BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015184
(411 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 682 bits (1761), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/391 (82%), Positives = 356/391 (91%)
Query: 21 GVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKI 80
VSS+ GVFSVKYRYAG++RSLS LK HD RRQ RILAGVDLPLGGS RPD VGLYYAK+
Sbjct: 31 AVSSDSGVFSVKYRYAGQQRSLSDLKAHDDRRQLRILAGVDLPLGGSGRPDTVGLYYAKV 90
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
GIGTP KDYYVQVDTGSDIMWVNCIQC+ECPR SSLG+ELTLY+IKDS +GK V CD+EF
Sbjct: 91 GIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVPCDEEF 150
Query: 141 CHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
C+ V GGPL+ CTAN SCPYLEIYGDGSST GYFV+DVVQYD+VSGDLQTTS+NGS+IFG
Sbjct: 151 CYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFG 210
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIG 260
CGARQSG+L T+EEALDGI+GFGKSNSSMISQLA++ V+K+FAHCLDGINGGGIFAIG
Sbjct: 211 CGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIG 270
Query: 261 HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
HVVQP+VN TPL+PNQPHY++NMTAVQVG DFL+LPT+ F GD KG IIDSGTTLAYLP
Sbjct: 271 HVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTTLAYLP 330
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEY 380
E+VYEPLVSKIISQQPDLKVH V DEYTCFQYS SVD+GFPNVTFHFENSV LKV+PHEY
Sbjct: 331 EIVYEPLVSKIISQQPDLKVHIVRDEYTCFQYSGSVDDGFPNVTFHFENSVFLKVHPHEY 390
Query: 381 LFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
LFPFE LWCIGWQNSGMQSRDR+NMTLLGD
Sbjct: 391 LFPFEGLWCIGWQNSGMQSRDRRNMTLLGDL 421
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 670 bits (1729), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 316/390 (81%), Positives = 352/390 (90%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
VS+N+GVFSVKY+YAG +RSLS LK HD +RQ RILAGVDLPLGG RPD +GLYYAKIG
Sbjct: 24 VSANNGVFSVKYKYAGLQRSLSDLKAHDDQRQLRILAGVDLPLGGIGRPDILGLYYAKIG 83
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP KDYYVQVDTGSDIMWVNCIQC+ECP+ SSLGI+LTLY+I +S TGK V CDQEFC
Sbjct: 84 IGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQEFC 143
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ + GG L CTAN SCPYLEIYGDGSST GYFV+DVVQY +VSGDL+TT+ NGS+IFGC
Sbjct: 144 YEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGC 203
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
GARQSG+L S+NEEALDGI+GFGKSNSSMISQLA +G V+K+FAHCLDG NGGGIF IGH
Sbjct: 204 GARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGGIFVIGH 263
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VVQP+VN TPL+PNQPHY++NMTAVQVG +FL+LPTDVF GD KG IIDSGTTLAYLPE
Sbjct: 264 VVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSGTTLAYLPE 323
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
MVY+PLVSKIISQQPDLKVHTV DEYTCFQYS+S+D+GFPNVTFHFENSV LKVYPHEYL
Sbjct: 324 MVYKPLVSKIISQQPDLKVHTVRDEYTCFQYSDSLDDGFPNVTFHFENSVILKVYPHEYL 383
Query: 382 FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
FPFE LWCIGWQNSG+QSRDR+NMTLLGD
Sbjct: 384 FPFEGLWCIGWQNSGVQSRDRRNMTLLGDL 413
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 639 bits (1648), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/390 (76%), Positives = 340/390 (87%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
VSSN GVF+VKYRY + SLS LKEHD RRQ ILAG+DLPLGG+ RPD GLYYAKIG
Sbjct: 26 VSSNPGVFNVKYRYPRLQGSLSALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIG 85
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP K YYVQVDTGSDIMWVNCIQCK+CPRRS+LGIELTLY+I +S +GK V+CD +FC
Sbjct: 86 IGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFC 145
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ + GGPL+ C AN SCPYLEIYGDGSST GYFV+DVVQYD V+GDL+T + NGS+IFGC
Sbjct: 146 YQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGC 205
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
GARQSG+LDS+NEEALDGI+GFGK+NSSMISQLASSG V+K+FAHCLDG NGGGIFAIG
Sbjct: 206 GARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGR 265
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VVQP+VN TPLVPNQPHY++NMTAVQVG +FLN+P D+F GD KG IIDSGTTLAYLPE
Sbjct: 266 VVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSGTTLAYLPE 325
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
++YEPLV KI SQ+P LKVH V +Y CFQYS VDEGFPNVTFHFENSV L+VYPH+YL
Sbjct: 326 IIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPNVTFHFENSVFLRVYPHDYL 385
Query: 382 FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
FP+E +WCIGWQNS MQSRDR+NMTLLGD
Sbjct: 386 FPYEGMWCIGWQNSAMQSRDRRNMTLLGDL 415
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 303/390 (77%), Positives = 343/390 (87%), Gaps = 2/390 (0%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
V+++HGVF+VK +Y ++RSLS LK HD RRQ +LAGVDLPLGGS RPD VGLYYAKIG
Sbjct: 31 VNASHGVFNVKCKY--QDRSLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIG 88
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTPPK+YY+QVDTGSDIMWVNCIQCKECP RSSLG++LTLYDIK+SS+GK V CDQEFC
Sbjct: 89 IGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVPCDQEFC 148
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ GG LT CTAN SCPYLEIYGDGSST GYFV+D+V YD+VSGDL+T S NGS++FGC
Sbjct: 149 KEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGC 208
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
GARQSG+L S+NEEALDGI+GFGK+NSSMISQLASSG V+KMFAHCL+G+NGGGIFAIGH
Sbjct: 209 GARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAIGH 268
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VVQP+VN TPL+P+QPHYS+NMTAVQVG FL+L TD GD KGTIIDSGTTLAYLPE
Sbjct: 269 VVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYLPE 328
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+YEPLV K+ISQ PDLKV T+HDEYTCFQYSESVD+GFP VTF FEN +SLKVYPH+YL
Sbjct: 329 GIYEPLVYKMISQHPDLKVQTLHDEYTCFQYSESVDDGFPAVTFFFENGLSLKVYPHDYL 388
Query: 382 FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
FP + WCIGWQNSG QSRD KNMTLLGD
Sbjct: 389 FPSVNFWCIGWQNSGTQSRDSKNMTLLGDL 418
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 637 bits (1643), Expect = e-180, Method: Compositional matrix adjust.
Identities = 297/390 (76%), Positives = 345/390 (88%), Gaps = 2/390 (0%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
VS+NHG FS+KY++AG++RSL+ LK HD RQ RILAGVDLPLGG+ RP+ VGLYYAKIG
Sbjct: 44 VSANHGFFSLKYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIG 103
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP +DYYVQVDTGSDIMWVNCIQC ECP++SSLG+ELTLYDIK+S TGK V+CDQ+FC
Sbjct: 104 IGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFC 163
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ + GGP + C AN SC Y EIY DGSS+ GYFV+D+VQYD+VSGDL+TTS NGS+IFGC
Sbjct: 164 YAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGC 223
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
A QSG+L S EEALDGI+GFGKSN+SMISQLASSG VRKMFAHCLDG+NGGGIFAIGH
Sbjct: 224 SATQSGDLSS--EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGH 281
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
+VQP+VN TPLVPNQ HY++NM AV+VG FLNLPTDVF VGD KGTIIDSGTTLAYLPE
Sbjct: 282 IVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPE 341
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+VY+ L+SKI S Q DLKVHT+HD++TCFQYSES+D+GFP VTFHFENS+ LKV+PHEYL
Sbjct: 342 VVYDQLLSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPHEYL 401
Query: 382 FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F ++ LWCIGWQNSGMQSRDR+N+TLLGD
Sbjct: 402 FSYDGLWCIGWQNSGMQSRDRRNITLLGDL 431
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 637 bits (1642), Expect = e-180, Method: Compositional matrix adjust.
Identities = 295/392 (75%), Positives = 343/392 (87%)
Query: 20 GGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAK 79
GGV +++G+FSVKY+YAGRERSLS LK HD RQ R LAG+D+PLGGS RPD VGLYYAK
Sbjct: 31 GGVYADNGIFSVKYKYAGRERSLSTLKAHDISRQLRFLAGIDIPLGGSGRPDAVGLYYAK 90
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
IGIGTP KDYYVQVDTGSDI+WVNCIQC+ECPR SSLG+ELT YD+++S+TGK V+CD++
Sbjct: 91 IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCDEQ 150
Query: 140 FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
FC V GGPL+ CT N SCPYL+IYGDGSST GYFV+D VQY++VSGDL+TT+ NGS+ F
Sbjct: 151 FCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKF 210
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCGARQSG+L S+ EEALDGI+GFGKSNSS+ISQLAS+ V+KMFAHCLDG NGGGIFA+
Sbjct: 211 GCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGGIFAM 270
Query: 260 GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYL 319
GHVVQP+VN TPLVPNQPHY++NMT VQVG LN+ DVF GD KGTIIDSGTTLAYL
Sbjct: 271 GHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSGTTLAYL 330
Query: 320 PEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
PE++YEPLV+KI+SQQ +L+V T+H EY CFQYSE VD+GFP V FHFENS+ LKVYPHE
Sbjct: 331 PELIYEPLVAKILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLKVYPHE 390
Query: 380 YLFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
YLF +E+LWCIGWQNSGMQSRDRKN+TL GD
Sbjct: 391 YLFQYENLWCIGWQNSGMQSRDRKNVTLFGDL 422
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 297/390 (76%), Positives = 345/390 (88%), Gaps = 2/390 (0%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
VS+NHG FS+KY++AG++RSL+ LK HD RQ RILAGVDLPLGG+ RP+ VGLYYAKIG
Sbjct: 44 VSANHGFFSLKYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIG 103
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP +DYYVQVDTGSDIMWVNCIQC ECP++SSLG+ELTLYDIK+S TGK V+CDQ+FC
Sbjct: 104 IGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFC 163
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ + GGP + C AN SC Y EIY DGSS+ GYFV+D+VQYD+VSGDL+TTS NGS+IFGC
Sbjct: 164 YAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGC 223
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
A QSG+L S EEALDGI+GFGKSN+SMISQLASSG VRKMFAHCLDG+NGGGIFAIGH
Sbjct: 224 SATQSGDLSS--EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGH 281
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
+VQP+VN TPLVPNQ HY++NM AV+VG FLNLPTDVF VGD KGTIIDSGTTLAYLPE
Sbjct: 282 IVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPE 341
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+VY+ L+SKI S Q DLKVHT+HD++TCFQYSES+D+GFP VTFHFENS+ LKV+PHEYL
Sbjct: 342 VVYDQLLSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPHEYL 401
Query: 382 FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F ++ LWCIGWQNSGMQSRDR+N+TLLGD
Sbjct: 402 FSYDGLWCIGWQNSGMQSRDRRNITLLGDL 431
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 302/386 (78%), Positives = 340/386 (88%), Gaps = 2/386 (0%)
Query: 26 HGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTP 85
HGVF+VK +Y ++R+LS LK HD RRQ +LAGVDLPLGGS RPD VGLYYAKIGIGTP
Sbjct: 37 HGVFNVKCKY--QDRTLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGTP 94
Query: 86 PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY 145
PK+YY+QVDTGSDIMWVNCIQCKECP RS+LG++LTLYDIK+SS+GKFV CDQEFC +
Sbjct: 95 PKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVPCDQEFCKEIN 154
Query: 146 GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQ 205
GG LT CTAN SCPYLEIYGDGSST GYFV+D+V YD+VSGDL+T S NGS++FGCGARQ
Sbjct: 155 GGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGARQ 214
Query: 206 SGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQP 265
SG+L S+NEEAL GI+GFGK+NSSMISQLASSG V+KMFAHCL+G+NGGGIFAIGHVVQP
Sbjct: 215 SGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAIGHVVQP 274
Query: 266 EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE 325
+VN TPL+P+QPHYS+NMTAVQVG FL+L TD GD KGTIIDSGTTLAYLPE +YE
Sbjct: 275 KVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSGTTLAYLPEGIYE 334
Query: 326 PLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
PLV KIISQ PDLKV T+HDEYTCFQYSESVD+GFP VTF+FEN +SLKVYPH+YLFP
Sbjct: 335 PLVYKIISQHPDLKVRTLHDEYTCFQYSESVDDGFPAVTFYFENGLSLKVYPHDYLFPSG 394
Query: 386 DLWCIGWQNSGMQSRDRKNMTLLGDF 411
D WCIGWQNSG QSRD KNMTLLGD
Sbjct: 395 DFWCIGWQNSGTQSRDSKNMTLLGDL 420
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 633 bits (1633), Expect = e-179, Method: Compositional matrix adjust.
Identities = 302/409 (73%), Positives = 344/409 (84%), Gaps = 3/409 (0%)
Query: 3 LCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDL 62
+C R L L A +V S N GVF+VKYRY + SL+ LKEHD RRQ ILAG+DL
Sbjct: 10 ICGRFTLIWFLTALVSV---SCNPGVFNVKYRYPRLQGSLTALKEHDDRRQLTILAGIDL 66
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
PLGG+ RPD GLYYAKIGIGTP K YYVQVDTGSDIMWVNCIQCK+CPRRS+LGIELTL
Sbjct: 67 PLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTL 126
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
Y+I +S +GK V+CD +FC+ + GGPL+ C AN SCPYLEIYGDGSST GYFV+DVVQYD
Sbjct: 127 YNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYD 186
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
V+GDL+T + NGS+IFGCGARQSG+LDS+NEEALDGI+GFGK+NSSMISQLASSG V+K
Sbjct: 187 SVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKK 246
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+FAHCLDG NGGGIFAIG VVQP+VN TPLVPNQPHY++NMTAVQVG +FL +P D+F
Sbjct: 247 IFAHCLDGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQP 306
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GD KG IIDSGTTLAYLPE++YEPLV KI SQ+P LKVH V +Y CFQYS VDEGFPN
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEGFPN 366
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
VTFHFENSV L+VYPH+YLFP E +WCIGWQNS MQSRDR+NMTLLGD
Sbjct: 367 VTFHFENSVFLRVYPHDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDL 415
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 633 bits (1632), Expect = e-179, Method: Compositional matrix adjust.
Identities = 296/390 (75%), Positives = 342/390 (87%)
Query: 20 GGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAK 79
GGV +++GVFSVKY+YAGRERSLS LK HD RQ R LAGVD+PLGGS RPD VGLYYAK
Sbjct: 31 GGVYADNGVFSVKYKYAGRERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVGLYYAK 90
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
IGIGTP KDYYVQVDTGSDI+WVNCIQC+ECPR SSLG+ELT YD+++S+TGK V+CD++
Sbjct: 91 IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCDEQ 150
Query: 140 FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
FC V GGPL+ CT N SCPYL+IYGDGSST GYFV+D VQY++VSGDL+TT+ NGS+ F
Sbjct: 151 FCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSIKF 210
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCGARQSG+L S+ EEALDGI+GFGKSNSS+ISQLAS+ V+KMFAHCLDG NGGGIFA+
Sbjct: 211 GCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGGIFAM 270
Query: 260 GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYL 319
GHVVQP+VN TPLVPNQPHY++NMT VQVG LN+ DVF GD KGTIIDSGTTLAYL
Sbjct: 271 GHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSGTTLAYL 330
Query: 320 PEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
PE++YEPLV+KI+SQQ +L+V T+H EY CFQYSE VD+GFP V FHFENS+ LKVYPHE
Sbjct: 331 PELIYEPLVAKILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLKVYPHE 390
Query: 380 YLFPFEDLWCIGWQNSGMQSRDRKNMTLLG 409
YLF +E+LWCIGWQNSGMQSRDRKN+TL G
Sbjct: 391 YLFQYENLWCIGWQNSGMQSRDRKNVTLFG 420
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 617 bits (1590), Expect = e-174, Method: Compositional matrix adjust.
Identities = 291/393 (74%), Positives = 343/393 (87%), Gaps = 3/393 (0%)
Query: 22 VSSNHGVFSVKYRYAG-RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKI 80
V++NHGVF+V+Y+++ ++RSLS+LK HD RRQ +L GVDLPLGG+ RPD VGLYYAKI
Sbjct: 18 VAANHGVFNVQYKFSDDQQRSLSVLKAHDYRRQISLLTGVDLPLGGTGRPDSVGLYYAKI 77
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
GIGTP KDYY+QVDTG+D+MWVNCIQCKECP RS+LG++LTLY+IK+SS+GK V CDQE
Sbjct: 78 GIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVPCDQEL 137
Query: 141 CHGVYGGPLTDCTANT--SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C + GG LT CT+ T SCPYLEIYGDGSST GYFV+DVV +D+VSGDL+T S NGS+I
Sbjct: 138 CKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASANGSVI 197
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA 258
FGCGARQSG+L +NEEALDGI+GFGK+N SMISQL+SSG V+KMFAHCL+G+NGGGIFA
Sbjct: 198 FGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNGGGIFA 257
Query: 259 IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
IGHVVQP VN TPL+P+QPHYS+NMTA+QVG FLNL TD D+KGTIIDSGTTLAY
Sbjct: 258 IGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSKGTIIDSGTTLAY 317
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPH 378
LP+ +Y+PLV KI+SQQP+LKV T+HDEYTCFQYS SVD+GFPNVTF+FEN +SLKVYPH
Sbjct: 318 LPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQYSGSVDDGFPNVTFYFENGLSLKVYPH 377
Query: 379 EYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+YLF E+LWCIGWQNSG QSRD KNMTLLGD
Sbjct: 378 DYLFLSENLWCIGWQNSGAQSRDSKNMTLLGDL 410
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 277/381 (72%), Positives = 317/381 (83%), Gaps = 7/381 (1%)
Query: 3 LCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDL 62
+C R L L A +V S N GVF+VKYRY + SL+ LKEHD RRQ ILAG+DL
Sbjct: 10 ICGRFTLIWFLTALVSV---SCNPGVFNVKYRYPRLQGSLTALKEHDDRRQLTILAGIDL 66
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
PLGG+ RPD GLYYAKIGIGTP K YYVQVDTGSDIMWVNCIQCK+CPRRS+LGIELTL
Sbjct: 67 PLGGTGRPDIPGLYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTL 126
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
Y+I +S +GK V+CD +FC+ + GGPL+ C AN SCPYLEIYGDGSST GYFV+DVVQYD
Sbjct: 127 YNIDESDSGKLVSCDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYD 186
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
V+GDL+T + NGS+IFGCGARQSG+LDS+NEEALDGI+GFGK+NSSMISQLASSG V+K
Sbjct: 187 SVAGDLKTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKK 246
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+FAHCLDG NGGGIFAIG VVQP+VN TPLVPNQPHY++NMTAVQVG +FL +P D+F
Sbjct: 247 IFAHCLDGRNGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQP 306
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GD KG IIDSGTTLAYLPE++YEPLV K +P LKVH V +Y CFQYS VDEGFPN
Sbjct: 307 GDRKGAIIDSGTTLAYLPEIIYEPLVKK----EPALKVHIVDKDYKCFQYSGRVDEGFPN 362
Query: 363 VTFHFENSVSLKVYPHEYLFP 383
VTFHFENSV L+VYPH+YLFP
Sbjct: 363 VTFHFENSVFLRVYPHDYLFP 383
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 243/402 (60%), Positives = 322/402 (80%), Gaps = 2/402 (0%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRP 70
+V++ V +S+ + VF+V++++AG+ERSLS LK+HDARR +RIL+ VDLPLGG+ P
Sbjct: 17 VVIVELGFVVCLSNGNYVFNVQHKFAGKERSLSALKQHDARRHRRILSAVDLPLGGNGHP 76
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
GLY+AKIG+G PPKDYYVQVDTGSDI+WVNC C +CP +S LG++LTLYD + S++
Sbjct: 77 AEAGLYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTS 136
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ CD +FC Y G L CT + C Y +YGDGSST G+FV+D +Q+D+V+G+LQT
Sbjct: 137 ATRIYCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQT 196
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+S NGS+IFGCGA+QSG L T+ EALDGI+GFG++NSSMISQLA++G V+++FAHCLD
Sbjct: 197 SSANGSVIFGCGAKQSGEL-GTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDN 255
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
+ GGGIFAIG VV P+VN TP+VPNQPHY++ M ++VG + L LPTD+F GD +GTII
Sbjct: 256 VKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRGTII 315
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS 370
DSGTTLAYLPE+VYE +++KI+S+QP LK+HTV +++TCFQY+ +V+EGFP V FHF S
Sbjct: 316 DSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTVEEQFTCFQYTGNVNEGFPVVKFHFNGS 375
Query: 371 VSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+SL V PH+YLF E++WC GWQNSGMQS+D ++MTLLGD
Sbjct: 376 LSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDL 417
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 241/390 (61%), Positives = 303/390 (77%), Gaps = 2/390 (0%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
+ S + VF V++++ GR +SL L+ HD RR RIL+ VDLPLGG+ P GLY+AKIG
Sbjct: 101 IVSGNAVFRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIG 160
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP KDYYVQVDTGSDI+WVNC C CP +S LG++LTLYD+K S+T V CD FC
Sbjct: 161 IGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC 220
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+Y GPL C C Y +YGDGSSTTGYFVQD VQY+++SG+ QTT TNG+++FGC
Sbjct: 221 -SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGC 279
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
G +QSG L S++E ALDGI+GFG++NSSM+SQLASSG V+K+F+HCLD ++GGGIFAIG
Sbjct: 280 GNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGE 338
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VV+P+VN TPLV NQ HY++ M ++VG D L++P+D F GD KGTIIDSGTTLAY P+
Sbjct: 339 VVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQ 398
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
VY PL+ KI+SQQPDL++HTV +TCF Y+ +VD+GFP VT HF+ S+SL VYPHEYL
Sbjct: 399 EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYL 458
Query: 382 FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F E WCIGWQNSG Q++D K++TLLGD
Sbjct: 459 FQHEFEWCIGWQNSGAQTKDGKDLTLLGDL 488
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 240/391 (61%), Positives = 304/391 (77%), Gaps = 3/391 (0%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
+ S + VF V++++ GR +SL L+ HD RR RIL+ VDLPLGG+ P GLY+AKIG
Sbjct: 20 IVSGNAVFRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIG 79
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP KDYYVQVDTGSDI+WVNC C CP +S LG++LTLYD+K S+T V CD FC
Sbjct: 80 IGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC 139
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+Y GPL C C Y +YGDGSSTTGYFVQD VQY+++SG+ QTT TNG+++FGC
Sbjct: 140 -SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGC 198
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
G +QSG L S++E ALDGI+GFG++NSSM+SQLASSG V+K+F+HCLD ++GGGIFAIG
Sbjct: 199 GNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGE 257
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VV+P+VN TPLV NQ HY++ M ++VG D L++P+D F GD KGTIIDSGTTLAY P+
Sbjct: 258 VVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQ 317
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
VY PL+ KI+SQQPDL++HTV +TCF Y+ +VD+GFP VT HF+ S+SL VYPHEYL
Sbjct: 318 EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYL 377
Query: 382 FPFEDL-WCIGWQNSGMQSRDRKNMTLLGDF 411
F ++ WCIGWQNSG Q++D K++TLLGD
Sbjct: 378 FQVKEFEWCIGWQNSGAQTKDGKDLTLLGDL 408
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 240/391 (61%), Positives = 304/391 (77%), Gaps = 3/391 (0%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
+ S + VF V++++ GR +SL L+ HD RR RIL+ VDLPLGG+ P GLY+AKIG
Sbjct: 101 IVSGNAVFRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIG 160
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP KDYYVQVDTGSDI+WVNC C CP +S LG++LTLYD+K S+T V CD FC
Sbjct: 161 IGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC 220
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+Y GPL C C Y +YGDGSSTTGYFVQD VQY+++SG+ QTT TNG+++FGC
Sbjct: 221 -SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGC 279
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
G +QSG L S++E ALDGI+GFG++NSSM+SQLASSG V+K+F+HCLD ++GGGIFAIG
Sbjct: 280 GNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGE 338
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VV+P+VN TPLV NQ HY++ M ++VG D L++P+D F GD KGTIIDSGTTLAY P+
Sbjct: 339 VVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYFPQ 398
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
VY PL+ KI+SQQPDL++HTV +TCF Y+ +VD+GFP VT HF+ S+SL VYPHEYL
Sbjct: 399 EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISLTVYPHEYL 458
Query: 382 FPFEDL-WCIGWQNSGMQSRDRKNMTLLGDF 411
F ++ WCIGWQNSG Q++D K++TLLGD
Sbjct: 459 FQVKEFEWCIGWQNSGAQTKDGKDLTLLGDL 489
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 232/407 (57%), Positives = 311/407 (76%), Gaps = 5/407 (1%)
Query: 6 RNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLG 65
R L +V I A +G +++ + VF V+ R +RSL+ +K HDARR+ RIL+ VDL LG
Sbjct: 4 RAVLILVAILVAEIGCIANGNFVFPVERR----KRSLNAVKAHDARRRGRILSAVDLNLG 59
Query: 66 GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDI 125
G+ P GLY+ K+G+G+PPKDYYVQVDTGSDI+WVNC++C CPR+S LGI+LTLYD
Sbjct: 60 GNGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDP 119
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
K S T + ++CDQEFC Y GP+ C + CPY YGDGS+TTGY+VQD + Y+ V+
Sbjct: 120 KGSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVN 179
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+L+T N S+IFGCGA QSG L S++EEALDGIIGFG+SNSS++SQLA+SG V+K+F+
Sbjct: 180 DNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFS 239
Query: 246 HCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
HCLD I GGGIFAIG VV+P+V+ TPLVP HY++ + +++V D L LP+D+F G+
Sbjct: 240 HCLDNIRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNG 299
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
KGTIIDSGTTLAYLP +VY+ L+ K++++QP LK++ V +++CFQY+ +VD GFP V
Sbjct: 300 KGTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLVEQQFSCFQYTGNVDRGFPVVKL 359
Query: 366 HFENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGDF 411
HFE+S+SL VYPH+YLF F+D +WCIGWQ S Q+++ K+MTLLGD
Sbjct: 360 HFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDL 406
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 496 bits (1278), Expect = e-138, Method: Compositional matrix adjust.
Identities = 239/385 (62%), Positives = 298/385 (77%), Gaps = 2/385 (0%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
V V++++ GRERSL K HD +R+ R L+ +DL LGG+ P GLY+AKIG+GTP +
Sbjct: 26 VLKVQHKFKGRERSLEAFKAHDIQRRGRFLSAIDLQLGGNGHPSESGLYFAKIGLGTPVQ 85
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
DYYVQVDTGSDI+WVNC C CP++S LGIEL+LY SST VTC+Q+FC Y G
Sbjct: 86 DYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVTCNQDFCTSTYDG 145
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
P+ CT C Y YGDGSST GYFV+D V D+V+G+ QTTSTNGS++FGCGA+QSG
Sbjct: 146 PIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSIVFGCGAQQSG 205
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
L +T+ ALDGI+GFG++NSSMISQLASSG V+++FAHCLD INGGGIFAIG VVQP+V
Sbjct: 206 QLGATSA-ALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIFAIGEVVQPKV 264
Query: 268 NKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
TPLVP Q HY++ M A++V + LNLPTDVF KGTIIDSGTTLAY P+++YEPL
Sbjct: 265 RTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPL 324
Query: 328 VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-D 386
+SKI ++Q LK+HTV +++TCF+Y +VD+GFP VTFHFE+S+SL VYPHEYLF + +
Sbjct: 325 ISKIFARQSTLKLHTVEEQFTCFEYDGNVDDGFPTVTFHFEDSLSLTVYPHEYLFDIDSN 384
Query: 387 LWCIGWQNSGMQSRDRKNMTLLGDF 411
WC+GWQNSG QSRD K+M LLGD
Sbjct: 385 KWCVGWQNSGAQSRDGKDMILLGDL 409
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 490 bits (1262), Expect = e-136, Method: Compositional matrix adjust.
Identities = 223/407 (54%), Positives = 310/407 (76%), Gaps = 5/407 (1%)
Query: 6 RNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLG 65
R L +V + A +G V++ + VF V+ R +RSLS ++ HD RR+ RIL+ VDL LG
Sbjct: 4 RGVLILVAVLGAEIGSVANGNLVFPVERR----KRSLSAVRAHDVRRRGRILSAVDLNLG 59
Query: 66 GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDI 125
G+ P GLY+ K+G+G+PP+DYYVQVDTGSDI+WVNC++C CPR+S LGI+LTLYD
Sbjct: 60 GNGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDP 119
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
K S T V+CDQ+FC + GP+ C + CPY YGDGS+TTGY+VQD + Y++++
Sbjct: 120 KGSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRIN 179
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
G+L+T+ N S+IFGCGA QSG L S++EEALDGIIGFG++NSS++SQLA+SG V+K+F+
Sbjct: 180 GNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFS 239
Query: 246 HCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
HCLD + GGGIFAIG VV+P+V+ TPLVP HY++ + +++V D L LP+D+F +
Sbjct: 240 HCLDNVRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNG 299
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
KGT+IDSGTTLAYLP++VY+ L+ K++++QP LK++ V ++ CF Y+ +VD GFP V
Sbjct: 300 KGTVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVKL 359
Query: 366 HFENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGDF 411
HF++S+SL VYPH+YLF F+D +WCIGWQ S Q+++ K+MTLLGD
Sbjct: 360 HFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDL 406
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 233/398 (58%), Positives = 300/398 (75%), Gaps = 11/398 (2%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
+ S + VF V++++ GR +SL L+ HD RR RIL+ VDLPLGG+ P GLY+AKIG
Sbjct: 24 IVSGNAVFRVQHKFKGRGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAKIG 83
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP KDYYVQVDTGSDI+WVNC C CP +S LG++LTLYD+K S+T V CD FC
Sbjct: 84 IGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC 143
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+Y GPL C C Y +YGDGSSTTGYFVQD VQY+++SG+ QTT TNG+++FGC
Sbjct: 144 -SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVFGC 202
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
G +QSG L S++ EALDGI+GFG++NSSM+SQLASSG V+K+F+HCLD ++GGGIFAIG
Sbjct: 203 GNKQSGELGSSS-EALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAIGE 261
Query: 262 VVQPEVN--------KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSG 313
VV+P+V L ++ HY++ M ++VG D L++P+D F GD KGTIIDSG
Sbjct: 262 VVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSG 321
Query: 314 TTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
TTLAY P+ VY PL+ KI+SQQPDL++HTV +TCF Y+ +VD+GFP VT HF+ S+SL
Sbjct: 322 TTLAYFPQEVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFDKSISL 381
Query: 374 KVYPHEYLFPFEDL-WCIGWQNSGMQSRDRKNMTLLGD 410
VYPHEYLF ++ WCIGWQNSG Q++D K++TLLG+
Sbjct: 382 TVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGE 419
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 220/385 (57%), Positives = 291/385 (75%), Gaps = 2/385 (0%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
VF V++++ GRERSL+ LK HD RR R+L+ +DL LGG+ P GLYYA+IGIG+PP
Sbjct: 25 VFEVQHKFKGRERSLNALKSHDVRRHGRLLSVIDLELGGNGHPAETGLYYARIGIGSPPN 84
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
D++VQVDTGSDI+WVNC+ C CP++S +G++L LY+ K SST +TCDQ FC Y
Sbjct: 85 DFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCSATYDA 144
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
P+ C + C Y IYGDGS+T GYFV D +Q + G+ +T+ TNGS++FGCGA+QSG
Sbjct: 145 PIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSG 204
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
L S++ EALDGI+GFG++NSSMISQLA++G V+K+FAHCLD I+GGGIFAIG VV+P++
Sbjct: 205 ELGSSS-EALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKL 263
Query: 268 NKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
TP+VPNQ HY++ + V+VG L+LP +F +G IIDSGTTLAYLPE +Y PL
Sbjct: 264 XNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPESIYLPL 323
Query: 328 VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-ED 386
+ KI+ QPDLK+ TV D++TCF + ++VD+GFP VTF FE S+ L +YPHEYLF +D
Sbjct: 324 MEKILGAQPDLKLRTVDDQFTCFVFDKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDD 383
Query: 387 LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WC+GWQNSG QS+D +TLLGD
Sbjct: 384 VWCVGWQNSGAQSKDGNEVTLLGDL 408
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 480 bits (1235), Expect = e-133, Method: Compositional matrix adjust.
Identities = 219/385 (56%), Positives = 291/385 (75%), Gaps = 2/385 (0%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
VF V++++ GRERSL+ LK HD RR R+L+ +DL LGG+ P GLYYA+IGIG+PP
Sbjct: 25 VFEVQHKFKGRERSLNALKSHDVRRHGRLLSVIDLELGGNGHPAETGLYYARIGIGSPPN 84
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
D++VQVDTGSDI+WVNC+ C CP++S +G++L LY+ K SST +TCDQ FC Y
Sbjct: 85 DFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCSATYDA 144
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
P+ C + C Y IYGDGS+T GYFV D +Q + G+ +T+ TNGS++FGCGA+QSG
Sbjct: 145 PIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSG 204
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
L S++ EALDGI+GFG++NSSMISQLA++G V+K+FAHCLD I+GGGIFAIG VV+P++
Sbjct: 205 ELGSSS-EALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKL 263
Query: 268 NKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
TP+VPNQ HY++ + V+VG L+LP +F +G IIDSGTTLAYLP+ +Y PL
Sbjct: 264 KTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPDSIYLPL 323
Query: 328 VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-ED 386
+ KI+ QPDLK+ TV D++TCF + ++VD+GFP VTF FE S+ L +YPHEYLF +D
Sbjct: 324 MEKILGAQPDLKLRTVDDQFTCFVFDKNVDDGFPTVTFKFEESLILTIYPHEYLFQIRDD 383
Query: 387 LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WC+GWQNSG QS+D +TLLGD
Sbjct: 384 VWCVGWQNSGAQSKDGNEVTLLGDL 408
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 243/386 (62%), Positives = 286/386 (74%), Gaps = 35/386 (9%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
VS+NHG FS+KY++AG++RSL+ LK HD RQ RILAGVDLPLGG+ RP+ VGLYYAKIG
Sbjct: 44 VSANHGFFSLKYKFAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIG 103
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTP +DYYVQ +ELTLYDIK+S TGK V+CDQ+FC
Sbjct: 104 IGTPARDYYVQ-------------------------MELTLYDIKESLTGKLVSCDQDFC 138
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI--- 198
+ + GGP + C AN SC Y EIY DGSS+ GYFV+ K + N L+
Sbjct: 139 YAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLN--NNPLLEVP 196
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA 258
C A QSG+L S EEALDGI+GFGKSN+SMISQLASSG VRKMFAHCLDG+NGGGIFA
Sbjct: 197 LRCSATQSGDLSS--EEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFA 254
Query: 259 IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
IGH+VQP+VN TPLVPNQ HY++NM AV+VG FLNLPTDVF VGD KGTIIDSGTTLAY
Sbjct: 255 IGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAY 314
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPH 378
LPE+VY+ L+SKI S Q DLKVHT+HD++TCFQYSES+D+GFP VTFHFENS+ LKV+PH
Sbjct: 315 LPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVTFHFENSLYLKVHPH 374
Query: 379 EYLFPFEDLWCIGWQNSGMQSRDRKN 404
EYLF + D IG +N + KN
Sbjct: 375 EYLFSYGD---IGEENGSICKLQMKN 397
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 230/390 (58%), Positives = 294/390 (75%), Gaps = 7/390 (1%)
Query: 27 GVFSVKYRYA----GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGI 82
GVF V+ ++ G E LS L+EHD RR R+LA +DLPLGGS GLY+ +IGI
Sbjct: 37 GVFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGI 96
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
GTP K YYVQVDTGSDI+WVNC+ C CPR+S+LGIELT+YD + S +G+ VTCDQ+FC
Sbjct: 97 GTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 143 GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
YGG L CT+ + C Y YGDGSST G+FV D +QY++VSGD QTT N S+ FGCG
Sbjct: 157 ANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCG 216
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
A+ G+L S+N ALDGI+GFG+SNSSM+SQLA++G VRKMFAHCLD +NGGGIFAIG+V
Sbjct: 217 AKLGGDLGSSN-LALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNV 275
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
VQP+V TPLVP+ PHY++ + + VG L LPT++F G++KGTIIDSGTTLAY+PE
Sbjct: 276 VQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYVPEG 335
Query: 323 VYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
VY+ L + + + D+ V T+ D ++CFQYS SVD+GFP VTFHFE VSL V PH+YLF
Sbjct: 336 VYKALFAMVFDKHQDISVQTLQD-FSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLF 394
Query: 383 PF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
++L+C+G+QN G+Q++D K+M LLGD
Sbjct: 395 QNGKNLYCMGFQNGGVQTKDGKDMVLLGDL 424
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 229/390 (58%), Positives = 293/390 (75%), Gaps = 7/390 (1%)
Query: 27 GVFSVKYRYA----GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGI 82
GVF V+ ++ G E LS L+EHD RR R+LA +DLPLGGS GLY+ +IGI
Sbjct: 37 GVFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGI 96
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
GTP K YYVQVDTGSDI+WVNC+ C CPR+S+LGIELT+YD + S +G+ VTCDQ+FC
Sbjct: 97 GTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 143 GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
YGG L CT+ + C Y YGDGSST G+FV D +QY++VSGD QTT N S+ FGCG
Sbjct: 157 ANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCG 216
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
A+ G+L S+N ALDGI+GFG+SNSSM+SQLA++G VRKMFAHCLD +NGGGIFAIG+V
Sbjct: 217 AKLGGDLGSSN-LALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNV 275
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
VQP+V TPLV + PHY++ + + VG L LPT++F G++KGTIIDSGTTLAY+PE
Sbjct: 276 VQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYVPEG 335
Query: 323 VYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
VY+ L + + + D+ V T+ D ++CFQYS SVD+GFP VTFHFE VSL V PH+YLF
Sbjct: 336 VYKALFAMVFDKHQDISVQTLQD-FSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLF 394
Query: 383 PF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
++L+C+G+QN G+Q++D K+M LLGD
Sbjct: 395 QNGKNLYCMGFQNGGVQTKDGKDMVLLGDL 424
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 217/392 (55%), Positives = 293/392 (74%), Gaps = 6/392 (1%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
V++ + VF V+ R A SL+ +K HD+ R+ RIL+ VD LGG+ P GLY+ KIG
Sbjct: 19 VANANLVFPVQRRQA----SLTGIKAHDSSRRGRILSAVDFNLGGNGLPTVTGLYFTKIG 74
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
+G+P KDYYVQVDTGSDI+WVNC++C CPR+S +GI LTLYD K S T +FV+C+ FC
Sbjct: 75 LGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFC 134
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
Y G + C A CPY YGDGS+TTGY+VQD + +++V+G+ T + N S+IFGC
Sbjct: 135 SSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGC 194
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
GA QSG S++EEALDGIIGFG++NSS++SQLA+SG V+K+F+HCLD GGGIF+IG
Sbjct: 195 GAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGE 254
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VV+P+V TPLVPN HY++ + ++V D L LP+D F + KGT+IDSGTTLAYLP
Sbjct: 255 VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPR 314
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+VY+ L+SK++++QP LKV+ V ++Y+CFQY+ +VD GFP V HFE+S+SL VYPH+YL
Sbjct: 315 IVYDQLMSKVLAKQPRLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYL 374
Query: 382 FPF--EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F + + WCIGWQ S ++++ K+MTLLGDF
Sbjct: 375 FNYKGDSYWCIGWQKSASETKNGKDMTLLGDF 406
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 218/391 (55%), Positives = 290/391 (74%), Gaps = 8/391 (2%)
Query: 27 GVFSVKYRY-----AGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
GVF V+ ++ G ++S L+ HD RR R+LA DLPLGG P GLY+ +I
Sbjct: 30 GVFQVRRKFPAGVGGGASANISALRVHDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIK 89
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
+GTPPK YYVQVDTGSDI+WVNCI C++CPR+S LG++LT YD K SS+G V+CDQ FC
Sbjct: 90 LGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFC 149
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
YGG L CTAN C Y +YGDGSSTTG+FV D +Q+D+V+GD QT N ++ FGC
Sbjct: 150 AATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATVTFGC 209
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
GA+Q G+L S+N +ALDGI+GFG++N+SM+SQLA++G V+K+FAHCLD I GGGIFAIG+
Sbjct: 210 GAQQGGDLGSSN-QALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGIFAIGN 268
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VVQP+V TPLV + PHY++N+ ++ VG L LP VF G+ KGTIIDSGTTL YLPE
Sbjct: 269 VVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPE 328
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+V++ +++ I ++ D+ H V D + CFQY SVD+GFP +TFHFE+ ++L VYPHEY
Sbjct: 329 LVFKEVMAAIFNKHQDIVFHNVQD-FMCFQYPGSVDDGFPTITFHFEDDLALHVYPHEYF 387
Query: 382 FPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
FP D++C+G+QN +QS+D K++ L+GD
Sbjct: 388 FPNGNDMYCVGFQNGALQSKDGKDIVLMGDL 418
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 227/410 (55%), Positives = 296/410 (72%), Gaps = 12/410 (2%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY------AGRERSLSLLKEHDARRQQRILAGVDL 62
L +L+A + GV + VF V+ ++ G + + L HD+ R+ R+LA D+
Sbjct: 13 LMAMLLAVVSSHGVGAT-SVFQVRRKFPRLGSKGGGDITAHL--THDSNRRGRLLAAADV 69
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
PLGG P GLYY +I IGTPPK Y+VQVDTGSDI+WVNCI C +CPR+S LGI+L L
Sbjct: 70 PLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRL 129
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
YD K SS+G V+CDQ+FC YGG L C N C Y +YGDGSSTTGYFV D +QY+
Sbjct: 130 YDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYN 189
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
+VSGD QT N S+IFGCGA+Q G+L STN +ALDGIIGFG+SN+SM+SQLA++G V+K
Sbjct: 190 QVSGDGQTRHANASVIFGCGAQQGGDLGSTN-QALDGIIGFGQSNTSMLSQLAAAGEVKK 248
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+F+HCLD I GGGIFAIG VVQP+V TPLVP+ PHY++N+ ++ VG L LP+ +F
Sbjct: 249 IFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMFET 308
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
G+ KGTIIDSGTTL YLPE+VY+ +++ + ++ PD H+V D + C QY +SVD+GFP
Sbjct: 309 GEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD-FLCIQYFQSVDDGFPK 367
Query: 363 VTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGDF 411
+TFHFE+ + L VYPH+Y F D L+C G+QN G+QS+D K+M LLGD
Sbjct: 368 ITFHFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDL 417
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 218/367 (59%), Positives = 275/367 (74%), Gaps = 3/367 (0%)
Query: 46 KEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI 105
+ HD R+ R+LA D+PLGG P GLYY +IGIGTP K YYVQVDTGSDI+WVNCI
Sbjct: 59 RAHDGSRRGRLLAAADIPLGGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCI 118
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYG 165
C CPR+S LG+ELTLYD KDSSTG V+CDQ FC YGG L CT + C Y YG
Sbjct: 119 SCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYG 178
Query: 166 DGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGK 225
DGSSTTGYFV D++Q+D+VSGD QT N ++ FGCG++Q G+L S+N +ALDGIIGFG+
Sbjct: 179 DGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSSN-QALDGIIGFGQ 237
Query: 226 SNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTA 285
SN+SM+SQL+++G V+K+FAHCLD INGGGIFAIG+VVQP+V TPLVPN PHY++N+ +
Sbjct: 238 SNTSMLSQLSAAGKVKKIFAHCLDTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKS 297
Query: 286 VQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD 345
+ VG L LP+ +F G+ KGTIIDSGTTL YLPE+VY+ ++ + ++ D+ H V
Sbjct: 298 IDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNVQ- 356
Query: 346 EYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRDRKN 404
E+ CFQY VD+ FP +TFHFEN + L VYPH+Y F D L+C+G+QN G+QS+D K
Sbjct: 357 EFLCFQYVGRVDDDFPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKG 416
Query: 405 MTLLGDF 411
M LLGD
Sbjct: 417 MVLLGDL 423
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 224/382 (58%), Positives = 287/382 (75%), Gaps = 7/382 (1%)
Query: 27 GVFSVKYRYA----GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGI 82
GVF V+ ++ G E LS L+EHD RR R+LA +DLPLGGS GLY+ +IGI
Sbjct: 37 GVFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGI 96
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
GTP K YYVQVDTGSDI+WVNC+ C CPR+S+LGIELT+YD + S +G+ VTCDQ+FC
Sbjct: 97 GTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 143 GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
YGG L CT+ + C Y YGDGSST G+FV D +QY++VSGD QTT N S+ FGCG
Sbjct: 157 ANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCG 216
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
A+ G+L S+N ALDGI+GFG+SNSSM+SQLA++G VRKMFAHCLD +NGGGIFAIG+V
Sbjct: 217 AKLGGDLGSSN-LALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNV 275
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
VQP+V TPLVP+ PHY++ + + VG L LPT++F G++KGTIIDSGTTLAY+PE
Sbjct: 276 VQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYVPEG 335
Query: 323 VYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
VY+ L + + + D+ V T+ D ++CFQYS SVD+GFP VTFHFE VSL V PH+YLF
Sbjct: 336 VYKALFAMVFDKHQDISVQTLQD-FSCFQYSGSVDDGFPEVTFHFEGDVSLIVSPHDYLF 394
Query: 383 PF-EDLWCIGWQNSGMQSRDRK 403
++L+C+G+QN G +++D K
Sbjct: 395 QNGKNLYCMGFQNGGGKTKDGK 416
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 208/387 (53%), Positives = 282/387 (72%), Gaps = 5/387 (1%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
VF V ++ G +L+ +K HDA R+ R L+ VDL LGG+ RP GLYY KIG+G P
Sbjct: 29 VFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDLALGGNGRPTSTGLYYTKIGLG--PN 86
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
DYYVQVDTGSD +WVNC+ C CP++S LG+ELTLYD S T K V CD EFC Y G
Sbjct: 87 DYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDG 146
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
P++ C + SCPY YGDGS+T+G +++D + +D+V GDL+T N S+IFGCG++QSG
Sbjct: 147 PISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSG 206
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
L ST + +LDGIIGFG++NSS++SQLA++G V+++F+HCLD +NGGGIFAIG VVQP+V
Sbjct: 207 TLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPKV 266
Query: 268 NKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
TPLVP HY++ + ++V D + LPTD+F +GTIIDSGTTLAYLP +Y+ L
Sbjct: 267 KTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQL 326
Query: 328 VSKIISQQPDLKVHTVHDEYTCFQYSE--SVDEGFPNVTFHFENSVSLKVYPHEYLFPF- 384
+ K ++Q+ ++++ V D++TCF YS+ S+D+ FP V F FE ++L YPH+YLFPF
Sbjct: 327 LEKTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPFK 386
Query: 385 EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
ED+WCIGWQ S Q++D K++ LLGD
Sbjct: 387 EDMWCIGWQKSTAQTKDGKDLILLGDL 413
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 223/389 (57%), Positives = 289/389 (74%), Gaps = 6/389 (1%)
Query: 27 GVFSVKY---RYAGRERSLSLLKEHDARRQQRILAG-VDLPLGGSSRPDGVGLYYAKIGI 82
GVF V+ R+ G + L+ L+ HDARR R LA VDLPLGG+ P GLY+ +IGI
Sbjct: 28 GVFEVRRKFPRHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNGLPTETGLYFTQIGI 87
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
GTP K YYVQVDTGSDI+WVNC+ C CPR+S LGIELTLYD SS+G VTC Q+FC
Sbjct: 88 GTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCV 147
Query: 143 GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
+GG + C C Y YGDGSSTTG+FV D +QY++VSG+ QTT N S+ FGCG
Sbjct: 148 ATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGCG 207
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
A+ G+L S++ +ALDGI+GFG+SNSSM+SQLA++G VRK+FAHCLD INGGGIFAIG V
Sbjct: 208 AKIGGDLGSSS-QALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTINGGGIFAIGDV 266
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
VQP+V+ TPLVP PHY++N+ A+ VG L LPT++F +G++KGTIIDSGTTLAYLP +
Sbjct: 267 VQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGTIIDSGTTLAYLPGV 326
Query: 323 VYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
VY ++SK+ +Q D+ + D + CF+YS SVD+GFP +TFHFE + L ++PH+YLF
Sbjct: 327 VYNAIMSKVFAQYGDMPLKNDQD-FQCFRYSGSVDDGFPIITFHFEGGLPLNIHPHDYLF 385
Query: 383 PFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+L+C+G+Q G+Q++D K+M LLGD
Sbjct: 386 QNGELYCMGFQTGGLQTKDGKDMVLLGDL 414
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 215/405 (53%), Positives = 291/405 (71%), Gaps = 11/405 (2%)
Query: 14 IATAAVGGVSSNHGVFSVKYRY------AGRERSLSLLKEHDARRQQRILAGVDLPLGGS 67
+A +A G ++ GVF V+ ++ ++S L+ HD R R+LA DLPLGG
Sbjct: 22 VAGSAPGATAT--GVFQVRRKFPVGVGGGAAGANISALRAHDGTRHGRLLATADLPLGGL 79
Query: 68 SRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKD 127
P GLYY ++ +GTPPK +YVQVDTGSDI+WVNCI C +CP +S LG++LTLYD K
Sbjct: 80 GLPTDTGLYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKA 139
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
SSTG V CDQ FC +GG L C+AN C Y YGDGSST G FV D +Q+D+V+GD
Sbjct: 140 SSTGSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGD 199
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
QT N S+IFGCGA+Q G+L S++ +ALDGI+GFG++N+SM+SQLA++G V+K+FAHC
Sbjct: 200 GQTQPANASVIFGCGAQQGGDLGSSS-QALDGILGFGEANTSMLSQLATAGKVKKIFAHC 258
Query: 248 LDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
LD I GGGIFAIG VVQP+V TPLV ++PHY++N+ + VG L LP D+F G+ +G
Sbjct: 259 LDTIKGGGIFAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRG 318
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHF 367
TIIDSGTTL YLPE+V++ ++ + ++ D+ H V D + CF+YS SVD+GFP +TFHF
Sbjct: 319 TIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQD-FLCFEYSGSVDDGFPTLTFHF 377
Query: 368 ENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
E+ ++L VYPHEY FP D++C+G+QN +QS+D K++ L+GD
Sbjct: 378 EDDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDL 422
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 210/396 (53%), Positives = 286/396 (72%), Gaps = 7/396 (1%)
Query: 18 AVGGVSSNHGVFSVKYRYAG-RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLY 76
+ +S + VF V+ ++AG R + L L+ HD R R+L+ +D+PLGG S+P+ +GLY
Sbjct: 26 STAATASENLVFEVRSKFAGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLY 85
Query: 77 YAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTC 136
+AKIG+GTP +D++VQVDTGSDI+WVNC C CPR+S L +ELT YD+ SST K V+C
Sbjct: 86 FAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSC 144
Query: 137 DQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGS 196
FC Y ++C + ++C Y+ +YGDGSST GY V+DVV D V+G+ QT STNG+
Sbjct: 145 SDNFCS--YVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGT 202
Query: 197 LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGI 256
+IFGCG++QSG L + + A+DGI+GFG+SNSS ISQLAS G V++ FAHCLD NGGGI
Sbjct: 203 IIFGCGSKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI 261
Query: 257 FAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTL 316
FAIG VV P+V TP++ HYS+N+ A++VG L L ++ F GD+KG IIDSGTTL
Sbjct: 262 FAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTL 321
Query: 317 AYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVY 376
YLP+ VY PL+++I++ P+L +HTV + +TCF Y++ +D FP VTF F+ SVSL VY
Sbjct: 322 VYLPDAVYNPLLNEILASHPELTLHTVQESFTCFHYTDKLDR-FPTVTFQFDKSVSLAVY 380
Query: 377 PHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
P EYLF ED WC GWQN G+Q++ ++T+LGD
Sbjct: 381 PREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDM 416
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 219/412 (53%), Positives = 293/412 (71%), Gaps = 11/412 (2%)
Query: 3 LCLRNCLCIVL-IATAAVGGVSSNHGVFSVKYRYAG-RERSLSLLKEHDARRQQRILAGV 60
+ L L I L ++TAA + ++ VF V+ ++AG RE+ L L+ HD R R+L+ +
Sbjct: 13 ILLSAALLIELQLSTAAT---APDNLVFQVRSKFAGKREKDLGALRAHDVHRHSRLLSAI 69
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
DLPLGG S+P+ +GLY+AKIG+GTP +D++VQVDTGSDI+WVNC C CPR+S L +EL
Sbjct: 70 DLPLGGDSQPESIGLYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VEL 128
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
T YD SST K V+C FC Y ++C + ++C Y+ +YGDGSST GY V+DVV
Sbjct: 129 TPYDADASSTAKSVSCSDNFCS--YVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVH 186
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
D V+G+ QT STNG++IFGCG++QSG L + + A+DGI+GFG+SNSS ISQLAS G V
Sbjct: 187 LDLVTGNRQTGSTNGTIIFGCGSKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKV 245
Query: 241 RKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF 300
++ FAHCLD NGGGIFAIG VV P+V TP++ HYS+N+ A++VG L L +D F
Sbjct: 246 KRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAF 305
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
GD+KG IIDSGTTL YLP+ VY PL+++I++ +L +HTV D +TCF Y + +D F
Sbjct: 306 DSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFTCFHYIDRLDR-F 364
Query: 361 PNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
P VTF F+ SVSL VYP EYLF ED WC GWQN G+Q++ ++T+LGD
Sbjct: 365 PTVTFQFDKSVSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDM 416
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 225/401 (56%), Positives = 286/401 (71%), Gaps = 15/401 (3%)
Query: 21 GVSSNHGVFSVKYRYA-------GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGV 73
G ++ GVF V+ + G E L+ L++HD RR +L VDLPLGG+ P
Sbjct: 30 GRAAATGVFQVRRNFPRHQGNGPGGEEHLAALRKHDGRR---LLTAVDLPLGGNGIPTDT 86
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
GLY+ +IGIGTP K YYVQVDTGSDI+WVNCI C CPR+S LGI+LTLYD S++ K
Sbjct: 87 GLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKT 146
Query: 134 VTCDQEFCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
VTC QEFC GG C AN+ C Y YGDGSSTTG+FV D +QYD+VSGD QT
Sbjct: 147 VTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNL 206
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
N S+ FGCGA+ G L S+N ALDGI+GFG++NSSM+SQL S+G V K+F+HCLD +N
Sbjct: 207 ANASVTFGCGAKIGGALGSSN-VALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVN 265
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV-GDNKGTIID 311
GGGIFAIG+VVQP+V TPLVP PHY++ + + VG L LPT++F + G ++GTIID
Sbjct: 266 GGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRGTIID 325
Query: 312 SGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSV 371
SGTTLAYLPE+VY+ ++S + S PD+ + V D + CFQYS SVD GFP VTFHF+ +
Sbjct: 326 SGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQD-FLCFQYSGSVDNGFPEVTFHFDGDL 384
Query: 372 SLKVYPHEYLFP-FEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
L VYPH+YLF ED++C+G+Q+ G+QS+D K+M LLGD
Sbjct: 385 PLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDL 425
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 209/391 (53%), Positives = 277/391 (70%), Gaps = 8/391 (2%)
Query: 27 GVFSVKYRYAGRER-----SLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
G+F V+ ++ ++S L+ HD R R+LA DLPLGG P GLYY +I
Sbjct: 32 GIFQVRRKFTAGVGGGAGANISALRAHDGTRHGRLLAAADLPLGGLGLPTDTGLYYTEIK 91
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
+GTPPK YYVQVDTGSDI+WVNCI C++CP +S LG++LTLYD K SSTG V CDQ FC
Sbjct: 92 LGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFC 151
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+GG L C AN C Y YGDGSST G FV D +Q+D+V+ D QT N S+IFGC
Sbjct: 152 AATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGC 211
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH 261
GA+Q G+L S+N +ALDGI+GFG++N+SM+SQL ++G V+K+FAHCLD I GGGIF+IG
Sbjct: 212 GAQQGGDLGSSN-QALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGIFSIGD 270
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
VVQP+V TPLV ++PHY++N+ + VG L LP +F G+ KGTIIDSGTTL YLPE
Sbjct: 271 VVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPE 330
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+V++ ++ + ++ D+ H V + CFQY SVD+GFP +TFHFE+ ++L VYPHEY
Sbjct: 331 LVFKEVMLAVFNKHQDITFHDVQG-FLCFQYPGSVDDGFPTITFHFEDDLALHVYPHEYF 389
Query: 382 FPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F D++C+G+QN QS+D K++ L+GD
Sbjct: 390 FANGNDVYCVGFQNGASQSKDGKDIVLMGDL 420
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 206/402 (51%), Positives = 287/402 (71%), Gaps = 2/402 (0%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRP 70
++LI S+ + VF V+ ++ G RSL +K HD RR+ R LA +D+PLGG+ P
Sbjct: 7 LILIVFLLFVDASNANLVFPVQRKFNGPHRSLDAIKAHDDRRRGRFLAAIDVPLGGNGLP 66
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
GLYY K+G+G+P K++YVQVDTGSDI+WVNC C CP++S LG++LTLYD S T
Sbjct: 67 SSTGLYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKT 126
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C FC Y GP++ C + SCPY YGDGS+T+G FV D + +D+VSG+L T
Sbjct: 127 SNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHT 186
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
N S+IFGCGA+QSG+L S ++EALDGIIGFG++NSS++SQLA+SG V+++F+HCLD
Sbjct: 187 KPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDS 246
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
+GGGIF+IG V++P+ N TPLVP HY++ + + V + + LP +F G +GTII
Sbjct: 247 HHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTII 306
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS 370
DSGTTLAYLP +Y L+ K++ +QP LK+ V D++TCF YS+ +DEGFP V FHFE
Sbjct: 307 DSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFHYSDKLDEGFPVVKFHFEG- 365
Query: 371 VSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+SL V+PH+YLF + ED++CIGWQ S Q+++ +++ L+GD
Sbjct: 366 LSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDL 407
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/412 (50%), Positives = 289/412 (70%), Gaps = 7/412 (1%)
Query: 5 LRNCLCIVLIATAAVGGVSSNHG--VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDL 62
LR L ++L+ + V + + VF V ++ G +L+ +K HDA R+ R L+ VD+
Sbjct: 3 LRESLVLLLVGSFVVQFCCNANANLVFPVVRKFKGPVENLAAIKAHDAGRRGRFLSVVDV 62
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
LGG+ RP GLYY KIG+G PKDYYVQVDTGSD +WVNC+ C CP++S LG++LTL
Sbjct: 63 ALGGNGRPTSNGLYYTKIGLG--PKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTL 120
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
YD S T K V CD EFC Y G ++ CT SCPY YGDGS+T+G +++D + +D
Sbjct: 121 YDPNLSKTSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFD 180
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
+V GDL+T N S+IFGCG++QSG L ST + +LDGIIGFG++NSS++SQLA++G V++
Sbjct: 181 RVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKR 240
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+F+HCLD I+GGGIFAIG VVQP+V TPL+ HY++ + ++V D + LP+D+
Sbjct: 241 IFSHCLDSISGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDS 300
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS--ESVDEGF 360
+GTIIDSGTTLAYLP +Y+ L+ KI++Q+ +K++ V D++TCF YS ESVD+ F
Sbjct: 301 SSGRGTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQFTCFHYSDEESVDDLF 360
Query: 361 PNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
P V F FE ++L YP +YLF F ED+WC+GWQ S Q++D K + LLGD
Sbjct: 361 PTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDL 412
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 436 bits (1122), Expect = e-120, Method: Compositional matrix adjust.
Identities = 214/417 (51%), Positives = 291/417 (69%), Gaps = 26/417 (6%)
Query: 19 VGGVS--SNHGVFSVKYRYAG-----RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPD 71
VG VS + G+F V+ + ++S L+ HD RR R+LA DLPLGG P
Sbjct: 23 VGSVSGAAAAGIFRVRRKLPAGVGGDTGANISALRAHDGRRHGRLLAAADLPLGGLGLPT 82
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
GLY+ +I +GTPPK YYVQVDTGSDI+WVNCI C +CPR+S LG++LT YD K SS+G
Sbjct: 83 DTGLYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSG 142
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+CDQ FC YGG L CTAN C Y +YGDGSSTTG+F+ D +Q+D+V+GD QT
Sbjct: 143 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQ 202
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
N ++ FGCGA+Q G+L ++N +ALDGI+GFG++N+SM+SQLA++G +K+FAHCLD I
Sbjct: 203 PGNATITFGCGAQQGGDLGNSN-QALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTI 261
Query: 252 NGGGIFAIGHVVQPE----------VNKTPL------VPNQPHYSINMTAVQVGLDFLNL 295
GGGIFAIG+VVQP+ + PL + ++PHY++N+ ++ VG L L
Sbjct: 262 KGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQL 321
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSES 355
P VF G+ KGTIIDSGTTL YLPE+V++ ++ + S+ D+ H + D + CFQYS S
Sbjct: 322 PAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQD-FLCFQYSGS 380
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
VD+GFP +TFHFE+ ++L VYPHEY FP D++C+G+QN +QS+D K++ L+GD
Sbjct: 381 VDDGFPTITFHFEDDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDL 437
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 205/338 (60%), Positives = 258/338 (76%), Gaps = 3/338 (0%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
LYY +IGIGTP K YYVQVDTGSDI+WVNCI C CPR+S LG+ELTLYD KDSSTG V
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+CDQ FC YGG L CT + C Y YGDGSSTTGYFV D++Q+D+VSGD QT N
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
++ FGCG++Q G+L S+N +ALDGIIGFG+SN+SM+SQL+++G V+K+FAHCLD INGG
Sbjct: 123 STVTFGCGSQQGGDLGSSN-QALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGG 181
Query: 255 GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGT 314
GIFAIG+VVQP+V TPLVPN PHY++N+ ++ VG L LP+ +F G+ KGTIIDSGT
Sbjct: 182 GIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGT 241
Query: 315 TLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLK 374
TL YLPE+VY+ ++ + ++ D+ H V E+ CFQY VD+ FP +TFHFEN + L
Sbjct: 242 TLTYLPEIVYKEIMLAVFAKHKDITFHNVQ-EFLCFQYVGRVDDDFPKITFHFENDLPLN 300
Query: 375 VYPHEYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGDF 411
VYPH+Y F D L+C+G+QN G+QS+D K M LLGD
Sbjct: 301 VYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDL 338
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 218/411 (53%), Positives = 288/411 (70%), Gaps = 14/411 (3%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKY---RYAGR--ERSLSLLKEHDARRQQRILAGVDLP 63
L ++L A + G +S GVF V+ R+ GR L+ L+ HDA R R+L VDL
Sbjct: 14 LLVLLFALSV--GCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLLGAVDLA 71
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLY 123
LGG P GLYY +I IG+PPK YYVQVDTGSDI+WVNCI+C CP RS LGIELT Y
Sbjct: 72 LGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQY 131
Query: 124 DIKDSSTGKFVTCDQEFC--HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
D + +G V C+QEFC + G P T + ++ C + YGDGS+TTG++V D VQY
Sbjct: 132 D--PAGSGTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQY 189
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
++VSG+ QTT++N S+ FGCGA+ G+L S+N+ ALDGI+GFG+S+SSM+SQLA++ VR
Sbjct: 190 NQVSGNGQTTTSNASITFGCGAQLGGDLGSSNQ-ALDGILGFGQSDSSMLSQLAAARRVR 248
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
K+FAHCLD + GGGIFAIG+VVQP+V TPLVPN HY++N+ + VG L LPT F
Sbjct: 249 KIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFD 308
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
GD+KGTIIDSGTTLAYLP VY L++ + + DL +H D + CFQ+S S+D+GFP
Sbjct: 309 SGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD-FVCFQFSGSIDDGFP 367
Query: 362 NVTFHFENSVSLKVYPHEYLFP-FEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+TF FE ++L VYP +YLF DL+C+G+ + G+Q++D K+M LLGD
Sbjct: 368 VITFSFEGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDL 418
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 217/411 (52%), Positives = 288/411 (70%), Gaps = 14/411 (3%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKY---RYAGR--ERSLSLLKEHDARRQQRILAGVDLP 63
L ++L A + G +S GVF V+ R+ GR L+ L+ HDA R R+L VDL
Sbjct: 14 LLVLLFALSV--GCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLLGAVDLA 71
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLY 123
LGG P GLYY +I IG+PPK YYVQVDTGSDI+WVNCI+C CP RS LGIELT Y
Sbjct: 72 LGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQY 131
Query: 124 DIKDSSTGKFVTCDQEFC--HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
D + +G V C+QEFC + G P T + ++ C + YGDGS+TTG++V D VQY
Sbjct: 132 D--PAGSGTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQY 189
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
++VSG+ QTT++N S+ FGCGA+ G+L S+N+ ALDGI+GFG+S+SSM+SQLA++ VR
Sbjct: 190 NQVSGNGQTTTSNASITFGCGAQLGGDLGSSNQ-ALDGILGFGQSDSSMLSQLAAARRVR 248
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
K+FAHCLD + GGGIFAIG+VVQP+V TPLVPN HY++N+ + VG L LPT F
Sbjct: 249 KIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFD 308
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
GD+KGTIIDSGTTLAYLP VY L++ + + DL +H D + CFQ+S S+D+GFP
Sbjct: 309 SGDSKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQD-FVCFQFSGSIDDGFP 367
Query: 362 NVTFHFENSVSLKVYPHEYLFP-FEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+TF F+ ++L VYP +YLF DL+C+G+ + G+Q++D K+M LLGD
Sbjct: 368 VITFSFKGDLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDL 418
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 216/399 (54%), Positives = 283/399 (70%), Gaps = 10/399 (2%)
Query: 20 GGVSSNHGVFSVKYRYA--GRER--SLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGL 75
GGVS+ GVF V+ R+A G E +L+ HD R R+LA D+PLGG P G GL
Sbjct: 28 GGVSAA-GVFKVRRRFARPGGEGGGNLTAHLAHDGDRHGRLLAAADVPLGGLGLPTGTGL 86
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
YY KI IGTPPK ++VQVDTGSDI+WVNC+ C +CP +S LGI+L LYD K SS+G V+
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146
Query: 136 CDQEFCHGVYGG--PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
CD +FC YG L CTA C Y YGDGSST G FV D +QY+++SG+ QT
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHA 206
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
++IFGCGA+Q G+L+STN+ ALDGIIGFG+SN+S +SQLAS+G V+K+F+HCLD I G
Sbjct: 207 KANVIFGCGAQQGGDLESTNQ-ALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKG 265
Query: 254 GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSG 313
GGIFAIG VVQP+V TPL+PN HY++N+ ++ V + L LP +F + +GTIIDSG
Sbjct: 266 GGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRGTIIDSG 325
Query: 314 TTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
TTL YLPE+VY+ +++ + + D+ T+ + CF+YSESVD+GFP +TFHFE+ + L
Sbjct: 326 TTLTYLPELVYKDILAAVFQKHQDITFRTIQG-FLCFEYSESVDDGFPKITFHFEDDLGL 384
Query: 374 KVYPHEYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGDF 411
VYPH+Y F D L+C+G+QN G Q +D K+M LLGD
Sbjct: 385 NVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDL 423
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 196/389 (50%), Positives = 272/389 (69%), Gaps = 5/389 (1%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIG 83
S + VF+V +++AG+E+ LS LK HD+ R R+LA +DLPLGG SR D +GLY+ KI +G
Sbjct: 25 SGNFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLG 84
Query: 84 TPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHG 143
+PPK+YYVQVDTGSDI+WVNC C +CP ++ LGI L+LYD K SST K V C+ FC
Sbjct: 85 SPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFCSF 144
Query: 144 VYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGA 203
+ C A C Y +YGDGS++ G FV+D + D+V+G+L+T ++FGCG
Sbjct: 145 IMQSET--CGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFGCGK 202
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV 263
QSG L T E A+DGI+GFG+SN+S+ISQLA+ G V+++F+HCLD +NGGGIFAIG V
Sbjct: 203 NQSGQLGQT-ESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGIFAIGEVE 261
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
P V TPLVPNQ HY++ + + V + ++LP + + GTIIDSGTTLAYLP+ +
Sbjct: 262 SPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNL 321
Query: 324 YEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP 383
Y L+ KI ++Q +K+H V + + CF ++ + D+ FP V HFE+S+ L VYPH+YLF
Sbjct: 322 YNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFS 380
Query: 384 F-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
ED++C GWQ+ GM ++D ++ LLGD
Sbjct: 381 LREDMYCFGWQSGGMTTQDGADVILLGDL 409
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 193/408 (47%), Positives = 278/408 (68%), Gaps = 8/408 (1%)
Query: 5 LRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPL 64
LR LCIV+ V +S + VF V++++AG+E+ L K HD RR R+LA +DLPL
Sbjct: 3 LRRKLCIVVAVFVIVNEFASGNFVFKVQHKFAGKEKKLEHFKSHDTRRHSRMLASIDLPL 62
Query: 65 GGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYD 124
GG SR D VGLY+ KI +G+PPK+Y+VQVDTGSDI+WVNC C ECP +++L L+L+D
Sbjct: 63 GGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFD 122
Query: 125 IKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
+ SST K V CD +FC + C C Y +Y D S++ G F++D + ++V
Sbjct: 123 VNASSTSKKVGCDDDFCSFISQS--DSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQV 180
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
+GDLQT ++FGCG+ QSG L ++ A+DG++GFG+SN+S++SQLA++G +++F
Sbjct: 181 TGDLQTGPLGQEVVFGCGSDQSGQL-GKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVF 239
Query: 245 AHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+HCLD + GGGIFA+G V P+V TP+VPNQ HY++ + + V L+LP +
Sbjct: 240 SHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIM---R 296
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
N GTI+DSGTTLAY P+++Y+ L+ I+++QP +K+H V D + CF +SE+VD FP V+
Sbjct: 297 NGGTIVDSGTTLAYFPKVLYDSLIETILARQP-VKLHIVEDTFQCFSFSENVDVAFPPVS 355
Query: 365 FHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGDF 411
F FE+SV L VYPH+YLF E +L+C GWQ G+ + +R + LLGD
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDL 403
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/389 (49%), Positives = 272/389 (69%), Gaps = 5/389 (1%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIG 83
S + VF+V +++AG+E+ LS LK HD+ R R+LA +DLPLGG SR D +GLY+ KI +G
Sbjct: 26 SGNFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLG 85
Query: 84 TPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHG 143
+PPK+YYVQVDTGSDI+WVNC C +CP ++ LGI L+LYD K SST K V C+ +FC
Sbjct: 86 SPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSF 145
Query: 144 VYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGA 203
+ C A C Y +YGDGS++ G F++D + ++V+G+L+T ++FGCG
Sbjct: 146 IMQSET--CGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGK 203
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV 263
QSG L T + A+DGI+GFG+SN+S+ISQLA+ G +++F+HCLD +NGGGIFA+G V
Sbjct: 204 NQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGEVE 262
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
P V TP+VPNQ HY++ + + V D ++LP + + GTIIDSGTTLAYLP+ +
Sbjct: 263 SPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNL 322
Query: 324 YEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP 383
Y L+ KI ++Q +K+H V + + CF ++ + D+ FP V HFE+S+ L VYPH+YLF
Sbjct: 323 YNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFS 381
Query: 384 F-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
ED++C GWQ+ GM ++D ++ LLGD
Sbjct: 382 LREDMYCFGWQSGGMTTQDGADVILLGDL 410
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 191/389 (49%), Positives = 272/389 (69%), Gaps = 5/389 (1%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIG 83
S + VF+V +++AG+E+ LS LK HD+ R R+LA +DLPLGG SR D +GLY+ KI +G
Sbjct: 22 SGNFVFNVTHKFAGKEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIKLG 81
Query: 84 TPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHG 143
+PPK+YYVQVDTGSDI+WVNC C +CP ++ LGI L+LYD K SST K V C+ +FC
Sbjct: 82 SPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFCSF 141
Query: 144 VYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGA 203
+ C A C Y +YGDGS++ G F++D + ++V+G+L+T ++FGCG
Sbjct: 142 IMQSET--CGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGCGK 199
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV 263
QSG L T + A+DGI+GFG+SN+S+ISQLA+ G +++F+HCLD +NGGGIFA+G V
Sbjct: 200 NQSGQLGQT-DSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGEVE 258
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
P V TP+VPNQ HY++ + + V D ++LP + + GTIIDSGTTLAYLP+ +
Sbjct: 259 SPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQNL 318
Query: 324 YEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP 383
Y L+ KI ++Q +K+H V + + CF ++ + D+ FP V HFE+S+ L VYPH+YLF
Sbjct: 319 YNSLIEKITAKQ-QVKLHMVQETFACFSFTSNTDKAFPVVNLHFEDSLKLSVYPHDYLFS 377
Query: 384 F-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
ED++C GWQ+ GM ++D ++ LLGD
Sbjct: 378 LREDMYCFGWQSGGMTTQDGADVILLGDL 406
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 208/394 (52%), Positives = 275/394 (69%), Gaps = 13/394 (3%)
Query: 27 GVFSVKYRYAGR------ERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKI 80
GVF V+ ++ L+ L+ HD R R+L VDLPLGG P GLYY +I
Sbjct: 30 GVFQVRRKFPRHGGGGDVAEHLAALRRHDVGRHGRLLGAVDLPLGGVGLPTATGLYYTQI 89
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
IG+P K YYVQVDTGSDI+WVNCI+C CP S LGIELT YD + +G V CDQEF
Sbjct: 90 EIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYD--PAGSGTTVGCDQEF 147
Query: 141 C--HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C + G P + ++ C + YGDGSSTTG++V D VQY++VSG+ QTT +N S+
Sbjct: 148 CVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNASIT 207
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA 258
FGCGA+ G+L S+++ ALDGI+GFG+++SSM+SQLA++ VRK+FAHCLD ++GGGIFA
Sbjct: 208 FGCGAQLGGDLGSSSQ-ALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGGGIFA 266
Query: 259 IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
IG+VVQP+V TPLV N HY++N+ + VG L LP+ F GD+KGTIIDSGTTLAY
Sbjct: 267 IGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGTTLAY 326
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPH 378
LP VY L++ + + DL +H D + CFQ+S S+D+GFP VTF FE ++L VYPH
Sbjct: 327 LPREVYRTLLTAVFDKYQDLALHNYQD-FVCFQFSGSIDDGFPVVTFSFEGEITLNVYPH 385
Query: 379 EYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+YLF E DL+C+G+ + G+Q++D K+M LLGD
Sbjct: 386 DYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDL 419
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 194/340 (57%), Positives = 249/340 (73%), Gaps = 22/340 (6%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+ LY+AKIG+G P KDYYVQVDTGSDI+WVNCI C +CP +S LGI+LTLYD S +
Sbjct: 24 LSLYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSAT 83
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V+CD +FC Y G L DC C Y +YGDGSST GYFV D VQ+++V+G+LQT
Sbjct: 84 RVSCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGL 143
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+NG++ FGCGA+QSG L T+ EALDGI+G FAHCLD +N
Sbjct: 144 SNGTVTFGCGAQQSGGL-GTSGEALDGILG--------------------AFAHCLDNVN 182
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
GGGIFAIG +V P+VN TP+VPNQ HY++ M ++VG L LPTDVF GD +GTIIDS
Sbjct: 183 GGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDS 242
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVS 372
GTTLAYLPE+VY+ ++++I SQQP L +HTV +++ CF+YS +VD+GFP++ FHF++S++
Sbjct: 243 GTTLAYLPEVVYDSMMNEIRSQQPGLSLHTVEEQFICFKYSGNVDDGFPDIKFHFKDSLT 302
Query: 373 LKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
L VYPH+YLF ED+WC GWQN GMQS+D ++MTLLGD
Sbjct: 303 LTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDL 342
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 209/396 (52%), Positives = 274/396 (69%), Gaps = 15/396 (3%)
Query: 27 GVFSVKYRYAGR------ERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKI 80
G+F V+ ++ E L+ L HD R R+L VDLPLGG P GLYY +I
Sbjct: 30 GLFQVRRKFPRHGGGDVVEHRLAALLRHDMGRNGRLLGAVDLPLGGVGLPTATGLYYTRI 89
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
IG+PPK YYVQVDTGSDI+WVN I C CP RS LGIELT YD + +G V C+QEF
Sbjct: 90 EIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYD--PAGSGTTVGCEQEF 147
Query: 141 C---HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
C G P +A + C + YGDGSSTTG++V D VQY++VSG+ QTT +N S+
Sbjct: 148 CVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSNVSI 207
Query: 198 IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIF 257
FGCGA+ G+L S++ +ALDGI+GFG+S++SM+SQLA++ VRK+FAHCLD + GGGIF
Sbjct: 208 TFGCGAQLGGDLGSSS-QALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGGGIF 266
Query: 258 AIGHVVQPEVNK-TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTL 316
AIG+VVQP + K TPLVPN HY++N+ + VG L LPT F GD+KGTIIDSGTTL
Sbjct: 267 AIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTL 326
Query: 317 AYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVY 376
AYLP VY L++ + + PDL V ++++ CFQ+S S+DE FP +TF FE ++L VY
Sbjct: 327 AYLPREVYRTLLTAVFDKHPDLAVRN-YEDFICFQFSGSLDEEFPVITFSFEGDLTLNVY 385
Query: 377 PHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
PH+YLF DL+C+G+ + G+Q++D K+M LLGD
Sbjct: 386 PHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDL 421
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/408 (45%), Positives = 278/408 (68%), Gaps = 8/408 (1%)
Query: 5 LRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPL 64
LR LCIV+ V +S + VF ++++AG++++L K HD RR R+LA +DLPL
Sbjct: 3 LRRKLCIVVAVFVIVIEFASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPL 62
Query: 65 GGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYD 124
GG SR D VGLY+ KI +G+PPK+Y+VQVDTGSDI+W+NC C +CP +++L L+L+D
Sbjct: 63 GGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFD 122
Query: 125 IKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
+ SST K V CD +FC + C C Y +Y D S++ G F++D++ ++V
Sbjct: 123 MNASSTSKKVGCDDDFCSFISQS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQV 180
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
+GDL+T ++FGCG+ QSG L + A+DG++GFG+SN+S++SQLA++G +++F
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQL-GNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVF 239
Query: 245 AHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+HCLD + GGGIFA+G V P+V TP+VPNQ HY++ + + V L+LP +
Sbjct: 240 SHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV---R 296
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
N GTI+DSGTTLAY P+++Y+ L+ I+++QP +K+H V + + CF +S +VDE FP V+
Sbjct: 297 NGGTIVDSGTTLAYFPKVLYDSLIETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPVS 355
Query: 365 FHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F FE+SV L VYPH+YLF E+L+C GWQ G+ + +R + LLGD
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDL 403
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/408 (45%), Positives = 278/408 (68%), Gaps = 8/408 (1%)
Query: 5 LRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPL 64
LR LCIV+ V +S + VF ++++AG++++L K HD RR R+LA +DLPL
Sbjct: 3 LRRKLCIVVAVFVIVIEFASANFVFKAQHKFAGKKKNLEHFKSHDTRRHSRMLASIDLPL 62
Query: 65 GGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYD 124
GG SR D VGLY+ KI +G+PPK+Y+VQVDTGSDI+W+NC C +CP +++L L+L+D
Sbjct: 63 GGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFD 122
Query: 125 IKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
+ SST K V CD +FC + C C Y +Y D S++ G F++D++ ++V
Sbjct: 123 MNASSTSKKVGCDDDFCSFISQS--DSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQV 180
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
+GDL+T ++FGCG+ QSG L + A+DG++GFG+SN+S++SQLA++G +++F
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQL-GNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVF 239
Query: 245 AHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+HCLD + GGGIFA+G V P+V TP+VPNQ HY++ + + V L+LP +
Sbjct: 240 SHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIV---R 296
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
N GTI+DSGTTLAY P+++Y+ L+ I+++QP +K+H V + + CF +S +VDE FP V+
Sbjct: 297 NGGTIVDSGTTLAYFPKVLYDSLIETILARQP-VKLHIVEETFQCFSFSTNVDEAFPPVS 355
Query: 365 FHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
F FE+SV L VYPH+YLF E+L+C GWQ G+ + +R + LLGD
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDL 403
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 362 bits (929), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 172/278 (61%), Positives = 216/278 (77%), Gaps = 2/278 (0%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
LYY +IGIGTP K YYVQVDTGSDI+WVNCI C CPR+S LG+ELTLYD KDSSTG V
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 91
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+CDQ FC YGG L CT + C Y YGDGSSTTGYFV D++Q+D+VSGD QT N
Sbjct: 92 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 151
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
++ FGCG++Q G+L S+N +ALDGIIGFG+SN+SM+SQL+++G V+K+FAHCLD INGG
Sbjct: 152 STVTFGCGSQQGGDLGSSN-QALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGG 210
Query: 255 GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGT 314
GIFAIG+VVQP+V TPLVPN PHY++N+ ++ VG L LP+ +F G+ KGTIIDSGT
Sbjct: 211 GIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGT 270
Query: 315 TLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
TL YLPE+VY+ ++ + ++ D+ H V E+ CFQY
Sbjct: 271 TLTYLPEIVYKEIMLAVFAKHKDITFHNVQ-EFLCFQY 307
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 159/306 (51%), Positives = 222/306 (72%), Gaps = 2/306 (0%)
Query: 107 CKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGD 166
C CP++S LG++LTLYD S T V C FC Y GP++ C + SCPY YGD
Sbjct: 33 CTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGD 92
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
GS+T+G FV D + +D+VSG+L T N S+IFGCGA+QSG+L S ++EALDGIIGFG++
Sbjct: 93 GSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQA 152
Query: 227 NSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAV 286
NSS++SQLA+SG V+++F+HCLD +GGGIF+IG V++P+ N TPLVP HY++ + +
Sbjct: 153 NSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDM 212
Query: 287 QVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE 346
V + + LP +F G +GTIIDSGTTLAYLP +Y L+ K++ +QP LK+ V D+
Sbjct: 213 DVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQ 272
Query: 347 YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNM 405
+TCF YS+ +DEGFP V FHFE +SL V+PH+YLF + ED++CIGWQ S Q+++ +++
Sbjct: 273 FTCFHYSDKLDEGFPVVKFHFEG-LSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDL 331
Query: 406 TLLGDF 411
L+GD
Sbjct: 332 ILIGDL 337
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 174/411 (42%), Positives = 257/411 (62%), Gaps = 19/411 (4%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY-----AGRERSLSLLKEHDA-RRQQRILAGVDL 62
L +V++A++ G +++ GVF V+ ++ + + L+ HD R ++R L +L
Sbjct: 12 LALVVVASSTHGTMAN--GVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAEL 69
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
PLGG + P G GLYY IGIGTP YYVQ+DTGS WVN I CK+CP S + +LT
Sbjct: 70 PLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTF 129
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
YD + S + K V CD C C CPY+ Y DG T G D++ Y
Sbjct: 130 YDPRSSVSSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYH 184
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
++ G+ QT T+ S+ FGCG +QSG+L+++ A+DGIIGFG SN + +SQLA++G +K
Sbjct: 185 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNS-AVAIDGIIGFGNSNQTALSQLAAAGKTKK 243
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS-INMTAVQVGLDFLNLPTDVFG 301
+F+HCLD NGGGIFAIG VV+P+V TP+V N Y +N+ ++ V L LP ++FG
Sbjct: 244 IFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFG 303
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
KGT IDSG+TL YLPE++Y L+ + ++ PD+ + +++ + CF + SVD+ FP
Sbjct: 304 TTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYN-FQCFHFLGSVDDKFP 362
Query: 362 NVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+TFHFEN ++L VYP++YL +E + +C G+Q++G+ K+M +LGD
Sbjct: 363 KITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG--YKDMIILGDM 411
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 337 bits (865), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 174/411 (42%), Positives = 257/411 (62%), Gaps = 19/411 (4%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY-----AGRERSLSLLKEHDA-RRQQRILAGVDL 62
L +V++A++ G +++ GVF V+ ++ + + L+ HD R ++R L +L
Sbjct: 12 LALVVVASSTHGTMAN--GVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAEL 69
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
PLGG + P G GLYY IGIGTP YYVQ+DTGS WVN I CK+CP S + +LT
Sbjct: 70 PLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTF 129
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
YD + S + K V CD C C CPY+ Y DG T G D++ Y
Sbjct: 130 YDPRSSVSSKEVKCDDTIC-----TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYH 184
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
++ G+ QT T+ S+ FGCG +QSG+L+++ A+DGIIGFG SN + +SQLA++G +K
Sbjct: 185 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNS-AVAIDGIIGFGNSNQTALSQLAAAGKTKK 243
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS-INMTAVQVGLDFLNLPTDVFG 301
+F+HCLD NGGGIFAIG VV+P+V TP+V N Y +N+ ++ V L LP ++FG
Sbjct: 244 IFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFG 303
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
KGT IDSG+TL YLPE++Y L+ + ++ PD+ + +++ + CF + SVD+ FP
Sbjct: 304 TTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYN-FQCFHFLGSVDDKFP 362
Query: 362 NVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+TFHFEN ++L VYP++YL +E + +C G+Q++G+ K+M +LGD
Sbjct: 363 KITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG--YKDMIILGDM 411
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 170/393 (43%), Positives = 246/393 (62%), Gaps = 17/393 (4%)
Query: 26 HGVFSVKYRY-----AGRERSLSLLKEHDA-RRQQRILAGVDLPLGGSSRPDGVGLYYAK 79
+GVF V+ ++ + + L+ HD R ++R L +LPLGG + P G GLYY
Sbjct: 3 NGVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAELPLGGFNIPYGTGLYYTD 62
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
IGIGTP YYVQ+DTGS WVN I CK+CP S + +LT YD + S + K V CD
Sbjct: 63 IGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDT 122
Query: 140 FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
C C CPY+ Y DG T G D++ Y ++ G+ QT T+ S+ F
Sbjct: 123 IC-----TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 177
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCG +QSG+L+++ A+DGIIGFG SN + +SQLA++G +K+F+HCLD NGGGIFAI
Sbjct: 178 GCGLQQSGSLNNS-AVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAI 236
Query: 260 GHVVQPEVNKTPLVPNQPHYS-INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
G VV+P+V TP+V N Y +N+ ++ V L LP ++FG KGT IDSG+TL Y
Sbjct: 237 GEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVY 296
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPH 378
LPE++Y L+ + ++ PD+ + +++ + CF + SVD+ FP +TFHFEN ++L VYP+
Sbjct: 297 LPEIIYSELILAVFAKHPDITMGAMYN-FQCFHFLGSVDDKFPKITFHFENDLTLDVYPY 355
Query: 379 EYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGD 410
+YL +E + +C G+Q++G+ K+M +LGD
Sbjct: 356 DYLLEYEGNQYCFGFQDAGIHG--YKDMIILGD 386
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 170/394 (43%), Positives = 246/394 (62%), Gaps = 17/394 (4%)
Query: 26 HGVFSVKYRY-----AGRERSLSLLKEHDA-RRQQRILAGVDLPLGGSSRPDGVGLYYAK 79
+GVF V+ ++ + + L+ HD R ++R L +LPLGG + P G GLYY
Sbjct: 3 NGVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAELPLGGFNIPYGTGLYYTD 62
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
IGIGTP YYVQ+DTGS WVN I CK+CP S + +LT YD + S + K V CD
Sbjct: 63 IGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDT 122
Query: 140 FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
C C CPY+ Y DG T G D++ Y ++ G+ QT T+ S+ F
Sbjct: 123 ICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 177
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCG +QSG+L+++ A+DGIIGFG SN + +SQLA++G +K+F+HCLD NGGGIFAI
Sbjct: 178 GCGLQQSGSLNNS-AVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAI 236
Query: 260 GHVVQPEVNKTPLVPNQPHYS-INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
G VV+P+V TP+V N Y +N+ ++ V L LP ++FG KGT IDSG+TL Y
Sbjct: 237 GEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGSTLVY 296
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPH 378
LPE++Y L+ + ++ PD+ + +++ + CF + SVD+ FP +TFHFEN ++L VYP+
Sbjct: 297 LPEIIYSELILAVFAKHPDITMGAMYN-FQCFHFLGSVDDKFPKITFHFENDLTLDVYPY 355
Query: 379 EYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+YL +E + +C G+Q++G+ K+M +LGD
Sbjct: 356 DYLLEYEGNQYCFGFQDAGIHG--YKDMIILGDM 387
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 324 bits (830), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 165/384 (42%), Positives = 240/384 (62%), Gaps = 16/384 (4%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY-----AGRERSLSLLKEHDA-RRQQRILAGVDL 62
L +V++A++ G +++ GVF V+ ++ + + L+ HD R ++R L +L
Sbjct: 12 LALVVVASSTHGTMAN--GVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAEL 69
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
PLGG + P G GLYY IGIGTP YYVQ+DTGS WVN I CK+CP S + +LT
Sbjct: 70 PLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTF 129
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
YD + S + K V CD C C CPY+ Y DG T G D++ Y
Sbjct: 130 YDPRSSVSSKEVKCDDTICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYH 184
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
++ G+ QT T+ S+ FGCG +QSG+L+++ A+DGIIGFG SN + +SQLA++G +K
Sbjct: 185 QLYGNGQTQPTSTSVTFGCGLQQSGSLNNS-AVAIDGIIGFGNSNQTALSQLAAAGKTKK 243
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS-INMTAVQVGLDFLNLPTDVFG 301
+F+HCLD NGGGIFAIG VV+P+V TP+V N Y +N+ ++ V L LP ++FG
Sbjct: 244 IFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFG 303
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
KGT IDSG+TL YLPE++Y L+ + ++ PD+ + +++ + CF + SVD+ FP
Sbjct: 304 TTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYN-FQCFHFLGSVDDKFP 362
Query: 362 NVTFHFENSVSLKVYPHEYLFPFE 385
+TFHFEN ++L VYP++YL +E
Sbjct: 363 KITFHFENDLTLDVYPYDYLLEYE 386
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 320 bits (820), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 157/253 (62%), Positives = 193/253 (76%), Gaps = 5/253 (1%)
Query: 27 GVFSVKYRYA----GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGI 82
GVF V+ ++ G E LS L+EHD RR R+LA +DLPLGGS GLY+ +IGI
Sbjct: 37 GVFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGI 96
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
GTP K YYVQVDTGSDI+WVNC+ C CPR+S+LGIELT+YD + S +G+ VTCDQ+FC
Sbjct: 97 GTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 143 GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
YGG L CT+ + C Y YGDGSST G+FV D +QY++VSGD QTT N S+ FGCG
Sbjct: 157 ANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCG 216
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
A+ G+L S+N ALDGI+GFG+SNSSM+SQLA++G VRKMFAHCLD +NGGGIFAIG+V
Sbjct: 217 AKLGGDLGSSN-LALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNV 275
Query: 263 VQPEVNKTPLVPN 275
VQP+V TPLVP+
Sbjct: 276 VQPKVKTTPLVPD 288
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 142/250 (56%), Positives = 187/250 (74%), Gaps = 3/250 (1%)
Query: 163 IYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIG 222
+YGDGSST GY V+DVV D V+G+ QT STNG++IFGCG++QSG L + + A+DGI+G
Sbjct: 1 MYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGES-QAAVDGIMG 59
Query: 223 FGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSIN 282
FG+SNSS ISQLAS G V++ FAHCLD NGGGIFAIG VV P+V TP++ HYS+N
Sbjct: 60 FGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVN 119
Query: 283 MTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT 342
+ A++VG L L ++ F GD+KG IIDSGTTL YLP+ VY PL+++I++ P+L +HT
Sbjct: 120 LNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNEILASHPELTLHT 179
Query: 343 VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRD 401
V + +TCF Y++ +D FP VTF F+ SVSL VYP EYLF ED WC GWQN G+Q++
Sbjct: 180 VQESFTCFHYTDKLDR-FPTVTFQFDKSVSLAVYPREYLFQVREDTWCFGWQNGGLQTKG 238
Query: 402 RKNMTLLGDF 411
++T+LGD
Sbjct: 239 GASLTILGDM 248
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 294 bits (752), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 152/382 (39%), Positives = 225/382 (58%), Gaps = 19/382 (4%)
Query: 42 LSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
L+ L+ D R R+L G VD + GSS P VGLY+ ++ +GTPP+++ VQ+DTG
Sbjct: 42 LAQLRARDHLRHARLLQGFVGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTG 101
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD++WV C C CP+ S LGI+L +D SST + V C C T C +
Sbjct: 102 SDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQS 161
Query: 157 S-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
+ C Y YGDGS T+GY+V D +D V G+ +++ +++FGC QSG+L T ++
Sbjct: 162 NQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDLTKT-DK 220
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVP 274
A+DGI GFG+ S+ISQL+S G ++F+HCL G + GGGI +G +++P + +PLVP
Sbjct: 221 AVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGGGILVLGEILEPGIVYSPLVP 280
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+QPHY++++ ++ V L + F N+GTIID+GTTLAYL E Y+P VS I +
Sbjct: 281 SQPHYNLDLQSIAVSGQLLPIDPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAA 340
Query: 335 QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-----LWC 389
L T++ C+ S SV E FP V+F+F ++ + P EYL + LWC
Sbjct: 341 VSQLATPTINKGNQCYLVSNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWC 400
Query: 390 IGWQNSGMQSRDRKNMTLLGDF 411
IG+Q + + +T+LGD
Sbjct: 401 IGFQ------KIQGGITILGDL 416
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/387 (40%), Positives = 226/387 (58%), Gaps = 20/387 (5%)
Query: 38 RERSLSLLKEHDARRQQRILAGV-----DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQ 92
++ L L+ D R RIL GV D + G+S P VGLY+ K+ +G+P K++YVQ
Sbjct: 40 QQVELEALRARDRARHGRILQGVVGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKEFYVQ 99
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGSDI+W+NCI C CP S LGIEL +D SST V+C C ++C
Sbjct: 100 IDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTATSEC 159
Query: 153 TANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVS-GDLQTTSTNGSLIFGCGARQSGNLD 210
++ + C Y YGDGS TTGY+V D + +D V G +++ ++IFGC QSG+L
Sbjct: 160 SSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQSGDLT 219
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNK 269
T ++A+DGI GFG S+ISQL+S G K+F+HCL G NGGG+ +G +++P +
Sbjct: 220 KT-DKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEPSIVY 278
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
+PLVP+QPHY++N+ ++ V L + ++VF +N+GTI+DSGTTLAYL + Y P V
Sbjct: 279 SPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVK 338
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL--FPFED- 386
I + + C+ S SV + FP V+ +F S+ + P YL + F D
Sbjct: 339 AITAAVSQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDG 398
Query: 387 --LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + + T+LGD
Sbjct: 399 AAMWCIGFQ------KVEQGFTILGDL 419
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/377 (41%), Positives = 218/377 (57%), Gaps = 9/377 (2%)
Query: 42 LSLLKEHDARRQQRILAG-VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
+LK HD R R L VD L G++ P GLYY +I +GTPP+ +YVQ+DTGSDI+
Sbjct: 6 FEMLKAHDRARHGRSLNTIVDFTLQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDIL 65
Query: 101 WVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPY 160
WVNC C CP S LG+ L +D + SST ++C C + CT + C Y
Sbjct: 66 WVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGY 125
Query: 161 LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGI 220
YGDGS T GY+V D Y++ T + + + FGC QSG+L + + A+DGI
Sbjct: 126 SFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYNQSGDL-TKPDRAVDGI 184
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVPNQPHY 279
GFG+++ S++SQL S G K+F+HCL+G + GGGI +G + +P + TP+VP+QPHY
Sbjct: 185 FGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGILVLGEITEPGMVYTPIVPSQPHY 244
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK 339
++N+ + V L++ VF + +GTIID GTTLAYL E YEP V+ II+
Sbjct: 245 NLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQST 304
Query: 340 VHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP-----FEDLWCIGWQN 394
+ CF S+DE FP+VT +FE + + + P +YL +WCIGWQ
Sbjct: 305 QPFMLKGNPCFLTVHSIDEIFPSVTLYFEGA-PMDLKPKDYLIQQLSPDSSPVWCIGWQK 363
Query: 395 SGMQSRDRKNMTLLGDF 411
SG Q+ D MT+LGD
Sbjct: 364 SGQQATDSSKMTILGDL 380
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 290 bits (742), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 154/387 (39%), Positives = 224/387 (57%), Gaps = 20/387 (5%)
Query: 38 RERSLSLLKEHDARRQQRILAGV-----DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQ 92
++ L L+ D R RIL GV D + G+S P VGLY+ K+ +G+P KD+YVQ
Sbjct: 40 QQVELEALRARDRARHGRILQGVVGGVVDFSVQGTSDPYFVGLYFTKVKLGSPAKDFYVQ 99
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGSDI+W+NCI C CP S LGIEL +D SST V+C C + C
Sbjct: 100 IDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTATSGC 159
Query: 153 TANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVS-GDLQTTSTNGSLIFGCGARQSGNLD 210
++ + C Y YGDGS TTGY+V D + +D V G +++ +++FGC QSG+L
Sbjct: 160 SSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQSGDLT 219
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNK 269
T ++A+DGI GFG S+ISQL+S G K+F+HCL G NGGG+ +G +++P +
Sbjct: 220 KT-DKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEPSIVY 278
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
+PLVP+ PHY++N+ ++ V L + ++VF +N+GTI+DSGTTLAYL + Y P V
Sbjct: 279 SPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVD 338
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL--FPFED- 386
I + + C+ S SV + FP V+ +F S+ + P YL + F D
Sbjct: 339 AITAAVSQFSKPIISKGNQCYLVSNSVGDIFPQVSLNFMGGASMVLNPEHYLMHYGFLDS 398
Query: 387 --LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + + T+LGD
Sbjct: 399 AAMWCIGFQ------KVERGFTILGDL 419
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 158/388 (40%), Positives = 224/388 (57%), Gaps = 24/388 (6%)
Query: 42 LSLLKEHDARR----QQRILAGV----DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
L L+ DA R ++R+L GV D P+ GS+ P VGLY+ ++ +G P K+++VQ+
Sbjct: 49 LEELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQI 108
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSDI+WV C C CP S L I+L ++ SST +TC + C + C
Sbjct: 109 DTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQ 168
Query: 154 ANTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ S C Y YGDGS T+GY+V D + ++ V G+ QT +++ S++FGC QSG+L
Sbjct: 169 TSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDL 228
Query: 210 DSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVN 268
+ + A+DGI GFG+ S+ISQL S G K+F+HCL G NGGGI +G +V+P +
Sbjct: 229 -TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLV 287
Query: 269 KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
TPLVP+QPHY++N+ ++ V L + + +F + +GTI+DSGTTLAYL + Y+P V
Sbjct: 288 YTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFV 347
Query: 329 SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-- 386
S I + V CF S SVD FP VT +F V++ V P YL
Sbjct: 348 SAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVD 407
Query: 387 ---LWCIGWQNSGMQSRDRKNMTLLGDF 411
LWCIGWQ + Q +T+LGD
Sbjct: 408 NSVLWCIGWQRNQGQ-----EITILGDL 430
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 158/388 (40%), Positives = 224/388 (57%), Gaps = 24/388 (6%)
Query: 42 LSLLKEHDARR----QQRILAGV----DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
L L+ DA R ++R+L GV D P+ GS+ P VGLY+ ++ +G P K+++VQ+
Sbjct: 47 LEELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQI 106
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSDI+WV C C CP S L I+L ++ SST +TC + C + C
Sbjct: 107 DTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQ 166
Query: 154 ANTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ S C Y YGDGS T+GY+V D + ++ V G+ QT +++ S++FGC QSG+L
Sbjct: 167 TSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDL 226
Query: 210 DSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVN 268
+ + A+DGI GFG+ S+ISQL S G K+F+HCL G NGGGI +G +V+P +
Sbjct: 227 -TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLV 285
Query: 269 KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
TPLVP+QPHY++N+ ++ V L + + +F + +GTI+DSGTTLAYL + Y+P V
Sbjct: 286 YTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFV 345
Query: 329 SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-- 386
S I + V CF S SVD FP VT +F V++ V P YL
Sbjct: 346 SAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVD 405
Query: 387 ---LWCIGWQNSGMQSRDRKNMTLLGDF 411
LWCIGWQ + Q +T+LGD
Sbjct: 406 NSVLWCIGWQRNQGQ-----EITILGDL 428
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 162/378 (42%), Positives = 225/378 (59%), Gaps = 15/378 (3%)
Query: 42 LSLLKEHDARRQQRILAGV-DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
L L+ D R RIL GV D + GSS P VGLY+ K+ +GTPP ++ VQ+DTGSDI+
Sbjct: 44 LETLRARDRLRHARILQGVVDFSVEGSSDPLLVGLYFTKVKLGTPPMEFTVQIDTGSDIL 103
Query: 101 WVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCP 159
WVNC C CPR S LGI+L +D SS+ V+C C+ + T C T + C
Sbjct: 104 WVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCS 163
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YGDGS T+GY+V + + +D V G +++ S++FGC QSG+L + ++ A+DG
Sbjct: 164 YTFQYGDGSGTSGYYVSESMYFDMVMGQSMIANSSASVVFGCSTYQSGDL-TKSDHAIDG 222
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVPNQPH 278
I GFG + S+ISQL++ G K+F+HCL G NGGGI +G V++P + +PLVP+QPH
Sbjct: 223 IFGFGPGDLSVISQLSARGITPKVFSHCLKGEGNGGGILVLGEVLEPGIVYSPLVPSQPH 282
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL 338
Y++ + ++ V L + VF N+GTIIDSGTTLAYL E Y P VS I +
Sbjct: 283 YNLYLQSISVNGQTLPIDPSVFATSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQS 342
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL--FPFED---LWCIGWQ 393
T+ C+ S SV E FP V+ +F S S+ + P EYL F D LWCIG+Q
Sbjct: 343 VTPTISKGNQCYLVSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQ 402
Query: 394 NSGMQSRDRKNMTLLGDF 411
+ ++ +T+LGD
Sbjct: 403 ------KVQEGVTILGDL 414
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 157/389 (40%), Positives = 229/389 (58%), Gaps = 20/389 (5%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
A + LS LKE D R R+L VD P+ G+ P VGLYY ++ +GTPP+D+Y
Sbjct: 7 ANYKLKLSKLKERDRVRHGRMLQSSGVGVVDFPVQGTFDPFLVGLYYTRLQLGTPPRDFY 66
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
VQ+DTGSD++WV+C C CP S L I L +D S T ++C + C +
Sbjct: 67 VQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDS 126
Query: 151 DCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
C+A N C Y YGDGS T+GY+V D++ +D V G +++ ++FGC A Q+G+L
Sbjct: 127 VCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQTGDL 186
Query: 210 DSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVN 268
+ ++ A+DGI GFG+ + S++SQLAS G + F+HCL G + GGGI +G +V+P +
Sbjct: 187 -TKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEIVEPNIV 245
Query: 269 KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
TPLVP+QPHY++NM ++ V L + VFG ++GTIIDSGTTLAYL E Y+P +
Sbjct: 246 YTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFI 305
Query: 329 SKIIS-QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED- 386
S I S P ++ + + C+ S S+++ FP V+ +F S+ + P +YL
Sbjct: 306 SAITSIVSPSVRPYLSKGNH-CYLISSSINDIFPQVSLNFAGGASMILIPQDYLIQQSSI 364
Query: 387 ----LWCIGWQNSGMQSRDRKNMTLLGDF 411
LWCIG+Q Q +T+LGD
Sbjct: 365 GGAALWCIGFQKIQGQ-----GITILGDL 388
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 154/383 (40%), Positives = 220/383 (57%), Gaps = 16/383 (4%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
R+R+ + + VD P+ GS+ P VGLY+ ++ +G+PPK+Y+VQ+DTGS
Sbjct: 53 RDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGS 112
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC--TAN 155
DI+WV C C CP S L I+L ++ SST + C + C C + N
Sbjct: 113 DILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDN 172
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
+ C Y YGDGS T+GY+V D + +D V G+ QT +++ S++FGC QSG+L T +
Sbjct: 173 SPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANSSASIVFGCSNSQSGDLTKT-DR 231
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVP 274
A+DGI GFG+ S++SQL S G K+F+HCL G NGGGI +G +V+P + TPLVP
Sbjct: 232 AVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVP 291
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+QPHY++N+ ++ V L + + +F + +GTI+DSGTTLAYL + Y+P V+ I +
Sbjct: 292 SQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAA 351
Query: 335 QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-----LWC 389
V CF S SVD FP V+ +F V++ V P YL LWC
Sbjct: 352 VSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWC 411
Query: 390 IGWQ-NSGMQSRDRKNMTLLGDF 411
IGWQ N G Q +T+LGD
Sbjct: 412 IGWQRNQGQQ------ITILGDL 428
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 285 bits (729), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 154/383 (40%), Positives = 220/383 (57%), Gaps = 16/383 (4%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
R+R+ + + VD P+ GS+ P VGLY+ ++ +G+PPK+Y+VQ+DTGS
Sbjct: 53 RDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGS 112
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC--TAN 155
DI+WV C C CP S L I+L ++ SST + C + C C + N
Sbjct: 113 DILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDN 172
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
+ C Y YGDGS T+GY+V D + +D V G+ QT +++ S++FGC QSG+L T +
Sbjct: 173 SPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKT-DR 231
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVP 274
A+DGI GFG+ S++SQL S G K+F+HCL G NGGGI +G +V+P + TPLVP
Sbjct: 232 AVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVP 291
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+QPHY++N+ ++ V L + + +F + +GTI+DSGTTLAYL + Y+P V+ I +
Sbjct: 292 SQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAA 351
Query: 335 QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-----LWC 389
V CF S SVD FP V+ +F V++ V P YL LWC
Sbjct: 352 VSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWC 411
Query: 390 IGWQ-NSGMQSRDRKNMTLLGDF 411
IGWQ N G Q +T+LGD
Sbjct: 412 IGWQRNQGQQ------ITILGDL 428
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 153/287 (53%), Positives = 200/287 (69%), Gaps = 12/287 (4%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKY---RYAGR--ERSLSLLKEHDARRQQRILAGVDLP 63
L ++L A + G +S GVF V+ R+ GR L+ L+ HDA R R+L VDL
Sbjct: 14 LLVLLFALSV--GCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLLGAVDLA 71
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLY 123
LGG P GLYY +I IG+PPK YYVQVDTGSDI+WVNCI+C CP RS LGIELT Y
Sbjct: 72 LGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQY 131
Query: 124 DIKDSSTGKFVTCDQEFC--HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
D + +G V C+QEFC + G P T + ++ C + YGDGS+TTG++V D VQY
Sbjct: 132 D--PAGSGTTVGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQY 189
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
++VSG+ QTT++N S+ FGCGA+ G+L S+N +ALDGI+GFG+S+SSM+SQLA++ VR
Sbjct: 190 NQVSGNGQTTTSNASITFGCGAQLGGDLGSSN-QALDGILGFGQSDSSMLSQLAAARRVR 248
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQV 288
K+FAHCLD + GGGIFAIG+VVQP+V TPLVPN S+ + V +
Sbjct: 249 KIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVPNVYVVSVLFSPVYI 295
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 284 bits (727), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 160/423 (37%), Positives = 235/423 (55%), Gaps = 33/423 (7%)
Query: 8 CLCIVLIATAAVGGVSSNHGVFSVKYRYAGRER-SLSLLKEHDARRQQRILAG-----VD 61
C +ATA G G ++ R + L+ D R RIL VD
Sbjct: 13 CCIFTFVATAVHGA-----GYLPLQRNVPLNHRVEIDTLRARDRVRHGRILRASVGGVVD 67
Query: 62 LPLGGSSRPD--GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
+ GSS P G GLY K+ +GTPP+++ VQ+DTGSDI+W+NC C CP+ S LGIE
Sbjct: 68 FRVQGSSDPSTLGYGLYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIE 127
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDV 178
L +D SST V C C G C+ + C Y Y DGS T+G +V D
Sbjct: 128 LNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDA 187
Query: 179 VQYDKVSGDLQTTSTN----GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
+ +D + G Q+T N +++FGC QSG+L T ++A+DGI+GFG S++SQL
Sbjct: 188 MYFDMILG--QSTPANVASSATIVFGCSTYQSGDLTKT-DKAVDGILGFGPGELSVVSQL 244
Query: 235 ASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFL 293
+S G K+F+HCL G NGGGI +G +++P + +PLVP+QPHY++N+ ++ V L
Sbjct: 245 SSRGITPKVFSHCLKGDGNGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVL 304
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
++ VF D +GTIIDSGTTL+YL + Y+PLV+ + + + C+
Sbjct: 305 SINPAVFATSDKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQCYLVL 364
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFP--FED---LWCIGWQNSGMQSRDRKNMTLL 408
S+D+ FP V+F+FE S+ + P +YL F+D +WCIG+Q + ++ +T+L
Sbjct: 365 TSIDDSFPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQ------KVQEGVTIL 418
Query: 409 GDF 411
GD
Sbjct: 419 GDL 421
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/387 (39%), Positives = 220/387 (56%), Gaps = 17/387 (4%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
A E LS LK D R R+L +D P+ G+ P VGLYY KI +G+PP+D+YV
Sbjct: 37 ANHEMELSQLKARDKARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKIRLGSPPRDFYV 96
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
QVDTGSD++WV+C C CP+ S L I+L +D S T V+C + C +
Sbjct: 97 QVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQSSDSG 156
Query: 152 CTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C+ N C Y YGDGS T+G++V DV+Q+D + G ++ ++FGC Q+G+L
Sbjct: 157 CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLV 216
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNK 269
+ + A+DGI GFG+ S+ISQLAS G ++F+HCL G N GGGI +G +V+P +
Sbjct: 217 KS-DRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGGGILVLGEIVEPNMVF 275
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
TPLVP+QPHY++N+ ++ V L + VF + +GTIID+GTTLAYL E Y P V
Sbjct: 276 TPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVE 335
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--- 386
I + V C+ + SV + FP V+ +F S+ + P +YL +
Sbjct: 336 AITNAVSQSVRPVVSKGNQCYVIATSVADIFPPVSLNFAGGASMFLNPQDYLIQQNNVGG 395
Query: 387 --LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + +T+LGD
Sbjct: 396 TAVWCIGFQR-----IQNQGITILGDL 417
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 152/382 (39%), Positives = 225/382 (58%), Gaps = 19/382 (4%)
Query: 42 LSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
LS L+ DA R +R+L VD + G+ P VGLYY K+ +GTPP ++ VQ+DTGS
Sbjct: 37 LSQLRARDALRHRRMLQSSNGVVDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGS 96
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANT 156
D++WV+C C CP+ S L I+L +D SST + C + C+ G+ T + N
Sbjct: 97 DVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNN 156
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y YGDGS T+GY+V D++ + + TT++ ++FGC +Q+G+L + ++ A
Sbjct: 157 QCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDL-TKSDRA 215
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPLVPN 275
+DGI GFG+ S+ISQL+S G ++F+HCL G +GGGI +G +V+P + T LVP
Sbjct: 216 VDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPA 275
Query: 276 QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
QPHY++N+ ++ V L + + VF +++GTI+DSGTTLAYL E Y+P VS I +
Sbjct: 276 QPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI 335
Query: 336 PDLKVHTVHDE-YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL-----WC 389
P VHTV C+ + SV E FP V+ +F S+ + P +YL + WC
Sbjct: 336 PQ-SVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWC 394
Query: 390 IGWQNSGMQSRDRKNMTLLGDF 411
IG+Q Q +T+LGD
Sbjct: 395 IGFQKIQGQ-----GITILGDL 411
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 152/387 (39%), Positives = 220/387 (56%), Gaps = 17/387 (4%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
A E LS LK D R R+L +D P+ G+ P VGLYY K+ +GTPP+D+YV
Sbjct: 37 ANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYV 96
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
QVDTGSD++WV+C C CP+ S L I+L +D S T ++C + C +
Sbjct: 97 QVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156
Query: 152 CTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C+ N C Y YGDGS T+G++V DV+Q+D + G ++ ++FGC Q+G+L
Sbjct: 157 CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLV 216
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNK 269
+ + A+DGI GFG+ S+ISQLAS G ++F+HCL G N GGGI +G +V+P +
Sbjct: 217 KS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVF 275
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
TPLVP+QPHY++N+ ++ V L + VF + +GTIID+GTTLAYL E Y P V
Sbjct: 276 TPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVE 335
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--- 386
I + V C+ + SV + FP V+ +F S+ + P +YL +
Sbjct: 336 AITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGG 395
Query: 387 --LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + +T+LGD
Sbjct: 396 TAVWCIGFQR-----IQNQGITILGDL 417
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 145/386 (37%), Positives = 221/386 (57%), Gaps = 23/386 (5%)
Query: 42 LSLLKEHDARRQQRILAG----------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
LS L+ D R RIL G VD P+ GSS P VGLY+ K+ +G+PP ++ V
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNV 115
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
Q+DTGSDI+WV C C CP S LGI+L +D S T VTC C V+
Sbjct: 116 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQ 175
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
C+ N C Y YGDGS T+GY++ D +D + G+ +++ ++FGC QSG+L +
Sbjct: 176 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL-T 234
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKT 270
+++A+DGI GFGK S++SQL+S G +F+HCL G +GGG+F +G ++ P + +
Sbjct: 235 KSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYS 294
Query: 271 PLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
PL+P+QPHY++N+ ++ V L + VF + +GTI+D+GTTL YL + Y+P ++
Sbjct: 295 PLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNA 354
Query: 331 IISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-----E 385
I + L + + C+ S S+ + FP V+ +F S+ + P +YLF +
Sbjct: 355 ISNSVSQLVTLIISNGEQCYLVSTSISDMFPPVSLNFAGGASMMLRPQDYLFHYGFYDGA 414
Query: 386 DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + + T+LGD
Sbjct: 415 SMWCIGFQ------KAPEEQTILGDL 434
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 160/412 (38%), Positives = 240/412 (58%), Gaps = 19/412 (4%)
Query: 13 LIATAAVGGVSSNHGVFSVKYRYAGRER-SLSLLKEHDARRQQRILAGV-----DLPLGG 66
++ TAAV S + +++ + +R L +L+ D R R+L GV D + G
Sbjct: 17 ILLTAAVVHCGSPASLLTLERAFPVNQRVELEVLRARDQARHGRLLRGVVGGVVDFTVYG 76
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
+S P VGLY+ K+ +G+PP+++ VQ+DTGSDI+WV C C +CPR S LGIEL+ +D
Sbjct: 77 TSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPS 136
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
SST V+C C + +C+ ++ C Y YGDGS TTGY+V D++ +D V
Sbjct: 137 SSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVL 196
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
GD +++ S++FGC QSG+L ++A+DGI GFG+ + S++SQL+S G K+F+
Sbjct: 197 GDSLIANSSASIVFGCSTYQSGDLTKV-DKAIDGIFGFGQQDLSVVSQLSSLGITPKVFS 255
Query: 246 HCLDGI-NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
HCL G +GGG +G +++P + +PLVP+Q HY++N+ ++ V L + VF +
Sbjct: 256 HCLKGEGDGGGKLVLGEILEPNIIYSPLVPSQSHYNLNLQSISVNGQLLPIDPAVFATSN 315
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
N+GTI+DSGTTL YL E Y+P VS I + + C+ S SVDE FP V+
Sbjct: 316 NQGTIVDSGTTLTYLVETAYDPFVSAITATVSSSTTPVLSKGNQCYLVSTSVDEIFPPVS 375
Query: 365 FHFENSVSLKVYPHEYL--FPFED---LWCIGWQNSGMQSRDRKNMTLLGDF 411
+F S+ + P EYL F D +WCIG+Q +T+LGD
Sbjct: 376 LNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVA-----EPGITILGDL 422
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 152/387 (39%), Positives = 220/387 (56%), Gaps = 17/387 (4%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
A E LS LK D R R+L +D P+ G+ P VGLYY K+ +GTPP+D+YV
Sbjct: 37 ANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYV 96
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
QVDTGSD++WV+C C CP+ S L I+L +D S T ++C + C +
Sbjct: 97 QVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156
Query: 152 CTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C+ N C Y YGDGS T+G++V DV+Q+D + G ++ ++FGC Q+G+L
Sbjct: 157 CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDLV 216
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNK 269
+ + A+DGI GFG+ S+ISQLAS G ++F+HCL G N GGGI +G +V+P +
Sbjct: 217 KS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVF 275
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
TPLVP+QPHY++N+ ++ V L + VF + +GTIID+GTTLAYL E Y P V
Sbjct: 276 TPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVE 335
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--- 386
I + V C+ + SV + FP V+ +F S+ + P +YL +
Sbjct: 336 AITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYLIQQNNVGG 395
Query: 387 --LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + +T+LGD
Sbjct: 396 TAVWCIGFQR-----IQNQGITILGDL 417
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/386 (37%), Positives = 221/386 (57%), Gaps = 23/386 (5%)
Query: 42 LSLLKEHDARRQQRILAG----------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
LS L+ D R RIL G VD P+ GSS P VGLY+ K+ +G+PP ++ V
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNV 115
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
Q+DTGSDI+WV C C CP S LGI+L +D S T VTC C V+
Sbjct: 116 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQ 175
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
C+ N C Y YGDGS T+GY++ D +D + G+ +++ ++FGC QSG+L +
Sbjct: 176 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL-T 234
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKT 270
+++A+DGI GFGK S++SQL+S G +F+HCL G +GGG+F +G ++ P + +
Sbjct: 235 KSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYS 294
Query: 271 PLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
PLVP+QPHY++N+ ++ V L L VF + +GTI+D+GTTL YL + Y+ ++
Sbjct: 295 PLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNA 354
Query: 331 IISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-----E 385
I + L + + C+ S S+ + FP+V+ +F S+ + P +YLF +
Sbjct: 355 ISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGA 414
Query: 386 DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + + T+LGD
Sbjct: 415 SMWCIGFQ------KAPEEQTILGDL 434
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 281 bits (718), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/386 (37%), Positives = 221/386 (57%), Gaps = 23/386 (5%)
Query: 42 LSLLKEHDARRQQRILAG----------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
LS L+ D R RIL G VD P+ GSS P VGLY+ K+ +G+PP ++ V
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNV 115
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
Q+DTGSDI+WV C C CP S LGI+L +D S T VTC C V+
Sbjct: 116 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQ 175
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
C+ N C Y YGDGS T+GY++ D +D + G+ +++ ++FGC QSG+L +
Sbjct: 176 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL-T 234
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKT 270
+++A+DGI GFGK S++SQL+S G +F+HCL G +GGG+F +G ++ P + +
Sbjct: 235 KSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYS 294
Query: 271 PLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
PLVP+QPHY++N+ ++ V L L VF + +GTI+D+GTTL YL + Y+ ++
Sbjct: 295 PLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNA 354
Query: 331 IISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-----E 385
I + L + + C+ S S+ + FP+V+ +F S+ + P +YLF +
Sbjct: 355 ISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYGIYDGA 414
Query: 386 DLWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + + T+LGD
Sbjct: 415 SMWCIGFQ------KAPEEQTILGDL 434
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 148/382 (38%), Positives = 219/382 (57%), Gaps = 20/382 (5%)
Query: 42 LSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
L L+ D R R+L G VD + GSS P VGLY+ K+ +G+PP+++ VQ+DTG
Sbjct: 27 LHQLRARDRLRHARLLQGFVGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQIDTG 86
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD++WV C C CPR S LGI+L +D SST V C C T C++ T
Sbjct: 87 SDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQT 146
Query: 157 -SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
C Y YGDGS T+GY+V D + +D + G +++ ++FGC A QSG+L T ++
Sbjct: 147 DQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDLTKT-DK 205
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVP 274
A+DGI GFG+ S+ISQL++ G ++F+HCL G +GGGI +G +++P + +PLVP
Sbjct: 206 AVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGGGILVLGEILEPGIVYSPLVP 265
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+QPHY++N+ ++ V L + F +++GTI+DSGTTLAYL Y+P VS + +
Sbjct: 266 SQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAI 325
Query: 335 QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-----EDLWC 389
C+ S SV + FP +F+F S+ + P +YL PF +WC
Sbjct: 326 VSPSVTPITSKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWC 385
Query: 390 IGWQNSGMQSRDRKNMTLLGDF 411
IG+Q + +T+LGD
Sbjct: 386 IGFQK-------VQGVTILGDL 400
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 151/399 (37%), Positives = 225/399 (56%), Gaps = 29/399 (7%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAK 79
+NHGV LS L+ D R +R+L VD + G+ P VGLYY K
Sbjct: 34 TNHGV------------ELSQLRARDELRHRRMLQSSSGVVDFSVQGTFDPFQVGLYYTK 81
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
+ +GTPP ++ VQ+DTGSD++WV+C C CP+ S L I+L +D SST + C +
Sbjct: 82 VQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQ 141
Query: 140 FCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C+ G T + N C Y YGDGS T+GY+V D++ + + TT++ ++
Sbjct: 142 RCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPVV 201
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIF 257
FGC +Q+G+L + ++ A+DGI GFG+ S+ISQL+S G ++F+HCL G +GGGI
Sbjct: 202 FGCSNQQTGDL-TKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGIL 260
Query: 258 AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLA 317
+G +V+P + T LVP QPHY++N+ ++ V L + + VF +++GTI+DSGTTLA
Sbjct: 261 VLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLA 320
Query: 318 YLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYP 377
YL E Y+P VS I + P V C+ + SV + FP V+ +F S+ + P
Sbjct: 321 YLAEEAYDPFVSAITAAIPQSVRTVVSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRP 380
Query: 378 HEYLFPFEDL-----WCIGWQNSGMQSRDRKNMTLLGDF 411
+YL + WCIG+Q Q +T+LGD
Sbjct: 381 QDYLIQQNSIGGAAVWCIGFQKIQGQ-----GITILGDL 414
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 277 bits (708), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 155/394 (39%), Positives = 229/394 (58%), Gaps = 24/394 (6%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDY 89
A + LS LKE D+ R +RIL VD P+ G+ P VGLY+ ++ +G+PPKD+
Sbjct: 38 ASHKLELSQLKERDSFRHRRILQSTTSGGVVDFPVQGTFNPFLVGLYFTRVQLGSPPKDF 97
Query: 90 YVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL 149
YVQ+DTGSD++WV+C C CP S L I LT +D S+T V+C + C
Sbjct: 98 YVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSD 157
Query: 150 TDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKV---SGDLQTT--STNGSLIFGCGA 203
+ C++ T+ C Y YGDGS T+GY+V D++ D + SG+L + + S+ F C
Sbjct: 158 SLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCST 217
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHV 262
Q+G+L + ++ A+DGI GFG+ S+ISQLAS G ++F+HCL G + GGG+ +G +
Sbjct: 218 LQTGDL-TKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDSGGGVLVLGEI 276
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
V+P + TPLVP+QPHY++ + ++ V L + VFG N+GTI+DSGTTLAYL E
Sbjct: 277 VEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGTIVDSGTTLAYLAEG 336
Query: 323 VYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
Y+P VS I S + C+ + SV++ FP V+ +F SL + P +YL
Sbjct: 337 AYDPFVSAITSVVSLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFAGGASLILNPQDYLL 396
Query: 383 PFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
+WC+G+Q + Q +T+LGD
Sbjct: 397 QQNSVGGAAVWCVGFQKTPGQ-----QITILGDL 425
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 153/387 (39%), Positives = 219/387 (56%), Gaps = 28/387 (7%)
Query: 45 LKEHDAR---RQQRILAG-------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVD 94
LKE D R++ +L G VD P+ GS+ P VGLY+ ++ +G P K+Y+VQ+D
Sbjct: 48 LKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEYFVQID 107
Query: 95 TGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
TGSDI+WV C C CP S L I+L ++ SST + C + C C +
Sbjct: 108 TGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQS 167
Query: 155 NTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
+ S C Y YGDGS T+G++V D + +D V G+ QT +++ S++FGC QSG+L
Sbjct: 168 SDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDLM 227
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNK 269
T + A+DGI GFG+ S++SQL S G K F+HCL G NGGGI +G +V+P +
Sbjct: 228 KT-DRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGSDNGGGILVLGEIVEPGLVF 286
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
TPLVP+QPHY++N+ ++ V L + + +F + +GTI+DSGTTL YL + Y+P ++
Sbjct: 287 TPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIVDSGTTLVYLVDGAYDPFIN 346
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--- 386
I + V CF + SVD FP T +F+ VS+ V P YL
Sbjct: 347 AIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGGVSMTVKPENYLLQQGSVDN 406
Query: 387 --LWCIGWQNSGMQSRDRKNMTLLGDF 411
LWCIGWQ S + +T+LGD
Sbjct: 407 NVLWCIGWQRS-------QGITILGDL 426
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 146/391 (37%), Positives = 221/391 (56%), Gaps = 28/391 (7%)
Query: 42 LSLLKEHDARRQQRILAG----------VDLPLGGSSRPDGVG-----LYYAKIGIGTPP 86
LS L+ D R RIL G VD P+ GSS P VG LY+ K+ +G+PP
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPP 115
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
++ VQ+DTGSDI+WV C C CP S LGI+L +D S T VTC C V+
Sbjct: 116 TEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQ 175
Query: 147 GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
C+ N C Y YGDGS T+GY++ D +D + G+ +++ ++FGC QS
Sbjct: 176 TTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQS 235
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQP 265
G+L + +++A+DGI GFGK S++SQL+S G +F+HCL G +GGG+F +G ++ P
Sbjct: 236 GDL-TKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP 294
Query: 266 EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE 325
+ +PLVP+QPHY++N+ ++ V L L VF + +GTI+D+GTTL YL + Y+
Sbjct: 295 GMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYD 354
Query: 326 PLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF- 384
++ I + L + + C+ S S+ + FP+V+ +F S+ + P +YLF +
Sbjct: 355 LFLNAISNSVSQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASMMLRPQDYLFHYG 414
Query: 385 ----EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+WCIG+Q + + T+LGD
Sbjct: 415 IYDGASMWCIGFQ------KAPEEQTILGDL 439
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 152/396 (38%), Positives = 225/396 (56%), Gaps = 26/396 (6%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVG--------LYYAKIGI 82
A + LS LKE D R R+L VD P+ G+ P VG LYY ++ +
Sbjct: 37 ASHKLKLSQLKERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQL 96
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
G+PP+D+YVQ+DTGSD++WV+C C CP S L I L +D S T ++C + C
Sbjct: 97 GSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSDQRCS 156
Query: 143 GVYGGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ C A N C Y YGDGS T+GY+V D++ +D + G +++ ++FGC
Sbjct: 157 LGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGC 216
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIG 260
Q+G+L + + A+DGI GFG+ + S+ISQLAS G ++F+HCL G + GGGI +G
Sbjct: 217 STLQTGDL-TKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLG 275
Query: 261 HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
+V+P + TPLVP+QPHY++N+ ++ V L + VF N+GTIIDSGTTLAYL
Sbjct: 276 EIVEPNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLT 335
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEY 380
E Y+P +S I S + C+ S S+++ FP V+ +F S+ + P +Y
Sbjct: 336 EAAYDPFISAITSTVSPSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGTSMILIPQDY 395
Query: 381 LFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
L LWC+G+Q +Q ++ +T+LGD
Sbjct: 396 LIQQSSINGAALWCVGFQK--IQGQE---ITILGDL 426
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 156/418 (37%), Positives = 237/418 (56%), Gaps = 23/418 (5%)
Query: 10 CIVLIATAAVGGVSSNHGVFSVKYRY---AGRERSLSLLKEHDARRQQRILAGV-----D 61
CI + +S+ HGVF R G ++ LK D R R+L GV D
Sbjct: 4 CIPTLLLVTTVLLSAVHGVFLPLERSIPPTGHRVEVAALKARDRARHARMLRGVAGGVVD 63
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
+ G+S P+ VGLYY K+ +GTPPK++ VQ+DTGSDI+WVNC C CP+ S LGIEL
Sbjct: 64 FSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELN 123
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQ 180
+D SST + C C G +C+ + C Y YGDGS T+GY+V D +
Sbjct: 124 FFDTVGSSTAALIPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMY 183
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
+ + G +++ +++FGC QSG+L T ++A+DGI GFG S++SQL+S G
Sbjct: 184 FSLIMGQPPAVNSSATIVFGCSISQSGDLTKT-DKAVDGIFGFGPGPLSVVSQLSSRGIT 242
Query: 241 RKMFAHCLD-GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDV 299
K+F+HCL +GGG+ +G +++P + +PLVP+QPHY++N+ ++ V L + V
Sbjct: 243 PKVFSHCLKGDGDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAV 302
Query: 300 FGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
F + +N+ GTI+D GTTLAYL + Y+PLV+ I + T C+ S S+ +
Sbjct: 303 FSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQCYLVSTSIGD 362
Query: 359 GFPNVTFHFENSVSLKVYPHEYL-----FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
FP+V+ +FE S+ + P +YL ++WCIG+Q + ++ ++LGD
Sbjct: 363 IFPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQ------KFQEGASILGDL 414
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 150/382 (39%), Positives = 223/382 (58%), Gaps = 18/382 (4%)
Query: 42 LSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
LS L+ D+ R +R+L VD P+ G+ P VGLYY K+ +GTPP++ YVQ+DTGS
Sbjct: 39 LSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGS 98
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANT 156
D++WV+C C CP+ S L I+L +D SST ++C C GV + N
Sbjct: 99 DVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNN 158
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y YGDGS T+GY+V D++ + + TT+++ S++FGC Q+G+L + +E A
Sbjct: 159 QCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSILQTGDL-TKSERA 217
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVPN 275
+DGI GFG+ S+ISQL+S G ++F+HCL G N GGG+ +G +V+P + +PLVP+
Sbjct: 218 VDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEIVEPNIVYSPLVPS 277
Query: 276 QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
QPHY++N+ ++ V + + VF +N+GTI+DSGTTLAYL E Y P V I +
Sbjct: 278 QPHYNLNLQSISVNGQIVRIAPSVFATSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVI 337
Query: 336 PDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHEYLFPFE-----DLWC 389
P + C+ + S + + FP V+ +F SL + P +YL +WC
Sbjct: 338 PQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWC 397
Query: 390 IGWQNSGMQSRDRKNMTLLGDF 411
IG+Q QS +T+LGD
Sbjct: 398 IGFQKISGQS-----ITILGDL 414
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 143/352 (40%), Positives = 205/352 (58%), Gaps = 7/352 (1%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
A E LS LK D R R+L +D P+ G+ P VGLYY K+ +GTPP+D+YV
Sbjct: 37 ANHEMELSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLRLGTPPRDFYV 96
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
QVDTGSD++WV+C C CP+ S L I+L +D S T ++C + C +
Sbjct: 97 QVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQSSDSG 156
Query: 152 CTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C+ N C Y YGDGS T+G++V DV+Q+D + G ++ ++FGC Q+G+L
Sbjct: 157 CSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFGCSTSQTGDL- 215
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNK 269
++ A+DGI GFG+ S+ISQLAS G ++F+HCL G N GGGI +G +V+P +
Sbjct: 216 VKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVLGEIVEPNMVF 275
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
TPLVP+QPHY++N+ ++ V L + VF + +GTIID+GTTLAYL E Y P V
Sbjct: 276 TPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVE 335
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
I + V C+ + SV + FP V+ +F S+ + P +YL
Sbjct: 336 AITNAVSQSVRPVVSKGNQCYVITTSVGDIFPPVSLNFAGGASMFLNPQDYL 387
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 270 bits (690), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 208/344 (60%), Gaps = 10/344 (2%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
VD + G+ P VGLYY K+ +GTPP ++ VQ+DTGSD++WV+C C CP+ S L I+
Sbjct: 9 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQ 68
Query: 120 LTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDV 178
L +D SST + C + C+ G+ T + N C Y YGDGS T+GY+V D+
Sbjct: 69 LNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDM 128
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+ + + TT++ ++FGC +Q+G+L + ++ A+DGI GFG+ S+ISQL+S G
Sbjct: 129 MHLNTIFEGSVTTNSTAPVVFGCSNQQTGDL-TKSDRAVDGIFGFGQQEMSVISQLSSQG 187
Query: 239 GVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPT 297
++F+HCL G +GGGI +G +V+P + T LVP QPHY++N+ ++ V L + +
Sbjct: 188 IAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDS 247
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT-VHDEYTCFQYSESV 356
VF +++GTI+DSGTTLAYL E Y+P VS I + P VHT V C+ + SV
Sbjct: 248 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASIPQ-SVHTAVSRGNQCYLITSSV 306
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFEDL-----WCIGWQNS 395
E FP V+ +F S+ + P +YL + WCIG+Q S
Sbjct: 307 TEVFPQVSLNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKS 350
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 270 bits (690), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 144/349 (41%), Positives = 204/349 (58%), Gaps = 16/349 (4%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
VGLY+ ++ +G P K+++VQ+DTGSDI+WV C C CP S L I+L ++ SST
Sbjct: 2 VGLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTAS 61
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+TC + C + C + S C Y YGDGS T+GY+V D + ++ V G+
Sbjct: 62 RITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNE 121
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
QT +++ S++FGC QSG+L + + A+DGI GFG+ S+ISQL S G K+F+HCL
Sbjct: 122 QTANSSASIVFGCSNSQSGDL-TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 180
Query: 249 DGI-NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G NGGGI +G +V+P + TPLVP+QPHY++N+ ++ V L + + +F + +G
Sbjct: 181 KGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQG 240
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHF 367
TI+DSGTTLAYL + Y+P VS I + V CF S SVD FP VT +F
Sbjct: 241 TIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYF 300
Query: 368 ENSVSLKVYPHEYLFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
V++ V P YL LWCIGWQ + Q +T+LGD
Sbjct: 301 MGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQ-----EITILGDL 344
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 268 bits (686), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 143/345 (41%), Positives = 203/345 (58%), Gaps = 16/345 (4%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y+ ++ +G+PPK+Y+VQ+DTGSDI+WV C C CP S L I+L ++ SST +
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 136 CDQEFCHGVYGGPLTDC--TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C + C C + N+ C Y YGDGS T+GY+V D + +D V G+ QT ++
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-N 252
+ S++FGC QSG+L T + A+DGI GFG+ S++SQL S G K+F+HCL G N
Sbjct: 237 SASIVFGCSNSQSGDLTKT-DRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 295
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
GGGI +G +V+P + TPLVP+QPHY++N+ ++ V L + + +F + +GTI+DS
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 355
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVS 372
GTTLAYL + Y+P V+ I + V CF S SVD FP V+ +F V+
Sbjct: 356 GTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGVA 415
Query: 373 LKVYPHEYLFPFED-----LWCIGWQ-NSGMQSRDRKNMTLLGDF 411
+ V P YL LWCIGWQ N G Q +T+LGD
Sbjct: 416 MTVKPENYLLQQASIDNNVLWCIGWQRNQGQQ------ITILGDL 454
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/382 (38%), Positives = 225/382 (58%), Gaps = 18/382 (4%)
Query: 42 LSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
LS L+ D+ R +R+L VD P+ G+ P VGLYY K+ +GTPP+++YVQ+DTGS
Sbjct: 39 LSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPREFYVQIDTGS 98
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANT 156
D++WV+C C CP+ S L I+L +D + SST ++C C GV + + N
Sbjct: 99 DVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSLISCSDRRCRSGVQTSDASCSSQNN 158
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y YGDGS T+GY+V D++ + + TT+++ S++FGC Q+G+L + +E A
Sbjct: 159 QCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNSSASVVFGCSILQTGDL-TKSERA 217
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVPN 275
+DGI GFG+ S+ISQL+ G ++F+HCL G N GGG+ +G +V+P + +PLV +
Sbjct: 218 VDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKGDNSGGGVLVLGEIVEPNIVYSPLVQS 277
Query: 276 QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
QPHY++N+ ++ V + + VF +N+GTI+DSGTTLAYL E Y P V+ I +
Sbjct: 278 QPHYNLNLQSISVNGQIVPIAPAVFATSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALV 337
Query: 336 PDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHEYLFPFE-----DLWC 389
P + C+ + S + + FP V+ +F SL + P +YL +WC
Sbjct: 338 PQSVRSVLSRGNQCYLITTSSNVDIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWC 397
Query: 390 IGWQNSGMQSRDRKNMTLLGDF 411
IG+Q QS +T+LGD
Sbjct: 398 IGFQRIPGQS-----ITILGDL 414
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 147/383 (38%), Positives = 219/383 (57%), Gaps = 21/383 (5%)
Query: 42 LSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
LS L+ D R R+L G VD + GS P VGLY+ K+ +G+PP+++ VQ+DTG
Sbjct: 27 LSQLRARDRLRHARLLQGFVGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQIDTG 86
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD++WV C C CPR S LGI+L +D SST V C C +T C+ T
Sbjct: 87 SDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQT 146
Query: 157 S-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
+ C Y Y DGS T+GY+V D + +D + G+ +++ ++FGC QSG+L T ++
Sbjct: 147 NQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDLTMT-DK 205
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPLVP 274
A+DGI GFG+ S+ISQL++ G ++F+HCL G GGGI +G +++P + +PLVP
Sbjct: 206 AVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIGGGILVLGEILEPGMVYSPLVP 265
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+QPHY++N+ ++ V L + VF +++GTI+DSGTTLAYL Y+P VS +
Sbjct: 266 SQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNVI 325
Query: 335 QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF------EDLW 388
+ C+ S SV + FP +F+F S+ + P +YL PF +W
Sbjct: 326 VSPSVTPIISKGNQCYLVSTSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMW 385
Query: 389 CIGWQNSGMQSRDRKNMTLLGDF 411
CIG+Q + +T+LGD
Sbjct: 386 CIGFQK-------VQGVTILGDL 401
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/383 (36%), Positives = 216/383 (56%), Gaps = 17/383 (4%)
Query: 39 ERSLSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
E L+ L+ D+ R R+L V+ P+ G+S P VGLYY K+ +GTPP+++ VQ+
Sbjct: 42 ELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQI 101
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSD++WV+C C CP+ S L I+L+ +D SS+ V+C C+ + + C+
Sbjct: 102 DTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTE-SGCS 160
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
N C Y YGDGS T+G+++ D + +D V +++ +FGC Q+G+L
Sbjct: 161 PNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAPFVFGCSNLQTGDLQRP- 219
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPL 272
A+DGI G G+ + S+ISQLA G ++F+HCL G +GGGI +G + +P+ TPL
Sbjct: 220 RRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVYTPL 279
Query: 273 VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKII 332
VP+QPHY++N+ ++ V L + VF + GTIID+GTTLAYLP+ Y P + I
Sbjct: 280 VPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIA 339
Query: 333 SQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF----EDLW 388
+ ++ Y CF+ + + FP V+ F S+ + PH YL F +W
Sbjct: 340 NAVSQYGRPITYESYQCFEITAGDVDVFPEVSLSFAGGASMVLRPHAYLQIFSSSGSSIW 399
Query: 389 CIGWQNSGMQSRDRKNMTLLGDF 411
CIG+Q + +T+LGD
Sbjct: 400 CIGFQR-----MSHRRITILGDL 417
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 261 bits (666), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 155/437 (35%), Positives = 234/437 (53%), Gaps = 71/437 (16%)
Query: 38 RERSLSLLKEHD-ARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
+ L+ LK D AR RIL +D + G+S P VGLY+ K+ +G+P K++YV
Sbjct: 27 HQVELTTLKARDRARHGGRILQDGGGGILDFSVQGTSDPYLVGLYFTKVKMGSPAKEFYV 86
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
Q+DTGSDI+W+NC C CP+ S LGI+L +D SST V+C C +
Sbjct: 87 QIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAALVSCSDPVCSYAVQTATSQ 146
Query: 152 CTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C++ + C Y YGDGS T+GY+V D + +D + G ++++ +++FGC QSG+L
Sbjct: 147 CSSQANQCSYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDLA 206
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNK 269
T E+A+DGI GFG S++SQ++S G K+F+HCL G +GGGI +G +++P +
Sbjct: 207 RT-EKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGSGGGILVLGEILEPNIVY 265
Query: 270 TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
TPLVP QPHY++N+ ++ V L + DVF G+N+GTI+DSGTTLAYL + Y+P ++
Sbjct: 266 TPLVPLQPHYNLNLQSIAVNGQILPIDQDVFATGNNRGTIVDSGTTLAYLVQEAYDPFLN 325
Query: 330 K----------------------IISQQPDLKVHTVHDEYT------------------- 348
+ Q +K H +DE T
Sbjct: 326 AGSPCHFFTHFNEPTNNIKYEDGNNNHQSRVKRH-YYDEVTLRLVLKHSAIITTTVSQFS 384
Query: 349 ---------CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL--FPFED---LWCIGWQN 394
C+ S+ + FP V+ +F S+ + P +YL + F D +WCIG+Q
Sbjct: 385 KPIISKGNQCYLVPTSLGDIFPLVSLNFMGGASMVLKPEQYLIHYGFLDGAAMWCIGFQ- 443
Query: 395 SGMQSRDRKNMTLLGDF 411
+ +K T+LGD
Sbjct: 444 -----KVQKGYTILGDL 455
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/383 (36%), Positives = 215/383 (56%), Gaps = 17/383 (4%)
Query: 39 ERSLSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
E L+ L+ D+ R R+L V+ P+ G+S P VGLYY K+ +GTPP+++ VQ+
Sbjct: 42 ELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQI 101
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSD++WV+C C CP+ S L I+L+ +D SS+ V+C C+ + + C+
Sbjct: 102 DTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTE-SGCS 160
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
N C Y YGDGS T+GY++ D + +D V +++ +FGC QSG+L
Sbjct: 161 PNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAPFVFGCSNLQSGDLQRP- 219
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPL 272
A+DGI G G+ + S+ISQLA G ++F+HCL G +GGGI +G + +P+ TPL
Sbjct: 220 RRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVYTPL 279
Query: 273 VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKII 332
VP+QPHY++N+ ++ V L + VF + GTIID+GTTLAYLP+ Y P + +
Sbjct: 280 VPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVA 339
Query: 333 SQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF----EDLW 388
+ ++ Y CF+ + + FP V+ F S+ + P YL F +W
Sbjct: 340 NAVSQYGRPITYESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIW 399
Query: 389 CIGWQNSGMQSRDRKNMTLLGDF 411
CIG+Q + +T+LGD
Sbjct: 400 CIGFQR-----MSHRRITILGDL 417
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 151/399 (37%), Positives = 220/399 (55%), Gaps = 29/399 (7%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAK 79
+NHGV ++ L+ D R R+L +D + G+ P VGLYY +
Sbjct: 39 TNHGV------------EIAHLRSRDRVRHGRMLQSSGGVIDFSVSGTYDPFLVGLYYTR 86
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
+ +G PPKD+YVQ+DTGSD++WV+C C CP S L I L +D S+T V+C +
Sbjct: 87 VQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVSCSDQ 146
Query: 140 FCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C GV + C Y+ YGDGS T+GY+V D++ D V T++++ S++
Sbjct: 147 ICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVV 206
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIF 257
FGC Q+G+L + ++ A+DGI GFG+ + S+ISQL+S G K+F+HCL G + GGGI
Sbjct: 207 FGCSTSQTGDL-TKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGIL 265
Query: 258 AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLA 317
+G +V+P V TPLVP+QPHY++N+ ++ V L + VF ++GTIIDSGTTLA
Sbjct: 266 VLGEIVEPNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLA 325
Query: 318 YLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYP 377
YL E Y V + + V C+ S SV + FP V+ +F SL +
Sbjct: 326 YLAEEAYNAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGASLVLGA 385
Query: 378 HEYLFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
+YL +WCIG+Q Q +T+LGD
Sbjct: 386 QDYLIQQNSVGGTTVWCIGFQKIPGQ-----GITILGDL 419
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 150/375 (40%), Positives = 222/375 (59%), Gaps = 15/375 (4%)
Query: 49 DARRQQRILA-GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC 107
D R+ R LA GVD LGG++ P GLY+ ++G+G P K Y VQVDTGSD++WVNC C
Sbjct: 1 DRGRRGRFLAEGVDFSLGGTADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPC 60
Query: 108 KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGD 166
CPR+S+L I LT+YD ++SST V+C C C+ T +C Y+ YGD
Sbjct: 61 SGCPRKSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGD 120
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
GS++ GY+V+D +QY+ +S + +T ++FGC RQ+G+L ST+++A+DGIIGFG+
Sbjct: 121 GSTSEGYYVRDAMQYNVISSN-GLANTTSQVLFGCSIRQTGDL-STSQQAVDGIIGFGQL 178
Query: 227 NSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTA 285
S+ +QLA+ + ++F+HCL+G GGGI IG + +P + TPLVP+ HY++ +
Sbjct: 179 ELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRG 238
Query: 286 VQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD 345
+ V + L + + F ++ G I+DSGTTLAY P Y V I V
Sbjct: 239 ISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGM 298
Query: 346 EYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF-------PFEDLWCIGWQNSGMQ 398
+ CF S + + FPNVT +FE ++++ P YL D+WCIGWQ+S
Sbjct: 299 DTQCFLVSGRLSDLFPNVTLNFEGG-AMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSS 357
Query: 399 S--RDRKNMTLLGDF 411
+ +D +T+LGD
Sbjct: 358 AGPKDGSQLTILGDI 372
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 141/361 (39%), Positives = 205/361 (56%), Gaps = 18/361 (4%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
V+ + GSS P VGLY+ K+ +G P +++ VQ+DTGSDI+WV C C CP S LGIE
Sbjct: 69 VNFSVKGSSNP-FVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIE 127
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV 179
L L+D SS+ + + C C V T C Y Y D S T+G++V D +
Sbjct: 128 LNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSM 187
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
+D + G+ +++ +++FGC Q G+L +ALDGI GFG+ S+ISQL+S G
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRAT-KALDGIFGFGQGEFSVISQLSSRGI 246
Query: 240 VRKMFAHCLD-GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD 298
K+F+HCL G NGGGI +G +++P + +PL+P+QPHY++ + ++ + PT
Sbjct: 247 TPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPT- 305
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
+F + + TIIDSGTTLAYL E VY+ +VS I S T+ CF+ S SV +
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVAD 365
Query: 359 GFPNVTFHFENSVSLKVYPHEYL--------FPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
FP + F+FE S+ V P EYL + F LWCIG+Q + + +LGD
Sbjct: 366 IFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQ------KAEDGLNILGD 419
Query: 411 F 411
Sbjct: 420 L 420
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/381 (38%), Positives = 217/381 (56%), Gaps = 18/381 (4%)
Query: 42 LSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
L LK D R R L VD P+ G+ P VGLY+ ++ +G+PPK++YVQ+DTGS
Sbjct: 45 LDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGS 104
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANT 156
D++WV+C C CP+ S L I L +D SST ++C + C GV +
Sbjct: 105 DVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGN 164
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y YGDGS T+GY+V D++ +D + G T+++ S++FGC Q+G+L + ++ A
Sbjct: 165 QCIYTFQYGDGSGTSGYYVSDLLNFDAIVGS-SVTNSSASIVFGCSISQTGDL-TKSDRA 222
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHC-LDGINGGGIFAIGHVVQPEVNKTPLVPN 275
+DGI GFG+ + S+ISQ++S G K+F+HC GGGI +G +V+ ++ +PLVP+
Sbjct: 223 VDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS 282
Query: 276 QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
QPHY++N+ ++ V L + +VF N+GTI+DSGTTLAYL E Y+P VS I
Sbjct: 283 QPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAV 342
Query: 336 PDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL-----WCI 390
+ C+ + SV FP V+ +F VS+ + P +YL + WCI
Sbjct: 343 SQSVRPLLSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCI 402
Query: 391 GWQNSGMQSRDRKNMTLLGDF 411
G+Q Q +T+LGD
Sbjct: 403 GFQKIQGQ-----GITILGDL 418
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/381 (38%), Positives = 217/381 (56%), Gaps = 18/381 (4%)
Query: 42 LSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
L LK D R R L VD P+ G+ P VGLY+ ++ +G+PPK++YVQ+DTGS
Sbjct: 30 LDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGS 89
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANT 156
D++WV+C C CP+ S L I L +D SST ++C + C GV +
Sbjct: 90 DVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGN 149
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y YGDGS T+GY+V D++ +D + G T+++ S++FGC Q+G+L + ++ A
Sbjct: 150 QCIYTFQYGDGSGTSGYYVSDLLNFDAIVGS-SVTNSSASIVFGCSISQTGDL-TKSDRA 207
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHC-LDGINGGGIFAIGHVVQPEVNKTPLVPN 275
+DGI GFG+ + S+ISQ++S G K+F+HC GGGI +G +V+ ++ +PLVP+
Sbjct: 208 VDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS 267
Query: 276 QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
QPHY++N+ ++ V L + +VF N+GTI+DSGTTLAYL E Y+P VS I
Sbjct: 268 QPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAV 327
Query: 336 PDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL-----WCI 390
+ C+ + SV FP V+ +F VS+ + P +YL + WCI
Sbjct: 328 SQSVRPLLSKGTQCYLITSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGDAAVWCI 387
Query: 391 GWQNSGMQSRDRKNMTLLGDF 411
G+Q Q +T+LGD
Sbjct: 388 GFQKIQGQ-----GITILGDL 403
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 148/378 (39%), Positives = 208/378 (55%), Gaps = 27/378 (7%)
Query: 42 LSLLKEHDARRQQRILA-GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
+ LLK HD R ++ + V LP+ G + P GLY+ ++ +GTPP+ Y +QVDTGSD++
Sbjct: 1 MQLLKAHDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLL 60
Query: 101 WVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPY 160
WVNC C CP S L I + YD+K S++ V C C + + C C Y
Sbjct: 61 WVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGY 120
Query: 161 LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGI 220
YGDGS T GY V+DV+ Y + ++IFGCG +QSG+L ST+E ALDGI
Sbjct: 121 SFQYGDGSGTLGYLVEDVLHY--------MVNATATVIFGCGFKQSGDL-STSERALDGI 171
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLD-GINGGGIFAIGHVVQPEVNKTPLVPNQPHY 279
IGFG S+ S SQLA G +FAHCLD G GGGI +G+V++P++ TPLVP HY
Sbjct: 172 IGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMYHY 231
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI-ISQQPDL 338
++ + ++ V L + +F +GTI DSGTTLAYLP+ Y+ + + P L
Sbjct: 232 NVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFL 291
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF-----PFEDLWCIGWQ 393
T + S + + FPNV +FE + S+ + P EYL +WC+GWQ
Sbjct: 292 LCDT--------RLSRFIYKLFPNVVLYFEGA-SMTLTPAEYLIRQASAANAPIWCMGWQ 342
Query: 394 NSGMQSRDRKNMTLLGDF 411
+ G + T+ GD
Sbjct: 343 SMG-SAESELQYTIFGDL 359
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 148/378 (39%), Positives = 208/378 (55%), Gaps = 27/378 (7%)
Query: 42 LSLLKEHDARRQQRILA-GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
+ LLK HD R ++ + V LP+ G + P GLY+ ++ +GTPP+ Y +QVDTGSD++
Sbjct: 1 MQLLKAHDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLL 60
Query: 101 WVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPY 160
WVNC C CP S L I + YD+K S++ V C C + + C C Y
Sbjct: 61 WVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGY 120
Query: 161 LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGI 220
YGDGS T GY V+DV+ Y + ++IFGCG +QSG+L ST+E ALDGI
Sbjct: 121 SFQYGDGSGTLGYLVEDVLHY--------MVNATATVIFGCGFKQSGDL-STSERALDGI 171
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLD-GINGGGIFAIGHVVQPEVNKTPLVPNQPHY 279
IGFG S+ S SQLA G +FAHCLD G GGGI +G+V++P++ TPLVP HY
Sbjct: 172 IGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMSHY 231
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI-ISQQPDL 338
++ + ++ V L + +F +GTI DSGTTLAYLP+ Y+ + + P L
Sbjct: 232 NVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFL 291
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF-----PFEDLWCIGWQ 393
T + S + + FPNV +FE + S+ + P EYL +WC+GWQ
Sbjct: 292 LCDT--------RLSRFIYKLFPNVVLYFEGA-SMTLTPAEYLIRQASAANAPIWCMGWQ 342
Query: 394 NSGMQSRDRKNMTLLGDF 411
+ G + T+ GD
Sbjct: 343 SMG-SAESELQYTIFGDL 359
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 251 bits (640), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 149/423 (35%), Positives = 229/423 (54%), Gaps = 31/423 (7%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSSNHGVFSVKYR----YAGRERSLSLLKEHDARRQQRI 56
M C+ L ++ + +AV HGVF R + R +L AR + +
Sbjct: 1 MRCCIPTLLAVITVLLSAV------HGVFLPLERSIPPTSHRVEVAALRARDRARHARML 54
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSL 116
VD + G+S P+ VG+Y G + VQ+DTGSDI+WVNC C CP+ S L
Sbjct: 55 RGVVDFSVQGTSDPNSVGMY------GXXXXXFNVQIDTGSDILWVNCNTCSNCPQSSQL 108
Query: 117 GIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFV 175
GIEL +D SST + C C G +C+ + C Y YGDGS T+GY+V
Sbjct: 109 GIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYV 168
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D + ++ + G ++ +++FGC QSG+L T ++A+DGI GFG S++SQL+
Sbjct: 169 SDAMYFNLIMGQPPAVNSTATIVFGCSISQSGDLTKT-DKAVDGIFGFGPGPLSVVSQLS 227
Query: 236 SSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLN 294
S G K+F+HCL G NGGGI +G +++P + +PLVP+QPHY++N+ ++ V L
Sbjct: 228 SQGITPKVFSHCLKGDGNGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLP 287
Query: 295 LPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
+ VF + +N+ GTI+D GTTLAYL + Y+PLV+ I + T C+ S
Sbjct: 288 INPAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQCYLVS 347
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYL-----FPFEDLWCIGWQNSGMQSRDRKNMTLL 408
S+ + FP V+ +FE S+ + P +YL ++WC+G+Q + ++ ++L
Sbjct: 348 TSIGDIFPLVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCVGFQ------KLQEGASIL 401
Query: 409 GDF 411
GD
Sbjct: 402 GDL 404
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/359 (39%), Positives = 205/359 (57%), Gaps = 17/359 (4%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
V+ + GSS P VGLY+ K+ +G P +++ VQ+DTGSDI+WV C C CP S LGIE
Sbjct: 69 VNFSVKGSSNP-FVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIE 127
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV 179
L L+D SS+ + + C C V T C Y Y D S T+G++V D +
Sbjct: 128 LNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSM 187
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
+D + G+ +++ +++FGC Q G+L +ALDGI GFG+ S+ISQL+S G
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRAT-KALDGIFGFGQGEFSVISQLSSRGI 246
Query: 240 VRKMFAHCLD-GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD 298
K+F+HCL G NGGGI +G +++P + +PL+P+QPHY++ + ++ + PT
Sbjct: 247 TPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPT- 305
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
+F + + TIIDSGTTLAYL E VY+ +VS I S T+ CF+ S SV +
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVAD 365
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFED------LWCIGWQNSGMQSRDRKNMTLLGDF 411
FP + F+FE S+ V P EYL F+ LWCIG+Q + + +LGD
Sbjct: 366 IFPVLRFNFEGIASMVVTPEEYL-QFDSIVREPALWCIGFQ------KAEDGLNILGDL 417
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 190/326 (58%), Gaps = 4/326 (1%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
R+R+ + + VD P+ GS+ P VGLY+ ++ +G+PPK+Y+VQ+DTGS
Sbjct: 53 RDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTGS 112
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC--TAN 155
DI+WV C C CP S L I+L ++ SST + C + C C + N
Sbjct: 113 DILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSDN 172
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
+ C Y YGDGS T+GY+V D + +D V G+ QT +++ S++FGC QSG+L T +
Sbjct: 173 SPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQSGDLTKT-DR 231
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVP 274
A+DGI GFG+ S++SQL S G K+F+HCL G NGGGI +G +V+P + TPLVP
Sbjct: 232 AVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVP 291
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+QPHY++N+ ++ V L + + +F + +GTI+DSGTTLAYL + Y+P V+ I +
Sbjct: 292 SQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAA 351
Query: 335 QPDLKVHTVHDEYTCFQYSESVDEGF 360
V CF S + F
Sbjct: 352 VSPSVRSLVSKGNQCFVTSSRLASCF 377
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 150/375 (40%), Positives = 206/375 (54%), Gaps = 20/375 (5%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
LK HD RR + A VD PL G P GLYY KI +GTPP YYVQVDTGSD+ W+NC
Sbjct: 9 LKAHDRRR---LAAVVDFPLTGDDDPFVTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNC 65
Query: 105 IQCKECPRRSSL-GIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
C C + L I+LT YD SST ++C C G CT+ C Y
Sbjct: 66 APCTSCVTETQLPSIKLTTYDPSRSSTDGALSCRDSNCGAALGSNEVSCTSAGYCAYSTT 125
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
YGDGSST GYF+QDV+ + ++ + Q T S+ FGCG QSGNL + ALDG+IGF
Sbjct: 126 YGDGSSTQGYFIQDVMTFQEIHNNTQVNGT-ASVYFGCGTTQSGNL-LMSSRALDGLIGF 183
Query: 224 GKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVPNQPHYSIN 282
G++ S+ SQLAS G V FAHCL G N GGG IG V +P ++ TP+V ++ HY++
Sbjct: 184 GQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGGTIVIGSVSEPNISYTPIV-SRNHYAVG 242
Query: 283 MTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH 341
M + V + P + G I+DSGTTLAYL + Y V+ + + + +
Sbjct: 243 MQNIAVNGRNVTTPASFDTTSTSAGGVIMDSGTTLAYLVDPAYTQFVNAVSTFESSM--F 300
Query: 342 TVHDEYTCFQYSE-SVDEGFPNVTFHFENSVSLKVYPHEYLF--PFED---LWCIGWQNS 395
+ H + C Q + S+ FP V F+ + + P YL+ P ++ +C+GWQ S
Sbjct: 301 SSHSQ--CLQLAWCSLQADFPTVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKS 358
Query: 396 GMQSRDRKNMTLLGD 410
++ + ++LGD
Sbjct: 359 TTKA-GYLSYSILGD 372
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/348 (39%), Positives = 206/348 (59%), Gaps = 14/348 (4%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
LY+ ++G+G P K Y VQVDTGSD++WVNC C CPR+S+L I LT+YD ++SST V
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 135 TCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+C C C+ A +C Y+ YGDGS++ GY+V+D +QY+ +S + +T
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSN-GLANT 119
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-IN 252
++FGC RQ+G+L ST+++A+DGIIGFG+ S+ +QLA+ + ++F+HCL+G
Sbjct: 120 TSQVLFGCSIRQTGDL-STSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKR 178
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
GGGI IG + +P + TPLVP+ HY++ + + V + L + + F ++ G I+DS
Sbjct: 179 GGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDS 238
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVS 372
GTTLAY P Y V I V + CF S + + FPNVT +FE +
Sbjct: 239 GTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGG-A 297
Query: 373 LKVYPHEYLF-------PFEDLWCIGWQNSGMQS--RDRKNMTLLGDF 411
+++ P YL D+WCIGWQ+S + +D +T+LGD
Sbjct: 298 MELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDI 345
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 198/344 (57%), Gaps = 24/344 (6%)
Query: 45 LKEHDARRQQRILAGV-DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
L+EHD RR +RIL V P+ G GLYY +I +GTPP+ +YV VDTGSD+ WVN
Sbjct: 16 LREHDQRRLRRILPEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVN 75
Query: 104 CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYLE 162
C+ C C R S++ + ++++D + S++ ++C E C Y + C+ N+ SCPY
Sbjct: 76 CVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEEC---YLASNSKCSFNSMSCPYST 132
Query: 163 IYGDGSSTTGYFVQDVVQYDKV-SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
+YGDGSST GY + DV+ +++V SG+ TS L FGCG+ Q+G + DG++
Sbjct: 133 LYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTWLT------DGLV 186
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVPNQPHYS 280
GFG++ S+ SQL+ +FAHCL G N G G IGH+ +P + TP+VP Q HY+
Sbjct: 187 GFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGLVYTPIVPKQSHYN 246
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI--ISQQPDL 338
+ + + V + PT F + ++ G I+DSGTTL YL + Y+ +K+ + L
Sbjct: 247 VELLNIGVSGTNVTTPT-AFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRSGVL 305
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
V FQ+ +++ FPNVT +F ++ + P YL+
Sbjct: 306 PV--------AFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLY 341
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 117/299 (39%), Positives = 171/299 (57%), Gaps = 29/299 (9%)
Query: 39 ERSLSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
E L+ L+ D+ R R+L V+ P+ G+S P VGLYY K+ +GTPP+++ VQ+
Sbjct: 90 ELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQI 149
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSD++WV+C C CP+ S L I+L+ +D SS+ V+C C+ + + C+
Sbjct: 150 DTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTE-SGCS 208
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
N C Y YGDGS T+GY++ D F C QSG+L
Sbjct: 209 PNNLCSYSFKYGDGSGTSGYYISD---------------------FMCSNLQSGDLQRP- 246
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPL 272
A+DGI G G+ + S+ISQLA G ++F+HCL G +GGGI +G + +P+ TPL
Sbjct: 247 RRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVYTPL 306
Query: 273 VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
VP+QPHY++N+ ++ V L + VF + GTIID+GTTLAYLP+ Y P + +
Sbjct: 307 VPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAV 365
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 9/72 (12%)
Query: 344 HDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF----EDLWCIGWQNSGMQS 399
++ Y CF+ + + FP V+ F S+ + P YL F +WCIG+Q
Sbjct: 445 YESYQCFEITAGDVDVFPQVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQR----- 499
Query: 400 RDRKNMTLLGDF 411
+ +T+LGD
Sbjct: 500 MSHRRITILGDL 511
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 107/253 (42%), Positives = 161/253 (63%), Gaps = 7/253 (2%)
Query: 42 LSLLKEHDARRQQRILAG----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
LS L+ D+ R +R+L VD P+ G+ P VGLYY K+ +GTPP++ YVQ+DTGS
Sbjct: 39 LSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGS 98
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH-GVYGGPLTDCTANT 156
D++WV+C C CP+ S L I+L +D SST ++C C GV + N
Sbjct: 99 DVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNN 158
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y YGDGS T+GY+V D++ + + TT+++ S++FGC Q+G+L + +E A
Sbjct: 159 QCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSILQTGDL-TKSERA 217
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVPN 275
+DGI GFG+ S+ISQL+S G ++F+HCL G N GGG+ +G +V+P + +PLVP+
Sbjct: 218 VDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEIVEPNIVYSPLVPS 277
Query: 276 QPHYSINMTAVQV 288
QPHY++N+ ++ V
Sbjct: 278 QPHYNLNLQSISV 290
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 110/302 (36%), Positives = 168/302 (55%), Gaps = 15/302 (4%)
Query: 45 LKEHDARRQQRILAGV-DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
L++HD RR +R+L V P+ G + +GLYY +I +GTPP+ +YV VDTGS++ WV
Sbjct: 9 LRKHDQRRLRRMLPEVVSFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVK 68
Query: 104 CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
C C C + + ++ +D + S+T ++C C GV L SCPY +
Sbjct: 69 CAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAEC-GVLNKKLQCSPERLSCPYSLL 127
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTT-STNGSLIFGCGARQSGNLDSTNEEALDGIIG 222
YGDGSST GY++ DV +++V D T S L+FGCG Q+G+ ++DG++G
Sbjct: 128 YGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSW------SVDGLLG 181
Query: 223 FGKSNSSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPLVPNQPHYSI 281
FG + S+ +QLA +FAHCL G ++G G IG + +P++ TP+V + HY++
Sbjct: 182 FGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVYTPMVFGEDHYNV 241
Query: 282 NMTAVQVGLDFLNLPTDV-FGVGDNKGTIIDSGTTLAYLPEMVYEPLVS--KIISQQPDL 338
+ + +G+ N+ T F + G IIDSGTTL YL + Y+ + Q DL
Sbjct: 242 QL--LNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYDEFRRGVSVFKQSSDL 299
Query: 339 KV 340
V
Sbjct: 300 AV 301
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 131/386 (33%), Positives = 197/386 (51%), Gaps = 44/386 (11%)
Query: 37 GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
G E L ++ A ++Q+ + G L + P GLY + +G P + YY+ TG
Sbjct: 44 GVEELSELDRKRFAAKKQQGVTGFVL----EAMP---GLYCITVKLGNPSRHYYLAFHTG 96
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD+MWV C C +CP +G L LYD K+SST ++C + C C +
Sbjct: 97 SDVMWVPCSSCTDCPTPDDIGFSLDLYDPKNSSTSSEISCSDDRCADALKTGHAICHTSH 156
Query: 157 S----CPYLEIYGDGS-STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
S C Y +IY DG +TTGY+V D + +D G+ S++ S+IFGC +SG+L +
Sbjct: 157 SSGDQCGYNQIYADGVLATTGYYVSDDIHFDIFMGNESFASSSASVIFGCSKSRSGHLQA 216
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNKT 270
DG+IGFGK S+ISQL +S GV F+ CL D +GGG+ + V +P + T
Sbjct: 217 ------DGVIGFGKDAPSLISQL-NSQGVSHAFSRCLDDSDDGGGVLILDEVGEPGLEFT 269
Query: 271 PLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
LV ++P Y++NM ++ V + + + +F +GT +DSGT+LAY P+ VY+P++
Sbjct: 270 SLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTSSTQGTFLDSGTSLAYFPDGVYDPVIRA 329
Query: 331 IISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL-----FPFE 385
I+ +S FP VT +FE ++KV P YL + +
Sbjct: 330 IL----------------FIYFSTRSFSSFPTVTXYFEGGAAMKVGPENYLLRRGSYDND 373
Query: 386 DLWCIGWQNSGMQSRDRKNMTLLGDF 411
CI +Q S D K T+LGD
Sbjct: 374 SYMCIAFQRS---EGDYKQTTILGDL 396
>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
Length = 191
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 83/160 (51%), Positives = 114/160 (71%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
R+ +LS +K HD R+ R L+ VD LGG+ P GLY+ K+G+G+P KDYYVQVDTGS
Sbjct: 32 RKTTLSGIKHHDHHRRGRFLSSVDFNLGGNGLPTRTGLYFTKLGLGSPKKDYYVQVDTGS 91
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS 157
DI+WVNC++C CP +S +G++LTLYD K S T + ++CD EFC Y GP+ C A T
Sbjct: 92 DILWVNCVECSRCPTKSQIGMDLTLYDPKGSHTSELISCDHEFCSSTYDGPIPGCRAETP 151
Query: 158 CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
CPY YGDGS+TTGY+V+D + +D+++G+L T N S+
Sbjct: 152 CPYSITYGDGSATTGYYVRDYLTFDRINGNLHTAPQNSSI 191
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 99/237 (41%), Positives = 140/237 (59%), Gaps = 12/237 (5%)
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
++ V G+ QT +++ S++FGC QSG+L + + A+DGI GFG+ S+ISQL S G
Sbjct: 3 FETVMGNEQTANSSASIVFGCSNSQSGDL-TKADRAVDGIFGFGQHQLSVISQLNSLGVS 61
Query: 241 RKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDV 299
K+F+HCL G NGGGI +G +V+P + TPLVP+QPHY++N+ ++ V L + + +
Sbjct: 62 PKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSL 121
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG 359
F + +GTI+DSGTTLAYL + Y+P VS I + V CF S SVD
Sbjct: 122 FTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSS 181
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
FP VT +F V++ V P YL LWCIGWQ + Q +T+LGD
Sbjct: 182 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQ-----EITILGDL 233
>gi|125589905|gb|EAZ30255.1| hypothetical protein OsJ_14305 [Oryza sativa Japonica Group]
Length = 213
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 78/176 (44%), Positives = 120/176 (68%), Gaps = 5/176 (2%)
Query: 237 SGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS-INMTAVQVGLDFLNL 295
+G +K+F+HCLD NGGGIFAIG VV+P+V TP+V N Y +N+ ++ V L L
Sbjct: 5 AGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQL 64
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSES 355
P ++FG KGT IDSG+TL YLPE++Y L+ + ++ PD+ + +++ + CF + S
Sbjct: 65 PANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVFAKHPDITMGAMYN-FQCFHFLGS 123
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGD 410
VD+ FP +TFHFEN ++L VYP++YL +E + +C G+Q++G+ K+M +LGD
Sbjct: 124 VDDKFPKITFHFENDLTLDVYPYDYLLEYEGNQYCFGFQDAGIHG--YKDMIILGD 177
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 138/441 (31%), Positives = 194/441 (43%), Gaps = 71/441 (16%)
Query: 14 IATAAVGGVSSNHGVFSVKYRYAGRERS-------------LSLLKEHDARRQQRILAGV 60
+A V V+ GV +K+R++ E S L +H R +R L V
Sbjct: 15 VALGPVSKVTCGSGVLKLKHRFSELEGSSKQSGKRGMSEEHFRQLMDHTRARSRRFLLEV 74
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI-- 118
DL L GSS D YYA+IG+G P + VDTGSDI+W C C+ C + ++ +
Sbjct: 75 DLMLNGSSTSDAT--YYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCS 132
Query: 119 ------ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSSTT 171
+TLYD + S T TC C GG C N SC Y Y D SS+T
Sbjct: 133 SIIMQGPITLYDPELSITASPATCSDPLCS--EGG---SCRGNNNSCAYDISYEDTSSST 187
Query: 172 GYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMI 231
G + +DVV S N ++ GC SG +DGI+GFG+S S+
Sbjct: 188 GIYFRDVVHLG------HKASLNTTMFLGCATSISGLW------PVDGIMGFGRSKVSVP 235
Query: 232 SQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQ-PEVNKTPLVPNQPHYSINMTAVQVG 289
+QLA+ G +F HCL G GGGI +G + PE+ TP++ N Y++ + ++ V
Sbjct: 236 NQLAAQAGSYNIFYHCLSGEKEGGGILVLGKNDEFPEMVYTPMLANDIVYNVKLVSLSVN 295
Query: 290 LDFLNLPTDVF---GVGDNKGTIIDSGTTLAYLPE---MVYEPLVSKIISQQPDLKVHTV 343
L + F N GTIIDSGT+ A P ++ VSK + P + +
Sbjct: 296 SKALPIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAPLESS 355
Query: 344 HDE-YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL-------------FPFEDLWC 389
+ SV+ FPNVT F+ ++++ H YL F L C
Sbjct: 356 GSPCFISISDRNSVEVDFPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTHFQGVRLVC 415
Query: 390 IGWQNSGMQSRDRKNMTLLGD 410
I W N T+LGD
Sbjct: 416 ISWSVG--------NSTILGD 428
>gi|388517377|gb|AFK46750.1| unknown [Lotus japonicus]
Length = 210
Score = 165 bits (418), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 73/137 (53%), Positives = 104/137 (75%), Gaps = 2/137 (1%)
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP 336
HY++ + ++V D L LP+D F + KGT+IDSGTTLAYLP +VY+ L+SK++++QP
Sbjct: 2 AHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKVLAKQP 61
Query: 337 DLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF--EDLWCIGWQN 394
LKV+ V ++Y+CFQY+ +VD GFP V HFE+S+SL VYPH+YLF + + WCIGWQ
Sbjct: 62 RLKVYLVEEQYSCFQYTGNVDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQK 121
Query: 395 SGMQSRDRKNMTLLGDF 411
S ++++ K+MTLLGDF
Sbjct: 122 SASETKNGKDMTLLGDF 138
>gi|413952261|gb|AFW84910.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 298
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 125/217 (57%), Gaps = 12/217 (5%)
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAI 259
C QSG+L + + A+DGI GFG+ S+ISQL S G K+F+HCL G NGGGI +
Sbjct: 9 CSNSQSGDL-TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVL 67
Query: 260 GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYL 319
G +V+P + TPLVP+QPHY++N+ ++ V L + + +F + +GTI+DSGTTLAYL
Sbjct: 68 GEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYL 127
Query: 320 PEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
+ Y+P VS I + V CF S SVD FP VT +F V++ V P
Sbjct: 128 ADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGGVAMSVKPEN 187
Query: 380 YLFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
YL LWCIGWQ + Q +T+LGD
Sbjct: 188 YLLQQASVDNSVLWCIGWQRNQGQ-----EITILGDL 219
>gi|46275851|gb|AAS86401.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 197
Score = 155 bits (393), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 76/191 (39%), Positives = 114/191 (59%), Gaps = 3/191 (1%)
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS 280
+G G SN+S++ QLA S +KMFAHCLDG GGIF +GH+V P+V KTPL Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
+ + VG L+L + TI+++G+ ++YLPE VY+ + I S D+ V
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQSFLDSIFSDLEDISV 120
Query: 341 HTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP--FEDLWCIGWQNSGMQ 398
+ Y+CF Y S+D FP V FHF+ ++L+VYPHEY+F E +C+G+ +S +
Sbjct: 121 INI-GGYSCFHYERSIDARFPEVVFHFKELLTLRVYPHEYMFHNMEEHYYCLGFLSSEQR 179
Query: 399 SRDRKNMTLLG 409
+ K++ +LG
Sbjct: 180 NHREKDLFILG 190
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 119/398 (29%), Positives = 192/398 (48%), Gaps = 47/398 (11%)
Query: 28 VFSVKYRYAGRERS---------------LSLLKEHDARRQQRILAGVDLPLGGSSRPDG 72
+ +++RY+G E S L L EH+ RR R L G+ PL G+
Sbjct: 23 ILKLQHRYSGLEGSSKQNEKLGLGMSKHHLQHLVEHNDRRG-RFLQGISFPLKGNY--SD 79
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+GLYY +IG+G P + V VDTGSDI+WV C C+ C + + L++Y++ SST
Sbjct: 80 LGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSS 139
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+C C G + +N++C Y Y D S++ G +V+D + Y G+ +
Sbjct: 140 VSSCSDPLCTGEQ-AVCSRSGSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGN----A 194
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-I 251
T + FGC +G+ + DGI+GFG+ + ++ +Q+A+ + ++F+HCL G
Sbjct: 195 TTSHIFFGCAINITGSWPA------DGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEK 248
Query: 252 NGGGIFAIGHVVQPEVNK---TPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-- 306
+GGGI G +P + TPL+ HY++++ ++ V L + + F N
Sbjct: 249 HGGGILEFGE--EPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTN 306
Query: 307 --GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE--SVDEGFPN 362
G IIDSGT+ A L L S+ I K+ + CF +V+ FPN
Sbjct: 307 ETGVIIDSGTSFALLATKANRILFSE-IKNLTTAKLGPKLEGLQCFYLKSGLTVETSFPN 365
Query: 363 VTFHFENSVSLKVYPHEYLFPFE-----DLWCIGWQNS 395
VT F ++K+ P YL E + +C W ++
Sbjct: 366 VTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSA 403
>gi|217073140|gb|ACJ84929.1| unknown [Medicago truncatula]
Length = 198
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 70/135 (51%), Positives = 100/135 (74%), Gaps = 1/135 (0%)
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
HY++ + ++V D L LP+D+F G+ KGT+IDSGTTLAYLP +VY+ L+ KI ++QP+
Sbjct: 3 HYNVVLKNIEVDGDVLQLPSDIFDSGNGKGTVIDSGTTLAYLPVIVYDQLIPKIFARQPE 62
Query: 338 LKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNSG 396
LK+ + +++ CF Y+ +VD GFP V HFE S+SL VYPH+YLF ++ + CIGWQ S
Sbjct: 63 LKLARIEEQFKCFPYAGNVDGGFPVVKLHFEGSLSLTVYPHDYLFQYKAGVRCIGWQKSV 122
Query: 397 MQSRDRKNMTLLGDF 411
Q++D K+MTLLGD
Sbjct: 123 TQTKDGKDMTLLGDL 137
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 119/399 (29%), Positives = 190/399 (47%), Gaps = 49/399 (12%)
Query: 28 VFSVKYRYAGRERS---------------LSLLKEHDARRQQRILAGVDLPLGGSSRPDG 72
+ +++RY+G E S L L EH+ RR R L G+ PL G+
Sbjct: 23 ILKLQHRYSGLEGSSKQNEKLGLGMSKQHLQHLVEHNDRRG-RFLQGISFPLKGNY--SD 79
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+GLYY +IG+G P + V VDTGSDI+WV C C+ C + + L++Y++ SST
Sbjct: 80 LGLYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSS 139
Query: 133 FVTCDQEFCHGVYGGPLTDCTA---NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+C C G C+ N++C Y+ Y D S++ G +V+D + Y G+
Sbjct: 140 VSSCSDPLCT----GEEVVCSRSGNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGN-- 193
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+T + FGC +G+ +DGI+GFG + ++ +Q+A+ + ++F+HCL
Sbjct: 194 --ATTSRIFFGCATNITGSW------PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLG 245
Query: 250 G-INGGGIFAIGHVVQ-PEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV----G 303
G +GGGI G E+ TPL+ HY++++ ++ V L + F
Sbjct: 246 GEKHGGGILEFGEAPNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNST 305
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE--SVDEGFP 361
+N G IIDSGTT L L +I S K+ + CF +++ FP
Sbjct: 306 NNTGVIIDSGTTFVLLTTKANRMLFQEIKSLT-TAKLGPKLEGLECFYLKSGLTMETSFP 364
Query: 362 NVTFHFENSVSLKVYPHEYLFPFE-----DLWCIGWQNS 395
NVT F ++K+ P YL E + +C W ++
Sbjct: 365 NVTLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSA 403
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 126/436 (28%), Positives = 206/436 (47%), Gaps = 70/436 (16%)
Query: 12 VLIATAAVGGVSSNHGVFSVKYRYA-GRERSLSLLKEHDARRQQRIL-------AGVDLP 63
V I A + VF+V+ R + +L+ L+EHDA R++RIL P
Sbjct: 42 VRIGGTAESSFDRSPAVFAVRRRESPSTPTALAHLREHDAHRRRRILESPAESPGASTFP 101
Query: 64 LGGSSRPDGVGLYYAKIGIGTP-PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
L GS + G YYA I +G P P+ + V VDTGS + +V C C +C + T
Sbjct: 102 LHGSVKEHG--YYYANIALGDPSPRTFQVIVDTGSTLTYVPCATCAKCGTHTG----GTR 155
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLT----DCTANTSCPYLEIYGDGSSTTGYFVQDV 178
+D TGK++TC ++ C GGP A C Y Y +GS +G V+D
Sbjct: 156 FD----PTGKWLTCQEKQCKAA-GGPGICAGGRGAAANRCTYSRTYAEGSGVSGDLVRDK 210
Query: 179 VQYDKVSGDLQTTSTNGSL--IFGCGARQSGNLDSTNEEALDGIIGFGKSN-SSMISQLA 235
+ + GD+ +TNG+L +FGC +SG + +++ DG+IG G + +S+ +QLA
Sbjct: 211 MHF---GGDI-APATNGTLDVVFGCTNAESGTI---HDQEADGLIGLGNNQFASIPNQLA 263
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVVQ----PEVNKTPLVPNQPH---YSINMTAVQV 288
+ G+ ++F+ C GGG + G + P + T + N+ H Y ++ A+++
Sbjct: 264 DTHGLPRVFSLCFGSFEGGGALSFGRLPATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKI 323
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE-----PLVSKIISQQPDLKVHTV 343
G + P+D+ VG GT++DSGTT Y+P V+ + + +P+ K+ V
Sbjct: 324 GDVAVATPSDL-AVG--YGTVMDSGTTFTYVPTKVFHATAAALDAAVTTNAKPEKKLAKV 380
Query: 344 ------HDEYTCFQYSESVD-----------EGFPNVTFHFE-NSVSLKVYPHEYLF--- 382
+ + CFQ + + E +P +T F+ SL + P YLF
Sbjct: 381 PGPDPSYPDDVCFQREGATEIEPIVTMANLGEYYPPLTIAFDGEGASLVLPPSNYLFVHG 440
Query: 383 PFEDLWCIGWQNSGMQ 398
+C+G ++ Q
Sbjct: 441 KKPGAFCLGVMDNKQQ 456
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 132/259 (50%), Gaps = 23/259 (8%)
Query: 39 ERSLSLLKEHDARRQQRIL-----AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
E L+ L D+ R R+L P+ + P +YY + IGTPP+++ V +
Sbjct: 41 ELDLTQLGAFDSARHGRMLQSHVHGAFSFPVERGTNPIS-RIYYTTLQIGTPPREFNVVI 99
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSD++WV+CI C CP ++ +T +D SS+ + C + C +D
Sbjct: 100 DTGSDVLWVSCISCVGCPLQN-----VTFFDPGASSSAVKLACSDKRC-------FSDLH 147
Query: 154 ANTSCPYLEI---YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
+ C LE Y DGS T+GY++ D++ ++ V T ++ +FGC +G L
Sbjct: 148 KKSGCSPLEYKVEYSDGSFTSGYYISDLISFETVMSSNLTVKSSAPFVFGCSNLHAG-LI 206
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD-GINGGGIFAIGHVVQPEVNK 269
S E ++ GI+G GK ++SQL+S ++F+ CL G GGG+ +G P
Sbjct: 207 SLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLSGGQEGGGVIILGENRLPNTVY 266
Query: 270 TPLVPNQPHYSINMTAVQV 288
TPLV +Q HY++N+ V
Sbjct: 267 TPLVRSQTHYNVNLKTFAV 285
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 109/355 (30%), Positives = 168/355 (47%), Gaps = 55/355 (15%)
Query: 64 LGGSSRPDGV----------GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR 113
L GS+RP+ G Y +I IGTPP+ + + VDTGS + +V C C++C R
Sbjct: 68 LQGSARPNARMRLYDDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRH 127
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSST 170
E L SST + V+C+ DCT + C Y Y + SS+
Sbjct: 128 QDPKFEPEL-----SSTYQPVSCN------------IDCTCDNERKQCVYERQYAEMSSS 170
Query: 171 TGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
+G +D++ + Q+ IFGC +++G+L S + DGI+G G+ + S+
Sbjct: 171 SGVLGEDIISFGN-----QSELVPQRAIFGCENQETGDLYS---QRADGIMGLGRGDLSI 222
Query: 231 ISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE---VNKTPLVPNQPHYSINMTAV 286
+ QL G + F+ C G++ GGG +G + P ++ V +Q +Y+I++ A+
Sbjct: 223 VDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISPPSGMVFAESDPVRSQ-YYNIDLKAI 281
Query: 287 QVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVH 344
V L+L +F D K GT++DSGTT AYLPE + ++ + LK +H
Sbjct: 282 HVAGKQLHLDPSIF---DGKHGTVLDSGTTYAYLPEAAFTAFKDAMMKELTSLKQIHGPD 338
Query: 345 DEYT--CFQYSES----VDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGW 392
Y CF +ES + FP V F N L + P YLF + L GW
Sbjct: 339 PNYNDICFSGAESDVSQLSNTFPAVEMVFSNGQKLSLSPENYLFQYYLGLESFGW 393
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/347 (30%), Positives = 153/347 (44%), Gaps = 46/347 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+A +G+GTP +D Y+ VDTGSDI W+ C C C ++ L++ SS+
Sbjct: 12 GTGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKD-----ALFNPSSSSSF 66
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K + C C + + C +N C Y YGDGS T G V D V D G Q
Sbjct: 67 KVLDCSSSLCLNL---DVMGCLSN-KCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVV 122
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
TN L GCG G + GI+G G+ S + L +S R +F++CL
Sbjct: 123 LTNIPL--GCGHDNEGTFGTAA-----GILGLGRGPLSFPNNLDAS--TRNIFSYCLPDR 173
Query: 252 NGG---------GIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFL-NLPTD 298
G AI H V P + N +Y + +T + VG + L N+P
Sbjct: 174 ESDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPAS 233
Query: 299 VFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKI------ISQQPDLKVHTVHDEYTCF 350
VF + N GTI DSGTT+ L Y + ++ D K+ TC+
Sbjct: 234 VFQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKIFD-----TCY 288
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF--EDLWCIGWQNS 395
++ P VTFHF+ V +++ P Y+ P +++C + S
Sbjct: 289 DFTGMNSISVPTVTFHFQGDVDMRLPPSNYIVPVSNNNIFCFAFAAS 335
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 110/360 (30%), Positives = 161/360 (44%), Gaps = 37/360 (10%)
Query: 45 LKEHDARRQQRILAG---VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
L E D R + G V +GG+ PDG LYY + +G+PPK Y++ +DTGSD+ W
Sbjct: 8 LLERDLSRLGKSSVGNHSVRFHVGGNIYPDG--LYYMALLLGSPPKLYFLDMDTGSDLTW 65
Query: 102 VNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CP 159
C C+ C ++G LY+ K + K V C C + G +C ++ C
Sbjct: 66 AQCDAPCRNC----AIGPH-GLYNPKKA---KVVDCHLPVCAQIQQGGSYECNSDVKQCD 117
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y Y DGSST G V+D + +G L T I GCG Q G L + + + DG
Sbjct: 118 YEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKA----IIGCGYDQQGTL-AKSPASTDG 172
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIGHVVQPE--VNKTPLV--P 274
+IG S ++ +QLA G ++ + HCL DG NGGG G + P + TP++ P
Sbjct: 173 VIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKP 232
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
Y + +++ G D L L D + DSGT+ YL Y ++S + Q
Sbjct: 233 EMLGYQARLQSIRYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQ 292
Query: 335 QPDLKVHTVHDEYTC------FQYSESVDEGFPNVTFH------FENSVSLKVYPHEYLF 382
L+V + C FQ V + F +T F +L + P YL
Sbjct: 293 SGLLRVKSDTTLPYCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLSPQGYLI 352
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 164/354 (46%), Gaps = 33/354 (9%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYVQVDTGSDIMW 101
L D +RQ+R LA + L GGS+ G L YYA + +GTP + V +DTGSD+ W
Sbjct: 62 LVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFW 121
Query: 102 V--NCIQCKECP-RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTS 157
V +CIQC R +L +L +Y +S+T + + C E C V G CT
Sbjct: 122 VPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPG-----CTNPKQP 176
Query: 158 CPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
CPY ++ + + ++++G ++D + + + N S+I GCG +QSG D + A
Sbjct: 177 CPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQKQSG--DYLDGIA 231
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-- 274
DG++G G ++ S+ S LA +G V+ F+ C + G IF G P TP VP
Sbjct: 232 PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQGVPSQQSTPFVPLY 290
Query: 275 -NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
Y++N+ +G L G + ++DSGT+ LP VY+ +
Sbjct: 291 GKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLPFDVYKAFTMEFDK 342
Query: 334 QQPDLKVHTVHDEYT-CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
Q +V + C+ S P +T F SL+ + PF D
Sbjct: 343 QMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTFAADKSLQAV--NPILPFND 394
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 164/354 (46%), Gaps = 33/354 (9%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYVQVDTGSDIMW 101
L D +RQ+R LA + L GGS+ G L YYA + +GTP + V +DTGSD+ W
Sbjct: 32 LVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFW 91
Query: 102 V--NCIQCKECP-RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTS 157
V +CIQC R +L +L +Y +S+T + + C E C V G CT
Sbjct: 92 VPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPG-----CTNPKQP 146
Query: 158 CPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
CPY ++ + + ++++G ++D + + + N S+I GCG +QSG D + A
Sbjct: 147 CPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQKQSG--DYLDGIA 201
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-- 274
DG++G G ++ S+ S LA +G V+ F+ C + G IF G P TP VP
Sbjct: 202 PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQGVPSQQSTPFVPLY 260
Query: 275 -NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
Y++N+ +G L G + ++DSGT+ LP VY+ +
Sbjct: 261 GKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLPLDVYKAFTMEFDK 312
Query: 334 QQPDLKVHTVHDEYT-CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
Q +V + C+ S P +T F SL+ + PF D
Sbjct: 313 QMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTFAADKSLQAV--NPILPFND 364
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 106/354 (29%), Positives = 164/354 (46%), Gaps = 33/354 (9%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGV---GLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
L D +RQ+R LA + L GGS+ G LYYA + +GTP + V +DTGSD+ W
Sbjct: 62 LVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFW 121
Query: 102 V--NCIQCKECP-RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTS 157
V +CIQC R +L +L +Y +S+T + + C E C V G CT
Sbjct: 122 VPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPG-----CTNPKQP 176
Query: 158 CPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
CPY ++ + + ++++G ++D + + + N S+I GCG +QSG D + A
Sbjct: 177 CPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQKQSG--DYLDGIA 231
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-- 274
DG++G G ++ S+ S LA +G V+ F+ C + G IF G P TP VP
Sbjct: 232 PDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQGVPSQQSTPFVPLY 290
Query: 275 -NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
Y++N+ +G L G + ++DSGT+ LP VY+ +
Sbjct: 291 GKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLPFDVYKAFTMEFDK 342
Query: 334 QQPDLKVHTVHDEYT-CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
Q +V + C+ S P +T F SL+ + PF D
Sbjct: 343 QMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTFAADKSLQAV--NPILPFND 394
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 163/354 (46%), Gaps = 33/354 (9%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYVQVDTGSDIMW 101
L D +RQ+R LA + L GGS+ G L YYA + +GTP + V +DTGSD+ W
Sbjct: 62 LVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSFLVALDTGSDLFW 121
Query: 102 V--NCIQCKECP-RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTS 157
V +CIQC R +L +L +Y +S+T + + C E C V G CT
Sbjct: 122 VPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSVPG-----CTNPKQP 176
Query: 158 CPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
CPY ++ + + ++++G ++D + + + N S+I GCG +QSG D + A
Sbjct: 177 CPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQKQSG--DYLDGIA 231
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-- 274
DG++ G ++ S+ S LA +G V+ F+ C + G IF G P TP VP
Sbjct: 232 PDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQGVPSQQSTPFVPLY 290
Query: 275 -NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
Y++N+ +G L G + ++DSGT+ LP VY+ +
Sbjct: 291 GKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLPFDVYKAFTMEFDK 342
Query: 334 QQPDLKVHTVHDEYT-CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
Q +V + C+ S P +T F SL+ + PF D
Sbjct: 343 QMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTFAADKSLQAV--NPILPFND 394
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 107/358 (29%), Positives = 163/358 (45%), Gaps = 57/358 (15%)
Query: 56 ILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KEC-PRR 113
+L LPL G+ + G +YA + +GTP + + V VDTGS I +V C C + C P
Sbjct: 44 LLRNATLPLHGAVKD--YGYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHH 101
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
+D SS+ + CD + C + G P C+ C Y Y + SS+ G
Sbjct: 102 KD-----AAFDPASSSSSAVIGCDSDKC--ICGRPPCGCSEKRECTYQRTYAEQSSSAGL 154
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
V D LQ ++FGC +++G + N+EA DGI+G G S S+++Q
Sbjct: 155 LVSD---------QLQLRDGAVEVVFGCETKETGEI--YNQEA-DGILGLGNSEVSLVNQ 202
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE----VNKTPLVPNQPH---YSINMTAV 286
LA SG + +FA C + G G +G V E + T L+ + H YS+ + A+
Sbjct: 203 LAGSGVIDDVFALCFGSVEGDGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEAL 262
Query: 287 QVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS-----------QQ 335
VG L + + + G GT++DSGTT YLP ++ L + +S +
Sbjct: 263 WVGGQQLPVKPERYEEG--YGTVLDSGTTFTYLPSEAFQ-LFKEAVSAYALEHGLNSVKG 319
Query: 336 PDLKVHT---VHDEYTCFQYS--------ESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
PD K + HD CF + +++ FP F + V L+ P YLF
Sbjct: 320 PDPKEKSFAQFHD--ICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLF 375
>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
Length = 178
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/181 (39%), Positives = 97/181 (53%), Gaps = 11/181 (6%)
Query: 26 HGVFSVKYRY-----AGRERSLSLLKEHDA-RRQQRILAGVDLPLGGSSRPDGVGLYYAK 79
+GVF V+ ++ + + L+ HD R ++R L +LPLGG + P G GLYY
Sbjct: 3 NGVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAELPLGGFNIPYGTGLYYTD 62
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
IGIGTP YYVQ+DTGS WVN I CK+CP S + +LT YD + S + K V CD
Sbjct: 63 IGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDT 122
Query: 140 FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
C C CPY+ Y DG T G D++ Y ++ G+ QT T+ S+ F
Sbjct: 123 ICTS-----RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTF 177
Query: 200 G 200
G
Sbjct: 178 G 178
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 178/387 (45%), Gaps = 47/387 (12%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVD---LPLGGSSR----PDGVG 74
+S N V S + + L LL ++D +RQ+ L + P GS D
Sbjct: 41 ISGNDNVSSQTWPNKNSFQYLQLLLDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDW 100
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSS----LGIELTLYDIKDS 128
L+Y I IGTP + V +D GSD+ WV +CIQC P +S L +L+ Y S
Sbjct: 101 LHYTWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCA--PLSASLYKPLDRDLSEYRPSLS 158
Query: 129 STGKFVTCDQEFCH-GVYGGPLTDCTANTSCPYLEIYGD-GSSTTGYFVQDVVQYDKVSG 186
+T + ++C+ + C G + L D CPY+ Y D +S++G+ V+D++ VS
Sbjct: 159 TTSRHLSCNHQLCELGSHCKNLKD-----PCPYIADYADPNTSSSGFLVEDILHLASVSD 213
Query: 187 DLQTTS--TNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
D +T S+I GCG +Q+G LD A DG++G G + S+ S LA +G +RK
Sbjct: 214 DSNSTQKRVQASVILGCGRKQTGGYLDGA---APDGVMGLGPGSISVPSLLAKAGLIRKS 270
Query: 244 FAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
F+ C D +NG G G TPL+P Q +Y + V+ + VG
Sbjct: 271 FSLCFD-VNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVE-----------SYCVG 318
Query: 304 DN------KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSESV 356
++ ++DSG + YLP VY +V + Q ++ + + C+ S
Sbjct: 319 NSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGGPWNYCYNTSSKQ 378
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFP 383
+ P + F + SL ++ Y P
Sbjct: 379 LDNVPAMRLSFLMNQSLLIHNSTYYVP 405
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 163/371 (43%), Gaps = 41/371 (11%)
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
P+GG+ PDG LYY + IG P K YY+ +DTGSD+ W+ C + P RS L
Sbjct: 20 PIGGNIYPDG--LYYMAMRIGNPAKLYYLDMDTGSDLTWLQC----DAPCRSCAVGPHGL 73
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQY 181
YD K + + V C + C V G C+ + C Y Y DGSST G V+D +
Sbjct: 74 YDPKRA---RVVDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITL 130
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
+G T + GCG Q G L + DG+IG S S+ SQLA+ G
Sbjct: 131 VLTNG----TRFQTRAVIGCGYDQQGTL-AKAPAVTDGVIGLSSSKISLPSQLAAKGIAN 185
Query: 242 KMFAHCLD-GINGGGIFAIGHVVQPEVNK--TPLV--PNQPHYSINMTAVQVGLDFLNLP 296
+ HCL G NGGG G + P + TP++ P Y + +++ G + L L
Sbjct: 186 NVIGHCLAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELE 245
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC------- 349
VG G + DSGT+ YL Y ++S ++ Q + + + T
Sbjct: 246 GTTDDVG---GAMFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCWRGP 302
Query: 350 --FQYSESVDEGFPNVTFHFENSVS------LKVYPHEYLF-PFEDLWCIGWQNSGMQSR 400
F+ V F VT F S L++ P YL + C+G ++ + S
Sbjct: 303 SPFESVADVSAYFKTVTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASL 362
Query: 401 DRKNMTLLGDF 411
+ N +LGD
Sbjct: 363 EVTN--ILGDI 371
>gi|147834977|emb|CAN67955.1| hypothetical protein VITISV_031916 [Vitis vinifera]
Length = 291
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 101/165 (61%), Gaps = 6/165 (3%)
Query: 42 LSLLKEHDARRQQRILAGV-----DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
L +L+ D R R+L GV D + G+S P VGLY+ K+ +G+PP+++ VQ+DTG
Sbjct: 127 LEVLRARDQARHGRLLRGVVGGVVDFTVYGTSDPYLVGLYFTKVKLGSPPREFNVQIDTG 186
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SDI+WV C C +CPR S LGIEL+ +D SST V+C C + +C+ +
Sbjct: 187 SDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQS 246
Query: 157 S-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
+ C Y YGDGS TTGY+V D++ +D V GD +++ S++FG
Sbjct: 247 NQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFG 291
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 142/308 (46%), Gaps = 31/308 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + I +GTPP+ V +DTGSD+ W+ C+ C ++ ++D SST
Sbjct: 21 GYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQAD-----PIFDPSKSSTY 75
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C C + G C+A +C Y YGDGS T GYF ++ + +G+
Sbjct: 76 NKIACSSSACADLLG--TQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGE---- 129
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DG 250
+ FG +G T E GI+G G+ SM SQL S G + F++CL D
Sbjct: 130 ----EVKFGASVYNTGTFGDTGGE---GILGLGQGPVSMPSQLGSVLGNK--FSYCLVDW 180
Query: 251 INGGG-----IFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGV 302
++ G F V EV TP+VPN H Y I + + VG L++ V+ +
Sbjct: 181 LSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEI 240
Query: 303 --GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
G + GTIIDSGTT+ YL + V+ LV+ SQ + CF + F
Sbjct: 241 DSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATGLDLCFNTRGTGSPVF 300
Query: 361 PNVTFHFE 368
P +T H +
Sbjct: 301 PAMTIHLD 308
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 160/335 (47%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y +G+GTP K++ + DTGSD+ W C C K C ++ ++ T S++
Sbjct: 129 GSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPT-----KSTS 183
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
K ++C FC + C++ T C Y YGDGS + G+F + + L +
Sbjct: 184 YKNISCSSAFCKLLDTEGGESCSSPT-CLYQVQYGDGSYSIGFFATETLT-------LSS 235
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ + +FGCG + SG G++G G++ S+ SQ A +K+F++CL
Sbjct: 236 SNVFKNFLFGCGQQNSGLF-----RGAAGLLGLGRTKLSLPSQTAQK--YKKLFSYCLPA 288
Query: 251 INGG-GIFAIGHVVQPEVNKTPL---VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+ G + G V V TPL + P Y +++T + VG + L++ +F
Sbjct: 289 SSSSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIF---STS 345
Query: 307 GTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
GT+IDSGT + LP Y L S K+++ P +++ D TC+ +S++ P V
Sbjct: 346 GTVIDSGTVITRLPSTAYSALSSAFQKLMTDYPSTDGYSIFD--TCYDFSKNETIKIPKV 403
Query: 364 TFHFENSVSLKVYPHEYLFPFEDLW--CIGWQNSG 396
F+ V + + L+P L C+ + +G
Sbjct: 404 GVSFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNG 438
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 101/351 (28%), Positives = 162/351 (46%), Gaps = 47/351 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VDTGS + +V C CK+C + + L SS+ K
Sbjct: 78 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPEL-----SSSYKA 132
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C+ + C+ G L C Y Y + SS++G +D++ + ++ T
Sbjct: 133 LKCNPD-CNCDDEGKL--------CVYERRYAEMSSSSGVLSEDLISFGN-----ESQLT 178
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN- 252
+FGC ++G+L S + DGI+G G+ S++ QL G + +F+ C G+
Sbjct: 179 PQRAVFGCENVETGDLFS---QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV 235
Query: 253 GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
GGG +G + P + P P+Y+I++ + V L L VF GT
Sbjct: 236 GGGAMVLGKISPPAGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF--NGKHGT 291
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEY--TCFQYS----ESVDEGFP 361
++DSGTT AY P+ + + II + P LK +H Y CF + + FP
Sbjct: 292 VLDSGTTYAYFPKEAFIAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFP 351
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ F N L + P YLF + +C+G DR + TLLG
Sbjct: 352 EIDMEFGNGQKLILSPENYLFRHTKVRGAYCLGI------FPDRDSTTLLG 396
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 104/358 (29%), Positives = 163/358 (45%), Gaps = 37/358 (10%)
Query: 45 LKEHDARRQQRILAG----VDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYVQVDTGS 97
L D +RQ+R LAG + L GGS+ G L YYA + +GTP + V +DTGS
Sbjct: 62 LLRSDLQRQKRRLAGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGS 121
Query: 98 DIMWV--NCIQCKECPR-RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG--GPLTDC 152
D+ WV +CIQC R +L +L +Y +S+T + + C E C G P C
Sbjct: 122 DLFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPC 181
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
T N ++ + + ++++G ++D + + G N S+I GCG +QSG D
Sbjct: 182 TYN-----IDYFSENTTSSGLLIEDSLHLNSREGH---APVNASVIIGCGRKQSG--DYL 231
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL 272
+ A DG++G G ++ S+ S LA +G VR F+ C + G IF V + TP
Sbjct: 232 DGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQ-QSTPF 290
Query: 273 VP---NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
VP Y++N+ +G L G + ++DSGT+ LP VY+ +
Sbjct: 291 VPLYGKLQTYAVNVDKSCIGHKCLE--------GSSFQALVDSGTSFTSLPPDVYKAFTT 342
Query: 330 KIISQQPDLKVHTVHDEYT-CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ Q +V + C+ S P + F + S + + PF D
Sbjct: 343 EFDKQINASRVPYEDSTWKYCYSASPLEMPDVPTIILAFAANKSFQAV--NPILPFND 398
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 170/360 (47%), Gaps = 48/360 (13%)
Query: 34 RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPD---GVGLYYAKIGIGTPPKDYY 90
R+ R ++ K R +LA + +G + G G + K+ IG+PP+ +
Sbjct: 66 RFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFS 125
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+DTGSD++W C C++C +S+ ++D K SS+ ++C E C + P +
Sbjct: 126 AIMDTGSDLIWTQCKPCQQCFDQST-----PIFDPKQSSSFYKISCSSELCGAL---PTS 177
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C+++ C YL YGD SST G + + + D + G FGCG +G D
Sbjct: 178 TCSSD-GCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLG---FGCGNDNNG--D 231
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING--------GGIFAIG-H 261
++ A G++G G+ S++SQL + FA+CL I+ G + I
Sbjct: 232 GFSQGA--GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSLLLGSLANITPK 284
Query: 262 VVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTL 316
+ E+ TPL+ P+QP Y +++ + VG L++P F + D+ G IIDSGTT+
Sbjct: 285 TSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTI 344
Query: 317 AYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-----CFQYSESVDE-GFPNVTFHFENS 370
Y+ + L ++ I+Q V D T CF ++ P +TFHF+ +
Sbjct: 345 TYVENSAFTSLKNEFIAQMN----LPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFKGA 400
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/358 (28%), Positives = 169/358 (47%), Gaps = 48/358 (13%)
Query: 34 RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPD---GVGLYYAKIGIGTPPKDYY 90
R+ R ++ K R +LA + +G + G G + K+ IG+PP+ +
Sbjct: 321 RFERLRRGVARGKNRLHRLNAMVLAAANATVGDQVKAPVVAGNGEFLMKLAIGSPPRSFS 380
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+DTGSD++W C C++C +S+ ++D K SS+ ++C E C + P +
Sbjct: 381 AIMDTGSDLIWTQCKPCQQCFDQST-----PIFDPKQSSSFYKISCSSELCGAL---PTS 432
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C+++ C YL YGD SST G + + + D + G FGCG +G D
Sbjct: 433 TCSSD-GCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLG---FGCGNDNNG--D 486
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING--------GGIFAIG-H 261
++ A G++G G+ S++SQL + FA+CL I+ G + I
Sbjct: 487 GFSQGA--GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDDSKPSSLLLGSLANITPK 539
Query: 262 VVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTL 316
+ E+ TPL+ P+QP Y +++ + VG L++P F + D+ G IIDSGTT+
Sbjct: 540 TSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVIIDSGTTI 599
Query: 317 AYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-----CFQYSESVDE-GFPNVTFHFE 368
Y+ + L ++ I+Q V D T CF ++ P +TFHF+
Sbjct: 600 TYVENSAFTSLKNEFIAQMN----LPVDDSGTGGLDLCFNLPAGTNQVEVPKLTFHFK 653
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 103/355 (29%), Positives = 165/355 (46%), Gaps = 54/355 (15%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VD+GS + +V C C++C + L SS+
Sbjct: 87 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDL-----SSSYSP 141
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DCT ++ C Y Y + SS++G +D+V + + S +L+
Sbjct: 142 VKCN------------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRES-ELKA 188
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 189 QRA----VFGCENSETGDLFSQHA---DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGG 241
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G V P PL P+Y+I + + V L + + +F D+
Sbjct: 242 MDIGGGAMVLGGVPTPSDMVFSRSDPL--RSPYYNIELKEIHVAGKALRVDSRIF---DS 296
Query: 306 K-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYSE----SVD 357
K GT++DSGTT AYLPE + + S+ LK D + CF + +
Sbjct: 297 KHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLH 356
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
E FP+V F N L + P YLF + +C+G +G + TLLG
Sbjct: 357 EVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNG-----KDPTTLLG 406
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 164/354 (46%), Gaps = 52/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VD+GS + +V C C++C + L SS+
Sbjct: 86 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDL-----SSSYSP 140
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DCT ++ C Y Y + SS++G +D+V + + S +L+
Sbjct: 141 VKCN------------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRES-ELKP 187
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
IFGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 188 QHA----IFGCENSETGDLFSQHA---DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 240
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G ++ P N PL P+Y+I + + V L + + +F
Sbjct: 241 MDIGGGAMVLGGMLAPPDMIFSNSDPL--RSPYYNIELKEIHVAGKALRVESRIF--NSK 296
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCF----QYSESVDE 358
GT++DSGTT AYLPE + + S+ LK D + CF + + E
Sbjct: 297 HGTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHE 356
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP+V F N L + P YLF + +C+G +G + TLLG
Sbjct: 357 VFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNG-----KDPTTLLG 405
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 162/351 (46%), Gaps = 47/351 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VDTGS + +V C CK+C + + L S++ +
Sbjct: 74 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPEL-----STSYQA 128
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C+ + C+ G L C Y Y + SS++G +D++ + ++ +
Sbjct: 129 LKCNPD-CNCDDEGKL--------CVYERRYAEMSSSSGVLSEDLISFGN-----ESQLS 174
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN- 252
+FGC ++G+L S + DGI+G G+ S++ QL G + +F+ C G+
Sbjct: 175 PQRAVFGCENEETGDLFS---QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEV 231
Query: 253 GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
GGG +G + P + P P+Y+I++ + V L L VF GT
Sbjct: 232 GGGAMVLGKISPPPGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF--NGKHGT 287
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEY--TCFQYS----ESVDEGFP 361
++DSGTT AY P+ + + +I + P LK +H Y CF + + FP
Sbjct: 288 VLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHNFFP 347
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ F N L + P YLF + +C+G DR + TLLG
Sbjct: 348 EIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGI------FPDRDSTTLLG 392
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 160/354 (45%), Gaps = 53/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VDTGS + +V C CK+C + + L S++ +
Sbjct: 74 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPEL-----STSYQA 128
Query: 134 VTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C+ DC + C Y Y + SS++G +D++ + ++
Sbjct: 129 LKCN------------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGN-----ES 171
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+ +FGC ++G+L S + DGI+G G+ S++ QL G + +F+ C G
Sbjct: 172 QLSPQRAVFGCENEETGDLFS---QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGG 228
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ GGG +G + P + P P+Y+I++ + V L L VF
Sbjct: 229 MEVGGGAMVLGKISPPPGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF--NGK 284
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEY--TCFQYS----ESVDE 358
GT++DSGTT AY P+ + + +I + P LK +H Y CF + +
Sbjct: 285 HGTVLDSGTTYAYFPKEAFIAIKDAVIKEIPSLKRIHGPDPNYDDVCFSGAGRDVAEIHN 344
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP + F N L + P YLF + +C+G DR + TLLG
Sbjct: 345 FFPEIAMEFGNGQKLILSPENYLFRHTKVRGAYCLGI------FPDRDSTTLLG 392
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 167/365 (45%), Gaps = 51/365 (13%)
Query: 45 LKEHDARRQQRILAG----VDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYVQVDTGS 97
L D +RQ+R + G + L GGS P G L YY + +GTP + V +DTGS
Sbjct: 64 LVRSDLQRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGS 123
Query: 98 DIMWV--NCIQCKECPR-RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT- 153
D+ WV +CIQC SL +L +Y +S+T + + C E C P + CT
Sbjct: 124 DLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRHLPCSHELC-----SPASGCTN 178
Query: 154 ANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
CPY ++ + + ++++G ++D++ D G N S+I GCG +QSG+
Sbjct: 179 PKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGH---APVNASVIIGCGKKQSGSY--L 233
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL 272
A DG++G G ++ S+ S LA +G VR F+ C + G IF G P TP
Sbjct: 234 EGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIF-FGDQGVPTQQSTPF 292
Query: 273 VP---NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
VP Y++N+ +G G ++D+GT+ LP Y
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTE--------GAGFQALVDTGTSFTSLPLDAY----- 339
Query: 330 KIISQQPDLKVHTVH---DEYTCFQYSESVD----EGFPNVTFHF-ENSVSLKVYPHEYL 381
K I+ + D +++ D+Y+ F+Y S P +T F EN V P +
Sbjct: 340 KSITMEFDKQINASRASSDDYS-FEYCYSTGPLEMPDVPTITLTFAENKSFQAVNP---I 395
Query: 382 FPFED 386
PF D
Sbjct: 396 LPFND 400
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 172/378 (45%), Gaps = 56/378 (14%)
Query: 33 YRYAGRERSLSL---LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDY 89
YR++G+ S ++ ++ +L +PL G+ + G +YA + +GTP K +
Sbjct: 34 YRHSGKRTSFGFRVQARDFQPTFRRSLLRNSTMPLHGAVK--DYGYFYATLYLGTPAKKF 91
Query: 90 YVQVDTGSDIMWVNCIQCKECPRRSSLGI--ELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
V VDTGS + +V C C S G + +D + SST ++C C G
Sbjct: 92 AVIVDTGSTMTYVPCSSCG-----SGCGPNHQDAAFDPEASSTASRISCTSPKCS--CGS 144
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ-YDKVSGDLQTTSTNGSLIFGCGARQS 206
P C+ C Y Y + SS++G ++DV+ +D + G +IFGC R++
Sbjct: 145 PRCGCSTQ-QCTYTRSYAEQSSSSGILLEDVLALHDGLPG--------APIIFGCETRET 195
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQP- 265
G + + DG+ G G S++S+++QL +G + +F+ C + G G +G P
Sbjct: 196 GEI---FRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGALLLGDAEVPG 252
Query: 266 --EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
+ TPL+ + H Y++ M ++ V L + +F G GT++DSGTT Y+P
Sbjct: 253 SISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQG--YGTVLDSGTTFTYMP 310
Query: 321 EMVYEPLVSKIISQQ----------PDLKVHTVHDEYTCFQYSESVDE------GFPNVT 364
V++ + PD + + CF + S D+ FP++
Sbjct: 311 SPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDI-----CFGQAPSHDDLEALSSVFPSME 365
Query: 365 FHFENSVSLKVYPHEYLF 382
F+ SL + P YLF
Sbjct: 366 VQFDQGTSLVLGPLNYLF 383
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/365 (29%), Positives = 167/365 (45%), Gaps = 51/365 (13%)
Query: 45 LKEHDARRQQRILAG----VDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYVQVDTGS 97
L D +RQ+R + G + L GGS P G L YY + +GTP + V +DTGS
Sbjct: 64 LVRSDLQRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGS 123
Query: 98 DIMWV--NCIQCKECPR-RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT- 153
D+ WV +CIQC SL +L +Y +S+T + + C E C P + CT
Sbjct: 124 DLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRHLPCSHELC-----SPASGCTN 178
Query: 154 ANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
CPY ++ + + ++++G ++D++ D G N S+I GCG +QSG+
Sbjct: 179 PKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGH---APVNASVIIGCGKKQSGSY--L 233
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL 272
A DG++G G ++ S+ S LA +G VR F+ C + G IF G P TP
Sbjct: 234 EGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIF-FGDQGVPTQQSTPF 292
Query: 273 VP---NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
VP Y++N+ +G G ++D+GT+ LP Y
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTE--------GAGFQALVDTGTSFTSLPLDAY----- 339
Query: 330 KIISQQPDLKVHTVH---DEYTCFQYSESVD----EGFPNVTFHF-ENSVSLKVYPHEYL 381
K I+ + D +++ D+Y+ F+Y S P +T F EN V P +
Sbjct: 340 KSITMEFDKQINASRASSDDYS-FEYCYSTGPLEMPDVPTITLTFAENKSFQAVNP---I 395
Query: 382 FPFED 386
PF D
Sbjct: 396 LPFND 400
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 169/351 (48%), Gaps = 46/351 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C C+ C R + L S T +
Sbjct: 87 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDL-----SETYQP 141
Query: 134 VTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C P +C +T+ C Y Y + SS++G +DVV + G+L +
Sbjct: 142 VKCT----------PDCNCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSF----GNLSELA 187
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+ +FGC ++G+L S + DGI+G G+ + S++ QL + F+ C G++
Sbjct: 188 PQRA-VFGCENDETGDLYS---QRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMD 243
Query: 253 -GGGIFAIGHVVQPE-VNKTPLVPNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GT 308
GGG +G + PE + T P++ P+Y+IN+ + V L L VF D K GT
Sbjct: 244 VGGGAMILGGISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVF---DGKHGT 300
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYS----ESVDEGFP 361
++DSGTT AYLPE + I+ ++ LK D + CF + + + FP
Sbjct: 301 VLDSGTTYAYLPETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFP 360
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
V FEN L + P YLF + +C+G ++G R TLLG
Sbjct: 361 VVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNG-----RDPTTLLG 406
>gi|356540982|ref|XP_003538963.1| PREDICTED: uncharacterized protein LOC100811106 [Glycine max]
Length = 813
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/133 (47%), Positives = 85/133 (63%), Gaps = 31/133 (23%)
Query: 169 STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG---------------------------- 200
++TGY+VQD + Y+ V+G+L+T N S+IFG
Sbjct: 641 NSTGYYVQDYLTYNHVNGNLRTAPQNSSIIFGRIMPAVNVQYERIILVVNGIFILLSQLF 700
Query: 201 ---CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIF 257
CGA QS S++EEALDGIIGFG+SNSS++SQLA+SG V+K+F+HCLD I GGGIF
Sbjct: 701 LVMCGAVQSVTFSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGGIF 760
Query: 258 AIGHVVQPEVNKT 270
AIG VV+P+V+ +
Sbjct: 761 AIGEVVEPKVSNS 773
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 112/396 (28%), Positives = 169/396 (42%), Gaps = 86/396 (21%)
Query: 37 GRERSLSLLKEHDARRQQRILAGVDLPLGGS-----SRPDGV---GLYYAKIGIGTPPKD 88
E L+ L D+ R R+L P+ GS R + LYY + IGTPP++
Sbjct: 34 SHELDLTQLMTFDSARHGRLLQS---PVHGSFNWKVERDTSILLSALYYTTVQIGTPPRE 90
Query: 89 YYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
V +DTGSD++WV+C C CP + +T +D SS+ + C + C
Sbjct: 91 LDVVIDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRCSSDLQKK 145
Query: 149 LTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGN 208
+ C+ SC Y YGDGS T+GY++ D++ +D +S D + + + RQ
Sbjct: 146 -SRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMS-DWTYIAFRDNSTWHPWVRQG-- 201
Query: 209 LDSTNEEALDGIIG-FGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
IIG F S+ S ++S
Sbjct: 202 ----------AIIGTFPALCSTPCSTVSSQ------------------------------ 221
Query: 268 NKTPLVPNQPHYSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYE 325
PL N P +S MT V ++ L LP D VF V GTIIDSGTTL + P Y+
Sbjct: 222 ---PLYYN-PQFSHMMT---VAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVHFPGEAYD 274
Query: 326 PLVSKIISQQPDLKVHTVHDEYTCFQYSESVD------EGFPNVTFHFENSVSLKVYPHE 379
PL+ I++ ++ + CF + + + FP V F S+ + P
Sbjct: 275 PLIQAILNVVSQYGRPIPYESFQCFNITSGISSHLVIADMFPEVHLGFAGGASMVIKPEA 334
Query: 380 YLF-PFEDL----WCIGWQNSGMQSRDRKNMTLLGD 410
YLF F DL WC+G+ +S + +T++G+
Sbjct: 335 YLFQKFLDLTNAIWCLGFYSS-----TSRRITIIGE 365
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 163/354 (46%), Gaps = 52/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VD+GS + +V C C++C + L SS+
Sbjct: 87 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDL-----SSSYSP 141
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DCT ++ C Y Y + SS++G +D+V + + S +L+
Sbjct: 142 VKCN------------VDCTCDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRES-ELKP 188
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 189 QRA----VFGCENSETGDLFS---QHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 241
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G V P + PL P+Y+I + + V L + + VF
Sbjct: 242 MDIGGGAMVLGGVPAPSDMVFSHSDPL--RSPYYNIELKEIHVAGKALRVDSRVF--NSK 297
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCF----QYSESVDE 358
GT++DSGTT AYLPE + + S+ LK D + CF + + E
Sbjct: 298 HGTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHE 357
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP+V F N L + P YLF + +C+G +G + TLLG
Sbjct: 358 VFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNG-----KDPTTLLG 406
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 110/379 (29%), Positives = 174/379 (45%), Gaps = 61/379 (16%)
Query: 42 LSLLKEHDARRQQRILAGVDLPLGGSSRPD------GVGLYYAKIGIGTPPKDYYVQVDT 95
L L+ RR + +L GS+R D G Y +++ IGTPP ++ + VDT
Sbjct: 3 LELVANSHRRRDRELL--------GSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDT 54
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE----FCHGVYGGPLTD 151
GS + +V C C C L SS+ K + C E FC G
Sbjct: 55 GSTVTYVPCSSCTHCGNHQDPRFSPAL-----SSSYKPLECGSECSTGFCDG-------- 101
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
S Y Y + S+++G +DV+ + S DL L+FGC ++G+L
Sbjct: 102 -----SRKYQRQYAEKSTSSGVLGKDVIGFSN-SSDLG----GQRLVFGCETAETGDL-- 149
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQP-EVNK 269
++ DGIIG G+ S+I QL + +F+ C G++ GGG +G P ++
Sbjct: 150 -YDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVF 208
Query: 270 TPLVPNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPL 327
T P++ P+Y++ + ++VG L L +VF D K GT++DSGTT AY P ++
Sbjct: 209 TASDPHRSPYYNLMLKGIRVGGSPLRLKPEVF---DGKYGTVLDSGTTYAYFPGAAFQAF 265
Query: 328 VSKIISQQPDLKVHTVHDEY---TCFQYS----ESVDEGFPNVTFHFENSVSLKVYPHEY 380
S + Q LK DE C+ + ++ + FP+V F F + S+ + P Y
Sbjct: 266 KSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENY 325
Query: 381 LFPFEDL---WCIGWQNSG 396
LF + +C+G +G
Sbjct: 326 LFRHTKISGAYCLGVFENG 344
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 96/309 (31%), Positives = 142/309 (45%), Gaps = 34/309 (11%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSS-----RPDGVG-LYYAKIGIGTPPKDYYVQVDTGSD 98
L D + R L V+ PL S R +G L+Y + +GTP + V +DTGSD
Sbjct: 64 LAHRDQMLRGRKLYNVEAPLAFSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSD 123
Query: 99 IMWVNCIQCKECPRRSSLG----IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
+ WV C C +C + EL++YD K SST K VTC+ C C
Sbjct: 124 LFWVPC-DCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTCNNNLC-----AHRNRCLG 177
Query: 155 N-TSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
+SCPY+ Y +ST+G V+DV+ S D S + FGCG QSG+
Sbjct: 178 TFSSCPYMVSYVSAQTSTSGILVEDVLHL--TSEDSNQESIKAYVTFGCGQVQSGSF--L 233
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL 272
N A +G+ G G S+ S L+ G F+ C G +G G + G P+ +TP
Sbjct: 234 NTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCF-GHDGVGRISFGDKGSPDQEETPF 292
Query: 273 --VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
P+ P Y+I++T V+VG +++ + + DSGT+ YL +Y +
Sbjct: 293 NSNPSHPSYNISVTQVRVGTTLVDV---------DFTALFDSGTSFTYLINPIYAMVSEN 343
Query: 331 IISQQPDLK 339
+Q D +
Sbjct: 344 FHAQAQDKR 352
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 113/377 (29%), Positives = 163/377 (43%), Gaps = 58/377 (15%)
Query: 37 GRERSLSLLKEHDAR---RQQRILA---GVDLPLGGSSRP----DGVGLYYAKIGIGTPP 86
G L LL+ R R R++A GV GG G G + + IGTP
Sbjct: 51 GNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMDVAIGTPA 110
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
Y VDTGSD++W C C +C ++S+ ++D SST V C C +
Sbjct: 111 LSYAAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTYATVPCSSALCSDL-- 163
Query: 147 GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
P + CT+ + C Y YGD SST G + K L + FGCG
Sbjct: 164 -PTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLP------GVAFGCGDTNE 216
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DG-------INGGG 255
G D + A G++G G+ S++SQL G+ K F++CL DG + G
Sbjct: 217 G--DGFTQGA--GLVGLGRGPLSLVSQL----GLDK-FSYCLTSLDDGDGKSPLLLGGSA 267
Query: 256 IFAIGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTII 310
V TPLV P+QP Y +++T + VG + LP F + D+ G I+
Sbjct: 268 AAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIV 327
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQ-YSESVDE-GFPNVT 364
DSGT++ YL Y L ++Q + + TV CFQ ++ VDE P +
Sbjct: 328 DSGTSITYLELQGYRALKKAFVAQ---MALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLV 384
Query: 365 FHFENSVSLKVYPHEYL 381
HF+ L + Y+
Sbjct: 385 LHFDGGADLDLPAENYM 401
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 175/389 (44%), Gaps = 56/389 (14%)
Query: 40 RSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
R+LS + H R + A +PL P G Y +I IGTPP+ + + VDTGS +
Sbjct: 58 RTLSHSRRHLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTL 115
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-- 157
+V C C++C + + SST + + C E CT ++
Sbjct: 116 TYVPCSTCEQCGKHQDPNFQPDW-----SSTYQPLKCSME------------CTCDSEMM 158
Query: 158 -CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y Y + SS++G +D+V + K S +L+ T +FGC ++G++ S +
Sbjct: 159 HCVYDRQYAEMSSSSGVLGEDIVSFGKQS-ELKPQRT----VFGCENVETGDIYS---QR 210
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE----VNKTP 271
DGI+G G+ + S++ QL G + F+ C G++ GGG +G + P + P
Sbjct: 211 ADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDP 270
Query: 272 LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
+Y+I++ + + LP + GTI+DSGTT AYLPE ++ I
Sbjct: 271 A--RSAYYNIDLKEIHIAGK--QLPINPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAI 326
Query: 332 ISQQPDLKVHTVHDEY---TCFQYSES----VDEGFPNVTFHFENSVSLKVYPHEYLFPF 384
+ + LK+ D CF S + + FP V F N L + P YLF
Sbjct: 327 MKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQH 386
Query: 385 ED---LWCIG-WQNSGMQSRDRKNMTLLG 409
+C+G +QN Q+ TLLG
Sbjct: 387 SKAHGAYCLGIFQNENDQT------TLLG 409
>gi|125547762|gb|EAY93584.1| hypothetical protein OsI_15370 [Oryza sativa Indica Group]
Length = 202
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 108/207 (52%), Gaps = 17/207 (8%)
Query: 3 LCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYA-----GRERSLSLLKEHDARRQQRIL 57
L L L +L+A++ G V+ G+F V+ +++ + + L+ HD R L
Sbjct: 4 LFLSAILSALLVASSTRGTVAI--GLFQVRRKFSIMGGGCKGSDIGALQTHDRNRHLSRL 61
Query: 58 AGVDLPLGG----SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR 113
D LGG S+ G Y + G+ ++ VDTGS WVNCI CK+CPR+
Sbjct: 62 VAADFSLGGLGGISTSSTG---YMLQCSFGSI---HFFLVDTGSSAFWVNCIPCKQCPRK 115
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
S + +LTLYD + S + K V CD FC +C + CP++ Y DG ST G
Sbjct: 116 SDILKKLTLYDPRSSVSSKVVKCDDMFCTSPDRDVQPECNTSLLCPFIATYADGGSTIGA 175
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFG 200
FV D+V Y+++SG+ T STN SL FG
Sbjct: 176 FVTDLVHYNQLSGNGLTQSTNTSLTFG 202
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 153/355 (43%), Gaps = 45/355 (12%)
Query: 54 QRILAGVDLPLGGSSRPDGVGL------------YYAKIGIGTPPKDYYVQVDTGSDIMW 101
+R +A V SS+P GV L Y+ + +GTP D V++DTGSD W
Sbjct: 101 RRKVAAVTT-AASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSW 159
Query: 102 VNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYL 161
+ C C +C + L+D SST +TC C + +C+++ CPY
Sbjct: 160 IQCKPCPDCYEQHE-----ALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYE 214
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
Y D S T G +D + L T +FGCG +G+ +DG++
Sbjct: 215 ITYADDSYTVGNLARDTLT-------LSPTDAVPGFVFGCGHNNAGSFGE-----IDGLL 262
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQP-EVNKTPLVPNQ- 276
G G+ +S+ SQ+A+ G F++CL G F+ P T +V Q
Sbjct: 263 GLGRGKASLSSQVAARYGAG--FSYCLPSSPSATGYLSFSGAAAAAPTNAQFTEMVAGQH 320
Query: 277 -PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
Y +N+T + V + +P VF GTIIDSGT + LP Y L S + S
Sbjct: 321 PSFYYLNLTGITVAGRAIKVPPSVFATA--AGTIIDSGTAFSCLPPSAYAALRSSVRSAM 378
Query: 336 PDLK---VHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL 387
K T+ D TC+ + P+V F + ++ ++P L+ + ++
Sbjct: 379 GRYKRAPSSTIFD--TCYDLTGHETVRIPSVALVFADGATVHLHPSGVLYTWSNV 431
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 175/389 (44%), Gaps = 56/389 (14%)
Query: 40 RSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
R+LS + H R + A +PL P G Y +I IGTPP+ + + VDTGS +
Sbjct: 58 RTLSHSRRHLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTL 115
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-- 157
+V C C++C + + SST + + C E CT ++
Sbjct: 116 TYVPCSTCEQCGKHQDPNFQPDW-----SSTYQPLKCSME------------CTCDSEMM 158
Query: 158 -CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y Y + SS++G +D+V + K S +L+ T +FGC ++G++ S +
Sbjct: 159 HCVYDRQYAEMSSSSGVLGEDIVSFGKQS-ELKPQRT----VFGCENVETGDIYS---QR 210
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE----VNKTP 271
DGI+G G+ + S++ QL G + F+ C G++ GGG +G + P + P
Sbjct: 211 ADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDP 270
Query: 272 LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
+Y+I++ + + LP + GTI+DSGTT AYLPE ++ I
Sbjct: 271 A--RSAYYNIDLKEIHIAGK--QLPINPMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAI 326
Query: 332 ISQQPDLKVHTVHDEY---TCFQYSES----VDEGFPNVTFHFENSVSLKVYPHEYLFPF 384
+ + LK+ D CF S + + FP V F N L + P YLF
Sbjct: 327 MKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQH 386
Query: 385 ED---LWCIG-WQNSGMQSRDRKNMTLLG 409
+C+G +QN Q+ TLLG
Sbjct: 387 SKAHGAYCLGIFQNENDQT------TLLG 409
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 160/353 (45%), Gaps = 50/353 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VD+GS + +V C C++C + L SST
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDL-----SSTYSP 140
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DCT ++ C Y Y + SS++G +D+V + S +L+
Sbjct: 141 VKCN------------VDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTES-ELKP 187
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 188 QRA----VFGCENSETGDLFSQHA---DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGG 240
Query: 251 IN-GGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK- 306
++ GGG +G + P + P+Y+I + + V L + +F D K
Sbjct: 241 MDIGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF---DGKH 297
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYT--CF----QYSESVDEG 359
GT++DSGTT AYLPE + + SQ P K+ Y CF + + E
Sbjct: 298 GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEV 357
Query: 360 FPNVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
FP V F N L + P YLF E +C+G +G + TLLG
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNG-----KDPTTLLG 405
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 103/353 (29%), Positives = 160/353 (45%), Gaps = 50/353 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VD+GS + +V C C++C + L SST
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDL-----SSTYSP 140
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DCT ++ C Y Y + SS++G +D+V + S +L+
Sbjct: 141 VKCN------------VDCTCDSDKNQCTYERQYAEMSSSSGVLGEDIVSFGTES-ELKP 187
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 188 QRA----VFGCENSETGDLFSQHA---DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGG 240
Query: 251 IN-GGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK- 306
++ GGG +G + P + P+Y+I + + V L + +F D K
Sbjct: 241 MDIGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF---DGKH 297
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYT--CF----QYSESVDEG 359
GT++DSGTT AYLPE + + SQ P K+ Y CF + + E
Sbjct: 298 GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEV 357
Query: 360 FPNVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
FP V F N L + P YLF E +C+G +G + TLLG
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNG-----KDPTTLLG 405
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/355 (28%), Positives = 163/355 (45%), Gaps = 54/355 (15%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VD+GS + +V C C++C + L SST
Sbjct: 83 GYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDL-----SSTYSP 137
Query: 134 VTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C DCT + + C Y Y + SS++G +D+V + S +L+
Sbjct: 138 VKCS------------ADCTCDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTES-ELKP 184
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 185 QRA----VFGCENSETGDLFSQHA---DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGG 237
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G + P P+ P+Y+I + + V L L +F D+
Sbjct: 238 MDIGGGAMVLGAMPAPPDMVFSRSDPV--RSPYYNIELKEIHVAGKALRLDPRIF---DS 292
Query: 306 K-GTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYT--CF----QYSESVD 357
K GT++DSGTT AYLPE + + S+ +P K+ Y CF + +
Sbjct: 293 KHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLS 352
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
+ FP+V F + L + P YLF E +C+G +G + TLLG
Sbjct: 353 QAFPDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNG-----KDPTTLLG 402
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/369 (30%), Positives = 167/369 (45%), Gaps = 54/369 (14%)
Query: 29 FSVKYRYAGRERSLSLLK--EHDARRQQRILAGVD-LPLGGSSRPD-----------GVG 74
F V R+ ++L+ L+ +H +R + L ++ + L SS PD G G
Sbjct: 47 FRVMLRHVDSGKNLTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQLEAPIHAGNG 106
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
Y ++ IGTPP Y +DTGSD++W C C C ++ + ++D K SS+ V
Sbjct: 107 EYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPT-----PIFDPKKSSSFSKV 161
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+C C L T + C Y+ YGD S T G + + K + +
Sbjct: 162 SCGSSLCSA-----LPSSTCSDGCEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIG 216
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-- 252
FGCG G+ E G++G G+ S++SQL + F++CL I+
Sbjct: 217 ----FGCGEDNEGD----GFEQASGLVGLGRGPLSLVSQLK-----EQRFSYCLTPIDDT 263
Query: 253 GGGIFAIGHVVQ----PEVNKTPLVPN--QPH-YSINMTAVQVGLDFLNLPTDVFGVGD- 304
+ +G + + EV TPL+ N QP Y +++ A+ VG L++ F VGD
Sbjct: 264 KESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDD 323
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP---DLKVHTVHDEYTCFQY-SESVDEG 359
N G IIDSGTT+ Y+ + YE L + ISQ D T D CF S S
Sbjct: 324 GNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLD--LCFSLPSGSTQVE 381
Query: 360 FPNVTFHFE 368
P + FHF+
Sbjct: 382 IPKLVFHFK 390
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/324 (27%), Positives = 148/324 (45%), Gaps = 33/324 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GIG+P + +DTGSD+ WV C C +C +L+D SST +
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVD-----SLFDPSASSTYSPFS 185
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + + +++ C Y+ Y DGSSTTG + D + L + + G
Sbjct: 186 CSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTL-------TLGSNAIKG 238
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING-G 254
FGC +SG + DG++G G S++SQ A + G K F++CL G
Sbjct: 239 -FQFGCSQSESGGF----SDQTDGLMGLGGDAQSLVSQTAGTFG--KAFSYCLPPTPGSS 291
Query: 255 GIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIID 311
G +G + KTP++ + +Y + + A++VG LN+PT VF + G+++D
Sbjct: 292 GFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF----SAGSVMD 347
Query: 312 SGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
SGT + LP Y L S + + P + + D TCF +S P+V F
Sbjct: 348 SGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILD--TCFDFSGQSSVSIPSVALVFS 405
Query: 369 NSVSLKVYPHEYLFPFEDLWCIGW 392
+ + + + ++ WC+ +
Sbjct: 406 GGAVVNLDFNGIMLELDN-WCLAF 428
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 106/351 (30%), Positives = 153/351 (43%), Gaps = 54/351 (15%)
Query: 40 RSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
+SL+ L DA RIL G Y ++GIGTP + Y +DTGSD+
Sbjct: 67 QSLATLAPGDAITAARILVLAS-----------DGEYLMEMGIGTPARFYSAILDTGSDL 115
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCP 159
+W C C C + + +D +SST + + C C+ +Y PL C T C
Sbjct: 116 IWTQCAPCLLCVDQPT-----PYFDPANSSTYRSLGCSAPACNALY-YPL--CYQKT-CV 166
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YGD +ST G + + G T T + FGCG +G+L + + G
Sbjct: 167 YQYFYGDSASTAGVLANETFTF----GTNDTRVTLPRISFGCGNLNAGSLANGS-----G 217
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCLDG--------INGGGIFAIGHVVQPEVNKTP 271
++GFG+ + S++SQL S F++CL + G + V TP
Sbjct: 218 MVGFGRGSLSLVSQLGS-----PRFSYCLTSFLSPVRSRLYFGAYATLNSTNASTVQSTP 272
Query: 272 LV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK---GTIIDSGTTLAYLPEMVY- 324
+ P P Y +NMT + VG + L + V + D GTIIDSGTT+ YL E Y
Sbjct: 273 FIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSGTTITYLAEPAYY 332
Query: 325 ---EPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG--FPNVTFHFENS 370
E V + S P L V TCFQ+ + P + HF+ +
Sbjct: 333 AVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVLHFDGA 383
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 166/377 (44%), Gaps = 43/377 (11%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGV-------G 74
VS N +F+ + LL D +RQ+ L L S D +
Sbjct: 42 VSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLGAEYQLLFPSEGSDALFLGNEFGW 101
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSS-----LGIELTLYDIKDSS 129
L+Y I IGTP + V +D GSD++WV C C +C S+ LG +L Y SS
Sbjct: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSS 160
Query: 130 TGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYL-EIYGDGSSTTGYFVQDVVQYDKVSGD 187
T K ++C+ + C G +DC ++ CPYL Y + +S++G ++D + S
Sbjct: 161 TSKPLSCNDQLCE--LG---SDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEH 215
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+S S+I GCG +QSG ++ A DG++G G + S+ S LA +G VR F+ C
Sbjct: 216 ASRSSVWASVIIGCGRKQSGAF--SDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSIC 273
Query: 248 LDGINGGGIFA--IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
D + G I G V Q + PL Y I + VG +L T F
Sbjct: 274 FDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSS--SLKTAGF----- 326
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVDEG 359
++DSGT+ +LP +YE +V + D +V+ + C+ S
Sbjct: 327 -QALVDSGTSFTFLPYEIYEKIVVEF-----DKQVNATRSSFKGSPWKYCYNSSSQELLN 380
Query: 360 FPNVTFHFENSVSLKVY 376
P VT F + S V+
Sbjct: 381 IPTVTLVFAMNQSFIVH 397
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 110/377 (29%), Positives = 166/377 (44%), Gaps = 43/377 (11%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGV-------G 74
VS N +F+ + LL D +RQ+ L L S D +
Sbjct: 32 VSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLGAEYQLLFPSEGSDALFLGNEFGW 91
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSS-----LGIELTLYDIKDSS 129
L+Y I IGTP + V +D GSD++WV C C +C S+ LG +L Y SS
Sbjct: 92 LHYTWIDIGTPNVSFLVALDAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSS 150
Query: 130 TGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYL-EIYGDGSSTTGYFVQDVVQYDKVSGD 187
T K ++C+ + C G +DC ++ CPYL Y + +S++G ++D + S
Sbjct: 151 TSKPLSCNDQLCE--LG---SDCKSSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEH 205
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+S S+I GCG +QSG ++ A DG++G G + S+ S LA +G VR F+ C
Sbjct: 206 ASRSSVWASVIIGCGRKQSGAF--SDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSIC 263
Query: 248 LDGINGGGIFA--IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
D + G I G V Q + PL Y I + VG +L T F
Sbjct: 264 FDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSS--SLKTAGFQA--- 318
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVDEG 359
++DSGT+ +LP +YE +V + D +V+ + C+ S
Sbjct: 319 ---LVDSGTSFTFLPYEIYEKIVVEF-----DKQVNATRSSFKGSPWKYCYNSSSQELLN 370
Query: 360 FPNVTFHFENSVSLKVY 376
P VT F + S V+
Sbjct: 371 IPTVTLVFAMNQSFIVH 387
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 190/444 (42%), Gaps = 69/444 (15%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY---------AGRERSLSLLKEHDARRQQRILAG 59
L V++ A +S + +V+ + A RE + AR +R+ +
Sbjct: 4 LAFVIVTLLAALAISRCNAAATVRMQLTHADAGRGLAARELMQRMALRSKARAARRLSSS 63
Query: 60 VDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
P+ + +GV Y + IGTPP+ + +DTGSD++W C C C ++
Sbjct: 64 ASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA--- 120
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-----NTSCPYLEIYGDGSSTTG 172
L +D SST +CD C G+ P+ C + N +C Y YGD S TTG
Sbjct: 121 --LPYFDPSTSSTLSLTSCDSTLCQGL---PVASCGSPKFWPNQTCVYTYSYGDKSVTTG 175
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+ ++ DK + S G + FGCG +G S NE GI GFG+ S+ S
Sbjct: 176 F-----LEVDKFTFVGAGASVPG-VAFGCGLFNNGVFKS-NET---GIAGFGRGPLSLPS 225
Query: 233 QLASSGGVRKMFAHCLDGING-----------GGIFAIGHVVQPEVNKTPLVPNQPH--- 278
QL F+HC +NG ++ G + V TPL+ N +
Sbjct: 226 QLKVGN-----FSHCFTAVNGLKPSTVLLDLPADLYKSG---RGAVQSTPLIQNPANPTF 277
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
Y +++ + VG L +P F + + GTIIDSGT + LP VY LV + Q
Sbjct: 278 YYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYR-LVRDAFAAQVK 336
Query: 338 LKVHT--VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED----LWCI- 390
L V + D Y C P + HFE + ++ + Y+F ED + C+
Sbjct: 337 LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA-TMDLPRENYVFEVEDAGSSILCLA 395
Query: 391 ---GWQNSGMQSRDRKNMTLLGDF 411
G + + + + ++NM +L D
Sbjct: 396 IIEGGEVTTIGNFQQQNMHVLYDL 419
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 157/343 (45%), Gaps = 46/343 (13%)
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTPP+++ + VDTGS + +V C C +C + L D
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTY--------------- 46
Query: 142 HGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
H V P C T N C Y Y + SS++G +D+V + +S +L+ +FG
Sbjct: 47 HPVKCNPDCTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMS-ELKPQRA----VFG 101
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAI 259
C ++G+L S + DGI+G G+ + S++ QL G + F+ C G+ GGG +
Sbjct: 102 CENAETGDLFSQHA---DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVL 158
Query: 260 GHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTL 316
G + P V P+Y+I + + V L++ VF D K GTI+DSGTT
Sbjct: 159 GQISPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF---DGKHGTILDSGTTY 215
Query: 317 AYLPEMVYEPLVSKIISQQPDLK-VHTVHDEY--TCFQYSES----VDEGFPNVTFHFEN 369
AYLPE + P + I S+ LK + Y CF + S + + FP+V F+N
Sbjct: 216 AYLPEAAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDN 275
Query: 370 SVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ P YLF + +C+G +G + TLLG
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGVFQNG-----KDPTTLLG 313
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 142/335 (42%), Gaps = 32/335 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++ +GTPP+ Y+ +DTGSDI+W+ C C C + ++D SST
Sbjct: 33 GSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCD-----EVFDPYKSSTY 87
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C+ C + G C N C Y YGDGS +TG F D V + SG Q
Sbjct: 88 STLGCNSRQCLNLDVG---GCVGN-KCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV 143
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ GCG G GK S +Q+ S G R F++CL G
Sbjct: 144 LNK--IPLGCGHDNEGYFVGAAGLLGL-----GKGPLSFPNQINSENGGR--FSYCLTGR 194
Query: 252 NGGG------IFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGV 302
+ IF V V TP N Y + MT + VG L +PT F +
Sbjct: 195 DTDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQL 254
Query: 303 GD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEG 359
N G IIDSGT++ L Y L + DL + T + TC+ S+
Sbjct: 255 DSLGNGGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVD 314
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGW 392
P VT HF+ LK+ YL P ++ +C+ +
Sbjct: 315 VPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAF 349
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 119/415 (28%), Positives = 171/415 (41%), Gaps = 69/415 (16%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY---------AGRERSLSLLKEHDARRQQRILAG 59
L V++ A +S + +V+ + A RE + AR +R+ +
Sbjct: 4 LAFVIVTLLAALAISRCNAAATVRMQLTHADAGRGLAARELMQRMALRSKARAARRLSSS 63
Query: 60 VDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
P+ + +GV Y + IGTPP+ + +DTGSD++W C C C
Sbjct: 64 ASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC-----FD 118
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-----NTSCPYLEIYGDGSSTTG 172
L +D SST +CD C G+ P+ C + N +C Y YGD S TTG
Sbjct: 119 QALPYFDPSTSSTLSLTSCDSTLCQGL---PVASCGSPKFWPNQTCVYTYSYGDKSVTTG 175
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+ ++ DK + S G + FGCG +G S NE GI GFG+ S+ S
Sbjct: 176 F-----LEVDKFTFVGAGASVPG-VAFGCGLFNNGVFKS-NET---GIAGFGRGPLSLPS 225
Query: 233 QLASSGGVRKMFAHCLDGING-----------GGIFAIGHVVQPEVNKTPLVPNQPH--- 278
QL F+HC +NG ++ G + V TPL+ N +
Sbjct: 226 QLKVGN-----FSHCFTAVNGLKPSTVLLDLPADLYKSG---RGAVQSTPLIQNPANPTF 277
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
Y +++ + VG L +P F + + GTIIDSGT + LP VY LV + Q
Sbjct: 278 YYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPTRVYR-LVRDAFAAQVK 336
Query: 338 LKVHT--VHDEYTCFQYSESVDEGFPNVTFHF---------ENSVSLKVYPHEYL 381
L V + D Y C P + HF EN V LK YP L
Sbjct: 337 LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYVWLKHYPKRLL 391
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 157/343 (45%), Gaps = 46/343 (13%)
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTPP+++ + VDTGS + +V C C +C + L D
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTY--------------- 46
Query: 142 HGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
H V P C T N C Y Y + SS++G +D+V + +S +L+ +FG
Sbjct: 47 HPVKCNPDCTCDTENDQCTYERQYAEMSSSSGILGEDLVSFGNMS-ELKPQRA----VFG 101
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAI 259
C ++G+L S + DGI+G G+ + S++ QL G + F+ C G+ GGG +
Sbjct: 102 CENAETGDLFS---QHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVL 158
Query: 260 GHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTL 316
G + P V P+Y+I + + V L++ VF D K GTI+DSGTT
Sbjct: 159 GQISPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF---DGKHGTILDSGTTY 215
Query: 317 AYLPEMVYEPLVSKIISQQPDLK-VHTVHDEY--TCFQYSES----VDEGFPNVTFHFEN 369
AYLPE + P + I S+ LK + Y CF + S + + FP+V F+N
Sbjct: 216 AYLPEAAFLPFIQAITSELHGLKQIRGPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDN 275
Query: 370 SVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ P YLF + +C+G +G + TLLG
Sbjct: 276 GEKYSLSPENYLFKHSKVHGAYCLGVFQNG-----KDPTTLLG 313
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 105/351 (29%), Positives = 155/351 (44%), Gaps = 54/351 (15%)
Query: 40 RSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
+SL+ L DA RIL G Y ++GIGTP + Y +DTGSD+
Sbjct: 65 QSLAALAPGDAITAARILVLAS-----------DGEYLMEMGIGTPTRYYSAILDTGSDL 113
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCP 159
+W C C C + + +D S+T + + C C+ +Y PL C C
Sbjct: 114 IWTQCAPCLLCVDQPT-----PYFDPARSATYRSLGCASPACNALY-YPL--CYQKV-CV 164
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YGD +ST G + + G +T + + FGCG +G+L + + G
Sbjct: 165 YQYFYGDSASTAGVLANETFTF----GTNETRVSLPGISFGCGNLNAGSLANGS-----G 215
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG-------GIFAI---GHVVQPEVNK 269
++GFG+ + S++SQL S F++CL G++A + V
Sbjct: 216 MVGFGRGSLSLVSQLGS-----PRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQS 270
Query: 270 TPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK---GTIIDSGTTLAYLPEMV 323
TP V P P Y +NMT + VG L + VF + D GTIIDSGTT+ YL E
Sbjct: 271 TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPA 330
Query: 324 YEPLVSKIISQ--QPDLKVHTVHDEYTCFQYSESVDEG--FPNVTFHFENS 370
Y+ + + SQ P L V TCFQ+ + P + HF+ +
Sbjct: 331 YDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGA 381
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 97/336 (28%), Positives = 153/336 (45%), Gaps = 36/336 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G YY K+G+G+PPK Y + +DTGS + W +QCK C ++ L++ S+T
Sbjct: 116 GSGNYYLKLGLGSPPKYYTMILDTGSSLSW---LQCKPCVVYCHSQVD-PLFEPSASNTY 171
Query: 132 KFVTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ + C C + L D CTA+ C Y YGD S + GY +D++ L
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLL-------TLT 224
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL- 248
+ T S +GCG G GI+G + SM++QL+ G F++CL
Sbjct: 225 PSQTLPSFTYGCGQDNEGLFGKA-----AGIVGLARDKLSMLAQLSPKYGY--AFSYCLP 277
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGD 304
+GGG +IG + TP++ N + Y + + A+ V P V G
Sbjct: 278 TSTSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVA----GRPVGVAAAGY 333
Query: 305 NKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPD-LKVHTVHDEYTCFQYSESVDEGF 360
TIIDSGT + LP +Y L KI+S++ + +++ D TCF+ S G
Sbjct: 334 QVPTIIDSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILD--TCFKGSLKSMSGA 391
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
P + F+ L + L + + C+ + +S
Sbjct: 392 PEIRMIFQGGADLSLRAPNILIEADKGIACLAFASS 427
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 162/364 (44%), Gaps = 50/364 (13%)
Query: 49 DARRQQRILAGVDLPLGGSSRPDGV---GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI 105
D +R L +LP D + G Y ++ IGTPP+++ + VDTGS + +V C
Sbjct: 47 DGHYSRRHLQNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCS 106
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIY 164
C++C + + L SST + V C+ P +C C Y Y
Sbjct: 107 SCEQCGKHQDPRFQPDL-----SSTYRPVKCN----------PSCNCDDEGKQCTYERRY 151
Query: 165 GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
+ SS++G +DVV + S +L+ +FGC ++G+L S + DGI+G G
Sbjct: 152 AEMSSSSGVIAEDVVSFGNES-ELKPQRA----VFGCENVETGDLYS---QRADGIMGLG 203
Query: 225 KSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQPHY 279
+ S++ QL G + F+ C G++ GGG +G + P + P P+Y
Sbjct: 204 RGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPNMVFSHSNPY--RSPYY 261
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK 339
+I + + V L L VF + GT++DSGTT AY PE + L I+ + LK
Sbjct: 262 NIELKELHVAGKPLKLKPKVF--DEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLK 319
Query: 340 V-----HTVHDEYTCF----QYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL--- 387
HD CF + + + FP V F + L + P YLF +
Sbjct: 320 QIPGPDPNYHD--ICFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGA 377
Query: 388 WCIG 391
+C+G
Sbjct: 378 YCLG 381
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 169/359 (47%), Gaps = 48/359 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSS------LGIELTLYDIKD 127
G Y +++ IGTPP ++ + VDTGS + +V C C C + L + ++
Sbjct: 38 GYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPEN 97
Query: 128 SSTGKFVTCDQEFC-HGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
SS+ + + C C G+ C +N+ C Y +Y + S++ G +D++ + S
Sbjct: 98 SSSYQKIGCRSSDCITGL-------CDSNSHQCKYERMYAEMSTSKGVLGKDLLDFGPAS 150
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
LQ+ L FGC +SG+L + DGI+G G+ S++ QL +G + F+
Sbjct: 151 -RLQSQ----LLSFGCETAESGDL---YLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFS 202
Query: 246 HCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF 300
C G++ GGG +G + P P N +Y++ +T +QV L L ++VF
Sbjct: 203 LCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPRRSN--YYNLELTEIQVQGASLKLDSNVF 260
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD-EYTCFQYS------ 353
GTI+DSGTT AYLP+ +E +++Q L+ D Y Y+
Sbjct: 261 --NGKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVDGPDPNYPDICYAGAGTDT 318
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ + + FP V F F + + + P YLF + +C+G+ +++ TLLG
Sbjct: 319 KELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGF------FKNQDATTLLG 371
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 122/444 (27%), Positives = 190/444 (42%), Gaps = 69/444 (15%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRY---------AGRERSLSLLKEHDARRQQRILAG 59
L V++ A +S + +V+ + A RE + AR +R+ +
Sbjct: 4 LAFVIVTLLAALAISRCNAAATVRMQLTHADAGRGLAARELMQRMALRSKARAARRLSSS 63
Query: 60 VDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
P+ + +GV Y + IGTPP+ + +DTGSD++W C C C ++
Sbjct: 64 ASAPVSPGTYDNGVPTTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA--- 120
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-----NTSCPYLEIYGDGSSTTG 172
L +D SST +CD C G+ P+ C + N +C Y YGD S TTG
Sbjct: 121 --LPYFDPSTSSTLSLTSCDSTLCQGL---PVASCGSPKFWPNQTCVYTYSYGDKSVTTG 175
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+ ++ DK + S G + FGCG +G S NE GI GFG+ S+ S
Sbjct: 176 F-----LEVDKFTFVGAGASVPG-VAFGCGLFNNGVFKS-NET---GIAGFGRGPLSLPS 225
Query: 233 QLASSGGVRKMFAHCLDGING-----------GGIFAIGHVVQPEVNKTPLVPNQPH--- 278
QL F+HC +NG ++ G + V TPL+ N +
Sbjct: 226 QLKVGN-----FSHCFTAVNGLKPSTVLLDLPADLYKSG---RGAVQSTPLIQNPANPTF 277
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
Y +++ + VG L +P F + + GTIIDSGT + LP VY LV + Q
Sbjct: 278 YYLSLKGITVGSTRLPVPESEFTLKNGTGGTIIDSGTAMTSLPTRVYR-LVRDAFAAQVK 336
Query: 338 LKVHT--VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED----LWCI- 390
L V + D Y C P + HFE + ++ + Y+F ED + C+
Sbjct: 337 LPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGA-TMDLPRENYVFEVEDAGSSILCLA 395
Query: 391 ---GWQNSGMQSRDRKNMTLLGDF 411
G + + + + ++NM +L D
Sbjct: 396 IIEGGEVTTIGNFQQQNMHVLYDL 419
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 159/350 (45%), Gaps = 45/350 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+++ + VDTGS + +V C C+ C + + +SST
Sbjct: 86 GYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQD-----PRFQPDESSTYHP 140
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V C+ + C+ + G +C Y Y + SS++G +D++ + Q+
Sbjct: 141 VKCNMD-CNCDHDG--------VNCVYERRYAEMSSSSGVLGEDIISFGN-----QSEVV 186
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN- 252
+FGC ++G+L S + DGI+G G+ S++ QL + F+ C G++
Sbjct: 187 PQRAVFGCENVETGDLYS---QRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHV 243
Query: 253 GGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTI 309
GGG +G + P V P+Y+I + + V L L F D K GT+
Sbjct: 244 GGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTF---DRKHGTV 300
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEYT--CFQYS----ESVDEGFPN 362
+DSGTT AYLPE + II + +LK +H Y CF + + + FP
Sbjct: 301 LDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPE 360
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
V F N L + P YLF + +C+G R+ + TLLG
Sbjct: 361 VDMVFSNGQKLSLTPENYLFQHTKVHGAYCLG------IFRNGDSTTLLG 404
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 96/307 (31%), Positives = 140/307 (45%), Gaps = 44/307 (14%)
Query: 44 LLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
LL H+ + R+ AG+ GG + + Y + +GTPP+ + +DTGSD++W
Sbjct: 58 LLSSHERPVRARVRAGLVAAAGGIATNE----YLVHLAVGTPPRPVALTLDTGSDLVWTQ 113
Query: 104 CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
C C++C + L D SST + C C + P T C SC Y+
Sbjct: 114 CAPCRDC-----FDQGIPLLDPAASSTYAALPCGAPRCRAL---PFTSC-GGRSCVYVYH 164
Query: 164 YGDGSSTTGYFVQDVVQY---DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGI 220
YGD S T G D + + +GD +T L FGCG G S NE GI
Sbjct: 165 YGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATR-RLTFGCGHFNKGVFQS-NE---TGI 219
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLDGIN---------GGGIFAI-GHVVQPEVNKT 270
GFG+ S+ SQL ++ F++C + GG A+ H EV T
Sbjct: 220 AGFGRGRWSLPSQLNATS-----FSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTT 274
Query: 271 PLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
PL P+QP Y +++ + VG L +P F + TIIDSG ++ LPE VYE +
Sbjct: 275 PLFKNPSQPSLYFLSLKGISVGKTRLPVPETKF-----RSTIIDSGASITTLPEEVYEAV 329
Query: 328 VSKIISQ 334
++ +Q
Sbjct: 330 KAEFAAQ 336
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 157/354 (44%), Gaps = 52/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C C++C R +D + SST K
Sbjct: 81 GYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQD-----PKFDPESSSTYKP 135
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C+ DC ++ C Y Y + S+++G +DV+ + Q+
Sbjct: 136 IKCN------------IDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGN-----QS 178
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G + S++ QL G + F+ C G
Sbjct: 179 ELIPQRAVFGCENMETGDLFS---QRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGG 235
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G + P P+ P+Y++++ + V L L + +F
Sbjct: 236 MDIGGGAMVLGGISPPSDMIFTYSDPV--RSPYYNVDLKEIHVAGKKLPLSSGIF--DGR 291
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYSES----VDE 358
G ++DSGTT AYLP + I+ + LK D + CF + S +
Sbjct: 292 YGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSN 351
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP V FEN L + P Y F + +C+G +G TLLG
Sbjct: 352 KFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-----NDQTTLLG 400
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 147/336 (43%), Gaps = 45/336 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y IG+GTP Y V DTGSD WV C C C ++ + L+D SST
Sbjct: 157 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQ-----QEKLFDPARSST 211
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYDKVSGD 187
++C C +Y + C+ C Y YGDGS + G+F D + YD + G
Sbjct: 212 YANISCAAPACSDLY---IKGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAIKG- 266
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAH 246
FGCG R G E A G++G G+ +S+ Q GGV FAH
Sbjct: 267 ---------FRFGCGERNEGLY---GEAA--GLLGLGRGKTSLPVQAYDKYGGV---FAH 309
Query: 247 CLDGINGG-GIFAIGHVVQPEVNK---TP-LVPNQP-HYSINMTAVQVGLDFLNLPTDVF 300
C + G G G P V+ TP LV N P Y + +T ++VG L++P VF
Sbjct: 310 CFPARSSGTGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVF 369
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL---KVHTVHDEYTCFQYSESVD 357
GTI+DSGT + LP Y L S S + K + TC+ ++ +
Sbjct: 370 ---TTSGTIVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSE 426
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGW 392
P V+ F+ SL V+ ++ C+G+
Sbjct: 427 VAIPTVSLLFQGGASLDVHASGIIYAASVSQACLGF 462
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 157/354 (44%), Gaps = 52/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C C++C R +D + SST K
Sbjct: 81 GYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQD-----PKFDPESSSTYKP 135
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C+ DC ++ C Y Y + S+++G +DV+ + Q+
Sbjct: 136 IKCN------------IDCICDSDGVQCVYERQYAEMSTSSGVLGEDVISFGN-----QS 178
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G + S++ QL G + F+ C G
Sbjct: 179 ELIPQRAVFGCENMETGDLFS---QRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGG 235
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G + P P+ P+Y++++ + V L L + +F
Sbjct: 236 MDIGGGAMVLGGISPPSDMIFTYSDPV--RSPYYNVDLKEIHVAGKKLPLSSGIF--DGR 291
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYSES----VDE 358
G ++DSGTT AYLP + I+ + LK D + CF + S +
Sbjct: 292 YGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSN 351
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP V FEN L + P Y F + +C+G +G TLLG
Sbjct: 352 KFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENG-----NDQTTLLG 400
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/318 (31%), Positives = 147/318 (46%), Gaps = 37/318 (11%)
Query: 43 SLLKEHDARRQQRILAG------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
S L HD R +R LAG + G + G LYYA++ +GTP + V +DTG
Sbjct: 72 SALSRHD--RARRALAGGADDGLLTFAAGNDTYQSGT-LYYAEVELGTPNATFLVALDTG 128
Query: 97 SDIMWV--NCIQCKECPRRSSLGIE---LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
SD+ WV +C QC P + G + L Y + SST K V CD C G
Sbjct: 129 SDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLC-----GQRNG 183
Query: 152 CTA--NTSCPY-LEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTSTNGSLIFGCGARQS 206
C+A N SCPY ++ +S++G VQDV+ ++ + ++FGCG Q+
Sbjct: 184 CSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQT 243
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGINGGGIFAIGHVVQP 265
G A+DG++G G S+ S LA+SG V F+ C G +G G G
Sbjct: 244 GAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCF-GDDGVGRVNFGDAGSR 302
Query: 266 EVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
+TP P Y+++ T++ VG + V ++DSGT+ YL +
Sbjct: 303 GQAETPFTVRSLNPTYNVSFTSIGVGSE---------SVAAEFAAVMDSGTSFTYLSDPE 353
Query: 324 YEPLVSKIISQQPDLKVH 341
Y L +K SQ + +V+
Sbjct: 354 YTQLATKFNSQVSERRVN 371
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 161/354 (45%), Gaps = 52/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTP +++ + VD+GS + +V C C++C + L SST
Sbjct: 89 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDL-----SSTYSP 143
Query: 134 VTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DCT + + C Y Y + SS++G +D++ + K S +L+
Sbjct: 144 VKCN------------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKES-ELKP 190
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ S++ QL G + F+ C G
Sbjct: 191 QRA----VFGCENTETGDLFS---QHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 243
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G + P + P+ P+Y+I + + V L L +F
Sbjct: 244 MDVGGGTMVLGGMPAPPDMVFSHSNPV--RSPYYNIELKEIHVAGKALRLDPKIF--NSK 299
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCF----QYSESVDE 358
GT++DSGTT AYLPE + + ++ LK D + CF + + E
Sbjct: 300 HGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSE 359
Query: 359 GFPNVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
FP+V F N L + P YLF E +C+G +G + TLLG
Sbjct: 360 VFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNG-----KDPTTLLG 408
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 145/333 (43%), Gaps = 43/333 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTPP+ + +DTGSD++W C C C R+ L D +SST +
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRA-----LGPLDPSNSSTFDVLP 469
Query: 136 CDQEFCHGVYGGPLTDC----TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
C C + + C N +C Y+ Y DGS TTG+ + + G Q T
Sbjct: 470 CSSPVCDNLT---WSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQAT 526
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ L FGCG +G + ++NE GI GFG+ S+ SQL F+HC I
Sbjct: 527 VPD--LAFGCGLFNNG-IFTSNET---GIAGFGRGALSLPSQLKVDN-----FSHCFTAI 575
Query: 252 NGG-------GIFA-IGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVF 300
G G+ A + V TPLV N Y +++ + VG L +P F
Sbjct: 576 TGSEPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTF 635
Query: 301 GVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYS-- 353
+ + GTIIDSGT + LP+ Y+ LV + Q L V CF +S
Sbjct: 636 ALKQDGTGGTIIDSGTGMTTLPQDAYK-LVHDAFTAQVRLPVDNATSSSLSRLCFSFSVP 694
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
P + HFE + +L + Y+F FED
Sbjct: 695 RRAKPDVPKLVLHFEGA-TLDLPRENYMFEFED 726
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 107/359 (29%), Positives = 163/359 (45%), Gaps = 55/359 (15%)
Query: 70 PD-GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDS 128
PD G G Y ++ IGTP +DTGSD++W C C +C S +
Sbjct: 35 PDIGSGEYLIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDPSSS------- 87
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
ST V C C + C + C Y+ YGD SST+G +
Sbjct: 88 STYSKVLCQSSLCQPP---SIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSI------- 137
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
++ + ++ FGCG D+ + + G++GFG+ + S++SQL S G + F++CL
Sbjct: 138 -SSQSLPNITFGCGH------DNQGFDKVGGLVGFGRGSLSLVSQLGPSMGNK--FSYCL 188
Query: 249 ----DGINGGGIFAIGHVVQPE---VNKTPLVPNQP--HYSINMTAVQVGLDFLNLPTDV 299
D +F IG+ E V TPLV + HY +++ + VG L +PT
Sbjct: 189 VSRTDSSKTSPLF-IGNTASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGT 247
Query: 300 FGVGDN--KGTIIDSGTTLAYLPEMVY----EPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
F + + G IIDSGTTL +L + Y E +VS I Q D ++ CF
Sbjct: 248 FDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLPQADGQLD------LCFNQQ 301
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFP--FEDLWCIGWQNSGMQSRDRKNMTLLGD 410
S + GFP++TFHF+ + V YLFP D+ C+ + + + NM + G+
Sbjct: 302 GSSNPGFPSMTFHFKGA-DYDVPKENYLFPDSTSDIVCLAMMPT---NSNLGNMAIFGN 356
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/348 (28%), Positives = 156/348 (44%), Gaps = 41/348 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G Y +GTP ++VDTGSD+ WV QCK C S + L+D SS+
Sbjct: 133 GTSNYVVTASLGTPGMAQTLEVDTGSDLSWV---QCKPCAAPSCYRQKDPLFDPAQSSSY 189
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C + C G+ G + C Y+ YGDGS+TTG + D + L
Sbjct: 190 AAVPCGRSACAGL--GIYASACSAAQCGYVVSYGDGSNTTGVYSSDTL-------TLAAN 240
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+T +FGCG QSG L + +DG++GFG+ S++ Q A G +F++CL
Sbjct: 241 ATVQGFLFGCGHAQSGGLFT----GIDGLLGFGREQPSLVQQTA--GAYGGVFSYCLPTK 294
Query: 252 NG-GGIFAIGHV--VQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G +G V P + T L+ PN P +Y + +T + VG L++P F
Sbjct: 295 SSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFA---- 350
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GT++D+GT + LP Y L S ++ P + D TC+ ++ +
Sbjct: 351 AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGILD--TCYSFAGYGTVNLTS 408
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
V F + ++ + + C+ + +SG +M +LG+
Sbjct: 409 VALTFSSGATMTLGADGIM----SFGCLAFASSG----SDGSMAILGN 448
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 89/270 (32%), Positives = 129/270 (47%), Gaps = 29/270 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSL----GIELTLYDIKDSST 130
LYYA + +GTP + V +DTGSD+ WV C CK+C +++ L Y ++SST
Sbjct: 110 LYYAVVEVGTPNATFLVALDTGSDLFWVPC-DCKQCASIANVTGQPATALRPYSPRESST 168
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTA--NTSCPY-LEIYGDGSSTTGYFVQDVVQYDK---V 184
K VTCD C G C+A N SCPY ++ +ST+G VQDV+ +
Sbjct: 169 SKQVTCDNALCDRPNG-----CSAATNGSCPYEVQYLSANTSTSGVLVQDVLHLTRERPG 223
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-M 243
+ + ++FGCG Q+G + A DG++G G+ N S+ S LASSG V
Sbjct: 224 AAAEAGEALQAPVVFGCGQVQTGTF--LDGAAFDGLMGLGRENVSVPSVLASSGLVASDS 281
Query: 244 FAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
F+ C G +G G G +TP + Y+++ TAV V + V
Sbjct: 282 FSMCF-GDDGVGRINFGDSGSSGQGETPFTGRRTLYNVSFTAVNV---------ETKSVA 331
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
+IDSGT+ YL + Y L + S
Sbjct: 332 AEFAAVIDSGTSFTYLADPEYTELATNFNS 361
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/351 (29%), Positives = 154/351 (43%), Gaps = 54/351 (15%)
Query: 40 RSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
+SL+ L DA RIL G Y ++GIGTP + Y +DTGSD+
Sbjct: 65 QSLAALAPGDAITAARILVLAS-----------DGEYLMEMGIGTPTRYYSAILDTGSDL 113
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCP 159
+W C C C + + +D S+T + + C C+ +Y PL C C
Sbjct: 114 IWTQCAPCLLCVDQPT-----PYFDPARSATYRSLGCASPACNALY-YPL--CYQKV-CV 164
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YGD +ST G + + G +T + + FGCG +G L + + G
Sbjct: 165 YQYFYGDSASTAGVLANETFTF----GTNETRVSLPGISFGCGNLNAGLLANGS-----G 215
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG-------GIFAI---GHVVQPEVNK 269
++GFG+ + S++SQL S F++CL G++A + V
Sbjct: 216 MVGFGRGSLSLVSQLGS-----PRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEPVQS 270
Query: 270 TPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK---GTIIDSGTTLAYLPEMV 323
TP V P P Y +NMT + VG L + VF + D GTIIDSGTT+ YL E
Sbjct: 271 TPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLAEPA 330
Query: 324 YEPLVSKIISQ--QPDLKVHTVHDEYTCFQYSESVDEG--FPNVTFHFENS 370
Y+ + + SQ P L V TCFQ+ + P + HF+ +
Sbjct: 331 YDAVRAAFASQITLPLLNVTDASVLDTCFQWPPPPRQSVTLPQLVLHFDGA 381
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 155/340 (45%), Gaps = 39/340 (11%)
Query: 69 RPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMWV-----NCIQCKECPRRSSLGIELTL 122
R D +G L+YA + +GTP + V +DTGSD+ W+ NC++ + P SSL +L +
Sbjct: 96 RVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL--DLNI 153
Query: 123 YDIKDSSTGKFVTCDQEFCH--GVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVV 179
Y SST V C+ C P +D CPY + +G+S+TG V+DV+
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESD------CPYQIRYLSNGTSSTGVLVEDVL 207
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
VS D + + + FGCG Q+G + A +G+ G G + S+ S LA G
Sbjct: 208 HL--VSNDKSSKAIPARVTFGCGQVQTGVFH--DGAAPNGLFGLGLEDISVPSVLAKEGI 263
Query: 240 VRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPT 297
F+ C G +G G + G + +TPL QPH Y+I +T + VG + +L
Sbjct: 264 AANSFSMCF-GNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEF 322
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT---CFQYSE 354
D + DSGT+ YL + Y + S D + T E C+ S
Sbjct: 323 DA---------VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSP 373
Query: 355 SVDE-GFPNVTFHFENSVSLKVYPHEYLFPFE--DLWCIG 391
+ D +P V + S VY + P + D++C+
Sbjct: 374 NKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLA 413
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 152/332 (45%), Gaps = 28/332 (8%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GTPP Y DTGSDI+W+ C C++C +++ +++ SS+ K
Sbjct: 85 GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTT-----PIFNPSKSSSYKN 139
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C + CH V T C+ SC Y YGD S + G D + + SG + +
Sbjct: 140 IPCSSKLCHSVRD---TSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSG---SPVS 193
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-- 251
++ GCG +G A GI+G G S+I+QL SS G + F++CL +
Sbjct: 194 FPKIVIGCGTDNAGTFGG----ASSGIVGLGGGPVSLITQLGSSIGGK--FSYCLVPLLN 247
Query: 252 ---NGGGIFAIGH--VVQPE-VNKTPLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD 304
N I + G VV + V TPL+ P Y + + A VG + G D
Sbjct: 248 KESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDD 307
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
IIDSGTTL +P VY L S ++ +V + +++ +S + FP +T
Sbjct: 308 EGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIIT 367
Query: 365 FHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
HF+ + ++++ P D + C +Q S
Sbjct: 368 VHFKGA-DVELHSISTFVPITDGIVCFAFQPS 398
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 97/318 (30%), Positives = 145/318 (45%), Gaps = 41/318 (12%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIG 81
V ++H V S + R L+ L ++ R PLG LYYA++
Sbjct: 92 VRTDHFVHSRRLGQVQDHRPLTFLSGNETLRIS--------PLGF--------LYYAEVT 135
Query: 82 IGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
+GTP Y V +DTGSD+ W+ +C+ C + + +Y +SST K V C
Sbjct: 136 VGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSS 195
Query: 140 FCHGVYGGPLTDCTANT-SCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
C L C++ + +CPY Y D +S+TGY V+D++ + D+Q+ N +
Sbjct: 196 LCSH-----LDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL--TTNDVQSKPVNARI 248
Query: 198 IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIF 257
GCG QSG S+ A +G+ G G N S+ S LA++G + F+ C G I
Sbjct: 249 TLGCGKDQSGAFLSS--AAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRI- 305
Query: 258 AIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTT 315
G P N+TP L P Y++++T + VG +L DV I DSGT+
Sbjct: 306 EFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDL--DV-------AVIFDSGTS 356
Query: 316 LAYLPEMVYEPLVSKIIS 333
YL + Y K S
Sbjct: 357 FTYLNDPAYSLFADKFAS 374
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 171/355 (48%), Gaps = 48/355 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK----DSS 129
G Y +++ IGTP +++ + VDTGS + +V C C C G +D + +SS
Sbjct: 97 GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHC------GHHQACFDPRFKPDNSS 150
Query: 130 TGKFVTCDQEFCHGVYGGPLTD-CTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ + V+C+ C +T C A C Y +Y + SS+ G +D++ + S
Sbjct: 151 SYQTVSCNSPDC-------ITKMCDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNGS-R 202
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
LQ L+FGC ++G+L + DGI+G G+ S++ QL +G + F+ C
Sbjct: 203 LQPH----PLLFGCETAETGDL---YLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLC 255
Query: 248 LDGIN-GGGIFAIGHVVQPEVNK-TPLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD 304
G++ GGG +G + P PN+ +Y++ ++ +QV LN+P++VF
Sbjct: 256 YGGMDEGGGSMVLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVF--NG 313
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCF----QYSESVD 357
GT++DSGTT AYLP+ ++ I Q L+ D CF S+++
Sbjct: 314 RLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVPGPDPSYPDVCFAGAGSDSKALG 373
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ FP V F F + + + P YLF + +C+G+ +++ TLLG
Sbjct: 374 KHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGF------FKNQDATTLLG 422
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 98/329 (29%), Positives = 151/329 (45%), Gaps = 37/329 (11%)
Query: 32 KYRYAGRERSLSLLKEHDARRQQRILAG------VDLPLGGSSRPDGVGLYYAKIGIGTP 85
++ G S L HD R +R LAG + G + G LYYA++ +GTP
Sbjct: 63 RWPARGTPEYYSALSRHD--RARRALAGGADDGLLTFAAGNDTYQSGT-LYYAEVELGTP 119
Query: 86 PKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIE---LTLYDIKDSSTGKFVTCDQEF 140
+ V +DTGSD+ WV +C QC P ++ G + L Y + SST + V CD
Sbjct: 120 NATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPPLRPYSPRRSSTSEQVACDNPL 179
Query: 141 CHGVYGGPLTDCTA--NTSCPY-LEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTSTNG 195
C G C+A N SCPY ++ +S++G VQDV+ ++ +
Sbjct: 180 C-----GRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQA 234
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGINGG 254
++FGCG Q+G A+DG++G G S+ S LA+SG V F+ C G +G
Sbjct: 235 PVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCF-GDDGV 293
Query: 255 GIFAIGHVVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
G G +TP P Y+++ T++ +G + V ++DS
Sbjct: 294 GRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTSIGIGSE---------SVAAEFAAVMDS 344
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVH 341
GT+ YL + Y L +K SQ + +V+
Sbjct: 345 GTSFTYLSDPEYTQLATKFNSQVSERRVN 373
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 152/329 (46%), Gaps = 45/329 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP Y VDTGSD++W C C +C ++S+ ++D SST
Sbjct: 91 GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTY 145
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C + P + CT+ + C Y YGD SST G + K
Sbjct: 146 ATVPCSSASCSDL---PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK-------- 194
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S ++FGCG G D ++ A G++G G+ S++SQL G+ K F++CL +
Sbjct: 195 SKLPGVVFGCGDTNEG--DGFSQGA--GLVGLGRGPLSLVSQL----GLDK-FSYCLTSL 245
Query: 252 NG--------GGIFAI--GHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD 298
+ G + I V TPL+ P+QP Y +++ A+ VG ++LP+
Sbjct: 246 DDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSS 305
Query: 299 VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ--PDLKVHTVHDEYTCFQY-S 353
F V D+ G I+DSGT++ YL Y L +Q P V + CF+ +
Sbjct: 306 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDL-CFRAPA 364
Query: 354 ESVDE-GFPNVTFHFENSVSLKVYPHEYL 381
+ VD+ P + FHF+ L + Y+
Sbjct: 365 KGVDQVEVPRLVFHFDGGADLDLPAENYM 393
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 152/329 (46%), Gaps = 45/329 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP Y VDTGSD++W C C +C ++S+ ++D SST
Sbjct: 101 GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTY 155
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C + P + CT+ + C Y YGD SST G + K
Sbjct: 156 ATVPCSSASCSDL---PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK-------- 204
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S ++FGCG G D ++ A G++G G+ S++SQL G+ K F++CL +
Sbjct: 205 SKLPGVVFGCGDTNEG--DGFSQGA--GLVGLGRGPLSLVSQL----GLDK-FSYCLTSL 255
Query: 252 NG--------GGIFAI--GHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD 298
+ G + I V TPL+ P+QP Y +++ A+ VG ++LP+
Sbjct: 256 DDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSS 315
Query: 299 VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ--PDLKVHTVHDEYTCFQY-S 353
F V D+ G I+DSGT++ YL Y L +Q P V + CF+ +
Sbjct: 316 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDL-CFRAPA 374
Query: 354 ESVDE-GFPNVTFHFENSVSLKVYPHEYL 381
+ VD+ P + FHF+ L + Y+
Sbjct: 375 KGVDQVEVPRLVFHFDGGADLDLPAENYM 403
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/334 (29%), Positives = 150/334 (44%), Gaps = 46/334 (13%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
S P G G Y +G+GTP KD + DTGSD+ W C C +S + ++D
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC----VKSCYAQQQPIFDPS 200
Query: 127 DSSTGKFVTCDQEFCHGVYG--GPLTDCTANTSCPYLEIYGDGSSTTGYFVQD---VVQY 181
S T ++C C G+ G C++ ++C Y YGD S T G+F +D + Q
Sbjct: 201 ASKTYSNISCTSTACSGLKSATGNSPGCSS-SNCVYGIQYGDSSFTVGFFAKDTLTLTQN 259
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
D G +FGCG G T G+IG G+ S++ Q A G
Sbjct: 260 DVFDG----------FMFGCGQNNRGLFGKT-----AGLIGLGRDPLSIVQQTAQKFG-- 302
Query: 242 KMFAHCL---DGINGGGIFAIGH------VVQPEVNKTPLVPNQ--PHYSINMTAVQVGL 290
K F++CL G NG F G+ V+ + TP +Q Y I++ + VG
Sbjct: 303 KYFSYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGG 362
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEY 347
L++ +F N GTIIDSGT + LP VY L S + +S+ P ++ D
Sbjct: 363 KALSISPMLF---QNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLD-- 417
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
TC+ S P ++F+F + ++ + P+ L
Sbjct: 418 TCYDLSNYTSISIPKISFNFNGNANVDLEPNGIL 451
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 111/395 (28%), Positives = 171/395 (43%), Gaps = 48/395 (12%)
Query: 36 AGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQV 93
+GRE + AR + + + P+ + DGV + Y + IGTPP+ + +
Sbjct: 49 SGRELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTL 108
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSD++W C C C +S L YD SST +CD C +T C
Sbjct: 109 DTGSDLVWTQCQPCAVCFNQS-----LPYYDASRSSTFALPSCDSTQCK--LDPSVTMCV 161
Query: 154 ANT--SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
T +C + YGD S+T G+ + V + V+G ++ ++FGCG +G S
Sbjct: 162 NQTVQTCAFSYSYGDKSATIGFLDVETVSF--VAG-----ASVPGVVFGCGLNNTGIFRS 214
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNK-- 269
NE GI GFG+ S+ SQL F+HC ++G + + ++ K
Sbjct: 215 -NET---GIAGFGRGPLSLPSQLKVGN-----FSHCFTAVSGRKPSTVLFDLPADLYKNG 265
Query: 270 ------TPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYL 319
TPL+ N H Y +++ + VG L +P F + + GTIIDSGT L
Sbjct: 266 RGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSL 325
Query: 320 PEMVYEPLVSKIISQQPDLKVHTVHD--EYTCFQYSE-SVDEGFPNVTFHFENSVSLKVY 376
P VY LV + L V ++ CF P + HFE + ++ +
Sbjct: 326 PPRVYR-LVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLP 383
Query: 377 PHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
Y+F +D G S + MT++G+F
Sbjct: 384 RENYVFEAKD----GGNCSICLAIIEGEMTIIGNF 414
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/425 (26%), Positives = 193/425 (45%), Gaps = 53/425 (12%)
Query: 5 LRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRE----RSLSLLKE---HDARRQQRI- 56
++ L LI T + V+ + SV+ + A R+ + LS +++ D +R I
Sbjct: 1 MQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLIS 60
Query: 57 -----LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
GV + LG S G Y+ +I +GTP K + V VDTGS++ WVNC
Sbjct: 61 RKRNSTVGVKMDLG-SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNC------- 112
Query: 112 RRSSLGIE-LTLYDIKDSSTGKFVTCDQEFCH----GVYGGPLTDC-TANTSCPYLEIYG 165
R + G + ++ +S + K V C + C ++ LT C T +T C Y Y
Sbjct: 113 RYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFS--LTTCPTPSTPCSYDYRYA 170
Query: 166 DGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGK 225
DGS+ G F ++ + +G + G LI GC + +G + + DG++G
Sbjct: 171 DGSAAQGVFAKETITVGLTNGRMARLP--GHLI-GCSSSFTGQ----SFQGADGVLGLAF 223
Query: 226 SNSSMISQLASSGGVRKMFAHCL------DGINGGGIFAIGHVVQPEVNKTP---LVPNQ 276
S+ S S S G + F++CL ++ IF + +T L
Sbjct: 224 SDFSFTSTATSLYGAK--FSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIP 281
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV---SKIIS 333
P Y+IN+ + +G D L++P+ V+ GTI+DSGT+L L + Y+ +V ++ +
Sbjct: 282 PFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLV 341
Query: 334 QQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIG 391
+ +K V EY CF ++ + P +TFH + + + YL + C+G
Sbjct: 342 ELKRVKPEGVPIEY-CFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLG 400
Query: 392 WQNSG 396
+ ++G
Sbjct: 401 FVSAG 405
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 162/359 (45%), Gaps = 52/359 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL-----TLYDIKDS 128
G Y ++ IGTP +++ + VD+GS + +V C C++C S + + S
Sbjct: 89 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLS 148
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
ST V C+ DCT + + C Y Y + SS++G +D++ + K S
Sbjct: 149 STYSPVKCN------------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKES 196
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+L+ +FGC ++G+L S + DGI+G G+ S++ QL G + F+
Sbjct: 197 -ELKPQRA----VFGCENTETGDLFS---QHADGIMGLGRGQLSIMDQLVEKGVISDSFS 248
Query: 246 HCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF 300
C G++ GGG +G + P + P+ P+Y+I + + V L L +F
Sbjct: 249 LCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPV--RSPYYNIELKEIHVAGKALRLDPKIF 306
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCF----QYS 353
GT++DSGTT AYLPE + + ++ LK D + CF +
Sbjct: 307 --NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNV 364
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
+ E FP+V F N L + P YLF E +C+G +G + TLLG
Sbjct: 365 SQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNG-----KDPTTLLG 418
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 100/359 (27%), Positives = 162/359 (45%), Gaps = 52/359 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL-----TLYDIKDS 128
G Y ++ IGTP +++ + VD+GS + +V C C++C S + + S
Sbjct: 90 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLS 149
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
ST V C+ DCT + + C Y Y + SS++G +D++ + K S
Sbjct: 150 STYSPVKCN------------VDCTCDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKES 197
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+L+ +FGC ++G+L S + DGI+G G+ S++ QL G + F+
Sbjct: 198 -ELKPQRA----VFGCENTETGDLFS---QHADGIMGLGRGQLSIMDQLVEKGVISDSFS 249
Query: 246 HCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF 300
C G++ GGG +G + P + P+ P+Y+I + + V L L +F
Sbjct: 250 LCYGGMDVGGGTMVLGGMPAPPDMVFSHSNPV--RSPYYNIELKEIHVAGKALRLDPKIF 307
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCF----QYS 353
GT++DSGTT AYLPE + + ++ LK D + CF +
Sbjct: 308 --NSKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNV 365
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
+ E FP+V F N L + P YLF E +C+G +G + TLLG
Sbjct: 366 SQLSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNG-----KDPTTLLG 419
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 113/425 (26%), Positives = 193/425 (45%), Gaps = 53/425 (12%)
Query: 5 LRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRE----RSLSLLKE---HDARRQQRI- 56
++ L LI T + V+ + SV+ + A R+ + LS +++ D +R I
Sbjct: 23 MQKTLLSCLITTLLLITVADSMKDTSVRLKLAHRDTLLPKPLSRIEDVIGADQKRHSLIS 82
Query: 57 -----LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
GV + LG S G Y+ +I +GTP K + V VDTGS++ WVNC
Sbjct: 83 RKRNSTVGVKMDLG-SGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNC------- 134
Query: 112 RRSSLGIE-LTLYDIKDSSTGKFVTCDQEFCH----GVYGGPLTDC-TANTSCPYLEIYG 165
R + G + ++ +S + K V C + C ++ LT C T +T C Y Y
Sbjct: 135 RYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMNLFS--LTTCPTPSTPCSYDYRYA 192
Query: 166 DGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGK 225
DGS+ G F ++ + +G + G LI GC + +G + + DG++G
Sbjct: 193 DGSAAQGVFAKETITVGLTNGRMARLP--GHLI-GCSSSFTGQ----SFQGADGVLGLAF 245
Query: 226 SNSSMISQLASSGGVRKMFAHCL------DGINGGGIFAIGHVVQPEVNKTP---LVPNQ 276
S+ S S S G + F++CL ++ IF + +T L
Sbjct: 246 SDFSFTSTATSLYGAK--FSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIP 303
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV---SKIIS 333
P Y+IN+ + +G D L++P+ V+ GTI+DSGT+L L + Y+ +V ++ +
Sbjct: 304 PFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLV 363
Query: 334 QQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIG 391
+ +K V EY CF ++ + P +TFH + + + YL + C+G
Sbjct: 364 ELKRVKPEGVPIEY-CFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLG 422
Query: 392 WQNSG 396
+ ++G
Sbjct: 423 FVSAG 427
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/323 (28%), Positives = 141/323 (43%), Gaps = 34/323 (10%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
DG G Y+ ++G+G+PP D Y+ VD+GSD++WV C C++C ++ L+D SS+
Sbjct: 125 DGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSS 179
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V+C C + G C Y YGDGS T G + + L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLT-------LGG 232
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-- 248
T+ G I GCG R SG G++G G S+I QL + G +F++CL
Sbjct: 233 TAVQGVAI-GCGHRNSGLFVGAA-----GLLGLGWGAMSLIGQLGGAAG--GVFSYCLAS 284
Query: 249 DGINGGGIFAIGHVVQPEVNK--TPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVG 303
G G G +G V PLV N Y + +T + VG + L L +F +
Sbjct: 285 RGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLT 344
Query: 304 DN--KGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDE 358
++ G ++D+GT + LP Y L + P ++ D TC+ S
Sbjct: 345 EDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLD--TCYDLSGYASV 402
Query: 359 GFPNVTFHFENSVSLKVYPHEYL 381
P V+F+F+ L + L
Sbjct: 403 RVPTVSFYFDQGAVLTLPARNLL 425
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 136/307 (44%), Gaps = 29/307 (9%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +I +GTPP DTGSD++W C C C ++++ ++D S+T K
Sbjct: 81 GEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNA-----PMFDPSKSTTYKN 135
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V C C Y G + C+ ++ C Y YGD S + G D V SG
Sbjct: 136 VACSSPVCS--YSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPR 193
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
+ GCG +G ++ + GI+G G+ +S+++QL + G + F++CL I
Sbjct: 194 T---VIGCGHDNAGTFNAN----VSGIVGLGRGPASLVTQLGPATGGK--FSYCLIPIGT 244
Query: 254 GG--------IFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGV 302
G + +V TP+ + + YS+ + AV VG N P +
Sbjct: 245 GSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL 304
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE-GFP 361
G IIDSGTTL YLP + S ISQ L E+ + ++ + D+ P
Sbjct: 305 GGESNIIIDSGTTLTYLPSALLNSFGSA-ISQSMSLPHAQDPSEFLDYCFATTTDDYEMP 363
Query: 362 NVTFHFE 368
VT HFE
Sbjct: 364 PVTMHFE 370
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 99/329 (30%), Positives = 152/329 (46%), Gaps = 45/329 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP Y VDTGSD++W C C +C ++S+ ++D SST
Sbjct: 70 GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTY 124
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C + P + CT+ + C Y YGD SST G + K
Sbjct: 125 ATVPCSSASCSDL---PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAK-------- 173
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S ++FGCG G D ++ A G++G G+ S++SQL G+ K F++CL +
Sbjct: 174 SKLPGVVFGCGDTNEG--DGFSQGA--GLVGLGRGPLSLVSQL----GLDK-FSYCLTSL 224
Query: 252 NG--------GGIFAI--GHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD 298
+ G + I V TPL+ P+QP Y +++ A+ VG ++LP+
Sbjct: 225 DDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSS 284
Query: 299 VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ--PDLKVHTVHDEYTCFQY-S 353
F V D+ G I+DSGT++ YL Y L +Q P V + CF+ +
Sbjct: 285 AFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLDL-CFRAPA 343
Query: 354 ESVDE-GFPNVTFHFENSVSLKVYPHEYL 381
+ VD+ P + FHF+ L + Y+
Sbjct: 344 KGVDQVEVPRLVFHFDGGADLDLPAENYM 372
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 95/328 (28%), Positives = 139/328 (42%), Gaps = 35/328 (10%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+G G Y+ + +GTPP + +DTGSD+ W C C + LYD SST
Sbjct: 91 NGAGAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPCTT----ACFAQPTPLYDPARSST 146
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C C + C A T C Y Y G T GY D + GD
Sbjct: 147 FSKLPCASPLCQALPSA-FRACNA-TGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDA 203
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+S+ + FGC G++D + GI+G G+S S++SQ+ GV + F++CL
Sbjct: 204 SSSFAGVAFGCSTANGGDMDGAS-----GIVGLGRSALSLLSQI----GVGR-FSYCLRS 253
Query: 251 INGGG----IF-AIGHVVQPEVNKTPLVPN-------QPHYSINMTAVQVGLDFLNLPTD 298
G +F A+ +V +V T L+ N P+Y +N+T + VG L + +
Sbjct: 254 DADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSS 313
Query: 299 VFG--VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYS 353
FG G I+DSGTT YL E Y L +SQ L ++ CF+ +
Sbjct: 314 TFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDLCFE-A 372
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+ D P + F F V Y
Sbjct: 373 GAADTPVPRLVFRFAGGAEYAVPRQSYF 400
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 141/323 (43%), Gaps = 34/323 (10%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
DG G Y+ ++G+G+PP D Y+ VD+GSD++WV C C++C ++ L+D SS+
Sbjct: 125 DGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSS 179
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V+C C + G C Y YGDGS T G + + L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLT-------LGG 232
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-- 248
T+ G I GCG R SG G++G G S++ QL + G +F++CL
Sbjct: 233 TAVQGVAI-GCGHRNSGLFVGAA-----GLLGLGWGAMSLVGQLGGAAG--GVFSYCLAS 284
Query: 249 DGINGGGIFAIGHVVQPEVNK--TPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVG 303
G G G +G V PLV N Y + +T + VG + L L +F +
Sbjct: 285 RGAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLT 344
Query: 304 DN--KGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDE 358
++ G ++D+GT + LP Y L + P ++ D TC+ S
Sbjct: 345 EDGAGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLD--TCYDLSGYASV 402
Query: 359 GFPNVTFHFENSVSLKVYPHEYL 381
P V+F+F+ L + L
Sbjct: 403 RVPTVSFYFDQGAVLTLPARNLL 425
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 148/348 (42%), Gaps = 47/348 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y+A IG+G PP V +DTGSD++W+ C+ C+ C R+ + LYD ++S T +
Sbjct: 90 GEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVT-----PLYDPRNSKTHRR 144
Query: 134 VTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+ C C GV P C A T C Y+ +YGDGS+++G D + L +
Sbjct: 145 IPCASPQCRGVLRYP--GCDARTGGCVYMVVYGDGSASSGDLATDTLV-------LPDDT 195
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
++ GCG G L S G++G G+ S +QLA + G +F++CL
Sbjct: 196 RVHNVTLGCGHDNEGLLASAA-----GLLGAGRGQLSFPTQLAPAYG--HVFSYCLGDRM 248
Query: 249 -DGINGGGIFAIGHVVQ-PEVNKTPLV--PNQPH-YSINMTAVQVGLD----FLNLPTDV 299
N G + P TPL P +P Y ++M VG + F N +
Sbjct: 249 SRARNSSSYLVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLAL 308
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG 359
G ++DSGT ++ Y + +S + + ++++ F V
Sbjct: 309 NPATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGN 368
Query: 360 -------FPNVTFHFENSVSLKVYPHEYLFPF-----EDLWCIGWQNS 395
P++ HF + + + YL P +C+G Q +
Sbjct: 369 GPGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAA 416
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 171/395 (43%), Gaps = 56/395 (14%)
Query: 9 LCIVLIATAAVGGVSSNHGV---FSVKYRYAGRERSL----SLLKEHD------ARRQQR 55
+ +VL GG+ S H F++ +R++ + + L ++H + R
Sbjct: 11 MLLVLSVFFLAGGLRSGHAASFKFTIHHRFSDSIKEIFGSEGLPEKHTPGYYAAMVHRDR 70
Query: 56 ILAGVDLPLGGSSRP------------DGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMWV 102
+L G +L P G+G LYYA + IGTP + V +DTGSD+ W+
Sbjct: 71 LLHGRNLATTNGDTPLMFSYGNETYELSGLGNLYYANVSIGTPGLYFLVALDTGSDLFWL 130
Query: 103 NCIQCKECP----RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN-TS 157
C +C +CP +R + L Y SST V C C C++N +S
Sbjct: 131 PC-ECTKCPTYLTKRDNGKFWLNHYSSNASSTSIRVPCSSSLCELA-----NQCSSNKSS 184
Query: 158 CPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
CPY Y + SS+ GY VQD++ + D Q + + GCG Q+G +N A
Sbjct: 185 CPYQTHYLSENSSSAGYLVQDILH--MATDDSQLKPVDVKVTLGCGKVQTGKF--SNVTA 240
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQ 276
+G+IG G S+ S LAS G F+ C G G G G + +TP P
Sbjct: 241 PNGLIGLGMGKVSVPSFLASQGLTTDSFSMCF-GYYGYGRIDFGDIGPVGQRETPFNPAS 299
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP 336
Y++ + + V N PT+V + IIDSG + YL + Y + + +
Sbjct: 300 LSYNVTILQIIV----TNRPTNV-----HLTAIIDSGASFTYLTDPFYSIITENMDAAME 350
Query: 337 DLKVHTVHD---EYTCFQYSESVDEGFPNVTFHFE 368
++ + D EY C++ S + PN+ F E
Sbjct: 351 LERIKSDSDFPFEY-CYRLSLATIFQQPNLNFTME 384
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 127/265 (47%), Gaps = 25/265 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
LYYA++ +GTP Y V +DTGSD+ W+ +C+ C + + +Y +SST K
Sbjct: 106 LYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSK 165
Query: 133 FVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C L C++ + +CPY Y D +S+TGY V+D++ + D+Q+
Sbjct: 166 EVQCSSSLCSH-----LDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL--TTNDVQS 218
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
N + GCG QSG S+ A +G+ G G N S+ S LA++G + F+ C
Sbjct: 219 KPVNARITLGCGKDQSGAFLSS--AAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGP 276
Query: 251 INGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G I G P N+TP L P Y++++T + VG +L V
Sbjct: 277 ARMGRI-EFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAV--------- 326
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIIS 333
I DSGT+ YL + Y K S
Sbjct: 327 IFDSGTSFTYLNDPAYSLFADKFAS 351
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 137/289 (47%), Gaps = 29/289 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +CI+C ++ +Y + SST +
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGDLKFDMYSPRKSSTSR 157
Query: 133 FVTCDQEFCHGVYGGPLTDCT-ANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C P DC+ A+ SCPY ++ + +S+ G V+DV+ SG Q+
Sbjct: 158 KVPCSSSLCD-----PQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYLTTESG--QS 210
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T + FGCG QSG+ A +G++G G + S+ S LAS G F+ C G
Sbjct: 211 KITQAPITFGCGQVQSGSF--LGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCF-G 267
Query: 251 INGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-G 307
+G G G + +TPL P+Y+I++T VG D K
Sbjct: 268 EDGHGRINFGDTGSSDQLETPLNIYKQNPYYNISITGAMVGGKSF----------DTKFS 317
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESV 356
++DSGT+ L + +Y + S +Q + + H D F+Y S+
Sbjct: 318 AVVDSGTSFTALSDPMYTEITSTFNAQVKESRKHL--DASMPFEYCYSI 364
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 104/323 (32%), Positives = 135/323 (41%), Gaps = 44/323 (13%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G G Y IG+GTP Y V DTGSD WV C C C ++ + L+D
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQ-----QEKLFDP 227
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYD 182
SST V+C C +Y T + C Y YGDGS + G+F D + YD
Sbjct: 228 ARSSTYANVSCAAPACSDLY----TRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTLSSYD 283
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVR 241
V G FGCG R G E A G++G G+ +S+ Q GGV
Sbjct: 284 AVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDKYGGV- 327
Query: 242 KMFAHCLDGINGG-GIFAIGHVVQPEVNKTPLVP----NQP-HYSINMTAVQVGLDFLNL 295
FAHCL + G G G V P N P Y + MT ++VG L++
Sbjct: 328 --FAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSI 385
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL---KVHTVHDEYTCFQY 352
P VF GTI+DSGT + LP Y L S S K + TC+ +
Sbjct: 386 PQSVF---STAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDF 442
Query: 353 SESVDEGFPNVTFHFENSVSLKV 375
+ + P V+ F+ L V
Sbjct: 443 TGMSEVAIPKVSLLFQGGAYLDV 465
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 142/304 (46%), Gaps = 34/304 (11%)
Query: 42 LSLLKEHDARRQQRILAGVDLPL------GGSSRPDGVG-LYYAKIGIGTPPKDYYVQVD 94
++ L HD + R LA D P + + +G L+YA + +GTP + V +D
Sbjct: 64 VAALAGHD---RHRALAAADHPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALD 120
Query: 95 TGSDIMWVNCIQCKECPRRSS-LGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
TGSD+ W+ C QC CP +S + Y SST + V C+ +FC DC+
Sbjct: 121 TGSDLFWLPC-QCDGCPPPASGASGSASFYIPSMSSTSQAVPCNSDFCDH-----RKDCS 174
Query: 154 ANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
+SCPY +Y +S++G+ V+DV+ Q ++FGCG Q+G+
Sbjct: 175 TTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDNHPQI--LKAQIMFGCGQVQTGSF--L 230
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL 272
+ A +G+ G G S+ S LA G F+ C G +G G + G + +TPL
Sbjct: 231 DAAAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCF-GRDGIGRISFGDQGSSDQEETPL 289
Query: 273 VPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
NQ H Y+I +T + VG + ++L TI D+GTT YL + Y +
Sbjct: 290 DINQKHPTYAITITGITVGTEPMDL---------EFSTIFDTGTTFTYLADPAYTYITQS 340
Query: 331 IISQ 334
+Q
Sbjct: 341 FHTQ 344
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 127/264 (48%), Gaps = 23/264 (8%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L++A + +GTPP + V +DTGSD+ W+ NC +C + I +YD+K SST +
Sbjct: 101 LHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGVESNGEKIAFNIYDLKGSSTSQ 160
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C+ C P +D + CPY + +G+STTG+ V+DV+ ++ D +T
Sbjct: 161 TVLCNSNLCELQRQCPSSD----SICPYEVNYLSNGTSTTGFLVEDVLHL--ITDDDETK 214
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ + FGCG Q+G + A +G+ G G N S+ S LA G F+ C G
Sbjct: 215 DADTRITFGCGQVQTGAF--LDGAAPNGLFGLGMGNESVPSILAKEGLTSNSFSMCF-GS 271
Query: 252 NGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
+G G G KTP L P Y+I +T + VG + +L I
Sbjct: 272 DGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGGNAADLEFHA---------I 322
Query: 310 IDSGTTLAYLPEMVYEPLVSKIIS 333
DSGT+ +L + Y+ + + S
Sbjct: 323 FDSGTSFTHLNDPAYKQITNSFNS 346
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 119/440 (27%), Positives = 190/440 (43%), Gaps = 70/440 (15%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSS-----NHG---VFSVKYRYAGRERSLSLLKEHDARR 52
M L + L ++LI AV G ++ +HG + + S S L D RR
Sbjct: 1 MARALTHHLSLILILIVAVAGDANLLRNRHHGSRPAMLLPLYLSAPNSSTSAL---DPRR 57
Query: 53 QQRILAGVDLPLGGSSRPDGV---GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE 109
Q P D + G Y ++ IGTPP+ + + VDTGS + +V C C++
Sbjct: 58 QLTGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ 117
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGD 166
C R + + SST + V C DC ++ C Y Y +
Sbjct: 118 CGRHQD-----PKFQPESSSTYQPVKCT------------IDCNCDSDRMQCVYERQYAE 160
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
S+++G +D++ + Q+ +FGC ++G+L S + DGI+G G+
Sbjct: 161 MSTSSGVLGEDLISFGN-----QSELAPQRAVFGCENVETGDLYSQHA---DGIMGLGRG 212
Query: 227 NSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSI 281
+ S++ QL + F+ C G++ GGG +G + P P+ P+Y+I
Sbjct: 213 DLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVLGGISPPSDMAFAYSDPV--RSPYYNI 270
Query: 282 NMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
++ + V L L +VF D K GT++DSGTT AYLPE + I+ + LK
Sbjct: 271 DLKEIHVAGKRLPLNANVF---DGKHGTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKK 327
Query: 341 HTVHDEY---TCFQYS----ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCI 390
+ D CF + + + FP V FEN + P Y+F + +C+
Sbjct: 328 ISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCL 387
Query: 391 G-WQNSGMQSRDRKNMTLLG 409
G +QN Q+ TLLG
Sbjct: 388 GVFQNGNDQT------TLLG 401
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 100/334 (29%), Positives = 148/334 (44%), Gaps = 37/334 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G YY K+G+G+P + Y + VDTGS + W +QCK C + + L+D S T
Sbjct: 9 GSGNYYVKVGLGSPARYYSMIVDTGSSLSW---LQCKPCVVYCHVQAD-PLFDPSASKTY 64
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C C + L + TS C Y YGD S + GY QD++ L
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLL-------TL 117
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ T ++GCG G GI+G G++ SM+ Q++S G F++CL
Sbjct: 118 APSQTLPGFVYGCGQDSEGLFGRA-----AGILGLGRNKLSMLGQVSSKFGY--AFSYCL 170
Query: 249 DGINGGGIFAIGH--VVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVG 303
GGG +IG + TP+ P P Y + +TA+ VG L + + V
Sbjct: 171 PTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRV- 229
Query: 304 DNKGTIIDSGTTLAYLPEMVYEP----LVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG 359
TIIDSGT + LP VY P V + S+ ++ D TCF+ + +
Sbjct: 230 ---PTIIDSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILD--TCFKGNLKDMQS 284
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGW 392
P V F+ L + P L E L C+ +
Sbjct: 285 VPEVRLIFQGGADLNLRPVNVLLQVDEGLTCLAF 318
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 153/334 (45%), Gaps = 31/334 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRS---SLGIELTLYDIKDSS 129
L+Y I IGTP + V +D GSD++WV +CIQC SL +L+ Y SS
Sbjct: 106 LHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSASYYNISLDRDLSEYSPSLSS 165
Query: 130 TGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTT--GYFVQDVVQYDKVSG 186
T + ++CD + C +G ++C CPY+ Y D +TT G+ V+D + V
Sbjct: 166 TSRHLSCDHQLCE--WG---SNCKNPKDPCPYIFNYDDFENTTSAGFLVEDKLHLASVGD 220
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S++ GCG +Q G+ + A DG++G G + S+ S LA +G ++ F+
Sbjct: 221 HTARKMLQASVVLGCGRKQGGSF--FDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSL 278
Query: 247 CLDGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
C D + G I GH Q TP +P Q Y A VG++ +
Sbjct: 279 CFDENDSGRILFGDRGHASQ---QSTPFLPIQGTY----VAYFVGVESYCVGNSCLKRSG 331
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE--GFPN 362
K ++DSG++ YLP VY LVS+ +Q + K + D + Y+ S E P
Sbjct: 332 FKA-LVDSGSSFTYLPSEVYNELVSE-FDKQVNAKRISFQDGLWDYCYNASSQELHDIPA 389
Query: 363 VTFHFENSVSLKVYPHEYLFPFED---LWCIGWQ 393
+ F + + V+ Y P ++C+ Q
Sbjct: 390 IQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQ 423
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 151/342 (44%), Gaps = 48/342 (14%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV------NCIQCKECPRRSSLGIELTLYDIKDS 128
L+YA + +GTP + V +DTGSD+ W+ NC++ + P SSL +L +Y S
Sbjct: 103 LHYANVTVGTPSDWFLVALDTGSDLFWLPCDCSTNCVRELKAPGGSSL--DLNIYSPNAS 160
Query: 129 STGKFVTCDQEFCHGV--YGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVS 185
ST V C+ C V PL+D CPY + +G+S+TG V+DV+ VS
Sbjct: 161 STSSKVPCNSTLCTRVDRCASPLSD------CPYQIRYLSNGTSSTGVLVEDVLHL--VS 212
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+ + + GCG Q+G + A +G+ G G + S+ S LA G F+
Sbjct: 213 MEKNSKPIRARITLGCGLVQTGVFH--DGAAPNGLFGLGLEDISVPSVLAKEGIAANSFS 270
Query: 246 HCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
C G +G G + G + +TPL QPH + N+T Q+ VG N
Sbjct: 271 MCF-GDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVTQI------------SVGGN 317
Query: 306 KG-----TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG- 359
G + D+GT+ YL + Y + S D + T D F+Y +V
Sbjct: 318 TGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQT--DSELPFEYCYAVSPNK 375
Query: 360 ----FPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNS 395
+P+V + S VY + P ED ++C+ S
Sbjct: 376 KSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKS 417
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/331 (29%), Positives = 153/331 (46%), Gaps = 49/331 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ IGTPP Y + DTGSD++W CI C +C ++ + ++D + SS+ +T
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQN-----PMFDPRSSSSYTNIT 114
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C E C+ + + T +C Y Y D S T G Q+ + +G + + G
Sbjct: 115 CGTESCNKLDSSLCS--TDQKTCNYTYSYADNSITQGVLAQETLTLTSTTG--EPVAFQG 170
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLDGIN-- 252
+IFGCG SG N+ + G+IG G+ S+ISQ+ SS G MF+ CL N
Sbjct: 171 -IIFGCGHNNSG----FNDREM-GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTD 224
Query: 253 -----------GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
G + G V P ++K + Y + + V + +NLP F
Sbjct: 225 PSITSQMNFGKGSEVLGNGTVSTPLISK-----DGTGYFATLLGISV--EDINLP---FS 274
Query: 302 VGDNKGTI------IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSES 355
G + GTI IDSGTT+ YLPE Y L+ + + + L+ + C+Q +
Sbjct: 275 NGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQ-VRNKVALEPFRIDGYELCYQTPTN 333
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
++ P +T HFE L + P + P +D
Sbjct: 334 LNG--PTLTIHFEGGDVL-LTPAQMFIPVQD 361
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/326 (29%), Positives = 150/326 (46%), Gaps = 49/326 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + K+ IGTP + Y +DTGSD++W C CK+C + + ++D K SS+
Sbjct: 93 GNGEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPT-----PIFDPKKSSSF 147
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C + C + P++ C+ C YL YGD SST G + + GD +
Sbjct: 148 SKLPCSSDLCAAL---PISSCSDG--CEYLYSYGDYSSTQGVLATETFAF----GDASVS 198
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
FGCG G+ S G++G G+ S+ISQL F++CL +
Sbjct: 199 KIG----FGCGEDNDGSGFSQGA----GLVGLGRGPLSLISQLG-----EPKFSYCLTSM 245
Query: 252 -NGGGIFAI---GHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGD 304
+ GI ++ TPL+ P+QP Y +++ + VG L + F + +
Sbjct: 246 DDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQN 305
Query: 305 N--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQY---SES 355
+ G IIDSGTT+ YL + + L + ISQ LK+ T CF + +
Sbjct: 306 DGSGGLIIDSGTTITYLEDSAFAALKKEFISQ---LKLDVDESGSTGLDLCFTLPPDAST 362
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYL 381
VD P + FHFE + LK+ Y+
Sbjct: 363 VD--VPQLVFHFEGA-DLKLPAENYI 385
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 171/369 (46%), Gaps = 42/369 (11%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
R+++ GV + LG S G Y+ ++ +GTP K + V VDTGS++ WVNC +
Sbjct: 65 RKRKFKGGVKMDLG-SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNC---RYRG 120
Query: 112 RRSSLGIELTLYDIKDSSTGKFVTCDQEFCH----GVYGGPLTDC-TANTSCPYLEIYGD 166
R ++ ++S + K V C + C ++ L+ C T +T C Y Y D
Sbjct: 121 RGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVDLMNLFS--LSTCPTPSTPCSYDYRYAD 178
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
GS+ G F ++ + +G + G L+ GC + S + + DG++G S
Sbjct: 179 GSAAQGVFAKETITVGLTNG--RKARLRG-LLVGCSSSFS----GQSFQGADGVLGLAFS 231
Query: 227 NSSMISQLASSGGVRKMFAHCL-DGINGGGI---FAIGHVVQPEVNKTP----------L 272
+ S S S G + ++CL D ++ I G+ KT L
Sbjct: 232 DFSFTSTATSLFGAK--LSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTL 289
Query: 273 VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV---S 329
+P P Y+IN+ + +G D L++PT V+ GTI+DSGT+L L E Y+P+V +
Sbjct: 290 IP--PFYAINIIGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLA 347
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEG-FPNVTFHFENSVSLKVYPHEYLF-PFEDL 387
+ + + +K + EY CF + +E P +TFH + + + YL +
Sbjct: 348 RYLVELKRVKPEGIPIEY-CFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGV 406
Query: 388 WCIGWQNSG 396
C+G+ ++G
Sbjct: 407 KCLGFMSAG 415
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 151/332 (45%), Gaps = 28/332 (8%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GTPP Y DTGSDI+W+ C C++C +++ +++ SS+ K
Sbjct: 85 GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTT-----PIFNPSKSSSYKN 139
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C + CH V T C+ SC Y YGD S + G D + + SG + +
Sbjct: 140 IPCLSKLCHSVRD---TSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSG---SPVS 193
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-- 251
+ GCG +G A GI+G G S+I+QL SS G + F++CL +
Sbjct: 194 FPKTVIGCGTDNAGTFGG----ASSGIVGLGGGPVSLITQLGSSIGGK--FSYCLVPLLN 247
Query: 252 ---NGGGIFAIGH--VVQPE-VNKTPLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD 304
N I + G VV + V TPL+ P Y + + A VG + G D
Sbjct: 248 KESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDD 307
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
IIDSGTTL +P VY L S ++ +V + +++ +S + FP +T
Sbjct: 308 EGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFPIIT 367
Query: 365 FHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
HF+ + ++++ P D + C +Q S
Sbjct: 368 AHFKGA-DIELHSISTFVPITDGIVCFAFQPS 398
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 157/352 (44%), Gaps = 42/352 (11%)
Query: 53 QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR 112
+ ++++G+D +G G Y+ ++GIG+PP + Y+ VD+GSD++WV C C EC
Sbjct: 113 ESKVVSGLD---------EGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYA 163
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTG 172
++ L+D S+T V C C + + C + C Y YGDGS T G
Sbjct: 164 QAD-----PLFDPATSATFSAVPCGSAVCRTLR---TSGCGDSGGCDYEVSYGDGSYTKG 215
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+ + L T+ G I GCG R G G++G G S++
Sbjct: 216 ALALETLT-------LGGTAVEGVAI-GCGHRNRGLFVGAA-----GLLGLGWGPMSLVG 262
Query: 233 QLASSGGVRKMFAHCLDGINGGGIFAIGH--VVQPEVNKTPLV--PNQPH-YSINMTAVQ 287
QL + F++CL G G +G V PLV P P Y + ++ +
Sbjct: 263 QLGGA--AGGAFSYCLAS-RGAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIG 319
Query: 288 VGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVH 344
VG + L L D+F + ++ G ++D+GT + LP+ Y L ++ L + V
Sbjct: 320 VGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVS 379
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNS 395
TC+ S P V+F+F+ + +L + L + ++C+ + S
Sbjct: 380 LLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPS 431
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 101/350 (28%), Positives = 159/350 (45%), Gaps = 35/350 (10%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPL-----GGSSRPDGVG-LYYAKIGIGTPPKDYYV 91
R+ +++ R +R+ AG PL + + + G L++A + +GTPP + V
Sbjct: 57 RQYYVAMAHRDRIFRGRRLAAGYHSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSFLV 116
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLG----IELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+DTGSD+ W+ C C +C L I +YD+K SST + V C+ C
Sbjct: 117 ALDTGSDLFWLPC-NCTKCVHGIGLSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQC 175
Query: 148 PLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
P +D T CPY + +G+STTG+ V+DV+ ++ D +T + + FGCG Q+
Sbjct: 176 PSSD----TICPYEVNYLSNGTSTTGFLVEDVLHL--ITDDDKTKDADTRITFGCGQVQT 229
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE 266
G + A +G+ G G SN S+ S LA G F+ C G +G G G
Sbjct: 230 GAF--LDGAAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCF-GSDGLGRITFGDNSSLV 286
Query: 267 VNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
KTP L P Y+I +T + VG +L I DSGT+ YL + Y
Sbjct: 287 QGKTPFNLRALHPTYNITVTQIIVGEKVDDLEFHA---------IFDSGTSFTYLNDPAY 337
Query: 325 EPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLK 374
+ + + S+ + T F+Y + PN T ++++K
Sbjct: 338 KQITNSFNSEIKLQRHSTSSSNELPFEYCYELS---PNQTVELSINLTMK 384
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 164/389 (42%), Gaps = 40/389 (10%)
Query: 7 NCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGG 66
L +VL+ + AV S V + G ++ L++ R + R L+G D
Sbjct: 5 QALSLVLLTSLAVSAPSGYRLVLTHVDSKGGYTKT-ELMRRAVHRSRLRALSGYD---AT 60
Query: 67 SSRPDGVGL-YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDI 125
S R V + Y ++ IG PP + DTGSD+ W C CK C + +YD
Sbjct: 61 SPRLHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLC-----FPQDTPVYDP 115
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
SST + C C ++ +CT ++ C Y YGDG+ + G + + S
Sbjct: 116 SASSTFSPLPCSSATCLPIWS---RNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSS 172
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+ G + FGCG G DS N G +G G+ S+++QL GV K F+
Sbjct: 173 APVSV----GGVAFGCGTDNGG--DSLNST---GTVGLGRGTLSLLAQL----GVGK-FS 218
Query: 246 HCLDGINGGGI---FAIGHVVQ-----PEVNKTPLV--PNQP-HYSINMTAVQVGLDFLN 294
+CL + F +G + + V TPL+ P P Y +++ + +G L
Sbjct: 219 YCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLP 278
Query: 295 LPTDVFGV-GDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
+P F + GD G I+DSGTT L E + +V ++ V+ + CF
Sbjct: 279 IPNGTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAPCFPA 338
Query: 353 SESVDEGFPNVTFHFENSVSLKVYPHEYL 381
P++ HF +++Y Y+
Sbjct: 339 PAGEPPYMPDLVLHFAGGADMRLYRDNYM 367
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 169/410 (41%), Gaps = 58/410 (14%)
Query: 36 AGRERSL-SLLKEHDAR---RQQRILAG--VDLPLGGSSRPDGV--GLYYAKIGIGTPPK 87
AGR S LL+ AR R R+L+G + S DGV Y + IGTPP+
Sbjct: 63 AGRGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQ 122
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+ +DTGSD+ W C C C R+S L ++ S T + CD C +
Sbjct: 123 PVQLILDTGSDLTWTQCAPCVSCFRQS-----LPRFNPSRSMTFSVLPCDLRICRDLTWS 177
Query: 148 PLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
+ + N C Y Y D S TTG+ D + + S L FGCG +
Sbjct: 178 SCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVP-DLTFGCGLFNN 236
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE 266
G S NE GI GF + SM +QL F++C I G + V P
Sbjct: 237 GIFVS-NET---GIAGFSRGALSMPAQLKVDN-----FSYCFTAITGSEPSPVFLGVPPN 287
Query: 267 ------------VNKTPLV----PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGT 308
V T L+ Y I++ V VG L +P VF + ++ GT
Sbjct: 288 LYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGT 347
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT--CFQYSESVDEGFPNVTFH 366
I+DSGT + LPE VY LV Q L VH + CF P + H
Sbjct: 348 IVDSGTGMTMLPEAVYN-LVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLH 406
Query: 367 FENSVSLKVYPHEYLFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
FE + +L + Y+F E+ L C+ N+G ++++++G+F
Sbjct: 407 FEGA-TLDLPRENYMFEIEEAGGIRLTCLAI-NAG------EDLSVIGNF 448
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 111/395 (28%), Positives = 170/395 (43%), Gaps = 48/395 (12%)
Query: 36 AGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQV 93
+GRE + AR + + + P+ + DGV + Y + IGTPP+ + +
Sbjct: 49 SGRELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTL 108
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGS ++W C C C +S L YD SST +CD C +T C
Sbjct: 109 DTGSVLVWTQCQPCAVCFNQS-----LPYYDASRSSTFALPSCDSTQCK--LDPSVTMCV 161
Query: 154 ANT--SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
T +C Y YGD S+T G+ + V + V+G ++ ++FGCG +G S
Sbjct: 162 NQTVQTCAYSYSYGDKSATIGFLDVETVSF--VAG-----ASVPGVVFGCGLNNTGIFRS 214
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNK-- 269
NE GI GFG+ S+ SQL F+HC ++G + + ++ K
Sbjct: 215 -NET---GIAGFGRGPLSLPSQLKVGN-----FSHCFTAVSGRKPSTVLFDLPADLYKNG 265
Query: 270 ------TPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYL 319
TPL+ N H Y +++ + VG L +P F + + GTIIDSGT L
Sbjct: 266 RGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSL 325
Query: 320 PEMVYEPLVSKIISQQPDLKVHTVHD--EYTCFQYSE-SVDEGFPNVTFHFENSVSLKVY 376
P VY LV + L V ++ CF P + HFE + ++ +
Sbjct: 326 PPRVYR-LVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLP 383
Query: 377 PHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
Y+F +D G S + MT++G+F
Sbjct: 384 RENYVFEAKD----GGNCSICLAIIEGEMTIIGNF 414
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 150/334 (44%), Gaps = 46/334 (13%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
S P G G Y +G+GTP KD + DTGSD+ W C C +S + ++D
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC----VKSCYAQQQPIFDPS 200
Query: 127 DSSTGKFVTCDQEFCHGVYG--GPLTDCTANTSCPYLEIYGDGSSTTGYFVQD---VVQY 181
S T ++C C + G C++ ++C Y YGD S T G+F +D + Q
Sbjct: 201 TSKTYSNISCTSAACSSLKSATGNSPGCSS-SNCVYGIQYGDSSFTIGFFAKDKLTLTQN 259
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
D G +FGCG G T G+IG G+ S++ Q A G
Sbjct: 260 DVFDG----------FMFGCGQNNKGLFGKT-----AGLIGLGRDPLSIVQQTAQKFG-- 302
Query: 242 KMFAHCL---DGINGGGIFAIGHVVQPE------VNKTPLVPNQ--PHYSINMTAVQVGL 290
K F++CL G NG F G+ V+ + TP +Q +Y I++ + VG
Sbjct: 303 KYFSYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGG 362
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEY 347
L++ +F N GTIIDSGT + LP Y L S + +S+ P ++ D
Sbjct: 363 KALSISPMLF---QNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLD-- 417
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
TC+ S P ++F+F + ++++ P+ L
Sbjct: 418 TCYDLSNYTSISIPKISFNFNGNANVELDPNGIL 451
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 169/410 (41%), Gaps = 58/410 (14%)
Query: 36 AGRERSL-SLLKEHDAR---RQQRILAG--VDLPLGGSSRPDGV--GLYYAKIGIGTPPK 87
AGR S LL+ AR R R+L+G + S DGV Y + IGTPP+
Sbjct: 37 AGRGLSTRELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQ 96
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+ +DTGSD+ W C C C R+S L ++ S T + CD C +
Sbjct: 97 PVQLILDTGSDLTWTQCAPCVSCFRQS-----LPRFNPSRSMTFSVLPCDLRICRDLTWS 151
Query: 148 PLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
+ + N C Y Y D S TTG+ D + + S L FGCG +
Sbjct: 152 SCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVP-DLTFGCGLFNN 210
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE 266
G S NE GI GF + SM +QL F++C I G + V P
Sbjct: 211 GIFVS-NET---GIAGFSRGALSMPAQLKVDN-----FSYCFTAITGSEPSPVFLGVPPN 261
Query: 267 ------------VNKTPLV----PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGT 308
V T L+ Y I++ V VG L +P VF + ++ GT
Sbjct: 262 LYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGT 321
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT--CFQYSESVDEGFPNVTFH 366
I+DSGT + LPE VY LV Q L VH + CF P + H
Sbjct: 322 IVDSGTGMTMLPEAVYN-LVCDAFVAQTKLTVHNSTSSLSQLCFSVPPGAKPDVPALVLH 380
Query: 367 FENSVSLKVYPHEYLFPFED-----LWCIGWQNSGMQSRDRKNMTLLGDF 411
FE + +L + Y+F E+ L C+ N+G ++++++G+F
Sbjct: 381 FEGA-TLDLPRENYMFEIEEAGGIRLTCLAI-NAG------EDLSVIGNF 422
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 164/386 (42%), Gaps = 57/386 (14%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----I 105
A R R + V P+ G+ P +G Y I IG PP+ YY+ +DTGSD+ W+ C +
Sbjct: 33 ADRFTRAASSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCV 90
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYG 165
C E P LY + + C+ C ++ C C Y Y
Sbjct: 91 HCLEAPH--------PLYQ----PSNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYA 138
Query: 166 DGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGK 225
DG S+ G V+DV + G L+ T L GCG Q ++ LDG++G G+
Sbjct: 139 DGGSSLGVLVRDVFSLNYTKG-LRLTP---RLALGCGYDQIPG--ASGHHPLDGVLGLGR 192
Query: 226 SNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV--QPEVNKTPLV-PNQPHYSIN 282
S++SQL S G V+ + HCL + GGGI G+ + V+ TP+ N HYS
Sbjct: 193 GKVSILSQLHSQGYVKNVVGHCLSSL-GGGILFFGNDLYDSSRVSWTPMARENSKHYSPA 251
Query: 283 MTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLK 339
M L F T + N T+ DSG++ Y Y+ L+ + +S +P +
Sbjct: 252 MGG---ELLFGGRTTGL----KNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKE 304
Query: 340 VHTVHDEYTCFQYS------ESVDEGFPNVTFHFE----NSVSLKVYPHEYL-FPFEDLW 388
H C+Q E V + F + F+ + ++ P YL +
Sbjct: 305 ARDDHTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNV 364
Query: 389 CIGWQNS---GMQSRDRKNMTLLGDF 411
C+G N G+Q N+ L+GD
Sbjct: 365 CLGILNGTEIGLQ-----NLNLIGDI 385
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 161/362 (44%), Gaps = 47/362 (12%)
Query: 39 ERSLSLLKEHDARRQQ---RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDT 95
E L+ +K RR Q ILA + L + G G Y I G+PP+ V VDT
Sbjct: 42 EIFLAAVKRGAERRAQLSKHILA--EGRLFSTPVASGNGEYLIDISFGSPPQKASVIVDT 99
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN 155
GSD++W C+ C+ C +S+ ++D SST V+C FC + P CT
Sbjct: 100 GSDLIWTQCLPCETCNAAASV-----IFDPVKSSTYDTVSCASNFCSSL---PFQSCT-- 149
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
TSC Y +YGDGSST+G T T ++ FGCG G+
Sbjct: 150 TSCKYDYMYGDGSSTSGAL--------STETVTVGTGTIPNVAFGCGHTNLGSF-----A 196
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD--GINGGGIFAIG-HVVQPEVNKTPL 272
GI+G G+ S+ISQ +S K F++CL G IG V T L
Sbjct: 197 GAAGIVGLGQGPLSLISQASSI--TSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVAYTAL 254
Query: 273 VPNQPH---YSINMTAVQVGLDFLNLPTDVFGV--GDNKGTIIDSGTTLAYLPEMVYEPL 327
+ N + Y ++T + V + P F + G I+DSGTTL YL + L
Sbjct: 255 LTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNAL 314
Query: 328 VSKIISQQPDLKVH-TVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
V+ + ++ P + +++ CF + + +P +TFHF+ + +Y P E+
Sbjct: 315 VAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGA--------DYELPPEN 366
Query: 387 LW 388
++
Sbjct: 367 VF 368
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/363 (30%), Positives = 152/363 (41%), Gaps = 64/363 (17%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTP-PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI 118
V P+ S G Y GIGTP P+ ++VDTGSD++W C C +C
Sbjct: 76 VTAPVASGSHVVGYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDC-----FTQ 130
Query: 119 ELTLYDIKDSSTGKFVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
L +D S T V C C H + G C Y YGD S T G
Sbjct: 131 PLPRFDTSASDTVHGVLCTDPICRALRPHACFLG---------GCTYQVNYGDNSVTIGQ 181
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+D +D G T L+FGCG +GN S NE GI GFG+ S+ Q
Sbjct: 182 LAKDSFTFDGKGGGKVTVP---DLVFGCGQYNTGNFHS-NET---GIAGFGRGPLSLPRQ 234
Query: 234 LASSGGVRKMFAHCLDGING--------GGIFAIG---HVVQPEVNKTPLVPNQP-HYSI 281
L S F++C I GG A G H P + TP +PN P +Y +
Sbjct: 235 LGVSS-----FSYCFTTIFESKSTPVFLGGAPADGLRAHATGP-ILSTPFLPNHPEYYYL 288
Query: 282 NMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK 339
++ + VG L +P F V + GTIIDSGT + P V+ L ++Q P
Sbjct: 289 SLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAFVAQVP--L 346
Query: 340 VHTVHDE-----YTCFQYSESVDEG----FPNVTFHFENS---VSLKVYPHEYLFPFEDL 387
HT +++ CF +ESV + P +T H E + + + Y EY P D
Sbjct: 347 PHTSYNDTGEPTLQCFS-TESVPDASKVPVPKMTLHLEGADWELPRENYMAEY--PDSDQ 403
Query: 388 WCI 390
C+
Sbjct: 404 LCV 406
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/396 (26%), Positives = 173/396 (43%), Gaps = 54/396 (13%)
Query: 36 AGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPD-----GVGLYYAKIGIGTPPKDYY 90
+G+ + L + +R +R + ++ L SS + G G Y + IGTP +
Sbjct: 51 SGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVYAGDGEYLMNVAIGTPDSSFS 110
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+DTGSD++W C C +C + + +++ +DSS+ + C+ ++C + P
Sbjct: 111 AIMDTGSDLIWTQCEPCTQCFSQPT-----PIFNPQDSSSFSTLPCESQYCQDL---PSE 162
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C N C Y YGDGS+T GY + ++ TS+ ++ FGCG G
Sbjct: 163 TCN-NNECQYTYGYGDGSTTQGYMATETFTFE--------TSSVPNIAFGCGEDNQGFGQ 213
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD--GINGGGIFAIGHVVQPEVN 268
G+IG G S+ SQL GV + F++C+ G + A+G
Sbjct: 214 GNGA----GLIGMGWGPLSLPSQL----GVGQ-FSYCMTSYGSSSPSTLALGSAASGVPE 264
Query: 269 KTPLVP------NQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLP 320
+P N +Y I + + VG D L +P+ F + D+ G IIDSGTTL YLP
Sbjct: 265 GSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLP 324
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDE----YTCFQY-SESVDEGFPNVTFHFENSVSLKV 375
+ Y + Q + + TV + TCFQ S+ P ++ F+ V
Sbjct: 325 QDAYNAVAQAFTDQ---INLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLG 381
Query: 376 YPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+ + P E + C+ M S + +++ G+
Sbjct: 382 EQNILISPAEGVICL-----AMGSSSQLGISIFGNI 412
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 104/351 (29%), Positives = 160/351 (45%), Gaps = 65/351 (18%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE 109
ARR++R+ PDG ++ IGTP Y VDTGSD++W C C +
Sbjct: 161 ARRERRV-------------PDG------RV-IGTPALAYSAIVDTGSDLVWTQCKPCVD 200
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSS 169
C ++S+ ++D SST V C C + P + CT+ + C Y YGD SS
Sbjct: 201 CFKQST-----PVFDPSSSSTYATVPCSSASCSDL---PTSKCTSASKCGYTYTYGDSSS 252
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
T G + K S ++FGCG G D ++ A G++G G+ S
Sbjct: 253 TQGVLATETFTLAK--------SKLPGVVFGCGDTNEG--DGFSQGA--GLVGLGRGPLS 300
Query: 230 MISQLASSGGVRKMFAHCLDGING--------GGIFAI--GHVVQPEVNKTPLV--PNQP 277
++SQL G+ K F++CL ++ G + I V TPL+ P+QP
Sbjct: 301 LVSQL----GLDK-FSYCLTSLDDTNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQP 355
Query: 278 H-YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
Y +++ A+ VG ++LP+ F V D+ G I+DSGT++ YL Y L +Q
Sbjct: 356 SFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQ 415
Query: 335 Q--PDLKVHTVHDEYTCFQY-SESVDE-GFPNVTFHFENSVSLKVYPHEYL 381
P V + CF+ ++ VD+ P + FHF+ L + Y+
Sbjct: 416 MALPAADGSGVGLDL-CFRAPAKGVDQVEVPRLVFHFDGGADLDLPAENYM 465
>gi|7413629|emb|CAB85978.1| putative protein [Arabidopsis thaliana]
Length = 356
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/363 (28%), Positives = 152/363 (41%), Gaps = 76/363 (20%)
Query: 37 GRERSLSLLKEHDARRQQRILAGVDLPLGGS-----SRPDGV---GLYYAKIGIGTPPKD 88
E L+ L D+ R R+L P+ GS R + LYY + IGTPP++
Sbjct: 34 SHELDLTQLMTFDSARHGRLLQS---PVHGSFNWKVERDTSILLSALYYTTVQIGTPPRE 90
Query: 89 YYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
V +DTGSD++WV+C C CP + +T +D SS+ + C + C
Sbjct: 91 LDVVIDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRCSSDLQKK 145
Query: 149 LTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGN 208
+ C+ SC Y YGDGS T+GY++ D++ +D +S D + + + RQ
Sbjct: 146 -SRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMS-DWTYIAFRDNSTWHPWVRQG-- 201
Query: 209 LDSTNEEALDGIIG-FGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
IIG F S+ S ++S
Sbjct: 202 ----------AIIGTFPALCSTPCSTVSSQ------------------------------ 221
Query: 268 NKTPLVPNQPHYSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYE 325
PL N P +S MT V ++ L LP D VF V GTIIDSGTTL + P Y+
Sbjct: 222 ---PLYYN-PQFSHMMT---VAVNDLRLPIDPSVFSVAKGYGTIIDSGTTLVHFPGEAYD 274
Query: 326 PLVSKIISQQPDLKVHTVHDEYTCFQYSESVD------EGFPNVTFHFENSVSLKVYPHE 379
PL+ I++ ++ + CF + + + FP V F S+ + P
Sbjct: 275 PLIQAILNVVSQYGRPIPYESFQCFNITSGISSHLVIADMFPEVHLGFAGGASMVIKPEA 334
Query: 380 YLF 382
YLF
Sbjct: 335 YLF 337
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/333 (27%), Positives = 155/333 (46%), Gaps = 27/333 (8%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y ++ IGTPP Y DTGSD+ W +C+ C +C ++ + ++D + S++ +
Sbjct: 22 LGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRN-----PIFDPQKSTSYR 76
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
++CD + CH + G C+ C Y Y + T G Q+ + G ++
Sbjct: 77 NISCDSKLCHKLDTG---VCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKG--ESVP 131
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
G ++FGCG +G N+ + GIIG G S ISQ+ SS G ++ F+ CL
Sbjct: 132 LKG-IVFGCGHNNTGGF---NDREM-GIIGLGGGPVSFISQIGSSFGGKR-FSQCLVPFH 185
Query: 249 DGINGGGIFAIG---HVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVG 303
++ ++G V V TPLV Q Y + + + VG +L+
Sbjct: 186 TDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSV 245
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
+ +DSGT LP +Y+ LV+++ S+ V D Y + P +
Sbjct: 246 EKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRGPVL 305
Query: 364 TFHFENSVSLKVYPHE-YLFPFEDLWCIGWQNS 395
T HFE +K+ P + ++ P + ++C+G+ N+
Sbjct: 306 TAHFEGG-DVKLLPTQTFVSPKDGVFCLGFTNT 337
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/367 (28%), Positives = 167/367 (45%), Gaps = 51/367 (13%)
Query: 29 FSVKYRYAGRERSLSLLK--EHDARRQQRILAGVDLPLGGSSRPD-----------GVGL 75
F V R+ ++L+ L+ +H +R + L ++ + +S D G G
Sbjct: 48 FRVMLRHVDSGKNLTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIHAGNGE 107
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ IGTPP Y +DTGSD++W C C +C ++ + ++D K SS+ V+
Sbjct: 108 YLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPT-----PIFDPKKSSSFSKVS 162
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C V P + C+ C Y+ YGD S T G + + K + +
Sbjct: 163 CGSSLCSAV---PSSTCSDG--CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSVHNIG- 216
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN--G 253
FGCG G+ E G++G G+ S++SQL F++CL ++
Sbjct: 217 ---FGCGEDNEGD----GFEQASGLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTK 264
Query: 254 GGIFAIGHVVQ----PEVNKTPLVPN--QPH-YSINMTAVQVGLDFLNLPTDVFGVGD-- 304
I +G + + EV TPL+ N QP Y +++ + VG L++ F VGD
Sbjct: 265 ESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDG 324
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ--PDLKVHTVHDEYTCFQY-SESVDEGFP 361
N G IIDSGTT+ Y+ + +E L + ISQ P K + + CF S S P
Sbjct: 325 NGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDL-CFSLPSGSTQVEIP 383
Query: 362 NVTFHFE 368
+ FHF+
Sbjct: 384 KIVFHFK 390
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/328 (27%), Positives = 152/328 (46%), Gaps = 42/328 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP Y +DTGSD++W C C C + + +D+K S+T +
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPT-----PYFDVKKSATYRA 141
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C C + + C Y YGD +ST G + + + + +T
Sbjct: 142 LPCRSSRCASLS----SPSCFKKMCVYQYYYGDTASTAGVLANETFTFGA-ANSTKVRAT 196
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
N + FGCG+ +G+L +++ G++GFG+ S++SQL S F++CL
Sbjct: 197 N--IAFGCGSLNAGDLANSS-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLS 244
Query: 254 G-------GIFA----IGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDV 299
G++A V TP V P P+ Y +++ A+ +G L + V
Sbjct: 245 ATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLV 304
Query: 300 FGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQY--SE 354
F + D+ G IIDSGT++ +L + YE + ++S P ++ TCFQ+
Sbjct: 305 FAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPP 364
Query: 355 SVDEGFPNVTFHFENSVSLKVYPHEYLF 382
+V P++ FHF+ S ++ + P Y+
Sbjct: 365 NVTVTVPDLVFHFD-SANMTLLPENYML 391
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 150/338 (44%), Gaps = 40/338 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
+Y + +GTP + + V +DTGS I ++ C C C + ++ +D S+T K +
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTA-----EWFDPDKSTTAKKLA 67
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C+ G P C N C Y Y + SS+ G+ ++D + ++
Sbjct: 68 CGDPLCNC--GTPSCTCN-NDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVR------ 118
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
L+FGC ++G + + DGI+G G ++++ SQL + +F+ C G G
Sbjct: 119 -LVFGCENGETGEI---YRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCF-GYPKDG 173
Query: 256 IFAIGHVVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
I +G V PE T P H Y++ M + V L VF G GT++
Sbjct: 174 ILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRG--YGTVL 231
Query: 311 DSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYT--CFQYS----ESVDEGFP 361
DSGTT YLP ++ + + + ++ +Y C++ + + +D+ FP
Sbjct: 232 DSGTTFTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFP 291
Query: 362 NVTFHFENSVSLKVYPHEYLF---PFEDLWCIGWQNSG 396
F F L + P YLF P E +C+G ++G
Sbjct: 292 PAEFVFGGGAKLTLPPLRYLFLSKPAE--YCLGIFDNG 327
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 100/321 (31%), Positives = 138/321 (42%), Gaps = 44/321 (13%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKD 127
R G G Y IG+GTP Y V DTGSD WV C C C + + L+D
Sbjct: 179 RALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQ-----QEKLFDPAR 233
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYDKV 184
SST ++C C +Y T + C Y YGDGS + G+F D + YD +
Sbjct: 234 SSTDANISCAAPACSDLY----TKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAI 289
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKM 243
G FGCG R G E A G++G G+ +S+ Q GGV
Sbjct: 290 KG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQAYDKYGGV--- 331
Query: 244 FAHCLDGINGG-GIFAIGHVVQPEVN---KTPLVPNQ--PHYSINMTAVQVGLDFLNLPT 297
FAHC + G G G P V+ TP++ + Y + +T ++VG L++P
Sbjct: 332 FAHCFPARSSGTGYLDFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPP 391
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSE 354
VF GTI+DSGT + LP Y L S I+ + K + TC+ ++
Sbjct: 392 SVF---TTAGTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTG 448
Query: 355 SVDEGFPNVTFHFENSVSLKV 375
P V+ F+ SL V
Sbjct: 449 MSQVAIPTVSLLFQGGASLDV 469
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 153/337 (45%), Gaps = 33/337 (9%)
Query: 69 RPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMWV-----NCIQCKECPRRSSLGIELTL 122
R D +G L+YA + +GTP + V +DTGSD+ W+ NC++ + P SSL +L +
Sbjct: 96 RVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSL--DLNI 153
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQY 181
Y SST V C+ C G +N CPY + +G+S+TG V+DV+
Sbjct: 154 YSPNASSTSTKVPCNSTLC--TRGDRCASPESN--CPYQIRYLSNGTSSTGVLVEDVLHL 209
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
VS D + + + GCG Q+G + A +G+ G G + S+ S LA G
Sbjct: 210 --VSNDKSSKAIPARVTLGCGQVQTGVFH--DGAAPNGLFGLGLEDISVPSVLAKEGIAA 265
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
F+ C G +G G + G + +TPL QPH + N+T ++ ++
Sbjct: 266 NSFSMCF-GNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVE--------GN 316
Query: 302 VGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT---CFQYSESVD 357
GD + + DSGT+ YL + Y + S D + T E C+ S + D
Sbjct: 317 TGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKD 376
Query: 358 E-GFPNVTFHFENSVSLKVYPHEYLFPFE--DLWCIG 391
+P V + S VY + P + D++C+
Sbjct: 377 SFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLA 413
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 137/306 (44%), Gaps = 30/306 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRR--SSLGIELTLYDIKDSST 130
L+Y I +GTP + V +D GSD++WV +CIQC S L +L+ Y+ SST
Sbjct: 102 LHYTWIDLGTPSVPFLVALDVGSDLLWVPCDCIQCAPLSANYYSVLDRDLSEYNPALSST 161
Query: 131 GKFVTCDQEFCHGVYGGPLTDC-TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K + C + C T C +AN C Y + Y D +ST+G+ ++D +Q S
Sbjct: 162 SKHLFCGHQLCAWS-----TTCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHG 216
Query: 189 QTTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ S++FGCG +QSG+ LD A DG++G G N S+ + LA G VR F+ C
Sbjct: 217 THSLLQASVVFGCGRKQSGSYLDGA---APDGVMGLGPGNISVPTLLAQEGLVRNTFSLC 273
Query: 248 LDGINGGGIFAIGH---VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
D NG G G Q PL Y I + + VG L
Sbjct: 274 FDN-NGSGRILFGDDGPATQQTTQFLPLFGEFAAYFIGVESFCVGSSCLQ--------RS 324
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT---CFQYSESVDEGFP 361
++DSG++ YLP VY+ +V + Q V E C+ S V P
Sbjct: 325 GFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNATRIVLRELPWNYCYNISTLVSFNIP 384
Query: 362 NVTFHF 367
++ F
Sbjct: 385 SMQLVF 390
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/320 (28%), Positives = 141/320 (44%), Gaps = 37/320 (11%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
DG G Y+ ++G+G+PP D Y+ VD+GSD++WV C C++C ++ L+D SS+
Sbjct: 125 DGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSS 179
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V+C C + G C Y YGDGS T G + + L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLT-------LGG 232
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T+ G I GCG R SG G++G G S++ QL + G +F++CL
Sbjct: 233 TAVQGVAI-GCGHRNSGLFVGAA-----GLLGLGWGAMSLVGQLGGAAG--GVFSYCLAS 284
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPN----QPHYSINMTAVQVGLDFLNLPTDVFGVGDN- 305
GG G +V + +T VP Y + +T + VG + L L +F + ++
Sbjct: 285 RGAGG---AGSLV---LGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDG 338
Query: 306 -KGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
G ++D+GT + LP Y L + P ++ D TC+ S P
Sbjct: 339 AGGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLD--TCYDLSGYASVRVP 396
Query: 362 NVTFHFENSVSLKVYPHEYL 381
V+F+F+ L + L
Sbjct: 397 TVSFYFDQGAVLTLPARNLL 416
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 146/321 (45%), Gaps = 38/321 (11%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
SS G G Y + IGTPP DY DTGSD+ W C+ C +C ++ +++
Sbjct: 83 SSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLR-----PIFNPL 137
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
S++ V C+ + CH V G C C Y YGD + + G + ++K++
Sbjct: 138 KSTSFSHVPCNTQTCHAVDDG---HCGVQGVCDYSYTYGDRTYSKGD-----LGFEKIT- 188
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S++ + GCG SG + G+IG G S++SQ++ + G+ + F++
Sbjct: 189 ---IGSSSVKSVIGCGHASSGGFGFAS-----GVIGLGGGQLSLVSQMSQTSGISRRFSY 240
Query: 247 CLDGI----NGGGIFAIGHVVQ-PEVNKTPLVPNQ--PHYSINMTAVQVGLDFLNLPTDV 299
CL + NG F VV P V TPL+ +Y I + A+ +G N
Sbjct: 241 CLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIG----NERHMA 296
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQ--YSESV 356
F N IIDSGTTL LP+ +Y+ +VS ++ +V H CF + +
Sbjct: 297 FAKQGN--VIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDGINAAA 354
Query: 357 DEGFPNVTFHFENSVSLKVYP 377
G P +T HF ++ + P
Sbjct: 355 SLGIPVITAHFSGGANVNLLP 375
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 168/353 (47%), Gaps = 49/353 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VD+GS + +V C C++C + + + SST +
Sbjct: 91 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEM-----SSTYQP 145
Query: 134 VTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DC + C Y Y + SS+ G +D++ + ++
Sbjct: 146 VKCNM------------DCNCDDDREQCVYEREYAEHSSSKGVLGEDLISFGN-----ES 188
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T +FGC ++G+L S + DGIIG G+ + S++ QL G + F C G
Sbjct: 189 QLTPQRAVFGCETVETGDLYS---QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGG 245
Query: 251 IN-GGGIFAIGHVVQP-EVNKTPLVPNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
++ GGG +G P ++ T P++ P+Y+I++T ++V L+L + VF G
Sbjct: 246 MDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVF--DGEHG 303
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQ-----YSESVDEG 359
++DSGTT AYLP+ + ++ + LK D + TCFQ Y + +
Sbjct: 304 AVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKI 363
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP+V F++ S + P Y+F + +C+G +G + + TLLG
Sbjct: 364 FPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNG-----KDHTTLLG 411
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 120/424 (28%), Positives = 172/424 (40%), Gaps = 60/424 (14%)
Query: 24 SNHGVFSVKYRYAGRERSLS---LLKEHDAR---RQQRILAG--VDLPLGGSSRPDGV-- 73
S+ + +A R LS LL AR R R+L+G + S DGV
Sbjct: 49 SDAAALRLHATHADAGRGLSTRELLHRMAARSKARSARLLSGRAASARVDPGSYTDGVPD 108
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
Y + IGTPP+ + +DTGSD+ W C C C R+S L ++ S T
Sbjct: 109 TEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQS-----LPRFNPSRSMTFSV 163
Query: 134 VTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+ CD C + + + N C Y Y D S TTG+ D + + S
Sbjct: 164 LPCDLRICRDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGAS 223
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
L FGCG +G S NE GI GF + SM +QL F++C I
Sbjct: 224 VP-DLTFGCGLFNNGIFVS-NET---GIAGFSRGALSMPAQLKVDN-----FSYCFTAIT 273
Query: 253 GGGIFAIGHVVQPE------------VNKTPLV----PNQPHYSINMTAVQVGLDFLNLP 296
G + V P V T L+ Y I++ V VG L +P
Sbjct: 274 GSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIP 333
Query: 297 TDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT--CFQY 352
VF + ++ GTI+DSGT + LPE VY LV Q L VH + CF
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYN-LVCDAFVAQTKLTVHNSTSSLSQLCFSV 392
Query: 353 SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-----LWCIGWQNSGMQSRDRKNMTL 407
P + HFE + +L + Y+F E+ L C+ N+G +++++
Sbjct: 393 PPGAKPDVPALVLHFEGA-TLDLPRENYMFEIEEAGGIRLTCLAI-NAG------EDLSV 444
Query: 408 LGDF 411
+G+F
Sbjct: 445 IGNF 448
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 103/330 (31%), Positives = 147/330 (44%), Gaps = 46/330 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP Y VDTGSD++W C C EC +S+ ++D SST
Sbjct: 114 GNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQST-----PVFDPSSSSTY 168
Query: 132 KFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C C + P + CT A C Y YGD SST G + K
Sbjct: 169 STLPCSSSLCSDL---PTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAK------- 218
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T G + FGCG G D + A G++G G+ S++SQL G+ K F++CL
Sbjct: 219 TKLPG-VAFGCGDTNEG--DGFTQGA--GLVGLGRGPLSLVSQL----GLGK-FSYCLTS 268
Query: 251 ING--------GGIFAIG--HVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPT 297
++ G + AI + TPL+ P+QP Y + + A+ VG + LP
Sbjct: 269 LDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPG 328
Query: 298 DVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH--TVHDEYTCFQYS 353
F V D+ G I+DSGT++ YL Y PL K + Q L V + CF+
Sbjct: 329 SAFAVQDDGTGGVIVDSGTSITYLELQGYRPL-KKAFAAQMKLPVADGSAVGLDLCFKAP 387
Query: 354 ES-VDE-GFPNVTFHFENSVSLKVYPHEYL 381
S VD+ P + HF+ L + Y+
Sbjct: 388 ASGVDDVEVPKLVLHFDGGADLDLPAENYM 417
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 97/323 (30%), Positives = 146/323 (45%), Gaps = 36/323 (11%)
Query: 27 GVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSS-RPDGVG-LYYAKIGIGT 84
G F A R+R+L RR I + G S+ R +G L+Y + +GT
Sbjct: 58 GSFEYYAELAHRDRALR------GRRLSDIDGLLTFSDGNSTFRISSLGFLHYTTVSLGT 111
Query: 85 PPKDYYVQVDTGSDIMWVNCIQCKECPRRS----SLGIELTLYDIKDSSTGKFVTCDQEF 140
P K + V +DTGSD+ WV C C C + EL++Y+ K SST + VTCD
Sbjct: 112 PGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSL 170
Query: 141 CHGVYGGPLTDCTANTS-CPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C C S CPY+ Y +ST+G V+DV+ + D + +
Sbjct: 171 C-----AHRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHL--TTEDNRQEFVEAYVT 223
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA 258
FGCG Q+G+ + A +G+ G G S+ S L+ G F+ C G +G G +
Sbjct: 224 FGCGQVQTGSF--LDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCF-GPDGIGRIS 280
Query: 259 IGHVVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTL 316
G P+ +TP N P Y+I +T V+VG ++L + + DSGT+
Sbjct: 281 FGDKGSPDQEETPFNLNALHPTYNITVTQVRVGTTLIDL---------DFTALFDSGTSF 331
Query: 317 AYLPEMVYEPLVSKIISQQPDLK 339
YL + +Y ++ SQ D +
Sbjct: 332 TYLVDPIYTNVLKSFHSQAQDSR 354
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 96/327 (29%), Positives = 148/327 (45%), Gaps = 35/327 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y A + +GTP + + V VDTGSD+ WV C C +C ++ L+ S++
Sbjct: 11 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQND-----ALFLPNTSTSFTK 65
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C C+G+ P C T+C Y YGDGS TTG FV D + D ++G Q
Sbjct: 66 LACGSALCNGL---PFPMCN-QTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVP- 120
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGI- 251
+ FGCG G+ DGI+G G+ S SQL S F++CL D +
Sbjct: 121 --NFAFGCGHDNEGSF-----AGADGILGLGQGPLSFHSQLKSV--YNGKFSYCLVDWLA 171
Query: 252 ----NGGGIFAIGHV-VQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVG 303
+F V + P+V P++ N +Y + + + VG + LN+ + VF +
Sbjct: 172 PPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDID 231
Query: 304 D--NKGTIIDSGTTLAYLPEMVYEPLVSKI--ISQQPDLKVHTVHDEYTCFQ-YSESVDE 358
GTI DSGTT+ L E Y+ +++ + + K+ + C + +
Sbjct: 232 SVGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLCLSGFPKDQLP 291
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFE 385
P +TFHFE + + P Y E
Sbjct: 292 TVPAMTFHFEGG-DMVLPPSNYFIYLE 317
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 169/391 (43%), Gaps = 54/391 (13%)
Query: 29 FSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLP--LGGSSRPDGVGLYYAKIGIGTPP 86
F+ + R + R R+ +L++ RR G +P LGG D + Y +GIGTP
Sbjct: 80 FAERLR-SDRARADHILRKASGRRMMSEGGGASIPTYLGGFV--DSL-EYVVTLGIGTPA 135
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGV-Y 145
V +DTGSD+ WV QCK C + L+D SST + C + C +
Sbjct: 136 VQQTVLIDTGSDLSWV---QCKPCNASDCYPQKDPLFDPSKSSTFATIPCASDACKQLPV 192
Query: 146 GGPLTDCTANTS-----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
G CT NTS C Y YG+G+ T G + + + L +++ S FG
Sbjct: 193 DGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLA-------LGSSAVVKSFRFG 245
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI- 259
CG+ Q G D DG++G G + S++SQ AS G F++CL +N G F
Sbjct: 246 CGSDQHGPYDK-----FDGLLGLGGAPESLVSQTASVYG--GAFSYCLPPLNSGAGFLTL 298
Query: 260 -----------GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G V P +P + Y + +T + VG L++P VF KG
Sbjct: 299 GAPNSTNNSNSGFVFTPMHAFSPKIAT--FYVVTLTGISVGGKALDIPPAVFA----KGN 352
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQYSESVDEGFPNVTFH 366
I+DSGT + +P Y+ L + S + + D TC+ ++ P V
Sbjct: 353 IVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTVTVPKVALT 412
Query: 367 FENSVSLKV-YPHEYLFPFEDLWCIGWQNSG 396
F ++ + P L ED C+ + ++G
Sbjct: 413 FVGGATVDLDVPSGVL--VED--CLAFADAG 439
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 156/351 (44%), Gaps = 46/351 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IG+PP+++ + VDTGS + +V C C +C + L SST +
Sbjct: 87 GYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPEL-----SSTYQP 141
Query: 134 VTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ + +C N C Y Y + S+++G +DV+ + K S + +
Sbjct: 142 VKCNAD----------CNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRA 191
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+FGC +SG+L + + DGI+G G+ S++ QL G V F+ C G++
Sbjct: 192 -----VFGCETMESGDLYT---QRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD 243
Query: 253 -GGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GT 308
GGG +G + P V P+Y+I + + V L L F D K G
Sbjct: 244 VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF---DGKYGA 300
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYS----ESVDEGFP 361
I+DSGTT AY PE Y I+ + LK + D + CF + + + FP
Sbjct: 301 ILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFP 360
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
V F N + + P YLF + +C+G +G TLLG
Sbjct: 361 EVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNG-----NDQTTLLG 406
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 168/384 (43%), Gaps = 53/384 (13%)
Query: 46 KEHDARRQQRILAGVDLPLGGSSRPDGV---GLYYAKIGIGTPPKDYYVQVDTGSDIMWV 102
K + +R L DLP D + G Y ++ IGTPP+++ + VDTGS + +V
Sbjct: 55 KPFTSNYHRRQLHNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYV 114
Query: 103 NCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYL 161
C C++C + + + SST K + C+ P +C C Y
Sbjct: 115 PCSTCEQCGKHQD-----PRFQPESSSTYKPMQCN----------PSCNCDDEGKQCTYE 159
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
Y + SS++G +DV+ + ++ T IFGC ++G L S + DGI+
Sbjct: 160 RRYAEMSSSSGLLAEDVLSFGN-----ESELTPQRAIFGCETVETGELFS---QRADGIM 211
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQ 276
G G+ S++ QL V F+ C G++ GG +G++ P + P
Sbjct: 212 GLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPPDMVFAHSDPY--RS 269
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
+Y+I + + V L L VF D K GT++DSGTT AYLPE + II +
Sbjct: 270 AYYNIELKELHVAGKRLKLNPRVF---DGKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEI 326
Query: 336 PDLK-VHTVHDEYT--CFQYS----ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL- 387
LK +H Y CF + + + FP V F N L + P YLF +
Sbjct: 327 KFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVS 386
Query: 388 --WCIGWQNSGMQSRDRKNMTLLG 409
+C+G +G + TLLG
Sbjct: 387 GAYCLGIFQNG-----KDPTTLLG 405
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 162/383 (42%), Gaps = 41/383 (10%)
Query: 23 SSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPL-GGSSRPDGVGLYYAKIG 81
SS + K R+A S LK D + + P+ G+S+ G G Y+++IG
Sbjct: 112 SSRVAGIAAKIRFAVEGIDRSDLKPVDIDETRFQPEDLTTPVVSGTSQ--GSGEYFSRIG 169
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
+GTP K+ YV +DTGSD+ W+ C+ C EC ++S ++D SST K +TC C
Sbjct: 170 VGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSD-----PIFDPTSSSTFKSLTCSDPKC 224
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
+ ++ C +N C Y YGDGS T G + D V + + SG + + GC
Sbjct: 225 ASL---DVSACRSN-KCLYQVSYGDGSFTVGNYATDTVTFGE-SGKVN------DVALGC 273
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFA 258
G G G SM +Q+ + K F++CL D +
Sbjct: 274 GHDNEGLFTGAAGLLGL-----GGGALSMTNQIKA-----KSFSYCLVDRDSAKSSSLDF 323
Query: 259 IGHVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSG 313
+ PL+ N Y + ++ VG +++P+ +F V + G I+D G
Sbjct: 324 NSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCG 383
Query: 314 TTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQYSESVDEGFPNVTFHFENSV 371
T + L Y L + D K T TC+ +S P VTFHF
Sbjct: 384 TAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGK 443
Query: 372 SLKVYPHEYLFPFED--LWCIGW 392
SL + YL P +D +C +
Sbjct: 444 SLNLPAKNYLIPIDDAGTFCFAF 466
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 156/351 (44%), Gaps = 46/351 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IG+PP+++ + VDTGS + +V C C +C + L SST +
Sbjct: 87 GYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPEL-----SSTYQP 141
Query: 134 VTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ + +C N C Y Y + S+++G +DV+ + K S + +
Sbjct: 142 VKCNAD----------CNCDENGVQCTYERRYAEMSTSSGVLAEDVMSFGKESELVPQRA 191
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+FGC +SG+L + + DGI+G G+ S++ QL G V F+ C G++
Sbjct: 192 -----VFGCETMESGDLYT---QRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD 243
Query: 253 -GGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GT 308
GGG +G + P V P+Y+I + + V L L F D K G
Sbjct: 244 VGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTF---DGKYGA 300
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYS----ESVDEGFP 361
I+DSGTT AY PE Y I+ + LK + D + CF + + + FP
Sbjct: 301 ILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFP 360
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
V F N + + P YLF + +C+G +G TLLG
Sbjct: 361 EVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNG-----NDQTTLLG 406
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 144/320 (45%), Gaps = 47/320 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GIGTPP+ Y +DTGSD++W C C C + + +D S +
Sbjct: 87 GEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPT-----PFFDPAQSPSYAK 141
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C+ C+ +Y PL C N C Y YGD ++T G + + G T T
Sbjct: 142 LPCNSPMCNALY-YPL--CYRNV-CVYQYFYGDSANTAGVLSNETFTF----GTNDTRVT 193
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
+ FGCG +G+L + + G++GFG+ S++SQL S F++CL
Sbjct: 194 VPRIAFGCGNLNAGSLFNGS-----GMVGFGRGPLSLVSQLGS-----PRFSYCLTSFMS 243
Query: 254 G-------GIFAIGHVVQPE----VNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDV 299
G +A + V TP + P P Y +NMT + VG + L + V
Sbjct: 244 PVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSV 303
Query: 300 FGVGDNKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQ---PDLKVHTVHDEY-TCFQY 352
F + D GT IIDSG+T+ YL Y+ +V + + Q P ++ D TCF +
Sbjct: 304 FAINDADGTGGVIIDSGSTITYLARAAYD-MVHQAFADQVGLPLTNATSLADVLDTCFVW 362
Query: 353 SESVDE--GFPNVTFHFENS 370
+ P + FHFE +
Sbjct: 363 PPPPRKIVTMPELAFHFEGA 382
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 95/334 (28%), Positives = 149/334 (44%), Gaps = 51/334 (15%)
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
P G S RP G Y + IGTPP+ +DTGSD++W C C C L L
Sbjct: 89 PTGVSVRPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASC-----LAQPDPL 143
Query: 123 YDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
+ +S++ + + C + C + C +C Y YGDG+ T G + + +
Sbjct: 144 FAPGESASYEPMRCAGQLCSDILH---HGCEMPDTCTYRYNYGDGTMTMGVYATERFTFT 200
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
GD T G FGCG+ G+L++ + GI+GFG++ S++SQL+ +R+
Sbjct: 201 SSGGDRLMTVPLG---FGCGSMNVGSLNNGS-----GIVGFGRNPLSLVSQLS----IRR 248
Query: 243 MFAHCLD------------GINGGGIFAIGHVVQPEVNKTPL---VPNQPHYSINMTAVQ 287
F++CL G GG++ G P V TPL + N Y +++ +
Sbjct: 249 -FSYCLTSYGSGRKSTLLFGSLSGGVY--GDATGP-VQTTPLLQSLQNPTFYYVHLAGLT 304
Query: 288 VGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT--V 343
VG L +P F + + G I+DSGT L LP V +V + QQ L
Sbjct: 305 VGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVV-RAFRQQLRLPFANGGN 363
Query: 344 HDEYTCF-------QYSESVDEGFPNVTFHFENS 370
++ CF + S + P + FHF+++
Sbjct: 364 PEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDA 397
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 103/336 (30%), Positives = 147/336 (43%), Gaps = 53/336 (15%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTPP+ + +DTGSD++W C C C ++ L +D SST +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA-----LPYFDPSTSSTLSLTS 89
Query: 136 CDQEFCHGVYGGPLTDCTA-----NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
CD C G+ P+ C + N +C Y YGD S TTG+ ++ DK +
Sbjct: 90 CDSTLCQGL---PVASCGSPKFWPNQTCVYTYSYGDKSVTTGF-----LEVDKFTFVGAG 141
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
S G + FGCG +G S NE GI GFG+ S+ SQL F+HC
Sbjct: 142 ASVPG-VAFGCGLFNNGVFKS-NET---GIAGFGRGPLSLPSQLKVGN-----FSHCFTT 191
Query: 251 ING-----------GGIFAIGHVVQPEVNKTPLV------PNQPHYSINMTAVQVGLDFL 293
I G +F+ G Q V TPL+ N Y +++ + VG L
Sbjct: 192 ITGAIPSTVLLDLPADLFSNG---QGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRL 248
Query: 294 NLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV--HTVHDEYTCF 350
+P F + + GTIIDSGT++ LP VY+ +V + Q L V YTCF
Sbjct: 249 PVPESAFALTNGTGGTIIDSGTSITSLPPQVYQ-VVRDEFAAQIKLPVVPGNATGHYTCF 307
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
P + HFE + ++ + Y+F D
Sbjct: 308 SAPSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPD 342
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 157/348 (45%), Gaps = 40/348 (11%)
Query: 17 AAVGGVSSNHGVFSVKY--RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSS-RPDGV 73
+A G+ + +V+Y A R+R L R+ +I AG+ G S+ R +
Sbjct: 43 SAAAGIPAPPEEGTVEYYAELADRDRLLR------GRKLSQIDAGLAFSDGNSTFRISSL 96
Query: 74 G-LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG----IELTLYDIKDS 128
G L+Y + IGTP + V +DTGSD+ WV C C C S +L +Y+ S
Sbjct: 97 GFLHYTTVQIGTPGVKFMVALDTGSDLFWVPC-DCTRCAASDSTAFASDFDLNVYNPNGS 155
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDG-SSTTGYFVQDVVQYDKVSG 186
ST K VTC+ C + C S CPY+ Y +ST+G V+DV+ +
Sbjct: 156 STSKKVTCNNSLC-----THRSQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDN 210
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
N +IFGCG QSG+ + A +G+ G G S+ S L+ G F+
Sbjct: 211 HHDLVEAN--VIFGCGQIQSGSF--LDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSM 266
Query: 247 CLDGINGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
C G +G G + G + ++TP L P+ P Y+I +T V+VG +++
Sbjct: 267 CF-GRDGIGRISFGDKGSFDQDETPFNLNPSHPTYNITVTQVRVGTTVIDV--------- 316
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
+ DSGT+ YL + Y L SQ D + + D F+Y
Sbjct: 317 EFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRS--DSRIPFEY 362
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 134/318 (42%), Gaps = 41/318 (12%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKD 127
R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 176 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE-----KLFDPAS 230
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYDKV 184
SST V+C C + ++ C+ C Y YGDGS + G+F D + YD V
Sbjct: 231 SSTYANVSCAAPACSDL---DVSGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 286
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM-ISQLASSGGVRKM 243
G FGCG R G E A G++G G+ +S+ + GGV
Sbjct: 287 KG----------FRFGCGERNDGLF---GEAA--GLLGLGRGKTSLPVQTYGKYGGV--- 328
Query: 244 FAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVP-NQP-HYSINMTAVQVGLDFLNLPTDVF 300
FAHCL + G G G P TP++ N P Y + MT ++VG L + VF
Sbjct: 329 FAHCLPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF 388
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD---LKVHTVHDEYTCFQYSESVD 357
GTI+DSGT + LP Y L S + K V TC+ ++
Sbjct: 389 AA---AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQ 445
Query: 358 EGFPNVTFHFENSVSLKV 375
P V+ F+ +L V
Sbjct: 446 VAIPTVSLLFQGGAALDV 463
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 104/339 (30%), Positives = 159/339 (46%), Gaps = 44/339 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +I IGTPP + V DTGSD++WV C C+EC ++ S +++ K SST
Sbjct: 90 GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKS-----PIFNPKQSSTY 144
Query: 132 KFVTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+ V C+ +C+ + + C+A+ +C Y YGD S T GY + + +
Sbjct: 145 RRVLCETRYCNAL-NSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSI 203
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
Q L FGCG GN D E GI+G G + S+ISQL + + F++CL
Sbjct: 204 Q------ELAFGCGNSNGGNFD----EVGSGIVGLGGGSLSLISQLGTK--IDNKFSYCL 251
Query: 249 DGINGGGIFAIGHVVQPEVN---------KTPLVPNQPH--YSINMTAVQVG---LDFLN 294
I F++G +V + + TPLV +P Y + + A+ VG L + N
Sbjct: 252 VPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYEN 311
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--Y 352
D G + IIDSGTTL +L +Y L + + ++ V D F +
Sbjct: 312 SRND--GNVEKGNIIIDSGTTLTFLDSKLYNKLE---LVLEKAVEGERVSDPNGIFSICF 366
Query: 353 SESVDEGFPNVTFHFENS-VSLKVYPHEYLFPFEDLWCI 390
+ + P +T HF ++ V LK + + EDL C
Sbjct: 367 RDKIGIELPIITVHFTDADVELKPI-NTFAKAEEDLLCF 404
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 89/292 (30%), Positives = 138/292 (47%), Gaps = 27/292 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P ++ T Y SST K
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPATAASGSATFYIPGMSSTSKA 166
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ FC +C+ CPY +Y G+S++G+ V+DV+ + Q
Sbjct: 167 VPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI-- 219
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++ GCG Q+G+ + A +G+ G G S+ S LA G F+ C G +
Sbjct: 220 LKAQIMLGCGQTQTGSF--LDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCF-GRD 276
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL N+ H Y+I ++ + VG N PTD+ + TI
Sbjct: 277 GIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVG----NKPTDMDFI-----TIF 327
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG-FP 361
D+GT+ YL + Y +++ Q H D F+Y + E FP
Sbjct: 328 DTGTSFTYLADPAYT-YITQSFHAQVQANRHAA-DSRIPFEYCYDLSEARFP 377
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 114/423 (26%), Positives = 181/423 (42%), Gaps = 66/423 (15%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRP 70
I+LI + V G SS +V +R+ R + + R R ++ V P+ G+ P
Sbjct: 10 ILLIVMSLVLGFSS-----AVDFRW----RKTAGFSD----RFTRAVSSVVFPVHGNVYP 56
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPRRSSLGIELTLYDIK 126
+G Y I IG PP+ YY+ +DTGSD+ W+ C ++C E P LY
Sbjct: 57 --LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPH--------PLYQ-- 104
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
+ + C+ C ++ C C Y Y DG S+ G V+DV + G
Sbjct: 105 --PSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTKG 162
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
L+ T L GCG Q +++ LDG++G G+ S++SQL S G V+ + H
Sbjct: 163 -LRLTP---RLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGH 216
Query: 247 CLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG-LDFLNLPTDVFGVGDN 305
CL + GGGI G + + ++ P YS + + G L F T + N
Sbjct: 217 CLSSL-GGGILFFGDDLY-DSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGL----KN 270
Query: 306 KGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVHDEYTCFQYS------ESV 356
T+ DSG++ Y Y+ L+ + +S +P + H C+Q E V
Sbjct: 271 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 330
Query: 357 DEGFPNVTFHFE----NSVSLKVYPHEYL-FPFEDLWCIGWQNS---GMQSRDRKNMTLL 408
+ F + F+ + ++ P YL + C+G N G+Q N+ L+
Sbjct: 331 KKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQ-----NLNLI 385
Query: 409 GDF 411
GD
Sbjct: 386 GDI 388
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 156/333 (46%), Gaps = 28/333 (8%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y ++ IGTPP Y DTGSD+ W +C+ C C ++ + ++D + S+T +
Sbjct: 69 LGHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRN-----PMFDPQKSTTYR 123
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
++CD + CH + G C+ C Y Y + T G Q+ + G ++
Sbjct: 124 NISCDSKLCHKLDTG---VCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKG--KSVP 178
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
G ++FGCG +G N+ + GIIG G S+ISQ+ SS G ++ F+ CL
Sbjct: 179 LKG-IVFGCGHNNTGGF---NDHEM-GIIGLGGGPVSLISQMGSSFGGKR-FSQCLVPFH 232
Query: 249 --DGINGGGIFAIGHVVQPE-VNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVG 303
++ F G V + V TPLV Q Y + + + V +L+ V
Sbjct: 233 TDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNV- 291
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
+ +DSGT LP +Y+ +V+++ S+ V D Y + P +
Sbjct: 292 EKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNLRGPVL 351
Query: 364 TFHFENSVSLKVYPHE-YLFPFEDLWCIGWQNS 395
T HFE + +K+ P + ++ P + ++C+G+ N+
Sbjct: 352 TAHFEGA-DVKLSPTQTFISPKDGVFCLGFTNT 383
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 134/318 (42%), Gaps = 41/318 (12%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKD 127
R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE-----KLFDPAS 226
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYDKV 184
SST V+C C + ++ C+ C Y YGDGS + G+F D + YD V
Sbjct: 227 SSTYANVSCAAPACSDL---DVSGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 282
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM-ISQLASSGGVRKM 243
G FGCG R G E A G++G G+ +S+ + GGV
Sbjct: 283 KG----------FRFGCGERNDGLF---GEAA--GLLGLGRGKTSLPVQTYGKYGGV--- 324
Query: 244 FAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVP-NQP-HYSINMTAVQVGLDFLNLPTDVF 300
FAHCL + G G G P TP++ N P Y + MT ++VG L + VF
Sbjct: 325 FAHCLPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF 384
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD---LKVHTVHDEYTCFQYSESVD 357
GTI+DSGT + LP Y L S + K V TC+ ++
Sbjct: 385 AA---AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQ 441
Query: 358 EGFPNVTFHFENSVSLKV 375
P V+ F+ +L V
Sbjct: 442 VAIPTVSLLFQGGAALDV 459
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 149/360 (41%), Gaps = 44/360 (12%)
Query: 34 RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
R R +LS+ + + + R G RP G Y + +GTPP+ +
Sbjct: 62 RSKARAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLEYLVDLAVGTPPQPVSALL 121
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSD++W C C C L ++ SS+ + + C E C+ + C
Sbjct: 122 DTGSDLIWTQCAPCASC-----LPQPDPIFSPGASSSYEPMRCAGELCNDILH---HSCQ 173
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
+C Y YGDG++T G + + + S +TT + L FGCG G+L++ +
Sbjct: 174 RPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKGSLNNGS 233
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD------------GINGGGIFAIGH 261
GI+GFG++ S++SQLA +R+ F++CL G GG++
Sbjct: 234 -----GIVGFGRAPLSLVSQLA----IRR-FSYCLTPYASGRKSTLLFGSLRGGVYDAAT 283
Query: 262 VVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYL 319
N Y + T V VG L +P F + + G I+DSGT L
Sbjct: 284 ATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDSGTALTLF 343
Query: 320 PEMVYEPLVSKIISQQPDLKV------HTVHDEYTCFQYSES---VDEGFPNVTFHFENS 370
P V +V SQ L++ + D+ CF + S P + FH + +
Sbjct: 344 PAPVLAEVVRAFRSQ---LRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFHLQGA 400
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 167/368 (45%), Gaps = 50/368 (13%)
Query: 44 LLKEHDARRQQRILAG---VDLPLGGS----SRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
LL + D RRQ+ L +P GS S D L+Y I IGTP + V +DTG
Sbjct: 61 LLAKSDFRRQRMNLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTG 120
Query: 97 SDIMWV--NCIQCKECPRR--SSLGI-ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
SD++W+ NC+QC SSL +L Y+ SST K C + C +D
Sbjct: 121 SDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA-----SD 175
Query: 152 C-TANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQ---TTSTNGSLIFGCGARQS 206
C + CPY Y G +S++G V+D++ + + ++S ++ GCG +QS
Sbjct: 176 CESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQS 235
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIF--AIGHVVQ 264
G D + A DG++G G + S+ S L+ +G +R F+ C D + G I+ +G +Q
Sbjct: 236 G--DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 265 PEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
TP + N Y + + A +G L + T IDSG + YLPE
Sbjct: 294 ---QSTPFLQLENNSGYIVGVEACCIGNSCLKQTSFT--------TFIDSGQSFTYLPEE 342
Query: 323 VYEPLVSKIISQQPDLKVHTVHDEYTC----FQYSESVDEGFPNVTFHFENSVSLKVYPH 378
+Y + +I D ++ + + Y SV+ P + F ++ + + H
Sbjct: 343 IYRKVALEI-----DRHINATSKSFEGVSWEYCYESSVEPKVPAIKLKFSHNNTFVI--H 395
Query: 379 EYLFPFED 386
+ LF F+
Sbjct: 396 KPLFVFQQ 403
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 140/303 (46%), Gaps = 37/303 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+P + +DTGSD+ WV C C +C ++ L+D SST +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 182
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G C++++ C Y+ YGDGSSTTG + D + +S
Sbjct: 183 CGSAACAQL-GQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG--------SSAVK 233
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGG 254
S FGC +SG D T DG++G G S++SQ A G + + F++CL +
Sbjct: 234 SFQFGCSNVESGFNDQT-----DGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSS 286
Query: 255 GIFAIGHVVQPEVN---KTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G +G + KTP++ + Y + + A++VG L++P VF + GT
Sbjct: 287 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF----SAGT 342
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++DSGT + LP Y L S + Q P + + D TCF +S P+V
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILD--TCFDFSGQSSVSIPSVAL 400
Query: 366 HFE 368
F
Sbjct: 401 VFS 403
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 164/382 (42%), Gaps = 53/382 (13%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQC 107
R R ++ V P+ G+ P +G Y I IG PP+ YY+ +DTGSD+ W+ C ++C
Sbjct: 26 RFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 83
Query: 108 KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG 167
E P LY + + C+ C ++ C C Y Y DG
Sbjct: 84 LEAPH--------PLYQ----PSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADG 131
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
S+ G V+DV + G L+ T L GCG Q +++ LDG++G G+
Sbjct: 132 GSSLGVLVRDVFSMNYTQG-LRLTP---RLALGCGYDQIPG--ASSHHPLDGVLGLGRGK 185
Query: 228 SSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQ 287
S++SQL S G V+ + HCL + GGGI G + + ++ P YS + +
Sbjct: 186 VSILSQLHSQGYVKNVIGHCLSSL-GGGILFFGDDLY-DSSRVSWTPMSREYSKHYSPAM 243
Query: 288 VG-LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTV 343
G L F T + N T+ DSG++ Y Y+ L+ + +S +P +
Sbjct: 244 GGELLFGGRTTGL----KNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDD 299
Query: 344 HDEYTCFQYS------ESVDEGFPNVTFHFE----NSVSLKVYPHEYL-FPFEDLWCIGW 392
H C+Q E V + F + F+ + ++ P YL + C+G
Sbjct: 300 HTLPLCWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGI 359
Query: 393 QNS---GMQSRDRKNMTLLGDF 411
N G+Q N+ L+GD
Sbjct: 360 LNGTEIGLQ-----NLNLIGDI 376
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 114/423 (26%), Positives = 181/423 (42%), Gaps = 66/423 (15%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRP 70
IVL+ + V G SS +V +R+ R + + R R ++ V P+ G+ P
Sbjct: 10 IVLMVMSLVLGFSS-----AVDFRW----RKTAGFSD----RFTRAVSSVVFPVHGNVYP 56
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPRRSSLGIELTLYDIK 126
+G Y I IG PP+ YY+ +DTGSD+ W+ C ++C E P LY
Sbjct: 57 --LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPH--------PLYQ-- 104
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
+ + C+ C ++ C C Y Y DG S+ G V+DV + G
Sbjct: 105 --PSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQG 162
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
L+ T L GCG Q +++ LDG++G G+ S++SQL S G V+ + H
Sbjct: 163 -LRLTP---RLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGH 216
Query: 247 CLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG-LDFLNLPTDVFGVGDN 305
CL + GGGI G + + ++ P YS + + G L F T + N
Sbjct: 217 CLSSL-GGGILFFGDDLY-DSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGL----KN 270
Query: 306 KGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVHDEYTCFQYS------ESV 356
T+ DSG++ Y Y+ L+ + +S +P + H C+Q E V
Sbjct: 271 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 330
Query: 357 DEGFPNVTFHFE----NSVSLKVYPHEYL-FPFEDLWCIGWQNS---GMQSRDRKNMTLL 408
+ F + F+ + ++ P YL + C+G N G+Q N+ L+
Sbjct: 331 KKYFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQ-----NLNLI 385
Query: 409 GDF 411
GD
Sbjct: 386 GDI 388
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 157/386 (40%), Gaps = 42/386 (10%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARR-QQRILAGVDLPLGGSSRPDGVGLYYAKI 80
V+S HG + R RS + ++ Q +++G+ L G G Y+ +I
Sbjct: 12 VASIHGRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSL---------GSGEYFIRI 62
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
+GTPP+ Y+ +DTGSDI+W+ C C C +S ++D SST + C
Sbjct: 63 SVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSD-----AIFDPYKSSTYSTLGCSTRQ 117
Query: 141 CHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
C + G C AN C Y YGDGS TTG F D V + SG Q + G
Sbjct: 118 CLNLDIG---TCQAN-KCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNK--IPLG 171
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-----DGINGGG 255
CG G GK S +Q+ G R F++CL D G
Sbjct: 172 CGHDNEGYFVGAAGLLGL-----GKGPLSFPNQVDPQNGGR--FSYCLTDRETDSTEGSS 224
Query: 256 -IFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGD--NKGTI 309
+F V TP N Y + MT + VG L +PT F + N G I
Sbjct: 225 LVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVI 284
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFE 368
IDSGT++ L Y L + DL + TC+ S P VT HF+
Sbjct: 285 IDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQ 344
Query: 369 NSVSLKVYPHEYLFPFE--DLWCIGW 392
LK+ YL P + + +C+ +
Sbjct: 345 GGTDLKLPASNYLIPVDNSNTFCLAF 370
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 128/264 (48%), Gaps = 24/264 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P ++ T Y SST K
Sbjct: 107 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPATAASGSATFYIPGMSSTSKA 165
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ FC +C+ CPY +Y G+S++G+ V+DV+ + Q
Sbjct: 166 VPCNSNFCDLQ-----KECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI-- 218
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++ GCG Q+G+ + A +G+ G G S+ S LA G F+ C G +
Sbjct: 219 LKAQIMLGCGQTQTGSF--LDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCF-GRD 275
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL NQ H Y+I ++ + +G N PTD+ + TI
Sbjct: 276 GIGRISFGDQGSSDQEETPLNINQQHPTYAITISGITIG----NKPTDLDFI-----TIF 326
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQ 334
D+GT+ YL + Y + +Q
Sbjct: 327 DTGTSFTYLADPAYTYITQSFHAQ 350
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 163/403 (40%), Gaps = 63/403 (15%)
Query: 26 HGVFSVKYRYAGRERSLSLLK-EHDARRQQRILAGVDLPLGGSSRPD------------- 71
H SV+ RSL+L + E D+ R + I +DL + G S D
Sbjct: 68 HSRTSVQKTKHPDYRSLTLSRLERDSARVKSINTRLDLAIHGLSTSDLKPLDTDSQFRAE 127
Query: 72 ------------GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
G G Y++++GIG P Y+ +DTGSD+ W+ C C +C ++
Sbjct: 128 DLQGPIISGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQAD---- 183
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV 179
+++ S++ ++CD + C + +++C NT C Y YGDGS T G FV + +
Sbjct: 184 -PIFEPASSTSYSPLSCDTKQCQSL---DVSECRNNT-CLYEVSYGDGSYTVGDFVTETI 238
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
S D ++ GCG G G S SQ+ +S
Sbjct: 239 TLGSASVD--------NVAIGCGHNNEGLFIGAAGLLGLGGGKL-----SFPSQINASS- 284
Query: 240 VRKMFAHCL--DGINGGGIFAIGHVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLN 294
F++CL + + P PL+ N+ Y + MT + VG + L+
Sbjct: 285 ----FSYCLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLS 340
Query: 295 LPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQ 351
+P +F + + N G IIDSGT + L Y L + DL V + + TC+
Sbjct: 341 IPESMFEMDESGNGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYD 400
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGW 392
S P VTFH L + YL P + +C +
Sbjct: 401 LSRKTSVEVPTVTFHLAGGKVLPLPATNYLIPVDSDGTFCFAF 443
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 131/275 (47%), Gaps = 32/275 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG----IELTLYDIKDSST 130
L++A + +GTP Y V +DTGSD+ W+ C C +C L I +YD K+SST
Sbjct: 112 LHFANVSVGTPASSYLVALDTGSDLFWLPC-NCTKCVHGIQLSTGQKIAFNIYDNKESST 170
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANT--SCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGD 187
K V C+ C T C++++ +CPY +E + +STTG+ V+DV+ D
Sbjct: 171 SKNVACNSSLCE-----QKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHL-ITDND 224
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
QT N + FGCG Q+G + A +G+ G G S+ S+ S LA G F+ C
Sbjct: 225 DQTQHANPLITFGCGQVQTGAF--LDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMC 282
Query: 248 LDGINGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
G I + + KTP + P+ Y+I +T + VG + +L +
Sbjct: 283 FAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLEFNA------ 336
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
I D+GT+ YL Y K I+Q D K+
Sbjct: 337 ---IFDTGTSFTYLNNPAY-----KQITQSFDSKI 363
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 113/414 (27%), Positives = 189/414 (45%), Gaps = 60/414 (14%)
Query: 29 FSVKYRYAGRERSLSLLKE--HDARR-QQRILAGVDLPLGGSSRPD-------GVGLYYA 78
F V+ ++ ++L+ L+ H +R + R+ + L SS + G G +
Sbjct: 40 FRVRLKHVDSGKNLTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLPGNGEFLM 99
Query: 79 KIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
K+ IGTPP+ Y +DTGSD++W C C +C +S+ ++D K SS+ ++C
Sbjct: 100 KLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQST-----PIFDPKKSSSFSKLSCSS 154
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
+ C + P + C N C YL YGD SST G + + + K S ++
Sbjct: 155 QLCEAL---PQSSC--NNGCEYLYSYGDYSSTQGILASETLTFGKASVP--------NVA 201
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG---- 254
FGCGA G+ S G++G G+ S++SQL F++CL ++
Sbjct: 202 FGCGADNEGSGFSQGA----GLVGLGRGPLSLVSQLK-----EPKFSYCLTTVDDTKTST 252
Query: 255 ---GIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDN--K 306
G A + + TPL+ + H Y +++ + VG L + F + D+
Sbjct: 253 LLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSG 312
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE--YTCFQY-SESVDEGFPNV 363
G IIDSGTT+ YL E + LV+K + + +L V + CF S S + P +
Sbjct: 313 GLIIDSGTTITYLEESAFN-LVAKEFTAKINLPVDSSGSTGLDVCFTLPSGSTNIEVPKL 371
Query: 364 TFHFENSVSLKVYPHEYLFPFEDL--WCIGW-QNSGMQ---SRDRKNMTLLGDF 411
FHF+ + L++ Y+ + C+ +SGM + ++NM +L D
Sbjct: 372 VFHFDGA-DLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDL 424
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 141/343 (41%), Gaps = 48/343 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
LP+ G+ PDG YY I IG PP+ Y++ VDTGSD+ W+ C C C +
Sbjct: 175 LPIKGNVFPDG--QYYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGP------ 226
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+ + + K V C + G C C Y Y D SS+ G +D
Sbjct: 227 --HPLYKPAKEKIVPPRDLLCQELQGN-QNYCETCKQCDYEIEYADQSSSMGVLARD--- 280
Query: 181 YDKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D+ +TNG +FGC Q G L S+ + DGI+G + S SQLA
Sbjct: 281 ------DMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKT-DGILGLSSAAISFPSQLA 333
Query: 236 SSGGVRKMFAHCLDGINGGGIFAI---GHVVQPEVNKTPLVPNQPH-YSINMTAVQVGLD 291
S G + +F HC+ GGG + +V + V T + + Y V+ G
Sbjct: 334 SHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQ 393
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TC 349
L P G I DSG++ YLP +YE LV+ I P V D C
Sbjct: 394 QLRRPEQ---AGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYASPGF-VQDTSDRTLPLC 449
Query: 350 FQ------YSESVDEGFPNVTFHFEN-----SVSLKVYPHEYL 381
++ Y E V + F + HF S + + P +YL
Sbjct: 450 WKADFPVRYLEDVKQFFEPLNLHFGKKWLFMSKTFTISPEDYL 492
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/337 (29%), Positives = 152/337 (45%), Gaps = 40/337 (11%)
Query: 43 SLLKEHDARRQQRILAG------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
S L HD R +R+LAG + G S+ L+YAK+ +GTP + V +DTG
Sbjct: 46 SALSAHD--RARRVLAGGKGESLLSFADGNSTTRHAGSLHYAKVALGTPNATFVVALDTG 103
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD+ WV C CK C ++ L Y + SST K VTC C P N
Sbjct: 104 SDLFWVPC-DCKRCAPIANTSELLKPYSPRQSSTSKPVTCSHSLCD----RPNACGNGNG 158
Query: 157 SCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTT-------STNGSLIFGCGARQSGN 208
SCPY Y +S++G V+DV+ + S ++ + ++FGCG Q+G
Sbjct: 159 SCPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGA 218
Query: 209 LDSTNEEALDGIIGFGKSNSSMISQLASSGGV-RKMFAHCLDGINGGGIFAIGHVVQPEV 267
+ A++G++G G S+ S LA++G V F+ C +G G G
Sbjct: 219 F--LDGAAMEGLLGLGMDRVSVPSLLAAAGLVGSDSFSMCFS-PDGNGRINFGEPSDAGA 275
Query: 268 -NKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
N+TP + +P Y+I++TAV V + ++DSGT+ YL + Y
Sbjct: 276 QNETPFIVSKTRPTYNISVTAVNV--------KGKGAMAAEFAAVVDSGTSFTYLNDPAY 327
Query: 325 EPLVSKIISQQPDLKVH---TVHDEYTCFQYSESVDE 358
L + SQ + + + ++ EY C+ S E
Sbjct: 328 SLLATSFNSQVREKRANLSASIPFEY-CYALSRGQTE 363
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 146/354 (41%), Gaps = 70/354 (19%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
LP+ G+ PDG YY I +G PP+ Y++ VDTGSD+ W+ C C C +
Sbjct: 175 LPIKGNVFPDGQ--YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGP------ 226
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+ + + K V C + G C C Y Y D SS+ G +D
Sbjct: 227 --HPLYKPTKEKIVPPRDLLCQELQGN-QNYCETCKQCDYEIEYADQSSSMGVLARD--- 280
Query: 181 YDKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D+ +TNG +FGC Q G L S+ + DGI+G + S+ SQLA
Sbjct: 281 ------DMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKT-DGILGLSNAAISLPSQLA 333
Query: 236 SSGGVRKMFAHCLDGINGGG--IFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFL 293
S G + +F HC+ GGG +F V P + I T+++ G D L
Sbjct: 334 SHGIISNIFGHCITREQGGGGYMFLGDDYV-------------PRWGITWTSIRSGPDNL 380
Query: 294 NLPTDVFGV-------------GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
T+ V G+ I DSG++ YLP+ +YE LV+ I P V
Sbjct: 381 -YHTEAHHVKYGDQQLRMREQAGNTVQVIFDSGSSYTYLPDEIYENLVAAIKYASPGF-V 438
Query: 341 HTVHDEY--TCFQ------YSESVDEGFPNVTFHFEN-----SVSLKVYPHEYL 381
D C++ Y E V + F + HF S + + P +YL
Sbjct: 439 QDSSDRTLPLCWKADFPVRYLEDVKQFFKPLNLHFGKKWLFMSKTFTISPEDYL 492
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 173/375 (46%), Gaps = 28/375 (7%)
Query: 30 SVKYRYAGRERSL-SLLKEHD--ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPP 86
+VK ++ +L S L H RQQ+ L D R + A + IG PP
Sbjct: 59 NVKAESLAKDTALESTLSRHAYLRARQQKALQPADFVPPPLIRDKSA--FLANLSIGNPP 116
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
+ YV +DTGSD+ W+ C C C ++ +Y+ S + + C++ C +
Sbjct: 117 TNVYVVLDTGSDLFWIQCEPCDVCYKQKD-----PIYNRTKSDSYTEMLCNEPPCLSL-- 169
Query: 147 GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
G C+ + SC Y Y DGS T+G + V + D T+ G FGCG +
Sbjct: 170 GREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVG---FGCGLQ-- 224
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC---LDGINGGGIFAIGHVV 263
NL+ G++G G S++SQL++ G V K FA+C L N GG G
Sbjct: 225 -NLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNLSNPNAGGFLVFGDAT 283
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLD--FLNLPTDVFGVGDN--KGTIIDSGTTLAYL 319
+ TP+V + +Y +N+ + +G++ L++ + F + G IIDSG+TL+
Sbjct: 284 YLNGDMTPMVIAEFYY-VNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIF 342
Query: 320 PEMVYEPLVSKIISQ-QPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYP 377
P VYE + + ++ + + + + CF+ D FP + + E++ L
Sbjct: 343 PPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGKIGRDLPLFPTLVLYLESTGILNDRW 402
Query: 378 HEYLFPFEDLWCIGW 392
+L +++L+C+G+
Sbjct: 403 SIFLQRYDELFCLGF 417
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/328 (25%), Positives = 141/328 (42%), Gaps = 40/328 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTPP DTGSD++WVNC S + ++ S+T ++
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAV---VFHPSRSTTYSLLS 156
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + C A++ C Y YGDGS T G + + G +
Sbjct: 157 CQSAACQALSQA---SCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVP 213
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGI 251
+ FGC +G+ S DG++G G S++SQL ++ + + F++CL
Sbjct: 214 RVSFGCSTGSAGSFRS------DGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAA 267
Query: 252 NGGGIFAIGH---VVQPEVNKTPLVPNQ--PHYSINMTAVQV-GLDFLNLPTDVFGVGDN 305
N + G V P TPLVP++ +Y++ + +V V G D + ++
Sbjct: 268 NSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVAS--------ANS 319
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKI-------ISQQPDLKVHTVHDEYTCFQYSESVDE 358
I+DSGTTL +L + PLV+++ +Q P+ + +D S++ D
Sbjct: 320 SRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQ---GKSQAEDF 376
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFED 386
G P+VT F S+ + P E+
Sbjct: 377 GIPDVTLRFGGGASVTLRPENTFSLLEE 404
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/335 (28%), Positives = 148/335 (44%), Gaps = 39/335 (11%)
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
+LPL S+ G G Y G GTP K+ + +DTGSD+ W+ C C +C +
Sbjct: 124 NLPLQPGSKV-GTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVD----- 177
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+++ + SS+ K ++C C + + C C Y YGDGS + G F Q+ +
Sbjct: 178 PIFEPQQSSSYKHLSCLSSACTELT--TMNHCRLG-GCVYEINYGDGSRSQGDFSQETLT 234
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
S S FGCG +G + G++G G++ S SQ S G
Sbjct: 235 LGSDSFP--------SFAFGCGHTNTGLFKGS-----AGLLGLGRTALSFPSQTKSKYG- 280
Query: 241 RKMFAHCLDGI---NGGGIFAIGHVVQPEVNK-TPLVPNQPH---YSINMTAVQVGLDFL 293
F++CL G F++G P PLV N + Y + + + VG + L
Sbjct: 281 -GQFSYCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERL 339
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ---PDLKVHTVHDEYTCF 350
++P V G G GTI+DSGT + L Y+ L + S+ P K ++ D TC+
Sbjct: 340 SIPPAVLGRG---GTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILD--TCY 394
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
S P +TFHF+N+ + V LF +
Sbjct: 395 DLSSYSQVRIPTITFHFQNNADVAVSAVGILFTIQ 429
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 99/353 (28%), Positives = 168/353 (47%), Gaps = 49/353 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VD+GS + +V C C++C + + L SST +
Sbjct: 92 GYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEL-----SSTYQP 146
Query: 134 VTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DC + C Y Y + SS+ G +D++ + ++
Sbjct: 147 VKCNM------------DCNCDDDKEQCVYEREYAEHSSSKGVLGEDLISFGN-----ES 189
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T +FGC ++G+L S + DGIIG G+ + S++ QL G + F C G
Sbjct: 190 QLTPQRAVFGCETVETGDLYS---QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGG 246
Query: 251 IN-GGGIFAIGHVVQP-EVNKTPLVPNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
++ GGG +G P ++ T P++ P+Y+I++T ++V L+L + VF G
Sbjct: 247 MDVGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVF--DGEHG 304
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYTCFQYSESVD-----EG 359
++DSGTT AYLP+ + ++ + LK D + TCF + S D +
Sbjct: 305 AVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKI 364
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
FP+V F++ S + P Y+F + +C+G +G + + TLLG
Sbjct: 365 FPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNG-----KDHTTLLG 412
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 101/375 (26%), Positives = 172/375 (45%), Gaps = 28/375 (7%)
Query: 30 SVKYRYAGRERSL-SLLKEHD--ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPP 86
+VK ++ +L S L H RQQ+ L D R + A + IG PP
Sbjct: 46 NVKAESLAKDTALESTLSRHAYLRARQQKALQPADFVPPPLIRDKSA--FLANLSIGNPP 103
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
+ YV +DTGSD+ W+ C C C ++ +Y+ S + + C++ C V
Sbjct: 104 TNVYVVLDTGSDLFWIQCEPCDVCYKQKD-----PIYNRTKSDSYTEMLCNEPPC--VSL 156
Query: 147 GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
G C+ + SC Y Y DG+ T+G + V + D T+ G FGCG Q+
Sbjct: 157 GREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVG---FGCGL-QN 212
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI---NGGGIFAIGHVV 263
N ++N + + G S++SQL++ G V K FA+C I N GG G
Sbjct: 213 LNFITSNRDGGVLGL--GPGLVSLVSQLSAIGKVSKSFAYCFGNISNPNAGGFLVFGDAT 270
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGL--DFLNLPTDVFGVGDN--KGTIIDSGTTLAYL 319
+ TP+V + +Y +N+ + +G+ L++ + F + G IIDSG+TL+
Sbjct: 271 YLNGDMTPMVIAEFYY-VNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGSTLSVF 329
Query: 320 PEMVYEPLVSKIISQ-QPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYP 377
P VYE + + ++ + + + + CF+ D FP + + E++ L
Sbjct: 330 PPEVYEVVRNAVVDKLKKGYNISPLTSSPDCFEGKIERDLPLFPTLVLYLESTGILNDRW 389
Query: 378 HEYLFPFEDLWCIGW 392
+L +++L+C+G+
Sbjct: 390 SIFLQRYDELFCLGF 404
>gi|222630453|gb|EEE62585.1| hypothetical protein OsJ_17388 [Oryza sativa Japonica Group]
Length = 275
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 78/136 (57%), Gaps = 1/136 (0%)
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS 280
+G G SN+S++ QLA S +KMFAHCLDG GGIF +GH+V P+V KTPL Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
+ + VG L+L + TI+++G+ ++YLPE VY+ + I S D+ V
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQSFLDSIFSDLEDISV 120
Query: 341 HTVHDEYTCFQYSESV 356
+ Y+CF Y SV
Sbjct: 121 INI-GGYSCFHYERSV 135
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 140/303 (46%), Gaps = 37/303 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+P + +DTGSD+ WV C C +C ++ L+D SST +
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 252
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G C++++ C Y+ YGDGSSTTG + D + +S
Sbjct: 253 CGSADCAQL-GQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG--------SSAVR 303
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGG 254
S FGC +SG D T DG++G G S++SQ A G + + F++CL +
Sbjct: 304 SFQFGCSNVESGFNDQT-----DGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSS 356
Query: 255 GIFAIGHVVQPEVN---KTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G +G + KTP++ + Y + + A++VG L++P VF + GT
Sbjct: 357 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF----SAGT 412
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++DSGT + LP Y L S + Q P + + D TCF +S P+V
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILD--TCFDFSGQSSVSIPSVAL 470
Query: 366 HFE 368
F
Sbjct: 471 VFS 473
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 140/337 (41%), Gaps = 42/337 (12%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGI 118
V LP+ G+ PDG YY I +G PP+ Y++ VDTGSD+ W+ C C C +
Sbjct: 177 VLLPIKGNVFPDG--QYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGP---- 230
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDV 178
+ + + K V C + G C C Y Y D SS+ G +D
Sbjct: 231 ----HPLYKPAKEKIVPPRDSLCQELQGD-QNYCETCKQCDYEIEYADRSSSMGVLAKD- 284
Query: 179 VQYDKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
D+ +TNG +FGC Q G L S+ + DGI+G + S+ SQ
Sbjct: 285 --------DMHLIATNGGREKLDFVFGCAYDQQGQLLSSPAKT-DGILGLSSAAISLPSQ 335
Query: 234 LASSGGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNKT-PLVPNQPHYSINMTAVQVGLD 291
LAS G + +F HC+ NGGG +G P T + P + A +V
Sbjct: 336 LASKGIISNVFGHCITRETNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKV--- 392
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TC 349
N G++ I DSG++ YLPE +Y+ L+ I P V D C
Sbjct: 393 --NYGDQELHAGNSVQVIFDSGSSYTYLPEEMYKNLIDAIKEDSPSF-VQDSSDTTLPLC 449
Query: 350 FQYSESVDEGFPNVTFHFENS-----VSLKVYPHEYL 381
++ SV F + HF + + P +YL
Sbjct: 450 WKADFSVRSFFKPLNLHFGRRWFVVPKTFTIVPDDYL 486
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 140/303 (46%), Gaps = 37/303 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+P + +DTGSD+ WV C C +C ++ L+D SST +
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 106
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G C++++ C Y+ YGDGSSTTG + D + +S
Sbjct: 107 CGSADCAQL-GQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG--------SSAVR 157
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGG 254
S FGC +SG D T DG++G G S++SQ A G + + F++CL +
Sbjct: 158 SFQFGCSNVESGFNDQT-----DGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSS 210
Query: 255 GIFAIGHVVQPEVN---KTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G +G + KTP++ + Y + + A++VG L++P VF + GT
Sbjct: 211 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF----SAGT 266
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++DSGT + LP Y L S + Q P + + D TCF +S P+V
Sbjct: 267 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILD--TCFDFSGQSSVSIPSVAL 324
Query: 366 HFE 368
F
Sbjct: 325 VFS 327
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 167/389 (42%), Gaps = 57/389 (14%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRP 70
IVL+ + V G SS +V +R+ R + + R R ++ V P+ G+ P
Sbjct: 7 IVLMVMSLVLGFSS-----AVDFRW----RKTAGFSD----RFTRAVSSVVFPVHGNVYP 53
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPRRSSLGIELTLYDIK 126
+G Y I IG PP+ YY+ +DTGSD+ W+ C ++C E P LY
Sbjct: 54 --LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPH--------PLYQ-- 101
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
+ + C+ C ++ C C Y Y DG S+ G V+DV + G
Sbjct: 102 --PSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQG 159
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
L+ T L GCG Q +++ LDG++G G+ S++SQL S G V+ + H
Sbjct: 160 -LRLTP---RLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGH 213
Query: 247 CLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG-LDFLNLPTDVFGVGDN 305
CL + GGGI G + + ++ P YS + + G L F T + N
Sbjct: 214 CLSSL-GGGILFFGDDLY-DSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGL----KN 267
Query: 306 KGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVHDEYTCFQYS------ESV 356
T+ DSG++ Y Y+ L+ + +S +P + H C+Q E V
Sbjct: 268 LLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEV 327
Query: 357 DEGFPNVTFHFE----NSVSLKVYPHEYL 381
+ F + F+ + ++ P YL
Sbjct: 328 KKYFKPLALSFKTGWRSKTLFEIPPEAYL 356
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 153/338 (45%), Gaps = 36/338 (10%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
D+PL S + Y K+G GTPP+ +Y +DTGS+I W+ C C C +
Sbjct: 109 ADIPLA-SGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQ---- 163
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV 179
++ SST ++TC + C + +D + N C + YGD S V +++
Sbjct: 164 --PFEPSKSSTYNYLTCASQQCQLLRVCTKSDNSVN--CSLTQRYGDQSE-----VDEIL 214
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
+ +S Q + +FGC G + T ++GFG++ S +SQ A+
Sbjct: 215 SSETLSVGSQQVE---NFVFGCSNAARGLIQRT-----PSLVGFGRNPLSFVSQTATL-- 264
Query: 240 VRKMFAHCL-----DGINGGGIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLD 291
F++CL G + + + TPL+ N + Y + + + VG +
Sbjct: 265 YDSTFSYCLPSLFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEE 324
Query: 292 FLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-T 348
+++P + ++ +GTIIDSGT + L E Y + SQ +L + + D + T
Sbjct: 325 LVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDT 384
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
C+ S D FP +T HF++++ L + L+P D
Sbjct: 385 CYN-RPSGDVEFPLITLHFDDNLDLTLPLDNILYPGND 421
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 147/329 (44%), Gaps = 38/329 (11%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSS 115
LA V L G S GVG Y ++G+GTP K Y + VDTGS + W+ C C+ C R+S
Sbjct: 101 LASVPLTPGTSV---GVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSG 157
Query: 116 LGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGY 173
++D K SS+ V+C C G+ L C+ + C Y YGD S + GY
Sbjct: 158 -----PVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGY 212
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+D V + S + +GCG G + G++G ++ S++ Q
Sbjct: 213 LSKDTVSFGANSVP--------NFYYGCGQDNEGLFGRS-----AGLMGLARNKLSLLYQ 259
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGL 290
LA + G F++CL + G +IG + TP+V N Y I+++ + V
Sbjct: 260 LAPTLGYS--FSYCLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAG 317
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ----PDLKVHTVHDE 346
L + + + + TIIDSGT + LP VY L + + +++ D
Sbjct: 318 KPLAVSSSEY---TSLPTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILD- 373
Query: 347 YTCFQYSESVDEGFPNVTFHFENSVSLKV 375
TCF+ S P V+ F +LK+
Sbjct: 374 -TCFEGQASKLRAVPAVSMAFSGGATLKL 401
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/375 (28%), Positives = 164/375 (43%), Gaps = 43/375 (11%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGV--------GLYYAKIGIGTPPKDYYVQVDTG 96
L D +RQ+R L G L S+ G+ LYY + +GTP + V +DTG
Sbjct: 169 LVRSDLQRQKRRLGGGKHQLLSFSKDGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALDTG 228
Query: 97 SDIMWVNCIQCKECPRRS----SLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
SD+ W+ C C EC S SL +L +Y +S+T + + C E C + G +DC
Sbjct: 229 SDLFWIPC-DCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPCSHELC--LLG---SDC 282
Query: 153 T-ANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGN-L 209
T CPY Y + ++++G V+D++ D S+I GCG +QSG+ L
Sbjct: 283 TNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESH---APVKASVIIGCGRKQSGSYL 339
Query: 210 DSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA-IGHVVQPEVN 268
D A DG++G G ++ S+ S LA +G VR F+ C +G F G Q
Sbjct: 340 DGI---APDGLLGLGMADISVPSFLARAGLVRNSFSMCFTKDSGRIFFGDQGVSTQQSTP 396
Query: 269 KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
PL Y++N+ VG + I+DSGT+ LP +Y+ +
Sbjct: 397 FVPLYGKLQTYTVNVDKSCVGHKCFE--------STSFQAIVDSGTSFTALPLDIYKAVA 448
Query: 329 SKIISQQPDLKVHTVHDEYTCFQY----SESVDEGFPNVTFHFENSVSLKVYPHEYLFPF 384
+ Q + + E T F Y S V P VT F + S + +L
Sbjct: 449 IEFDKQ---VNASRLPQEATSFDYCYSASPLVMPDVPTVTLTFAGNKSFQPVNPTFLLHD 505
Query: 385 EDLWCIGWQNSGMQS 399
E+ G+ + +QS
Sbjct: 506 EEGAVAGFCLAVVQS 520
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 128/264 (48%), Gaps = 24/264 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P ++ T Y SST K
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPATAASGSATFYIPGMSSTSKA 166
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ FC +C+ CPY +Y G+S++G+ V+DV+ + Q
Sbjct: 167 VPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI-- 219
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++ GCG Q+G+ + A +G+ G G S+ S LA G F+ C G +
Sbjct: 220 LKAQIMLGCGQTQTGSF--LDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCF-GRD 276
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL N+ H Y+I ++ + VG N PTD+ + TI
Sbjct: 277 GIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVG----NKPTDMDFI-----TIF 327
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQ 334
D+GT+ YL + Y + +Q
Sbjct: 328 DTGTSFTYLADPAYTYITQSFHAQ 351
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/375 (27%), Positives = 169/375 (45%), Gaps = 52/375 (13%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGG---SSRPDGVGLYYAKIGIGTPPKDYYVQVDT 95
ER+L+L K+ R + +A VD GG S G G Y+ +IG+GTP ++ Y+ +DT
Sbjct: 119 ERTLTLNKDPVNRYEN--VAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDT 176
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN 155
GSD+ W+ C C+EC ++ +++ S++ V CD C + DC +
Sbjct: 177 GSDVAWIQCEPCRECYSQAD-----PIFNPSYSASFSTVGCDSAVCSQLDA---YDCHSG 228
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
C Y YGDGS +TG F + + + T++ ++ GCG + G
Sbjct: 229 -GCLYEASYGDGSYSTGSFATETLTFG--------TTSVANVAIGCGHKNVGLFIGAAGL 279
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGI------FAIGHVVQPE 266
G S +Q+ + G F++CL + + G + +G + P
Sbjct: 280 LGL-----GAGALSFPNQIGTQTG--HTFSYCLVDRESDSSGPLQFGPKSVPVGSIFTP- 331
Query: 267 VNKTPLVPNQPHYSINMTAVQVGLDFLN-LPTDVFGVGDNKGT---IIDSGTTLAYLPEM 322
+ K P +P Y +++TA+ VG L+ +P +VF + + G IIDSGT + L
Sbjct: 332 LEKNPHLPT--FYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTS 389
Query: 323 VYEPLVSKIIS---QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y+ + ++ Q P ++ D TC+ S P V FHF N SL +
Sbjct: 390 AYDAVRDAFVAGTGQLPRTDAVSIFD--TCYDLSGLQFVSVPTVGFHFSNGASLILPAKN 447
Query: 380 YLFPFEDL--WCIGW 392
YL P + + +C +
Sbjct: 448 YLIPMDTVGTFCFAF 462
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/318 (31%), Positives = 134/318 (42%), Gaps = 41/318 (12%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKD 127
R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE-----KLFDPAS 227
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYDKV 184
SST V+C C + ++ C+ C Y YGDGS + G+F D + YD V
Sbjct: 228 SSTYANVSCAAPACSDL---DVSGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDAV 283
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM-ISQLASSGGVRKM 243
G FGCG R G E A G++G G+ +S+ + GGV
Sbjct: 284 KG----------FRFGCGERNDGLF---GEAA--GLLGLGRGKTSLPVQTYGKYGGV--- 325
Query: 244 FAHCLDGIN-GGGIFAIGHVVQPEVNKTPLVP-NQP-HYSINMTAVQVGLDFLNLPTDVF 300
FAHCL + G G G P TP++ N P Y + MT ++VG L + VF
Sbjct: 326 FAHCLPPRSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVF 385
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD---LKVHTVHDEYTCFQYSESVD 357
GTI+DSGT + LP Y L S + K V TC+ ++
Sbjct: 386 AA---AGTIVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQ 442
Query: 358 EGFPNVTFHFENSVSLKV 375
P V+ F+ +L V
Sbjct: 443 VAIPTVSLLFQGGAALDV 460
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/345 (28%), Positives = 143/345 (41%), Gaps = 48/345 (13%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGI 118
V LP+ G+ PDG YY I +G PP+ Y++ VDTGSD+ W+ C C C +
Sbjct: 180 VLLPIKGNVFPDGQ--YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGP---- 233
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDV 178
+ + + K V C + G C C Y Y D SS+ G +D
Sbjct: 234 ----HPLYKPAKEKIVPPRDLLCQELQGD-QNYCATCKQCDYEIEYADRSSSMGVLAKD- 287
Query: 179 VQYDKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
D+ +TNG +FGC Q G L T+ DGI+G + S+ SQ
Sbjct: 288 --------DMHMIATNGGREKLDFVFGCAYDQQGQL-LTSPAKTDGILGLSSAAISLPSQ 338
Query: 234 LASSGGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNKT--PLV--PNQPHYSINMTAVQV 288
LAS G + +F HC+ NGGG +G P T P+ P+ Y V
Sbjct: 339 LASQGIISNVFGHCITKEPNGGGYMFLGDDYVPRWGMTWAPIRGGPDN-LYHTEAQKVNY 397
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
G L + G + I DSG++ YLP+ +Y+ LV+ I P T
Sbjct: 398 GDQQLRMHGQ---AGSSIQVIFDSGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTTLP 454
Query: 349 -CFQ------YSESVDEGFPNVTFHFENSV-----SLKVYPHEYL 381
C++ Y E V + F + HF N + + P +YL
Sbjct: 455 LCWKADFDVRYLEDVKQFFKPLNLHFGNRWFVIPRTFTILPDDYL 499
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/359 (29%), Positives = 159/359 (44%), Gaps = 49/359 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP + Y +DTGSD++W C CK C + + ++D + SS+
Sbjct: 93 GNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPT-----PIFDPEKSSSF 147
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C + C + P++ C+ C Y YGD SST G + + GD +
Sbjct: 148 SKLPCSSDLCVAL---PISSCSDG--CEYRYSYGDHSSTQGVLATETFTF----GDASVS 198
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
FGCG G S G++G G+ S+ISQL GV K F++CL I
Sbjct: 199 KIG----FGCGEDNRGRAYSQGA----GLVGLGRGPLSLISQL----GVPK-FSYCLTSI 245
Query: 252 N---GGGIFAIG-HVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G +G TPL+ P++P Y +++ + VG L + F + D
Sbjct: 246 DDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQD 305
Query: 305 N--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQP-DLKVHTVHDEYTCFQYS---ESVDE 358
+ G IIDSGTT+ YL + + L + ISQ D+ + CF VD
Sbjct: 306 DGSGGLIIDSGTTITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVD- 364
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGW---QNSGMQ---SRDRKNMTLLGDF 411
P + FHFE V LK+ Y+ L I +SGM + ++N+ +L D
Sbjct: 365 -VPQLVFHFEG-VDLKLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDL 421
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/369 (28%), Positives = 157/369 (42%), Gaps = 61/369 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTPP+ + +DTGSD++W C C C L +D SST +
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSC-----FDQPLPYFDTSRSSTNALLP 89
Query: 136 CDQEFCHGVYGGPLTDCT----ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
C+ C +T C +C Y YGD S T G D ++ V+G T
Sbjct: 90 CESTQCK--LDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAAD--KFTFVAG----T 141
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S G + FGCG +G +S NE GI GFG+ S+ SQL F+HC I
Sbjct: 142 SLPG-VTFGCGLNNTGVFNS-NET---GIAGFGRGPLSLPSQLKVGN-----FSHCFTTI 191
Query: 252 NG-----------GGIFAIGHVVQPEVNKTPLV------PNQPHYSINMTAVQVGLDFLN 294
G +F+ G Q V TPL+ N Y +++ + VG L
Sbjct: 192 TGAIPSTVLLDLPADLFSNG---QGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLP 248
Query: 295 LPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV--HTVHDEYTCFQ 351
+P F + + GTIIDSGT++ LP VY+ +V + Q L V YTCF
Sbjct: 249 VPESAFALTNGTGGTIIDSGTSITSLPPQVYQ-VVRDEFAAQIKLPVVPGNATGHYTCFS 307
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-----LWCI----GWQNSGMQSRDR 402
P + HFE + ++ + Y+F D + C+ G + + + + +
Sbjct: 308 APSQAKPDVPKLVLHFEGA-TMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQ 366
Query: 403 KNMTLLGDF 411
+NM +L D
Sbjct: 367 QNMHVLYDL 375
>gi|326523463|dbj|BAJ92902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 633
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/100 (55%), Positives = 69/100 (69%), Gaps = 4/100 (4%)
Query: 27 GVFSVKYRYA---GRERSLSLLKEHDARRQQRILAG-VDLPLGGSSRPDGVGLYYAKIGI 82
GVF V+ ++ G + L+ L+ HDARR R LA VDLPLGG++ P GLY+ +IGI
Sbjct: 85 GVFEVRRKFPCHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNALPYETGLYFTQIGI 144
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTL 122
GTP K YYVQVDT SDI WVNC+ C CPR+S LG+ +L
Sbjct: 145 GTPAKSYYVQVDTSSDIFWVNCVFCDTCPRKSGLGVLPSL 184
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 124/280 (44%), Gaps = 38/280 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTPP+ + +DTGSD++W C C++C + L L D SST +
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQG-----LPLLDPAASSTYAALP 146
Query: 136 CDQEFCHGVYGGPLTDC---------TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
C C + P T C N SC Y+ YGD S T G D + +G
Sbjct: 147 CGAPRCRAL---PFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNG 203
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL---ASSGGVRKM 243
D + L FGCG G S NE GI GFG+ S+ SQL S M
Sbjct: 204 DGDSRLPTRRLTFGCGHFNKGVFQS-NET---GIAGFGRGRWSLPSQLNVTTFSYCFTSM 259
Query: 244 FAHCLDGINGGG------IFAIGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLN 294
F + GG +++ + EV TPL+ P+QP Y +++ + VG L
Sbjct: 260 FESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLA 319
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+P + TIIDSG ++ LPE VYE + ++ +Q
Sbjct: 320 VPEAKL-----RSTIIDSGASITTLPEAVYEAVKAEFAAQ 354
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 140/303 (46%), Gaps = 37/303 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+P + +DTGSD+ WV C C +C ++ L+D SST +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 182
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G C++++ C Y+ YGDGSSTTG + D + +S
Sbjct: 183 CGSADCAQL-GQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG--------SSAVR 233
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGG 254
S FGC +SG D T DG++G G S++SQ A G + + F++CL +
Sbjct: 234 SFQFGCSNVESGFNDQT-----DGLMGLGGGAQSLVSQTA--GTLGRAFSYCLPPTPSSS 286
Query: 255 GIFAIGHVVQPEVN---KTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G +G + KTP++ + Y + + A++VG L++P VF + GT
Sbjct: 287 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF----SAGT 342
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++DSGT + LP Y L S + Q P + + D TCF +S P+V
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILD--TCFDFSGQSSVSIPSVAL 400
Query: 366 HFE 368
F
Sbjct: 401 VFS 403
>gi|54287450|gb|AAV31194.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 351
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 83/162 (51%), Gaps = 1/162 (0%)
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS 280
+G G SN+S++ QLA S +KMFAHCLDG GGIF +GH+V P+V KTPL Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
+ + VG L+L + TI+++G+ ++YLPE VY+ + I S D+ V
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQSFLDSIFSDLEDISV 120
Query: 341 HTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
+ Y+CF Y E H V+ V YL
Sbjct: 121 INI-GGYSCFHYERRTKESSREGLVHSGRQVTKPVLELYYLM 161
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/354 (28%), Positives = 163/354 (46%), Gaps = 52/354 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C C++C R + + SST +
Sbjct: 110 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQD-----PKFQPESSSTYQP 164
Query: 134 VTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C DC + C Y Y + S+++G +DV+ + Q+
Sbjct: 165 VKCT------------IDCNCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGN-----QS 207
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ + S++ QL + F+ C G
Sbjct: 208 ELAPQRAVFGCENVETGDLYSQHA---DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGG 264
Query: 251 IN-GGGIFAIGHVVQP-EVNKTPLVPNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNK- 306
++ GGG +G + P ++ P++ P+Y+I++ + V L L +VF D K
Sbjct: 265 MDVGGGAMVLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVF---DGKH 321
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYS----ESVDEG 359
GT++DSGTT AYLPE + I+ + LK + D CF + + +
Sbjct: 322 GTVLDSGTTYAYLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKS 381
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIG-WQNSGMQSRDRKNMTLLG 409
FP V F N + P Y+F + +C+G +QN Q+ TLLG
Sbjct: 382 FPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQT------TLLG 429
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/266 (31%), Positives = 130/266 (48%), Gaps = 26/266 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC--PRRSSLG-IELTLYDIKDSSTG 131
L+YA + +GTP + + V +DTGSD+ W+ C QC C P ++ G + T Y SST
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPATAASGSFQATFYIPGMSSTS 166
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQT 190
K V C+ FC +C+ CPY +Y G+S++G+ V+DV+ + Q
Sbjct: 167 KAVPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI 221
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ GCG Q+G+ + A +G+ G G S+ S LA G F+ C G
Sbjct: 222 --LKAQIMLGCGQTQTGSF--LDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCF-G 276
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
+G G + G + +TPL N+ H Y+I ++ + VG N PTD+ + T
Sbjct: 277 RDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVG----NKPTDMDFI-----T 327
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQ 334
I D+GT+ YL + Y + +Q
Sbjct: 328 IFDTGTSFTYLADPAYTYITQSFHAQ 353
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 151/347 (43%), Gaps = 36/347 (10%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
G G Y+ + IGTPP+ + DTGSD++WV C C+ C RS + + + S+T
Sbjct: 81 SGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRS----PGSAFFARHSTT 136
Query: 131 GKFVTCDQEFCHGV---YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ C C V + P ++ C Y Y D S+TTG+F ++ + + +G
Sbjct: 137 YSAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGK 196
Query: 188 LQTTSTNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
++ NG L FGCG R SG +L + E G++G G++ S SQL G + F++
Sbjct: 197 VK--KLNG-LSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSK--FSY 251
Query: 247 CLDGIN----GGGIFAIGHVVQPEVNK------TPLV--PNQP-HYSINMTAVQVGLDFL 293
CL IG V+K TPL+ P P Y I + V V L
Sbjct: 252 CLMDYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKL 311
Query: 294 NLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT--- 348
+ V+ + D N GTIIDSGTTL ++ E Y ++ + +K+ + +
Sbjct: 312 PINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKR---VKLPSPAEPTPGFD 368
Query: 349 -CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQ 393
C S P ++F+ P Y D + C+ Q
Sbjct: 369 LCMNVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQ 415
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 97/305 (31%), Positives = 143/305 (46%), Gaps = 36/305 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +G GTP + V DTGSD+ W +QCK C R E L+D SST
Sbjct: 12 GSGNYVITVGFGTPTRTQTVVFDTGSDVNW---LQCKPCAVRCYAQQE-PLFDPSLSSTY 67
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ V+C + C G+ T ++++C Y YGDGSST G+ D L
Sbjct: 68 RNVSCTEPACVGLS----TRGCSSSTCLYGVFYGDGSSTIGFLAMDTFM-------LTPA 116
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS-SMISQLASSGGVRKMFAHCLDG 250
+ IFGCG +G T G++G G+S++ S+ SQ+A S G +F++CL
Sbjct: 117 QKFKNFIFGCGQNNTGLFQGT-----AGLVGLGRSSTYSLNSQVAPSLG--NVFSYCLPS 169
Query: 251 INGG-GIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+ G IG+ Q T ++ + Y I++ + VG L+L + VF +
Sbjct: 170 TSSATGYLNIGN-PQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVF---QSV 225
Query: 307 GTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
GTIIDSGT + LP Y L V ++Q T+ D TC+ +S + +P +
Sbjct: 226 GTIIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAVTILD--TCYDFSRTTSVVYPVI 283
Query: 364 TFHFE 368
HF
Sbjct: 284 VLHFA 288
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 102/336 (30%), Positives = 141/336 (41%), Gaps = 39/336 (11%)
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLY 123
L G+ PDG LYY + IG P K YY+ +DTGSD+ W+ C + P RS LY
Sbjct: 13 LRGNIYPDG--LYYMAMLIGAPAKLYYLDMDTGSDLTWLQC----DAPCRSCASGPHGLY 66
Query: 124 DIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYD 182
D K + + V C C V G C C Y Y DGSST G ++D +
Sbjct: 67 DPKKA---RLVDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLL 123
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
+G T + + I GCG Q G L T + DG++G + S+ SQLA G VR
Sbjct: 124 LTNG----TRSKTTAIIGCGYDQQGTLAQT-PASTDGVMGLSSAKISLPSQLAKKGIVRN 178
Query: 243 MFAHCL-DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
+ HCL G NGGG G + P + T + N+ D
Sbjct: 179 VIGHCLAGGSNGGGYLFFGDSLVPALGMTWTPIMGKSITGNIGGKSGDAD--------DK 230
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTC------FQY 352
GD G + DSGT+ YL Y ++S + + + +++ T + C F+
Sbjct: 231 TGDIGGVMFDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPFCWRGPSPFES 290
Query: 353 SESVDEGFPNVTFHF------ENSVSLKVYPHEYLF 382
V F VT F S L++ P YL
Sbjct: 291 VADVQRYFKTVTLDFGKRNWYSASRVLELSPEGYLI 326
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 125/254 (49%), Gaps = 24/254 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P ++ T Y SST K
Sbjct: 6 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPATAASGSATFYIPGMSSTSKA 64
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ FC +C+ CPY +Y G+S++G+ V+DV+ + Q
Sbjct: 65 VPCNSNFCDL-----QKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI-- 117
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++ GCG Q+G+ + A +G+ G G S+ S LA G F+ C G +
Sbjct: 118 LKAQIMLGCGQTQTGSF--LDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCF-GRD 174
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL N+ H Y+I ++ + VG N PTD+ + TI
Sbjct: 175 GIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVG----NKPTDMDFI-----TIF 225
Query: 311 DSGTTLAYLPEMVY 324
D+GT+ YL + Y
Sbjct: 226 DTGTSFTYLADPAY 239
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 143/330 (43%), Gaps = 46/330 (13%)
Query: 65 GGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYD 124
G + R G Y + +GTPP+ +DTGSD++W C C C R+ L+
Sbjct: 87 GMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPD-----PLFS 141
Query: 125 IKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
+ SS+ + + C + C + C +C Y YGDG++T GY+ + +
Sbjct: 142 PRMSSSYEPMRCAGQLCGDILH---HSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASS 198
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
SG+ Q+ L FGCG G+L++ + GI+GFG+ S++SQL+ +R+ F
Sbjct: 199 SGETQSV----PLGFGCGTMNVGSLNNAS-----GIVGFGRDPLSLVSQLS----IRR-F 244
Query: 245 AHCL--------DGINGGGIFAIGHV--VQPEVNKTPLV---PNQPHYSINMTAVQVGLD 291
++CL + G + +G V TP++ N Y + T V VG
Sbjct: 245 SYCLTPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGAR 304
Query: 292 FLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYT 348
L +P F + + G IIDSGT L P V +V SQ + + D+
Sbjct: 305 RLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGV 364
Query: 349 CFQYSE--------SVDEGFPNVTFHFENS 370
CF + P + FHF+ +
Sbjct: 365 CFAAPAVAAGGGRMARQVAVPRMVFHFQGA 394
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 143/317 (45%), Gaps = 35/317 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +G+GTP +D DTGSD+ W QC+ C R E +++ S++
Sbjct: 134 GTGNYVVTVGLGTPKRDLTFIFDTGSDLTWT---QCEPCARYCYHQQE-PIFNPSKSTSY 189
Query: 132 KFVTCDQEFCHGVYGGP--LTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
++C C + G C+A+T C Y YGD S + G+F QD + L
Sbjct: 190 TNISCSSPTCDELKSGTGNSPSCSAST-CVYGIQYGDQSYSVGFFAQDKLA-------LT 241
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+T + +FGCG G + G+IG G++ S++SQ A G K+F++CL
Sbjct: 242 STDVFNNFLFGCGQNNRGLF-----VGVAGLIGLGRNALSLVSQTAQKYG--KLFSYCLP 294
Query: 250 GIN---GGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVG 303
+ G F G V TP + N Y +N+ A+ VG L+ VF
Sbjct: 295 STSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTA 354
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
GTIIDSGT ++ LP Y L + +S+ P ++ D TC+ +S+
Sbjct: 355 ---GTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILD--TCYDFSQYDTVDV 409
Query: 361 PNVTFHFENSVSLKVYP 377
P + +F + + + P
Sbjct: 410 PKINLYFSDGAEMDLDP 426
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 99/327 (30%), Positives = 147/327 (44%), Gaps = 38/327 (11%)
Query: 36 AGRERSLSLLKEHDARRQQRILAGVDLPLGGSS-RPDGVG-LYYAKIGIGTPPKDYYVQV 93
A R+R L R+ +I G+ G S+ R +G L+Y + IGTP + V +
Sbjct: 60 ADRDRLLR------GRKLSQIDDGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVAL 113
Query: 94 DTGSDIMWVNCIQCKECPRRSSLG----IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL 149
DTGSD+ WV C C C S +L +Y+ SST K VTC+ C
Sbjct: 114 DTGSDLFWVPC-DCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCMH-----R 167
Query: 150 TDCTANTS-CPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
+ C S CPY+ Y +ST+G V+DV+ + N +IFGCG QSG
Sbjct: 168 SQCLGTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEAN--VIFGCGQIQSG 225
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
+ + A +G+ G G S+ S L+ G F+ C G +G G + G +
Sbjct: 226 SF--LDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDGIGRISFGDKGSFDQ 282
Query: 268 NKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE 325
++TP L P+ P Y+I +T V+VG +++ + DSGT+ YL + Y
Sbjct: 283 DETPFNLNPSHPTYNITVTQVRVGTTLIDV---------EFTALFDSGTSFTYLVDPTYT 333
Query: 326 PLVSKIISQQPDLKVHTVHDEYTCFQY 352
L SQ D + + D F+Y
Sbjct: 334 RLTESFHSQVQDRRHRS--DSRIPFEY 358
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 144/316 (45%), Gaps = 38/316 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y + IGTPP DY DTGSD+MW C+ C +C ++S ++D S++
Sbjct: 88 GSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSR-----PIFDPLKSTSF 142
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C+ + C + + C A C Y YGD + T G + ++K++
Sbjct: 143 SHVPCNSQNCKAIDD---SHCGAQGVCDYSYTYGDQTYTKGD-----LGFEKIT----IG 190
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S++ + GCG G+IG G S++SQ++ + G+ + F++CL +
Sbjct: 191 SSSVKSVIGCGHES-----GGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTL 245
Query: 252 ----NGGGIFAIGHVVQ-PEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLPTDVFGVGD 304
NG F VV P V TPL+ P +Y + + A+ +G +
Sbjct: 246 LSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNE------RHMASAK 299
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQ--YSESVDEGFP 361
IIDSGTTL++LP+ +Y+ +VS ++ +V + + CF + + G P
Sbjct: 300 QGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIP 359
Query: 362 NVTFHFENSVSLKVYP 377
+T F ++ + P
Sbjct: 360 IITAQFSGGANVNLLP 375
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 158/357 (44%), Gaps = 45/357 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP + Y +DTGSD++W C CK C + + ++D + SS+
Sbjct: 93 GNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPT-----PIFDPEKSSSF 147
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C + C + P++ C+ C Y YGD SST G + + GD +
Sbjct: 148 SKLPCSSDLCVAL---PISSCSDG--CEYRYSYGDHSSTQGVLATETFTF----GDASVS 198
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
FGCG G S G++G G+ S+ISQL GV K F++CL I
Sbjct: 199 KIG----FGCGEDNRGRAYSQGA----GLVGLGRGPLSLISQL----GVPK-FSYCLTSI 245
Query: 252 N---GGGIFAIG-HVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G +G TPL+ P++P Y +++ + VG L + F + D
Sbjct: 246 DDSKGISTLLVGSEATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQD 305
Query: 305 N--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQP-DLKVHTVHDEYTCFQY-SESVDEGF 360
+ G IIDSGTT+ YL + + L + ISQ D+ + CF +
Sbjct: 306 DGSGGLIIDSGTTITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEV 365
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFEDLWCIGW---QNSGMQ---SRDRKNMTLLGDF 411
P + FHFE V LK+ Y+ L I +SGM + ++N+ +L D
Sbjct: 366 PQLVFHFEG-VDLKLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDL 421
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 128/265 (48%), Gaps = 25/265 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP--RRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ W+ C QC C S+ + Y SST +
Sbjct: 97 LHYALVTVGTPGHTFMVALDTGSDLFWLPC-QCDGCTPPPSSAASAPASFYIPSLSSTSQ 155
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTT 191
V C+ +FC G +C+ +SCPY +Y +S++G+ V+DV+ + D
Sbjct: 156 AVPCNSDFC-----GLRKECSKTSSCPYKMVYVSADTSSSGFLVEDVLYLS--TEDTHPQ 208
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
++FGCG Q+G+ + A +G+ G G S+ S LA G F+ C G
Sbjct: 209 FLKAQIMFGCGEVQTGSF--LDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCF-GR 265
Query: 252 NGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
+G G + G + +TPL NQ H Y+I +T + VG + ++L TI
Sbjct: 266 DGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDLEVS---------TI 316
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQ 334
D+GT+ YL + Y + SQ
Sbjct: 317 FDTGTSFTYLADPAYTYITDGFHSQ 341
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 147/332 (44%), Gaps = 45/332 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +I +GTPP+ + VDTGSD+ WV C C C + L+ SS+
Sbjct: 4 GSGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPD-----PLFIPLASSSY 58
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+C C + P C+ +C Y YGDGS+T G F + V +
Sbjct: 59 SNASCTDSLCDAL---PRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNG-------- 107
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
ST + FGCG Q G DG+IG G+ S+ SQL SS +F++CL
Sbjct: 108 STLARIGFGCGHNQEGTF-----AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQ 160
Query: 252 NGGGIFA---IGHVVQ-PEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G F+ G+ + + TPL+ N+ +Y + + ++ VG + P F +
Sbjct: 161 STTGTFSPITFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDA 220
Query: 305 N--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ----QPDLKVHTVHDEYTCFQYSESVDE 358
N G I+DSGTT+ Y + P+++++ Q + D + ++ Y S S
Sbjct: 221 NGVGGVILDSGTTITYWRLAAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSAS-SL 279
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDLWCI 390
P++T H N ++ P +LW +
Sbjct: 280 TLPSMTVHLTNV--------DFEIPVSNLWVL 303
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 162/366 (44%), Gaps = 43/366 (11%)
Query: 26 HGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRP--DGVGLYYAKIGIG 83
HG ++ G R+L +R+ ++L+ + GG P D LYY + +G
Sbjct: 93 HGARWPRHGSGGYYRALVRSDLQRQKRKHQLLSVSEA--GGIFSPGNDFGWLYYTWVDVG 150
Query: 84 TPPKDYYVQVDTGSDIMWVNCIQCKECPR----RSSLGIELTLYDIKDSSTGKFVTCDQE 139
TP + V +DTGSD+ WV C C EC R +L +L +Y +S+T + + C E
Sbjct: 151 TPNTSFMVALDTGSDLFWVPC-DCIECAPLAGYRETLDRDLGIYKPAESTTSRHLPCSHE 209
Query: 140 FCHGVYGGPLTDCTA-NTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
C P + C++ CPY Y + ++++G ++D++ D S+
Sbjct: 210 LC-----PPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESH---APVKASV 261
Query: 198 IFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGI 256
+ GCG +QSG+ LD A DG++G G ++ S+ S LA +G VR F+ C +G
Sbjct: 262 VIGCGRKQSGSYLDGI---APDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSGRIF 318
Query: 257 FA-IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTT 315
F G +Q PL Y++N+ VG + ++DSGT+
Sbjct: 319 FGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFE--------ATSFEALVDSGTS 370
Query: 316 LAYLPEMVYEPLVSKIISQQPDLKVHT--VHDEYTCFQYSESVD----EGFPNVTFHFEN 369
LP VY K ++ + D +VH + E F+Y S P VT F
Sbjct: 371 FTALPLNVY-----KAVAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPTVTLTFAA 425
Query: 370 SVSLKV 375
+ S +
Sbjct: 426 NKSFQA 431
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 129/267 (48%), Gaps = 29/267 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR----RSSLGIELTLYDIKDSST 130
L++A + +GTPP + V +DTGSD+ W+ C C C R ++ I+L +Y++ SST
Sbjct: 112 LHFANVSVGTPPLWFLVALDTGSDLFWLPC-NCTSCVRGLKTQNGKVIDLNIYELDKSST 170
Query: 131 GKFVTCDQEFCHGVYGGPLTDC-TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K V C+ C T C ++ +SC Y +E + +S++G+ V+DV+ ++ +
Sbjct: 171 RKNVPCNSNMCKQ------TQCHSSGSSCRYEVEYLSNDTSSSGFLVEDVLHL--ITDND 222
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
QT + + GCG Q+G N A +G+ G G N S+ S LA G + F+ C
Sbjct: 223 QTKDIDTQITIGCGQVQTGVF--LNGAAPNGLFGLGMENVSVPSILAQKGLISDSFSMCF 280
Query: 249 DGINGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G +G G G + KTP L + P Y++ +T + VG +
Sbjct: 281 -GSDGSGRITFGDTGSSDQGKTPFNLRESHPTYNVTITQIIVG---------GYAADHEF 330
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIIS 333
I DSGT+ YL + Y + K S
Sbjct: 331 HAIFDSGTSFTYLNDPAYTLISEKFNS 357
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 128/265 (48%), Gaps = 25/265 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP--RRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ W+ C QC C S+ + Y SST +
Sbjct: 97 LHYALVTVGTPGHTFMVALDTGSDLFWLPC-QCDGCTPPPSSAASAPASFYIPSLSSTSQ 155
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTT 191
V C+ +FC G +C+ +SCPY +Y +S++G+ V+DV+ + D
Sbjct: 156 AVPCNSDFC-----GLRKECSKTSSCPYKMVYVSADTSSSGFLVEDVLYLS--TEDTHPQ 208
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
++FGCG Q+G+ + A +G+ G G S+ S LA G F+ C G
Sbjct: 209 FLKAQIMFGCGEVQTGSF--LDAAAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCF-GR 265
Query: 252 NGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
+G G + G + +TPL NQ H Y+I +T + VG + ++L TI
Sbjct: 266 DGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDLEVS---------TI 316
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQ 334
D+GT+ YL + Y + SQ
Sbjct: 317 FDTGTSFTYLADPAYTYITDGFHSQ 341
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 142/334 (42%), Gaps = 50/334 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP Y VDTGSD++W C C C + + + S+T +
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPT-----PYFRPARSATYRL 144
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V C C + P C + C Y YGD +ST G + + + S
Sbjct: 145 VPCRSPLCAAL---PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS- 200
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG--- 250
+ FGCG SG L +++ G++G G+ S++SQL S F++CL
Sbjct: 201 --DVAFGCGNINSGQLANSS-----GMVGLGRGPLSLVSQLGPS-----RFSYCLTSFLS 248
Query: 251 -------------INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQ-VGLDFLNLP 296
+NG + G VQ TPLV N S+ +++ + L LP
Sbjct: 249 PEPSRLNFGVFATLNGTNASSSGSPVQ----STPLVVNAALPSLYFMSLKGISLGQKRLP 304
Query: 297 TD--VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV--HTVHDEYTCF 350
D VF + D+ G IDSGT+L +L + Y+ + +++S L T TCF
Sbjct: 305 IDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCF 364
Query: 351 QY--SESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
+ SV P++ HF+ ++ V P Y+
Sbjct: 365 PWPPPPSVAVTVPDMELHFDGGANMTVPPENYML 398
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 155/360 (43%), Gaps = 49/360 (13%)
Query: 53 QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR 112
+ ++++G+D +G G Y+ ++GIG+PP + Y+ VD+GSD++WV C C EC
Sbjct: 111 ESKVVSGLD---------EGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYA 161
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTG 172
++ L+D S+T V+C C + + C + C Y YGDGS T G
Sbjct: 162 QAD-----PLFDPASSATFSAVSCGSAICRTLR---TSGCGDSGGCEYEVSYGDGSYTKG 213
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+ + L T+ G I GCG R G G++G G S++
Sbjct: 214 TLALETLT-------LGGTAVEGVAI-GCGHRNRGLFVGAA-----GLLGLGWGPMSLVG 260
Query: 233 QLASSGGVRKMFAHCLDGINGGG----------IFAIGHVVQPEVNKTPLV--PNQPH-Y 279
QL + F++CL G G + V PLV P P Y
Sbjct: 261 QLGGA--AGGAFSYCLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFY 318
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
+ ++ + VG + L L +F + ++ G ++D+GT + LP+ Y L +
Sbjct: 319 YVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGA 378
Query: 338 L-KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNS 395
L + V TC+ S P V+F+F+ + +L + L + ++C+ + S
Sbjct: 379 LPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPS 438
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 165/381 (43%), Gaps = 48/381 (12%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC 107
AR + + + P+ + DGV + Y + IGTPP+ + +DTGS ++W C C
Sbjct: 7 ARAPRLLSSSATAPVSPGAYDDGVPMTEYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPC 66
Query: 108 KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT--SCPYLEIYG 165
C +S L YD SST +CD C +T C T +C Y YG
Sbjct: 67 AVCFNQS-----LPYYDASRSSTFALPSCDSTQCK--LDPSVTMCVNQTVQTCAYSYSYG 119
Query: 166 DGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGK 225
D S+T G+ + V + V+G ++ ++FGCG +G S NE GI GFG+
Sbjct: 120 DKSATIGFLDVETVSF--VAG-----ASVPGVVFGCGLNNTGIFRS-NET---GIAGFGR 168
Query: 226 SNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNK--------TPLVPNQP 277
S+ SQL F+HC ++G + + ++ K TPL+ N
Sbjct: 169 GPLSLPSQLKVGN-----FSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPA 223
Query: 278 H---YSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIIS 333
H Y +++ + VG L +P F + + GTIIDSGT LP VY LV +
Sbjct: 224 HPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYR-LVHDEFA 282
Query: 334 QQPDLKVHTVHD--EYTCFQYSE-SVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCI 390
L V ++ CF P + HFE + ++ + Y+F +D
Sbjct: 283 AHVKLPVVPSNETGPLLCFSAPPLGKAPHVPKLVLHFEGA-TMHLPRENYVFEAKD---- 337
Query: 391 GWQNSGMQSRDRKNMTLLGDF 411
G S + MT++G+F
Sbjct: 338 GGNCSICLAIIEGEMTIIGNF 358
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 101/323 (31%), Positives = 132/323 (40%), Gaps = 44/323 (13%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE-----KLFDP 225
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV---QYD 182
SST ++C C + T + +C Y YGDGS + G+F D + YD
Sbjct: 226 ARSSTYANISCAAPACSDLD----TRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD 281
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVR 241
V G FGCG R G E A G++G G+ +S+ Q GGV
Sbjct: 282 AVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDKYGGV- 325
Query: 242 KMFAHCLDGINGG-GIFAIG----HVVQPEVNKTPLVPNQP-HYSINMTAVQVGLDFLNL 295
FAHCL + G G G + L N P Y + MT ++VG L++
Sbjct: 326 --FAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSI 383
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL---KVHTVHDEYTCFQY 352
P VF GTI+DSGT + LP Y L S S K V TC+ +
Sbjct: 384 PQSVF---TTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDF 440
Query: 353 SESVDEGFPNVTFHFENSVSLKV 375
+ P V+ F+ L V
Sbjct: 441 TGMSQVAIPTVSLLFQGGARLDV 463
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 133/328 (40%), Gaps = 54/328 (16%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G G Y +G+GTP Y V DTGSD WV C C C + + L+D
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-----QEKLFDP 224
Query: 126 KDSSTGKFVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV- 179
SST V+C C G GG C Y YGDGS + G+F D +
Sbjct: 225 ARSSTYANVSCAAPACFDLDTRGCSGG---------HCLYGVQYGDGSYSIGFFAMDTLT 275
Query: 180 --QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
YD V G FGCG R G E A G++G G+ +S+ Q
Sbjct: 276 LSSYDAVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDK 320
Query: 238 -GGVRKMFAHCLDGINGG-GIFAIG----HVVQPEVNKTPLVPNQP-HYSINMTAVQVGL 290
GGV FAHCL + G G G + L N P Y + MT ++VG
Sbjct: 321 YGGV---FAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGG 377
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL---KVHTVHDEY 347
L++P VF GTI+DSGT + LP Y L S +S K V
Sbjct: 378 QLLSIPQSVFA---TAGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLD 434
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKV 375
TC+ ++ P V+ F+ L V
Sbjct: 435 TCYDFTGMSQVAIPTVSLLFQGGAILDV 462
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 133/322 (41%), Gaps = 31/322 (9%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G YY + IG P K Y++ VDTGSD+ W+ C + P +S + LY + K
Sbjct: 54 TGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQC----DAPCQSCNKVPHPLYR---PTKNK 106
Query: 133 FVTCDQEFCHGVYGG--PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C ++ G P CT C Y Y D +S+ G V D S L+
Sbjct: 107 LVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTD-----SFSLPLRN 161
Query: 191 TS-TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
S SL FGCG Q + DG++G G+ + S++SQL G + + HCL
Sbjct: 162 KSNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLS 221
Query: 250 GINGGGIFAIGHVVQP--EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
+GGG G + P V P+V + + + + D +L T V
Sbjct: 222 -TSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV----- 275
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKI-------ISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
+ DSG+T Y Y+ +S I + Q D + F+ V + F
Sbjct: 276 -VFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKGQKAFKSVSDVKKDF 334
Query: 361 PNVTFHFENSVSLKVYPHEYLF 382
++ F F + +++ P YL
Sbjct: 335 KSLQFIFGKNAVMEIPPENYLI 356
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 143/346 (41%), Gaps = 49/346 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++GIG+P K Y+ +DTGSD+ W+ C CK C +++ ++D + SS+
Sbjct: 10 GSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQND-----AVFDPRASSSF 64
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ ++C C L D A S C Y YGDGS T G D
Sbjct: 65 RRLSCSTPQCK------LLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF-------- 110
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
L + ++FGCG G G S SQL+S + F++C
Sbjct: 111 LVSRGRTSPVVFGCGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSS-----RKFSYC 160
Query: 248 L----DGINGGGIFAIGHVVQP---EVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPT 297
L +G+ G P T L+ N Y ++ + +G L++P+
Sbjct: 161 LVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPS 220
Query: 298 DVFGVGDNK---GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYS 353
F + + G IIDSGT++ LP Y + S L + TC+ +S
Sbjct: 221 TAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFS 280
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGM 397
P V+FHFE S+++ P YL P + +C + + +
Sbjct: 281 ALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSL 326
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 135/328 (41%), Gaps = 54/328 (16%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE-----KLFDP 225
Query: 126 KDSSTGKFVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV- 179
SST V+C C HG GG C Y YGDGS + G+F D +
Sbjct: 226 ARSSTYANVSCAAPACSDLNIHGCSGG---------HCLYGVQYGDGSYSIGFFAMDTLT 276
Query: 180 --QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
YD V G FGCG R G E A G++G G+ +S+ Q
Sbjct: 277 LSSYDAVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDK 321
Query: 238 -GGVRKMFAHCLDGINGGG---IFAIGHVVQPEVNKTP--LVPNQP-HYSINMTAVQVGL 290
GGV FAHCL + G F G + T L N P Y + MT ++VG
Sbjct: 322 YGGV---FAHCLPARSTGTGYLDFGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGG 378
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L++P VF GTI+DSGT + LP Y L + ++ + K V
Sbjct: 379 QLLSIPQSVFA---TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLD 435
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKV 375
TC+ ++ P V+ F+ L V
Sbjct: 436 TCYDFTGMSQVAIPTVSLLFQGGARLDV 463
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 98/306 (32%), Positives = 141/306 (46%), Gaps = 31/306 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y I IGTPP DTGSD++W C C++C +++S L+D K+SST +
Sbjct: 84 GEYLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTS-----PLFDPKESSTYRK 138
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V+C C + + T +C Y YGD S T G D V SG +
Sbjct: 139 VSCSSSQCRALEDASCS--TDENTCSYTITYGDNSYTKGDVAVDTVTMGS-SGRRPVSLR 195
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
N +I GCG +G D A GIIG G ++S++SQL S + F++CL
Sbjct: 196 N--MIIGCGHENTGTFD----PAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTS 247
Query: 249 -DGINGGGIFAIGHVVQPE-VNKTPLVPNQP--HYSINMTAVQVGLDFLNLPTDVFGVGD 304
G+ F +V + V T +V P +Y +N+ A+ VG + + +FG G+
Sbjct: 248 ETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGE 307
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE--YTCFQYSESVDEGFPN 362
+IDSGTTL LP Y L S + S +K V D Y +S P+
Sbjct: 308 GN-IVIDSGTTLTLLPSNFYYELESVVAST---IKAERVQDPDGILSLCYRDSSSFKVPD 363
Query: 363 VTFHFE 368
+T HF+
Sbjct: 364 ITVHFK 369
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 133/284 (46%), Gaps = 36/284 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + IGTPPK Y + +DTGSD+ W+ C+ C +C ++ YD K+SS+
Sbjct: 86 GSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNG-----PYYDPKESSSF 140
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ + C CH V P C A N +CPY YGD S+TTG F + + S +
Sbjct: 141 RNIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGK 200
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ +++FGCG G + G+ S SQL S G F++CL
Sbjct: 201 SEFKRVENVMFGCGHWNRGLFHGASGLLGL-----GRGPLSFSSQLQSLYG--HSFSYCL 253
Query: 249 ------DGINGGGIFAIGH--VVQPEVNKTPLV-----PNQPHYSINMTAVQVGLDFLNL 295
++ IF + PE+N T LV P Y + + ++ VG + LN+
Sbjct: 254 VDRNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNI 313
Query: 296 PTDVF-----GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
P + GVG GTI+DSGTTL+Y E Y+ + + +
Sbjct: 314 PESTWNMTSDGVG---GTIVDSGTTLSYFTEPAYQIIKDAFVKK 354
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 144/316 (45%), Gaps = 36/316 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCK--ECPRRSSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D+GSD+ WV +C+QC SSL +L+ Y SST
Sbjct: 97 LHYTWIDIGTPHVSFMVALDSGSDLFWVPCDCVQCAPLSASHYSSLDRDLSEYSPSQSST 156
Query: 131 GKFVTCDQEFCHGVYGGPLTDC-TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C C GP +C SCPY + Y + +S++G V+D++ D
Sbjct: 157 SKQLSCSHRLCD---MGP--NCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDT 211
Query: 189 QTTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
TS +I GCG +QSG LD A DG++G G S+ S LA +G ++ F+ C
Sbjct: 212 LNTSVKAPVIIGCGMKQSGGYLDGV---APDGLLGLGLQEISVPSFLAKAGLIQNSFSMC 268
Query: 248 LDGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ + G IF G Q L N Y + + VG L +
Sbjct: 269 FNEDDSGRIFFGDQGPATQQSAPFLKLNGNYTTYIVGVEVCCVGTSCLK--------QSS 320
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVDEG 359
++DSGT+ +LP+ V+E +I+++ D +V+ + C++ S
Sbjct: 321 FSALVDSGTSFTFLPDDVFE-----MIAEEFDTQVNASRSSFEGYSWKYCYKTSSQDLPK 375
Query: 360 FPNVTFHFENSVSLKV 375
P++ F + S V
Sbjct: 376 IPSLRLIFPQNNSFMV 391
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 148/340 (43%), Gaps = 35/340 (10%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
S G G Y+ +IG+G+PP++ YV +D+GSDI+WV C C +C +S +++
Sbjct: 125 SGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSD-----PVFNPA 179
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
DSS+ V+C C V + C Y YGDGS T G + + + +
Sbjct: 180 DSSSYAGVSCASTVCSHVDNAGCHE----GRCRYEVSYGDGSYTKGTLALETLTFGR--- 232
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ ++ GCG G G++G G S + QL G F++
Sbjct: 233 -----TLIRNVAIGCGHHNQGMF-----VGAAGLLGLGSGPMSFVGQLGGQAG--GTFSY 280
Query: 247 CL--DGINGGGIFAIGHVVQP-EVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVF 300
CL GI G+ G P PL+ N Q Y + ++ + VG + + DVF
Sbjct: 281 CLVSRGIQSSGLLQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVF 340
Query: 301 GVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVD 357
+ + + G ++D+GT + LP YE I+Q +L + V TC+ V
Sbjct: 341 KLSELGDGGVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVS 400
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNS 395
P V+F+F L + +L P +D+ +C + S
Sbjct: 401 VRVPTVSFYFSGGPILTLPARNFLIPVDDVGSFCFAFAPS 440
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 142/334 (42%), Gaps = 50/334 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP Y VDTGSD++W C C C + + + S+T +
Sbjct: 90 GEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPT-----PYFRPARSATYRL 144
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V C C + P C + C Y YGD +ST G + + + S
Sbjct: 145 VPCRSPLCAAL---PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVS- 200
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG--- 250
+ FGCG SG L +++ G++G G+ S++SQL S F++CL
Sbjct: 201 --DVAFGCGNINSGQLANSS-----GMVGLGRGPLSLVSQLGPS-----RFSYCLTSFLS 248
Query: 251 -------------INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQ-VGLDFLNLP 296
+NG + G VQ TPLV N S+ +++ + L LP
Sbjct: 249 PEPSRLNFGVFATLNGTNASSSGSPVQ----STPLVVNAALPSLYFMSLKGISLGQKRLP 304
Query: 297 TD--VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV--HTVHDEYTCF 350
D VF + D+ G IDSGT+L +L + Y+ + +++S L T TCF
Sbjct: 305 IDPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCF 364
Query: 351 QY--SESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
+ SV P++ HF+ ++ V P Y+
Sbjct: 365 PWPPPPSVAVTVPDMELHFDGGANMTVPPENYML 398
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 143/330 (43%), Gaps = 46/330 (13%)
Query: 65 GGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYD 124
G + R G Y + +GTPP+ +DTGSD++W C C C R+ L+
Sbjct: 87 GMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPD-----PLFS 141
Query: 125 IKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
+ SS+ + + C + C + C +C Y YGDG++T GY+ + +
Sbjct: 142 PRMSSSYEPMRCAGQLCGDILH---HSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASS 198
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
SG+ Q+ L FGCG G+L++ + GI+GFG+ S++SQL+ +R+ F
Sbjct: 199 SGETQSV----PLGFGCGTMNVGSLNNAS-----GIVGFGRDPLSLVSQLS----IRR-F 244
Query: 245 AHCL--------DGINGGGIFAIGHV--VQPEVNKTPLV---PNQPHYSINMTAVQVGLD 291
++CL + G + +G V TP++ N Y + T V VG
Sbjct: 245 SYCLTPYASSRKSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGAR 304
Query: 292 FLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYT 348
L +P F + + G IIDSGT L P V +V SQ + + D+
Sbjct: 305 RLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGV 364
Query: 349 CFQYSE--------SVDEGFPNVTFHFENS 370
CF + P + FHF+ +
Sbjct: 365 CFAAPAVAAGGGRMARQVAVPRMVFHFQGA 394
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 146/335 (43%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP++ YV +D+GSDI+WV C C +C ++ ++D DS++
Sbjct: 138 GSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTD-----PVFDPADSASF 192
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C + C A C Y +YGDGS T G + + + + T
Sbjct: 193 MGVPCSSSVCERIEN---AGCHAG-GCRYEVMYGDGSYTKGTLALETLTFGR------TV 242
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
N + GCG R G G + S++ QL G F++CL
Sbjct: 243 VRN--VAIGCGHRNRGMFVGAAGLLGL-----GGGSMSLVGQLGGQTG--GAFSYCLVSR 293
Query: 250 GINGGGIFAIGHVVQP-EVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD- 304
G + G G P PL+ P P Y I ++ V VG + + DVF + +
Sbjct: 294 GTDSAGSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEM 353
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
N G ++D+GT + +P + Y I Q +L + V TC+ + V P
Sbjct: 354 GNGGVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPT 413
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNS 395
V+F+F L + +L P +D+ +C + S
Sbjct: 414 VSFYFAGGPILTLPARNFLIPVDDVGTFCFAFAAS 448
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 90/295 (30%), Positives = 140/295 (47%), Gaps = 31/295 (10%)
Query: 67 SSRPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRS----SLGIELT 121
+SR +G L+Y + +GTP + V +DTGSD+ WV C C +C + EL+
Sbjct: 95 TSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELS 153
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDG-SSTTGYFVQDVV 179
+Y+ K S+T K VTC+ C C ++CPY+ Y +ST+G ++DV+
Sbjct: 154 IYNPKISTTNKKVTCNNSLC-----AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVM 208
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
+ D + FGCG QSG+ + A +G+ G G S+ S LA G
Sbjct: 209 HL--TTEDKNPERVEAYVTFGCGQVQSGSF--LDIAAPNGLFGLGMEKISVPSVLAREGL 264
Query: 240 VRKMFAHCLDGINGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPT 297
V F+ C G +G G + G + +TP L P+ P+Y+I +T V+VG ++
Sbjct: 265 VADSFSMCF-GHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID--- 320
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
D + D+GT+ YL + +Y + SQ D K H+ D F+Y
Sbjct: 321 ------DEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQD-KRHS-PDSRIPFEY 367
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 125/277 (45%), Gaps = 41/277 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ +GTP + + +DTGSD++W C C++C +L + D SST +
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDC-----FDQDLPVLDPAASSTYAALP 138
Query: 136 CDQEFCHGVYGGPLTDCTANT-----SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
C C + P T C T SC Y YGD S T G D + G ++
Sbjct: 139 CGAARCRAL---PFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGES 195
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T L FGCG G S NE GI GFG+ S+ SQL + F++C
Sbjct: 196 LHTR-RLTFGCGHLNKGVFQS-NE---TGIAGFGRGRWSLPSQLNVTS-----FSYCFTS 245
Query: 251 IN---------GGGIFAI-GHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPT 297
+ GG A+ H EV TP++ P+QP Y +++ + VG L +P
Sbjct: 246 MFESKSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPE 305
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
F + TIIDSG ++ LPE VYE + ++ +Q
Sbjct: 306 TKF-----RSTIIDSGASITTLPEEVYEAVKAEFAAQ 337
>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
Length = 947
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 146/324 (45%), Gaps = 35/324 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G ++A + GTPP+ V +DTGS C +C+ C + +D S++
Sbjct: 122 GWGTHFAYVYAGTPPQRVSVIIDTGSHFTAFPCSECENCGSHTD-----PHWDQSKSTSS 176
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV--------QYDK 183
VTC E CHG + C + C + + Y +GSS Y V+DV+ Q +K
Sbjct: 177 HIVTC--EDCHGSF-----RCQKDKRCGFSQRYSEGSSWRAYQVEDVLWVGELTLQQSEK 229
Query: 184 VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR-K 242
++ D S +FGC Q+G + + DGI+G + +++ QLA +G ++ +
Sbjct: 230 INHDESAYSVE--FMFGCIESQTGLFKT---QLADGIMGMSADSHTLVWQLAKAGKIKER 284
Query: 243 MFAHCLDGINGGGIFAIGH---VVQP--EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPT 297
F+ C G NGG + G+ + +P E+ TP +++ +T + V +
Sbjct: 285 TFSLCF-GKNGGTMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQDP 343
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVD 357
+F G KG I+DSGTT YLP V + S + D + C + +
Sbjct: 344 AIFQRG--KGIIVDSGTTDTYLPRSVAKGF-SAAWERATGSPYANCKDNHFCMILTSAEL 400
Query: 358 EGFPNVTFHFENSVSLKVYPHEYL 381
E P VT H + + + V P Y+
Sbjct: 401 EALPTVTIHMDGGLEVNVRPSGYM 424
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 109/413 (26%), Positives = 182/413 (44%), Gaps = 40/413 (9%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVK--YRYAGRERSLSLLKEHDAR----RQQRILAGVDL 62
+ I LI+TA V + F+V+ +R + + + L+ H R ++ I L
Sbjct: 10 VIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGL 69
Query: 63 PLGGSSRP--DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
P + G Y K+ +GTPP DTGSDI+W C+ C C ++ +L
Sbjct: 70 VTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQ-----DL 124
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+++ S+T + V+C C + G C+ C Y YGD S + G F D +
Sbjct: 125 PMFNPSKSTTYRKVSCSSPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT 182
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
SG + GCG +G+ D+ + GI+G G +S+I Q+ S+ G
Sbjct: 183 MGSTSGRVVAFPRTA---IGCGHDNAGSFDAN----VSGIVGLGLGPASLIKQMGSAVGG 235
Query: 241 RKMFAHCLDGI--NGGGIFAIGHVVQPEVN-----KTPLVPN---QPHYSINMTAVQVGL 290
+ F++CL I + GG + V+ TP+ + + YS+ + AV VG
Sbjct: 236 K--FSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGR 293
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ T +G IIDSGTTL LP +Y +K IS +L+ +++ +
Sbjct: 294 NNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNF-AKAISNSINLQRTDDPNQFLEY 352
Query: 351 QYSESVDE-GFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRD 401
+ + D+ P + HFE + +L++ L D + C+ + +G Q D
Sbjct: 353 CFETTTDDYKVPFIAMHFEGA-NLRLQRENVLIRVSDNVICLAF--AGAQDND 402
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 165/391 (42%), Gaps = 49/391 (12%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSSNHG---------VFSVKYRYAGRERSLSLLKEHDAR 51
+G+ R+ C + A GG + H V S+ + AG + S++ A
Sbjct: 71 LGVVHRHGPCSPVQARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARAS 130
Query: 52 RQQRILAGVDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC 110
Q GV LP G S G G Y +G+GTP K Y V DTGSD+ WV C C +C
Sbjct: 131 EQ-----GVSLPAQRGISL--GTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADC 183
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSST 170
+ + L+D SST V C C + + C++++ C Y YGD S T
Sbjct: 184 YEQ-----QDPLFDPSLSSTYAAVACGAPECQELDA---SGCSSDSRCRYEVQYGDQSQT 235
Query: 171 TGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
G V+D + L + T +FGCG + +G +DG+ G G+ S+
Sbjct: 236 DGNLVRDTLT-------LSASDTLPGFVFGCGDQNAGLFGQ-----VDGLFGLGREKVSL 283
Query: 231 ISQLASSGGVRKMFAHCLDGINGG-GIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQ 287
SQ A S G F +CL + G G ++G T L Y I++ ++
Sbjct: 284 PSQGAPSYG--PGFTYCLPSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIK 341
Query: 288 VGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVH 344
VG + +P GT+IDSGT + LP Y PL ++ ++Q ++
Sbjct: 342 VGGRAIRIPATA--FAAAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL 399
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
D TC+ ++ P V F ++ +
Sbjct: 400 D--TCYDFTGHRTAQIPTVELAFAGGATVSL 428
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 137/300 (45%), Gaps = 35/300 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +G+P K V +D+GSD+ WV C C +C + L+D SST +
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVD-----PLFDPSLSSTYSPFS 185
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G C++++ C Y+ Y DGSSTTG + D + ++T
Sbjct: 186 CSSAACAQL-GQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLALG--------SNTIS 236
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGG 254
+ FGC +SG D T DG++G G S+ SQ A + G F++CL +
Sbjct: 237 NFQFGCSHVESGFNDLT-----DGLMGLGGGAPSLASQTAGTFGT--AFSYCLPPTPSSS 289
Query: 255 GIFAIGHVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIID 311
G +G V KTP++ + P Y + + A++VG L++PT VF + G ++D
Sbjct: 290 GFLTLGAGTSGFV-KTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF----SAGMVMD 344
Query: 312 SGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
SGT + LP Y L S + Q ++ D TCF +S P+V F
Sbjct: 345 SGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMD--TCFDFSGQSSVRLPSVALVFS 402
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 160/381 (41%), Gaps = 58/381 (15%)
Query: 43 SLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWV 102
S+ + D R + +++GV P G Y+A I +G PP V +DTGSD++W+
Sbjct: 64 SIAADDDDRLRSPVMSGV---------PFDSGEYFAVINVGDPPTRALVVIDTGSDLIWL 114
Query: 103 NCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYL 161
C+ C+ C R+ + LYD + SST + + C C V P C A T C Y+
Sbjct: 115 QCVPCRHCYRQVT-----PLYDPRSSSTHRRIPCASPRCRDVLRYP--GCDARTGGCVYM 167
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
+YGDGS+++G D + + T N +L GCG G L+S G++
Sbjct: 168 VVYGDGSASSGDLATDRLVFPD-----DTHVHNVTL--GCGHDNVGLLESAA-----GLL 215
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCL-----DGINGGGIFAIGHVVQPEVNK-TPLV-- 273
G G+ S +QLA + G +F++CL NG G +P TPL
Sbjct: 216 GVGRGQLSFPTQLAPAYG--HVFSYCLGDRLSRAQNGSSYLVFGRTPEPPSTAFTPLRTN 273
Query: 274 PNQPH-YSINMTAVQVGLD----FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
P +P Y ++M VG + F N + G ++DSGT ++ Y +
Sbjct: 274 PRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVR 333
Query: 329 SKIISQQPDL-KVHTVHDEYTCFQY--------SESVDEGFPNVTFHFENSVSLKVYPHE 379
S + + +++ F + + P++ HF + +
Sbjct: 334 DAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLHFAGGADMALPQAN 393
Query: 380 YLFPFE-----DLWCIGWQNS 395
YL P + +C+G Q +
Sbjct: 394 YLIPVQGGDRRTYFCLGLQAA 414
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 161/371 (43%), Gaps = 49/371 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL G P G LYY + IG PPK Y++ VD+GSD+ W+ C + P RS +
Sbjct: 52 FPLYGDVYPHG--LYYVAMNIGNPPKPYFLDVDSGSDLTWLQC----DAPCRSCNEVPHP 105
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVY----GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
LY S K V C C ++ GG + + C Y+ Y D S+TG V D
Sbjct: 106 LYRPTKS---KLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVND 162
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQ---SGNLDSTNEEALDGIIGFGKSNSSMISQL 234
+G + S+ FGCG Q SG+L S DG++G G + S++SQL
Sbjct: 163 SFALRLTNGSV----ARPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLLSQL 214
Query: 235 ASSGGVRKMFAHCLDGINGGGIFAIGHVVQP--EVNKTPLVPN--QPHYSINMTAVQVGL 290
G + + HCL + GGG G + P TP+ + + +YS ++ G
Sbjct: 215 KQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGD 273
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTV 343
L GV K + DSG++ Y Y+ LV S+ + ++PD +
Sbjct: 274 RSL-------GVRLAK-VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLC 325
Query: 344 HDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHEYLFPFED-LWCIGWQNSGMQSR 400
F+ V + F ++ +F + +++ P YL E+ C+G N
Sbjct: 326 WKGQEPFKSVLDVRKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNG--SEI 383
Query: 401 DRKNMTLLGDF 411
K+++++GD
Sbjct: 384 GLKDLSIIGDI 394
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 141/296 (47%), Gaps = 31/296 (10%)
Query: 66 GSSRPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSL---GIEL 120
+SR +G L+Y + +GTP + V +DTGSD+ WV C C +C P + EL
Sbjct: 96 STSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFEL 154
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDG-SSTTGYFVQDV 178
++Y+ K S+T K VTC+ C C ++CPY+ Y +ST+G ++DV
Sbjct: 155 SIYNPKVSTTNKKVTCNNSLC-----AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDV 209
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+ + D + FGCG QSG+ + A +G+ G G S+ S LA G
Sbjct: 210 MHL--TTEDKNPERVEAYVTFGCGQVQSGSF--LDIAAPNGLFGLGMEKISVPSVLAREG 265
Query: 239 GVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLP 296
V F+ C G +G G + G + +TP L P+ P+Y+I +T V+VG ++
Sbjct: 266 LVADSFSMCF-GHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID-- 322
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
D + D+GT+ YL + +Y + SQ D K H+ D F+Y
Sbjct: 323 -------DEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQD-KRHS-PDSRIPFEY 369
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 141/332 (42%), Gaps = 52/332 (15%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G+YY+ I +G+PPKD+ + +DTGSD+ WV C C P SS +D S+T K
Sbjct: 1 GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCS--PDCSST------FDRLASNTYKA 52
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+TC ++ +G YGDGS T G D ++ + D
Sbjct: 53 LTCADDYSYG--------------------YGDGSFTQGDLSVDTLKMAGAASD--ELEE 90
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+FGCG+ G + GI+ + S SQ+ G + F++CL
Sbjct: 91 FPGFVFGCGSLLKGLISGEV-----GILALSPGSLSFPSQIGEKYGNK--FSYCLLRQTA 143
Query: 249 -DGINGGGIF---AIGHVVQP------EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD 298
+ + + A + +P E+ TP+ + +Y++ + + VG L+L
Sbjct: 144 QNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPS 203
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
F G +K TI DSGTTL LP V + + + S + + CF+ S +
Sbjct: 204 AFLNGQDKPTIFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSGQ 263
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDLWCI 390
G P++TFHF P Y+ L C+
Sbjct: 264 GLPDITFHFNGGADFVTRPSNYVIDLGSLQCL 295
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 132/313 (42%), Gaps = 37/313 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK--EC-PRRSSLGIELTLYDIKDSSTGK 132
Y +G GTP + +DTGSD+ WV C C EC P++ L+D SST
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDP------LFDPSKSSTYA 178
Query: 133 FVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C + C+ + CT+ T C Y YGDGSST G + + + +
Sbjct: 179 PIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITF-------APG 231
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
T FGCG Q G D DG++G G + S++ Q AS G F++CL +
Sbjct: 232 ITVKDFHFGCGHDQRGPSDK-----FDGLLGLGGAPESLVVQTASVYG--GAFSYCLPAL 284
Query: 252 NG-GGIFAIGHVVQPEVNKTPLV--------PNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
N G A+G N + V + Y +NMT + VG L++P F
Sbjct: 285 NSEAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAF-- 342
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
G +IDSGT + LPE Y L + + + D TC+ ++ + P
Sbjct: 343 --RGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASEDFDTCYNFTGYSNVTVPR 400
Query: 363 VTFHFENSVSLKV 375
V F ++ +
Sbjct: 401 VALTFSGGATIDL 413
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 155/352 (44%), Gaps = 42/352 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +G+GTP +D + DTGSD+ W C C RS + ++D S++
Sbjct: 141 GSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCA----RSCYKQQDAIFDPSKSTSY 196
Query: 132 KFVTCDQEFCH--GVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+TC C G C+A+T +C Y YGD S + GYF ++ L
Sbjct: 197 SNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRE---------RL 247
Query: 189 QTTSTN--GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
T+T+ + +FGCG G + G+IG G+ S + Q A+ RK+F++
Sbjct: 248 SVTATDIVDNFLFGCGQNNQGLFGGSA-----GLIGLGRHPISFVQQTAAV--YRKIFSY 300
Query: 247 CLDGINGG-GIFAIGHVVQPEVNKTP---LVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
CL + G + G V TP + Y +++T + VG L + + F
Sbjct: 301 CLPATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFST 360
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEG 359
G G IIDSGT + LP Y L S +S+ P ++ D TC+ S
Sbjct: 361 G---GAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILD--TCYDLSGYEVFS 415
Query: 360 FPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
P + F F V++++ P L+ C+ + +G D ++T+ G+
Sbjct: 416 IPKIDFSFAGGVTVQLPPQGILYVASAKQVCLAFAANG----DDSDVTIYGN 463
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 158/371 (42%), Gaps = 56/371 (15%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE 109
+RR IL+ DL G G ++ I IGTPP + DTGSD+ WV C C++
Sbjct: 62 SRRLNNILSQTDLQSG---LIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQ 118
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSS 169
C + + ++D K SST K CD CH + + C Y YGD S
Sbjct: 119 CYKENG-----PIFDKKKSSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSF 173
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
+ G + + D SG S G+ +FGCG G D T + + S
Sbjct: 174 SKGDVATETISIDSASG--SPVSFPGT-VFGCGYNNGGTFDETGSGIIGLG----GGHLS 226
Query: 230 MISQLASSGGVRKMFAHCLD----GINGGGIFAIGHVVQPE-------VNKTPLVPNQP- 277
+ISQL SS + K F++CL NG + +G P V TPLV +P
Sbjct: 227 LISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPR 284
Query: 278 -HYSINMTAVQVGLDFLNLPTDVFGVGD-------NKGTIIDSGTTLAYLPEMVY----- 324
+Y + + A+ VG + + D + IIDSGTTL L +
Sbjct: 285 TYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGA 344
Query: 325 --EPLV--SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYP-HE 379
E LV +K +S L H CF+ S S + G P +T HF + +++ P +
Sbjct: 345 AVEELVTGAKRVSDPQGLLSH-------CFK-SGSAEIGLPEITVHFTGA-DVRLSPINA 395
Query: 380 YLFPFEDLWCI 390
++ ED+ C+
Sbjct: 396 FVKVSEDMVCL 406
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 130/300 (43%), Gaps = 38/300 (12%)
Query: 56 ILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECP 111
I + V PL G+ P +G YY + IG PPK Y++ DTGSD+ W+ C ++C + P
Sbjct: 49 IQSSVVFPLYGNVYP--LGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAP 106
Query: 112 RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTT 171
LY V C C ++ P C C Y Y DG S+
Sbjct: 107 H--------PLY----RPNNNLVICKDPMCASLHP-PGYKCEHPEQCDYEVEYADGGSSL 153
Query: 172 GYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMI 231
G V+DV + +G L GCG Q + + LDG++G GK SS++
Sbjct: 154 GVLVKDVFPLNFTNG----LRLAPRLALGCGYDQ---IPGQSYHPLDGVLGLGKGKSSIV 206
Query: 232 SQLASSGGVRKMFAHCLDGINGGGIFAIGHVV--QPEVNKTPLVPNQ-PHYSINMTAVQV 288
SQL S G +R + HC+ GGG G + V TP++ +Q HYS + +
Sbjct: 207 SQLHSQGVIRNVVGHCVSS-RGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELIL 265
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
G T VF N DSG++ YL + Y+ LV + + + V D+ T
Sbjct: 266 GGK-----TTVF---KNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQT 317
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 142/346 (41%), Gaps = 49/346 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++GIG+P K Y+ +DTGSD+ W+ C CK C +++ ++D + SS+
Sbjct: 10 GSGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQND-----AVFDPRASSSF 64
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ ++C C L D A S C Y YGDGS T G D +
Sbjct: 65 RRLSCSTPQCK------LLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSR---- 114
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
++FGCG G G S SQL+S + F++C
Sbjct: 115 ----GRTSPVVFGCGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSS-----RKFSYC 160
Query: 248 L----DGINGGGIFAIGHVVQP---EVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPT 297
L +G+ G P T L+ N Y ++ + +G L++P+
Sbjct: 161 LVSRDNGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPS 220
Query: 298 DVFGVGDNK---GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYS 353
F + + G IIDSGT++ LP Y + S L + TC+ +S
Sbjct: 221 TAFKLSSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFS 280
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGM 397
P V+FHFE S+++ P YL P + +C + + +
Sbjct: 281 ALTSVTIPTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSL 326
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 88/342 (25%), Positives = 147/342 (42%), Gaps = 28/342 (8%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
L+ +G P +DTGS+I+WV C CK C +++ L D SST +
Sbjct: 98 LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNG-----PLLDPSKSSTYASL 152
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C CH P C C Y Y G S+ G + + + S D + +
Sbjct: 153 PCTNTMCH---YAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFH--SSD-EGVNAV 206
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-- 252
S++FGC + ++G+ + G+ G GK +S ++++ S F++CL I
Sbjct: 207 PSVVFGC-SHENGDY---KDRRFTGVFGLGKGITSFVTRMGSK------FSYCLGNIADP 256
Query: 253 --GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV-GDNKGTI 309
G G E TPL HY + + + VG L++ + F + G+ K +
Sbjct: 257 HYGYNQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSAL 316
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE-GFPNVTFHFE 368
IDSGT L +L E + L +++ + + + C++ + S D GFP VTFHF
Sbjct: 317 IDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWRGSFACYKGTVSQDLIGFPVVTFHFS 376
Query: 369 NSVSLKVYPHEYLF-PFEDLWCIGWQNSGMQSRDRKNMTLLG 409
L + + D+ CI + + D K+ +++G
Sbjct: 377 GGADLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIG 418
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 165/391 (42%), Gaps = 49/391 (12%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSSNHG---------VFSVKYRYAGRERSLSLLKEHDAR 51
+G+ R+ C + A GG + H V S+ + AG + S++ A
Sbjct: 71 LGVVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARAS 130
Query: 52 RQQRILAGVDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC 110
Q GV LP G S G G Y +G+GTP K Y V DTGSD+ WV C C +C
Sbjct: 131 EQ-----GVSLPAQRGISL--GTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADC 183
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSST 170
+ + L+D SST V C C + + C++++ C Y YGD S T
Sbjct: 184 YEQ-----QDPLFDPSLSSTYAAVACGAPECQELDA---SGCSSDSRCRYEVQYGDQSQT 235
Query: 171 TGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
G V+D + L + T +FGCG + +G +DG+ G G+ S+
Sbjct: 236 DGNLVRDTLT-------LSASDTLPGFVFGCGDQNAGLFGQ-----VDGLFGLGREKVSL 283
Query: 231 ISQLASSGGVRKMFAHCLDGINGG-GIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQ 287
SQ A S G F +CL + G G ++G T L Y I++ ++
Sbjct: 284 PSQGAPSYG--PGFTYCLPSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIK 341
Query: 288 VGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVH 344
VG + +P GT+IDSGT + LP Y PL ++ ++Q ++
Sbjct: 342 VGGRAIRIPATA--FAAAGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL 399
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
D TC+ ++ P V F ++ +
Sbjct: 400 D--TCYDFTGHRTAQIPTVELAFAGGATVSL 428
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 149/342 (43%), Gaps = 47/342 (13%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV-----NCIQCKECPRRSSLGIELTLYDIKDSS 129
L+YA + +GTP + V +DTGSD+ W+ NC++ + P SSL +L +Y SS
Sbjct: 54 LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL--DLNIYSPNASS 111
Query: 130 TGKFVTCDQEFCH--GVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSG 186
T V C+ C P +D CPY + +G+S+TG V+DV+ VS
Sbjct: 112 TSTKVPCNSTLCTRGDRCASPESD------CPYQIRYLSNGTSSTGVLVEDVLHL--VSN 163
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
D + + + FGCG Q+G + A +G+ G G + S+ S LA G F+
Sbjct: 164 DKSSKAIPARVTFGCGQVQTGVFH--DGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSM 221
Query: 247 CLDGINGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGD 304
C G +G G + G + +TPL QPH Y+I +T + VG + +L D
Sbjct: 222 CF-GNDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDA----- 275
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE-------------YTCFQ 351
+ DSGT+ YL + Y + S D + T E Y+
Sbjct: 276 ----VFDSGTSFTYLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHH 331
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE--DLWCIG 391
+ +P V + S VY + P + D++C+
Sbjct: 332 HPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYCLA 373
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 157/368 (42%), Gaps = 45/368 (12%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL G P G LYY + IG PPK Y++ VDTGSD+ W+ C + P RS +
Sbjct: 54 FPLYGDVYPHG--LYYVAMNIGNPPKPYFLDVDTGSDLTWLQC----DAPCRSCNKVPHP 107
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDV 178
LY + K V C + C ++ G ++ C Y+ Y D S+TG V D
Sbjct: 108 LY---RPTKNKLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDS 164
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+G + SL FGCG Q + S DG++G G + S++SQ G
Sbjct: 165 FALRLANGSV----VRPSLAFGCGYDQ--QVSSGEMSPTDGVLGLGTGSVSLLSQFKQHG 218
Query: 239 GVRKMFAHCLDGINGGGIFAIGHVVQP--EVNKTPLV--PNQPHYSINMTAVQVGLDFLN 294
+ + HCL + GGG G + P V TP+V P + +YS ++ G L
Sbjct: 219 VTKNVVGHCLS-LRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLR 277
Query: 295 LP-TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDE 346
+ T+V + DSG++ Y Y+ LV S+ + + D +
Sbjct: 278 VKLTEV---------VFDSGSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPLCWKG 328
Query: 347 YTCFQYSESVDEGFPNVTFHF--ENSVSLKVYPHEYLFPFE-DLWCIGWQNSGMQSRDRK 403
F+ V + F ++ +F N +++ P YL + C+G N K
Sbjct: 329 KKPFKSVLDVKKEFKSLVLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNG--SEVGLK 386
Query: 404 NMTLLGDF 411
++++LGD
Sbjct: 387 DLSILGDI 394
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/361 (26%), Positives = 152/361 (42%), Gaps = 45/361 (12%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPD------------GVGLYYAKIGIGTPPKDYYVQVD 94
+ DA+R ++ + GGS R D G G Y+ +IG+G+PP+ Y+ +D
Sbjct: 99 KRDAKRVASLIRRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVID 158
Query: 95 TGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
+GSDI+WV C C +C +S ++D DS++ V+C C + C A
Sbjct: 159 SGSDIVWVQCQPCTQCYHQSD-----PVFDPADSASFTGVSCSSSVCDRLEN---AGCHA 210
Query: 155 NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
C Y YGDGS T G + + + + + S+ GCG R G
Sbjct: 211 G-RCRYEVSYGDGSYTKGTLALETLTFGR--------TMVRSVAIGCGHRNRGMFVGAAG 261
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQPE-VNKTP 271
G + S + QL G F++CL G + G G P P
Sbjct: 262 LLGL-----GGGSMSFVGQLGGQTG--GAFSYCLVSRGTDSSGSLVFGREALPAGAAWVP 314
Query: 272 LV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEP 326
LV P P Y I + + VG + + +VF + + + G ++D+GT + LP + Y+
Sbjct: 315 LVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQA 374
Query: 327 LVSKIISQQPDLKVHT-VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
++Q +L T V TC+ V P V+F+F L + +L P +
Sbjct: 375 FRDAFLAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMD 434
Query: 386 D 386
D
Sbjct: 435 D 435
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 146/314 (46%), Gaps = 41/314 (13%)
Query: 68 SRPDGVGLYYAK------IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
S P +GLY +G GTP K+ V DTGS++ W IQCK C S +
Sbjct: 2 SIPARIGLYIGTANYVITVGFGTPKKNQTVIFDTGSNVNW---IQCKPC-VVSCYPQQEP 57
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
L+D SST + ++C C G+ C+ +T C Y YGDGSST G+ + +
Sbjct: 58 LFDPTLSSTYRNISCTSAACTGLSS---RGCSGST-CVYGVTYGDGSSTVGFLATET--F 111
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
+G++ + IFGCG G G+IG G+S S+ SQLA+S G
Sbjct: 112 TLAAGNVF-----NNFIFGCGQNNQGLF-----TGAAGLIGLGRSPYSLNSQLATSLG-- 159
Query: 242 KMFAHCLDGINGG-GIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPT 297
+F++CL + G IG+ ++ T ++ N Y I++ + VG L L +
Sbjct: 160 NIFSYCLPSTSSATGYLNIGNPLRTP-GYTAMLTNSRAPTLYFIDLIGISVGGTRLALSS 218
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSE 354
VF + GTIIDSGT + LP Y L + ++Q ++ D TC+ +S
Sbjct: 219 TVF---QSVGTIIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASILD--TCYDFSR 273
Query: 355 SVDEGFPNVTFHFE 368
+ FP + H+
Sbjct: 274 TTTVTFPTIKLHYT 287
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 140/321 (43%), Gaps = 28/321 (8%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G YY + IG P K Y++ VDTGSD+ W+ C + P RS + LY + +
Sbjct: 50 TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC----DAPCRSCNKVPHPLYR---PTANR 102
Query: 133 FVTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C ++ G ++ C + C Y Y D +S+ G + D S +++
Sbjct: 103 LVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----SFSLPMRS 157
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ L FGCG Q + + A+DG++G G+ + S++SQL G + + HCL
Sbjct: 158 SNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS- 216
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
NGGG G V P ++ VP S N + G + + + GV + +
Sbjct: 217 TNGGGFLFFGDDVVPS-SRVTWVPMAQRTSGNYYSPGSGTLYFDRRS--LGVKPME-VVF 272
Query: 311 DSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
DSG+T Y Y+ +V SK + Q D + F+ V F ++
Sbjct: 273 DSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQKAFKSVFDVKNEFKSM 332
Query: 364 TFHFENS--VSLKVYPHEYLF 382
F ++ ++++ P YL
Sbjct: 333 FLSFSSAKNAAMEIPPENYLI 353
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 153/349 (43%), Gaps = 41/349 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + IG PP+ + DTGSD++WV C C+ C S T++ + SST
Sbjct: 79 GSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS----PATVFFPRHSSTF 134
Query: 132 KFVTCDQEFCHGV-YGGPLTDCT---ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
C C V G C +++CPY Y DGS T+G F ++ SG
Sbjct: 135 SPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGK 194
Query: 188 LQTTSTNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ S+ FGCG R SG ++ T+ +G++G G+ S SQL G + F++
Sbjct: 195 ---EAKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK--FSY 249
Query: 247 CLDG-----------INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNL 295
CL I G G A+ + + PL P Y + + +V V L +
Sbjct: 250 CLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPT--FYYVKLKSVFVNGAKLRI 307
Query: 296 PTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----- 348
++ + D N GT++DSGTTLA+L + Y LV + Q+ +K+ DE T
Sbjct: 308 DPSIWEIDDSGNGGTVMDSGTTLAFLADPAYR-LVIAAVKQR--IKLPNA-DELTPGFDL 363
Query: 349 CFQYS--ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQN 394
C S ++ P + F F P Y E+ + C+ Q+
Sbjct: 364 CVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQS 412
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 170/385 (44%), Gaps = 51/385 (13%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPD-----GVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
L + +R +R + ++ L SS + G G Y + IGTP +DTGSD+
Sbjct: 60 LIKRAIKRGERRMRSINAMLQSSSGIETPVYAGSGEYLMNVAIGTPASSLSAIMDTGSDL 119
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCP 159
+W C C +C + + +++ +DSS+ + C+ ++C + P C + C
Sbjct: 120 IWTQCEPCTQCFSQPT-----PIFNPQDSSSFSTLPCESQYCQDL---PSESCYND--CQ 169
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YGDGSST GY + ++ TS+ ++ FGCG G G
Sbjct: 170 YTYGYGDGSSTQGYMATETFTFE--------TSSVPNIAFGCGEDNQGFGQGNGA----G 217
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCLD--GINGGGIFAIGHVVQPEVNKTPLVP--- 274
+IG G S+ SQL GV + F++C+ G + A+G +P
Sbjct: 218 LIGMGWGPLSLPSQL----GVGQ-FSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIH 272
Query: 275 ---NQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVS 329
N +Y I + + VG D L +P+ F + D+ G IIDSGTTL YLP+ Y V+
Sbjct: 273 SSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN-AVA 331
Query: 330 KIISQQPDLKV--HTVHDEYTCFQY-SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ + Q +L + TCFQ S+ P ++ F+ V + + P E
Sbjct: 332 QAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLISPAEG 391
Query: 387 LWCIGWQNSGMQSRDRKNMTLLGDF 411
+ C+ M S ++ +++ G+
Sbjct: 392 VICL-----AMGSSSQQGISIFGNI 411
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 140/321 (43%), Gaps = 28/321 (8%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G YY + IG P K Y++ VDTGSD+ W+ C + P RS + LY + +
Sbjct: 50 TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC----DAPCRSCNKVPHPLYR---PTANR 102
Query: 133 FVTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C ++ G ++ C + C Y Y D +S+ G + D S +++
Sbjct: 103 LVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----SFSLPMRS 157
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ L FGCG Q + + A+DG++G G+ + S++SQL G + + HCL
Sbjct: 158 SNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS- 216
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
NGGG G V P ++ VP S N + G + + + GV + +
Sbjct: 217 TNGGGFLFFGDDVVPS-SRVTWVPMAQRTSGNYYSPGSGTLYFDRRS--LGVKPME-VVF 272
Query: 311 DSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
DSG+T Y Y+ +V SK + Q D + F+ V F ++
Sbjct: 273 DSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQKAFKSVFDVKNEFKSM 332
Query: 364 TFHFENS--VSLKVYPHEYLF 382
F ++ ++++ P YL
Sbjct: 333 FLSFASAKNAAMEIPPENYLI 353
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 166/375 (44%), Gaps = 41/375 (10%)
Query: 31 VKYRYAGRERSLSLLKEHDARRQQRILAGVDLP---LGGSSRPDGVGLYYAKIGIGTPPK 87
VK Y E +LS LK D + + DL + G+S+ G G Y++++G+G P K
Sbjct: 109 VKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQ--GSGEYFSRVGVGQPAK 166
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+Y+ +DTGSDI W+ C C +C +++ ++D + SS+ + C+ + C +
Sbjct: 167 PFYMVLDTGSDINWLQCQPCTDCYQQTD-----PIFDPRSSSSFASLPCESQQCQAL--- 218
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
+ C A + C Y YGDGS T G FV + + + SG + + GCG G
Sbjct: 219 ETSGCRA-SKCLYQVSYGDGSFTVGEFVTETLTFGN-SGMINDVAV------GCGHDNEG 270
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQP 265
+ G S+ SQ+ +S F++CL +
Sbjct: 271 LFVGSAGLLGL-----GGGPLSLTSQMKASS-----FSYCLVDRDSSSSSDLEFNSAAPS 320
Query: 266 EVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLP 320
+ PL+ + Y + +T + VG L++P ++F + D+ G I+DSGT + L
Sbjct: 321 DSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQ 380
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y L +S+ P LK + TC+ S P V+F F SL++ P
Sbjct: 381 TQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKN 440
Query: 380 YLFPFEDL--WCIGW 392
YL P + + +C +
Sbjct: 441 YLIPVDSVGTFCFAF 455
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 135/316 (42%), Gaps = 42/316 (13%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
DG G Y+ ++G+G+PP D Y+ VD+GSD++WV C C++C ++ L+D SS+
Sbjct: 125 DGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSS 179
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V+C C + G C Y YGDGS T G + + L
Sbjct: 180 FSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLT-------LGG 232
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
T+ G I GCG R SG G++G G S++ QL + G +F++CL
Sbjct: 233 TAVQGVAI-GCGHRNSGLFVGAA-----GLLGLGWGAMSLVGQLGGAAG--GVFSYCLAS 284
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGT 308
GG ++ Y + +T + VG + L L +F + ++ G
Sbjct: 285 RGAGGAGSLA---------------SSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 329
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++D+GT + LP Y L + P ++ D TC+ S P V+F
Sbjct: 330 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLD--TCYDLSGYASVRVPTVSF 387
Query: 366 HFENSVSLKVYPHEYL 381
+F+ L + L
Sbjct: 388 YFDQGAVLTLPARNLL 403
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 161/346 (46%), Gaps = 46/346 (13%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSS 129
PD +G Y +GTPP Y VDTGSDI+W+ C C+EC +++ +++ SS
Sbjct: 82 PD-IGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTT-----PMFNPSKSS 135
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ K + C + C + T C C Y YGD S + G D + + +G
Sbjct: 136 SYKNIPCPSKLCQSMED---TSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNG--- 189
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
T + +++ GCG + N+ S E A GI+GFG +S I+QL SS G + F++CL
Sbjct: 190 LTVSFPNIVIGCG---TNNILSY-EGASSGIVGFGSGPASFITQLGSSTGGK--FSYCLT 243
Query: 250 GI--------NGGGIFAIGHVVQPE---VNKTPLVPNQPH--YSINMTAVQVGLDFLNLP 296
+ N G V TP++ P Y + + A VG N
Sbjct: 244 PLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVG----NRR 299
Query: 297 TDVFGV--GDNKGT-IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
++ GV GDN+G IIDSGTTL L + Y L S ++ +K+ V D
Sbjct: 300 VEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDL---VKLERVDDPTQTLNLC 356
Query: 354 ESVD-EG--FPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
SV EG FP +T HF+ + + ++P D ++C+ +++S
Sbjct: 357 YSVKAEGYDFPIITMHFKGA-DVDLHPISTFVSVADGVFCLAFESS 401
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 153/354 (43%), Gaps = 51/354 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y ++ IGTPP+ +DTGSD++W+ C C C T++ SS+
Sbjct: 1 GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHH---GETIFFSDASSSY 57
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K + C+ C G+ + T C Y YGDGS T+G D + +
Sbjct: 58 KKLPCNSTHCSGMSSAGIGPRCEET-CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHR 116
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S +FGCG + G+ + T G+IG G+ + S+I QL G + F++CL
Sbjct: 117 SFFDGFLFGCGRKLKGDWNFTQ-----GLIGLGQKSHSLIQQLGDKLGYK--FSYCLVSY 169
Query: 252 N-----------GGGIFAIGHVVQPEVNKTPLVP----NQPHYSINMTAVQVGLDFLNLP 296
+ G GH +V TP++ +Q Y +++ ++ VG +P
Sbjct: 170 DSPPSAKSFLFLGSSAALRGH----DVVSTPILHGDHLDQTLYYVDLQSITVG----GVP 221
Query: 297 TDVFG--VGDNKG--------TIIDSGTTLAYLPEMVYEPLVSKIISQ--QPDLKVHTVH 344
V+ G N T+IDSGTT L VYE + I Q P L
Sbjct: 222 VVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGL 281
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF--EDLWCIGWQNSG 396
D CF S GFP+VTF+F N V L V P E +F D+ C+ +SG
Sbjct: 282 D--LCFNSSGDTSYGFPSVTFYFANQVQL-VLPFENIFQVTSRDVVCLSMDSSG 332
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 95/302 (31%), Positives = 139/302 (46%), Gaps = 37/302 (12%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPDGVGL------YYAKIGIGTPPKDYYVQVDTGSDIM 100
H R R L + ++ P +GL Y IGIGTPP+++ V DTGSD+
Sbjct: 87 RHRVRSIYRRLTAAETTTTTTTIPARLGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLT 146
Query: 101 WVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPY 160
WV QC CP S + L+D SST V C CH + G T C A TSC Y
Sbjct: 147 WV---QCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAPECH-IGGVQQTRCGA-TSCEY 201
Query: 161 LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGI 220
YGD S T G ++ S L +T ++FGC + T + G+
Sbjct: 202 SVKYGDESETHGSLAEETFTLSPPS-PLAPAATG--VVFGCSHEYISVFNDTG-MGVAGL 257
Query: 221 IGFGKSNSSMISQ----LASSGGVRKMFAHCLD--GINGGGIFAIGHVVQPE-----VNK 269
+G G+ +SS++SQ + S GGV F++CL G + G + G P+ ++
Sbjct: 258 LGLGRGDSSILSQTRRSINSGGGV---FSYCLPPRGSSTGYLTIGGGAAAPQQQYSNLSF 314
Query: 270 TPLVPN----QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE 325
TPL+ + Y +N+ V V +++P F + G +IDSGT + ++P Y
Sbjct: 315 TPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL----GAVIDSGTVVTHMPAAAYY 370
Query: 326 PL 327
PL
Sbjct: 371 PL 372
>gi|413936884|gb|AFW71435.1| hypothetical protein ZEAMMB73_652585 [Zea mays]
Length = 287
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/124 (49%), Positives = 75/124 (60%), Gaps = 13/124 (10%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYRYA-----GRERSLSLLKEHDARRQQRIL-AGVDLPL 64
+VL+ +V G + GVF V+ ++ G L+ L+ HD R R+L A VDL L
Sbjct: 16 LVLLFALSVVGRAGATGVFQVRRKFPRHGRRGVAEHLAALRRHDVGRHGRLLGAVVDLGL 75
Query: 65 GGSSRPDGVG-------LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
GG P G LYY +I IG+PPK YYVQVDTGSDI+WVNCI+C CP RS LG
Sbjct: 76 GGVGLPTAAGCLPAQRSLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPARSGLG 135
Query: 118 IELT 121
IELT
Sbjct: 136 IELT 139
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 40/60 (66%), Gaps = 5/60 (8%)
Query: 353 SESVDEGFPNVTFHFENSVSLKVYPHEYLFP-FEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+ VD+GFP +TF FE +++ VYP +YLF DL+C+G+ + G+Q+ ++ LLGD
Sbjct: 158 NSGVDDGFPVITFSFEGGLTMNVYPDDYLFQNRNDLYCMGFLDGGVQT----DIVLLGDL 213
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 103/324 (31%), Positives = 131/324 (40%), Gaps = 52/324 (16%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKD 127
R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE-----KLFDPAR 226
Query: 128 SSTGKFVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV--- 179
SST V+C C G GG C Y YGDGS + G+F D +
Sbjct: 227 SSTYANVSCAAPACSDLDTRGCSGG---------HCLYGVQYGDGSYSIGFFAMDTLTLS 277
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-G 238
YD V G FGCG R G E A G++G G+ +S+ Q G
Sbjct: 278 SYDAVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDKYG 322
Query: 239 GVRKMFAHCLDGINGGG---IFAIGHVVQPEVNKTPLVPNQP-HYSINMTAVQVGLDFLN 294
GV FAHCL + G F G LV N P Y + +T ++VG L
Sbjct: 323 GV---FAHCLPARSTGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLY 379
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQ 351
+P VF GTI+DSGT + LP Y L S +S + K V TC+
Sbjct: 380 IPQSVFA---TAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSLLDTCYD 436
Query: 352 YSESVDEGFPNVTFHFENSVSLKV 375
++ P V+ F+ L V
Sbjct: 437 FAGMSQVAIPTVSLLFQGGARLDV 460
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 155/351 (44%), Gaps = 39/351 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +G+GTP +D + DTGSD+ W C C RS + ++D S++
Sbjct: 142 GSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCA----RSCYKQQDVIFDPSKSTSY 197
Query: 132 KFVTCDQEFCHGVYGGPLTD--CTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+TC C + D C+A+T +C Y YGD S + GYF ++ + +
Sbjct: 198 SNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLT-------V 250
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
T + +FGCG G + G+IG G+ S + Q A+ RK+F++CL
Sbjct: 251 TATDVVDNFLFGCGQNNQGLFGGSA-----GLIGLGRHPISFVQQTAAK--YRKIFSYCL 303
Query: 249 DGINGG-GIFAIGHVVQPEVNK-TP---LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
+ G + G K TP + Y +++TA+ VG L + + F G
Sbjct: 304 PSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTG 363
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
G IIDSGT + LP Y L S +S+ P ++ D TC+ S
Sbjct: 364 ---GAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILD--TCYDLSGYKVFSI 418
Query: 361 PNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
P + F F V++K+ P LF C+ + +G D ++T+ G+
Sbjct: 419 PTIEFSFAGGVTVKLPPQGILFVASTKQVCLAFAANG----DDSDVTIYGN 465
>gi|218196224|gb|EEC78651.1| hypothetical protein OsI_18747 [Oryza sativa Indica Group]
Length = 317
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 76/134 (56%), Gaps = 1/134 (0%)
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYS 280
+G G SN+S++ QLA S +KMFAHCLDG GGIF +GH+V P+V KTPL Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
+ + VG L+L + TI+++G+ ++YLPE VY+ + I S D+ V
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQSFLDSIFSDLEDISV 120
Query: 341 HTVHDEYTCFQYSE 354
+ Y+CF Y
Sbjct: 121 INI-GGYSCFHYER 133
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 163/371 (43%), Gaps = 50/371 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL G P G LYY + IG PPK Y++ VD+GSD+ W+ C + P RS +
Sbjct: 45 FPLYGDVYPHG--LYYVAMNIGNPPKPYFLDVDSGSDLTWLQC----DAPCRSCNEVPHP 98
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-TANTSCPYLEIYGDGSSTTGYFVQD 177
LY S K V C C ++ G LT C + + C Y+ Y D S+TG + D
Sbjct: 99 LYRPTKS---KLVPCVHRLCASLHNG-LTGKHRCDSPHEQCDYVIKYADQGSSTGVLIND 154
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQ---SGNLDSTNEEALDGIIGFGKSNSSMISQL 234
+G + S+ FGCG Q SG+L S DG++G G + S++SQL
Sbjct: 155 SFALRLTNGSV----ARPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLLSQL 206
Query: 235 ASSGGVRKMFAHCLDGINGGGIFAIGHVVQP--EVNKTPLVPN--QPHYSINMTAVQVGL 290
G + + HCL + GGG G + P TP+ + + +YS ++ G
Sbjct: 207 KQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGD 265
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTV 343
L GV K + DSG++ Y Y+ LV S+ + ++PD +
Sbjct: 266 RSL-------GVRLAK-VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLC 317
Query: 344 HDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHEYLFPFED-LWCIGWQNSGMQSR 400
F+ V + F ++ +F + +++ P YL E+ C+G N
Sbjct: 318 WKGQEPFKSVLDVRKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNG--SEI 375
Query: 401 DRKNMTLLGDF 411
K+++++GD
Sbjct: 376 GLKDLSIIGDI 386
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 163/371 (43%), Gaps = 50/371 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL G P G LYY + IG PPK Y++ VD+GSD+ W+ C + P RS +
Sbjct: 54 FPLYGDVYPHG--LYYVAMNIGNPPKPYFLDVDSGSDLTWLQC----DAPCRSCNEVPHP 107
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-TANTSCPYLEIYGDGSSTTGYFVQD 177
LY S K V C C ++ G LT C + + C Y+ Y D S+TG + D
Sbjct: 108 LYRPTKS---KLVPCVHRLCASLHNG-LTGKHRCDSPHEQCDYVIKYADQGSSTGVLIND 163
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQ---SGNLDSTNEEALDGIIGFGKSNSSMISQL 234
+G + S+ FGCG Q SG+L S DG++G G + S++SQL
Sbjct: 164 SFALRLTNGSV----ARPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLLSQL 215
Query: 235 ASSGGVRKMFAHCLDGINGGGIFAIGHVVQP--EVNKTPLVPN--QPHYSINMTAVQVGL 290
G + + HCL + GGG G + P TP+ + + +YS ++ G
Sbjct: 216 KQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGD 274
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTV 343
L GV K + DSG++ Y Y+ LV S+ + ++PD +
Sbjct: 275 RSL-------GVRLAK-VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLC 326
Query: 344 HDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHEYLFPFED-LWCIGWQNSGMQSR 400
F+ V + F ++ +F + +++ P YL E+ C+G N
Sbjct: 327 WKGQEPFKSVLDVRKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNACLGILNG--SEI 384
Query: 401 DRKNMTLLGDF 411
K+++++GD
Sbjct: 385 GLKDLSIIGDI 395
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 105/366 (28%), Positives = 167/366 (45%), Gaps = 46/366 (12%)
Query: 44 LLKEHDARRQQRIL-AGVD--LPLGGS----SRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
LL E D RRQ+ L A V +P GS S D L+Y I IGTP + V +DTG
Sbjct: 61 LLAESDFRRQRMNLGAKVQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTG 120
Query: 97 SDIMWV--NCIQCKECPRR--SSLGI-ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
S+++W+ NC+QC SSL +L Y+ SST K C + C +D
Sbjct: 121 SNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSA-----SD 175
Query: 152 C-TANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQ---TTSTNGSLIFGCGARQS 206
C + CPY Y G +S++G V+D++ + + ++S ++ GCG +QS
Sbjct: 176 CESPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQS 235
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIF--AIGHVVQ 264
G D + A DG++G G + S+ S L+ +G +R F+ C D + G I+ +G +Q
Sbjct: 236 G--DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 265 PEVNKTPLVPNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
L N+ Y + + A +G L + T IDSG + YLPE +
Sbjct: 294 QSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFT--------TFIDSGQSFTYLPEEI 345
Query: 324 YEPLVSKIISQQPDLKVHTVHDEYTC----FQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y + +I D ++ + + Y S + P + F ++ + + H+
Sbjct: 346 YRKVALEI-----DRHINATSKNFEGVSWEYCYESSAEPKVPAIKLKFSHNNTFVI--HK 398
Query: 380 YLFPFE 385
LF F+
Sbjct: 399 PLFVFQ 404
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 100/375 (26%), Positives = 167/375 (44%), Gaps = 41/375 (10%)
Query: 31 VKYRYAGRERSLSLLKEHDARRQQRILAGVDLP---LGGSSRPDGVGLYYAKIGIGTPPK 87
VK Y E +LS LK D + + DL + G+S+ G G Y++++G+G P K
Sbjct: 109 VKSIYDRLEFALSELKRSDLEPLKTEILPEDLSTPIISGTSQ--GSGEYFSRVGVGQPAK 166
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+Y+ +DTGSDI W+ C C +C +++ ++D + SS+ + C+ + C +
Sbjct: 167 PFYMVLDTGSDINWLQCQPCTDCYQQTD-----PIFDPRSSSSFASLPCESQQCQAL--- 218
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
+ C A + C Y YGDGS T G FV + + + SG + + GCG G
Sbjct: 219 ETSGCRA-SKCLYQVSYGDGSFTVGEFVIETLTFGN-SGMINNVAV------GCGHDNEG 270
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQP 265
+ G + S+ SQ+ +S F++CL +
Sbjct: 271 LFVGSAGLLGL-----GGGSLSLTSQMKASS-----FSYCLVDRDSSSSSDLEFNSAAPS 320
Query: 266 EVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLP 320
+ PL+ + Y + +T + VG L++P ++F + D+ G I+DSGT + L
Sbjct: 321 DSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQ 380
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y L +S+ P LK + TC+ S P V+F F SL++ P
Sbjct: 381 TQAYNTLRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPTVSFEFAGGKSLQLPPKN 440
Query: 380 YLFPFEDL--WCIGW 392
YL P + + +C +
Sbjct: 441 YLIPVDSVGTFCFAF 455
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 154/349 (44%), Gaps = 39/349 (11%)
Query: 32 KYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
+ R A LS + A+ Q+ +GV +P S G Y + +GTP +
Sbjct: 89 QLRAANIHAKLSSPRNSSAKELQQ--SGVTIPTS-SGYSLGTPEYVITVSLGTPAVTQVM 145
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
+DTGSD+ WV QC C +S + L+D S+T +C C + GG
Sbjct: 146 SIDTGSDVSWV---QCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSAQCAQL-GGEGNG 201
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
C N+ C Y+ Y D S+TTG + D + L T+ + FGC R +G +
Sbjct: 202 CL-NSHCQYIVKYVDHSNTTGTYGSDTL-------GLTTSDAVKNFQFGCSHRANGFVGQ 253
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVV----QP 265
LDG++G G S++SQ A++ G K F++CL + GG +G
Sbjct: 254 -----LDGLMGLGGDTESLVSQTAATYG--KAFSYCLPPSSSSAGGFLTLGAAAGGTSSS 306
Query: 266 EVNKTPLVP-NQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
++TPLV N P Y + + A+ V LN+P VF + +++DSGT + LP
Sbjct: 307 RYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF----SGASVVDSGTVITQLPPTA 362
Query: 324 YEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN 369
Y+ L + K + P + D TCF +S P VT F
Sbjct: 363 YQALRTAFKKEMKAYPSAAPVGILD--TCFDFSGIKTVRVPVVTLTFSR 409
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 156/361 (43%), Gaps = 54/361 (14%)
Query: 50 ARRQQRILAGVD-----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
ARR + + V+ +P G S+ Y +GIGTP K+ + DTGS ++W C
Sbjct: 102 ARRSMNLTSSVEHMKSSVPFYGLSKITASD-YIVNVGIGTPKKEMPLIFDTGSGLIWTQC 160
Query: 105 IQCKEC-PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
CK C P+ + ++D S++ K + C + C + G ++ C YL
Sbjct: 161 KPCKACYPK-------VPVFDPTKSASFKGLPCSSKLCQSIRQG-----CSSPKCTYLTA 208
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
Y D SS+TG + + + + D + +++ GC + SG +S E GI+G
Sbjct: 209 YVDNSSSTGTLATETISFSHLKYDFK------NILIGCSDQVSG--ESLGES---GIMGL 257
Query: 224 GKSNSSMISQLASSGGVRKMFAHCLDGINGG-GIFAIGHVVQPEVNKTPLVPNQPH--YS 280
+S S+ SQ A+ K+F++C+ G G G V +V +P+ P Y
Sbjct: 258 NRSPISLASQTANI--YDKLFSYCIPSTPGSTGHLTFGGKVPNDVRFSPVSKTAPSSDYD 315
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPD 337
I MT + VG L + F + + IDSG L LP Y L S +++ P
Sbjct: 316 IKMTGISVGGRKLLIDASAFKI----ASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPL 371
Query: 338 LKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGM 397
L D TC+ +S P+++ FE V + + D+ I WQ G
Sbjct: 372 LDQDDFLD--TCYDFSNYSTVAIPSISVFFEGGVEMDI----------DVSGIMWQVPGS 419
Query: 398 Q 398
+
Sbjct: 420 K 420
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 99/338 (29%), Positives = 150/338 (44%), Gaps = 47/338 (13%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G Y + IGTPP VDTGSD+ W C C C ++ + L+D K+SST +
Sbjct: 89 AGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQV-----VPLFDPKNSSTYR 143
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+C FC + G C+ C + Y DGS T G + + D +G + S
Sbjct: 144 DSSCGTSFCLAL--GKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAG--KPVS 199
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
G FGCG G D ++ GI+G G S+ISQL S+ + +F++CL
Sbjct: 200 FPG-FAFGCGHSSGGIFDKSSS----GIVGLGGGELSLISQLKST--INGLFSYCLLPVS 252
Query: 249 ------DGINGGGIFAIGHVVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFL-----NL 295
IN G A G V TPLV P Y + + + VG L +
Sbjct: 253 TDSSISSRINFG---ASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSK 309
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YS 353
T+V + I+DSGTT +LP+ Y L + + +K V D F Y+
Sbjct: 310 KTEV----EEGNIIVDSGTTYTFLPQEFYSKLEKSVANS---IKGKRVRDPNGIFSLCYN 362
Query: 354 ESVDEGFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCI 390
+ + P +T HF+++ ++++ P + ++ EDL C
Sbjct: 363 TTAEINAPIITAHFKDA-NVELQPLNTFMRMQEDLVCF 399
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 132/299 (44%), Gaps = 34/299 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +G+P K + +DTGSD+ WV C C +C ++ L+D SST +
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 187
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G + +++ C Y YGDGSSTTG + D + ++
Sbjct: 188 CSSAACAQL--GQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLALG--------SNAVR 237
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
FGC +SG D T DG++G G S++SQ A + G F++CL +
Sbjct: 238 KFQFGCSNVESGFNDQT-----DGLMGLGGGAQSLVSQTAGTFGA--AFSYCLPATSSSS 290
Query: 256 IFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
F KTP++ + Y + + A++VG L++PT VF + GTI+DS
Sbjct: 291 GFLTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVF----SAGTIMDS 346
Query: 313 GTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
GT L LP Y L S + Q P + D TCF +S P V F
Sbjct: 347 GTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILD--TCFDFSGQSSVSIPTVALVFS 403
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 152/323 (47%), Gaps = 37/323 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRR--SSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++W+ +C+QC S+L +L Y S +
Sbjct: 95 LHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSSYYSNLDRDLNEYSPSRSLS 154
Query: 131 GKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C + C ++C ++ CPY+ Y + +S++G V+D++ + G L
Sbjct: 155 SKHLSCSHQLCDKG-----SNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHL-QSGGSL 208
Query: 189 QTTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+S ++ GCG +QSG LD A DG++G G SS+ S LA SG + F+ C
Sbjct: 209 SNSSVQAPVVLGCGMKQSGGYLDGV---APDGLLGLGPGESSVPSFLAKSGLIHDSFSLC 265
Query: 248 LDGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ + G IF G +Q + PL Y I + + VG L + + F V
Sbjct: 266 FNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMTS--FKVQ-- 321
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVDEG 359
+DSGT+ +LP VY I+++ D +V+ + C+ S
Sbjct: 322 ----VDSGTSFTFLPGHVY-----GAIAEEFDQQVNGSRSSFEGSPWEYCYVPSSQELPK 372
Query: 360 FPNVTFHFENSVSLKVYPHEYLF 382
P++T F+ + S VY ++F
Sbjct: 373 VPSLTLTFQQNNSFVVYDPVFVF 395
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 101/342 (29%), Positives = 141/342 (41%), Gaps = 46/342 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
LP+ G+ PDG YY I IG PP+ Y++ VDTGSD+ W+ C + P +
Sbjct: 175 LPIKGNVFPDG--QYYTSIFIGNPPRPYFLDVDTGSDLTWIQC----DAPCTNFAKGPHP 228
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
LY + K V C + G C C Y Y D SS+ G +D
Sbjct: 229 LY---KPAKEKIVPPRDLLCQELQGN-QNYCETCKQCDYEIEYADQSSSMGVLARD---- 280
Query: 182 DKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS 236
D+ +TNG +FGC Q G L S+ + DGI+G + S SQLAS
Sbjct: 281 -----DMHMIATNGGREKLDFVFGCAYDQQGQLLSSPAKT-DGILGLSSAAISFPSQLAS 334
Query: 237 SGGVRKMFAHCLDGINGGGIFAI---GHVVQPEVNKTPLVPNQPH-YSINMTAVQVGLDF 292
G + +F HC+ GGG + +V + V T + + Y V+ G
Sbjct: 335 HGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQ 394
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCF 350
L P G I DSG++ YLP +YE LV+ I P V D C+
Sbjct: 395 LRRPEQ---AGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYASPGF-VQDTSDRTLPLCW 450
Query: 351 Q------YSESVDEGFPNVTFHFEN-----SVSLKVYPHEYL 381
+ Y E V + F + HF S + + P +YL
Sbjct: 451 KADFPVRYLEDVKQFFEPLNLHFGKKWLFMSKTFTISPEDYL 492
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 90/334 (26%), Positives = 152/334 (45%), Gaps = 46/334 (13%)
Query: 90 YVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL 149
++ +DTGSDI W+ C C +C ++ +L+ S+T K + C+ C +
Sbjct: 2 FLLIDTGSDITWIQCDPCPQCYKQQD-----SLFQPAGSATYKPLPCNSTMCQQLQS--F 54
Query: 150 TDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ N+SC Y+ YGD S+T G F + + + D + + FGCG G
Sbjct: 55 SHSCLNSSCNYMVSYGDKSTTRGDFALETL---TLRSDDTILVSVPNFAFGCGHANKGLF 111
Query: 210 DSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING---GGIFAIGH--VVQ 264
+ G++G GKS+ +Q + + G K+F++CL ++ GI G ++
Sbjct: 112 NGAA-----GLMGLGKSSIGFPAQTSVAFG--KVFSYCLPSVSSTIPSGILHFGEAAMLD 164
Query: 265 PEVNKTPLV-----PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYL 319
+V TPLV P+Q Y ++MT + VG + L + V ++DSGT ++
Sbjct: 165 YDVRFTPLVDSSSGPSQ--YFVSMTGINVGDELLPISATV---------MVDSGTVISRF 213
Query: 320 PEMVYEPLVSKIISQQPDLKVH-TVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPH 378
+ YE L P L+ +V TCF+ S D P +T HF + L++ P
Sbjct: 214 EQSAYERLRDAFTQILPGLQTAVSVAPFDTCFRVSTVDDINIPLITLHFRDDAELRLSPV 273
Query: 379 EYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGDF 411
L+P +D + C + S ++LG+F
Sbjct: 274 HILYPVDDGVMCFAFAPSS------SGRSVLGNF 301
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 147/330 (44%), Gaps = 31/330 (9%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y K+ +GTPP D Y VDTGSD++W C C+ C R+ S +++ S+T
Sbjct: 48 GDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKS-----PMFEPLRSNTYTP 102
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ CD E C+ ++G C+ C Y Y D S T G ++ V + G+
Sbjct: 103 IPCDSEECNSLFG---HSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVV-- 157
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
G ++FGCG SG + + + S++SQ + G ++ F+ CL +
Sbjct: 158 -GDIVFGCGHSNSGTFNENDMGIIGLG----GGPLSLVSQFGNLYGSKR-FSQCLVPFHA 211
Query: 254 G----GIFAIGH---VVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNL-PTDVFGVG 303
G + G V V TPLV Q Y + + + VG F++ +++ G
Sbjct: 212 DPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKG 271
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
+ +IDSGT YLP+ Y+ LV ++ Q L + D T Y + P +
Sbjct: 272 N---IMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEGPIL 328
Query: 364 TFHFENSVSLKVYPHEYLFPFED-LWCIGW 392
HFE + +++ P + P +D ++C
Sbjct: 329 IAHFEGA-DVQLMPIQTFIPPKDGVFCFAM 357
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 158/344 (45%), Gaps = 39/344 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +G+PPK + + +DTGSD+ W+ C+ C +C +++ YD K S++
Sbjct: 166 GSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNG-----AFYDPKASASY 220
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K +TC+ + C+ V P C + N SCPY YGD S+TTG F + + +
Sbjct: 221 KNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGS 280
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ N +++FGCG G G+ S SQL S G F++CL
Sbjct: 281 SELYNVENMMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYG--HSFSYCL 333
Query: 249 ------DGINGGGIFAIGH--VVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNL 295
++ IF + P +N T V + + Y + + ++ V + LN+
Sbjct: 334 VDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNI 393
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ----PDLKVHTVHDEYTC 349
P + + + + GTIIDSGTTL+Y E YE + +KI + P + + D C
Sbjct: 394 PEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDP--C 451
Query: 350 FQYSESVDEGFPNVTFHFENSVSLKVYPHE--YLFPFEDLWCIG 391
F S + P + F + +P E +++ EDL C+
Sbjct: 452 FNVSGIHNVQLPELGIAFADGAVWN-FPTENSFIWLNEDLVCLA 494
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 113/414 (27%), Positives = 185/414 (44%), Gaps = 60/414 (14%)
Query: 29 FSVKYRYAGRERSLSLLK--EHDARR-QQRILAGVDLPLGGSSRPD-------GVGLYYA 78
F K ++ ++L+ + +H +R + R+ + L SS + G G +
Sbjct: 40 FRAKLKHVDSGKNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAPVLPGNGEFLM 99
Query: 79 KIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
K+ IGTPP+ Y +DTGSD++W C C +C + + ++D K SS+ ++C
Sbjct: 100 KLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPT-----PIFDPKKSSSFSKLSCSS 154
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
+ C L T + C YL YGD SST G + + + KVS +
Sbjct: 155 KLCEA-----LPQSTCSDGCEYLYGYGDYSSTQGMLASETLTFGKVSVP--------EVA 201
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG---- 254
FGCG G+ S G++G G+ S++SQL F++CL ++
Sbjct: 202 FGCGEDNEGSGFSQG----SGLVGLGRGPLSLVSQLK-----EPKFSYCLTSVDDTKAST 252
Query: 255 ---GIFAIGHVVQPEVNKTPLVPN--QPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--K 306
G A E+ TPL+ N QP Y +++ + VG L + F + ++
Sbjct: 253 LLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSG 312
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE--YTCFQY-SESVDEGFPNV 363
G IIDSGTT+ YL + ++ LV+K + Q +L V CF S S D P +
Sbjct: 313 GLIIDSGTTITYLEQSAFD-LVAKEFTSQINLPVDNSGSTGLEVCFTLPSGSTDIEVPKL 371
Query: 364 TFHFENSVSLKVYPHEYLFPFEDL--WCIGW-QNSGMQ---SRDRKNMTLLGDF 411
FHF+ + L++ Y+ + C+ +SGM + ++NM +L D
Sbjct: 372 VFHFDGA-DLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDL 424
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 129/285 (45%), Gaps = 31/285 (10%)
Query: 51 RRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC 110
++ R V +PL + G G YY K+G G+P + Y + VDTGS + W +QCK C
Sbjct: 94 KKDIRFPKSVSVPLNPGAS-IGSGNYYVKVGFGSPARYYSMIVDTGSSLSW---LQCKPC 149
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDG 167
+ + L+D S T K ++C C + L + TS C Y YGD
Sbjct: 150 VVYCHVQAD-PLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDS 208
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
S + GY QD++ L + T ++GCG G GI+G G++
Sbjct: 209 SYSMGYLSQDLLT-------LAPSQTLPGFVYGCGQDSDGLFGRAA-----GILGLGRNK 256
Query: 228 SSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH--VVQPEVNKTPLV--PNQPH-YSIN 282
SM+ Q++S G F++CL GGG +IG + TP+ P P Y +
Sbjct: 257 LSMLGQVSSKFGY--AFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLR 314
Query: 283 MTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
+TA+ VG L + + V TIIDSGT + LP VY P
Sbjct: 315 LTAITVGGRALGVAAAQYRV----PTIIDSGTVITRLPMSVYTPF 355
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 148/351 (42%), Gaps = 54/351 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE----CPRRSSLGIELTLYDIKD 127
G+G Y + GTPP++ + DTGSD++W+ C CP+++ +
Sbjct: 50 GLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKAC--SRRPAFVASK 107
Query: 128 SSTGKFVTCDQEFCHGVYG----GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDK 183
S+T V C C V GP A C Y Y DGSSTTG+ +D
Sbjct: 108 SATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTA---T 164
Query: 184 VSGDLQTTSTNGSLIFGCGAR-QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
+S + + FGCG R Q G+ T G+IG G+ S +Q S +
Sbjct: 165 ISNGTSGGAAVRGVAFGCGTRNQGGSFSGTG-----GVIGLGQGQLSFPAQSGSL--FAQ 217
Query: 243 MFAHCLDGINGG------GIFAIGHVVQPEVNK----TPLVPN---QPHYSINMTAVQVG 289
F++CL + GG +G +PE TPLV N Y + + A++VG
Sbjct: 218 TFSYCLLDLEGGRRGRSSSFLFLG---RPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVG 274
Query: 290 LDFLNLP-----TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVH 344
L +P DV G N GT+IDSG+TL YL Y LVS + ++ +
Sbjct: 275 NRVLPVPGSEWAIDVLG---NGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSA 331
Query: 345 DEYT----CFQYSES-----VDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ C+ S S + GFP +T F +SL++ YL D
Sbjct: 332 TFFQGLELCYNVSSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVAD 382
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 164/369 (44%), Gaps = 33/369 (8%)
Query: 43 SLLKEHDARRQQRILAGVDLPLGGSSRPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMW 101
SL + + RR+ R A + + + D G + +G PP V +DTGSD++W
Sbjct: 25 SLDRNNVERRRTRRAAFIXDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLW 84
Query: 102 VNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYL 161
V C C +C R+S+ ++D SST ++ D C P C Y
Sbjct: 85 VQCRPCADCFRQST-----PIFDPSKSSTYVDLSYDSPICP---NSPQKKYNHLNQCIYN 136
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
Y DGS+++G + + ++ Q T T S++FGCG G D GI+
Sbjct: 137 ASYADGSTSSGNLATEDIVFETSD---QGTVTVSSVVFGCGHSNRGRFDGQQS----GIL 189
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIFAIGHVVQPEVNKTPLVPNQP 277
G + S++S+L S F++C+ D +G V+ E + TP
Sbjct: 190 GLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFNG 243
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPL---VSKII 332
Y + + + VG L++ +VF ++ G ++DSGTT +L + ++PL + +++
Sbjct: 244 FYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLV 303
Query: 333 SQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHE-YLFPFEDLWCI 390
++ + C++ + D GFP + FHF L + + ++ +D++C+
Sbjct: 304 RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFVQKNQDVFCL 363
Query: 391 GWQNSGMQS 399
S +++
Sbjct: 364 AVLESNLKN 372
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 164/369 (44%), Gaps = 33/369 (8%)
Query: 43 SLLKEHDARRQQRILAGVDLPLGGSSRPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMW 101
SL + + RR+ R A + + + D G + +G PP V +DTGSD++W
Sbjct: 25 SLDRNNVERRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLW 84
Query: 102 VNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYL 161
V C C +C R+S+ ++D SST ++ D C P C Y
Sbjct: 85 VQCRPCADCFRQST-----PIFDPSKSSTYVDLSYDSPICP---NSPQKKYNHLNQCIYN 136
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
Y DGS+++G + + ++ Q T T S++FGCG G D GI+
Sbjct: 137 ASYADGSTSSGNLATEDIVFETSD---QGTVTVSSVVFGCGHSNRGRFDGQQS----GIL 189
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIFAIGHVVQPEVNKTPLVPNQP 277
G + S++S+L S F++C+ D +G V+ E + TP
Sbjct: 190 GLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFNG 243
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPL---VSKII 332
Y + + + VG L++ +VF ++ G ++DSGTT +L + ++PL + +++
Sbjct: 244 FYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLV 303
Query: 333 SQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHE-YLFPFEDLWCI 390
++ + C++ + D GFP + FHF L + + ++ +D++C+
Sbjct: 304 RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFVQKNQDVFCL 363
Query: 391 GWQNSGMQS 399
S +++
Sbjct: 364 AVLESNLKN 372
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 147/354 (41%), Gaps = 60/354 (16%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE----CPRRSSLGIELTLYDIKD 127
G+G Y + GTPP++ + DTGSD++W+ C CP+++ +
Sbjct: 49 GLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKAC--SRRPAFVASK 106
Query: 128 SSTGKFVTCDQEFCHGVYG----GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDK 183
S+T V C C V GP A C Y Y DGSSTTG+ +D
Sbjct: 107 SATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTA---T 163
Query: 184 VSGDLQTTSTNGSLIFGCGAR-QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
+S + + FGCG R Q G+ T G+IG G+ S +Q S +
Sbjct: 164 ISNGTSGGAAVRGVAFGCGTRNQGGSFSGTG-----GVIGLGQGQLSFPAQ--SGSLFAQ 216
Query: 243 MFAHCLDGINGG------GIFAIGHVVQPEVNK----TPLVPN---QPHYSINMTAVQVG 289
F++CL + GG +G +PE TPLV N Y + + A++VG
Sbjct: 217 TFSYCLLDLEGGRRGRSSSFLFLG---RPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVG 273
Query: 290 LDFLNLP-----TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVH 344
L +P DV G N GT+IDSG+TL YL Y LVS + + + +
Sbjct: 274 NRVLPVPGSEWAIDVLG---NGGTVIDSGSTLTYLRLGAYLHLVSAFAA---SVHLPRIP 327
Query: 345 DEYTCFQYSE------------SVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
T FQ E + GFP +T F +SL++ YL D
Sbjct: 328 SSATFFQGLELCYNVSSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVAD 381
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 164/369 (44%), Gaps = 33/369 (8%)
Query: 43 SLLKEHDARRQQRILAGVDLPLGGSSRPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMW 101
SL + + RR+ R A + + + D G + +G PP V +DTGSD++W
Sbjct: 57 SLDRNNVERRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLW 116
Query: 102 VNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYL 161
V C C +C R+S+ ++D SST ++ D C P C Y
Sbjct: 117 VQCRPCADCFRQST-----PIFDPSKSSTYVDLSYDSPICP---NSPQKKYNHLNQCIYN 168
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
Y DGS+++G + + ++ Q T T S++FGCG G D GI+
Sbjct: 169 ASYADGSTSSGNLATEDIVFETSD---QGTVTVSSVVFGCGHSNRGRFDGQQS----GIL 221
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIFAIGHVVQPEVNKTPLVPNQP 277
G + S++S+L S F++C+ D +G V+ E + TP
Sbjct: 222 GLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFNG 275
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPL---VSKII 332
Y + + + VG L++ +VF ++ G ++DSGTT +L + ++PL + +++
Sbjct: 276 FYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFDPLSNEIQRLV 335
Query: 333 SQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHE-YLFPFEDLWCI 390
++ + C++ + D GFP + FHF L + + ++ +D++C+
Sbjct: 336 RGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGADLVLDANSLFVQKNQDVFCL 395
Query: 391 GWQNSGMQS 399
S +++
Sbjct: 396 AVLESNLKN 404
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 127/260 (48%), Gaps = 24/260 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSS 129
G L+YA + +GTP + V +DTGS+++W+ +C C R S ++L +Y SS
Sbjct: 58 GYILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTVDLNIYSPNTSS 117
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIY-GDGSSTTGYFVQDVVQYDKVSGD 187
T + V C+ C C ++ S CPY +Y +G+STTGY VQD++ +S D
Sbjct: 118 TSEKVPCNSTLCSQTQ---RDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHL--ISDD 172
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
Q+ + + + FGCG Q+G+ A +G+ G G SN S+ S LA +G F+ C
Sbjct: 173 SQSKAVDAKITFGCGKVQTGSF--LTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMC 230
Query: 248 LDGINGGGIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGD 304
NG G + G +T QP Y+I++T +G +L V+
Sbjct: 231 FSP-NGIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDL---VYSA-- 284
Query: 305 NKGTIIDSGTTLAYLPEMVY 324
I DSGT+ YL + Y
Sbjct: 285 ----IFDSGTSFTYLNDPAY 300
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/369 (26%), Positives = 159/369 (43%), Gaps = 53/369 (14%)
Query: 44 LLKEHDARRQQRILAGVDLPLGGSSRPDGV-----------GLYYAKIGIGTPPKDYYVQ 92
LL AR + R+ A + + D + G Y + IGTPP Y
Sbjct: 46 LLSRAIARSKARVAALQSAAVSPAPVADPITAARVLVTASSGEYLVDLAIGTPPLYYTAI 105
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGSD++W C C C + + +D+K S+T + + C C + C
Sbjct: 106 MDTGSDLIWTQCAPCLLCAAQPT-----PYFDVKRSATYRALPCRSSRCAALSS---PSC 157
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
C Y YGD +ST G + + S T ++ FGCG+ +G L ++
Sbjct: 158 FKKM-CVYQYYYGDTASTAGVLANETFTFGAAS---STKVRAANISFGCGSLNAGELANS 213
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG-------GIFAIGHVVQP 265
+ G++GFG+ S++SQL S F++CL G+FA +
Sbjct: 214 S-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSPTPSRLYFGVFANLNSTNT 263
Query: 266 E----VNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTL 316
V TP V P P+ Y +++ + +G L + VF + D+ G IIDSGT++
Sbjct: 264 SSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGTSI 323
Query: 317 AYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQY--SESVDEGFPNVTFHFENSVSL 373
+L + YE + + S P ++ TCFQ+ +V P+ FHF+ + ++
Sbjct: 324 TWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFDGA-NM 382
Query: 374 KVYPHEYLF 382
+ P Y+
Sbjct: 383 TLPPENYML 391
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 156/352 (44%), Gaps = 40/352 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
L++ +G PP + +DTGS ++W+ C CK C SS + +++ SST
Sbjct: 67 LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHC---SSNHMIHPVFNPALSSTFVEC 123
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+CD FC P C++N C Y ++Y G+ + G ++ + + +G+ T
Sbjct: 124 SCDDRFCR---YAPNGHCSSN-KCVYEQVYISGTGSKGVLAKERLTFTTPNGN---TVVT 176
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-- 252
+ FGCG L+S GI+G G +S+ QL S F++C+ +
Sbjct: 177 QPIAFGCGHENGEQLES----EFTGILGLGAKPTSLAVQLGSK------FSYCIGDLANK 226
Query: 253 --GGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFG-VGDNKG 307
G +G + TP+ + Y +N+ + VG LN+ VF G G
Sbjct: 227 NYGYNQLVLGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTG 286
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIIS-QQPDLKVHTVHDEYTCFQYSESVDE---GFPNV 363
I+D+GT +L ++ Y L ++I S P L+ D + C Y V+E GFP V
Sbjct: 287 VILDTGTLYTWLADIAYRELYNEIKSILDPKLERFWFRD-FLC--YHGRVNEELIGFPVV 343
Query: 364 TFHFENSVSLKVYPHEYLFP------FEDLWCIGWQNSGMQSRDRKNMTLLG 409
TFHF L + +P + +++C+ + + + K+ T +G
Sbjct: 344 TFHFAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIG 395
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 148/344 (43%), Gaps = 44/344 (12%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPR 112
++ V P+ G+ P +G Y I IG PP+ YY+ +DTGSD+ W+ C ++C E P
Sbjct: 21 VSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPH 78
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTG 172
LY + + C+ C ++ C C Y Y DG S+ G
Sbjct: 79 --------PLYQ----PSSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLG 126
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
V+DV + G L+ T L GCG Q +++ LDG++G G+ S++S
Sbjct: 127 VLVRDVFSMNYTQG-LRLTP---RLALGCGYDQIPG--ASSHHPLDGVLGLGRGKVSILS 180
Query: 233 QLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG-LD 291
QL S G V+ + HCL + GGGI G + + ++ P YS + + G L
Sbjct: 181 QLHSQGYVKNVIGHCLSSL-GGGILFFGDDLY-DSSRVSWTPMSREYSKHYSPAMGGELL 238
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVHDEYT 348
F T + N T+ DSG++ Y Y+ L+ + +S +P + H
Sbjct: 239 FGGRTTGL----KNLLTVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPL 294
Query: 349 CFQYS------ESVDEGFPNVTFHFE----NSVSLKVYPHEYLF 382
C+Q E V + F + F+ + ++ P YL
Sbjct: 295 CWQGRRPFMSIEEVKKYFKPLALSFKTGWRSKTLFEIPPEAYLI 338
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 146/362 (40%), Gaps = 63/362 (17%)
Query: 76 YYAKIGIGTP-PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
Y + IG P + + +DTGSD++W C C EC L +D S+T + V
Sbjct: 92 YLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAEC-----FTQPLPRFDTAASNTVRSV 146
Query: 135 TCDQEFC--HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
C C H +G L CT Y+ YGDGS + G+F++D +D G + T
Sbjct: 147 ACSDPLCNAHSEHGCFLHGCT------YVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTV 200
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC----- 247
+ + FGCG +G T GI GFG+ S+ SQL VR+ F++C
Sbjct: 201 PD--IGFGCGMYNAGRFLQTET----GIAGFGRGPLSLPSQLK----VRQ-FSYCFTTRF 249
Query: 248 --------LDGINGGGIFAIGHVVQ-PEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD 298
L G A G ++ P V P + HY ++ V VG LP
Sbjct: 250 EAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGK--TRLPVP 307
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
+ T IDSGT + P+ V+ L S I+Q T ++ CF +
Sbjct: 308 EIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAALPVNKTADEDDICFSWDGKKTA 367
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL---------WCIGWQNSGMQSRDRKNMTLLG 409
P + FH E + ++ P E+ C+ SG R TL+G
Sbjct: 368 AMPKLVFHLEGA--------DWDLPRENYVTEDRESGQVCVAVSTSGQMDR-----TLIG 414
Query: 410 DF 411
+F
Sbjct: 415 NF 416
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 109/413 (26%), Positives = 181/413 (43%), Gaps = 40/413 (9%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVK--YRYAGRERSLSLLKEHDAR----RQQRILAGVDL 62
+ I LI+TA V + F+V+ +R + + + L+ H R ++ I L
Sbjct: 10 VIIFLISTAVVSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADTLRRSISHNTGL 69
Query: 63 PLGGSSRP--DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
P + G Y K+ +GTPP DTGSDI+W C C C ++ +L
Sbjct: 70 VTNTVEAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQ-----DL 124
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+++ S+T + V+C C + G C+ C Y YGD S + G F D +
Sbjct: 125 PMFNPSKSTTYRKVSCSSPVCS--FTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT 182
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
SG + GCG +G+ D+ + GI+G G +S+I Q+ S+ G
Sbjct: 183 MGSTSGRVVAFPRTA---IGCGHDNAGSFDAN----VSGIVGLGLGPASLIKQMGSAVGG 235
Query: 241 RKMFAHCLDGI--NGGGIFAIGHVVQPEVN-----KTPLVPN---QPHYSINMTAVQVGL 290
+ F++CL I + GG + V+ TP+ + + YS+ + AV VG
Sbjct: 236 K--FSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGR 293
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ T +G IIDSGTTL LP +Y +K IS +L+ +++ +
Sbjct: 294 NNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNF-AKAISNSINLQRTDDPNQFLEY 352
Query: 351 QYSESVDE-GFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRD 401
+ + D+ P + HFE + +L++ L D + C+ + +G Q D
Sbjct: 353 CFETTTDDYKVPFIAMHFEGA-NLRLQRENVLIRVSDNVICLAF--AGAQDND 402
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 157/357 (43%), Gaps = 35/357 (9%)
Query: 37 GRERSLSLLKEHDARRQQRILAGVD---LPLGGSSR----PDGVGLYYAKIGIGTPPKDY 89
G LL D RQ+ L D P GS D V L+Y I IGTP +
Sbjct: 56 GSSEYFRLLLNSDLTRQKMKLGSQDQSFYPSEGSKTLSFGNDFVWLHYTWIDIGTPNVSF 115
Query: 90 YVQVDTGSDIMWVNCIQCKECPRRS-----SLGIELTLYDIKDSSTGKFVTCDQEFCHGV 144
V +DTGSD+ WV C C EC S +L +L Y SS+ + + C + C+
Sbjct: 116 LVALDTGSDMFWVPC-DCIECAPLSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQN 174
Query: 145 YGGPLTDCTA-NTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
++C CPY++ Y D +S++G+ ++D + S + S S+I GCG
Sbjct: 175 -----SNCKGFKDRCPYIKEYTSDNTSSSGFLIEDKLHL--ASNNATKNSIQASVILGCG 227
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIF--AIG 260
+QSG A +G++G G + S+ + LA +G +R + CL+ G I G
Sbjct: 228 RKQSGYF--LEGAAPNGMLGLGPGSISVPALLAKAGLIRNSISICLNEKGSGRILFGDQG 285
Query: 261 HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
H Q TP + + + VG++ + + + + K ID+GT+ YLP
Sbjct: 286 HATQRR--STPFLLDDGE----LLNYFVGVERFCVGSFCYKETEFKA-FIDTGTSFTYLP 338
Query: 321 EMVYEPLVSKIISQQPDLKV--HTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+ VYE +V++ Q ++ D C+ S FP + F F + S +
Sbjct: 339 KGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRESNNFPPMKFTFSKNQSFII 395
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/303 (29%), Positives = 137/303 (45%), Gaps = 35/303 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFV 134
+ +G GTP + Y V DTGSD+ W+ C+ C C ++ ++D S+T V
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHD-----PIFDPTKSATYSVV 189
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C G + C+ N +C Y YGDGSS+ G V+ ++ +S L +T
Sbjct: 190 PCGHPQCAAADG---SKCS-NGTCLYKVEYGDGSSSAG-----VLSHETLS--LTSTRAL 238
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
FGCG G+ +DG+IG G+ S+ SQ A+S G F++CL N
Sbjct: 239 PGFAFGCGQTNLGDFGD-----VDGLIGLGRGQLSLSSQAAASFG--GTFSYCLPSDNTT 291
Query: 255 -GIFAIGHVVQP---EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G IG +V T +V Q + Y + + ++ +G L +P +F + G
Sbjct: 292 HGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLF---TDDG 348
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFH 366
T +DSGT L YLP Y L + K +D + TC+ ++ P V+F
Sbjct: 349 TFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFK 408
Query: 367 FEN 369
F +
Sbjct: 409 FSD 411
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/308 (30%), Positives = 140/308 (45%), Gaps = 36/308 (11%)
Query: 27 GVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSS-RPDGVG-LYYAKIGIGT 84
G F A R+R+L RR I + G S+ R +G L+Y + +GT
Sbjct: 58 GSFEYYAELAHRDRALR------GRRLSDIDGLLTFSDGNSTFRISSLGFLHYTTVSLGT 111
Query: 85 PPKDYYVQVDTGSDIMWVNCIQCKECPRRS----SLGIELTLYDIKDSSTGKFVTCDQEF 140
P K + V +DTGSD+ WV C C C + EL++Y+ K SST + VTC+
Sbjct: 112 PGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCNNSL 170
Query: 141 CHGVYGGPLTDCTANTS-CPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C C S CPY+ Y +ST+G V+DV+ + D + +
Sbjct: 171 C-----AHRNRCLGTFSNCPYMVSYVSAETSTSGILVEDVLHL--TTEDNRQEFVEAYVT 223
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA 258
FGCG Q+G+ + A +G+ G G S+ S L+ G F+ C G +G G +
Sbjct: 224 FGCGQVQTGSF--LDIAAPNGLFGLGLEKISVPSILSKEGFTADSFSMCF-GPDGIGRIS 280
Query: 259 IGHVVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTL 316
G P+ +TP N P Y+I +T V+VG ++L + + DSGT+
Sbjct: 281 FGDKGGPDQEETPFNLNALHPTYNITVTQVRVGTTLIDL---------DFTALFDSGTSF 331
Query: 317 AYLPEMVY 324
YL + +Y
Sbjct: 332 TYLVDPIY 339
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 102/351 (29%), Positives = 152/351 (43%), Gaps = 54/351 (15%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTP ++VDTGSD+ WV QC C + + L+D SS+ V
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWV---QCTPCAAPACYSQKDPLFDPAQSSSYAAVP 196
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY---DKVSGDLQTTS 192
C C G+ G + C+A C Y+ YGDGS TTG + D + D V G
Sbjct: 197 CGGPVCGGL-GIYASSCSA-AQCGYVVSYGDGSKTTGVYSSDTLTLSPNDAVRG------ 248
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLDG- 250
FGCG QSG + DG++G G+ +S++ Q A + GGV F++CL
Sbjct: 249 ----FFFGCGHAQSGFTGN------DGLLGLGREEASLVEQTAGTYGGV---FSYCLPTR 295
Query: 251 INGGGIFAIG---HVVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G +G P + T L+ PN +Y + +T + VG L++P+ VF
Sbjct: 296 PSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFA--- 352
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSESVDEG 359
GT++D+GT + LP Y L S S P + D TC+ +S
Sbjct: 353 -GGTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILD--TCYNFSGYGTVT 409
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
PNV F ++ + L C+ + SG M +LG+
Sbjct: 410 LPNVALTFSGGATVTLGADGIL----SFGCLAFAPSG----SDGGMAILGN 452
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 147/360 (40%), Gaps = 39/360 (10%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ 106
E AR + +LAG L + G G Y I G PP+ VDTGSD+ WV C+
Sbjct: 63 ERRARLAKHVLAGDQL--FETPVASGNGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLP 120
Query: 107 CKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGD 166
CK C S +D S++ K + C FC + P C A SC Y +YGD
Sbjct: 121 CKSCYETLS-----AKFDPSKSASYKTLGCGSNFCQDL---PFQSCAA--SCQYDYMYGD 170
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
GSST+G D V T ++ FGCG G G
Sbjct: 171 GSSTSGALSTDDVTIG--------TGKIPNVAFGCGNSNLGTFAGAGGLVGLGKGPL--- 219
Query: 227 NSSMISQLASSGGVRKMFAHCLD--GINGGGIFAIG-HVVQPEVNKTPLVPNQPH---YS 280
S++SQL G K F++CL G IG + V TP++ N + Y
Sbjct: 220 --SLVSQLG--GTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAYTPMLTNNNYPTFYY 275
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL 338
+ + V +N P + F + G I+DSGTTL YL + P+V+ + + P
Sbjct: 276 AELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMVAALKAALPYP 335
Query: 339 KVH-TVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP--FEDLWCIGWQNS 395
+ + + CF + + +P V FHF N + + P FE C+ +S
Sbjct: 336 EADGSFYGLEYCFSTAGVANPTYPTVVFHF-NGADVALAPDNTFIALDFEGTTCLAMASS 394
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 157/355 (44%), Gaps = 54/355 (15%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C C++C R + L SST +
Sbjct: 79 GYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDL-----SSTYQP 133
Query: 134 VTCDQEFCHGVYGGPLTDCTANT---SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C DC + C Y Y + S+++G +DVV + Q+
Sbjct: 134 VKCT------------LDCNCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGN-----QS 176
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+FGC ++G+L S + DGI+G G+ + S++ QL V F+ C G
Sbjct: 177 ELAPQRAVFGCENVETGDLYS---QHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGG 233
Query: 251 IN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
++ GGG +G + P P+ P+Y+I++ + V L L VF D
Sbjct: 234 MDVGGGAMVLGGISPPSDMVFAQSDPV--RSPYYNIDLKEIHVAGKRLPLNPSVF---DG 288
Query: 306 K-GTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYT--CFQYS----ESVD 357
K G+++DSGTT AYLPE + I+ + Q ++ Y CF + +
Sbjct: 289 KHGSVLDSGTTYAYLPEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLS 348
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ FP V F N + P Y+F + +C+G +G + TLLG
Sbjct: 349 KTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNG-----KDPTTLLG 398
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 150/341 (43%), Gaps = 67/341 (19%)
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTC----DQEFCH 142
+ Y + VDTGS +V C C C + YD S + + C D C
Sbjct: 49 QTYDLIVDTGSARTYVPCKGCARCGEHAH-----GYYDYDRSMEFERLDCGEASDATLCE 103
Query: 143 GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
G C ++ C Y+ Y +GSS+ GY V+D V+ L + + L FGC
Sbjct: 104 ETMKGT---CQSDGRCSYVVSYAEGSSSRGYVVRDRVR-------LGEGTLSAMLAFGC- 152
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING-GGIFAIGH 261
+ ++ E+ DG+ GFG+ +++ +QLAS+G + +F+ C++G GG+ +G
Sbjct: 153 --EEAETNAIYEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGR 210
Query: 262 ----VVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGD-------NKGT 308
P + +TPLV P P F N+ T + +GD + T
Sbjct: 211 FDFGADAPALARTPLVADPANPA-------------FHNVRTSSWKLGDSLIEHLNSYTT 257
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQ--QPDLKVHTVHD---EYTCFQYS---------- 353
+DSGTT ++P V+ +++ +Q Q L++ D + C+ S
Sbjct: 258 TLDSGTTFTFVPRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQ 317
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED---LWCIG 391
+V E FP +T +E VSL + P YLF E +C+G
Sbjct: 318 STVSEWFPPLTIAYEGGVSLTLGPENYLFAHETNSAAFCVG 358
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 133/281 (47%), Gaps = 30/281 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +G+PPK + + +DTGSD+ W+ C+ C +C +++ YD K S++
Sbjct: 151 GSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNG-----AFYDPKASASY 205
Query: 132 KFVTCDQEFCHGVY-GGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYD-KVSGDL 188
K +TC+ C+ V P C + N SCPY YGD S+TTG F + + SG
Sbjct: 206 KNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGS 265
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+++FGCG G G+ S SQL S G F++CL
Sbjct: 266 SELYNVENMMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYG--HSFSYCL 318
Query: 249 ------DGINGGGIFAIGH--VVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNL 295
++ IF + P +N T V + + Y + + ++ V + LN+
Sbjct: 319 VDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNI 378
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
P + + + + GTIIDSGTTL+Y E YE + +KI +
Sbjct: 379 PEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEK 419
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 149/340 (43%), Gaps = 47/340 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL G P G LYY + IG PPK Y++ VD+GSD+ W+ C + P RS +
Sbjct: 54 FPLYGDVYPHG--LYYVAMNIGNPPKPYFLDVDSGSDLTWLQC----DAPCRSCNEVPHP 107
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-TANTSCPYLEIYGDGSSTTGYFVQD 177
LY S K V C C ++ G LT C + + C Y+ Y D S+TG + D
Sbjct: 108 LYRPTKS---KLVPCVHRLCASLHNG-LTGKHRCDSPHEQCDYVIKYADQGSSTGVLIND 163
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQ---SGNLDSTNEEALDGIIGFGKSNSSMISQL 234
+G + S+ FGCG Q SG+L S DG++G G + S++SQL
Sbjct: 164 SFALRLTNGSV----ARPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLLSQL 215
Query: 235 ASSGGVRKMFAHCLDGINGGGIFAIGHVVQP--EVNKTPLVPN--QPHYSINMTAVQVGL 290
G + + HCL + GGG G + P TP+ + + +YS ++ G
Sbjct: 216 KQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGD 274
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTV 343
L GV K + DSG++ Y Y+ LV S+ + ++PD +
Sbjct: 275 RSL-------GVRLAK-VVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLC 326
Query: 344 HDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHEYL 381
F+ V + F ++ +F + +++ P YL
Sbjct: 327 WKGQEPFKSVLDVRKEFKSLVLNFASGKKTLMEIPPENYL 366
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/272 (31%), Positives = 127/272 (46%), Gaps = 30/272 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + IGTPP+ + + +DTGSD+ W+ C+ C +C ++ YD K+SS+
Sbjct: 188 GSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNG-----PYYDPKESSSF 242
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K + C CH V P C A N +CPY YGD S+TTG F + + S +
Sbjct: 243 KNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGK 302
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ +++FGCG G G+ S SQL S G F++CL
Sbjct: 303 SEFKRVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYG--HSFSYCL 355
Query: 249 ------DGINGGGIFAIGH--VVQPEVNKTPLV-----PNQPHYSINMTAVQVGLDFLNL 295
++ IF + PEVN T LV P Y + + ++ VG + L +
Sbjct: 356 VDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKI 415
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYE 325
P + + + GTI+DSGTTL+Y E YE
Sbjct: 416 PEETWHLSPEGAGGTIVDSGTTLSYFAEPSYE 447
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 176/387 (45%), Gaps = 50/387 (12%)
Query: 37 GRERSLSLLKEHDARRQQRILAGVD--LPLGGSSRPDGVG------LYYAKIGIGTPPKD 88
G + +++ D + R LAG D PL ++ D L++A + +GTPP
Sbjct: 58 GTPQYYAVMAHRDRVFRGRRLAGADHHSPLTFAAGNDTHQIASSGFLHFANVSVGTPPLW 117
Query: 89 YYVQVDTGSDIMWV--NCIQCKECPRRSSLG--IELTLYDIKDSSTGKFVTCDQE-FCHG 143
+ V +DTGSD+ W+ +CI C R+ G ++ YD+ SST V+C+ FC
Sbjct: 118 FLVALDTGSDLFWLPCDCISCVHGGLRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQ 177
Query: 144 VYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
P +A ++C Y ++ + +S+ G+ V+DV+ ++ D QT + + FGCG
Sbjct: 178 RQQCP----SAGSTCRYQVDYLSNDTSSRGFVVEDVLHL--ITDDDQTKDADTRIAFGCG 231
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
Q+G N A +G+ G G N S+ S LA G + F+ C G + G G
Sbjct: 232 QVQTGVF--LNGAAPNGLFGLGMDNISVPSILAREGLISNSFSMCF-GSDSAGRITFGDT 288
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPE 321
P+ KTP + H + N+T ++ ++ V D + I DSGT+ Y+ +
Sbjct: 289 GSPDQRKTPFNVRKLHPTYNITITKIIVE--------DSVADLEFHAIFDSGTSFTYIND 340
Query: 322 MVYEPLVSKIISQQPDLKVHTVH--DEYTCFQY------SESVDEGFPNVTF-----HFE 368
Y + ++ + + K H+ D F Y S++++ F N+T ++
Sbjct: 341 PAYT-RIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQTIEVPFLNLTMKGGDDYYV 399
Query: 369 NSVSLKVYPHEYLFPFEDLWCIGWQNS 395
++V E DL C+G Q S
Sbjct: 400 MDPIIQVSSEEE----GDLLCLGIQKS 422
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 161/387 (41%), Gaps = 60/387 (15%)
Query: 40 RSLSLLKEH-DARRQQRILAGVDLPLGGSSRPD-----------------------GVGL 75
++L L + H D+ R Q I + L L G S+ D G G
Sbjct: 99 KALVLSRLHRDSSRVQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTPVSSGTSQGSGE 158
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y+ ++G+G P K YY+ +DTGSDI W+ C C +C ++S ++ SS+ +T
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSD-----PIFTPAASSSYSPLT 213
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
CD + C+ + ++ C N C Y YGDGS T G FV + + + + T
Sbjct: 214 CDSQQCNSLQ---MSSC-RNGQCRYQVNYGDGSFTFGDFVTETMSFGG-------SGTVN 262
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
S+ GCG G G S+ SQL ++ F++CL +
Sbjct: 263 SIALGCGHDNEGLFVGAAGLLGLGGGPL-----SLTSQLKATS-----FSYCLVNRDSAA 312
Query: 256 IFAIGHVVQPEVNK--TPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVGD--NKGT 308
+ P + PL+ + Y + ++ + VG + L +P +VF + D + G
Sbjct: 313 SSTLDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGV 372
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT-VHDEYTCFQYSESVDEGFPNVTFHF 367
I+D GT + L Y L +S L+ + V TC+ S P V+FHF
Sbjct: 373 IVDCGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHF 432
Query: 368 ENSVSLKVYPHEYLFPFED--LWCIGW 392
+ S + YL P + +C +
Sbjct: 433 DGGKSWDLPAANYLIPVDSAGTYCFAF 459
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/354 (29%), Positives = 152/354 (42%), Gaps = 51/354 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y ++ IGTPP+ +DTGSD++W+ C C C T++ SS+
Sbjct: 1 GEGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHH---GETIFFSDASSSY 57
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K + C+ C G+ + T C Y YGDGS T+G D + +
Sbjct: 58 KKLPCNSTHCSGMSSAGIGPRCEET-CKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHR 116
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S +FGC + G+ + T G+IG G+ + S+I QL G + F++CL
Sbjct: 117 SFFDGFLFGCARKLKGDWNFTQ-----GLIGLGQKSHSLIQQLGDKLGYK--FSYCLVSY 169
Query: 252 N-----------GGGIFAIGHVVQPEVNKTPLVP----NQPHYSINMTAVQVGLDFLNLP 296
+ G GH +V TP++ +Q Y +++ ++ +G +P
Sbjct: 170 DSPPSAKSFLFLGSSAALRGH----DVVSTPILHGDHLDQTLYYVDLQSITIG----GVP 221
Query: 297 TDVFG--VGDNKG--------TIIDSGTTLAYLPEMVYEPLVSKIISQ--QPDLKVHTVH 344
V+ G N T+IDSGTT L VYE + I Q P L
Sbjct: 222 VVVYDKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGL 281
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF--EDLWCIGWQNSG 396
D CF S GFP+VTF+F N V L V P E +F D+ C+ +SG
Sbjct: 282 D--LCFNSSGDTSYGFPSVTFYFANQVQL-VLPFENIFQVTSRDVVCLSMDSSG 332
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 130/291 (44%), Gaps = 26/291 (8%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
+K A + R + LP+ G+ PDG YY + IG PP+ Y++ VDTGSD+ W+ C
Sbjct: 130 VKPDSAGAEARENSSALLPIRGNVFPDGQ--YYTSMYIGNPPRPYFLDVDTGSDLTWIQC 187
Query: 105 -IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
C C + + + V +C + G T+ C Y
Sbjct: 188 DAPCTNCAKGP--------HPLYKPEKPNVVPPRDSYCQELQGNQNYGDTSK-QCDYEIT 238
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
Y D SS+ G +D +Q G+ + N +FGCG Q GNL S+ DGI+G
Sbjct: 239 YADRSSSMGILARDNMQLITADGERE----NLDFVFGCGYDQQGNLLSSPANT-DGILGL 293
Query: 224 GKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQPEVNKTPL-VPNQPH-- 278
+ S+ +QLAS G + +F HC+ D NGG +F +G P T + + N P
Sbjct: 294 SNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMF-LGDDYVPRWGMTWMPIRNGPENL 352
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
YS + V G LN+ G I DSG++ YLP Y L++
Sbjct: 353 YSTEVQKVNYGDQQLNVRRK---AGKLTQVIFDSGSSYTYLPHDDYTNLIA 400
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 154/352 (43%), Gaps = 30/352 (8%)
Query: 44 LLKEHDARRQQRILAGVDLPLGGSSRPDGVG-------LYYAKIGIGTPPKDYYVQVDTG 96
LL D+RRQ+ L L S + L+Y I IGTP + V +D+G
Sbjct: 58 LLTSIDSRRQKMNLGAKFQSLVPSEGSKTISPGNYFGWLHYTWIDIGTPSVSFLVALDSG 117
Query: 97 SDIMWV--NCIQCKECPRR--SSLGI-ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
SD++W+ NC+QC SSL +L +D S+T K C + C P +
Sbjct: 118 SDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCE---SAPACE 174
Query: 152 CTANTSCPYLEIYG-DGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
+ CPY Y + +S++G V+DV+ + ++S ++ GCG +QSG
Sbjct: 175 -SPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSAN--ASSSVKARVVVGCGEKQSGEF- 230
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKT 270
A DG++G G S+ S LA +G +R F+ C D + G I+ G V T
Sbjct: 231 -LKGIAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIY-FGDVGPSTQQST 288
Query: 271 PLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
+P Y A VG++ + + T+IDSG + +LPE +Y + +
Sbjct: 289 RFLP----YKNEFVAYFVGVEVCCVGNSCLK-QSSFTTLIDSGQSFTFLPEEIYREVALE 343
Query: 331 IISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
I S + V + + Y S + P + F ++ + + H+ LF
Sbjct: 344 IDSHI-NATVKKIEGGPWEYCYETSFEPKVPAIKLKFSSNNTFVI--HKPLF 392
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 145/337 (43%), Gaps = 20/337 (5%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTP + + + DTGSD+ WV C + +S ++ +S +
Sbjct: 106 GTGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSW 165
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCP----YLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ C + C L +C+A T+ P Y Y D SS G D
Sbjct: 166 APIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSG 225
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ ++ GC + + D + ++ DG++ G SN S S+ A+ G R F++C
Sbjct: 226 SDRKAKLQEVVLGC----TTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR--FSYC 279
Query: 248 L-DGI---NGGGIFAIGHV-VQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDV 299
L D + N G V ++TPL+ + P Y++ + AV V LN+P +V
Sbjct: 280 LVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEV 339
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSES-VDE 358
+ V N G I+DSGT+L L Y+ +V+ + Q + T+ C+ ++ +
Sbjct: 340 WDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTMDPFEYCYNWTATRRPP 399
Query: 359 GFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
P + F S L+ Y+ + CIG Q
Sbjct: 400 AVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQE 436
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 159/357 (44%), Gaps = 58/357 (16%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C C++C R + L SST +
Sbjct: 11 GYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDL-----SSTYQS 65
Query: 134 VTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C+ DC + C Y Y + S+++G +D++ + G+L
Sbjct: 66 VKCN------------IDCNCDDEKQQCVYERQYAEMSTSSGVLGEDIISF----GNLSA 109
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC--- 247
+ + +FGC ++G+L S + DGI+G G+ + S++ L G + F+ C
Sbjct: 110 LAPQRA-VFGCENMETGDLYSQHA---DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGG 165
Query: 248 ----LDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
+ GGI ++V + + P+Y+I++ + V L L VF
Sbjct: 166 MGIGGGAMVLGGISPPSNMVFSQSDPV----RSPYYNIDLKEIHVAGKPLPLNPTVF--- 218
Query: 304 DNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEYT--CFQYSES---- 355
D K GTI+DSGTT AYLPE + I+ + LK + Y CF + S
Sbjct: 219 DGKHGTILDSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQ 278
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIGWQNSGMQSRDRKNMTLLG 409
+ FP V F N L + P YLF + +C+G +G + TLLG
Sbjct: 279 LSSSFPAVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNG-----KDPTTLLG 330
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 150/337 (44%), Gaps = 49/337 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y A++ IGTPP+ + + VDTGS + +V C C+ C + +DS T +
Sbjct: 91 GYYTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQD-----PKFRPEDSETYQP 145
Query: 134 VTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C + +C + C Y Y + S+++G +DVV + QT
Sbjct: 146 VKCTWQ----------CNCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGN-----QTEL 190
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+ IFGC ++G D N+ A DGI+G G+ + S++ QL + F+ C G+
Sbjct: 191 SPQRAIFGCENDETG--DIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMG 247
Query: 253 G-------GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
GGI +V + P+Y+I++ + V L+L VF D
Sbjct: 248 VGGGAMVLGGISPPADMVFTRSDPV----RSPYYNIDLKEIHVAGKRLHLNPKVF---DG 300
Query: 306 K-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYSE----SVD 357
K GT++DSGTT AYLPE + I+ + LK + D CF +E +
Sbjct: 301 KHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQIS 360
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIG 391
+ FP V F N L + P YLF + +C+G
Sbjct: 361 KSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLG 397
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 162/339 (47%), Gaps = 42/339 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ IGTPP Y QVDTGSD++W+ CI C C ++ + ++D + SST +
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLN-----PMFDPQSSSTYSNIA 113
Query: 136 CDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
E C +Y T C+ + +C Y Y D S T G Q+ + +G + +
Sbjct: 114 YGSESCSKLYS---TSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTG--KPVALK 168
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL------ 248
G +IFGCG +G N++ + GIIG G+ S++SQ+ SS G KMF+ CL
Sbjct: 169 G-VIFGCGHNNNGVF---NDKEM-GIIGLGRGPLSLVSQIGSSFG-GKMFSQCLVPFHTN 222
Query: 249 DGINGGGIFAIG-HVVQPEVNKTPLVPNQPHYSIN-MTAVQVGLDFLNLPTDVFGVGDN- 305
I F G V+ V TPLV H + +T + + ++ +NLP F G +
Sbjct: 223 PSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLP---FNDGSSL 279
Query: 306 ----KGT-IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVD 357
KG +IDSGT LPE Y LV ++ ++ P + + C++ ++
Sbjct: 280 EPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDP-IPIDPTLGYQLCYRTPTNLK 338
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
+T HFE + L + P + P +D ++C + ++
Sbjct: 339 GT--TLTAHFEGADVL-LTPTQIFIPVQDGIFCFAFTST 374
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 142/301 (47%), Gaps = 35/301 (11%)
Query: 67 SSRPDGVG-LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSL---GIELT 121
+SR +G L+Y + +GTP + V +DTGSD+ WV C C +C P + EL+
Sbjct: 97 TSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELS 155
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDG-SSTTGYFVQDVV 179
+Y+ K S+T K VTC+ C C ++CPY+ Y +ST+G ++DV+
Sbjct: 156 IYNPKVSTTNKKVTCNNSLC-----AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVM 210
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
+ D + FGCG QSG+ + A +G+ G G S+ S LA G
Sbjct: 211 HL--TTEDKNPERVEAYVTFGCGQVQSGSF--LDIAAPNGLFGLGMEKISVPSVLAREGL 266
Query: 240 VRKMFAHCLDGINGGGIFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPT 297
V F+ C G +G G + G + +TP L P+ P+Y+I +T V+VG ++
Sbjct: 267 VADSFSMCF-GHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID--- 322
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVD 357
D + D+GT+ YL + +Y +S+ K H+ D F+Y +
Sbjct: 323 ------DEFTALFDTGTSFTYLVDPMY-----TTVSESAQDKRHS-PDSRIPFEYCYDMR 370
Query: 358 E 358
E
Sbjct: 371 E 371
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 139/309 (44%), Gaps = 34/309 (11%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSS-----RPDGVG-LYYAKIGIGTPPKDYYVQVDTGSD 98
L D + R L+ D L S R +G L+Y + +GTP + V +DTGSD
Sbjct: 60 LAHRDQMLRGRRLSDADASLAFSDGNSTFRISSLGFLHYTTVELGTPGVKFMVALDTGSD 119
Query: 99 IMWVNCIQCKECP----RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
+ WV C C C + EL++Y+ ++SST K VTC+ + C C
Sbjct: 120 LFWVPC-DCSRCAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMC-----AQRNRCLG 173
Query: 155 N-TSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
+SCPY+ Y +ST+G V+DV+ G + + FGCG QSG+
Sbjct: 174 TFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREF--VEAYVTFGCGQVQSGSF--L 229
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTP- 271
+ A +G+ G G S+ S L+ G + F+ C G +G G + G P+ +TP
Sbjct: 230 DIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCF-GHDGIGRISFGDKGSPDQEETPF 288
Query: 272 -LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK 330
+ P P Y++ +T +VG +++ + DSGT+ Y+ + Y + K
Sbjct: 289 NVNPAHPTYNVTVTQARVGTMLIDV---------EFTALFDSGTSFTYMVDPAYSRVSEK 339
Query: 331 IISQQPDLK 339
S D +
Sbjct: 340 FHSLARDKR 348
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 130/267 (48%), Gaps = 21/267 (7%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCK--ECPRRSSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++WV +C+QC SSL +L Y SST
Sbjct: 112 LHYTWIDIGTPHVSFLVALDAGSDLLWVPCDCLQCAPLSASYYSSLDRDLNEYSPSHSST 171
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTA-NTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C + C GP +C + CPY ++ Y + +S++G V+D++ +
Sbjct: 172 SKHLSCSHQLCE---LGP--NCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLASNGDNA 226
Query: 189 QTTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ S ++ GCG +QSG LD A DG++G G + S+ S LA +G +R F+ C
Sbjct: 227 LSYSVRAPVVIGCGMKQSGGYLDGV---APDGLMGLGLAEISVPSFLAKAGLIRNSFSMC 283
Query: 248 LDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
D + G IF G TP + +Y T VG++ + + +
Sbjct: 284 FDEDDSGRIF-FGDQGPTTQQSTPFLTLDGNY----TTYVVGVEGFCVGSSCLKQTSFRA 338
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
++D+GT+ +LP VYE + + Q
Sbjct: 339 -LVDTGTSFTFLPNGVYERITEEFDRQ 364
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 141/325 (43%), Gaps = 45/325 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
LP+ G+ PDG YY I +G PP+ Y++ VDTGSD+ W+ C C C +
Sbjct: 191 LPIKGNVFPDGQ--YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGP------ 242
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+ + + K V C + G C C Y Y D SS+ G +D
Sbjct: 243 --HPLYKPAKEKIVPPKDLLCQELQGN-QNYCETCKQCDYEIEYADRSSSMGVLARD--- 296
Query: 181 YDKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D+ +TNG +FGC Q G L ++ + DGI+G + S+ SQLA
Sbjct: 297 ------DMHIITTNGGREKLDFVFGCAYDQQGQLLASPAKT-DGILGLSSAGISLPSQLA 349
Query: 236 SSGGVRKMFAHCL-DGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQV--GL 290
+ G + +F HC+ NGGG +G P + TP + + P + A +V G
Sbjct: 350 NQGIISNVFGHCITRDPNGGGYMFLGDDYVPRWGMTSTP-IRSAPDNLFHTEAQKVYYGD 408
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
L++ G++ I DSG++ YLP+ +Y+ L++ I P+ V D
Sbjct: 409 QQLSMRG---ASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNF-VQDSSDRTLPL 464
Query: 351 --------QYSESVDEGFPNVTFHF 367
+Y E V + F + HF
Sbjct: 465 CLATDFPVRYLEDVKQLFKPLNLHF 489
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 129/317 (40%), Gaps = 48/317 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +G+GTP +D V DTGSD+ WV C C +C + L+D SST
Sbjct: 142 GTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKD-----PLFDPARSSTY 196
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD---VVQYDKVSGDL 188
V C C G+ C+ + C Y +YGD S T G +D + Q D + G
Sbjct: 197 SAVPCASPECQGLDS---RSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSDVLPG-- 251
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+FGCG + +G DG++G G+ S+ SQ AS G F++CL
Sbjct: 252 --------FVFGCGEQDTGLFGRA-----DGLVGLGREKVSLSSQAASKYGA--GFSYCL 296
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G ++G T + Y + + V+V + + VF
Sbjct: 297 PSSPSAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAA- 355
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKII--------SQQPDLKVHTVHDEYTCFQYSESV 356
GT+IDSGT + LP VY L S + P L + TC+ ++
Sbjct: 356 --GTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILD-----TCYDFTGHT 408
Query: 357 DEGFPNVTFHFENSVSL 373
P+V F ++
Sbjct: 409 TVRIPSVALVFAGGAAV 425
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 141/325 (43%), Gaps = 45/325 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
LP+ G+ PDG YY I +G PP+ Y++ VDTGSD+ W+ C C C +
Sbjct: 192 LPIKGNVFPDGQ--YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGP------ 243
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
+ + + K V C + G C C Y Y D SS+ G +D
Sbjct: 244 --HPLYKPAKEKIVPPKDLLCQELQGN-QNYCETCKQCDYEIEYADRSSSMGVLARD--- 297
Query: 181 YDKVSGDLQTTSTNG-----SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D+ +TNG +FGC Q G L ++ + DGI+G + S+ SQLA
Sbjct: 298 ------DMHIITTNGGREKLDFVFGCAYDQQGQLLASPAKT-DGILGLSSAGISLPSQLA 350
Query: 236 SSGGVRKMFAHCL-DGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQV--GL 290
+ G + +F HC+ NGGG +G P + TP + + P + A +V G
Sbjct: 351 NQGIISNVFGHCITRDPNGGGYMFLGDDYVPRWGMTSTP-IRSAPDNLFHTEAQKVYYGD 409
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
L++ G++ I DSG++ YLP+ +Y+ L++ I P+ V D
Sbjct: 410 QQLSMRG---ASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYAYPNF-VQDSSDRTLPL 465
Query: 351 --------QYSESVDEGFPNVTFHF 367
+Y E V + F + HF
Sbjct: 466 CLATDFPVRYLEDVKQLFKPLNLHF 490
>gi|297723777|ref|NP_001174252.1| Os05g0187600 [Oryza sativa Japonica Group]
gi|255676094|dbj|BAH92980.1| Os05g0187600 [Oryza sativa Japonica Group]
Length = 340
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/141 (38%), Positives = 79/141 (56%), Gaps = 9/141 (6%)
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQ 276
+DG++G G SN+S++ QLA S +KMFAHCLDG GGIF +GH+V P+V KTPL
Sbjct: 89 VDGVMGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTS 148
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP 336
Y + + VG L+L + TI+++G+ ++YLPE KI S
Sbjct: 149 SRYRTTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPE--------KIFSDLE 200
Query: 337 DLKVHTVHDEYTCFQYSESVD 357
D+ V + Y+CF Y ++
Sbjct: 201 DISVINIGG-YSCFHYERRMN 220
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 130/291 (44%), Gaps = 26/291 (8%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
+K A + R + LP+ G+ PDG YY + IG PP+ Y++ VDTGSD+ W+ C
Sbjct: 130 VKPDGAGAEARENSSALLPIRGNVFPDGQ--YYTSMYIGNPPRPYFLDVDTGSDLTWIQC 187
Query: 105 -IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
C C + + + V +C + G T+ C Y
Sbjct: 188 DAPCTNCAKGP--------HPLYKPEKPNVVPPRDSYCQELQGNQNYGDTSK-QCDYEIT 238
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
Y D SS+ G +D +Q G+ + N +FGCG Q GNL S+ DGI+G
Sbjct: 239 YADRSSSMGILARDNMQLITADGERE----NLDFVFGCGYDQQGNLLSSPANT-DGILGL 293
Query: 224 GKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQPEVNKTPL-VPNQPH-- 278
+ S+ +QLAS G + +F HC+ D NGG +F +G P T + + N P
Sbjct: 294 SNAAISLPTQLASQGIISNVFGHCIAADPSNGGYMF-LGDDYVPRWGMTWMPIRNGPENL 352
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
YS + V G LN+ G I DSG++ YLP Y L++
Sbjct: 353 YSTEVQKVNYGDQQLNVRRK---AGKLTQVIFDSGSSYTYLPHDDYTNLIA 400
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 131/322 (40%), Gaps = 31/322 (9%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G YY + IG P K Y++ VDTGSD+ W+ C + P +S + LY + K
Sbjct: 54 TGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQC----DAPCQSCNKVPHPLYR---PTKNK 106
Query: 133 FVTCDQEFCHGVYGG--PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C ++ G P CT C Y Y D +S+ G V D S L+
Sbjct: 107 LVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMD-----SFSLPLRN 161
Query: 191 TS-TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
S SL FGCG Q + DG++G G+ + S++SQL G + + HCL
Sbjct: 162 KSNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLS 221
Query: 250 GINGGGIFAIGHVVQP--EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
+GGG G + P V +V + + + + D +L T V
Sbjct: 222 -TSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV----- 275
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKI-------ISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
+ DSG+T Y Y+ +S I + Q D + F+ V + F
Sbjct: 276 -VFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPLCWKGQKAFKSVSDVKKDF 334
Query: 361 PNVTFHFENSVSLKVYPHEYLF 382
++ F F + + + P YL
Sbjct: 335 KSLQFIFGKNAVMDIPPENYLI 356
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 142/320 (44%), Gaps = 50/320 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + IGTP Y +DTGSD++W C C EC +S+ ++D SST
Sbjct: 98 GNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQST-----PVFDPSSSSTY 152
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C C + P + CT + C Y YGD SST G + K
Sbjct: 153 AALPCSSTLCSDL---PSSKCT-SAKCGYTYTYGDSSSTQGVLAAETFTLAK-------- 200
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ + FGCG G D + A G++G G+ S++SQL G+ K F++CL +
Sbjct: 201 TKLPDVAFGCGDTNEG--DGFTQGA--GLVGLGRGPLSLVSQL----GLNK-FSYCLTSL 251
Query: 252 NG--------GGIFAI--GHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD 298
+ G + I V TPL+ P+QP Y +N+ + VG + LP+
Sbjct: 252 DDTSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSS 311
Query: 299 VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE----YTCFQY 352
F V D+ G I+DSGT++ YL Y L +Q +K+ TCF+
Sbjct: 312 AFAVQDDGTGGVIVDSGTSITYLELQGYRALKKAFAAQ---MKLPAADGSGIGLDTCFEA 368
Query: 353 SES-VDE-GFPNVTFHFENS 370
S VD+ P + FH + +
Sbjct: 369 PASGVDQVEVPKLVFHLDGA 388
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/338 (29%), Positives = 146/338 (43%), Gaps = 39/338 (11%)
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
+LPL S G G Y G GTP K+ + +DTGSD+ W+ C C +C +
Sbjct: 123 NLPLQ-SGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVD----- 176
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGYFVQDV 178
+++ K SS+ K + C C + T C C Y YGDGSS+ G F Q+
Sbjct: 177 AIFEPKQSSSYKTLPCLSATCTELITSESNPTPCLLG-GCVYEINYGDGSSSQGDFSQET 235
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+ S + FGCG +G ++ G++G G+++ S SQ S
Sbjct: 236 LTLGSDSFQ--------NFAFGCGHTNTGLFKGSS-----GLLGLGQNSLSFPSQSKSKY 282
Query: 239 GVRKMFAHCLDGINGGGIFAIGHVVQPEVNK----TPLVPN---QPHYSINMTAVQVGLD 291
G FA+CL V + + TPLV N Y + + + VG D
Sbjct: 283 G--GQFAYCLPDFGSSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGD 340
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL---KVHTVHDEYT 348
L++P V G G TI+DSGT + L Y L + S+ DL K ++ D T
Sbjct: 341 RLSIPPAVLGRGS---TIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSILD--T 395
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
C+ S P +TFHF+N+ + V L P ++
Sbjct: 396 CYDLSRHSQVRIPTITFHFQNNADVAVSDVGILVPVQN 433
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 93/328 (28%), Positives = 153/328 (46%), Gaps = 48/328 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKFV 134
Y ++ IGTPP + DTGSD+ W C CK C P+ + +YD SS+ V
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQ------DTPIYDTAVSSSFSPV 146
Query: 135 TCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C C ++ +CTA++S C Y YGDG+ + G + + + G +
Sbjct: 147 PCASATCLPIWSS--RNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGAPG-----VS 199
Query: 194 NGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGI 251
G + FGCG G + +ST G +G G+ + S+++QL GV K F++CL D
Sbjct: 200 VGGIAFGCGVDNGGLSYNST------GTVGLGRGSLSLVAQL----GVGK-FSYCLTDFF 248
Query: 252 N---GGGIF--AIGHVVQPE----VNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDV 299
N G + A+ + P V TPLV P P Y +++ + +G L +P
Sbjct: 249 NTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGT 308
Query: 300 FGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKI--ISQQPDLKVHTVHDEYTCFQYS-- 353
F + D+ G I+DSGTT +L E + +V + + +QP + ++ + CF +
Sbjct: 309 FDLRDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLRQPVVNASSL--DSPCFPAATG 366
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYL 381
E P++ HF ++++ Y+
Sbjct: 367 EQQLPAMPDMVLHFAGGADMRLHRDNYM 394
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 153/355 (43%), Gaps = 44/355 (12%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G Y +G+GTP Y V DTGSD WV C C +C ++ + L+D
Sbjct: 154 SGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQ-----KEPLFDP 208
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD--VVQYDK 183
SST V+C C + CT C Y YGDGS T G+F QD + +D
Sbjct: 209 AKSSTYANVSCTDSACADL---DTNGCTGG-HCLYAVQYGDGSYTVGFFAQDTLTIAHDA 264
Query: 184 VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
+ G FGCG + +G T G++G G+ +S+ Q + G
Sbjct: 265 IKG----------FRFGCGEKNNGLFGKTA-----GLMGLGRGKTSLTVQAYNKYG--GA 307
Query: 244 FAHCLDGI-NGGGIFAIGH-VVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDV 299
FA+CL + G G G TP++ + Q Y + MT ++VG + + V
Sbjct: 308 FAYCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESV 367
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESV 356
F GT++DSGT + LP Y L S K++ + K TC+ ++
Sbjct: 368 F---STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLS 424
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGD 410
D P V+ F+ L V ++ E C+ + ++G D +++ ++G+
Sbjct: 425 DVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNG----DDESVAIVGN 475
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 32/285 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P S+ + Y SST +
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPASAASGSASFYIPSMSSTSQA 173
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ +FC +C+ + CPY +Y +S++G+ V+DV+ + D
Sbjct: 174 VPCNSQFCEL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLS--TEDAIPQI 226
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++FGCG Q+G+ + A +G+ G G S+ S LA G FA C +
Sbjct: 227 LKAQILFGCGQVQTGSF--LDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSR-D 283
Query: 253 GGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL P P Y+I+++ + VG +L TI
Sbjct: 284 GIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDL---------EFSTIF 334
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT---VHDEYTCFQY 352
D+GT+ YL + Y I+Q +VH D F+Y
Sbjct: 335 DTGTSFTYLADPAY-----TYITQSFHAQVHANRHAADSRIPFEY 374
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 32/285 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P S+ + Y SST +
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPASAASGSASFYIPSMSSTSQA 173
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ +FC +C+ + CPY +Y +S++G+ V+DV+ + D
Sbjct: 174 VPCNSQFCEL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLS--TEDAIPQI 226
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++FGCG Q+G+ + A +G+ G G S+ S LA G FA C +
Sbjct: 227 LKAQILFGCGQVQTGSF--LDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSR-D 283
Query: 253 GGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL P P Y+I+++ + VG +L TI
Sbjct: 284 GIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDL---------EFSTIF 334
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT---VHDEYTCFQY 352
D+GT+ YL + Y I+Q +VH D F+Y
Sbjct: 335 DTGTSFTYLADPAY-----TYITQSFHAQVHANRHAADSRIPFEY 374
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 139/347 (40%), Gaps = 47/347 (13%)
Query: 54 QRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKE 109
R+ + + LPL G+ P+G Y + IG P K Y++ VDTGSD+ W+ C +QC E
Sbjct: 14 NRVPSSIVLPLHGNVYPNGY--YNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTE 71
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSS 169
P Y +++ V C C ++ C C Y Y DG S
Sbjct: 72 APH--------PYYRPRNN----LVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGS 119
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
+ G V D + S + L GCG Q + +DG++G GK SS
Sbjct: 120 SFGVLVTDTFNLNFTSEKRHSPL----LALGCGYDQ---FPGGSHHPIDGVLGLGKGKSS 172
Query: 230 MISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV-QPEVNKTPLVPNQPHYSINMTAVQV 288
++SQL+S G VR + HCL G GG +F + V TP+ P+ HYS
Sbjct: 173 IVSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKHYS-------P 225
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
GL L G N T DSG + YL Y+ L+S + + + D+ T
Sbjct: 226 GLAELTFDGKTTGF-KNLLTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQT 284
Query: 349 C---------FQYSESVDEGFPNVTFHFEN----SVSLKVYPHEYLF 382
F+ V + F F N L+ P YL
Sbjct: 285 LPLCWKGRKPFKSIRDVKKYFKTFALSFTNERKSKTELEFPPEAYLI 331
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 142/331 (42%), Gaps = 37/331 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++GIG PP Y+ +DTGSD+ WV C C EC ++ E T S++
Sbjct: 147 GSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPT-----SSASF 201
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
++C+ E C + +++C N +C Y YGDGS T G FV + V L +T
Sbjct: 202 TSLSCETEQCKSL---DVSEC-RNGTCLYEVSYGDGSYTVGDFVTETVT-------LGST 250
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
S G++ GCG G G + S SQL +S F++CL
Sbjct: 251 SL-GNIAIGCGHNNEGLFIGAAGLLGL-----GGGSLSFPSQLNASS-----FSYCLVDR 299
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGD-- 304
+ + P+ PL N + + +T + VG L +P F + +
Sbjct: 300 DSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDG 359
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEYTCFQYSESVDEGFPNV 363
N G I+DSGT + L VY L + DL+ V TC+ S P V
Sbjct: 360 NGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTV 419
Query: 364 TFHFENSVSLKVYPHEYLFPF--EDLWCIGW 392
+FHF N L + YL P E +C +
Sbjct: 420 SFHFANGNELPLPAKNYLIPVDSEGTFCFAF 450
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 94/361 (26%), Positives = 152/361 (42%), Gaps = 33/361 (9%)
Query: 59 GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI 118
GV LP P G Y +G+GTP +D V DTGSD+ WV C C C ++
Sbjct: 122 GVSLP-ARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHD--- 177
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDV 178
L+D S+T V C + C + G C++ C Y +YGD S T G +D
Sbjct: 178 --PLFDPSQSTTYSAVPCGAQECRRLDSG---SCSSG-KCRYEVVYGDMSQTDGNLARDT 231
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+ S + +FGCG +G DG+ G G+ S+ SQ A+
Sbjct: 232 LTLGPSSSSSSSDQLQ-EFVFGCGDDDTGLFGKA-----DGLFGLGRDRVSLASQAAAKY 285
Query: 239 GVRKMFAHCLDGINGG-GIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLN 294
G F++CL + G ++G P T +V Y +N+ ++V +
Sbjct: 286 GA--GFSYCLPSSSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVR 343
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK---IISQQPDLKVHTVHDEYTCFQ 351
+ VF GT+IDSGT + LP Y L S ++ + + + TC+
Sbjct: 344 VSPAVF---RTPGTVIDSGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYD 400
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
++ P+V F+ +L + E L+ + C+ + ++G D ++ +LG+
Sbjct: 401 FTGRNKVQIPSVALLFDGGATLNLGFGEVLYVANKSQACLAFASNG----DDTSIAILGN 456
Query: 411 F 411
Sbjct: 457 M 457
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 144/324 (44%), Gaps = 51/324 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + ++ IG P Y VDTGSD++W C C EC + + ++D + SS+
Sbjct: 103 GSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPT-----PIFDPEKSSSY 157
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C+ + P ++C + +C YL YGD SST G + ++ D +
Sbjct: 158 SKVGCSSGLCNAL---PRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE----DENS 210
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
S G FGCG G+ S G++G G+ S+ISQL + F++CL
Sbjct: 211 ISGIG---FGCGVENEGDGFSQGS----GLVGLGRGPLSLISQLKET-----KFSYCLTS 258
Query: 251 IN--------------GGGIFAIGHVVQPEVNKTPLV---PNQPH-YSINMTAVQVGLDF 292
I G + G + EV KT + P+QP Y + + + VG
Sbjct: 259 IEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKR 318
Query: 293 LNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQP---DLKVHTVHDEY 347
L++ F + ++ G IIDSGTT+ YL E ++ L + S+ D T D
Sbjct: 319 LSVEKSTFELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLD-- 376
Query: 348 TCFQYSESVDE-GFPNVTFHFENS 370
CF+ ++ P + FHF+ +
Sbjct: 377 LCFKLPDAAKNIAVPKMIFHFKGA 400
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 149/345 (43%), Gaps = 34/345 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTP + + + DTGSD+ WV C + + G ++ S +
Sbjct: 97 GTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKC-RGAGAAAGTGAGSPARVFRTAASKSW 155
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C + C L +C++ S C Y Y DGS+ G VV D + L +
Sbjct: 156 APIACSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARG-----VVGTDSATIALSS 210
Query: 191 TSTNGS-------------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
S G ++ GC A D + ++ DG++ G SN S S+ A+
Sbjct: 211 GSGRGGGDSSGGRRAKLQGVVLGCAA----TYDGQSFQSSDGVLSLGNSNISFASRAAAR 266
Query: 238 GGVRKMFAHCL-DGI---NGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGL 290
G R F++CL D + N G +TPL+ ++ P Y++ + AV V
Sbjct: 267 FGGR--FSYCLVDHLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAG 324
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ L++P DV+ V N G I+DSGT+L L Y +V+ + L T+ C+
Sbjct: 325 EALDIPADVWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMDPFEYCY 384
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
++++ P + HF S L+ Y+ + CIG Q
Sbjct: 385 NWTDAGALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQE 429
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 152/355 (42%), Gaps = 44/355 (12%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G Y +G+GTP Y V DTGSD WV C C +C ++ L+D
Sbjct: 154 SGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKG-----PLFDP 208
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD--VVQYDK 183
SST V+C C + CT C Y YGDGS T G+F QD + +D
Sbjct: 209 AKSSTYANVSCTDSACADL---DTNGCTGG-HCLYAVQYGDGSYTVGFFAQDTLTIAHDA 264
Query: 184 VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
+ G FGCG + +G T G++G G+ +S+ Q + G
Sbjct: 265 IKG----------FRFGCGEKNNGLFGKTA-----GLMGLGRGKTSLTVQAYNKYG--GA 307
Query: 244 FAHCLDGI-NGGGIFAIGH-VVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDV 299
FA+CL + G G G TP++ + Q Y + MT ++VG + + V
Sbjct: 308 FAYCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESV 367
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESV 356
F GT++DSGT + LP Y L S K++ + K TC+ ++
Sbjct: 368 F---STAGTLVDSGTVITRLPATAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLS 424
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGD 410
D P V+ F+ L V ++ E C+ + ++G D +++ ++G+
Sbjct: 425 DVELPTVSLVFQGGACLDVDVSGIVYAISEAQVCLAFASNG----DDESVAIVGN 475
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 105/343 (30%), Positives = 156/343 (45%), Gaps = 38/343 (11%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSS 129
P+G G Y+ K+ IGTP + V DTGSD+ WV C+ C C R+ S L+D SS
Sbjct: 89 PNG-GEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKS-----PLFDPSRSS 142
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+ + + C FC+ + CT +T+ C Y YGD S T G + S
Sbjct: 143 SYRHMLCGSRFCNALDVSEQA-CTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRP 201
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
S ++FGCG G D E GI+G G S++SQL+S ++ F++CL
Sbjct: 202 VHLS---PIVFGCGTGNGGTFD----ELGSGIVGLGGGALSLVSQLSSI--IKGKFSYCL 252
Query: 249 ------DGINGGGIFAIGHVVQ-PEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLPTDV 299
+ F V+ P+V TPLV QP +Y + + A+ VG L +
Sbjct: 253 VPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGL 312
Query: 300 FGVGDNKG-TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY----TCFQYSE 354
KG IIDSGTTL +L + L +++ + +K V D CF+ +
Sbjct: 313 LNGNVEKGNVIIDSGTTLTFLDSEFFTEL-ERVLEET--VKAERVSDPRGLFSVCFRSAG 369
Query: 355 SVDEGFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCIGWQNSG 396
+D P + HF N +K+ P + ++ EDL C +S
Sbjct: 370 DID--LPVIAVHF-NDADVKLQPLNTFVKADEDLLCFTMISSN 409
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 102/377 (27%), Positives = 164/377 (43%), Gaps = 46/377 (12%)
Query: 32 KYRYAGRERSLSLLKEHDARRQQRIL----AGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
K R + ++ +D+RR+ + A V++P+ S R D +G Y+A++ +G+P +
Sbjct: 66 KLRRQRMNQRWGVVSNYDSRRKGFEMTTTPAEVEMPMH-SGRDDALGEYFAEVKVGSPGQ 124
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+++ VDTGS+ W+NC + E +S ++ L ++ F V
Sbjct: 125 RFWLVVDTGSEFTWLNCSKSFEAVTCASRKCKVDLSEL--------------FSLSVCPK 170
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
P C + S Y DGSS G+F D + +G Q N L GC +
Sbjct: 171 PSDPCLYDIS------YADGSSAKGFFGTDSITVGLTNGK-QGKLNN--LTIGC-TKSML 220
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGINGGGI---FAIG--- 260
N + NEE GI+G G + S I + A+ G + F++CL D ++ + IG
Sbjct: 221 NGVNFNEET-GGILGLGFAKDSFIDKAANKYGAK--FSYCLVDHLSHRSVSSNLTIGGHH 277
Query: 261 -HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYL 319
+ E+ +T L+ P Y +N+ + +G L +P V+ GT+IDSGTTL L
Sbjct: 278 NAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNAEGGTLIDSGTTLTSL 337
Query: 320 PEMVYEPLVSKIISQQPDLKVHTVHD----EYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
YE + + +K T D E+ CF D P + FHF +
Sbjct: 338 LLPAYEAVFEALTKSLTKVKRVTGEDFDALEF-CFDAEGFDDSVVPRLVFHFAGGARFEP 396
Query: 376 YPHEYLFPFEDL-WCIG 391
Y+ L CIG
Sbjct: 397 PVKSYIIDVAPLVKCIG 413
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 133/280 (47%), Gaps = 29/280 (10%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPP--KDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGI 118
P+GG+ PDG LYY +I +G P + Y++ +DTGSD+ W+ C C C + ++
Sbjct: 186 FPVGGNVYPDG--LYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGAN--- 240
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD-CTANTSCPYLEIYGDGSSTTGYFVQD 177
LY + + V + FC V LT+ C + C Y Y D S + G +D
Sbjct: 241 --QLYKPRKDN---LVRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMGVLTKD 295
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+G L ++FGCG Q G L +T + DGI+G ++ S+ SQLAS
Sbjct: 296 KFHLKLHNGSL----AESDIVFGCGYDQQGLLLNTLLKT-DGILGLSRAKISLPSQLASR 350
Query: 238 GGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNKT--PLVPNQPH---YSINMTAVQVGLD 291
G + + HCL +NG G +G + P T P++ + PH Y + +T + G
Sbjct: 351 GIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPML-HHPHLEVYQMQVTKMSYGNA 409
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
L+L + VG + D+G++ Y P Y LV+ +
Sbjct: 410 MLSLDGENGRVGK---VLFDTGSSYTYFPNQAYSQLVTSL 446
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 139/349 (39%), Gaps = 42/349 (12%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
S P G Y+A +G+GTPP + +DTGSD++W+ C C C R+ S LYD +
Sbjct: 90 SGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLS-----PLYDPR 144
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
SST C C P T C Y +YGD SST+G D + +
Sbjct: 145 GSSTYAQTPCSPPQCR----NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVF----- 195
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
++ G++ GCG G S G++G + N+S +Q+A S G + FA+
Sbjct: 196 --SNDTSVGNVTLGCGHDNEGLFGSAA-----GLLGVARGNNSFATQVADSYG--RYFAY 246
Query: 247 CLDGINGGG------IFAIGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLD----FL 293
CL G +F P TPL P +P Y ++M VG + F
Sbjct: 247 CLGDRTRSGSSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFS 306
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----C 349
N + G ++DSGT++ Y L ++ + + V + C
Sbjct: 307 NASLSLDPATGRGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDAC 366
Query: 350 FQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSG 396
+ P V HF + + P YL P E C + +G
Sbjct: 367 YDLRGVAVADAPGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAG 415
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 93/305 (30%), Positives = 137/305 (44%), Gaps = 43/305 (14%)
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
+ V VDTGSD+ WV C CK C + +++ S + + V C C +
Sbjct: 144 RKMTVIVDTGSDLSWVQCQPCKRCYNQQD-----PVFNPSTSPSYRTVLCSSPTCQSLQS 198
Query: 147 --GPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGA 203
G L C +N SC Y+ YGDGS T G + + DL ++ + IFGCG
Sbjct: 199 ATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHL-------DLGNSTAVNNFIFGCGR 251
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCL--DGINGGGIFAIG 260
G + G++G G+S+ S+ISQ ++ GGV F++CL G +G
Sbjct: 252 NNQGLFGGAS-----GLVGLGRSSLSLISQTSAMFGGV---FSYCLPITETEASGSLVMG 303
Query: 261 HVVQPEVNKTP-----LVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSG 313
N TP ++PN P Y +N+T + VG + P+ FG G +IDSG
Sbjct: 304 GNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPS--FG---KDGMMIDSG 358
Query: 314 TTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS 370
T + LP +Y+ L + + Q P + D TCF S + PN+ HFE +
Sbjct: 359 TVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILD--TCFNLSGYQEVEIPNIKMHFEGN 416
Query: 371 VSLKV 375
L V
Sbjct: 417 AELNV 421
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/407 (26%), Positives = 172/407 (42%), Gaps = 59/407 (14%)
Query: 33 YRYAGRERSLSLLKEHDARR--QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
+R A R + RR +R++A V+ S G G Y + +GTPP+ +
Sbjct: 109 HRRAARSGVARMPASSSPRRALSERMVATVE-----SGVAVGSGEYLIDVYVGTPPRRFR 163
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+ +DTGSD+ W+ C C +C + ++D SS+ + VTC + C G+ P
Sbjct: 164 MIMDTGSDLNWLQCAPCLDCFEQRG-----PVFDPAASSSYRNVTCGDQRC-GLVAPPEA 217
Query: 151 DCT----ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
A SCPY YGD S+TTG + + ++ + +G ++FGCG R
Sbjct: 218 PRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVN-LTAPGASRRVDG-VVFGCGHRNR 275
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGH--- 261
G G+ S SQL + G F++CL G + G G
Sbjct: 276 GLFHGAAGLLGL-----GRGPLSFASQLRAVYG--HTFSYCLVEHGSDAGSKVVFGEDYL 328
Query: 262 -VVQPEVNKTPLVPNQP----HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGT 314
+ P++ T P Y + + V VG D LN+ +D + VG + GTIIDSGT
Sbjct: 329 VLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGT 388
Query: 315 TLAYLPEMVYE-------PLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHF 367
TL+Y E Y+ L+S++ PD V C+ S P ++ F
Sbjct: 389 TLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLN-----PCYNVSGVERPEVPELSLLF 443
Query: 368 ENSVSLKVYPHEYLFPFED---LWCIGWQNSGMQSRDRKNMTLLGDF 411
+ +P E F D + C+ ++ R M+++G+F
Sbjct: 444 ADGAVWD-FPAENYFVRLDPDGIMCL-----AVRGTPRTGMSIIGNF 484
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 149/330 (45%), Gaps = 39/330 (11%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSS 129
PD G Y + IG+PP + VDTGS ++W+ C C C E L++ SS
Sbjct: 84 PDK-GEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNC-----FPQETPLFEPLKSS 137
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
T K+ TCD + C + DC C Y +YGD S + G + + + +G Q
Sbjct: 138 TYKYATCDSQPCT-LLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGS-TGGAQ 195
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL- 248
T S + IFGCG + + ++N+ + GI G G S++SQL + G + F++CL
Sbjct: 196 TVSFPNT-IFGCGVDNNFTIYTSNK--VMGIAGLGAGPLSLVSQLGAQIGHK--FSYCLL 250
Query: 249 --DGINGGGI-FAIGHVVQPE-VNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFG 301
D + + F ++ V TPL+ P+ P +Y +N+ AV +G V
Sbjct: 251 PYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIG-------QKVVS 303
Query: 302 VGDNKGTI-IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD----EYTCFQYSESV 356
G G I IDSGT L YL Y V+ S Q L V + D TCF
Sbjct: 304 TGQTDGNIVIDSGTPLTYLENTFYNNFVA---SLQETLGVKLLQDLPSPLKTCF--PNRA 358
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ P++ F F + S+ + P L P D
Sbjct: 359 NLAIPDIAFQFTGA-SVALRPKNVLIPLTD 387
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 145/337 (43%), Gaps = 44/337 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+++IG+G P +D + +DTGSD+ W+ C C +C ++S +Y+ SS+
Sbjct: 141 GSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSD-----PIYNPALSSSY 195
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K V C C + ++ C+ N SC Y YGDGS T G F + + LQ
Sbjct: 196 KLVGCQANLCQQL---DVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTLG--GAPLQNV 250
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ GCG G G + S SQL G K+F++CL
Sbjct: 251 A------IGCGHDNEGLFVGAAGLLGL-----GGGSLSFPSQLTDENG--KIFSYCLVDR 297
Query: 252 N---------GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+ G G V+ P + + L Y ++++ + VG L++ VFG+
Sbjct: 298 DSESSSTLQFGRAAVPNGAVLAPMLKNSRL---DTFYYVSLSGISVGGKMLSISDSVFGI 354
Query: 303 --GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYS--ESVD 357
N G I+DSGT + L Y+ L + +L V TC+ S ESVD
Sbjct: 355 DASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVD 414
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
P V FHF S+ + YL P + + +C +
Sbjct: 415 --VPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAF 449
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 102/328 (31%), Positives = 134/328 (40%), Gaps = 54/328 (16%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G G Y +G+GTP Y V DTGSD WV C C C + + L+D
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ-----QEKLFDP 223
Query: 126 KDSSTGKFVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV- 179
SST V+C C HG GG C Y YGDGS + G+F D +
Sbjct: 224 VRSSTYANVSCAAPACSDLNIHGCSGG---------HCLYGVQYGDGSYSIGFFAMDTLT 274
Query: 180 --QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
YD V G FGCG R G E A G++G G+ +S+ Q
Sbjct: 275 LSSYDAVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDK 319
Query: 238 -GGVRKMFAHCLDGINGGGIF-----AIGHVVQPEVNKTPLVPNQP-HYSINMTAVQVGL 290
GGV FAHCL + G + + L N P Y I MT ++VG
Sbjct: 320 YGGV---FAHCLPARSTGTGYLDFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGG 376
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L++P VF GTI+DSGT + LP Y L + ++ + K V
Sbjct: 377 QLLSIPQSVFA---TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLD 433
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKV 375
TC+ ++ P V+ F+ L V
Sbjct: 434 TCYDFTGMSQVAIPTVSLLFQGGARLDV 461
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 153/350 (43%), Gaps = 41/350 (11%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
G G Y+ + IG PP+ + DTGSD++WV C C+ C S T++ + SST
Sbjct: 79 SGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS----PATVFFPRHSST 134
Query: 131 GKFVTCDQEFCHGVYG---GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
C C V P+ + T +++C Y Y DGS T+G F ++ SG
Sbjct: 135 FSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSG 194
Query: 187 DLQTTSTNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+ S+ FGCG R SG ++ T+ +G++G G+ S SQL G + F+
Sbjct: 195 K---EARLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNK--FS 249
Query: 246 HCLDG-----------INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLN 294
+CL I G G I + + PL P Y + + +V V L
Sbjct: 250 YCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPT--FYYVKLKSVFVNGAKLR 307
Query: 295 LPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT---- 348
+ ++ + D N GT++DSGTTLA+L E Y +++ + + +K+ + D T
Sbjct: 308 IDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRR---VKL-PIADALTPGFD 363
Query: 349 -CFQYS--ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQN 394
C S ++ P + F F P Y E+ + C+ Q+
Sbjct: 364 LCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQS 413
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 126/260 (48%), Gaps = 30/260 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV-------NCIQCKECPRRSSLGIELTLYDIKD 127
L+YA + IGTP Y V +DTGSD+ W+ C+Q + P S I+ +Y
Sbjct: 112 LHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGLQFP--SGEQIDFNIYRPNA 169
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSG 186
SST + + C+ C P +A ++CPY ++ +G+S+TG V+D++ +
Sbjct: 170 SSTSQTIPCNNTLCSRQSRCP----SAQSTCPYQVQYLSNGTSSTGVLVEDLLHL--TTD 223
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
D Q+ + + +IFGCG Q+G+ + A +G+ G G +N S+ S LA G F+
Sbjct: 224 DAQSRALDAKIIFGCGRVQTGSF--LDGAAPNGLFGLGMTNISVPSTLAREGYTSNSFSM 281
Query: 247 CLDGINGGGIFAIGHVVQPEVNKTPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVGD 304
C G +G G + G +TP Q P Y++++T + VG +L
Sbjct: 282 CF-GRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITKINVGGRDADL--------- 331
Query: 305 NKGTIIDSGTTLAYLPEMVY 324
I DSGT+ YL + Y
Sbjct: 332 EFSAIFDSGTSFTYLNDPAY 351
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 93/322 (28%), Positives = 142/322 (44%), Gaps = 51/322 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + ++ IG P Y VDTGSD++W C C EC + + ++D + SS+
Sbjct: 104 GSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPT-----PIFDPEKSSSY 158
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C+ + P ++C + SC YL YGD SST G + ++ D +
Sbjct: 159 SKVGCSSGLCNAL---PRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFE----DENS 211
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
S G FGCG G+ S G++G G+ S+ISQL + F++CL
Sbjct: 212 ISGIG---FGCGVENEGDGFSQGS----GLVGLGRGPLSLISQLKET-----KFSYCLTS 259
Query: 251 IN--------------GGGIFAIGHVVQPEVNKTPLV---PNQPH-YSINMTAVQVGLDF 292
I G + G + EV KT + P+QP Y + + + VG
Sbjct: 260 IEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKR 319
Query: 293 LNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQP---DLKVHTVHDEY 347
L++ F + ++ G IIDSGTT+ YL E ++ L + S+ D T D
Sbjct: 320 LSVEKSTFELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLD-- 377
Query: 348 TCFQYSESVDE-GFPNVTFHFE 368
CF+ + P + FHF+
Sbjct: 378 LCFKLPNAAKNIAVPKLIFHFK 399
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 94/331 (28%), Positives = 143/331 (43%), Gaps = 37/331 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++GIG PP Y+ +DTGSD+ WV C C EC ++ +++ S++
Sbjct: 147 GSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTD-----PIFEPTSSASF 201
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
++C+ E C + +++C N +C Y YGDGS T G FV + V L +T
Sbjct: 202 TSLSCETEQCKSL---DVSEC-RNGTCLYEVSYGDGSYTVGDFVTETVT-------LGST 250
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
S G++ GCG G G + S SQL +S F++CL
Sbjct: 251 SL-GNIAIGCGHNNEGLFIGAAGLLGL-----GGGSLSFPSQLNASS-----FSYCLVDR 299
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGD-- 304
+ + P+ PL N + + +T + VG L +P F + +
Sbjct: 300 DSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDG 359
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEYTCFQYSESVDEGFPNV 363
N G I+DSGT + L VY L + DL+ V TC+ S P V
Sbjct: 360 NGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTV 419
Query: 364 TFHFENSVSLKVYPHEYLFPF--EDLWCIGW 392
+FHF N L + YL P E +C +
Sbjct: 420 SFHFANGNELPLPAKNYLIPVDSEGTFCFAF 450
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 165/360 (45%), Gaps = 54/360 (15%)
Query: 38 RERSLSL-LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
R +SL L +K + ++ ++ +PL + + + Y + +G K+ + VDTG
Sbjct: 100 RVQSLQLRIKAMTSSTTEQSVSETQIPLTSGIKLETLN-YIVTVELGG--KNMSLIVDTG 156
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG-----GPLT- 150
SD+ WV C C+ C + LYD SS+ K V C+ C + GP
Sbjct: 157 SDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATGNSGPCGG 211
Query: 151 -DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ T+C Y+ YGDGS T G D+ V GD + +L+FGCG G
Sbjct: 212 FNGVVKTTCEYVVSYGDGSYTRG----DLASESIVLGDTKLE----NLVFGCGRNNKGLF 263
Query: 210 DSTNEEALDGIIGFGKSNSSMISQ-LASSGGVRKMFAHCLDGINGG--GIFAIGHVVQPE 266
+ G++G G+S+ S++SQ L + GV F++CL + G G + G+
Sbjct: 264 GGAS-----GLMGLGRSSVSLVSQTLKTFNGV---FSYCLPSLEDGASGTLSFGNDFSVY 315
Query: 267 VNK-----TPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
N TPLV N + Y +N+T +G + L T FG +G +IDSGT +
Sbjct: 316 KNSTSVFYTPLVQNPQLRSFYILNLTGASIG--GVELKTLSFG----RGILIDSGTVITR 369
Query: 319 LPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
LP +Y+ + ++ + Q P +++ D TCF + D P + FE + L+V
Sbjct: 370 LPPSIYKAVKTEFLKQFSGFPSAPGYSILD--TCFNLTSYEDISIPTIKMIFEGNAELEV 427
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 30/308 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y + +G+PP+ + V VDTGSD+ WV C+ C+ C ++ +D S +
Sbjct: 35 GNGEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPG-----PKFDPSKSRSF 89
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C C+ V PL C AN C Y YGD S+T G + + + +G T
Sbjct: 90 RKAACTDNLCN-VSALPLKACAANV-CQYQYTYGDQSNTNGDLAFETISLNNGAG----T 143
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ + FGCG + G G++G G+ S+ SQL+ + F++CL +
Sbjct: 144 QSVPNFAFGCGTQNLGTF-----AGAAGLVGLGQGPLSLNSQLSHT--FANKFSYCLVSL 196
Query: 252 N--GGGIFAIGHV-VQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDN 305
N G + + T +V N H Y + + +++VG LNL VF + +
Sbjct: 197 NSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQS 256
Query: 306 K---GTIIDSGTTLAYLPEMVYEPLVSKIIS--QQPDLKVHTVHDEYTCFQYSESVDEGF 360
GTIIDSGTT+ L Y ++ S P L + + CF + +
Sbjct: 257 TGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLD-GSAYGLDLCFNIAGVSNPSV 315
Query: 361 PNVTFHFE 368
P++ F F+
Sbjct: 316 PDMVFKFQ 323
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 137/315 (43%), Gaps = 38/315 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GIG+PP+ + +DTGSD++W C C C + + ++ S++
Sbjct: 86 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPT-----PYFEPAKSTSYAS 140
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C C+ +Y PL C N +C Y YGD +S+ G + + S +
Sbjct: 141 LPCSSAMCNALY-SPL--CFQN-ACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRV 196
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS-----------SGGVRK 242
+ FGCG +G L + + G++GFG+ S++SQL S S +
Sbjct: 197 S----FGCGNMNAGTLFNGS-----GMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSR 247
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
++ +N + G V P +P Y +NMT + V D L + VF +
Sbjct: 248 LYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTM--YFLNMTGISVAGDLLPIDPSVFAI 305
Query: 303 GDNKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH-TVHDEY-TCFQYSESVD 357
+ GT IIDSGTT+ +L + Y + ++ + + T D + TCF++
Sbjct: 306 NETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPR 365
Query: 358 E--GFPNVTFHFENS 370
P + HF+ +
Sbjct: 366 RMVTLPEMVLHFDGA 380
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 145/354 (40%), Gaps = 46/354 (12%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDS 128
RP G Y + IGTPP+ +DTGSD++W C C C L L+ S
Sbjct: 96 RPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASC-----LAQPDPLFAPAAS 150
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
S+ + C + C+ + C +C Y YGDG++T G + + + SG+
Sbjct: 151 SSYVPMRCSGQLCNDILH---HSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEK 207
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA------------S 236
+ L FGCG G+L++ + GI+GFG+ S++SQL+ S
Sbjct: 208 LSV----PLGFGCGTMNVGSLNNGS-----GIVGFGRDPLSLVSQLSIRRFSYCLTPYTS 258
Query: 237 SGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLP 296
+ MF DG+ G A G V + ++ P Y + T V VG L +P
Sbjct: 259 TRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPT--FYYVPFTGVTVGTRRLRIP 316
Query: 297 TDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ-QPDLKVHTVHDEYTCFQYS 353
F + + G I+DSGT L P V ++ +Q + + D+ CF
Sbjct: 317 LSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATP 376
Query: 354 ESVDE---------GFPNVTFHFENSVSLKVYPHEYLF--PFEDLWCIGWQNSG 396
+ P + FHF+ + L++ Y+ P CI +SG
Sbjct: 377 MAAGGRRASAATVVSVPRMAFHFQGA-DLELPRRNYVLDDPRRGSLCILLADSG 429
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 32/285 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKF 133
L+YA + +GTP + + V +DTGSD+ W+ C QC C P S+ + Y SST +
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPC-QCDGCTPPASAASGSASFYIPSMSSTSQA 173
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTS 192
V C+ +FC +C+ + CPY +Y +S++G+ V+DV+ + D
Sbjct: 174 VPCNSQFCEL-----RKECSTTSQCPYKMVYVSADTSSSGFLVEDVLYLS--TEDAIPQI 226
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++FGCG Q+G+ + A +G+ G G S+ S LA G FA C +
Sbjct: 227 LKAQILFGCGQVQTGSF--LDAAAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFSR-D 283
Query: 253 GGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
G G + G + +TPL P P Y+I+++ + VG +L TI
Sbjct: 284 GIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDL---------EFSTIF 334
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT---VHDEYTCFQY 352
D+GT+ YL + Y I+Q +VH D F+Y
Sbjct: 335 DTGTSFTYLADPAY-----TYITQSFHAQVHANRHAADSRIPFEY 374
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 133/310 (42%), Gaps = 39/310 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y IG+G+P KD + DTGSD+ W C + +D S++
Sbjct: 130 GTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET-------------FDPTKSTSY 176
Query: 132 KFVTCDQEFCHGVYG--GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C V G + C A+T C Y YGDGS + G+ ++ + +
Sbjct: 177 ANVSCSTPLCSSVISATGNPSRCAAST-CVYGIQYGDGSYSIGFLGKERLT-------IG 228
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+T + FGCG G G++G G+ S++SQ A ++F++CL
Sbjct: 229 STDIFNNFYFGCGQDVDGLFGKAA-----GLLGLGRDKLSVVSQTAPK--YNQLFSYCLP 281
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPN-QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
+ G + G TPL Y++++T + VG L +P VF GT
Sbjct: 282 SSSSTGFLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTA---GT 338
Query: 309 IIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
IIDSGT + LP Y L S K ++ P K ++ D TC+ +S+ P +
Sbjct: 339 IIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSILD--TCYDFSKYKTIKVPKIVI 396
Query: 366 HFENSVSLKV 375
F V + V
Sbjct: 397 SFSGGVDVDV 406
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 137/317 (43%), Gaps = 43/317 (13%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSS-----LGIELTLYDIKDSS 129
L+Y I IGTP + V +D GSD++WV C C EC S+ L +L Y S+
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPC-DCIECASLSAGNYNVLDRDLNQYRPSLSN 162
Query: 130 TGKFVTCDQEFC--HGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSG 186
T + + C + C H G + CPY Y +S++GY +D +
Sbjct: 163 TSRHLPCGHKLCDVHSFCKG------SKDPCPYEVQYASANTSSSGYVFEDKLHLTSDGK 216
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ S S+I GCG +Q+G D + DG++G G N S+ S LA +G ++ F+
Sbjct: 217 HAEQNSVQASIILGCGRKQTG--DYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSI 274
Query: 247 CLDGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
CLD G I GHV Q + TP +P + A VG+ + F VG
Sbjct: 275 CLDENESGRIIFGDQGHVTQ---HSTPFLP--------IIAYMVGV-------ESFCVGS 316
Query: 305 ------NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
+IDSG++ +LP VY+ +V++ Q ++ C+ S
Sbjct: 317 LCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSSWEYCYNASSQELV 376
Query: 359 GFPNVTFHFENSVSLKV 375
P + F + + +
Sbjct: 377 NIPPLKLAFSRNQTFLI 393
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 106/366 (28%), Positives = 157/366 (42%), Gaps = 46/366 (12%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE 109
+RR L+ DL G G ++ I IGTPP + DTGSD+ WV C C++
Sbjct: 62 SRRFNHQLSQTDLQSG---LIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ 118
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSS 169
C + + ++D K SST K CD C + +N C Y YGD S
Sbjct: 119 CYKENG-----PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSF 173
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
+ G + V D SG S G+ +FGCG G D T + + S
Sbjct: 174 SKGDVATETVSIDSASG--SPVSFPGT-VFGCGYNNGGTFDETGSGIIGLG----GGHLS 226
Query: 230 MISQLASSGGVRKMFAHCLD----GINGGGIFAIGHVVQPE-------VNKTPLVPNQP- 277
+ISQL SS + K F++CL NG + +G P V TPLV +P
Sbjct: 227 LISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL 284
Query: 278 -HYSINMTAVQVGLDFLNL------PTDVFGVGDNKGT-IIDSGTTLAYLPEMVYEPLVS 329
+Y + + A+ VG + P D + + G IIDSGTTL L ++ S
Sbjct: 285 TYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSS 344
Query: 330 KIISQQPDLKVHTVHDEY----TCFQYSESVDEGFPNVTFHFENSVSLKVYP-HEYLFPF 384
+ ++ V D CF+ S S + G P +T HF + +++ P + ++
Sbjct: 345 AV--EESVTGAKRVSDPQGLLSHCFK-SGSAEIGLPEITVHFTGA-DVRLSPINAFVKLS 400
Query: 385 EDLWCI 390
ED+ C+
Sbjct: 401 EDMVCL 406
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 141/307 (45%), Gaps = 34/307 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSL--GIELTLYDIKDSST 130
L+YA++ +GTP + V +DTGSD+ WV +C QC S L G +L Y SST
Sbjct: 106 LHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDLRGGPDLRPYSPGKSST 165
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTA----NTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVS 185
K VTC+ C C A +TSCPY Y +S++G V+DV+ + +
Sbjct: 166 SKAVTCEHALCERP-----NACAAAGNSSTSCPYTVRYVSANTSSSGVLVEDVLHLSREA 220
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MF 244
+T+ ++ GCG Q+G + A+DG++G G S+ S L ++G V F
Sbjct: 221 AGGASTAVTAPVVLGCGQVQTGAF--LDGAAVDGLLGLGMDKVSVPSVLHAAGLVASDSF 278
Query: 245 AHCLDGINGGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+ C +G G G + +TP P Y+I++TA+ V V
Sbjct: 279 SMCFS-PDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISVTAMSVSGK---------EV 328
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH---TVHDEYTCFQYSESVDEG 359
I+DSGT+ YL + Y L + S+ + + + ++ EY C++ E
Sbjct: 329 AAEFAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEY-CYELGRGQTEL 387
Query: 360 F-PNVTF 365
F P V+
Sbjct: 388 FVPEVSL 394
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 158/380 (41%), Gaps = 62/380 (16%)
Query: 40 RSLSLLK-EHDARRQQRILAGVDLPLGGSSRPD-----------------------GVGL 75
+SL L + E D+ R + + +DL + G ++ D G G
Sbjct: 95 KSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALETPLVSGASQGSGE 154
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y++++GIG+PPK Y+ VDTGSD+ WV C C +C +++ +++ SS+ +T
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQAD-----PIFEPSFSSSYAPLT 209
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C+ C + +++C N SC Y YGDGS T G F + + D +++
Sbjct: 210 CETHQCKSL---DVSECR-NDSCLYEVSYGDGSYTVGDFATETITLDG-------SASLN 258
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGING 253
++ GCG G G + S SQ+ +S F++CL +
Sbjct: 259 NVAIGCGHDNEGLFVGAAGLLGL-----GGGSLSFPSQINASS-----FSYCLVNRDTDS 308
Query: 254 GGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGD--NKGT 308
+ PL+ N Y + MT + VG L++P F V + N G
Sbjct: 309 ASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGI 368
Query: 309 IIDSGTTLAYLPEMVYEPLVSKII---SQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
I+DSGT + L VY L + P + D TC+ S P V+F
Sbjct: 369 IVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFD--TCYDLSSRSSVEVPTVSF 426
Query: 366 HFENSVSLKVYPHEYLFPFE 385
HF + L + YL P +
Sbjct: 427 HFPDGKYLALPAKNYLIPVD 446
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 137/315 (43%), Gaps = 38/315 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GIG+PP+ + +DTGSD++W C C C + + ++ S++
Sbjct: 83 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPT-----PYFEPAKSTSYAS 137
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C C+ +Y PL C N +C Y YGD +S+ G + + S +
Sbjct: 138 LPCSSAMCNALY-SPL--CFQN-ACVYQAFYGDSASSAGVLANETFTFGTNSTRVAVPRV 193
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS-----------SGGVRK 242
+ FGCG +G L + + G++GFG+ S++SQL S S +
Sbjct: 194 S----FGCGNMNAGTLFNGS-----GMVGFGRGALSLVSQLGSPRFSYCLTSFMSPATSR 244
Query: 243 MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
++ +N + G V P +P Y +NMT + V D L + VF +
Sbjct: 245 LYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTM--YFLNMTGISVAGDLLPIDPSVFAI 302
Query: 303 GDNKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH-TVHDEY-TCFQYSESVD 357
+ GT IIDSGTT+ +L + Y + ++ + + T D + TCF++
Sbjct: 303 NETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPR 362
Query: 358 E--GFPNVTFHFENS 370
P + HF+ +
Sbjct: 363 RMVTLPEMVLHFDGA 377
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 98/339 (28%), Positives = 148/339 (43%), Gaps = 36/339 (10%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
P+ G PDG LYY I +G PP+ Y++ +DTGSD+ WV C C C + S
Sbjct: 187 FPVRGDIYPDG--LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRS----- 239
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGV---YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
LY + + V+ C V Y G C A C Y Y D SS+ G V+D
Sbjct: 240 PLYKPRRENV---VSFKDSLCMEVQRNYDG--DQCAACQQCNYEVQYADQSSSLGVLVKD 294
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+G L T + IFGC Q G L +T + DGI+G ++ S+ SQLAS
Sbjct: 295 EFTLRFSNGSL----TKLNAIFGCAYDQQGLLLNTLSKT-DGILGLSRAKVSLPSQLASR 349
Query: 238 GGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPL-VPNQPHYSINMTAVQVGLDFLNL 295
G + + HCL G GGG +G P+ + + + P T V V +D+ ++
Sbjct: 350 GIINNVVGHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKV-VRIDYGSI 408
Query: 296 PTDVFGVGDNKGTII-DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
P + G ++ ++ DSG++ Y + Y LV+ + + + C++ +
Sbjct: 409 PLSLDTWGSSREQVVFDSGSSYTYFTKEAYYQLVANLEEVSAFGLILQDSSDTICWKTEQ 468
Query: 355 S------VDEGFPNVTFHFEN-----SVSLKVYPHEYLF 382
S V F +T F + S L + P YL
Sbjct: 469 SIRSVKDVKHFFKPLTLQFGSRFWLVSTKLVILPENYLL 507
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 94/330 (28%), Positives = 141/330 (42%), Gaps = 37/330 (11%)
Query: 63 PLGGSSRPDGVGLYYAKIGIGTP-PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
P+ ++ P G Y IGTP P+ + +DTGSD++W C C C
Sbjct: 75 PVTATAVPSS-GEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVC-----FDQPFP 128
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQ 180
L+D SST + V C C G ++ C T C YL YGD S T GY +D
Sbjct: 129 LFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFT 188
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV 240
+ +G+ L FGCG +G S NE GI GFG+ S+ SQL
Sbjct: 189 FMSPNGEGAPPVAVSGLAFGCGDYNTGVFAS-NES---GIAGFGRGPLSLPSQLRVG--- 241
Query: 241 RKMFAHCL---DGINGGGIFAIGHVVQPE---------VNKTPLV--PNQP-HYSINMTA 285
F++CL D A+ P TP++ P+ P Y +++
Sbjct: 242 --RFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEG 299
Query: 286 VQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTV 343
+ VG L + + VF + + GT+IDSGT + P V+E L ++ ++Q P +
Sbjct: 300 ITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNT 359
Query: 344 HD--EYTCFQYSESVDE-GFPNVTFHFENS 370
+ CFQ + + P + FH ++
Sbjct: 360 SEVGNLLCFQRPKGGKQVPVPKLIFHLASA 389
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 167/392 (42%), Gaps = 42/392 (10%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGGSS------RPDGVGLYYAKIGIGTPPKDYYVQ 92
E + L + + R + + +D LG S+ + L+ +G PP
Sbjct: 53 EDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTI 112
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGS ++W+ C CK C SS + +++ SST +CD FC G C
Sbjct: 113 MDTGSSLLWIQCQPCKHC---SSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNG---HC 166
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
++ C Y ++Y G+ + G ++ + + +G+ T + FGCG L+S
Sbjct: 167 GSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGN---TVVTQPIAFGCGYENGEQLES- 222
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN----GGGIFAIGHVVQPEVN 268
GI+G G +S+ QL S F++C+ + G +G +
Sbjct: 223 ---HFTGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNYGYNQLVLGEDADILGD 273
Query: 269 KTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFG-VGDNKGTIIDSGTTLAYLPEMVYE 325
TP+ + Y +N+ + VG LN+ VF G G I+DSGT +L ++ Y
Sbjct: 274 PTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYR 333
Query: 326 PLVSKIIS-QQPDLKVHTVHDEYTCF--QYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
L ++I S P L+ D + C+ + SE + GFP VTFHF L + +
Sbjct: 334 ELYNEIKSILDPKLERFWFRD-FLCYHGRVSEELI-GFPVVTFHFAGGAELAMEATSMFY 391
Query: 383 PFED-----LWCIGWQNSGMQSRDRKNMTLLG 409
P + ++C+ + + + K T +G
Sbjct: 392 PLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIG 423
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 144/336 (42%), Gaps = 42/336 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++GIG+P + Y+ +DTGSD+ WV C C +C ++S ++D S++
Sbjct: 162 GSGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSASY 216
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+CD + C + + T +C Y YGDGS T G F + + L +
Sbjct: 217 AAVSCDSQRCRDLDTAACRNATG--ACLYEVAYGDGSYTVGDFATETLT-------LGDS 267
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ G++ GCG G G S SQ+++S F++CL
Sbjct: 268 TPVGNVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAS-----TFSYCLVDR 317
Query: 252 N---------GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+ G G G V P V ++P Y + ++ + VG L++P F +
Sbjct: 318 DSPAASTLQFGDGAAEAGTVTAPLV-RSPRTST--FYYVALSGISVGGQPLSIPASAFAM 374
Query: 303 GDNKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDE 358
G+ I+DSGT + L Y L + P L + V TC+ S+
Sbjct: 375 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSV 434
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
P V+ FE +L++ YL P + +C+ +
Sbjct: 435 EVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAF 470
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/360 (28%), Positives = 151/360 (41%), Gaps = 50/360 (13%)
Query: 44 LLKEHDARRQQRILA--------GVDLPLGGSSRPDGVGLYYAKIGIGTP-PKDYYVQVD 94
LL+ AR + R+ + + P+ G Y +GIGTP P+ + +D
Sbjct: 54 LLRRMVARSKARLASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLD 113
Query: 95 TGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC-HGVYGGPLTDCT 153
TGSD++W C C C + ++ S T V C C H VY PL+ C
Sbjct: 114 TGSDLVWTQC-ACTVC-----FDQPVPVFRASVSHTFSRVPCSDPLCGHAVYL-PLSGCA 166
Query: 154 A-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
A + SC Y Y D S TTG +D + K T + ++ FGCG G L +
Sbjct: 167 ARDRSCFYAYGYMDHSITTGKMAEDTFTF-KAPDRADTAAAVPNIRFGCGMMNYG-LFTP 224
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE------ 266
N+ GI GFG S+ SQL VR+ F++C + + + +PE
Sbjct: 225 NQS---GIAGFGTGPLSLPSQLK----VRR-FSYCFTAMEESRVSPVILGGEPENIEAHA 276
Query: 267 ---VNKTPLVP--------NQPHYSINMTAVQVGLDFLNLPTDVFGV-GDNKG-TIIDSG 313
+ TP P +QP Y +++ V VG L F + GD G T IDSG
Sbjct: 277 TGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSG 336
Query: 314 TTLAYLPEMVYEPLVSKIISQ--QPDLKVHTVHDEYTCFQY-SESVDEGFPNVTFHFENS 370
T + + P+ V+ L ++Q P K +T D CF ++ P + H E +
Sbjct: 337 TAITFFPQAVFRSLREAFVAQVPLPVAKGYTDPDNLLCFSVPAKKKAPAVPKLILHLEGA 396
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 143/335 (42%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP+ Y+ +D+GSDI+WV C C +C ++ L+D DS++
Sbjct: 39 GSGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTD-----PLFDPADSASF 93
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C V C + C Y YGDGS T G + + + + T
Sbjct: 94 MGVSCSSAVCDRVEN---AGCNSG-RCRYEVSYGDGSYTKGTLALETLTFGR------TV 143
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
N + GCG G G + S + QL SG F++CL
Sbjct: 144 VRN--VAIGCGHSNRGMFVGAAGLLGL-----GGGSMSFMGQL--SGQTGNAFSYCLVSR 194
Query: 250 GINGGGIFAIGHVVQP-EVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD- 304
G N G G P PLV P P Y I + + VG + + DVF + +
Sbjct: 195 GTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNEL 254
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
+ G ++D+GT + P + YE + I Q +L + V TC+ + P
Sbjct: 255 GSGGVVMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPT 314
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNS 395
V+F+F L + + +L P +D +C + S
Sbjct: 315 VSFYFSGGPILTIPANNFLIPVDDAGTFCFAFAPS 349
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 134/347 (38%), Gaps = 48/347 (13%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKEC 110
R+ + + LPL G+ P G Y + IG P K Y++ VDTGSD+ W+ C QC E
Sbjct: 1 RVPSSIVLPLHGNVYP--TGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEA 58
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSST 170
P + V C C ++ G C C Y Y DG S+
Sbjct: 59 PHPYY------------KPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSS 106
Query: 171 TGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
G V+D + S Q+ L CG Q L +DG++G G+ S+
Sbjct: 107 LGVLVKDAFNLNFTSEKRQSPLLALGL---CGYDQ---LPGGTYHPIDGVLGLGRGKPSI 160
Query: 231 ISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV-QPEVNKTPLVPNQPHYSINMTAVQVG 289
+SQL+ G VR + HCL G GG +F + V TP+ PN HYS
Sbjct: 161 VSQLSGLGLVRNVIGHCLSGRGGGFLFFGDDLYDSSRVAWTPMSPNAKHYSPG------- 213
Query: 290 LDFLNLPTDVFGVG-DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
F L D G N DSG + YL VY+ L+S I + + D+ T
Sbjct: 214 --FAELTFDGKTTGFKNLIVAFDSGASYTYLNSQVYQGLISLIKRELSTKPLREALDDQT 271
Query: 349 C---------FQYSESVDEGFPNVTFHFEN----SVSLKVYPHEYLF 382
F+ V + F F N L+ P YL
Sbjct: 272 LPICWKGRKPFKSVRDVKKYFKTFALSFANDGKSKTQLEFPPEAYLI 318
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/328 (30%), Positives = 133/328 (40%), Gaps = 54/328 (16%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDI 125
S R G G Y +G+GTP Y V DTGSD WV C C C + L+D
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQRE-----KLFDP 225
Query: 126 KDSSTGKFVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV- 179
SST V+C C HG GG C Y YGDGS + G+F D +
Sbjct: 226 ARSSTYANVSCAAPACSDLNIHGCSGG---------HCLYGVQYGDGSYSIGFFAMDTLT 276
Query: 180 --QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
YD V G FGCG R G E A G++G G+ +S+ Q
Sbjct: 277 LSSYDAVKG----------FRFGCGERNEGLF---GEAA--GLLGLGRGKTSLPVQTYDK 321
Query: 238 -GGVRKMFAHCLDGINGGGIF-----AIGHVVQPEVNKTPLVPNQP-HYSINMTAVQVGL 290
GGV FAHCL + G + + L N P Y + MT ++VG
Sbjct: 322 YGGV---FAHCLPARSTGTGYLDFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGG 378
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L++P VF GTI+DSGT + LP Y L + ++ + K V
Sbjct: 379 QLLSIPQSVFA---TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLD 435
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKV 375
TC+ ++ P V+ F+ L V
Sbjct: 436 TCYDFTGMSQVAIPTVSLLFQGGARLDV 463
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 118/268 (44%), Gaps = 26/268 (9%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
YY I IG PP+ Y++ +DTGSD W++C C C + + + + GK V
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGP--------HPVYKPTEGKIV 67
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C + G C C Y Y D SS+ G +D +Q G+++ N
Sbjct: 68 HPRDPLCEELQGN-QNYCETCKQCDYEITYADRSSSKGVLARDNMQLTTADGEMK----N 122
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGIN 252
+FGC Q G L + + DGI+G S+ +QLA+SG + +F HC+ D +
Sbjct: 123 VDFVFGCAHNQQGKLLDS-PTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPSS 181
Query: 253 GGGIFAIGHVVQPEVNKTPLVP--NQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
GG +F +G P T VP N P YS + V G LNL G
Sbjct: 182 GGYMF-LGDDYVPRWGMT-WVPIRNGPGNVYSTEVPKVNYGAQELNLRGQ---AGKLTQV 236
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQP 336
I DSG++ Y P +Y L++ + P
Sbjct: 237 IFDSGSSYTYFPHEIYTNLIALLEDASP 264
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 115/424 (27%), Positives = 176/424 (41%), Gaps = 60/424 (14%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERS-LSLLKEHDARRQQRILAG 59
+GL + + ++ A + +G FS+ + +S L E A R R
Sbjct: 7 LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDRFFRR 66
Query: 60 VDLPLGGSSRPD--------GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
S P+ G Y KI IGTPP D Y DTGSD+MW C+ C C
Sbjct: 67 FMSFSEASISPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCY 126
Query: 112 RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS----CPYLEIYGDG 167
++ + ++D S++ K V+C+ + C L D + + C + YGDG
Sbjct: 127 KQKN-----PMFDPSKSTSFKEVSCESQQCR------LLDTVSCSQPQKLCDFSYGYGDG 175
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
S G + + + SG Q TS +++FGCG SG NE + G+ G G
Sbjct: 176 SLAQGVIATETLTLNSNSG--QPTSIL-NIVFGCGHNNSGTF---NENEM-GLFGTGGRP 228
Query: 228 SSMISQLASSGGVRKMFAHCL------DGINGGGIFAI-GHVVQPEVNKTPLVP--NQPH 278
S+ SQ+ S+ G + F+ CL I IF V +V TPLV + +
Sbjct: 229 LSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTY 288
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGTI-IDSGTTLAYLPEMVYEPLVSKIIS---- 333
Y + + + VG P KG + ID+GT LP Y LV +
Sbjct: 289 YFVTLDGISVGDKL--FPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPM 346
Query: 334 ---QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS-VSLKVYPHEYLFPFEDLWC 389
Q PDL+ C++ + +D P +T HF+ + V LK + ++ P E ++C
Sbjct: 347 EPVQDPDLQPQ------LCYRSATLIDG--PILTAHFDGADVQLKPL-NTFISPKEGVYC 397
Query: 390 IGWQ 393
Q
Sbjct: 398 FAMQ 401
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/306 (28%), Positives = 142/306 (46%), Gaps = 38/306 (12%)
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTPP DY DTGSD+ W C+ C +C ++ +++ S++ V C+ + C
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLR-----PIFNPLKSTSFSHVPCNTQTC 140
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
H V G C C Y YGD + + G + ++K++ S++ + GC
Sbjct: 141 HAVDDG---HCGVQGVCDYSYTYGDRTYSKGD-----LGFEKIT----IGSSSVKSVIGC 188
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI----NGGGIF 257
G SG + G+IG G S++SQ++ + G+ + F++CL + NG F
Sbjct: 189 GHASSGGFGFAS-----GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINF 243
Query: 258 AIGHVVQ-PEVNKTPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGT 314
VV P V TPL+ +Y I + A+ +G N F N IIDSGT
Sbjct: 244 GQNAVVSGPGVVSTPLISKNTVTYYYITLEAISIG----NERHMAFAKQGN--VIIDSGT 297
Query: 315 TLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQ--YSESVDEGFPNVTFHFENSV 371
TL++LP+ +Y+ +VS ++ +V + + CF + + G P +T F
Sbjct: 298 TLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGA 357
Query: 372 SLKVYP 377
++ + P
Sbjct: 358 NVNLLP 363
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 164/379 (43%), Gaps = 48/379 (12%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRP--DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
E +RR QR+ A ++ P G P G G Y + IGTP + + +DTGSD++W C
Sbjct: 65 ERGSRRLQRLEAMLNGP-SGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQC 123
Query: 105 IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIY 164
C +C +S+ +++ + SS+ + C + C + + +N SC Y Y
Sbjct: 124 QPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQALQ----SPTCSNNSCQYTYGY 174
Query: 165 GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
GDGS T G + + + VS ++ FGCG G + G++G G
Sbjct: 175 GDGSETQGSMGTETLTFGSVSIP--------NITFGCGENNQG----FGQGNGAGLVGMG 222
Query: 225 KSNSSMISQLASSGGVRKMFAHCLDGINGGG-----IFAIGHVVQPEVNKTPLVPNQ--- 276
+ S+ SQL V K F++C+ I + ++ + V T L+ +
Sbjct: 223 RGPLSLPSQL----DVTK-FSYCMTPIGSSTSSTLLLGSLANSVTAGSPNTTLIESSQIP 277
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT---IIDSGTTLAYLPEMVYEPLVSKIIS 333
Y I + + VG L + VF + N GT IIDSGTTL Y + Y+ + IS
Sbjct: 278 TFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQAFIS 337
Query: 334 QQPDLKVHTVHDEY-TCFQY-SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIG 391
Q V+ + CFQ S+ + P HF+ + + ++ P L C+
Sbjct: 338 QMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFISPSNGLICLA 397
Query: 392 WQNSGMQSRDRKNMTLLGD 410
+S + M++ G+
Sbjct: 398 MGSS------SQGMSIFGN 410
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 169/393 (43%), Gaps = 41/393 (10%)
Query: 10 CIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSR 69
C+VL+ + AV S + G ++ L++ R + + L+G D S R
Sbjct: 3 CLVLLTSLAVSAPSGYRLALTHVDSKIGFTKT-ELMRRAAHRSRLQALSGYD---ANSPR 58
Query: 70 PDGVGL-YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDS 128
V + Y ++ IGTPP + DTGSD+ W C CK C + +YD S
Sbjct: 59 LHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLC-----FPQDTPVYDPSAS 113
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQY-DKVSG 186
ST V C C + +C+ +S C Y+ Y DG+ + G + + V G
Sbjct: 114 STFSPVPCSSATCLPTWRS--RNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPG 171
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
QT S GS+ FGCG G DS N G +G G+ S+++QL GV K F++
Sbjct: 172 --QTVSV-GSVAFGCGTDNGG--DSLNST---GTVGLGRGTLSLLAQL----GVGK-FSY 218
Query: 247 CLDGINGGGI---FAIGHVVQPE-----VNKTPLVP---NQPHYSINMTAVQVGLDFLNL 295
CL + F +G + + V TPL+ N Y +N+ + +G L +
Sbjct: 219 CLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPI 278
Query: 296 PTDVFGV--GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
P F + N G ++DSGTT L + + +V ++ V+ + CF S
Sbjct: 279 PNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSPCFP-S 337
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ P++ HF ++++ Y+ ED
Sbjct: 338 PDGEPFMPDLVLHFAGGADMRLHRDNYMSYNED 370
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 154/345 (44%), Gaps = 36/345 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC---IQCKECPRRSSLGIE-LTLYDIKD 127
G+G Y+ +GTP + + + DTGSD+ W++C + + C R + I ++
Sbjct: 79 GIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANL 138
Query: 128 SSTGKFVTCDQEFCHGVYGG--PLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
SS+ K + C + C LT+C T T C Y Y DGS+ G+F + V +
Sbjct: 139 SSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELK 198
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
G +++ GC S + + +A DG++G G S S + A G + F
Sbjct: 199 EGRKMKLH---NVLIGC----SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK--F 249
Query: 245 AHCL----DGINGGGIFAIGHVVQPE-----VNKTPLVPNQPH--YSINMTAVQVGLDFL 293
++CL N G E + T LV + Y++NM + +G L
Sbjct: 250 SYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAML 309
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD-----EYT 348
+P++V+ V GTI+DSG++L +L E Y+P+++ + + LK V EY
Sbjct: 310 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAAL--RVSLLKFRKVEMDIGPLEY- 366
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGW 392
CF + + P + FHF + + Y+ D + C+G+
Sbjct: 367 CFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF 411
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 140/308 (45%), Gaps = 44/308 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP Y +DTGSD++W C C C + + +D+K S+T +
Sbjct: 87 GEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPT-----PYFDVKKSATYRA 141
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C C + C C Y YGD +ST G + + + + +T
Sbjct: 142 LPCRSSRCASLSS---PSCFKKM-CVYQYYYGDTASTAGVLANETFTFG-AANSTKVRAT 196
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
N + FGCG+ +G+L +++ G++GFG+ S++SQL S F++CL
Sbjct: 197 N--IAFGCGSLNAGDLANSS-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLS 244
Query: 254 G-------GIFA----IGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDV 299
G++A V TP V P P+ Y +++ A+ +G L + V
Sbjct: 245 ATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLV 304
Query: 300 FGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESV 356
F + D+ G IIDSGT++ +L + YE + ++S P ++ TCFQ+
Sbjct: 305 FAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQWPPP- 363
Query: 357 DEGFPNVT 364
PNVT
Sbjct: 364 ----PNVT 367
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 121/268 (45%), Gaps = 38/268 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y + +GTP + + V DTGSD WV C C C R+ L+D S+T
Sbjct: 92 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKE-----PLFDPTKSAT 146
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD--VVQYDKVSGDL 188
++C +C +Y ++ C+ C Y YGDGS T G++ QD + YD +
Sbjct: 147 YANISCSSSYCSDLY---VSGCSGG-HCLYGIQYGDGSYTIGFYAQDTLTLAYDTIK--- 199
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHC 247
+ FGCG + G G++G G+ +S+ Q GGV FA+C
Sbjct: 200 -------NFRFGCGEKNRGLFGRAA-----GLLGLGRGKTSLPVQAYDKYGGV---FAYC 244
Query: 248 LDGINGGGIFAIGHVVQPEVNK--TPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVG 303
L + G F P N TP++ ++ Y + MT ++VG L +P VF
Sbjct: 245 LPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF--- 301
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
GT++DSGT + LP Y PL S
Sbjct: 302 STAGTLVDSGTVITRLPPSAYAPLRSAF 329
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 121/268 (45%), Gaps = 38/268 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y + +GTP + + V DTGSD WV C C C R+ L+D S+T
Sbjct: 157 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKE-----PLFDPTKSAT 211
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD--VVQYDKVSGDL 188
++C +C +Y ++ C+ C Y YGDGS T G++ QD + YD +
Sbjct: 212 YANISCSSSYCSDLY---VSGCSGG-HCLYGIQYGDGSYTIGFYAQDTLTLAYDTIK--- 264
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHC 247
+ FGCG + G G++G G+ +S+ Q GGV FA+C
Sbjct: 265 -------NFRFGCGEKNRGLFGRAA-----GLLGLGRGKTSLPVQAYDKYGGV---FAYC 309
Query: 248 LDGINGGGIFAIGHVVQPEVNK--TPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVG 303
L + G F P N TP++ ++ Y + MT ++VG L +P VF
Sbjct: 310 LPATSAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVF--- 366
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
GT++DSGT + LP Y PL S
Sbjct: 367 STAGTLVDSGTVITRLPPSAYAPLRSAF 394
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 98/364 (26%), Positives = 158/364 (43%), Gaps = 42/364 (11%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRP--DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
E +RR QR+ A ++ P G P G G Y + IGTP + + +DTGSD++W C
Sbjct: 65 ERGSRRLQRLEAMLNGP-SGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQC 123
Query: 105 IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIY 164
C +C +S+ +++ + SS+ + C + C + + +N SC Y Y
Sbjct: 124 QPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQALQ----SPTCSNNSCQYTYGY 174
Query: 165 GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
GDGS T G + + + VS ++ FGCG G + G++G G
Sbjct: 175 GDGSETQGSMGTETLTFGSVSIP--------NITFGCGENNQG----FGQGNGAGLVGMG 222
Query: 225 KSNSSMISQLASSGGVRKMFAHCLDGINGGG-----IFAIGHVVQPEVNKTPLVPNQ--- 276
+ S+ SQL V K F++C+ I + ++ + V T L+ +
Sbjct: 223 RGPLSLPSQL----DVTK-FSYCMTPIGSSNSSTLLLGSLANSVTAGSPNTTLIQSSQIP 277
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT---IIDSGTTLAYLPEMVYEPLVSKIIS 333
Y I + + VG L + VF + N GT IIDSGTTL Y + Y+ + IS
Sbjct: 278 TFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQAFIS 337
Query: 334 QQPDLKVHTVHDEY-TCFQY-SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIG 391
Q V+ + CFQ S+ + P HF+ + + ++ P L C+
Sbjct: 338 QMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGDLVLPSENYFISPSNGLICLA 397
Query: 392 WQNS 395
+S
Sbjct: 398 MGSS 401
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/404 (25%), Positives = 170/404 (42%), Gaps = 66/404 (16%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSR---PDGVGLYYA 78
+ SN V + ++ R + AR + + + D + +R P+G G Y
Sbjct: 36 IHSNPDVSATEFVRDALRRDM----HRHARFTRELASSGDRTVAAPTRKDLPNG-GEYIM 90
Query: 79 KIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCD 137
+ IGTPP Y DTGSD++W C C +C +++ Y+ S+T + C+
Sbjct: 91 TLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAG-----QPYNPSSSTTFGVLPCN 145
Query: 138 Q--EFCHGVYG-GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C + G P C SC Y + YG G T G +Q V + S T
Sbjct: 146 SSVSMCAALAGPSPPPGC----SCMYNQTYGTG-WTAG--IQSVETFTFGSTPADQTRVP 198
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD----- 249
G + FGC N S + G++G G+ + S++SQL + MF++CL
Sbjct: 199 G-IAFGC-----SNASSDDWNGSAGLVGLGRGSMSLVSQLGAG-----MFSYCLTPFQDA 247
Query: 250 ------------GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPT 297
+NG G+ V P +K P+ +Y +N+T + +G L++P
Sbjct: 248 NSTSTLLLGPSAALNGTGVLTTPFVASP--SKAPM---STYYYLNLTGISIGTTALSIPP 302
Query: 298 DVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQY 352
+ F + + G IIDSGTT+ L + Y+ V I L V D CF
Sbjct: 303 NAFALRTDGTGGLIIDSGTTITSLVDAAYQ-QVRAAIESLVTLPVADGSDSTGLDLCFAL 361
Query: 353 SE--SVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQN 394
+ S P++TFHF+ + + + Y+ +WC+ +N
Sbjct: 362 TSETSTPPSMPSMTFHFDGA-DMVLPVDNYMILGSGVWCLAMRN 404
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 146/340 (42%), Gaps = 61/340 (17%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y A + +GTP + + V VDTGSD+ WV C C C ++ +L+ S++
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQND-----SLFIPNTSTSFTK 55
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C E C+G+ P C T+C Y YGDGS +TG FV D + D ++G Q
Sbjct: 56 LACGTELCNGL---PYPMCN-QTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVP- 110
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
+ FGCG G+ DGI+G G+ S SQL + F++CL
Sbjct: 111 --NFAFGCGHDNEGSF-----AGADGILGLGQGPLSFPSQLKTV--FNGKFSYCLV---- 157
Query: 254 GGIFAIGHVVQPEVNKTPL------VPNQP---------------HYSINMTAVQVGLDF 292
+ P +PL VP P +Y + + + VG
Sbjct: 158 -------DWLAPPTQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKL 210
Query: 293 LNLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
LN+ + F + GTI DSGTT+ L V++ +++ + + D + D+ +
Sbjct: 211 LNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKS--DDSSGL 268
Query: 351 Q-----YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
++E P++TFHFE +++ P Y E
Sbjct: 269 DLCLGGFAEGQLPTVPSMTFHFEGG-DMELPPSNYFIFLE 307
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 151/346 (43%), Gaps = 57/346 (16%)
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ----CKECPRRSSLGIE 119
LGG P G +Y + IG P K Y++ +DTGS++ W+ C CK C + +
Sbjct: 30 LGGDVHP--TGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNK-----VP 82
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYG--GPLTDCTANT-SCPYLEIYGDGSSTTGYFVQ 176
LY K K V C C ++ G DC C Y Y DG+++ G
Sbjct: 83 HPLYRPK-----KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLG---- 133
Query: 177 DVVQYDKVSGDLQTTSTNGSLIFGCG--ARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
V+ DK S T + ++ FGCG Q + + +DGI+G G+ + ++SQL
Sbjct: 134 -VLLLDKFS---LPTGSARNIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQL 189
Query: 235 ASSGGVRK-MFAHCLDGINGGGIFAIGHVVQPEVNKTPL----VPNQP-HYSINMTAVQV 288
SG V K + HCL GGG IG P + + + +P HYS + +
Sbjct: 190 KHSGAVSKNVIGHCLSS-KGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHL 248
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS-----------KIISQQPD 337
G + + T F I DSG+T YLPE ++ LVS K++S D
Sbjct: 249 GRNPIG--TKPFKA------IFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDT-D 299
Query: 338 LKVHTVHDEYTCFQYSESVDEGFPN-VTFHFENSVSLKVYPHEYLF 382
++H F+ + + F + VT F++ V++ + P YL
Sbjct: 300 TRLHLCWKGPKPFKTVHDLPKEFKSLVTLKFDHGVTMTIPPENYLI 345
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 130/282 (46%), Gaps = 32/282 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK + + +DTGSD+ W+ C+ C EC ++ YD SS+
Sbjct: 177 GSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNG-----PHYDPGQSSSY 231
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQD--VVQYDKVSGD 187
+ + C CH V P C A N +CPY YGD S+TTG F + V SG
Sbjct: 232 RNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGK 291
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ +++FGCG G G+ S SQL S G F++C
Sbjct: 292 PELRRVE-NVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYG--HSFSYC 343
Query: 248 LDGINGGG------IFAIGH--VVQPEVNKTPLV-----PNQPHYSINMTAVQVGLDFLN 294
L N IF + PE+N T LV P Y + + ++ VG + +N
Sbjct: 344 LVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVN 403
Query: 295 LPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+P + + + + GTIIDSGTTL+Y E Y+ + +++
Sbjct: 404 IPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAK 445
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 150/349 (42%), Gaps = 46/349 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GTP ++VDTGSD+ WV C C P S + L+D SS+ V
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAP--SCYSQKDPLFDPAQSSSYAAVP 197
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C G+ G + C Y+ YGDGS+TTG + D + L +S
Sbjct: 198 CGGPVCAGL-GIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLT-------LSASSAVQ 249
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLD-GING 253
FGCG QSG + +DG++G G+ S++ Q A + GGV F++CL +
Sbjct: 250 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGV---FSYCLPTKPST 301
Query: 254 GGIFAIG----HVVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G +G P + T L+ PN P +Y + +T + VG L++P F
Sbjct: 302 AGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFA----G 357
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSESVDEGFP 361
GT++D+GT + LP Y L S S P + + D TC+ ++ P
Sbjct: 358 GTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILD--TCYNFAGYGTVTLP 415
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
NV F + ++ + L C+ + SG M +LG+
Sbjct: 416 NVALTFGSGATVMLGADGIL----SFGCLAFAPSG----SDGGMAILGN 456
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 140/301 (46%), Gaps = 31/301 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSL---GIELTLYDIKDSSTG 131
L+YA + +GTP + V +DTGSD+ WV C C +C SS ++ +Y + SST
Sbjct: 107 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLSSPDYGNLKFDVYSPRKSSTS 165
Query: 132 KFVTCDQEFCHGVYGGPLTDCT-ANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ V C C T+C+ A+ SCPY +E D +S+ G V+DV+ SG
Sbjct: 166 RKVPCSSNMCDL-----QTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESG--H 218
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ T + FGCG Q+G+ A +G++G G + S+ S LAS G F+ C
Sbjct: 219 SKITQAPITFGCGQVQTGSF--LGSAAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCF- 275
Query: 250 GINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G +G G G + +TPL + P+Y+I++ G +
Sbjct: 276 GEDGHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGAMAGGKTFS---------TKFS 326
Query: 308 TIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
++DSGT+ L + +Y + S K + ++ + ++ EY C+ S PN++
Sbjct: 327 AVVDSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEY-CYTISSKGAVSPPNIS 385
Query: 365 F 365
Sbjct: 386 L 386
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 147/339 (43%), Gaps = 39/339 (11%)
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLY 123
+ G+S+ G G Y++++GIG PP Y+ +DTGSD+ WV C C +C +++ ++
Sbjct: 139 ISGTSQ--GSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQAD-----PIF 191
Query: 124 DIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDK 183
+ S++ ++C+ C + +++C N +C Y YGDGS T G FV + +
Sbjct: 192 EPASSASFSTLSCNTRQCRSL---DVSEC-RNDTCLYEVSYGDGSYTVGDFVTETITLGS 247
Query: 184 VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
D ++ GCG G G + S SQ+ ++
Sbjct: 248 APVD--------NVAIGCGHNNEGLFVGAAGLLGL-----GGGSLSFPSQINATS----- 289
Query: 244 FAHCL--DGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTD 298
F++CL + P PL+ N Y + +T + VG + +++P
Sbjct: 290 FSYCLVDRDSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPES 349
Query: 299 VFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSES 355
F + + N G I+DSGT + L VY L + + DL + + TC+ S
Sbjct: 350 AFQIDESGNGGVIVDSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSK 409
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPF--EDLWCIGW 392
+ P V+FHF + L + YL P E +C +
Sbjct: 410 GNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFAF 448
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 136/320 (42%), Gaps = 29/320 (9%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G YY + IG P K Y++ VDTGSD+ W+ C + P RS + LY +S
Sbjct: 51 TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQC----DAPCRSCNKVPHPLYRPTANS--- 103
Query: 133 FVTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C C ++ G ++ C + C Y Y D +S+ G + D S +++
Sbjct: 104 LVPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----NFSLPMRS 158
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ L FGCG Q + + A DG++G G+ + S++SQL G + + HCL
Sbjct: 159 SNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLS- 217
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
NGGG G + P T V P I+ G L GV + +
Sbjct: 218 TNGGGFLFFGDDIVP----TSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPME-VVF 272
Query: 311 DSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
DSG+T Y Y+ +V SK + Q D + F+ V + F ++
Sbjct: 273 DSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPLCWKGPKAFKSVFDVKKEFKSL 332
Query: 364 TFHFENSVS--LKVYPHEYL 381
F ++ + +++ P YL
Sbjct: 333 FLSFASAKNAVMEIPPENYL 352
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 124/297 (41%), Gaps = 28/297 (9%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRR 113
R + V P+ G+ P VG Y + IG PP+ Y++ +DTGSD+ W+ C C C +
Sbjct: 60 RAGSSVVFPVHGNVYP--VGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQT 117
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
LY + V C C ++ DC C Y Y D S+ G
Sbjct: 118 PH-----PLY----RPSNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGV 168
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+ DV + +G + GCG Q + LDG++G G+ +S+ SQ
Sbjct: 169 LLHDVYTLNFTNG----VQLKVRMALGCGYDQI--FPDPSHHPLDGMLGLGRGKTSLTSQ 222
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQP-EVNKTPLVP-NQPHYSINMTAVQVGLD 291
L S G VR + HCL GG IF G V + TP+ + HYS+ G
Sbjct: 223 LNSQGLVRNVIGHCLSAQGGGYIF-FGDVYDSFRLTWTPMSSRDYKHYSV------AGAA 275
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
L GVG N + D+G++ Y Y+ L+S + + + HD+ T
Sbjct: 276 ELLFGGKKSGVG-NLHAVFDTGSSYTYFNSYAYQVLISWLKKESGGKPLKEAHDDQT 331
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 152/382 (39%), Gaps = 60/382 (15%)
Query: 45 LKEHDARRQQRIL---AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ H+AR+ A V P S G Y + IGTPP Y DTGSD++W
Sbjct: 61 MHRHNARKLALAASSGATVSAPTQDSPT---AGEYLMALAIGTPPLPYQAIADTGSDLIW 117
Query: 102 VNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF--CHGVYGGPLTDCTANTSC 158
C C +C R+ + LY+ S+T + C+ C G T +C
Sbjct: 118 TQCAPCTSQCFRQPT-----PLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCAC 172
Query: 159 PYLEIYGDG------SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
Y YG G S T F + +V G + FGC SG
Sbjct: 173 TYNVTYGSGWTSVFQGSETFTFGSTPAGHARVPG----------IAFGCSTASSG----F 218
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQ----P 265
N + G++G G+ S++SQL GV K F++CL N +G
Sbjct: 219 NASSASGLVGLGRGRLSLVSQL----GVPK-FSYCLTPYQDTNSTSTLLLGPSASLNGTA 273
Query: 266 EVNKTPLV------PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLA 317
V+ TP V P Y +N+T + +G L++P D F + + G IIDSGTT+
Sbjct: 274 GVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGLIIDSGTTIT 333
Query: 318 YLPEMVYEPLVSKIIS----QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
L Y+ + + ++S D T D S S P++T HF N +
Sbjct: 334 LLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHF-NGADM 392
Query: 374 KVYPHEYLFPFED-LWCIGWQN 394
+ Y+ + LWC+ QN
Sbjct: 393 VLPADSYMMSDDSGLWCLAMQN 414
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 154/351 (43%), Gaps = 43/351 (12%)
Query: 50 ARRQQRILAGVDLPLGGSSRPD------GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
+R + + G +L ++ P G G Y +G+G+P +D DTGSD+ W
Sbjct: 115 SRLAKNLAGGSNLKASKATLPSKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQ 174
Query: 104 CIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG--GPLTDCTANTSCPY 160
C C C ++ ++D S + V+CD C + G C+++T C Y
Sbjct: 175 CEPCVGYCYQQRE-----HIFDPSTSLSYSNVSCDSPSCEKLESATGNSPGCSSST-CLY 228
Query: 161 LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGI 220
YGDGS + G+F ++ + L +T + FGCG G T G+
Sbjct: 229 GIRYGDGSYSIGFFAREKL-------SLTSTDVFNNFQFGCGQNNRGLFGGTA-----GL 276
Query: 221 IGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQPEVNKTPLVPNQP 277
+G ++ S++SQ A G K+F++CL G F G V TP N
Sbjct: 277 LGLARNPLSLVSQTAQKYG--KVFSYCLPSSSSSTGYLSFGSGDGDSKAVKFTPSEVNSD 334
Query: 278 H---YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY---EPLVSKI 331
+ Y ++M + VG L +P VF GTIIDSGT ++ LP VY + + ++
Sbjct: 335 YPSFYFLDMVGISVGERKLPIPKSVFSTA---GTIIDSGTVISRLPPTVYSSVQKVFREL 391
Query: 332 ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
+S P +K ++ D TC+ S+ P + +F + + P ++
Sbjct: 392 MSDYPRVKGVSILD--TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGIIY 440
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 140/318 (44%), Gaps = 38/318 (11%)
Query: 33 YRYAGRERSLSLLKEHDARR--QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
+R A S + ++ RR +R++A V+ S P G G Y + +GTPP+ +
Sbjct: 109 HRRAALSGSAAARRDSAPRRALSERVVATVE-----SGVPVGSGEYLVDVYLGTPPRRFR 163
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+ +DTGSD+ W+ C C +C +S ++D S + + VTC + C V +
Sbjct: 164 MIMDTGSDLNWLQCAPCLDCFEQSG-----PIFDPAASISYRNVTCGDDRCRLVSPPAES 218
Query: 151 ---DCTANTS--CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQ 205
+C S CPY YGD S+TTG + + T +G + FGCG R
Sbjct: 219 APRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSG--TRRVDG-VAFGCGHRN 275
Query: 206 SGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGH-- 261
G G+ S SQL G F++CL G G GH
Sbjct: 276 RGLFHGAAGLLGL-----GRGPLSFASQLRGVYG-GHAFSYCLVEHGSAAGSKIIFGHDD 329
Query: 262 --VVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTL 316
+ P++N T P Y + + ++ VG + +N+ +D G GTIIDSGTTL
Sbjct: 330 ALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAG---GTIIDSGTTL 386
Query: 317 AYLPEMVYEPLVSKIISQ 334
+Y PE Y+ + I +
Sbjct: 387 SYFPEPAYQAIRQAFIDR 404
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 155/357 (43%), Gaps = 50/357 (14%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR----------SSLGIELTLYD 124
L+YA + IGTP + + V +DTGSD+ W+ C C R ++ I L +Y+
Sbjct: 110 LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYN 169
Query: 125 IKDSSTGKFVTCDQEFC--HGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQY 181
S++ VTC+ C PL+D CPY + GS +TG V+DV+
Sbjct: 170 PSISTSSSKVTCNSTLCALRNRCISPLSD------CPYRIRYLSPGSKSTGVLVEDVIHM 223
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
G+ + + + FGC Q G E A++GI+G ++ ++ + L +G
Sbjct: 224 STEEGEAR----DARITFGCSETQLGLF---QEVAVNGIMGLAMADIAVPNMLVKAGVAS 276
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDV 299
F+ C G NG G + G + ++TPL + Y +++T +VG
Sbjct: 277 DSFSMCF-GPNGKGTISFGDKGSSDQHETPLGGTISPLFYDVSITKFKVG---------K 326
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY---SESV 356
V I DSGT + +L + Y L + PD ++ D F Y S S
Sbjct: 327 VTVETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYIITSTSD 386
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFED----LWCIGWQNSGMQSRDRKNMTLLG 409
+E P+++F + + V+ +F D ++C+ + +D+ + ++G
Sbjct: 387 EEKLPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCL-----AVLKQDKADFNIIG 438
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 153/325 (47%), Gaps = 34/325 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y +G+GTP K++ + DTGSDI W C C K C ++ + + S++
Sbjct: 115 GAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRL-----NPSTSTS 169
Query: 131 GKFVTCDQEFCHGVYGG-PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K ++C C V G + ++++C Y YGDGS + G+F + + L
Sbjct: 170 YKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLT-------LS 222
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+++ + +FGCG + ++ G++G G++ ++ SQ A + +K+F++CL
Sbjct: 223 SSNVFKNFLFGCGQQ-----NNGLFGGAAGLLGLGRTKLALPSQTAKT--YKKLFSYCLP 275
Query: 250 GINGG-GIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G ++G V V TPL + P Y +++T + VG L++ F +
Sbjct: 276 ASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF----S 331
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GT+IDSGT + L Y L S +++ P +++ D TC+ +S+ P
Sbjct: 332 AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFD--TCYDFSKYDTVRIPK 389
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL 387
V F+ V + + L+P L
Sbjct: 390 VGVTFKGGVEMDIDVSGILYPVNGL 414
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 161/405 (39%), Gaps = 66/405 (16%)
Query: 45 LKEHDARRQQRIL---AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ H+AR+ A V P S G Y + IGTPP Y DTGSD++W
Sbjct: 1 MHRHNARKLALAASSGATVSAPTQDSPT---AGEYLMALAIGTPPLPYQAIADTGSDLIW 57
Query: 102 VNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF--CHGVYGGPLTDCTANTSC 158
C C +C R+ + LY+ S+T + C+ C G T +C
Sbjct: 58 TQCAPCTSQCFRQPT-----PLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCAC 112
Query: 159 PYLEIYGDG------SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
Y YG G S T F + +V G + FGC SG
Sbjct: 113 TYNVTYGSGWTSVFQGSETFTFGSTPAGHARVPG----------IAFGCSTASSG----F 158
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQ----P 265
N + G++G G+ S++SQL GV K F++CL N +G
Sbjct: 159 NASSASGLVGLGRGRLSLVSQL----GVPK-FSYCLTPYQDTNSTSTLLLGPSASLNGTA 213
Query: 266 EVNKTPLV------PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLA 317
V+ TP V P Y +N+T + +G L++P D F + + G IIDSGTT+
Sbjct: 214 GVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGTGGLIIDSGTTIT 273
Query: 318 YLPEMVYEPLVSKIIS----QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
L Y+ + + ++S D T D S S P++T HF N +
Sbjct: 274 LLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHF-NGADM 332
Query: 374 KVYPHEYLFPFED-LWCIGWQNSG------MQSRDRKNMTLLGDF 411
+ Y+ + LWC+ QN + + ++NM +L D
Sbjct: 333 VLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDI 377
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 122/296 (41%), Gaps = 26/296 (8%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRR 113
R + V P+ G+ P VG Y + IG PP+ Y++ +DTGSD+ W+ C C C +
Sbjct: 58 RAGSSVVFPVHGNVYP--VGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQT 115
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
LY + FV C C ++ DC C Y Y D S+ G
Sbjct: 116 PH-----PLY----RPSNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGV 166
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+ DV + +G + GCG Q + LDG++G G+ +S+ SQ
Sbjct: 167 LLHDVYTLNFTNG----VQLKVRMALGCGYDQI--FPDPSHHPLDGMLGLGRGKTSLTSQ 220
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-NQPHYSINMTAVQVGLDF 292
L S G VR + HCL GG IF + TP+ + HYS G
Sbjct: 221 LNSQGLVRNVIGHCLSAQGGGYIFFGDVYDSSRLTWTPMSSRDYKHYS------AAGAAE 274
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
L G+G + + D+G++ Y Y+ L+S + + + HD+ T
Sbjct: 275 LLFGGKKSGIG-SLHAVFDTGSSYTYFNPYAYQALISWLGKESGGKPLKEAHDDQT 329
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 127/297 (42%), Gaps = 32/297 (10%)
Query: 56 ILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRS 114
I + V PL G+ P +G YY + IG PP Y++ TGSD+ W+ C C C +
Sbjct: 49 IQSSVVFPLYGNVYP--LGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAX 106
Query: 115 SLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYF 174
LY V C C ++ P C C Y Y DG S+ G
Sbjct: 107 H-----XLY----RPNNNLVICKDPMCAXLHP-PGYKCEHPEQCDYEVEYADGGSSLGVL 156
Query: 175 VQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
V+DV + +G L GCG Q + + LDG++G GK SS++SQL
Sbjct: 157 VKDVFPLNFTNG----LRLAPRLALGCGYDQ---IPGXSYHPLDGVLGLGKGKSSIVSQL 209
Query: 235 ASSGGVRKMFAHCLDGINGGGIFAIGHVV--QPEVNKTPLVPNQ-PHYSINMTAVQVGLD 291
S G +R + HC+ +GGG G + V TP++ +Q HYS + +G
Sbjct: 210 HSQGVIRNVVGHCVSS-HGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGK 268
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT 348
T VF N DSG++ YL + Y+ LV + + + V D+ T
Sbjct: 269 -----TTVF---KNLLVTFDSGSSYTYLNSLAYQALVHLVRKELSEKPVREALDDQT 317
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 144/323 (44%), Gaps = 35/323 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++G+G P + +Y+ +DTGSDI W+ C C +C +++ ++D SST
Sbjct: 16 GSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTD-----PIFDPTASSTY 70
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
VTC + C + ++ C + C Y YGDGS T G F + V + SG ++
Sbjct: 71 APVTCQSQQCSSL---EMSSCRSG-QCLYQVNYGDGSYTFGDFATESVSFGN-SGSVK-- 123
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
++ GCG G G S+ +QL ++ F++CL
Sbjct: 124 ----NVALGCGHDNEGLFVGAAGLLGLGGGPL-----SLTNQLKATS-----FSYCLVNR 169
Query: 252 NGGGIFAIG-HVVQPEVNK--TPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVGD- 304
+ G + + Q V+ PL+ N+ Y + ++ + VG +++P F + +
Sbjct: 170 DSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDES 229
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPN 362
N G I+D GT + L Y PL + +LK+ + + TC+ S P
Sbjct: 230 GNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPT 289
Query: 363 VTFHFENSVSLKVYPHEYLFPFE 385
V+FHF + S + YL P +
Sbjct: 290 VSFHFADGKSWNLPAANYLIPVD 312
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 153/325 (47%), Gaps = 34/325 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y +G+GTP K++ + DTGSDI W C C K C ++ + + S++
Sbjct: 67 GAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPST-----STS 121
Query: 131 GKFVTCDQEFCHGVYGG-PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K ++C C V G + ++++C Y YGDGS + G+F + + L
Sbjct: 122 YKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLT-------LS 174
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+++ + +FGCG + ++ G++G G++ ++ SQ A + +K+F++CL
Sbjct: 175 SSNVFKNFLFGCGQQ-----NNGLFGGAAGLLGLGRTKLALPSQTAKT--YKKLFSYCLP 227
Query: 250 GINGG-GIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G ++G V V TPL + P Y +++T + VG L++ F +
Sbjct: 228 ASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAF----S 283
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GT+IDSGT + L Y L S +++ P +++ D TC+ +S+ P
Sbjct: 284 AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFD--TCYDFSKYDTVRIPK 341
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL 387
V F+ V + + L+P L
Sbjct: 342 VGVTFKGGVEMDIDVSGILYPVNGL 366
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 165/380 (43%), Gaps = 58/380 (15%)
Query: 44 LLKEHDARRQQRI----LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
L K DA + LAG+ L G S G G YY K+G+G+P K Y + VDTGS
Sbjct: 70 LAKNSDANASSKKVGPKLAGIPLKSGLSM---GSGNYYVKMGLGSPTKYYTMIVDTGSSF 126
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA---NT 156
W +QC+ C + E +++ S T K V C C + L + T +
Sbjct: 127 SW---LQCQPCTIYCHIQ-EDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEPTCSKQSN 182
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
+C Y YGD S + GY QDV+ L + T S ++GCG G T
Sbjct: 183 ACVYKASYGDSSFSLGYLSQDVL-------TLTPSQTLSSFVYGCGQDNQGLFGRT---- 231
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCL------DGINGGGIFAIG-HVVQPEVNK 269
DGIIG + SM+SQL SG F++CL G +IG + P +
Sbjct: 232 -DGIIGLANNELSMLSQL--SGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSY 288
Query: 270 --TPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
TPL+ PN P Y I++ ++ V L + + V TIIDSGT + LP VY
Sbjct: 289 KFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKV----PTIIDSGTVITRLPTPVY 344
Query: 325 EPL-------VSKIISQQPDLKVHTVHDEYTCFQYSES-VDEGFPNVTFHFENSVSLKVY 376
L +SK Q P + + TCF+ S + + E P++ F+ L++
Sbjct: 345 TTLKNAYVTILSKKYQQAPGISLLD-----TCFKGSLAGISEVAPDIRIIFKGGADLQLK 399
Query: 377 PHEYLFPFED-LWCIGWQNS 395
H L E + C+ S
Sbjct: 400 GHNSLVELETGITCLAMAGS 419
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 147/332 (44%), Gaps = 37/332 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++G+G P + +Y+ +DTGSDI W+ C C +C +++ ++D SST
Sbjct: 157 GSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTD-----PIFDPTASSTY 211
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
VTC + C + ++ C + C Y YGDGS T G F + V + SG ++
Sbjct: 212 APVTCQSQQCSSL---EMSSCRSG-QCLYQVNYGDGSYTFGDFATESVSFGN-SGSVK-- 264
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
++ GCG G G S+ +QL ++ F++CL
Sbjct: 265 ----NVALGCGHDNEGLFVGAAGLLGLGGGPL-----SLTNQLKATS-----FSYCLVNR 310
Query: 252 NGGGIFAIG-HVVQPEVNK--TPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVGD- 304
+ G + + Q V+ PL+ N+ Y + ++ + VG +++P F + +
Sbjct: 311 DSAGSSTLDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDES 370
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPN 362
N G I+D GT + L Y PL + +LK+ + + TC+ S P
Sbjct: 371 GNGGIIVDCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPT 430
Query: 363 VTFHFENSVSLKVYPHEYLFPFED--LWCIGW 392
V+FHF + S + YL P + +C +
Sbjct: 431 VSFHFADGKSWNLPAANYLIPVDSAGTYCFAF 462
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 146/349 (41%), Gaps = 33/349 (9%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y K +GTP D DTGSD++W C C +C + + L+D K SST +
Sbjct: 90 GEYLMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQ-----DAPLFDPKSSSTYRD 144
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
++C + C + G N +C Y YGD S T+G D + SG
Sbjct: 145 ISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLP- 203
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
I GCG G+ E GI+G G S+ISQL S+ + F++CL ++
Sbjct: 204 --KAIIGCGHNNGGSF----TEKGSGIVGLGGGPISLISQLGST--IDGKFSYCLVPLSS 255
Query: 254 GGIFAI-------GHVVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G V V TPL+ P Y + + AV VG + + P FG +
Sbjct: 256 NATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSE 315
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YSESVDEGFPN 362
IIDSGTTL PE + L S + Q + V D YS D FP+
Sbjct: 316 GN-IIIDSGTTLTLFPEDFFSELSSAV---QDAVAGTPVEDPSGILSLCYSIDADLKFPS 371
Query: 363 VTFHFENSVSLKVYPHEYLFPFED-LWCIGWQ--NSGMQSRDRKNMTLL 408
+T HF+ + +K+ P D + C + NSG + M L
Sbjct: 372 ITAHFDGA-DVKLNPLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFL 419
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/315 (27%), Positives = 138/315 (43%), Gaps = 37/315 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G Y + IGTP + +DTGSD+ WV QC C +S + L+D S+T
Sbjct: 125 GTTEYVITVTIGTPAVTQVMSIDTGSDVSWV---QCAPCAAQSCSSQKDKLFDPAMSATY 181
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+C C + G + + C Y+ YGDGS+T G + D + L ++
Sbjct: 182 SAFSCGSAQCAQL--GDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTL-------SLTSS 232
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
S FGC R +G + LDG++G G S++SQ A++ G K F++CL
Sbjct: 233 DAVKSFQFGCSHRAAGFVGE-----LDGLMGLGGDTESLVSQTAATYG--KAFSYCLPPP 285
Query: 250 GINGGGIF---AIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+GGG A G + TP+V Y + + + V LN+P VF
Sbjct: 286 SSSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVF---- 341
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH----TVHDEYTCFQYSESVDEGF 360
+ +++DSGT + LP Y+ L + + ++K + V TCF +S
Sbjct: 342 SGASVVDSGTVITQLPPTAYQALRTAF---KKEMKAYPSAAPVGSLDTCFDFSGFNTITV 398
Query: 361 PNVTFHFENSVSLKV 375
P VT F ++ +
Sbjct: 399 PTVTLTFSRGAAMDL 413
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 153/325 (47%), Gaps = 34/325 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y +G+GTP K++ + DTGSDI W C C K C ++ + + S++
Sbjct: 127 GAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRL-----NPSTSTS 181
Query: 131 GKFVTCDQEFCHGVYGG-PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K ++C C V G + ++++C Y YGDGS + G+F + + L
Sbjct: 182 YKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLT-------LS 234
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+++ + +FGCG + ++ G++G G++ ++ SQ A + +K+F++CL
Sbjct: 235 SSNVFKNFLFGCGQQ-----NNGLFGGAAGLLGLGRTKLALPSQTAKT--YKKLFSYCLP 287
Query: 250 GINGG-GIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G ++G V V TPL + P Y +++T + VG L++ F +
Sbjct: 288 ASSSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAF----S 343
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GT+IDSGT + L Y L S +++ P +++ D TC+ +S+ P
Sbjct: 344 AGTVIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFD--TCYDFSKYDTVRIPK 401
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL 387
V F+ V + + L+P L
Sbjct: 402 VGVTFKGGVEMDIDVSGILYPVNGL 426
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 94/318 (29%), Positives = 143/318 (44%), Gaps = 42/318 (13%)
Query: 30 SVKY--RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSS-RPDGVG-LYYAKIGIGTP 85
SV+Y A R+R L RR + AG+ G S+ R +G L+Y I +GTP
Sbjct: 57 SVEYYAELADRDRFLR------GRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTP 110
Query: 86 PKDYYVQVDTGSDIMWVNCIQCKECP--------RRSSLGIELTLYDIKDSSTGKFVTCD 137
+ V +DTGSD+ WV C C C + +L++Y+ SST K VTC+
Sbjct: 111 GVKFMVALDTGSDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCN 169
Query: 138 QEFCHGVYGGPLTDCTANTS-CPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C S CPY+ Y +ST+G V+DV+ + + N
Sbjct: 170 NSLCTH-----RNQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEAN- 223
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
+IFGCG QSG+ + A +G+ G G S+ S L+ G F+ C G +G G
Sbjct: 224 -VIFGCGQVQSGSF--LDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-GRDGIG 279
Query: 256 IFAIGHVVQPEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSG 313
+ G + ++TP + P+ P Y+I + V+VG +++ + DSG
Sbjct: 280 RISFGDKGSLDQDETPFNVNPSHPTYNITINQVRVGTTLIDV---------EFTALFDSG 330
Query: 314 TTLAYLPEMVYEPLVSKI 331
T+ YL + Y L +
Sbjct: 331 TSFTYLVDPTYSRLSESV 348
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 135/312 (43%), Gaps = 36/312 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+GTP + +DTGSD+ WV QC C + + L+D SST +
Sbjct: 120 YVVTVGLGTPAVSQVLLIDTGSDLSWV---QCAPCNSTTCYPQKDPLFDPSRSSTYAPIP 176
Query: 136 CDQEFCHGV----YGGPLTDCTANT----SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
C+ + C + YG +DCT+ + C Y YGDGS TTG + + +
Sbjct: 177 CNTDACRDLTRDGYG---SDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLT------- 226
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ T FGCG Q G D DG++G G + S++ Q +S G F++C
Sbjct: 227 MAPGVTVKDFHFGCGHDQDGPNDK-----YDGLLGLGGAPESLVVQTSSVYG--GAFSYC 279
Query: 248 LDGING-GGIFAIGHVVQPEVN--KTPLV-PNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
L N G A+G V TP+V Q Y +NMT + VG + +++P F
Sbjct: 280 LPAANDQAGFLALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAF--- 336
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
+ G IIDSGT + L Y L + + + TC+ ++ + P V
Sbjct: 337 -SGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGELDTCYNFTGHSNVTVPRV 395
Query: 364 TFHFENSVSLKV 375
F ++ +
Sbjct: 396 ALTFSGGATVDL 407
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 94/309 (30%), Positives = 138/309 (44%), Gaps = 35/309 (11%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR--RSSLGIELTL--YDI 125
PD LYYA + +GTP D+ V +DTGSD+ W+ C +C C +S G + L Y
Sbjct: 98 PDLGFLYYANVSVGTPSLDFLVALDTGSDLFWLPC-ECSSCFTYLNTSNGGKFMLNHYSP 156
Query: 126 KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPY-LEIYGDGSSTTGYFVQDVVQYDK 183
DS+T V C C+ CT+N + CPY + +S+ GY V+DV+
Sbjct: 157 NDSTTSSTVPCTSSLCN--------RCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHL-- 206
Query: 184 VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
+ D + FGCG Q+G +T A +G+IG G S+ S LA G
Sbjct: 207 ATDDSLLKPVEAKITFGCGTVQTGIFATT--AAPNGLIGLGMEKISVPSFLADQGLTSNS 264
Query: 244 FAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMT--AVQVGLDFLNLPTDVFG 301
F+ C G +G G G + +TP + S N+T + VG + P DV
Sbjct: 265 FSMCF-GADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTFNVINVGGE----PNDV-- 317
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG-- 359
I DSGT+ YL E Y ++K + LK +++ F+Y + G
Sbjct: 318 ---PFTAIFDSGTSFTYLTEPAYS-TITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAK 373
Query: 360 -FPNVTFHF 367
F +T +F
Sbjct: 374 EFQYLTLNF 382
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 110/424 (25%), Positives = 172/424 (40%), Gaps = 60/424 (14%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAGRERS-LSLLKEHDARRQQRILAG 59
+GL + + ++ A + +G FS+ + +S L E A R R
Sbjct: 7 LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDRFFRR 66
Query: 60 VDLPLGGSSRPD--------GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
S P+ G Y KI IGTPP D Y DTGSD+MW C+ C C
Sbjct: 67 FMSFSEASISPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCY 126
Query: 112 RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS----CPYLEIYGDG 167
++ + ++D S++ K V+C+ + C L D + + C + YGDG
Sbjct: 127 KQKN-----PMFDPSKSTSFKEVSCESQQCR------LLDTVSCSQPQKLCDFSYGYGDG 175
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
S G + + + SG + +++FGCG SG NE + G+ G G
Sbjct: 176 SLAQGVIATETLTLNSNSGQPXSIX---NIVFGCGHNNSGTF---NENEM-GLFGTGGRP 228
Query: 228 SSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE-------VNKTPLVP--NQPH 278
S+ SQ+ S+ G + F+ CL + PE V TPLV + +
Sbjct: 229 LSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTY 288
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGTI-IDSGTTLAYLPEMVYEPLVSKIIS---- 333
Y + + + VG P KG + ID+GT LP Y LV +
Sbjct: 289 YFVTLDGISVGDKL--FPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPM 346
Query: 334 ---QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS-VSLKVYPHEYLFPFEDLWC 389
Q PDL+ C++ + +D P +T HF+ + V LK + ++ P E ++C
Sbjct: 347 EPVQDPDLQPQ------LCYRSATLIDG--PILTAHFDGADVQLKPL-NTFISPKEGVYC 397
Query: 390 IGWQ 393
Q
Sbjct: 398 FAMQ 401
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 84/319 (26%), Positives = 131/319 (41%), Gaps = 31/319 (9%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTG 131
+G YY + IG P K Y++ VDTGSD+ W+ C C+ C + + +
Sbjct: 70 IGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNK--------VPHPWYKPTKN 121
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K V C C + P C C Y Y D +S+ G + D + L+ +
Sbjct: 122 KIVPCAASLCTSL--TPNKKCAVPQQCDYQIKYTDKASSLGVLIAD-----NFTLSLRNS 174
Query: 192 ST-NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
ST +L FGCG Q + + A DG++G GK S++SQL G + + HC
Sbjct: 175 STVRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFS- 233
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
NGGG G + P ++ VP S N + G + D +G ++
Sbjct: 234 TNGGGFLFFGDDIVP-TSRVTWVPMARTTSGNYYSPGSGTLYF----DRRSLGMKPMEVV 288
Query: 311 -DSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
DSG+T AY Y+ V SK + + D+ + F+ V F +
Sbjct: 289 FDSGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPLCWKGQKVFKSVSEVKNDFKS 348
Query: 363 VTFHFENSVSLKVYPHEYL 381
+ F + +++ P YL
Sbjct: 349 LFLSFGKNSVMEIPPENYL 367
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 129/310 (41%), Gaps = 44/310 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +I IGTP + DTGSD+ WV QC C LYD +SST
Sbjct: 94 GNYLMRIYIGTPSVERLAIADTGSDLTWV---QCSPCDNTKCFAQNTPLYDPLNSSTFTL 150
Query: 134 VTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ CD + C + P + C+ C Y YGD S + G D ++ L
Sbjct: 151 LPCDSQPCTQL---PYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRL-----MLLQL 202
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
N + FGCG + D + + GI+G G S++SQL G + F++CL
Sbjct: 203 HYNSKICFGCGFQNKFTADKSGKTT--GIVGLGAGPLSLVSQLGDEIGHK--FSYCLLPF 258
Query: 249 -DGINGGGIFAIGHVVQPE-VNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
N F +VQ V TPL+ P+ P Y +N+ + VG G
Sbjct: 259 SSNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVG-------AKTVKTGQ 311
Query: 305 NKGT-IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVD 357
G IIDSG+TL YL E Y VS + V D+Y CF Y E +
Sbjct: 312 TDGNIIIDSGSTLTYLEESFYNEFVSLVKE-----TVAVEEDQYIPYPFDFCFTYKEGMS 366
Query: 358 EGFPNVTFHF 367
P+V FHF
Sbjct: 367 TP-PDVVFHF 375
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 134/280 (47%), Gaps = 32/280 (11%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSL-------GIELT 121
R DG L+YA++ +GTP + V +DTGSD+ WV C CK+C +L G EL
Sbjct: 99 RLDG-SLHYAEVAVGTPNTTFLVALDTGSDLFWVPC-DCKQCAPLGNLTAVDGGGGPELR 156
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYG-DGSSTTGYFVQDVVQ 180
Y SST K VTC C P TA +SCPY Y +S++G V+DV+
Sbjct: 157 QYSPSKSSTSKTVTCASNLCD----QPNACATATSSCPYAVRYAMANTSSSGELVEDVLY 212
Query: 181 YDK---VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ + + ++FGCG Q+G+ + A DG++G G S+ S LAS+
Sbjct: 213 LTREKGAAAAAAGAAVRTPVVFGCGQVQTGSF--LDGAAADGLMGLGMEKVSVPSILAST 270
Query: 238 GGVR-KMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLN 294
G V+ F+ C +G G G + ++TP + H Y+I++T++ VG N
Sbjct: 271 GVVKSNSFSMCFSK-DGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMSVGDK--N 327
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
LP + I DSGT+ YL + Y + +Q
Sbjct: 328 LPLGFYA-------IADSGTSFTYLNDPAYTAYTTNFNAQ 360
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 145/335 (43%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP++ YV +D+GSDI+WV C C +C +S +++ DSS+
Sbjct: 132 GSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSD-----PVFNPADSSSF 186
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C V + C Y YGDGS T G + + + +
Sbjct: 187 SGVSCASTVCSHVDNAACHE----GRCRYEVSYGDGSYTKGTLALETITFGR-------- 234
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
+ ++ GCG G G S + QL G F++CL
Sbjct: 235 TLIRNVAIGCGHHNQGMFVGAAGLLGLGGGPM-----SFVGQLGGQTG--GAFSYCLVSR 287
Query: 250 GINGGGIFAIGHVVQPE-VNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGD- 304
GI G+ G P PL+ N Q Y I ++ + VG +++ DVF + +
Sbjct: 288 GIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSEL 347
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
+ G ++D+GT + LP + YE I+Q +L + V TC+ V P
Sbjct: 348 GDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPT 407
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNS 395
V+F+F L + +L P +D+ +C + S
Sbjct: 408 VSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPS 442
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 132/317 (41%), Gaps = 34/317 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+GTP V +DTGSD+ WV C C P + G L+D SST + V+
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTG---ALFDPAKSSTYRAVS 183
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + N C Y YGDGS+T G + +D + S ++
Sbjct: 184 CAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVK------ 237
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG- 254
FGC +SG D T DG++G G S++SQ A++ G F++CL +G
Sbjct: 238 GFQFGCSHLESGFSDQT-----DGLMGLGGGAQSLVSQTAAAYG--NSFSYCLPPTSGSS 290
Query: 255 ------GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G V + ++ +P Y + + VG L L VF G+
Sbjct: 291 GFLTLGGGGGASGFVTTRMLRSKQIPT--FYGARLQDIAVGGKQLGLSPSVFAA----GS 344
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++DSGT + LP Y L S + Q ++ D TCF ++ P V
Sbjct: 345 VVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILD--TCFDFAGQTQISIPTVAL 402
Query: 366 HFENSVSLKVYPHEYLF 382
F ++ + P+ ++
Sbjct: 403 VFSGGAAIDLDPNGIMY 419
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/335 (26%), Positives = 144/335 (42%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP+ Y+ +D+GSDI+WV C C +C ++ L+D DS++
Sbjct: 39 GSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTD-----PLFDPADSASF 93
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C V C + C Y YGDGSST G + + + T
Sbjct: 94 MGVSCSSAVCDQVDN---AGCNSG-RCRYEVSYGDGSSTKGTLALETLTLGR------TV 143
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG- 250
N + GCG G G + S + QL+ G F++CL
Sbjct: 144 VQN--VAIGCGHMNQGMFVGAAGLLGL-----GGGSMSFVGQLSRERG--NAFSYCLVSR 194
Query: 251 -INGGGIFAIGHVVQP-EVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGD- 304
N G G P PL+ P+ P +Y I ++ + VG + + D+F + +
Sbjct: 195 VTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTEL 254
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
N G ++D+GT + P + YE I Q +L + V TC+ + P
Sbjct: 255 GNGGVVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPT 314
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNS 395
V+F+F L + + +L P +D +C + S
Sbjct: 315 VSFYFSGGPILTLPANNFLIPVDDAGTFCFAFAPS 349
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 146/363 (40%), Gaps = 56/363 (15%)
Query: 47 EHDARRQQRILAGVD-----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ AR + AG + PL G P G LYY + IG PP+ Y++ VDTGSD+ W
Sbjct: 26 DRPARGGLSVTAGAEESSAVFPLYGDVYPHG--LYYVAMSIGNPPRPYFLDVDTGSDLTW 83
Query: 102 VN----CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-T 153
+ C+ C + P + + + K V C + C ++GG LT C +
Sbjct: 84 LQCDAPCVSCSKVP-----------HPLYRPTKNKLVPCVDQMCAALHGG-LTGRHKCDS 131
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
C Y Y D S+ G V D + + L FGCG Q ST
Sbjct: 132 PKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQVG-SSTE 186
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKT--P 271
A DG++G G + S++SQL G + + HCL GGG G + P T P
Sbjct: 187 VSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAP 245
Query: 272 LV--PNQPHYSINMTAVQVGLDFLNL-PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
+ ++ +YS + G L + P +V + DSG++ Y Y+ LV
Sbjct: 246 MARSTSRNYYSPGSANLYFGGRPLGVRPMEV---------VFDSGSSFTYFSAQPYQALV 296
Query: 329 -------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHE 379
SK + + PD + F+ V + F V F N +++ P
Sbjct: 297 DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLSFSNGKKALMEIPPEN 356
Query: 380 YLF 382
YL
Sbjct: 357 YLI 359
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 168/393 (42%), Gaps = 48/393 (12%)
Query: 10 CIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSR 69
C+VL+ + AV S + G ++ L++ R + R L+G D S R
Sbjct: 14 CLVLLTSLAVSASSGYRLALTHVDSKIGLTKT-ELMRRAAHRSRLRALSGYD---ANSPR 69
Query: 70 PDGVGL-YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDS 128
V + Y ++ IGTPP + DTGSD+ W C CK C + +YD S
Sbjct: 70 LHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLC-----FPQDTPVYDPSAS 124
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQY-DKVSG 186
ST V C C V +C+ +S C Y Y DG+ + G + + V G
Sbjct: 125 STFSPVPCSSATCLPVLRS--RNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPG 182
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ S + FGCG G DS N G +G G+ S+++QL GV K F++
Sbjct: 183 QAVSVS---DVAFGCGTDNGG--DSLNST---GTVGLGRGTLSLLAQL----GVGK-FSY 229
Query: 247 CLDGINGGGI---FAIGHVVQP-----EVNKTPLVP---NQPHYSINMTAVQVGLDFLNL 295
CL + F +G + + V TPL+ N Y +++ + +G L +
Sbjct: 230 CLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPI 289
Query: 296 PTDVFGVGDNK--GTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYTCF 350
P F + N G ++DSGTT + LPE + + V++++ Q P V+ + CF
Sbjct: 290 PNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPP---VNASSLDSPCF 346
Query: 351 QYS--ESVDEGFPNVTFHFENSVSLKVYPHEYL 381
E P++ HF ++++ Y+
Sbjct: 347 PAPAGERQLPFMPDLVLHFAGGADMRLHRDNYM 379
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 165/366 (45%), Gaps = 46/366 (12%)
Query: 44 LLKEHDARRQQRILAG---VDLPLGGS----SRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
LL + D RRQ+ L +P GS S D L+Y I IGTP + V +DTG
Sbjct: 61 LLAKSDFRRQRMNLGAKFQSLVPSEGSKTISSGNDFGWLHYTWIDIGTPSVSFLVALDTG 120
Query: 97 SDIMWV--NCIQCKECPRR--SSLGI-ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
SD++W+ NC+QC SSL +L Y+ SS+ K C + C G +D
Sbjct: 121 SDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSSSSKVFLCSHKLC-----GSASD 175
Query: 152 C-TANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQ---TTSTNGSLIFGCGARQS 206
C + C Y Y G +S++G V+D++ + + ++S ++ GCG +QS
Sbjct: 176 CDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNNRLMNGSSSVKARVVVGCGKKQS 235
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFA--IGHVVQ 264
G D + A DG++G G + S+ S L+ +G +R F+ C D + G I+ +G +Q
Sbjct: 236 G--DYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQ 293
Query: 265 PEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
L N Y + + A +G L + T IDSG + YLPE +Y
Sbjct: 294 QSAPFLQL-ENNSGYIVGVEACCIGNSCLKQTSFT--------TFIDSGQSFTYLPEEIY 344
Query: 325 EPLVSKIISQQPDLKVHTVHDEYTC----FQYSESVDEGFPNVTFHFENSVSLKVYPHEY 380
+ +I D ++ + + Y SV+ P + F ++ + + H+
Sbjct: 345 RKVALEI-----DRHINATSKSFEGVSWEYCYESSVEPKVPAIKLKFSHNNTFVI--HKP 397
Query: 381 LFPFED 386
LF F+
Sbjct: 398 LFVFQQ 403
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/280 (32%), Positives = 134/280 (47%), Gaps = 32/280 (11%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSL-------GIELT 121
R DG L+YA++ +GTP + V +DTGSD+ WV C CK+C +L G EL
Sbjct: 99 RLDG-SLHYAEVAVGTPNTTFLVALDTGSDLFWVPC-DCKQCAPLGNLTAVDGGGGPELR 156
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYG-DGSSTTGYFVQDVVQ 180
Y SST K VTC C P TA +SCPY Y +S++G V+DV+
Sbjct: 157 QYSPSKSSTSKTVTCASNLCD----QPNACATATSSCPYAVRYAMANTSSSGELVEDVLY 212
Query: 181 YDK---VSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ + + ++FGCG Q+G+ + A DG++G G S+ S LAS+
Sbjct: 213 LTREKGAAAAAAGAAVRTPVVFGCGQVQTGSF--LDGAAADGLMGLGMEKVSVPSILAST 270
Query: 238 GGVR-KMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLN 294
G V+ F+ C +G G G + ++TP + H Y+I++T++ VG N
Sbjct: 271 GVVKSNSFSMCFSK-DGLGRINFGDTGSADQSETPFIVKSTHSYYNISITSMSVGDK--N 327
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
LP + I DSGT+ YL + Y + +Q
Sbjct: 328 LPLGFYA-------IADSGTSFTYLNDPAYTAYTTNFNAQ 360
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 27/267 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +C++C + ++ +Y S+T +
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPLQSPNYGSLKFDVYSPAQSTTSR 157
Query: 133 FVTCDQEFCHGVYGGPLTDC--TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V C C L + + + SCPY ++ D +S++G V+DV+ S Q
Sbjct: 158 KVPCSSNLCD------LQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL--TSDSAQ 209
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ ++FGCG Q+G+ A +G++G G + S+ S LAS G F+ C
Sbjct: 210 SKIVTAPIMFGCGQVQTGSF--LGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF- 266
Query: 250 GINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G +G G G + +TPL P+Y+I +T + VG +
Sbjct: 267 GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSK---------SISTEFS 317
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
I+DSGT+ L + +Y + S +Q
Sbjct: 318 AIVDSGTSFTALSDPMYTQITSSFDAQ 344
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 136/312 (43%), Gaps = 28/312 (8%)
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IG P K Y++ VDTGSD+ W+ C + P RS + LY + + V C C
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQC----DAPCRSCNKVPHPLYR---PTANRLVPCANALC 53
Query: 142 HGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
++ G ++ C + C Y Y D +S+ G + D S +++++ L F
Sbjct: 54 TALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLIND-----SFSLPMRSSNIRPGLTF 108
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCG Q + + A+DG++G G+ + S++SQL G + + HCL NGGG
Sbjct: 109 GCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS-TNGGGFLFF 167
Query: 260 GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYL 319
G V P ++ VP S N + G + + + GV + + DSG+T Y
Sbjct: 168 GDDVVPS-SRVTWVPMAQRTSGNYYSPGSGTLYFDRRS--LGVKPME-VVFDSGSTYTYF 223
Query: 320 PEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS-- 370
Y+ +V SK + Q D + F+ V F ++ F ++
Sbjct: 224 TAQPYQAVVSALKGGLSKSLKQVSDPTLPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKN 283
Query: 371 VSLKVYPHEYLF 382
++++ P YL
Sbjct: 284 AAMEIPPENYLI 295
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 130/301 (43%), Gaps = 39/301 (12%)
Query: 43 SLLKEHDARRQQRILA-----------GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
S K+ R++ IL+ + LPL G+ P+G Y + +G PPK Y++
Sbjct: 15 SFFKDKPWERKRPILSVPTASSSFASSSIVLPLQGNVYPNG--FYNVTLYVGQPPKPYFL 72
Query: 92 QVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
DTGSD+ W+ C C++C TL+ + S V C C ++
Sbjct: 73 DPDTGSDLTWLQCDAPCQQCTE--------TLHPLYQPSN-DLVPCKDPLCMSLHSSMDH 123
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C C Y Y DG S+ G V+DV + +GD L GCG Q +
Sbjct: 124 RCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD----PIRPRLALGCGYDQ--DPG 177
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQP-EVNK 269
S++ +DGI+G G+ S++SQL + G VR + HC + GG +F + P +
Sbjct: 178 SSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYLFFGDGIYDPYRLVW 237
Query: 270 TPLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
TP+ + P HYS L F T + N + DSG++ Y Y+ L
Sbjct: 238 TPMSRDYPKHYSPGFGE----LIFNGRSTGL----RNLFVVFDSGSSYTYFNAQAYQVLT 289
Query: 329 S 329
S
Sbjct: 290 S 290
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 147/316 (46%), Gaps = 44/316 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+ ++ V VDTGSD+ WV C C+ C ++ L+ S + + +
Sbjct: 122 YIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNG-----PLFKPSTSPSYQPIL 174
Query: 136 CDQEFCHGV-YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C+ C + G +D + + +C Y+ YGDGS T+G + + + +S
Sbjct: 175 CNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGISVS------- 227
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKMFAHCL---DG 250
+ +FGCG G + G++G G+S SMISQ A+ GGV F++CL D
Sbjct: 228 -NFVFGCGRNNKGLFGGAS-----GLMGLGRSELSMISQTNATFGGV---FSYCLPSTDQ 278
Query: 251 INGGGIFAIGHVVQPEVNKTP-----LVPN---QPHYSINMTAVQVGLDFLNLPTDVFGV 302
G +G+ N TP ++PN Y +N+T + VG L++ FG
Sbjct: 279 AGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFG- 337
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEG 359
N G I+DSGT ++ L VY+ L +K + Q P ++ D TCF +
Sbjct: 338 --NGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILD--TCFNLTGYDQVN 393
Query: 360 FPNVTFHFENSVSLKV 375
P ++ +FE + L V
Sbjct: 394 IPTISMYFEGNAELNV 409
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 91/344 (26%), Positives = 137/344 (39%), Gaps = 47/344 (13%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPRR 113
+ + P+ G+ P VG Y + IG PP+ Y++ VDTGS++ W+ C QC E P
Sbjct: 58 SSIVFPIYGNVYP--VGFYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPH- 114
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
LY + F+ C C + C C Y Y D ST G
Sbjct: 115 -------PLY----KPSNDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGV 163
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+ DV + +G + GCG Q + LDGI+G G+ +S+ISQ
Sbjct: 164 LLNDVYLLNFTNG----VQLKVRMALGCGYDQI--FSPSTYHPLDGILGLGRGKASLISQ 217
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLD 291
L S G VR + HCL GG IF ++ TP+ + + HYS + G
Sbjct: 218 LNSQGLVRNVMGHCLSSRGGGYIFFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFG-- 275
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC-- 349
GVG + I D+G++ Y Y+ ++S + + + D+ T
Sbjct: 276 -----GRKTGVG-SLNIIFDTGSSYTYFNSQAYQAMISLLNKELHRKPIKAAPDDQTLPM 329
Query: 350 -------FQYSESVDEGFPNVTFHFENSVSLK----VYPHEYLF 382
F+ V + F +T F N +K + P YL
Sbjct: 330 CWHGKRPFRSINEVKKYFKPLTLSFTNGGRVKPQFEIPPEAYLI 373
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 153/369 (41%), Gaps = 61/369 (16%)
Query: 53 QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR 112
+ ++++G+D +G G Y ++ +G+PP + Y+ VD+GSD+MWV C C EC
Sbjct: 157 ESKVVSGLD---------EGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYV 207
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT--SCPYLEIYGDGSST 170
++ L+D S+T V+C C + P + C C Y Y DGS T
Sbjct: 208 QAD-----PLFDPATSATFSGVSCGSAICRIL---PTSACGDGELGGCEYEVSYADGSYT 259
Query: 171 TGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
G + + L T+ G ++ GCG R G G++G G S+
Sbjct: 260 KGALALETLT-------LGGTAVEG-VVIGCGHRNRGLFVGAA-----GLMGLGWGPMSL 306
Query: 231 ISQLASSGGVRKMFAHCLDGINGGG-----------IFAIGHVVQPEVNKTPLV--PNQP 277
+ QL G V F++CL G G + V PLV P P
Sbjct: 307 VGQLG--GEVGGAFSYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAP 364
Query: 278 H-YSINMTAVQVGLDFLNLPTDVF-----GVGDNKGTIIDSGTTLAYLPEMVYEPL---- 327
Y + ++ ++VG + L L +F G GD ++D+GTT+ LP+ Y L
Sbjct: 365 SFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD---VVMDTGTTVTRLPQEAYAALRDAF 421
Query: 328 VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-D 386
V + P + + TC+ S P V+F F+ L + L +
Sbjct: 422 VGALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTVSFCFDGDARLILAARNVLLEVDMG 481
Query: 387 LWCIGWQNS 395
++C+ + S
Sbjct: 482 IYCLAFAPS 490
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 114/388 (29%), Positives = 171/388 (44%), Gaps = 49/388 (12%)
Query: 35 YAGRERSLSLLKE---HDARRQQRI-----LAGVDLPLGGSSRPDGVGLYYAKIGIGTPP 86
Y RE L + H +R + L+ DLP + P Y IGTPP
Sbjct: 42 YNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLP-KPTIIPYAGSYYVMSYSIGTPP 100
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
Y VDTGSD +W C CK C ++S +++ SST K + C C
Sbjct: 101 FQLYGVVDTGSDGIWFQCKPCKPCLNQTS-----PIFNPSKSSTYKNIRCSSPICK---R 152
Query: 147 GPLTDCTAN--TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGAR 204
G T C++N C Y Y D S + G +D + + G + + ++ GCG +
Sbjct: 153 GEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDG---SPISFPKIVIGCGHK 209
Query: 205 QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL------DGINGGGIFA 258
S T E GIIGFG+ N S++SQL SS G + F++CL I+ F
Sbjct: 210 NS----LTTEGLASGIIGFGRGNFSIVSQLGSSIGGK--FSYCLASLFSKANISSKLYFG 263
Query: 259 IGHVVQPE-VNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG-TIIDSGT 314
VV V TPL+ + +Y N+ A VG + L D + DN+G +IDSG+
Sbjct: 264 DMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKL-KDSSLIPDNEGNAVIDSGS 322
Query: 315 TLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESVDEGFPNVTFHFENS 370
T+ LP VY L + +IS +K+ V D C++ + E P +T HF +
Sbjct: 323 TITQLPNDVYSQLETAVISM---VKLKRVKDPTQQLSLCYKTTLKKYE-VPIITAHFRGA 378
Query: 371 -VSLKVYPHEYLFPFEDLWCIGWQNSGM 397
V L + + ++ ++ C + +S
Sbjct: 379 DVKLNAF-NTFIQMNHEVMCFAFNSSAF 405
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 142/322 (44%), Gaps = 44/322 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + IG+PPK + + +DTGSD+ W+ C+ C +C ++ YD KDS +
Sbjct: 192 GSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNG-----PYYDPKDSISF 246
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ +TC+ C V P C T SCPY YGD S+TTG F + +
Sbjct: 247 RNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN------L 300
Query: 190 TTSTNG--------SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
T+ST G +++FGCG G G+ S SQL S G
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYG-- 353
Query: 242 KMFAHCL------DGINGGGIFAIGH--VVQPEVNKTPLV-----PNQPHYSINMTAVQV 288
F++CL ++ IF + PE+N T L+ P Y + + ++ V
Sbjct: 354 HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFV 413
Query: 289 GLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHD 345
G + L +P + + + + GTIIDSGTTL+Y + Y + + + K V
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPI 473
Query: 346 EYTCFQYSESVDEGFPNVTFHF 367
+ C+ S + + FP F
Sbjct: 474 LHPCYNVSGTDELNFPEFLIQF 495
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 169/379 (44%), Gaps = 63/379 (16%)
Query: 42 LSLLKEHDARRQQRILAGVDLPLGGSSRPD------GVGLYYAKIGIGTPPKDYYVQVDT 95
L L+ RR + +L GS+R D G Y +++ IGTPP ++ + VD
Sbjct: 3 LELVANSHRRRDRELL--------GSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDR 54
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE----FCHGVYGGPLTD 151
S + + C S ++ + SS+ K + C E FC G
Sbjct: 55 -SSFVSPKTMFC------SFFFLQDPRFSPALSSSYKPLECGNECSTGFCDG-------- 99
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
S Y Y + S+++G +DV+ + S DL L+FGC ++G+L
Sbjct: 100 -----SRKYQRQYAEKSTSSGVLGKDVISFSN-SSDLG----GQRLVFGCETAETGDL-- 147
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE--VN 268
++ DGIIG G+ S+I QL + +F+ C G++ GGG +G P+ V
Sbjct: 148 -YDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVF 206
Query: 269 KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPL 327
+ P+Y++ + ++VG L L +VF D K GT++DSGTT AY P ++
Sbjct: 207 TSSDPHRSPYYNLMLKGIRVGGSPLRLKPEVF---DGKYGTVLDSGTTYAYFPGAAFQAF 263
Query: 328 VSKIISQQPDLKVHTVHDEY---TCFQYS----ESVDEGFPNVTFHFENSVSLKVYPHEY 380
S + Q LK DE C+ + ++ + FP+V F F + S+ + P Y
Sbjct: 264 KSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGDGQSVTLSPENY 323
Query: 381 LFPFEDL---WCIGWQNSG 396
LF + +C+G +G
Sbjct: 324 LFRHTKISGAYCLGVFENG 342
>gi|222628608|gb|EEE60740.1| hypothetical protein OsJ_14268 [Oryza sativa Japonica Group]
Length = 181
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 101/207 (48%), Gaps = 38/207 (18%)
Query: 3 LCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYA-----GRERSLSLLKEHDARRQQRIL 57
L L L +L+A++ G V+ G+F V+ +++ + + L+ HD R L
Sbjct: 4 LFLSAILSALLVASSTRGTVAI--GLFQVRRKFSIMGGGCKGSDIGALQTHDRNRHLSRL 61
Query: 58 AGVDLPLGG----SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR 113
D LGG S+ G Y + G+ ++ VDTGS WVNCI CK+CPR+
Sbjct: 62 VAADFSLGGLGGISTSSTG---YMLQCSFGSI---HFFLVDTGSSAFWVNCIPCKQCPRK 115
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
S + +LTLYD + S +C + CP++ Y DG ST G
Sbjct: 116 SDILKKLTLYDPRSSP---------------------ECNTSLLCPFIATYADGGSTIGA 154
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFG 200
FV D+V Y+++SG+ T STN SL FG
Sbjct: 155 FVTDLVHYNQLSGNGLTQSTNTSLTFG 181
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 143/324 (44%), Gaps = 44/324 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + IG+PPK + + +DTGSD+ W+ C+ C +C ++ YD KDS +
Sbjct: 192 GSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNG-----PYYDPKDSISF 246
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ +TC+ C V P C T SCPY YGD S+TTG F + +
Sbjct: 247 RNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVN------L 300
Query: 190 TTSTNG--------SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
T+ST G +++FGCG G G+ S SQL S G
Sbjct: 301 TSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYG-- 353
Query: 242 KMFAHCL------DGINGGGIFAIGH--VVQPEVNKTPLV-----PNQPHYSINMTAVQV 288
F++CL ++ IF + PE+N T L+ P Y + + ++ V
Sbjct: 354 HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFV 413
Query: 289 GLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHD 345
G + L +P + + + + GTIIDSGTTL+Y + Y + + + K V
Sbjct: 414 GGEKLQIPEENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPI 473
Query: 346 EYTCFQYSESVDEGFPNVTFHFEN 369
+ C+ S + + FP F +
Sbjct: 474 LHPCYNVSGTDELNFPEFLIQFAD 497
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 128/317 (40%), Gaps = 25/317 (7%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G YY + IG P K Y++ +DTGSD+ W+ C + P +S + LY + K
Sbjct: 50 GHYYVTMNIGDPAKPYFLDIDTGSDLTWLQC----DAPCQSCNKVPHPLYK---PTKNKL 102
Query: 134 VTCDQEFCHGVYGG--PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C ++ P C C Y Y D +S+ G V D ++
Sbjct: 103 VPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPL----RNSS 158
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S S FGCG Q + + DG++G GK + S++SQL G + + HCL
Sbjct: 159 SVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLS-T 217
Query: 252 NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIID 311
NGGG G V P T VP S N + G + + + GV + + D
Sbjct: 218 NGGGFLFFGDNVVPTSRAT-WVPMVRSTSGNYYSPGSGTLYFDRRS--LGVKPME-VVFD 273
Query: 312 SGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
SG+T Y Y+ V SK + Q D + F+ V F ++
Sbjct: 274 SGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPLCWKGQKVFKSVSDVKNDFKSLF 333
Query: 365 FHFENSVSLKVYPHEYL 381
F + L++ P YL
Sbjct: 334 LSFVKNSVLEIPPENYL 350
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 146/363 (40%), Gaps = 56/363 (15%)
Query: 47 EHDARRQQRILAGVD-----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ AR + AG + PL G P G LYY + IG PP+ Y++ VDTGSD+ W
Sbjct: 26 DRPARGGLSVTAGAEESSAVFPLYGDVYPHG--LYYVAMSIGNPPRPYFLDVDTGSDLTW 83
Query: 102 VN----CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-T 153
+ C+ C + P + + + K V C + C ++GG LT C +
Sbjct: 84 LQCDAPCVSCSKVP-----------HPLYRPTKNKLVPCVDQMCAALHGG-LTGRHKCDS 131
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
C Y Y D S+ G V D + + L FGCG Q ST
Sbjct: 132 PKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQVG-SSTE 186
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKT--P 271
A DG++G G + S++SQL G + + HCL GGG G + P T P
Sbjct: 187 VSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAP 245
Query: 272 LV--PNQPHYSINMTAVQVGLDFLNL-PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
+ ++ +YS + G L + P +V + DSG++ Y Y+ LV
Sbjct: 246 MARSTSRNYYSPGSANLYFGGRPLGVRPMEV---------VFDSGSSFTYFSAQPYQALV 296
Query: 329 -------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHE 379
SK + + PD + F+ V + F V F N +++ P
Sbjct: 297 DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKALMEIPPEN 356
Query: 380 YLF 382
YL
Sbjct: 357 YLI 359
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 153/345 (44%), Gaps = 36/345 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC---IQCKECPRRSSLGIE-LTLYDIKD 127
G+G Y +GTP + + + DTGSD+ W++C + + C R + I ++
Sbjct: 79 GIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANL 138
Query: 128 SSTGKFVTCDQEFCHGVYGG--PLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
SS+ K + C + C LT+C T T C Y Y DGS+ G+F + V +
Sbjct: 139 SSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELK 198
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
G +++ GC S + + +A DG++G G S S + A G + F
Sbjct: 199 EGRKMKLH---NVLIGC----SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK--F 249
Query: 245 AHCL----DGINGGGIFAIGHVVQPE-----VNKTPLVPNQPH--YSINMTAVQVGLDFL 293
++CL N G E + T LV + Y++NM + +G L
Sbjct: 250 SYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAML 309
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD-----EYT 348
+P++V+ V GTI+DSG++L +L E Y+P+++ + + LK V EY
Sbjct: 310 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAAL--RVSLLKFRKVEMDIGPLEY- 366
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGW 392
CF + + P + FHF + + Y+ D + C+G+
Sbjct: 367 CFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF 411
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 149/355 (41%), Gaps = 52/355 (14%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPD------------GVGLYYAKIGIGTPPKDYYVQVD 94
+ DA+R ++ + GGS R D G G Y+ +IG+G+PP+ Y+ +D
Sbjct: 160 KRDAKRVASLIRRLSSGGGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVID 219
Query: 95 TGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
+GSDI+WV C C +C +S ++D DS++ V+C C + C A
Sbjct: 220 SGSDIVWVQCQPCTQCYHQSD-----PVFDPADSASFTGVSCSSSVCDRLEN---AGCHA 271
Query: 155 NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
C Y YGDGS T G + + + + + S+ GCG R G
Sbjct: 272 G-RCRYEVSYGDGSYTKGTLALETLTFGR--------TMVRSVAIGCGHRNRGMFVGAAG 322
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP 274
G + S + QL G F++CL + P V + P P
Sbjct: 323 LLGL-----GGGSMSFVGQLGGQTG--GAFSYCL----------VSAAWVPLV-RNPRAP 364
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKII 332
+ Y I + + VG + + +VF + + + G ++D+GT + LP + Y+ +
Sbjct: 365 S--FYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFL 422
Query: 333 SQQPDLKVHT-VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+Q +L T V TC+ V P V+F+F L + +L P +D
Sbjct: 423 AQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARNFLIPMDD 477
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 147/356 (41%), Gaps = 41/356 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-----PRRSSLGIELTLYDIK 126
G+G Y+ + +GTP + + + DTGSD+ WV C + P S G + +
Sbjct: 93 GIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRA-FRPE 151
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
DS T ++C + C L C T + C Y Y DGS+ G + +S
Sbjct: 152 DSRTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATI-ALS 210
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
G + + L+ GC + +G + EA DG++ G S S S AS G R F+
Sbjct: 211 GREERKAKLKGLVLGCSSSYTG----PSFEASDGVLSLGYSGISFASHAASRFGGR--FS 264
Query: 246 HCL----DGINGGGIFAIG---HVVQPE------------VNKTPLVPNQ---PHYSINM 283
+CL N G V P +TPL+ ++ P Y +++
Sbjct: 265 YCLVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSL 324
Query: 284 TAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTV 343
A+ V +FL +P V+ V G I+DSGT+L L + Y +V+ + L T+
Sbjct: 325 KAISVAGEFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM 384
Query: 344 HDEYTCFQYS----ESVDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
C+ ++ + D P + HF + L+ Y+ + CIG Q
Sbjct: 385 DPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQE 440
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 138/333 (41%), Gaps = 42/333 (12%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSS 129
PD G + I IGTPP + DTGSD+ W C+ C+EC +S +++ + SS
Sbjct: 85 PDS-GEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQ-----PIFNPRRSS 138
Query: 130 TGKFVTCDQEFCHGV---YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
+ + V+C + C + + GP SC Y YGD S T G D + G
Sbjct: 139 SYRKVSCASDTCRSLESYHCGPDLQ-----SCSYGYSYGDRSFTYGDLASDQITI----G 189
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ T + GCG + G + + S++SQ+ + GV+ F++
Sbjct: 190 SFKLPKT----VIGCGHQNGGTFGGVTSGIIGLG----GGSLSLVSQMRTIAGVKPRFSY 241
Query: 247 CL----DGINGGGIFAIGH---VVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLPT 297
CL N G + G V +V TPLVP P Y + + A+ VG
Sbjct: 242 CLPTFFSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAAN 301
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY----TCFQYS 353
+ + ++ IIDSGTTL LP +Y + S + +K V D C+
Sbjct: 302 GISAMTNHGNIIIDSGTTLTLLPRSLYYGVFSTLARV---IKAKRVDDPSGILELCYSAG 358
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ D P +T HF +K+ P P D
Sbjct: 359 QVDDLNIPIITAHFAGGADVKLLPVNTFAPVAD 391
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 27/267 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +C++C + ++ +Y S+T +
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSR 157
Query: 133 FVTCDQEFCHGVYGGPLTDC--TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V C C L + + + SCPY ++ D +S++G V+DV+ S Q
Sbjct: 158 KVPCSSNLCD------LQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL--TSDSAQ 209
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ ++FGCG Q+G+ A +G++G G + S+ S LAS G F+ C
Sbjct: 210 SKIVTAPIMFGCGQVQTGSF--LGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF- 266
Query: 250 GINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G +G G G + +TPL P+Y+I +T + VG +
Sbjct: 267 GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSK---------SISTEFS 317
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
I+DSGT+ L + +Y + S +Q
Sbjct: 318 AIVDSGTSFTALSDPMYTQITSSFDAQ 344
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 149/337 (44%), Gaps = 49/337 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y ++ IGTPP+ + + VDTGS + +V C CK C + + S T +
Sbjct: 91 GYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQD-----PKFRPEASETYQP 145
Query: 134 VTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C + +C + C Y Y + S+++G +DVV + Q+
Sbjct: 146 VKCTWQ----------CNCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGN-----QSEL 190
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+ IFGC ++G D N+ A DGI+G G+ + S++ QL + F+ C G+
Sbjct: 191 SPQRAIFGCENDETG--DIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMG 247
Query: 253 G-------GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
GGI +V + P+Y+I++ + V L+L VF D
Sbjct: 248 VGGGAMVLGGISPPADMVFTHSDPV----RSPYYNIDLKEIHVAGKRLHLNPKVF---DG 300
Query: 306 K-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYSE----SVD 357
K GT++DSGTT AYLPE + I+ + LK + D + CF +E +
Sbjct: 301 KHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLS 360
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIG 391
+ FP V F N L + P YLF + +C+G
Sbjct: 361 KSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLG 397
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 104/382 (27%), Positives = 151/382 (39%), Gaps = 60/382 (15%)
Query: 45 LKEHDARRQQRIL---AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ H+AR+ A V P S G Y + IGTPP Y DTGSD++W
Sbjct: 59 MHRHNARKLALAASSGATVSAPTQNSPT---AGEYLMALAIGTPPLPYQAIADTGSDLIW 115
Query: 102 VNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF--CHGVYGGPLTDCTANTSC 158
C C +C R+ + LY+ S+T + C+ C G T +C
Sbjct: 116 TQCAPCTSQCFRQPT-----PLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCAC 170
Query: 159 PYLEIYGDG------SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
Y YG G S T F +V G + FGC SG
Sbjct: 171 TYNVTYGSGWTSVFQGSETFTFGSTPAGQSRVPG----------IAFGCSTASSG----F 216
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQ----P 265
N + G++G G+ S++SQL GV K F++CL N +G
Sbjct: 217 NASSASGLVGLGRGRLSLVSQL----GVPK-FSYCLTPYQDTNSTSTLLLGPSASLNGTA 271
Query: 266 EVNKTPLV------PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLA 317
V+ TP V P Y +N+T + +G L++P D F + + G IIDSGTT+
Sbjct: 272 GVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTGGLIIDSGTTIT 331
Query: 318 YLPEMVYEPLVSKIIS----QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
L Y+ + + ++S D T D S S P++T HF N +
Sbjct: 332 LLGNTAYQQVRAAVVSLVTLPTTDGSAATGLDLCFMLPSSTSAPPAMPSMTLHF-NGADM 390
Query: 374 KVYPHEYLFPFED-LWCIGWQN 394
+ Y+ + LWC+ QN
Sbjct: 391 VLPADSYMMSDDSGLWCLAMQN 412
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 164/391 (41%), Gaps = 54/391 (13%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGG---SSRPDGVGLYYAKIGIGTPPKDYYVQVDT 95
ER L L K+ + +AGV G S G G Y+ +IGIGTP ++ Y+ +DT
Sbjct: 116 ERKLKLKKDPAGSYEN--VAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDT 173
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN 155
GSD++W+ C C+EC ++ +++ S + V CD C + DC
Sbjct: 174 GSDVVWIQCEPCRECYSQAD-----PIFNPSSSVSFSTVGCDSAVCSQLDA---NDCHGG 225
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
C Y YGDGS T G + + + + T++ ++ GCG G
Sbjct: 226 -GCLYEVSYGDGSYTVGSYATETLTFG--------TTSIQNVAIGCGHDNVGLFVGAAGL 276
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGI------FAIGHVVQPE 266
G S +QL + G + F++CL D + G + IG + P
Sbjct: 277 LGLGAGSL-----SFPAQLGTQTG--RAFSYCLVDRDSESSGTLEFGPESVPIGSIFTPL 329
Query: 267 VNKTPLVPNQPHYSINMTAVQVGLDFLN-LPTDVFGVGDNK---GTIIDSGTTLAYLPEM 322
V P +P Y ++M A+ VG L+ +P++ F + + G IIDSGT + L
Sbjct: 330 V-ANPFLPT--FYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTS 386
Query: 323 VYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
Y+ L I+ L + + TC+ S P V FHF N + L
Sbjct: 387 AYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFILPAKNCL 446
Query: 382 FPFEDL--WCIGWQNSGMQSRDRKNMTLLGD 410
P + + +C + + N++++G+
Sbjct: 447 IPMDSMGTFCFAFAPAD------SNLSIMGN 471
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 171/396 (43%), Gaps = 63/396 (15%)
Query: 33 YRYAGRERSLS-----LLKEHDA----RRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIG 83
Y +A E + L K DA ++ LAG+ L G S G G YY K+G+G
Sbjct: 54 YMFAKDEERIRYFHSRLAKNSDANASFKKVGPKLAGIPLKSGLSM---GSGNYYVKMGLG 110
Query: 84 TPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHG 143
+P K Y + VDTGS W +QC+ C + E +++ S T K V C C
Sbjct: 111 SPTKYYTMIVDTGSSFSW---LQCQPCTIYCHIQ-EDPVFNPSASKTYKTVPCSSSQCSS 166
Query: 144 VYGGPLTDCTA---NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
+ L + T + +C Y YGD S + GY QDV+ L + T S ++G
Sbjct: 167 LKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVL-------TLTPSQTLSSFVYG 219
Query: 201 CGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL------DGINGG 254
CG G T DGIIG + SM+SQL SG F++CL
Sbjct: 220 CGQDNQGLFGRT-----DGIIGLANNELSMLSQL--SGKYGNAFSYCLPTSFSTPNSPKE 272
Query: 255 GIFAIG-HVVQPEVNK--TPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G +IG + P + TPL+ PN P Y I++ ++ V L + + V T
Sbjct: 273 GFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKV----PT 328
Query: 309 IIDSGTTLAYLPEMVYEPL-------VSKIISQQPDLKVHTVHDEYTCFQYSES-VDEGF 360
IIDSGT + LP VY L +SK Q P + + TCF+ S + + E
Sbjct: 329 IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLD-----TCFKGSLAGISEVA 383
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
P++ F+ L++ H L E + C+ S
Sbjct: 384 PDIRIIFKGGADLQLKGHNSLVELETGITCLAMAGS 419
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 136/316 (43%), Gaps = 33/316 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRR--SSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++WV +CI C S+L +L Y S +
Sbjct: 99 LHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCAPLSASFYSNLDRDLNEYSPSRSLS 158
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K ++C C G + CPY + D +S++G V+D+ G
Sbjct: 159 SKHLSCSHRLCD---MGSNCKTSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGSTS 215
Query: 190 TTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+S ++ GCG +QSG LD T A DG+IG G SS+ S LA SG +R F+ C
Sbjct: 216 NSSVQAPVVVGCGMKQSGGYLDGT---APDGLIGLGPGESSVPSFLAKSGLIRDSFSLCF 272
Query: 249 DGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+ + G +F G VQ TP + +S + V+ + P +
Sbjct: 273 NEDDSGRLFFGDQGSTVQ---QSTPFLLVDGMFSTYIVGVETCCIGNSCPKVT-----SF 324
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVDEGF 360
DSGT+ +LP Y I+++ D +V+ + C+ S
Sbjct: 325 NAQFDSGTSFTFLPGHAY-----GAIAEEFDKQVNATRSTFQGSPWEYCYVPSSQQLPKI 379
Query: 361 PNVTFHFENSVSLKVY 376
P +T F+ + S VY
Sbjct: 380 PTLTLMFQQNNSFVVY 395
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 137/320 (42%), Gaps = 55/320 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTP ++VDTGSD+ WV QCK CP L+D SS+ V
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWV---QCKPCPSPPCYSQRDPLFDPTRSSSYSAVP 187
Query: 136 CDQEFC-------HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
C C +G GG C Y+ YGDGS+TTG + D L
Sbjct: 188 CAAASCSQLALYSNGCSGG---------QCGYVVSYGDGSTTTGVYSSDT---------L 229
Query: 189 QTTSTNG--SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFA 245
T +N +FGCG Q G +DG++G G+ S++SQ +S+ GGV F+
Sbjct: 230 TLTGSNALKGFLFGCGHAQQGLF-----AGVDGLLGLGRQGQSLVSQASSTYGGV---FS 281
Query: 246 HCLDGI-NGGGIFAIGHVVQPE-VNKTPLV--PNQPHYSINMTA-VQVGLDFLNLPTDVF 300
+CL N G ++G + TPL+ N P Y I M A + VG L++ VF
Sbjct: 282 YCLPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF 341
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSES 355
G ++D+GT + LP Y L S + P + D TC+ ++
Sbjct: 342 A----SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILD--TCYDFTRY 395
Query: 356 VDEGFPNVTFHFENSVSLKV 375
P ++ F ++ +
Sbjct: 396 GTVTLPTISIAFGGGAAMDL 415
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 138/346 (39%), Gaps = 46/346 (13%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKEC 110
R+ + + LPL G+ P+G Y + IG P K Y++ VDTGSD+ W+ C +QC E
Sbjct: 1 RVPSSIVLPLHGNVYPNGY--YNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEA 58
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSST 170
P Y +++ V C C ++ C C Y Y DG S+
Sbjct: 59 PH--------PYYRPRNN----LVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSS 106
Query: 171 TGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
G V+D + T+ S + G + +DG++G GK SS+
Sbjct: 107 FGVLVRDTFNLN------FTSEKRHSPLLALGLCGYDQFPGGSHHPIDGVLGLGKGKSSI 160
Query: 231 ISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV-QPEVNKTPLVPNQPHYSINMTAVQVG 289
+SQL+S G VR + HCL G GG +F + V TP+ P+ HYS G
Sbjct: 161 VSQLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKHYS-------PG 213
Query: 290 LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC 349
L L G N T DSG + YL Y+ L+S + + + D+ T
Sbjct: 214 LAELTFDGKTTGF-KNLLTTFDSGASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTL 272
Query: 350 ---------FQYSESVDEGFPNVTFHFEN----SVSLKVYPHEYLF 382
F+ V + F F N L+ P YL
Sbjct: 273 PLCWKGRKPFKSIRDVKKYFKTFALSFTNERKSKTELEFPPEAYLI 318
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 97/319 (30%), Positives = 144/319 (45%), Gaps = 40/319 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSST 130
G G YY K+G+GTPPK Y + +DTGS + W+ C C C ++ LYD S T
Sbjct: 121 GSGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQAD-----PLYDPSVSKT 175
Query: 131 GKFVTCDQEFCHGVYGGPLTD--C-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
K ++C C + L D C T + +C Y YGD S + GY QD++
Sbjct: 176 YKKLSCASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLL-------T 228
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
L ++ T +GCG G GIIG + SM++QL++ G F++C
Sbjct: 229 LTSSQTLPQFTYGCGQDNQGLFGRA-----AGIIGLARDKLSMLAQLSTKYG--HAFSYC 281
Query: 248 LDGIN---GGGIFAIGHVVQPEVNK-TPLV---PNQPHYSINMTAVQVGLDFLNLPTDVF 300
L N GG F + P K TP++ N Y + +TA+ V L+L ++
Sbjct: 282 LPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMY 341
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQ-PDLKVHTVHDEYTCFQYSESV 356
V T+IDSGT + LP +Y L KI+S + +++ D TCF+ S
Sbjct: 342 RV----PTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILD--TCFKGSLKS 395
Query: 357 DEGFPNVTFHFENSVSLKV 375
P + F+ L +
Sbjct: 396 ISAVPEIKMIFQGGADLTL 414
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 137/320 (42%), Gaps = 55/320 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTP ++VDTGSD+ WV QCK CP L+D SS+ V
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWV---QCKPCPSPPCYSQRDPLFDPTRSSSYSAVP 198
Query: 136 CDQEFC-------HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
C C +G GG C Y+ YGDGS+TTG + D L
Sbjct: 199 CAAASCSQLALYSNGCSGG---------QCGYVVSYGDGSTTTGVYSSDT---------L 240
Query: 189 QTTSTNG--SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFA 245
T +N +FGCG Q G +DG++G G+ S++SQ +S+ GGV F+
Sbjct: 241 TLTGSNALKGFLFGCGHAQQGLF-----AGVDGLLGLGRQGQSLVSQASSTYGGV---FS 292
Query: 246 HCLDGI-NGGGIFAIGHVVQPE-VNKTPLV--PNQPHYSINMTA-VQVGLDFLNLPTDVF 300
+CL N G ++G + TPL+ N P Y I M A + VG L++ VF
Sbjct: 293 YCLPPTQNSVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF 352
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSES 355
G ++D+GT + LP Y L S + P + D TC+ ++
Sbjct: 353 A----SGAVVDTGTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILD--TCYDFTRY 406
Query: 356 VDEGFPNVTFHFENSVSLKV 375
P ++ F ++ +
Sbjct: 407 GTVTLPTISIAFGGGAAMDL 426
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 81/271 (29%), Positives = 127/271 (46%), Gaps = 28/271 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSL---GIELTLYDIKDSST 130
L+Y + +GTP + V +DTGSD+ WV C C C P S EL++Y K SST
Sbjct: 3 LHYTTVQLGTPGTKFMVALDTGSDLFWVPC-DCSRCAPTEGSPYASDFELSVYSPKKSST 61
Query: 131 GKFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDL 188
K V C+ C CT A +CPY+ Y +STTG ++D++ + +
Sbjct: 62 SKTVPCNNSLC-----AQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLK--TENK 114
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ + FGCG QSG+ + A +G+ G G S+ S L+ G + F+ C
Sbjct: 115 HSEPIQAYITFGCGQVQSGSF--LDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCF 172
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+G G G E +TP NQ P+Y+I +T+++VG ++ +
Sbjct: 173 SD-DGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDA---------DI 222
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
+ DSGT+ +Y + +Y L + +Q D
Sbjct: 223 TALFDSGTSFSYFTDPIYSKLSASFHAQTRD 253
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 101/368 (27%), Positives = 160/368 (43%), Gaps = 61/368 (16%)
Query: 34 RYAGRERSLSLLKEH---DARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
R R +LS ++ + +Q+ AGV LP+ RP G Y + IGTPP+
Sbjct: 56 RSKARAAALSAVRNRARFSGKNEQQTPAGV-LPV----RPSGDLEYVVDLAIGTPPQPVS 110
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+DTGSD++W C C C L L+ S++ + + C C +
Sbjct: 111 ALLDTGSDLIWTQCAPCASC-----LSQPDPLFAPGQSASYEPMRCAGTLCSDILH---H 162
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C +C Y YGDG+ T G + + + SG T+T L FGCG+ G+L+
Sbjct: 163 SCERPDTCTYRYNYGDGTMTVGVYATERFTFAS-SGGGGLTTTTVPLGFGCGSVNVGSLN 221
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-----------------DGING 253
+ + GI+GFG++ S++SQL+ +R+ F++CL DG+ G
Sbjct: 222 NGS-----GIVGFGRNPLSLVSQLS----IRR-FSYCLTSYASRRQSTLLFGSLSDGVYG 271
Query: 254 GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIID 311
A G V + ++P P Y ++ T + VG L +P F + + G I+D
Sbjct: 272 D---ATGRVQTTPLLQSPQNPT--FYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVD 326
Query: 312 SGTTLAYLPEMVYEPLVSKIISQQPDLKVHT--VHDEYTCF-------QYSESVDEGFPN 362
SGT L LP V +V + QQ L ++ CF + S + P
Sbjct: 327 SGTALTLLPAAVLAEVV-RAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPR 385
Query: 363 VTFHFENS 370
+ HF+ +
Sbjct: 386 MVLHFQGA 393
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 146/363 (40%), Gaps = 56/363 (15%)
Query: 47 EHDARRQQRILAGVD-----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ AR + AG + PL G P G LYY + IG PP+ Y++ VDTGSD+ W
Sbjct: 26 DRPARGGLSVTAGAEESSAVFPLYGDVYPHG--LYYVAMSIGNPPRPYFLDVDTGSDLTW 83
Query: 102 VN----CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-T 153
+ C+ C + P + + + K V C + C ++GG LT C +
Sbjct: 84 LQCDAPCVSCSKVP-----------HPLYRPTKNKLVPCVDQMCAALHGG-LTGRHKCDS 131
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
C Y Y D S+ G V D + + L FGCG Q ST
Sbjct: 132 PKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQVG-SSTE 186
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKT--P 271
A DG++G G + S++SQL G + + HCL GGG G + P T P
Sbjct: 187 VSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAP 245
Query: 272 LV--PNQPHYSINMTAVQVGLDFLNL-PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
+ ++ +YS + G L + P +V + DSG++ Y Y+ LV
Sbjct: 246 MARSTSRNYYSPGSANLYFGGRPLGVRPMEV---------VFDSGSSFTYFSAQPYQALV 296
Query: 329 -------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHE 379
SK + + PD + F+ V + F V F N +++ P
Sbjct: 297 DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKALMEIPPEN 356
Query: 380 YLF 382
YL
Sbjct: 357 YLI 359
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 90/338 (26%), Positives = 144/338 (42%), Gaps = 33/338 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ I +GTPP+ + DTGSD++WV C C+ C L + SS+
Sbjct: 84 GSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLP----RHSSSF 139
Query: 132 KFVTCDQEFCHGVYGGPLTDCT---ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
C C + P C ++ C +L Y DGS ++G+F ++ +SG
Sbjct: 140 SPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGS- 198
Query: 189 QTTSTNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
G L FGCG R SG ++ G++G G+ + S SQL G + F++C
Sbjct: 199 -EIHLKG-LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNK--FSYC 254
Query: 248 LDGIN-----------GGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLN 294
L GGG+ ++ +++ TPL P P + +T + +D +
Sbjct: 255 LMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTF-YYITIHSITIDGVK 313
Query: 295 LPTD--VFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIIS--QQPDLKVHTVHDEYT 348
LP + V+ + + N GT++DSGTTL YL + YE ++ + + P+ T +
Sbjct: 314 LPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLC 373
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
ES P + F P Y E+
Sbjct: 374 VNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEE 411
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 27/267 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +C++C + ++ +Y S+T +
Sbjct: 34 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSR 93
Query: 133 FVTCDQEFCHGVYGGPLTDC--TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V C C L + + + SCPY ++ D +S++G V+DV+ S Q
Sbjct: 94 KVPCSSNLCD------LQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL--TSDSAQ 145
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ ++FGCG Q+G+ A +G++G G + S+ S LAS G F+ C
Sbjct: 146 SKIVTAPIMFGCGQVQTGSF--LGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF- 202
Query: 250 GINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G +G G G + +TPL P+Y+I +T + VG +
Sbjct: 203 GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSK---------SISTEFS 253
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
I+DSGT+ L + +Y + S +Q
Sbjct: 254 AIVDSGTSFTALSDPMYTQITSSFDAQ 280
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 27/267 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +C++C + ++ +Y S+T +
Sbjct: 61 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSR 120
Query: 133 FVTCDQEFCHGVYGGPLTDC--TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V C C L + + + SCPY ++ D +S++G V+DV+ S Q
Sbjct: 121 KVPCSSNLCD------LQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL--TSDSAQ 172
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ ++FGCG Q+G+ A +G++G G + S+ S LAS G F+ C
Sbjct: 173 SKIVTAPIMFGCGQVQTGSF--LGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF- 229
Query: 250 GINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G +G G G + +TPL P+Y+I +T + VG +
Sbjct: 230 GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSK---------SISTEFS 280
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
I+DSGT+ L + +Y + S +Q
Sbjct: 281 AIVDSGTSFTALSDPMYTQITSSFDAQ 307
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 152/359 (42%), Gaps = 46/359 (12%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSS-----LGIELTLYDIKDSS 129
L+Y I IGTP + V +D GSD++WV C C EC S+ L +L Y S+
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPC-DCIECASLSAGNYNVLDRDLNQYRPSLSN 162
Query: 130 TGKFVTCDQEFC--HGVYGGPLTDCTANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSG 186
T + + C + C H V G + CPY Y +S++GY +D +
Sbjct: 163 TSRHLPCGHKLCDVHSVCKG------SKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGK 216
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ S S+I GCG +Q+G + DG++G G N S+ S LA +G ++ F+
Sbjct: 217 HAEQNSVQASIILGCGRKQTG--EYLRGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSI 274
Query: 247 CLDGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQ-VGLDFLNLPTDVFGVG 303
C + G I GHV Q + TP +P ++ + V+ + L L F
Sbjct: 275 CFEENESGRIIFGDQGHVTQ---HSTPFLPIDGKFNAYIVGVESFCVGSLCLKETRF--- 328
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP-- 361
+IDSG++ +LP VY+ +V + Q + + C+ S P
Sbjct: 329 ---QALIDSGSSFTFLPNEVYQKVVIEFDKQVNATSIVLQNSWEYCYNASSQELISIPPL 385
Query: 362 ------NVTFHFENSVSLKVYPHEYLF------PFEDLWCIGWQNSGMQSR---DRKNM 405
N T+ +N + + EY P +D + QN M R DR+N+
Sbjct: 386 NLAFSRNQTYLIQNPIFIDPASQEYTIFCLPVSPSDDDYAAIGQNFLMGYRMVFDRENL 444
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 149/333 (44%), Gaps = 32/333 (9%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + +GTPP DTGSD++W C C+ C ++ L+D K S T +
Sbjct: 93 GEYLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVD-----PLFDPKSSKTYRD 147
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+CD C + + C+ N C Y YGD S T G D + D +G + +
Sbjct: 148 FSCDARQCSLL---DQSTCSGNI-CQYQYSYGDRSYTMGNVASDTITLDSTTG---SPVS 200
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+ GCG D T + GI+G G S+ISQ+ SS G + F++CL
Sbjct: 201 FPKTVIGCGHEN----DGTFSDKGSGIVGLGAGPLSLISQMGSSVGGK--FSYCLVPLSS 254
Query: 249 -DGINGGGIFAIGHVVQ-PEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVG 303
G + F VV P V TPL+ ++ Y + + A+ VG + + G G
Sbjct: 255 RAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTG 314
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
+ IIDSGTTL +P+ + L S + Q + + + YS + D P +
Sbjct: 315 EGN-IIIDSGTTLTIVPDDFFSNL-STAVGNQVEGRRAEDPSGFLSVCYSATSDLKVPAI 372
Query: 364 TFHFENSVSLKVYP-HEYLFPFEDLWCIGWQNS 395
T HF + +K+ P + ++ +D+ C+ + ++
Sbjct: 373 TAHFTGA-DVKLKPINTFVQVSDDVVCLAFAST 404
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 126/279 (45%), Gaps = 32/279 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y ++ +GTPP+ + + +DTGSD+ W+ C C +C ++D S++
Sbjct: 146 GSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDC-----FDQRGPVFDPMASTSY 200
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ VTC C G+ P T +S CPY YGD S+TTG + + +
Sbjct: 201 RNVTCGDTRC-GLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASS 259
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ ++ GCG R G G+ S SQL + G F++C
Sbjct: 260 SRRVD---GVVLGCGHRNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYG--HAFSYC 309
Query: 248 L----DGINGGGIFAIGHVV--QPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTD 298
L + +F +V+ P++N T P+ Y + + + VG + L++P++
Sbjct: 310 LVDHGSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSN 369
Query: 299 VFGVGDNK---GTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+GV GTIIDSGTTL+Y PE Y+ + + +
Sbjct: 370 TWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAFVDR 408
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 131/309 (42%), Gaps = 44/309 (14%)
Query: 48 HDARRQQRILAGVDLPLGGSSRPDGVG------LYYAKIGIGTPPKDYYVQVDTGSDIMW 101
HD + + D P+ R G G Y + +GTPP+ + +DTGSD++W
Sbjct: 65 HDEKEE-----AADRPVRARVRTAGAGGGIVTNEYLVHLSVGTPPRPVALTLDTGSDLVW 119
Query: 102 VNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC------TAN 155
C C C + ++ + D SST V CD C + P T C
Sbjct: 120 TQCAPCLNCFDQGAIPV----LDPAASSTHAAVRCDAPVCRAL---PFTSCGRGGSSWGE 172
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
SC Y+ YGD S T G D + + L FGCG G + NE
Sbjct: 173 RSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVSERRLTFGCGHFNKGIFQA-NET 231
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQP-------EVN 268
GI GFG+ S+ SQL + F++C + + V P +V
Sbjct: 232 ---GIAGFGRGRWSLPSQLGVTS-----FSYCFTSMFESTSSLVTLGVAPAELHLTGQVQ 283
Query: 269 KTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE 325
TPL+ P+QP Y +++ A+ VG + +P + + IIDSG ++ LPE VYE
Sbjct: 284 STPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREAS-AIIDSGASITTLPEDVYE 342
Query: 326 PLVSKIISQ 334
+ ++ ++Q
Sbjct: 343 AVKAEFVAQ 351
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 155/346 (44%), Gaps = 41/346 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK + + +DTGSD+ W+ C+ C +C ++ + YD K S++
Sbjct: 156 GSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGM-----FYDPKTSASF 210
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K +TC+ C + P C + N SCPY YGD S+TTG F + + + +
Sbjct: 211 KNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGG 270
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
++ G+++FGCG G + G S SQL S G F++CL
Sbjct: 271 SSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFS-----SQLQSLYG--HSFSYCL 323
Query: 249 ----DGINGGGIFAIGH----VVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNL 295
N G + +N T V + + Y I + ++ VG L++
Sbjct: 324 VDRNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDI 383
Query: 296 PTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ----PDLKVHTVHDEYTC 349
P + + + + GTIIDSGTTL+Y E YE + +K + P + V D C
Sbjct: 384 PEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDP--C 441
Query: 350 FQYS--ESVDEGFPNVTFHFENSVSLKVYPHE--YLFPFEDLWCIG 391
F S E + P + F + +P E +++ EDL C+
Sbjct: 442 FNVSGIEENNIHLPELGIAFVDGTVWN-FPAENSFIWLSEDLVCLA 486
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 139/305 (45%), Gaps = 31/305 (10%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G + +I IGTPP VDTGSD++W+ C C C ++ ++D SST
Sbjct: 65 IGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIK-----PMFDPLKSSTYN 119
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
++CD CH + G C+ C Y YGD S T G QD + +G + S
Sbjct: 120 NISCDSPLCHKLDTG---VCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLS 176
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
+FGCG +G N+ + G+IG G +S+ISQ+ G +K F+ CL
Sbjct: 177 ---RFLFGCGHNNTGGF---NDHEM-GLIGLGGGPTSLISQIGPLFGGKK-FSQCLVPFL 228
Query: 249 --DGINGGGIFAIG-HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
I+ F G V+ V TPLVP + S +T + + ++ P +
Sbjct: 229 TDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMN--STIGK 286
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYSESVDEGFPN 362
++DSGT LP+ +Y+ + +++ ++ + + + D+ T Y + P
Sbjct: 287 ANMLVDSGTPPILLPQQLYDKVFAEVRNK---VALKPITDDPSLGTQLCYRTQTNLKGPT 343
Query: 363 VTFHF 367
+TFHF
Sbjct: 344 LTFHF 348
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 149/359 (41%), Gaps = 43/359 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT---------- 121
G G Y+ + +GTP + + + DTGSD+ WV C + P ++
Sbjct: 106 GTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKC-RGAASPSHATATASPAAAPSPAVAPP 164
Query: 122 -LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQD-- 177
++ DS T + C E C L +C+++T+ C Y Y D S+ G D
Sbjct: 165 RVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSA 224
Query: 178 VVQYDKVSGDLQTTSTNGSL---IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
V G L + GC +G EA DG++ G SN S S+
Sbjct: 225 TVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQ----GFEASDGVLSLGYSNISFASRA 280
Query: 235 ASSGGVRKMFAHCL-DGIN----------GGGIFAIGHVVQPEVNKTPLVPN---QPHYS 280
AS G R F++CL D + G G A ++TPL+ + +P Y+
Sbjct: 281 ASRFGGR--FSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYA 338
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
+ + +V V L++P +V+ VG N GTIIDSGT+L L Y+ +V+ + Q L
Sbjct: 339 VAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPR 398
Query: 341 HTVHDEYTCFQYSESVDEG----FPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
+ C+ ++ D G P + F S L+ Y+ + CIG Q
Sbjct: 399 VAMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQE 457
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 88/323 (27%), Positives = 147/323 (45%), Gaps = 37/323 (11%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRR--SSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++W+ +C+QC S+L +L Y S +
Sbjct: 96 LHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAPLSSSYYSNLDRDLNEYSPSRSLS 155
Query: 131 GKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C C ++C ++ CPY+ Y + +S++G V+D++ + G L
Sbjct: 156 SKHLSCSHRLCDKG-----SNCKSSQQQCPYMVSYLSENTSSSGLLVEDILHL-QSGGTL 209
Query: 189 QTTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+S ++ GCG +QSG LD A DG++G G SS+ S LA SG + F+ C
Sbjct: 210 SNSSVQAPVVLGCGMKQSGGYLDGV---APDGLLGLGPGESSVPSFLAKSGLIHYSFSLC 266
Query: 248 LDGINGGGIF--AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ + G +F G Q + PL Y I + + +G L + +
Sbjct: 267 FNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKM--------TS 318
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT------CFQYSESVDEG 359
+DSGT+ +LP VY I+++ D +V+ + C+ S
Sbjct: 319 FKAQVDSGTSFTFLPGHVY-----GAITEEFDQQVNGSRSSFEGSPWEYCYVPSSQDLPK 373
Query: 360 FPNVTFHFENSVSLKVYPHEYLF 382
P+ T F+ + S VY ++F
Sbjct: 374 VPSFTLMFQRNNSFVVYDPVFVF 396
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 161/385 (41%), Gaps = 45/385 (11%)
Query: 23 SSNHGVFSVKYRYA--GRERS-LSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAK 79
SS + K R+A G +RS L + D R Q L + S G G Y+++
Sbjct: 110 SSRVAGIAAKIRFAVEGIDRSDLKPVNNEDTRYQPEALTTPVV----SGVSQGSGEYFSR 165
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
IG+GTP K+ Y+ +DTGSD+ W+ C C +C ++S +++ SST K +TC
Sbjct: 166 IGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSD-----PVFNPTSSSTYKSLTCSAP 220
Query: 140 FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
C + + C +N C Y YGDGS T G D V + SG + +
Sbjct: 221 QCSLL---ETSACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKIN------DVAL 269
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCG G G S+ +Q+ ++ F++CL + G ++
Sbjct: 270 GCGHDNEGLFTGAAGLLGL-----GGGALSITNQMKATS-----FSYCLVDRDSGKSSSL 319
Query: 260 G-HVVQ--PEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGV--GDNKGTIID 311
+ VQ PL+ NQ Y + ++ VG + +P +F V + G I+D
Sbjct: 320 DFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILD 379
Query: 312 SGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQYSESVDEGFPNVTFHFEN 369
GT + L Y L + +LK T TC+ +S P V FHF
Sbjct: 380 CGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTG 439
Query: 370 SVSLKVYPHEYLFPFED--LWCIGW 392
SL + YL P +D +C +
Sbjct: 440 GKSLDLPAKNYLIPVDDNGTFCFAF 464
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 144/339 (42%), Gaps = 38/339 (11%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSS 115
LA V L G S GVG Y ++G+GTP Y + VDTGS + W+ C C C R+
Sbjct: 118 LASVPLTPGTSV---GVGNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVG 174
Query: 116 LGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGY 173
LYD + SST V C C + L + C+ C Y YGD S + GY
Sbjct: 175 -----PLYDPRASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGY 229
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+D V + + + + +GCG G + G+IG ++ S++ Q
Sbjct: 230 LSRDTVSFG--------SGSYPNFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQ 276
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGL 290
LA S G F++CL G +IG + TP+ + Y + ++ + VG
Sbjct: 277 LAPSLGYS--FSYCLPTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGG 334
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK---VHTVHDEY 347
L + + + TIIDSGT + LP VY L + + ++ ++ D
Sbjct: 335 SPLAVSPAEY---SSLPTIIDSGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSILD-- 389
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
TCFQ ++ P V F +LK+ L +D
Sbjct: 390 TCFQ-GQASQLRVPAVAMAFAGGATLKLATQNVLIDVDD 427
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 125/267 (46%), Gaps = 27/267 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +C++C + ++ +Y S+T +
Sbjct: 75 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGSLKFDVYSPAQSTTSR 134
Query: 133 FVTCDQEFCHGVYGGPLTDC--TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V C C L + + + SCPY ++ D +S++G V+DV+ S Q
Sbjct: 135 KVPCSSNLCD------LQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL--TSDSAQ 186
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ ++FGCG Q+G+ A +G++G G + S+ S LAS G F+ C
Sbjct: 187 SKIVTAPIMFGCGQVQTGSF--LGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCF- 243
Query: 250 GINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G +G G G + +TPL P+Y+I +T + VG +
Sbjct: 244 GDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSK---------SISTEFS 294
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
I+DSGT+ L + +Y + S +Q
Sbjct: 295 AIVDSGTSFTALSDPMYTQITSSFDAQ 321
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 121/272 (44%), Gaps = 30/272 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK + + +DTGSD+ W+ C+ C C ++ YD KDSS+
Sbjct: 191 GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNG-----PYYDPKDSSSF 245
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K +TC C V P C T SCPY YGD S+TTG F + + + + +
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305
Query: 190 TT-STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+++FGCG G G+ S +QL S G F++CL
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFATQLQSLYG--HSFSYCL 358
Query: 249 DGINGGG------IFAIGH--VVQPEVNKTPLV-----PNQPHYSINMTAVQVGLDFLNL 295
N IF + P +N T V P Y + + ++ VG + L +
Sbjct: 359 VDRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKI 418
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYE 325
P + + + GTIIDSGTTL Y E YE
Sbjct: 419 PEETWHLSAQGGGGTIIDSGTTLTYFAEPAYE 450
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 133/283 (46%), Gaps = 37/283 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
P+ G+ PDG LY+ + +G PPK Y++ VDTGSD+ W+ C C+ C + + + +
Sbjct: 182 FPVSGNVYPDG--LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYKP 239
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
T ++ S + + +G + L C Y Y D SS+ G V+D
Sbjct: 240 TRSNVVSSVDSLCLDVQKNQKNGHHDESLLQCD------YEIQYADHSSSLGVLVRD--- 290
Query: 181 YDKVSGDLQTTSTNGS-----LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+L +TNGS ++FGCG Q G + +T + DGI+G ++ S+ QLA
Sbjct: 291 ------ELHLVTTNGSKTKLNVVFGCGYDQEGLILNTLAKT-DGIMGLSRAKVSLPYQLA 343
Query: 236 SSGGVRKMFAHCL--DGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQ---V 288
S G ++ + HCL DG GG +F +G P +N P+ Y++ Q +
Sbjct: 344 SKGLIKNVVGHCLSNDGAGGGYMF-LGDDFVPYWGMNWVPMA-----YTLTTDLYQTEIL 397
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
G+++ N G DSG++ Y P+ Y LV+ +
Sbjct: 398 GINYGNRQLKFDGQSKVGKVFFDSGSSYTYFPKEAYLDLVASL 440
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 101 bits (251), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 145/347 (41%), Gaps = 39/347 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++G+GTP + ++ VDTGSD+ W+ C CK C +++ ++D ++SS+
Sbjct: 125 GSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQAD-----PIFDPRNSSSF 179
Query: 132 KFVTCDQEFCHGVYGGPLTDCT----ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ + C C + + C+ A + C Y YGDGS + G F D+
Sbjct: 180 QRIPCLSPLCKALE---IHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLF-------T 229
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
L T S S+ FGCG G G S S I +++ F++C
Sbjct: 230 LGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKL--SFPSQIFASSTNSSTANSFSYC 287
Query: 248 L-DGIN----GGGIFAIGHVVQPEVNK-TPLVPN---QPHYSINMTAVQVGLDFL--NLP 296
L D N G P +PL+ N Y M V VG L +L
Sbjct: 288 LVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLK 347
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYTCFQYS 353
+ + G IIDSGT++ P VY + + P +++ D TC+ +S
Sbjct: 348 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFD--TCYNFS 405
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNSGMQ 398
P + HFEN L++ P YL P +C+ + + M+
Sbjct: 406 GKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSME 452
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 136/309 (44%), Gaps = 31/309 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-EC-PRRSSLGIELTLYDIKDSSTG 131
G Y +G+GTP KD+ + DTGSD+ W C C C P+ +D S++
Sbjct: 130 GGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDE------KFDPTKSTSY 183
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K ++C E C + C+++ SC Y YG G T G+ + + + +
Sbjct: 184 KNLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLT-------ITPS 235
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ + GCG R G T G++G G+S ++ SQ +S+ + +F++CL
Sbjct: 236 DVFENFVIGCGERNGGRFSGT-----AGLLGLGRSPVALPSQTSST--YKNLFSYCLPAS 288
Query: 252 NGG-GIFAIGHVVQPEVNKTPLVPNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
+ G + G V TP+ P Y ++++ + VG L + VF GTI
Sbjct: 289 SSSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVF---RTAGTI 345
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT-VHDEYTCFQYSESVDEG--FPNVTFH 366
IDSGTTL YLP + L S + + C+ +S+ ++ P ++
Sbjct: 346 IDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQPCYDFSKHANDNITIPQISIF 405
Query: 367 FENSVSLKV 375
FE V + +
Sbjct: 406 FEGGVEVDI 414
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 90/317 (28%), Positives = 141/317 (44%), Gaps = 51/317 (16%)
Query: 79 KIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
++ IG P Y VDTGSD++W C C EC + + ++D + SS+ V C
Sbjct: 2 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPT-----PIFDPEKSSSYSKVGCSS 56
Query: 139 EFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
C+ + P ++C + +C YL YGD SST G + ++ D + S G
Sbjct: 57 GLCNAL---PRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFE----DENSISGIG-- 107
Query: 198 IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN----- 252
FGCG G+ S G++G G+ S+ISQL + F++CL I
Sbjct: 108 -FGCGVENEGDGFSQG----SGLVGLGRGPLSLISQLKET-----KFSYCLTSIEDSEAS 157
Query: 253 ---------GGGIFAIGHVVQPEVNKTPLV---PNQPH-YSINMTAVQVGLDFLNLPTDV 299
G + G + EV KT + P+QP Y + + + VG L++
Sbjct: 158 SSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKST 217
Query: 300 FGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQP---DLKVHTVHDEYTCFQYSE 354
F + ++ G IIDSGTT+ YL E ++ L + S+ D T D CF+ +
Sbjct: 218 FELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGSTGLD--LCFKLPD 275
Query: 355 SVDE-GFPNVTFHFENS 370
+ P + FHF+ +
Sbjct: 276 AAKNIAVPKMIFHFKGA 292
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 96/322 (29%), Positives = 140/322 (43%), Gaps = 33/322 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRR--SSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++WV NCIQC SL +L Y SST
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCAPLSASYYGSLDKDLNEYRPSSSST 161
Query: 131 GKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C C C + SCPY+ Y + +S++G +QDV+ +
Sbjct: 162 SKHISCSHNLCDSG-----QSCQSPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENS 216
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ +I GCG +QSG S A DG+ G G S++S LA V+ F+ C
Sbjct: 217 SNCTIQAPVILGCGMKQSGGYLSG--VAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF 274
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
+ G IF G T VP Y + VG++ + K
Sbjct: 275 NEDGSGRIF-FGDEGPASQQTTSFVPLDGKYETYI----VGVEACCIENSCLKQTSFKA- 328
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTV-------HDEYTCFQYSESVDEGFP 361
+IDSGT+ YLPE YE +V + D +++T + C++ S P
Sbjct: 329 LIDSGTSFTYLPEEAYENIVIEF-----DKRLNTTSAVSFKGYPWKYCYKISADAMPKVP 383
Query: 362 NVTFHFENSVSLKVYPHEYLFP 383
+VT F + S V H+ +FP
Sbjct: 384 SVTLLFPLNNSFVV--HDPVFP 403
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 123/279 (44%), Gaps = 44/279 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK + + +DTGSD+ W+ C+ C C +S YD KDSS+
Sbjct: 193 GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSG-----PYYDPKDSSSF 247
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ ++C C V P C A N SCPY YGDGS+TTG F + +
Sbjct: 248 RNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFT-------VN 300
Query: 190 TTSTNGS--------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
T+ NG+ ++FGCG G GK S SQ+ S G
Sbjct: 301 LTTPNGTSELKHVENVMFGCGHWNRGLFHGAAGLLGL-----GKGPLSFASQMQSLYG-- 353
Query: 242 KMFAHCLDGINGGG------IFAIGH--VVQPEVNKTPLVPNQ-----PHYSINMTAVQV 288
+ F++CL N IF + P +N T + Y + + +V V
Sbjct: 354 QSFSYCLVDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMV 413
Query: 289 GLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYE 325
+ L +P + + + GTIIDSGTTL Y E YE
Sbjct: 414 DDEVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYE 452
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 118/259 (45%), Gaps = 27/259 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR-SSLG----IELTLYDIKDSS 129
LYYA + +GTPP + V +DTGSD+ W+ C C R +G + L LY S+
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G C++ S CPY Y + + TTG +QDV+ +L
Sbjct: 161 TSSSIRCSDKRCFGS-----KKCSSPKSICPYQISYSNSTGTTGTLLQDVLHLATEDENL 215
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
TN +L GCG +Q+G N +++G++G G S+ S LA + F+ C
Sbjct: 216 TPVKTNVTL--GCGQKQTGLFQRNN--SVNGVLGLGIKGYSVPSLLAKANITADSFSMCF 271
Query: 249 DGINGG-GIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G G + G + +TP + P Y +N+T V VG D VG
Sbjct: 272 GRVIGNVGRISFGDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGD---------PVGTR 322
Query: 306 KGTIIDSGTTLAYLPEMVY 324
D+G++ +L E Y
Sbjct: 323 LFAKFDTGSSFTHLMEPAY 341
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/363 (26%), Positives = 146/363 (40%), Gaps = 56/363 (15%)
Query: 47 EHDARRQQRILAGVD-----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ AR + AG + PL G P G LYY + IG PP+ Y++ VDTGSD+ W
Sbjct: 26 DRPARGGLSVTAGAEESSAVFPLYGDVYPHG--LYYVAMSIGNPPRPYFLDVDTGSDLTW 83
Query: 102 VN----CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT---DC-T 153
+ C+ C + P + + + K V C + C ++GG LT C +
Sbjct: 84 LQCDAPCVSCSKVP-----------HPLYRPTKNKLVPCVDQMCAALHGG-LTGRHKCDS 131
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
C Y Y D S+ G V D + + L FGCG Q ST
Sbjct: 132 PKQQCDYEIKYADQGSSLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQVG-SSTE 186
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKT--P 271
A DG++G G + S++SQL G + + HCL GGG G + P T P
Sbjct: 187 VSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAP 245
Query: 272 LV--PNQPHYSINMTAVQVGLDFLNL-PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
+ ++ +YS + G L + P +V + DSG++ Y Y+ LV
Sbjct: 246 MARSTSRNYYSPGSANLYFGGRPLGVRPMEV---------VFDSGSSFTYFSAQPYQALV 296
Query: 329 -------SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN--SVSLKVYPHE 379
SK + + PD + F+ V + F V F N +++ P
Sbjct: 297 DAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSFSNGKKALMEIPPEN 356
Query: 380 YLF 382
YL
Sbjct: 357 YLI 359
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 91/345 (26%), Positives = 153/345 (44%), Gaps = 36/345 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC---IQCKECPRRSSLGIE-LTLYDIKD 127
G+G Y +GTP + + + DTGSD+ W++C + + C R + I ++
Sbjct: 8 GIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANL 67
Query: 128 SSTGKFVTCDQEFCHGVYGG--PLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
SS+ K + C + C LT+C T T C Y Y DGS+ G+F + V +
Sbjct: 68 SSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELK 127
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
G +++ GC S + + +A DG++G G S S + A G + F
Sbjct: 128 EGRKMKLH---NVLIGC----SESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK--F 178
Query: 245 AHCL----DGINGGGIFAIGHVVQPE-----VNKTPLVPNQPH--YSINMTAVQVGLDFL 293
++CL N G E + T LV + Y++NM + +G L
Sbjct: 179 SYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAML 238
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD-----EYT 348
+P++V+ V GTI+DSG++L +L E Y+P+++ + + LK V EY
Sbjct: 239 KIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAAL--RVSLLKFRKVEMDIGPLEY- 295
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGW 392
CF + + P + FHF + + Y+ D + C+G+
Sbjct: 296 CFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGF 340
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/355 (29%), Positives = 151/355 (42%), Gaps = 40/355 (11%)
Query: 49 DARRQQRILAG---VDLPLGGSSR----PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
D RRQ+ L + P GS D L+Y I IGTP + V +D GSD++W
Sbjct: 69 DFRRQKMKLGSRFQLLFPSEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLW 128
Query: 102 V--NCIQCK--ECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANT 156
V NCIQC SL +L Y SST K ++C C C +
Sbjct: 129 VPCNCIQCAPLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSG-----QSCQSPKQ 183
Query: 157 SCPYLEIY-GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
SCPY+ Y + +S++G +QDV+ + + +I GCG +QSG S
Sbjct: 184 SCPYVIDYITENTSSSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSG--V 241
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPN 275
A DG+ G G S++S LA V+ F+ C + G IF G T VP
Sbjct: 242 APDGLFGLGLGEISVLSSLAKEELVQNSFSLCFNEDGSGRIF-FGDEGPASQQTTSFVPL 300
Query: 276 QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
Y + VG++ + K +IDSGT+ YLPE YE +V +
Sbjct: 301 DGKYETYI----VGVEACCIENSCLKQTSFKA-LIDSGTSFTYLPEEAYENIVIEF---- 351
Query: 336 PDLKVHTV-------HDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP 383
D +++T + C++ S P+VT F + S V H+ +FP
Sbjct: 352 -DKRLNTTSAVSFKGYPWKYCYKISADAMPKVPSVTLLFPLNNSFVV--HDPVFP 403
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 157/376 (41%), Gaps = 69/376 (18%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC--------KECPRRSSLGIELTLYDI 125
G Y + IGTPP Y DTGSD++W C C +C ++S LY+
Sbjct: 85 GEYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGC-----LYNP 139
Query: 126 KDSSTGKFVTCDQ--EFCHGVYG-GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
S+T + C+ C + G P C +C Y + YG G T G VQ V +
Sbjct: 140 SSSTTFGVLPCNSPLSMCAAMAGPSPPPGC----ACMYNQTYGTG-WTAG--VQSVETFT 192
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
S ++ FGC N S + G++G G+ + S++SQL +
Sbjct: 193 FGSSSTPPAVRVPNIAFGC-----SNASSNDWNGSAGLVGLGRGSMSLVSQLGAGA---- 243
Query: 243 MFAHCLDGI---NGGGIFAIGHVVQPE------VNKTPLV------PNQPHYSINMTAVQ 287
F++CL N +G V TP V P +Y +N+T +
Sbjct: 244 -FSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGIS 302
Query: 288 VGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEP--------LVSKI-ISQQP 336
VG L +P D F + + G IIDSGTT+ L + Y+ LV+++ ++ P
Sbjct: 303 VGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGP 362
Query: 337 DLKVHTVHDEYTCFQYSESV-DEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNS 395
D H+ + CF S P++T HFE + + Y+ +WC+ +N
Sbjct: 363 D---HSTGLDL-CFALKASTPPPAMPSMTLHFEGGADMVLPVENYMILGSGVWCLAMRNQ 418
Query: 396 GMQSRDRKNMTLLGDF 411
+ + M+++G++
Sbjct: 419 TVGA-----MSMVGNY 429
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 124/272 (45%), Gaps = 30/272 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK + + +DTGSD+ W+ C+ C C +S YD KDSS+
Sbjct: 191 GSGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSG-----PYYDPKDSSSF 245
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ ++C C V P C A N SCPY YGDGS+TTG F + + + + +
Sbjct: 246 RNISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGK 305
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ + +++FGCG G GK S SQ+ S G + F++CL
Sbjct: 306 SELKHVENVMFGCGHWNRGLFHGAAGLLGL-----GKGPLSFASQMQSLYG--QSFSYCL 358
Query: 249 DGINGGG------IFAIGH--VVQPEVNKTPLVPNQ-----PHYSINMTAVQVGLDFLNL 295
N IF + P +N T + Y + + +V V + L +
Sbjct: 359 VDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKI 418
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYE 325
P + + + GTIIDSGTTL Y E YE
Sbjct: 419 PEETWHLSSEGAGGTIIDSGTTLTYFAEPAYE 450
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 113/444 (25%), Positives = 178/444 (40%), Gaps = 72/444 (16%)
Query: 11 IVLIATAAVGGVSSNHGVFSVKYR--YAGRERSLSLLKEHDARRQQRILAGVDLP----- 63
+V+ AT A G S G+ + E L+ R+Q R L G +L
Sbjct: 17 LVVCATLASGAASVRVGLTRIHSDPDITAPEFVRDALRRDMHRQQSRSLFGRELAESDGT 76
Query: 64 -LGGSSR---PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
+ +R P+G G Y + IGTPP Y DTGSD++W QC C
Sbjct: 77 TVSARTRKDLPNG-GEYLMTLSIGTPPLSYPAIADTGSDLIWT---QCAPCSGDQCFAQP 132
Query: 120 LTLYDIKDSSTGKFVTCDQEF--CHGVYGG--PLTDCTANTSCPYLEIYGDGSSTTGYFV 175
LY+ S+T + C+ C GV G P C +C Y + YG G T G
Sbjct: 133 APLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPPPGC----ACMYNQTYGTG-WTAGVQG 187
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+ + + D + FGC N S++ G++G G+ + S++SQL
Sbjct: 188 SETFTFGSAAADQARVP---GIAFGC-----SNASSSDWNGSAGLVGLGRGSLSLVSQL- 238
Query: 236 SSGGVRKMFAHCLD-----------------GINGGGIFAIGHVVQPEVNKTPLVPNQPH 278
G R F++CL +NG G+ + V P K P+ +
Sbjct: 239 --GAGR--FSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPA--KAPM---STY 289
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIIS--- 333
Y +N+T + +G L++ D F + + G IIDSGTT+ L Y+ + + + S
Sbjct: 290 YYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVT 349
Query: 334 -QQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGW 392
D T D S P++T HF+ + + + Y+ +WC+
Sbjct: 350 LPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFDGA-DMVLPADSYMISGSGVWCLAM 408
Query: 393 QN------SGMQSRDRKNMTLLGD 410
+N S + ++NM +L D
Sbjct: 409 RNQTDGAMSTFGNYQQQNMHILYD 432
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 133/334 (39%), Gaps = 44/334 (13%)
Query: 53 QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR 112
Q GV LP R G Y +G+GTP +D V DTGSD+ WV C C C +
Sbjct: 166 QSSASKGVSLPAHRGLRL-GTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYK 224
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTG 172
+ L+D S+T V C + C + ++ C Y +YGD S T G
Sbjct: 225 QHD-----PLFDPSQSTTYSAVPCGAQECLD------SGTCSSGKCRYEVVYGDMSQTDG 273
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+D + S LQ +FGCG +G DG+ G G+ S+ S
Sbjct: 274 NLARDTLTLGPSSDQLQ------GFVFGCGDDDTGLFGRA-----DGLFGLGRDRVSLAS 322
Query: 233 QLASSGGVRKMFAHCL-DGINGGGIFAIGHVVQ-PEVNKTPLVPNQ---PHYSINMTAVQ 287
Q A+ G F++CL G ++G P T +V Y +++ ++
Sbjct: 323 QAAARYGA--GFSYCLPSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIK 380
Query: 288 VGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKII------SQQPDLKVH 341
V + + VF GT+IDSGT + LP Y L S + P L +
Sbjct: 381 VAGRTVRVAPAVF---KAPGTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSIL 437
Query: 342 TVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
TC+ ++ P+V F+ +L +
Sbjct: 438 D-----TCYDFTGRTKVQIPSVALLFDGGATLNL 466
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 157/339 (46%), Gaps = 41/339 (12%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y ++ IGTPP VDTGSD++WV C+ C C + + ++D SST
Sbjct: 61 IGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQIN-----PMFDPLKSSTYT 115
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
++CD C+ Y G +C+ C Y Y D S T G Q+ V +G + S
Sbjct: 116 NISCDSPLCYKPYIG---ECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTG--KPIS 170
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
G ++FGCG +GN N+ + G+IG G +S++SQ+ G +K F+ CL
Sbjct: 171 LQG-ILFGCGHNNTGNF---NDHEM-GLIGLGGGPTSLVSQIGPLFGGKK-FSQCLVPFL 224
Query: 249 --DGINGGGIFAIGHVVQPE-VNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGV 302
I+ F G V E V TPLV + Y + + + V +L + + +
Sbjct: 225 TDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTI--- 281
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC---FQYSESVDEG 359
+ ++DSGT LP+ +Y+ + ++ ++ P + + D+ + Y +
Sbjct: 282 -EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVP---LEPITDDPSLGPQLCYRTQTNLK 337
Query: 360 FPNVTFHFENSVSLKVYPHEYLFP----FEDLWCIGWQN 394
P +T+HFE + +L + P + P + ++C+ N
Sbjct: 338 GPTLTYHFEGA-NLLLTPIQTFIPPTPETKGVFCLAITN 375
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 144/333 (43%), Gaps = 32/333 (9%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y K+ +G+PP D Y VDTGSD++W C C C R+ S +++ S T
Sbjct: 80 GDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKS-----PMFEPLRSKTYSP 134
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C+ E C +G C+ C Y Y D S T G ++ + + GD
Sbjct: 135 IPCESEQC-SFFG---YSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVV-- 188
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
G +IFGCG SG + + + S++SQ+ + G ++ F+ CL
Sbjct: 189 -GDIIFGCGHSNSGTFNENDMGIIGMG----GGPLSLVSQIGTLYGSKR-FSQCLVPFHT 242
Query: 249 DGINGGGI-FAIGHVVQPE-VNKTPLVPN--QPHYSINMTAVQVGLDFLNL-PTDVFGVG 303
D G I F V E V TPL Q Y + + + VG F+ ++ G
Sbjct: 243 DAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKG 302
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
+ +IDSGT Y+P+ YE LV ++ Q L + D T Y + P +
Sbjct: 303 N---IMIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYRSETNLEGPIL 359
Query: 364 TFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
T HFE + +++ P + P +D ++C S
Sbjct: 360 TAHFEGA-DVQLLPIQTFIPPKDGVFCFAMAGS 391
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/364 (27%), Positives = 165/364 (45%), Gaps = 53/364 (14%)
Query: 29 FSVKYRYAGRERSLSLLK--EHDARRQQRILAGVD-LPLGGSSRPD-------GVGLYYA 78
F + ++ +++L+ + +H +R L ++ + L SS + G G +
Sbjct: 43 FRITLKHVDSDKNLTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLSGNGEFLM 102
Query: 79 KIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
+ IGTPP+ Y +DTGSD++W C C +C + S ++D K SS+ ++C
Sbjct: 103 NLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPS-----PIFDPKKSSSFSKLSCSS 157
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
+ C + P + C+ SC YL YGD SST G + + KVS ++
Sbjct: 158 QLCKAL---PQSSCS--DSCEYLYTYGDYSSTQGTMATETFTFGKVSIP--------NVG 204
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG---- 254
FGCG G+ + G++G G+ S++SQL + F++CL I+
Sbjct: 205 FGCGEDNEGDGFTQGS----GLVGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTKTST 255
Query: 255 ---GIFAIGHVVQPEVNKTPLVPN--QPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--K 306
G A + + TPL+ N QP Y +++ + VG L + F + D+
Sbjct: 256 LLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTG 315
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT--VHDEYTCFQY-SESVDEGFPNV 363
G IIDSGTT+ YL E ++ LV K + Q L V C+ S++ + P +
Sbjct: 316 GLIIDSGTTITYLEESAFD-LVKKEFTSQMGLPVDNSGATGLELCYNLPSDTSELEVPKL 374
Query: 364 TFHF 367
HF
Sbjct: 375 VLHF 378
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 120/273 (43%), Gaps = 36/273 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP-----RRSSLGIELTLYDIKDSST 130
Y + IGTPP DTGSD++W+NC + P R + +D S+T
Sbjct: 100 YLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTT 159
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL-- 188
+ V CD C + P C A++ C Y YGDGS T+G + + G
Sbjct: 160 FRLVDCDSVACSEL---PEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGD 216
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
TT+ ++ FGC G+ L + S++SQL + + + F++CL
Sbjct: 217 GTTTRVANVNFGCSTTFVGSSVGDGLVGLG------GGDLSLVSQLGADTSLGRRFSYCL 270
Query: 249 --------DGINGGGIFAIGHVVQPEVNKTPLVPNQ--PHYSINMTAVQVGLDFLNLPTD 298
+N G A V P TPL+P+Q +Y + + +V+VG
Sbjct: 271 VPYSVKASSALNFGPRAA---VTDPGAVTTPLIPSQVKAYYIVELRSVKVG-------NK 320
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
F D I+DSGTTL +LPE + +PLV ++
Sbjct: 321 TFEAPDRSPLIVDSGTTLTFLPEALVDPLVKEL 353
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 160/384 (41%), Gaps = 47/384 (12%)
Query: 26 HGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTP 85
+ S K R G L LK R + A ++P+ G G Y ++ GTP
Sbjct: 74 ESLMSEKIR--GDANRLRFLKR--TSRSSKEDANANVPVRS-----GSGEYIIQVDFGTP 124
Query: 86 PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY 145
+ Y +DTGSD+ W+ C QC+ C + ++D SS+ K CD + C +
Sbjct: 125 KQSMYTLIDTGSDVAWIPCKQCQGCHSTAP------IFDPAKSSSYKPFACDSQPCQEIS 178
Query: 146 GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQ 205
G +C N+ C + +YGDG+ G D + + + FGC
Sbjct: 179 G----NCGGNSKCQFEVLYGDGTQVDGTLASDAITLG--------SQYLPNFSFGCAESL 226
Query: 206 SGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI----GH 261
S + S+ G + ++L GG F++CL + +
Sbjct: 227 SEDTYSSPGLMGLGGGSLSLLTQAPTAELF--GGT---FSYCLPSSSTSSGSLVLGKEAA 281
Query: 262 VVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
V + T L+ P+ P Y + + A+ VG +++P + GTIIDSGTT+ Y
Sbjct: 282 VSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPAT--NIASGGGTIIDSGTTITY 339
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY-SESVDEGFPNVTFHFENSVSLKVYP 377
L Y+ L Q L+ V D TC+ S SVD P +T H + +V L V P
Sbjct: 340 LVPSAYKDLRDAFRQQLSSLQPTPVEDMDTCYDLSSSSVD--VPTITLHLDRNVDL-VLP 396
Query: 378 HEYLFPFED--LWCIGWQNSGMQS 399
E + ++ L C+ + ++ +S
Sbjct: 397 KENILITQESGLSCLAFSSTDSRS 420
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 132/297 (44%), Gaps = 36/297 (12%)
Query: 44 LLKEHDARR--QQRILAGVDLPLGGSSRPDGVGL------YYAKIGIGTPPKDYYVQVDT 95
L ++H+ R +R+ D ++ P +GL Y IGIGTP +++ V DT
Sbjct: 89 LRRDHNRVRSIHRRLTGAGDT---AATIPASLGLAFHSLEYVVTIGIGTPARNFTVLFDT 145
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN 155
GSD+ WV C C + S + L+D SST V C C + GG C
Sbjct: 146 GSDLTWVQCKPCTD----SCYQQQEPLFDPSKSSTYVDVPCGTPQCK-IGGGQDLTC-GG 199
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
T+C Y YGD S T G Q+ + ++FGC S + EE
Sbjct: 200 TTCEYSVKYGDQSVTRGNLAQEAFTLSP------SAPPAAGVVFGCSHEYSSGVKGAEEE 253
Query: 216 -ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNK--TP 271
++ G++G G+ +SS++SQ G +F++CL + G IG P+ N TP
Sbjct: 254 MSVAGLLGLGRGDSSILSQ-TRRGNSGDVFSYCLPPRGSSAGYLTIGAAAPPQSNLSFTP 312
Query: 272 LVPNQPH----YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
LV + Y +N+ + V L + F + GT+IDSGT + ++P Y
Sbjct: 313 LVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI----GTVIDSGTVITHMPAAAY 365
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 168/376 (44%), Gaps = 61/376 (16%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK ++ +DTGSD+ W+ C C +C ++ + Y KDSST
Sbjct: 167 GTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNG-----SHYYPKDSSTY 221
Query: 132 KFVTCDQEFCHGVYGG-PLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ ++C C V PL C A N +CPY Y DGS+TTG F + +
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFT-------VN 274
Query: 190 TTSTNGS--------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
T NG ++FGCG G G++G G+ S SQ+ S G
Sbjct: 275 LTWPNGKEKFKQVVDVMFGCGHWNKGFF-----YGASGLLGLGRGPISFPSQIQSIYG-- 327
Query: 242 KMFAHCL------DGINGGGIFAIGHVV--QPEVNKTPLV-----PNQPHYSINMTAVQV 288
F++CL ++ IF + +N T L+ P++ Y + + ++ V
Sbjct: 328 HSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMV 387
Query: 289 GLDFLNLPTDVFGVGDN-------KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH 341
G + L++ + GTIIDSG+TL + P+ Y+ ++ + ++ L+
Sbjct: 388 GGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYD-IIKEAFEKKIKLQ-Q 445
Query: 342 TVHDEYT---CFQYSESVDE-GFPNVTFHFENSVSLKVYPHEYLFPFE--DLWCIGWQNS 395
D++ C+ S ++ + P+ HF + Y + +E ++ C+
Sbjct: 446 IAADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAI--- 502
Query: 396 GMQSRDRKNMTLLGDF 411
M++ + ++T++G+
Sbjct: 503 -MKTPNHSHLTIIGNL 517
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 134/284 (47%), Gaps = 30/284 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK + + +DTGSD+ W+ C+ C +C ++ YD K S++
Sbjct: 158 GSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNE-----AFYDPKTSASF 212
Query: 132 KFVTCDQEFCHGVYG-GPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K +TC+ C + P C + N SCPY YGD S+TTG F + + + + +
Sbjct: 213 KNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGR 272
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
++ +++FGCG G + G S SQL S G F++CL
Sbjct: 273 SSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFS-----SQLQSLYG--HSFSYCL 325
Query: 249 ------DGINGGGIFAIGH--VVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNL 295
++ IF + +N T V + + Y I + ++ VG + L++
Sbjct: 326 VDRNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDI 385
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
P + + + + GTIIDSGTTL+Y E YE + +K + +
Sbjct: 386 PEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE 429
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/330 (27%), Positives = 135/330 (40%), Gaps = 48/330 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC--IQCKECPRRSSLGIELTLYDIKDSSTG 131
GLYY I +G+PP+ Y++ VDTGS WV C C C + + LY + + T
Sbjct: 158 GLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH-----PLY--RPARTA 210
Query: 132 KFVTCDQEFCHGV-YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C G + P C Y Y DGSS+ G +V+D +Q+ G+ +
Sbjct: 211 DALPASDPLCEGAQHENP-------NQCDYEISYADGSSSMGVYVRDSMQFVGEDGERE- 262
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-- 248
N ++FGCG Q G L + E DG++G S+ +QLAS G + F HC+
Sbjct: 263 ---NADIVFGCGYDQQGVLLNA-LETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMST 318
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQV-----GLDFLNLPTDVFGVG 303
D GG +G P T VP + + ++ QV G LN G
Sbjct: 319 DPSGAGGYLFLGDDYIPRWGMT-WVPIRDGPADDVRRAQVKQINHGDQQLN------AQG 371
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF--------QYSES 355
+ D+G+T Y P+ L+S + V D+ F + E
Sbjct: 372 KLTQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFCMKSDFPVRSVED 431
Query: 356 VDEGFPNVTFHFEN----SVSLKVYPHEYL 381
V F ++ FE S + + P YL
Sbjct: 432 VKHFFKPLSLQFEKRFFFSRTFNIRPEHYL 461
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/292 (31%), Positives = 125/292 (42%), Gaps = 48/292 (16%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+G G Y I +GTPP D+ V VDTGS+++W C C C R + L SST
Sbjct: 86 NGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVL---QPARSST 142
Query: 131 GKFVTCDQEFCH--GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+ C+ FC P T C A +C Y YG G T GY + + GD
Sbjct: 143 FSRLPCNGSFCQYLPTSSRPRT-CNATAACAYNYTYGSG-YTAGYLATETLTV----GD- 195
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALD---GIIGFGKSNSSMISQLASSGGVRKMFA 245
T + FGC + E +D GI+G G+ S++SQLA F+
Sbjct: 196 ---GTFPKVAFGC----------STENGVDNSSGIVGLGRGPLSLVSQLAVG-----RFS 237
Query: 246 HCL--DGINGG---------GIFAIGHVVQP-EVNKTPLVPNQPHYSINMTAVQVGLDFL 293
+CL D +GG G VVQ + K P + HY +N+T + V L
Sbjct: 238 YCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTEL 297
Query: 294 NLPTDVFG---VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT 342
+ FG G GTI+DSGTTL YL + Y + SQ +L T
Sbjct: 298 PVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTT 349
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 146/339 (43%), Gaps = 53/339 (15%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDL------------------------- 62
VF V ++ RE +L+E + + +DL
Sbjct: 131 VFPVYHKLRAREFHERILEEDLGLENENFVESMDLELVNPVKVNDVLSTSAGSIDSSTTI 190
Query: 63 -PLGGSSRPDGVGLYYAKIGIGTPP--KDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGI 118
P+GG+ PDG LYY +I +G P + Y++ +DTGS++ W+ C C C + ++
Sbjct: 191 FPVGGNVYPDG--LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN--- 245
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD-CTANTSCPYLEIYGDGSSTTGYFVQD 177
LY + + V + FC V LT+ C C Y Y D S + G +D
Sbjct: 246 --QLYKPRKDN---LVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKD 300
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+G L ++FGCG Q G L +T + DGI+G ++ S+ SQLAS
Sbjct: 301 KFHLKLHNGSL----AESDIVFGCGYDQQGLLLNTLLKT-DGILGLSRAKISLPSQLASR 355
Query: 238 GGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNKT--PLVPNQ--PHYSINMTAVQVGLDF 292
G + + HCL +NG G +G + P T P++ + Y + +T + G
Sbjct: 356 GIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGM 415
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
L+L + VG + D+G++ Y P Y LV+ +
Sbjct: 416 LSLDGENGRVGK---VLFDTGSSYTYFPNQAYSQLVTSL 451
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 123/272 (45%), Gaps = 30/272 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + IGTPPK Y + +DTGSD+ W+ C+ C C +S YD K+SS+
Sbjct: 188 GSGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSG-----PYYDPKESSSF 242
Query: 132 KFVTCDQEFCHGVYG-GPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ +TC C V P C N +CPY YGD S+TTG F + + + + +
Sbjct: 243 ENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGK 302
Query: 190 TTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ + +++FGCG G G S SQL S G F++CL
Sbjct: 303 SEQKHVENVMFGCGHWNRGLFHGAAGLLGLGR-----GPLSFASQLQSIYG--HSFSYCL 355
Query: 249 ------DGINGGGIFAIGH--VVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNL 295
++ IF + P +N T V + + Y + + ++ V + L +
Sbjct: 356 VDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKI 415
Query: 296 PTDVFGVGDN--KGTIIDSGTTLAYLPEMVYE 325
P + + + GTIIDSGTTL Y E YE
Sbjct: 416 PEETWHLSKEGGGGTIIDSGTTLTYFAEPAYE 447
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 140/326 (42%), Gaps = 36/326 (11%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGI 118
D+P+ S P G G Y K+ +GTP + +DTGSDI W C C C R++
Sbjct: 30 ADIPVQ-SGIPLGAGNYLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQ--- 85
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVY-GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
T +D + SS+ K V+C C + G C ++T C Y YGDGS + G+F +
Sbjct: 86 --TKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVSST-CIYKVQYGDGSYSVGFFATE 142
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ + + + +FGCG + +G + L +S
Sbjct: 143 KLT-------ISPSDVISNFLFGCGQQNAGRFGRIAGLLG-------LGRGKLSLALQTS 188
Query: 238 GGVRKMFAHCLDGINGG--GIFAIGHVVQPEVNKTPLVP---NQPHYSINMTAVQVGLDF 292
+F +CL + G +G V V TPL P N P Y I++ + VG
Sbjct: 189 EKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHV 248
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSK---IISQQPDLKVHTVHDEYTC 349
L + VF N G IIDSGT + L VY L SK ++ P ++ D TC
Sbjct: 249 LPIDASVF---SNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSILD--TC 303
Query: 350 FQYSESVDEGFPNVTFHFENSVSLKV 375
+ +S + P ++F F+ V + +
Sbjct: 304 YDFSGNESISVPRISFFFKGGVEVDI 329
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 43/378 (11%)
Query: 28 VFSVKYRYAGRERS-LSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPP 86
V +++ G +RS L + D R Q L + G+S+ G G Y+++IG+GTP
Sbjct: 117 VAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTP--VVSGASQ--GSGEYFSRIGVGTPA 172
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
KD Y+ +DTGSD+ W+ C C +C ++S +++ SST K +TC C +
Sbjct: 173 KDMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYKSLTCSAPQCSLL-- 225
Query: 147 GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
+ C +N C Y YGDGS T G D V + SG + ++ GCG
Sbjct: 226 -ETSACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKIN------NVALGCGHDNE 276
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIG-HVVQP 265
G G S+ +Q+ ++ F++CL + G ++ + VQ
Sbjct: 277 GLFTGAAGLLGLGGGVL-----SITNQMKATS-----FSYCLVDRDSGKSSSLDFNSVQL 326
Query: 266 --EVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGV--GDNKGTIIDSGTTLAY 318
PL+ N+ Y + ++ VG + + LP +F V + G I+D GT +
Sbjct: 327 GGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 386
Query: 319 LPEMVYEPLVSKIISQQPDLK--VHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVY 376
L Y L + +LK ++ TC+ +S P V FHF SL +
Sbjct: 387 LQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLP 446
Query: 377 PHEYLFPFED--LWCIGW 392
YL P +D +C +
Sbjct: 447 AKNYLIPVDDSGTFCFAF 464
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 95/346 (27%), Positives = 150/346 (43%), Gaps = 62/346 (17%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTG 131
LY +G+GTP K V++DTGS WV C +C C PR T + ++
Sbjct: 81 LYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCA 131
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K V+C C + GG C + + CP+ Y DGS++ G QD + + D+
Sbjct: 132 K-VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDV 184
Query: 189 QTTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
Q FGC GA + GN +DG++G G S++ Q S
Sbjct: 185 QKIP---GFSFGCNMDSFGANEFGN--------VDGLLGMGAGPMSVLKQ---SSPTFDC 230
Query: 244 FAHCL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLD 291
F++CL G G F++G V + +V T +V + + + +++TA+ V +
Sbjct: 231 FSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGE 290
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ 351
L L VF KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 291 RLGLSPSVFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERNCYD 347
Query: 352 YSESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 348 M-RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 392
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 146/346 (42%), Gaps = 37/346 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++G+GTP + ++ VDTGSD+ W+ C CK C +++ ++D ++SS+
Sbjct: 50 GSGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQAD-----PIFDPRNSSSF 104
Query: 132 KFVTCDQEFCHGVYGGPLTDCT----ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ + C C + + C+ A + C Y YGDGS + G F D+
Sbjct: 105 QRIPCLSPLCKAL---EVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFT------- 154
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
L T S S+ FGCG G G S S I +++ F++C
Sbjct: 155 LGTGSKAMSVAFGCGFDNEGLFAGAAGLLGLGAGKL--SFPSQIFASSTNSSTANSFSYC 212
Query: 248 L-DGIN------GGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFL--NL 295
L D N IF + + +PL+ N Y M V VG L +L
Sbjct: 213 LVDRSNPMTRSSSSLIFGVA-AIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISL 271
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSE 354
+ + G IIDSGT++ P VY + + +L + + TC+ +S
Sbjct: 272 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSG 331
Query: 355 SVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNSGMQ 398
P + HFEN L++ P YL P +C+ + + M+
Sbjct: 332 KASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSME 377
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/318 (26%), Positives = 134/318 (42%), Gaps = 38/318 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTPP DTGSD++WVNC + G + + SST ++
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTR-SSTYSQLS 161
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + C A++ C Y YGDGS T G + + G Q
Sbjct: 162 CQSNACQAL---SQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPR- 217
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD---GIN 252
+ FGC +G S DG++G G S++SQL ++ + + ++CL N
Sbjct: 218 -VNFGCSTASAGTFRS------DGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYDAN 270
Query: 253 GGGIFAIGH---VVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G V +P TPLVP+ +Y++ + +V VG D++
Sbjct: 271 SSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVG-------GQEVATHDSR- 322
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY-------SESVDEGF 360
I+DSGTTL +L + PLV+++ + +K+ V Q SE+ + G
Sbjct: 323 IIVDSGTTLTFLDPALLGPLVTEL---ERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGI 379
Query: 361 PNVTFHFENSVSLKVYPH 378
P+VT F ++ + P
Sbjct: 380 PDVTLRFGGGAAVTLRPE 397
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 134/283 (47%), Gaps = 37/283 (13%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
P+ G+ PDG LY+ + +G PPK Y++ VDTGSD+ W+ C C C + + + +
Sbjct: 180 FPVSGNVYPDG--LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHVLYKP 237
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
T ++ S + + +G + L C Y Y D SS+ G V+D
Sbjct: 238 TRSNVVSSVDALCLDVQKNQKNGHHDESLLQCD------YEIQYADHSSSLGVLVRD--- 288
Query: 181 YDKVSGDLQTTSTNGS-----LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+L +TNGS ++FGCG Q+G L +T + DGI+G ++ S+ QLA
Sbjct: 289 ------ELHLVTTNGSKTKLNVVFGCGYDQAGLLLNTLGKT-DGIMGLSRAKVSLPYQLA 341
Query: 236 SSGGVRKMFAHCL--DGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQ---V 288
S G ++ + HCL DG GG +F +G P +N P+ Y++ Q +
Sbjct: 342 SKGLIKNVVGHCLSNDGAGGGYMF-LGDDFVPYWGMNWVPMA-----YTLTTDLYQTEIL 395
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
G+++ N G + DSG++ Y P+ Y LV+ +
Sbjct: 396 GINYGNRQLRFDGQSKVGKMVFDSGSSYTYFPKEAYLDLVASL 438
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 132/317 (41%), Gaps = 34/317 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+GTP V +DTGSD+ WV QC CP L+D SST + V+
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWV---QCNPCPNPPCYAQTGALFDPAKSSTYRAVS 183
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + N C Y YGDGS+T G + +D + S ++
Sbjct: 184 CAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVK------ 237
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG- 254
FGC +SG D T DG++G G S++SQ A++ G F++CL +G
Sbjct: 238 GFQFGCSHVESGFSDQT-----DGLMGLGGGAQSLVSQTAAAYG--NSFSYCLPPTSGSS 290
Query: 255 ------GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G + V + ++ +P Y + + VG L L VF G+
Sbjct: 291 GFLTLGGGGGVSGFVTTRMLRSRQIPT--FYGARLQDIAVGGKQLGLSPSVFAA----GS 344
Query: 309 IIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
++DSGT + LP Y L S + Q ++ D TCF ++ P V
Sbjct: 345 VVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILD--TCFDFAGQTQISIPTVAL 402
Query: 366 HFENSVSLKVYPHEYLF 382
F ++ + P+ ++
Sbjct: 403 VFSGGAAIDLDPNGIMY 419
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 143/334 (42%), Gaps = 38/334 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++GIG+P ++ Y+ +DTGSD+ WV C C +C ++S ++D S++
Sbjct: 165 GSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSASY 219
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+CD C + + T +C Y YGDGS T G F + + GD T
Sbjct: 220 AAVSCDSPRCRDLDTAACRNATG--ACLYEVAYGDGSYTVGDFATETLTL----GD-STP 272
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
TN + GCG G G S SQ+++S F++CL
Sbjct: 273 VTN--VAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAS-----TFSYCLVDR 320
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTDVFGVGD 304
D + + + PLV P Y + ++ + VG L++P+ F +
Sbjct: 321 DSPAASTLQFGADGAEADTVTAPLV-RSPRTGTFYYVALSGISVGGQALSIPSSAFAMDA 379
Query: 305 NKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGF 360
G+ I+DSGT + L Y L + P L + V TC+ S+
Sbjct: 380 TSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEV 439
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
P V+ FE +L++ YL P + +C+ +
Sbjct: 440 PAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAF 473
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 144/335 (42%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP+D Y+ +D+GSD++WV C CK C ++S ++D S +
Sbjct: 127 GSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPAKSGSY 181
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C + + C + C Y +YGDGS T G + + + K T
Sbjct: 182 TGVSCGSSVCDRIEN---SGCHSG-GCRYEVMYGDGSYTKGTLALETLTFAK------TV 231
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
N + GCG R G G + S + QL SG F +CL
Sbjct: 232 VRN--VAMGCGHRNRGMFIGAAGLLGI-----GGGSMSFVGQL--SGQTGGAFGYCLVSR 282
Query: 250 GINGGGIFAIGHVVQP-EVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGD- 304
G + G G P + PLV P P Y + + + VG + LP VF + +
Sbjct: 283 GTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTET 342
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
+ G ++D+GT + LP Y SQ +L + V TC+ S V P
Sbjct: 343 GDGGVVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPT 402
Query: 363 VTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNS 395
V+F+F L + +L P +D +C + S
Sbjct: 403 VSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 437
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 95/340 (27%), Positives = 144/340 (42%), Gaps = 56/340 (16%)
Query: 65 GGSSRPDGVG------LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI 118
GG+S P +G Y +GIGTP V +DTGSD+ WV QCK C
Sbjct: 101 GGTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWV---QCKPCGAGECYAQ 157
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCH----GVYGGPLTDCTANTS--CPYLEIYGDGSSTTG 172
+ L+D SS+ V CD + C G YG CT+ + C Y YG+ ++TTG
Sbjct: 158 KDPLFDPSSSSSYASVPCDSDACRKLAAGAYG---HGCTSGAAALCEYGIEYGNRATTTG 214
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+ + + L+ FGCG Q G E DG++G G + S++S
Sbjct: 215 VYSTETLT-------LKPGVVVADFGFGCGDHQHGPY-----EKFDGLLGLGGAPESLVS 262
Query: 233 QLASSGGVRKMFAHCLDGINGGGIF--------------AIGHVVQPEVNKTPLVPNQPH 278
Q +S G F++CL +GG F A G + P + + P VP
Sbjct: 263 QTSSQFG--GPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTP-MRRIPSVPT--F 317
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL 338
Y + +T + VG L +P F + G +IDSGT + LP Y L S S +
Sbjct: 318 YVVTLTGISVGGAPLAVPPSAF----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEY 373
Query: 339 KVHTVHDEY---TCFQYSESVDEGFPNVTFHFENSVSLKV 375
++ + TC+ ++ + P + F ++ +
Sbjct: 374 RLLPPSNGAVLDTCYDFTGHTNVTVPTIALTFSGGATIDL 413
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 143/320 (44%), Gaps = 42/320 (13%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+GVG Y I +GTP + V DTGSD++W C C +C ++ + + SST
Sbjct: 81 NGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPA-----PPFQPASSST 135
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C FC + C A T C Y YG G T GY + ++ S
Sbjct: 136 FSKLPCTSSFCQ-FLPNSIRTCNA-TGCVYNYKYGSG-YTAGYLATETLKVGDASFP--- 189
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
S+ FGC + ++G +ST+ GI G G+ S+I QL GV + F++CL
Sbjct: 190 -----SVAFGC-STENGVGNSTS-----GIAGLGRGALSLIPQL----GVGR-FSYCLRS 233
Query: 251 INGGG----IF-AIGHVVQPEVNKTPLVPNQP----HYSINMTAVQVGLDFLNLPTDVFG 301
+ G +F ++ ++ V TP V N +Y +N+T + VG L + T FG
Sbjct: 234 GSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFG 293
Query: 302 VGDN---KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVD 357
N GTI+DSGTTL YL + YE + +SQ D+ V+ CF+ +
Sbjct: 294 FTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTVNGTRGLDLCFKSTGGGG 353
Query: 358 E--GFPNVTFHFENSVSLKV 375
P++ F+ V
Sbjct: 354 GGIAVPSLVLRFDGGAEYAV 373
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 144/335 (42%), Gaps = 35/335 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP+D Y+ +D+GSD++WV C CK C ++S ++D S +
Sbjct: 128 GSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPAKSGSY 182
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C + + C + C Y +YGDGS T G + + + K T
Sbjct: 183 TGVSCGSSVCDRIEN---SGCHSG-GCRYEVMYGDGSYTKGTLALETLTFAK------TV 232
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
N + GCG R G G + S + QL SG F +CL
Sbjct: 233 VRN--VAMGCGHRNRGMFIGAAGLLGI-----GGGSMSFVGQL--SGQTGGAFGYCLVSR 283
Query: 250 GINGGGIFAIGHVVQP-EVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGD- 304
G + G G P + PLV P P Y + + + VG + LP VF + +
Sbjct: 284 GTDSTGSLVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTET 343
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
+ G ++D+GT + LP Y SQ +L + V TC+ S V P
Sbjct: 344 GDGGVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPT 403
Query: 363 VTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNS 395
V+F+F L + +L P +D +C + S
Sbjct: 404 VSFYFTEGPVLTLPARNFLMPVDDSGTYCFAFAAS 438
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/370 (27%), Positives = 159/370 (42%), Gaps = 55/370 (14%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTG 131
G Y + IGTPP Y DTGSD++W C C +C ++ + LY+ S+T
Sbjct: 83 AGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPT-----PLYNPSSSTTF 137
Query: 132 KFVTCDQEF--CHGVYGG--PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ C+ C G P CT C Y YG G T+ Y + + +
Sbjct: 138 AVLPCNSSLSMCAAALAGTTPPPGCT----CMYNMTYGSG-WTSVYQGSETFTFGSSTPA 192
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
QT + FGC + SG N + G++G G+ + S++SQL GV K F++C
Sbjct: 193 NQTGVPG--IAFGC-SNASGGF---NTSSASGLVGLGRGSLSLVSQL----GVPK-FSYC 241
Query: 248 L---DGINGGGIFAIGHVVQPE----VNKTPLV------PNQPHYSINMTAVQVGLDFLN 294
L N +G V+ TP V P +Y +N+T + +G L+
Sbjct: 242 LTPYQDTNSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALS 301
Query: 295 LPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT---- 348
+PT + + G IIDSGTT+ L Y+ + + ++S L T
Sbjct: 302 IPTTALSLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLV-TLPTTDGGSAATGLDL 360
Query: 349 CFQY--SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQN------SGMQSR 400
CF+ S S P++T HF+ + + + Y+ +LWC+ QN S + +
Sbjct: 361 CFELPSSTSAPPTMPSMTLHFDGA-DMVLPADSYMMLDSNLWCLAMQNQTDGGVSILGNY 419
Query: 401 DRKNMTLLGD 410
++NM +L D
Sbjct: 420 QQQNMHILYD 429
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 124/268 (46%), Gaps = 23/268 (8%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
L+YA + +GTP + V +DTGSD+ WV +CI C + ++ Y + SST +
Sbjct: 103 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSPNYRDLKFDTYSPQKSSTSR 162
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C +A++SCPY +E D +S+TG V+DV+ G Q
Sbjct: 163 KVPCSSNLCDLQ----SACRSASSSCPYSIEYLSDNTSSTGVLVEDVLYLITEYG--QPK 216
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ FGCG Q+G+ A +G++G G + S+ S LAS G F+ C G
Sbjct: 217 IVTAPITFGCGRIQTGSF--LGSAAPNGLLGLGMDSISVPSLLASEGVAANSFSMCF-GD 273
Query: 252 NGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
+G G G + +TPL P+Y+I++T VG N N I
Sbjct: 274 DGRGRINFGDTGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNT---------NFNAI 324
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPD 337
+DSGT+ L + +Y + S SQ D
Sbjct: 325 VDSGTSFTALSDPMYSEITSSFNSQVQD 352
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 105/400 (26%), Positives = 175/400 (43%), Gaps = 57/400 (14%)
Query: 34 RYAGRERSLSLLKEHDARRQQRILAGVDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQ 92
++ ER++ E +RR QR+ A ++ P G +S G G Y + IGTP + +
Sbjct: 56 KFQLLERAI----ERGSRRLQRLEAMLNGPSGVETSVYAGDGEYLMNLSIGTPAQPFSAI 111
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGSD++W C C +C +S+ +++ + SS+ + C + C + +
Sbjct: 112 MDTGSDLIWTQCQPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQALS----SPT 162
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
+N C Y YGDGS T G + + + VS ++ FGCG G
Sbjct: 163 CSNNFCQYTYGYGDGSETQGSMGTETLTFGSVSIP--------NITFGCGENNQG----F 210
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG-----IFAIGHVVQPEV 267
+ G++G G+ S+ SQL V K F++C+ I + ++ + V
Sbjct: 211 GQGNGAGLVGMGRGPLSLPSQL----DVTK-FSYCMTPIGSSTPSNLLLGSLANSVTAGS 265
Query: 268 NKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT---IIDSGTTLAYLPE 321
T L+ + Y I + + VG L + F + N GT IIDSGTTL Y
Sbjct: 266 PNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVN 325
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYT----CFQY-SESVDEGFPNVTFHFENSVSLKVY 376
Y+ + + ISQ + + V+ + CFQ S+ + P HF+ L++
Sbjct: 326 NAYQSVRQEFISQ---INLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG-DLELP 381
Query: 377 PHEYLF-PFEDLWCI--GWQNSGMQ---SRDRKNMTLLGD 410
Y P L C+ G + GM + ++NM ++ D
Sbjct: 382 SENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYD 421
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 142/331 (42%), Gaps = 54/331 (16%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G Y + IGTPP VDTGSD+ W C C C ++ + L+D K+SST +
Sbjct: 89 AGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQV-----VPLFDPKNSSTYR 143
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+C FC + G C+ C + Y DGS T G + + D +G + S
Sbjct: 144 DSSCGTSFCLAL--GKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAG--KPVS 199
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
G FGCG G D ++ GI+G G S+ISQL S+ + +F++CL
Sbjct: 200 FPG-FAFGCGHSSGGIFDKSSS----GIVGLGGGELSLISQLKST--INGLFSYCLLPVS 252
Query: 249 ------DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
IN G A G V TPL YS T V+ G
Sbjct: 253 TDSSISSRINFG---ASGRVSGYGTVSTPLRLPYKGYS-KKTEVEEG------------- 295
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YSESVDEGF 360
I+DSGTT +LP+ Y L + + +K V D F Y+ + +
Sbjct: 296 ----NIIVDSGTTYTFLPQEFYSKLEKSVANS---IKGKRVRDPNGIFSLCYNTTAEINA 348
Query: 361 PNVTFHFENSVSLKVYP-HEYLFPFEDLWCI 390
P +T HF+++ ++++ P + ++ EDL C
Sbjct: 349 PIITAHFKDA-NVELQPLNTFMRMQEDLVCF 378
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 81/269 (30%), Positives = 125/269 (46%), Gaps = 28/269 (10%)
Query: 77 YAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-PRRSSL---GIELTLYDIKDSSTGK 132
Y + +GTP + V +DTGSD+ WV C C C P S EL++Y K SST K
Sbjct: 113 YTTVQLGTPGTKFMVALDTGSDLFWVPC-DCSRCAPTEGSPYASDFELSVYSPKKSSTSK 171
Query: 133 FVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQT 190
V C+ C CT A +CPY+ Y +STTG ++D++ + +
Sbjct: 172 TVPCNNNLC-----AQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLK--TEHKHS 224
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+ FGCG QSG+ + A +G+ G G S+ S L+ G + F+ C
Sbjct: 225 EPIQAYITFGCGQVQSGSF--LDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSD 282
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
+G G G E +TP NQ P+Y+I +T+++VG ++ D+
Sbjct: 283 -DGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLID--ADI-------TA 332
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPD 337
+ DSGT+ +Y + +Y L + +Q D
Sbjct: 333 LFDSGTSFSYFTDPIYSKLSASFHAQTRD 361
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 153/365 (41%), Gaps = 57/365 (15%)
Query: 51 RRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC--- 107
+ Q + +P GG+ Y +G+GTP KD+ + DTGSD+ W C C
Sbjct: 123 KEMQTTIPASIVPTGGA--------YVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGG 174
Query: 108 ---KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG--PLTDCTANTSCPYLE 162
+ P+ +D S++ K V+C EFC + G P DC +NT C Y
Sbjct: 175 CFPQNQPK----------FDPTTSTSYKNVSCSSEFCKLIAEGNYPAQDCISNT-CLYGI 223
Query: 163 IYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIG 222
YG G T G+ + + + ++ + +FGC G + T G++G
Sbjct: 224 QYGSG-YTIGFLATETLA-------IASSDVFKNFLFGCSEESRGTFNGTT-----GLLG 270
Query: 223 FGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIGHVVQPEVNKTPLVPNQPH-YS 280
G+S ++ SQ ++ + +F++CL + G + G V TP+ P Y
Sbjct: 271 LGRSPIALPSQ--TTNKYKNLFSYCLPASPSSTGHLSFGVEVSQAAKSTPISPKLKQLYG 328
Query: 281 INMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
+N + V LP + G TIIDSGTT +LP Y L S + +
Sbjct: 329 LNTVGISV--RGRELPIN----GSISRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTL 382
Query: 341 HTVHDEY-TCFQYSESVDEG---FPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQN 394
+ C+ +S ++ G P ++ FE V +++ + P L C+ + +
Sbjct: 383 TNGTSSFQPCYDFS-NIGNGTLTIPGISIFFEGGVEVEIDVSGIMIPVNGLKEVCLAFAD 441
Query: 395 SGMQS 399
+G S
Sbjct: 442 TGSDS 446
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 150/369 (40%), Gaps = 64/369 (17%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC--------------IQCKECPRRSSLG 117
G+G Y+ + +GTP + + + DTGSD+ WV C PRR+
Sbjct: 91 GIGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRA--- 147
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYF-- 174
+ + S T + C + C L+ C T + C Y Y DGS+ G
Sbjct: 148 -----FRPEKSKTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGT 202
Query: 175 ----VQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSM 230
+ ++ G L+ GC +G+ + EA DG++ G SN S
Sbjct: 203 ESATIALSSSSSSSKNKVKKAKLQG-LVLGC----TGSYTGPSFEASDGVLSLGYSNVSF 257
Query: 231 ISQLASSGGVRKMFAHCL----DGINGGGIFAIG----------HVVQPEVNKTPLVPN- 275
S AS G R F++CL N G P +TPLV +
Sbjct: 258 ASHAASRFGGR--FSYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDS 315
Query: 276 --QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---K 330
+P Y +++ A+ V + L +P DV+ V G I+DSGT+L L + Y +V+ K
Sbjct: 316 RMRPFYDVSIKAISVDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGK 375
Query: 331 IISQQPDLKVHTVHDEYTCFQYSESV--DEG--FPNVTFHFENSVSLKVYPHEYLF-PFE 385
+++ P + + EY C+ ++ DEG P + HF S L+ Y+
Sbjct: 376 KLARFPRVAMDPF--EY-CYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAP 432
Query: 386 DLWCIGWQN 394
+ CIG Q
Sbjct: 433 GVKCIGVQE 441
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 96/352 (27%), Positives = 145/352 (41%), Gaps = 42/352 (11%)
Query: 45 LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC 104
L A + + +A +PL S GVG Y ++G+GTP Y + VD+GS + W+ C
Sbjct: 78 LASRLATKDKDWVAASSVPLA-SGASVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWLQC 136
Query: 105 IQCK-ECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYL 161
C C ++ LYD + SST V C C + L + C+ + C Y
Sbjct: 137 APCAVSCHPQAG-----PLYDPRASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQ 191
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
YGDGS + GY +D V L ++ + +GCG G G+I
Sbjct: 192 ASYGDGSFSFGYLSKDTV-------SLSSSGSFPGFYYGCGQDNVGLFGRA-----AGLI 239
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQPEVNKTP-------L 272
G ++ S++SQLA S V FA+CL G + G NK P +
Sbjct: 240 GLARNKLSLLSQLAPS--VGNSFAYCLPTSAAASAGYLSFGSNSD---NKNPGKYSYTSM 294
Query: 273 VPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
V + Y +++ + V L +P+ +G + TIIDSGT + LP VY L
Sbjct: 295 VSSSLDASLYFVSLAGMSVAGSPLAVPSSEYG---SLPTIIDSGTVITRLPTPVYTALSK 351
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+ + TCF+ + P V F +L++ P L
Sbjct: 352 AVGAALAAPSAPAYSILQTCFK-GQVAKLPVPAVNMAFAGGATLRLTPGNVL 402
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 149/353 (42%), Gaps = 43/353 (12%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
DG G Y+ +G+GTPP+ + DTGSD++W+ C+ C+ C G L++ SST
Sbjct: 76 DGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSC-----YGQTDPLFNPSFSST 130
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ +TC C + + C N C Y YGDGS T G F + + +
Sbjct: 131 FQSITCGSSLCQQLL---IRGCRRN-QCLYQVSYGDGSFTVGEFSTETLSFG-------- 178
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ S+ GCG G T L G+ S S + QL S +F++CL
Sbjct: 179 SNAVNSVAIGCGHNNQGLF--TGAAGLLGLGKGLLSFPSQVGQLYGS-----VFSYCLPT 231
Query: 251 INGGGIFAI---GHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
G + V T L+ N Y + M ++VG +N+P +
Sbjct: 232 RESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDS 291
Query: 305 ---NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP-DLKVHTVHDEY-TCFQYSESVDEG 359
N G I+DSGT + L Y P+ + P D K+ + + TC+ S
Sbjct: 292 STGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIM 351
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGMQSRDRKNMTLLGD 410
P V+F F ++ + + P ++ +C+ + + + +N +++G+
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAF------APNSENFSIIGN 398
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 150/344 (43%), Gaps = 58/344 (16%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTG 131
LY +G+GTP K V++DTGS WV C +C C PR T + ++
Sbjct: 81 LYVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCA 131
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K V+C C + GG C + + CP+ Y DGS++ G QD + + D+
Sbjct: 132 K-VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDV 184
Query: 189 QTTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFA 245
Q S FGC NLDS NE +DG++G G S++ Q S F+
Sbjct: 185 QKIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPRFDGFS 232
Query: 246 HCLDGING--------GGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDFL 293
+CL G F++G V + +V T +V + + + +++ A+ V + L
Sbjct: 233 YCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERL 292
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 293 GLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM- 348
Query: 354 ESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 349 RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 392
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 141/319 (44%), Gaps = 37/319 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFV 134
+ +G GTP + Y + DTGSD+ W+ C+ C C ++ ++D S+T V
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHD-----PIFDPTKSATYSAV 174
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C G C++N +C Y YGDGSST G V+ ++ +S L +
Sbjct: 175 PCGHPQCAAAGG----KCSSNGTCLYKVQYGDGSSTAG-----VLSHETLS--LTSARAL 223
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
FGCG G+ +DG+IG G+ S+ SQ A+S F++CL N
Sbjct: 224 PGFAFGCGETNLGDFGD-----VDGLIGLGRGQLSLSSQAAAS--FGAAFSYCLPSYNTS 276
Query: 255 -GIFAIGHVVQPE----VNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G IG V T ++ Q + Y +++ ++ VG L +P +F
Sbjct: 277 HGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILF---TRD 333
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTF 365
GT++DSGT L YLP Y L + K +D + TC+ ++ P V+F
Sbjct: 334 GTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSF 393
Query: 366 HFENSVSLKVYPHEYL-FP 383
F + S + P L FP
Sbjct: 394 KFSDGSSFDLSPFGVLIFP 412
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 93/292 (31%), Positives = 129/292 (44%), Gaps = 48/292 (16%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+G G Y I +GTPP D+ V VDTGS+++W C C C R + L SST
Sbjct: 86 NGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVL---QPARSST 142
Query: 131 GKFVTCDQEFCH--GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+ C+ FC P T C A +C Y YG G T GY + + GD
Sbjct: 143 FSRLPCNGSFCQYLPTSSRPRT-CNATAACAYNYTYGSG-YTAGYLATETLTV----GD- 195
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALD---GIIGFGKSNSSMISQLASSGGVRKMFA 245
T + FGC + E +D GI+G G+ S++SQLA F+
Sbjct: 196 ---GTFPKVAFGC----------STENGVDNSSGIVGLGRGPLSLVSQLAVG-----RFS 237
Query: 246 HCL--DGINGGG---IF-AIGHVVQPE-VNKTPLVPN-----QPHYSINMTAVQVGLDFL 293
+CL D +GG +F ++ + + V TPL+ N HY +N+T + V L
Sbjct: 238 YCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTEL 297
Query: 294 NLPTDVFG---VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT 342
+ FG G GTI+DSGTTL YL + Y + SQ +L T
Sbjct: 298 PVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTT 349
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 142/340 (41%), Gaps = 43/340 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + +GTPP + DTGSD++W C C +C ++ + L+D K S T +
Sbjct: 91 GEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIA-----PLFDPKSSKTYRD 145
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
++CD C + G + C++ C Y YGD S T G D V ST
Sbjct: 146 LSCDTRQCQNL--GESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLP---------ST 194
Query: 194 NGSLIF------GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
NG ++ GCG R +G D + GIIG G S+ISQ+ SS G + F++C
Sbjct: 195 NGGPVYFPKTVIGCGRRNNGTFDKKDS----GIIGLGGGPMSLISQMGSSVGGK--FSYC 248
Query: 248 L-------DGINGGGIFAIGHVVQPE-VNKTPLVPNQP--HYSINMTAVQVGLDFLNLPT 297
L G + F VV V TPL+ P Y + + A+ VG D
Sbjct: 249 LVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVG-DKKIEFG 307
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YSES 355
G IIDSGT+L P + + + + + D Y +
Sbjct: 308 GSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAV--ENAVINGERTQDASGLLSHCYRPT 365
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNS 395
D P +T HF + + + ++ +D+ C+ + ++
Sbjct: 366 PDLKVPVITAHFNGADVVLQTLNTFILISDDVLCLAFNST 405
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 144/326 (44%), Gaps = 36/326 (11%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+G G Y+ ++G+G+PP + Y+ VD+GSD++W+ C C EC +++ L+D S++
Sbjct: 128 EGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQAD-----PLFDPAASAS 182
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V CD C + GG + C + +C Y YGDGS T G + + + GD +
Sbjct: 183 FTAVPCDSGVCRTLPGGS-SGCADSGACRYQVSYGDGSYTQGVLAMETLTF----GD--S 235
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-- 248
T G I GCG R G G++G G S++ QL + F++CL
Sbjct: 236 TPVQGVAI-GCGHRNRGLFVGAA-----GLLGLGWGPMSLVGQLGGA--AGGAFSYCLAS 287
Query: 249 ---DGINGGGIFAIGHVVQPEVNKTPLVPN--QPH-YSINMTAVQVGLDFLNLPTDVFGV 302
D G +F + PL+ N QP Y + +T + VG + L L +F +
Sbjct: 288 RGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDL 347
Query: 303 GDN--KGTIIDSGTTLAYLPEMVYEPL----VSKIISQQPDLKVHTVHDEYTCFQYSESV 356
++ G ++D+GT + LP Y L S I P ++ D TC+ S
Sbjct: 348 TEDGGGGVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLD--TCYDLSGYA 405
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLF 382
P V +F + P L
Sbjct: 406 SVRVPTVALYFGRDGAALTLPARNLL 431
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 130/303 (42%), Gaps = 38/303 (12%)
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD- 151
+DT SD+ WV QC CP + LYD SS+ +C+ C + GP +
Sbjct: 148 LDTASDVTWV---QCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL--GPYANG 202
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
CT N C Y Y DG+ST G ++ D++ + + S FGC G+
Sbjct: 203 CTNNNQCQYRVRYPDGTSTAGTYISDLL-------TITPATAVRSFQFGCSHGVQGSFSF 255
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIG--------HVV 263
+ A GI+ G S++SQ A++ G ++F+HC G F +G +V+
Sbjct: 256 GSSAA--GIMALGGGPESLVSQTAATYG--RVFSHCFPPPTRRGFFTLGVPRVAAWRYVL 311
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
P + K P +P Y + + A+ V + +P VF G +DS T + LP
Sbjct: 312 TPML-KNPAIPPT-FYMVRLEAIAVAGQRIAVPPTVFAA----GAALDSRTAITRLPPTA 365
Query: 324 YEPLVS----KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y+ L ++ QP + TC+ + P +T F+ + ++++ P
Sbjct: 366 YQALRQAFRDRMAMYQPAPPKGPLD---TCYDMAGVRSFALPRITLVFDKNAAVELDPSG 422
Query: 380 YLF 382
LF
Sbjct: 423 VLF 425
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 158/369 (42%), Gaps = 44/369 (11%)
Query: 46 KEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI 105
+H RR + +L + G S G G Y+A++GIG+P + YY+++DTGSD+ W+ C
Sbjct: 18 SDHRHRRGRSLLQTAQVSSGLSL---GSGEYFARMGIGSPQRSYYLELDTGSDVTWIQCA 74
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEI 163
C C + +YD +SS+ + V C C + D +A C Y +
Sbjct: 75 PCSSCYSQVD-----PIYDPSNSSSYRRVYCGSALCQAL------DYSACQGMGCSYRVV 123
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
YGD S+++G D+ G +T+ ++ FGCG SG
Sbjct: 124 YGDSSASSG----DLGIESFYLGPNSSTAMR-NIAFGCGHSNSGLFRGEAGLLGM----- 173
Query: 224 GKSNSSMISQLASSGGVRKMFAHCL-----DGINGGGIFAIGHVVQPEVNK-TPLVPNQP 277
G S SQ+A+S G F++CL + G P + TPL+ N
Sbjct: 174 GGGTLSFFSQIAASIG--PAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPR 231
Query: 278 ----HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKI 331
+Y+I +T + VG L +P F + N G I+DSGT++ + Y L
Sbjct: 232 IDTFYYAI-LTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAY 290
Query: 332 ISQQPDL-KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--LW 388
+ +L V+ TCF + P++ HF+N V + + L P + +
Sbjct: 291 RAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTF 350
Query: 389 CIGWQNSGM 397
C+ + S M
Sbjct: 351 CLAFAPSSM 359
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 143/309 (46%), Gaps = 42/309 (13%)
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGSD++W C C C + + +D+K S+T + + C C + +
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQPT-----PYFDVKKSATYRALPCRSSRCASLS----SPS 51
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
C Y YGD +ST G + + + + +TN + FGCG+ +G+L ++
Sbjct: 52 CFKKMCVYQYYYGDTASTAGVLANETFTF-GAANSTKVRATN--IAFGCGSLNAGDLANS 108
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG-------GIFA----IGH 261
+ G++GFG+ S++SQL S F++CL G++A
Sbjct: 109 S-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGVYANLSSTNT 158
Query: 262 VVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTL 316
V TP V P P+ Y +++ A+ +G L + VF + D+ G IIDSGT++
Sbjct: 159 SSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSI 218
Query: 317 AYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQY--SESVDEGFPNVTFHFENSVSL 373
+L + YE + ++S P ++ TCFQ+ +V P++ FHF+ S ++
Sbjct: 219 TWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD-SANM 277
Query: 374 KVYPHEYLF 382
+ P Y+
Sbjct: 278 TLLPENYML 286
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 163/380 (42%), Gaps = 50/380 (13%)
Query: 9 LCIVLIATAAVGGVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQ---QRILAGVD---- 61
+C V A+++ V NH + + ++ L EHD R QR L+G D
Sbjct: 52 VCSVTPASSSGTTVPLNHRYGPCSPAPSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQP 111
Query: 62 ----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
+P S D + Y +GIG+P + +DTGSD+ WV C S+ G
Sbjct: 112 LDLTVPTTLGSALDTME-YVITVGIGSPAVTQTMMIDTGSDVSWVRC--------NSTDG 162
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
LTL+D S+T +C C + G D +N+ C Y YGDGS+TTG + D
Sbjct: 163 --LTLFDPSKSTTYAPFSCSSAACAQL--GNNGDGCSNSGCQYRVQYGDGSNTTGTYSSD 218
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ L + T FGC + + + E +DG++G G S++SQ A++
Sbjct: 219 TLA-------LSASDTVTDFHFGCSHHE----EDFDGEKIDGLMGLGGDAQSLVSQTAAT 267
Query: 238 GGVRKMFAHCLDGIN---GGGIFAIGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLD 291
G K F++CL N G F + TP++ P P Y + + + VG
Sbjct: 268 YG--KSFSYCLPPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGT 325
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---T 348
L + V + G+++DSGT + +LP Y L S S L+ T
Sbjct: 326 PLGIQPSVL----SNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDT 381
Query: 349 CFQYSESVDEGFPNVTFHFE 368
C+ ++ V+ P V+ +
Sbjct: 382 CYDFTGLVNVSIPAVSLVLD 401
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 115/369 (31%), Positives = 159/369 (43%), Gaps = 58/369 (15%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
R +R DL G S G Y+ I IGTPP + DTGSD+ WV C C++C
Sbjct: 64 RSRRFTTKTDLQSGLISNG---GEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCY 120
Query: 112 RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTT 171
+++S L+D K SST K +CD + C + + C Y YGD S T
Sbjct: 121 KQNS-----PLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFTK 175
Query: 172 GYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMI 231
G + + D S ++ + +FGCG G T EE GIIG G S++
Sbjct: 176 GDVATETISIDSSS---GSSVSFPGTVFGCGYNNGG----TFEETGSGIIGLGGGPLSLV 228
Query: 232 SQLASSGGVRKMFAHCLD----GINGGGIFAIGHVVQPE-------VNKTPLVPNQP--H 278
SQL SS G K F++CL NG + +G P TPL+ P +
Sbjct: 229 SQLGSSIG--KKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETY 286
Query: 279 YSINMTAVQVGLDFLNLPTDVFGVGDNKGT-------IIDSGTTLAYLPEMVYEPLVS-- 329
Y + + AV VG LP G G N + IIDSGTTL L Y+ +
Sbjct: 287 YFLTLEAVTVGK--TKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAV 344
Query: 330 -------KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYP-HEYL 381
K +S L H CF+ S + G P +T HF N+ +K+ P + ++
Sbjct: 345 EESVTGAKRVSDPQGLLTH-------CFK-SGDKEIGLPAITMHFTNA-DVKLSPINAFV 395
Query: 382 FPFEDLWCI 390
ED C+
Sbjct: 396 KLNEDTVCL 404
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 124/266 (46%), Gaps = 19/266 (7%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCK--ECPRRSSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++W+ +CIQC SL +L Y SST
Sbjct: 99 LHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSST 158
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K ++C + C P D + CPY + Y + +S++G ++D++ D
Sbjct: 159 SKHLSCSHQLCE---SSPNCD-SPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDAS 214
Query: 190 TTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+S +I GCG RQ+G LD A DG++G G S+ S L+ +G V+ F+ C
Sbjct: 215 NSSVRAPVIIGCGMRQTGGYLDGV---APDGLMGLGLGEISVPSFLSKAGLVKNSFSLCF 271
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
+ + G IF G T +P+ Y + VG++ + + +
Sbjct: 272 NDDDSGRIF-FGDQGLATQQTTLFLPSDGKYETYI----VGVEACCIGSSCIKQTSFRA- 325
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQ 334
++DSG + +LP+ Y +V + Q
Sbjct: 326 LVDSGASFTFLPDESYRNVVDEFDKQ 351
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 130/303 (42%), Gaps = 38/303 (12%)
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD- 151
+DT SD+ WV QC CP + LYD SS+ +C+ C + GP +
Sbjct: 173 LDTASDVTWV---QCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQL--GPYANG 227
Query: 152 CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
CT N C Y Y DG+ST G ++ D++ + + S FGC G+
Sbjct: 228 CTNNNQCQYRVRYPDGTSTAGTYISDLL-------TITPATAVRSFQFGCSHGVQGSFSF 280
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIG--------HVV 263
+ A GI+ G S++SQ A++ G ++F+HC G F +G +V+
Sbjct: 281 GSSAA--GIMALGGGPESLVSQTAATYG--RVFSHCFPPPTRRGFFTLGVPRVAAWRYVL 336
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
P + K P +P Y + + A+ V + +P VF G +DS T + LP
Sbjct: 337 TPML-KNPAIPPT-FYMVRLEAIAVAGQRIAVPPTVFAA----GAALDSRTAITRLPPTA 390
Query: 324 YEPLVS----KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y+ L ++ QP + TC+ + P +T F+ + ++++ P
Sbjct: 391 YQALRQAFRDRMAMYQPAPPKGPLD---TCYDMAGVRSFALPRITLVFDKNAAVELDPSG 447
Query: 380 YLF 382
LF
Sbjct: 448 VLF 450
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 124/266 (46%), Gaps = 19/266 (7%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWV--NCIQCK--ECPRRSSLGIELTLYDIKDSST 130
L+Y I IGTP + V +D GSD++W+ +CIQC SL +L Y SST
Sbjct: 80 LHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSST 139
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
K ++C + C P D + CPY + Y + +S++G ++D++ D
Sbjct: 140 SKHLSCSHQLCE---SSPNCD-SPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDAS 195
Query: 190 TTSTNGSLIFGCGARQSGN-LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+S +I GCG RQ+G LD A DG++G G S+ S L+ +G V+ F+ C
Sbjct: 196 NSSVRAPVIIGCGMRQTGGYLDGV---APDGLMGLGLGEISVPSFLSKAGLVKNSFSLCF 252
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
+ + G IF G T +P+ Y + VG++ + + +
Sbjct: 253 NDDDSGRIF-FGDQGLATQQTTLFLPSDGKYETYI----VGVEACCIGSSCIKQTSFRA- 306
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQ 334
++DSG + +LP+ Y +V + Q
Sbjct: 307 LVDSGASFTFLPDESYRNVVDEFDKQ 332
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 101/378 (26%), Positives = 164/378 (43%), Gaps = 43/378 (11%)
Query: 28 VFSVKYRYAGRERS-LSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPP 86
V +++ G +RS L + D R Q L + G+S+ G G Y+++IG+GTP
Sbjct: 117 VAKIRFAVEGVDRSDLKPVYNEDTRYQTEDLTTP--VVSGASQ--GSGEYFSRIGVGTPA 172
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
K+ Y+ +DTGSD+ W+ C C +C ++S +++ SST K +TC C +
Sbjct: 173 KEMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYKSLTCSAPQCSLL-- 225
Query: 147 GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
+ C +N C Y YGDGS T G D V + SG + ++ GCG
Sbjct: 226 -ETSACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKIN------NVALGCGHDNE 276
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIG-HVVQP 265
G G S+ +Q+ ++ F++CL + G ++ + VQ
Sbjct: 277 GLFTGAAGLLGLGGGVL-----SITNQMKATS-----FSYCLVDRDSGKSSSLDFNSVQL 326
Query: 266 --EVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGV--GDNKGTIIDSGTTLAY 318
PL+ N+ Y + ++ VG + + LP +F V + G I+D GT +
Sbjct: 327 GGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 386
Query: 319 LPEMVYEPLVSKIISQQPDLK--VHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVY 376
L Y L + +LK ++ TC+ +S P V FHF SL +
Sbjct: 387 LQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLP 446
Query: 377 PHEYLFPFED--LWCIGW 392
YL P +D +C +
Sbjct: 447 AKNYLIPVDDSGTFCFAF 464
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 81/266 (30%), Positives = 125/266 (46%), Gaps = 25/266 (9%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
L+YA++ +GTP + V +DTGSD+ W+ C +CK C + S T+Y SST K V
Sbjct: 120 LHYAEVEVGTPSSKFLVALDTGSDLFWLPC-ECKLCAKNGS-----TMYSPSLSSTSKTV 173
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSS-TTGYFVQDVVQYDKVSGDLQTTST 193
C C T +++SCPY Y ++ ++G V+DV+ G +
Sbjct: 174 PCGHPLCERP-DACATAGKSSSSCPYEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAV 232
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGIN 252
++FGCG Q+G A G++G G S+ S LASSG V F+ C +
Sbjct: 233 QAPIVFGCGQVQTGAF--LRGAAAGGLMGLGLDKVSVPSALASSGLVASDSFSMCFS-RD 289
Query: 253 GGGIFAIGHVVQPEVNKTPLVPN---QP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G G G P+ +TPL+ QP +Y+I++ A+ V D +
Sbjct: 290 GVGRINFGDAGSPDQAETPLIAAGSLQPSYYNISVGAITV---------DSKAMAVEFTA 340
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQ 334
++DSGT+ YL + Y L + S+
Sbjct: 341 VVDSGTSFTYLDDPAYTFLTTNFNSR 366
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 95/344 (27%), Positives = 154/344 (44%), Gaps = 53/344 (15%)
Query: 18 AVGGVSSNHGV-FSVKYRYAGRERSLSLLKEHDARRQ----QRILAG------VDLPLGG 66
A+ + S +G+ ++ + G ++L++HD R +RILA V +
Sbjct: 42 AIEAMRSRNGMDYAQDWPTEGTIEFQTMLRDHDVARHTRTARRILAASSMDQYVLIQGNA 101
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---------PRRSSLG 117
+ + G GL+Y+ I IGTP + V +DTGSD++W+ C +C+ C PR S
Sbjct: 102 TEQLFGGGLHYSYIDIGTPNVQFLVVLDTGSDLLWIPC-ECESCAPLSAESKDPRTS--- 157
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPY-LEIYGDGSSTTGYFV 175
+L Y SST K V C C + C A T CPY + +ST+G
Sbjct: 158 -QLNPYTPSLSSTAKPVLCSDPLCEMS-----STCMAPTDQCPYEINYVSANTSTSGALY 211
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+D + + + SG + GCG Q+G+L A +G++G G ++ S+ ++LA
Sbjct: 212 EDYMYFMRESGG---NPVKLPVYLGCGKVQTGSL--LKGAAPNGLMGLGTTDISVPNKLA 266
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNL 295
S+G + F+ C+ G G G TP++P S++M LD +
Sbjct: 267 STGQLADSFSLCISP-GGSGTLTFGDEGPAAQRTTPIIPK----SVSM------LDTYIV 315
Query: 296 PTDVFGVGDNK-----GTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
D VG+ + D+GT+ YL + VY V +Q
Sbjct: 316 EIDSITVGNTNLLMASHALFDTGTSFTYLSKTVYPQFVQAYDAQ 359
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 97/343 (28%), Positives = 150/343 (43%), Gaps = 50/343 (14%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+G Y +GTP + +DTGSDI+W+ C CK+C +++ ++D S T
Sbjct: 84 SALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTT-----PIFDSSKSQT 138
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
K + C C V G T C++ C Y Y DGS + G D L
Sbjct: 139 YKTLPCPSNTCQSVQG---TFCSSRKHCLYSIHYVDGSQSLG---------DLSVETLTL 186
Query: 191 TSTNGS------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
STNGS + GCG + + EE GI+G G+ S+I+QL+ S G + F
Sbjct: 187 GSTNGSPVQFPGTVIGCGRYNAIGI----EEKNSGIVGLGRGPMSLITQLSPSTGGK--F 240
Query: 245 AHCL--------DGINGGGIFAI---GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFL 293
++CL +N G + G V P +K LV Y + + A VG + +
Sbjct: 241 SYCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLV----FYFLTLEAFSVGRNRI 296
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQY 352
+ G G IIDSGTTL LP VY L + + +V + C++
Sbjct: 297 EFGSP--GSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKV 354
Query: 353 S-ESVDEGFPNVTFHFENS-VSLKVYPHEYLFPFEDLWCIGWQ 393
+ + +D P +T HF + V+L + ++ +D+ C +Q
Sbjct: 355 TPDKLDASVPVITAHFSGADVTLNAI-NTFVQVADDVVCFAFQ 396
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 146/340 (42%), Gaps = 39/340 (11%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSS 115
LA V L G S GVG Y ++G+GTP Y + VDTGS + W+ C C C R+
Sbjct: 118 LASVPLSPGTSV---GVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVG 174
Query: 116 LGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGY 173
L+D + SST V C C + L + C+A+ C Y YGD S + GY
Sbjct: 175 -----PLFDPRASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGY 229
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
D V + S S +GCG G + G+IG ++ S++ Q
Sbjct: 230 LSTDTVSFGSTS--------YPSFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQ 276
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIG-HVVQPEVNKTPLVP---NQPHYSINMTAVQVG 289
LA S G F++CL G +IG + + TP+ + Y I ++ + VG
Sbjct: 277 LAPSLGYS--FSYCLPTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVG 334
Query: 290 LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDE 346
L + + + TIIDSGT + LP V+ L V++ ++ ++ D
Sbjct: 335 GSPLAVSPSEY---SSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILD- 390
Query: 347 YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
TCF+ ++ P V F S+K+ L +D
Sbjct: 391 -TCFE-GQASQLRVPTVVMAFAGGASMKLTTRNVLIDVDD 428
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 103/335 (30%), Positives = 144/335 (42%), Gaps = 41/335 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP VDTGSD+ W C C C ++ + +D K+SST +
Sbjct: 90 GEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQV-----VPFFDPKNSSTYRD 144
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+C FC + G C C ++ Y DGS T G + + +G + S
Sbjct: 145 SSCGTSFCLAL--GNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAG--KPVSF 200
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
G FGC R G D E GI+G G + SMISQL S+ + F++CL
Sbjct: 201 PG-FAFGCVHRSGGIFD----EHSSGIVGLGVAELSMISQLKST--INGRFSYCLLPVFT 253
Query: 249 -----DGINGGGIFAIGHVVQPEVNKTPLV---PNQPHYSINMTAVQVGLDFLNLPTDVF 300
IN G G V TPLV P+ +Y I + VG L+
Sbjct: 254 DSSMSSRINFG---RSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSK 310
Query: 301 GVGDNKGTII-DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE--YTCFQYSESVD 357
+G II DSGTT YLP Y L + +K V D + Y+ +VD
Sbjct: 311 KAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHS---IKGKRVRDPNGISSLCYNTTVD 367
Query: 358 E-GFPNVTFHFENS-VSLKVYPHEYLFPFEDLWCI 390
+ P +T HF+++ V L+ + + +L EDL C
Sbjct: 368 QIDAPIITAHFKDANVELQPW-NTFLRMQEDLVCF 401
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 128/312 (41%), Gaps = 40/312 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y I IGTPP +DTGSD++W C + P R LY S+T V+
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQC----DAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 136 CDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C + P + C+ +T C Y YGDG+ST G + L + +
Sbjct: 148 CRSPMCQALQ-SPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFT-------LGSDTAV 199
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
+ FGCG G+ D+++ G++G G+ S++SQL GV + F++C N
Sbjct: 200 RGVAFGCGTENLGSTDNSS-----GLVGMGRGPLSLVSQL----GVTR-FSYCFTPFNAT 249
Query: 255 G----IFAIGHVVQPEVNKTPLVPN--------QPHYSINMTAVQVGLDFLNLPTDVFGV 302
+ TP VP+ +Y +++ + VG L + VF +
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309
Query: 303 ---GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSESVDE 358
GD G IIDSGTT L E + L + S+ H + CF +
Sbjct: 310 TPMGDG-GVIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAV 368
Query: 359 GFPNVTFHFENS 370
P + HF+ +
Sbjct: 369 EVPRLVLHFDGA 380
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 92/319 (28%), Positives = 137/319 (42%), Gaps = 43/319 (13%)
Query: 72 GVGL--YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSS 129
GVG Y + +GTP V+VDTGSD+ WV QCK C + L+D SS
Sbjct: 137 GVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWV---QCKPCSAPACNSQRDQLFDPAKSS 193
Query: 130 TGKFVTCDQEFCH--GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
T V C + C +Y C+ + C Y+ YGDGS+TTG + D +
Sbjct: 194 TYSAVPCGADACSELRIY---EAGCS-GSQCGYVVSYGDGSNTTGVYGSDTLA------- 242
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAH 246
L +T G+ +FGCG Q+G +DG++ G+ + S+ SQ A + GGV F++
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMF-----AGIDGLLALGRQSMSLKSQAAGAYGGV---FSY 294
Query: 247 CLDGI-NGGGIFAIGHVVQPE-VNKTPLV---PNQPHYSINMTAVQVGLDFLNLPTDVFG 301
CL + G +G T L+ Y + +T + VG + +P F
Sbjct: 295 CLPSKQSAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFA 354
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ-----QPDLKVHTVHDEYTCFQYSESV 356
GT++D+GT + LP Y L S P + + D TC+ +S
Sbjct: 355 ----GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILD--TCYDFSRYG 408
Query: 357 DEGFPNVTFHFENSVSLKV 375
P V F +L +
Sbjct: 409 VVTLPTVALTFSGGATLAL 427
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 134/311 (43%), Gaps = 42/311 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GTPP + Y VDTGSDI+W+ C C++C ++++ +++ SS+ K
Sbjct: 85 GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTT-----PIFNPSKSSSYKN 139
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C C V T C SC Y + D S + G + + D +G + +
Sbjct: 140 IPCSSNLCQSVR---YTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGH---SVS 193
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+ GCG G + GI+G G S+ +QL SS G + F++CL
Sbjct: 194 FPKTVIGCGHNNRGMF----QGETSGIVGLGIGPVSLTTQLKSSIGGK--FSYCLLPLLV 247
Query: 249 -----DGINGGGIFAI---GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF 300
+N G + G V P V K P Q Y + + A VG + +V
Sbjct: 248 DSNKTSKLNFGDAAVVSGDGVVSTPFVKKDP----QAFYYLTLEAFSVGNKRIEF--EVL 301
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YSESVDE 358
+ I+DSGTTL LP VY L S + +K+ V D YS + D+
Sbjct: 302 DDSEEGNIILDSGTTLTLLPSHVYTNLESAVAQL---VKLDRVDDPNQLLNLCYSITSDQ 358
Query: 359 -GFPNVTFHFE 368
FP +T HF+
Sbjct: 359 YDFPIITAHFK 369
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 150/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---GFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V + + + +++TA+ V + L
Sbjct: 153 CLPLQMSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L VF KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSVFS---RKGVVFDSGSELSYIPDRALSVLRQRIRELLLKRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|224140735|ref|XP_002323734.1| predicted protein [Populus trichocarpa]
gi|222866736|gb|EEF03867.1| predicted protein [Populus trichocarpa]
Length = 184
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 85/165 (51%), Gaps = 13/165 (7%)
Query: 42 LSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
L LK D R R+L G VD + GSS P V LY+ K+ +G+PP+++ VQ++TG
Sbjct: 27 LHQLKARDRLRHARLLQGFVGGVVDFSVQGSSDPYLVELYFTKVKLGSPPREFNVQINTG 86
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD++WV C + P SS+ + T + + C C T C++ T
Sbjct: 87 SDVLWVCYNSCNKLPAFSSISLIPTAHQLLGG-------CSNPICTSAVQTTATQCSSQT 139
Query: 157 -SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFG 200
C Y YGDGS T+GY+V D + +D + G +++ ++FG
Sbjct: 140 DQCSYTSQYGDGSGTSGYYVSDTLYFDAILGQSLIANSSVLIVFG 184
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/326 (28%), Positives = 137/326 (42%), Gaps = 57/326 (17%)
Query: 72 GVGL--YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSS 129
GVG Y + +GTP V+VDTGSD+ WV QCK C + L+D SS
Sbjct: 137 GVGTFQYVVTVSLGTPGVSQTVEVDTGSDVSWV---QCKPCSAPACNSQRDQLFDPAKSS 193
Query: 130 TGKFVTCDQEFCH--GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
T V C + C +Y C+ + C Y+ YGDGS+TTG + D +
Sbjct: 194 TYSAVPCGADACSELRIY---EAGCS-GSQCGYVVSYGDGSNTTGVYGSDTLA------- 242
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAH 246
L +T G+ +FGCG Q+G +DG++ G+ + S+ SQ A + GGV F++
Sbjct: 243 LAPGNTVGTFLFGCGHAQAGMF-----AGIDGLLALGRQSMSLKSQAAGAYGGV---FSY 294
Query: 247 CLD------------GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLN 294
CL G + FA ++ T Y + +T + VG +
Sbjct: 295 CLPSKQSAAGYLTLGGPSSASGFATTGLLTAWAAPT-------FYMVMLTGISVGGQQVA 347
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ-----QPDLKVHTVHDEYTC 349
+P F GT++D+GT + LP Y L S P + + D TC
Sbjct: 348 VPASAFA----GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILD--TC 401
Query: 350 FQYSESVDEGFPNVTFHFENSVSLKV 375
+ +S P V F +L +
Sbjct: 402 YDFSRYGVVTLPTVALTFSGGATLAL 427
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/314 (27%), Positives = 134/314 (42%), Gaps = 35/314 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G G Y+ +G+GTP KD+ + DTGSD+ W C C K C + + +++ S++
Sbjct: 149 GSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQ-----KEAIFNPSQSTS 203
Query: 131 GKFVTCDQEFCHGVYG--GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
++C C + G + +C A+++C Y YGD S + G+F ++ + L
Sbjct: 204 YANISCGSTLCDSLASATGNIFNC-ASSTCVYGIQYGDSSFSIGFFGKEKLS-------L 255
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
T FGCG G G+ S++SQ A K+F++CL
Sbjct: 256 TATDVFNDFYFGCGQNNKGLFGGAAGLLGL-----GRDKLSLVSQTAQR--YNKIFSYCL 308
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVP---NQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G G + TPL Y +++T + VG L + VF
Sbjct: 309 PSSSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTA- 367
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
GTIIDSGT + LP Y L S K++SQ P ++ D TCF +S P
Sbjct: 368 --GTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSILD--TCFDFSNHDTISVP 423
Query: 362 NVTFHFENSVSLKV 375
+ F V + +
Sbjct: 424 KIGLFFSGGVVVDI 437
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/377 (25%), Positives = 163/377 (43%), Gaps = 69/377 (18%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPPK ++ +DTGSD+ W+ C C +C ++ Y+ +SS+
Sbjct: 166 GTGEYFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNG-----PHYNPNESSSY 220
Query: 132 KFVTCDQEFCHGVYG-GPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ ++C C V PL C T N +CPY Y DGS+TTG F + +
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFT-------VN 273
Query: 190 TTSTNGS--------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
T NG ++FGCG G G+ S SQL S G
Sbjct: 274 LTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGL-----GRGPLSFPSQLQSIYG-- 326
Query: 242 KMFAHCL------DGINGGGIFAIGHVV--QPEVNKTPLV-----PNQPHYSINMTAVQV 288
F++CL ++ IF + +N T L+ P+ Y + + ++ V
Sbjct: 327 HSFSYCLTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVV 386
Query: 289 GLDFLNLPTDVF-----GVGDNKGTIIDSGTTLAYLPEMVY----EPLVSKIISQQPDLK 339
G + L++P + GVG GTIIDSG+TL + P+ Y E KI QQ
Sbjct: 387 GGEVLDIPEKTWHWSSEGVG---GTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQ---- 439
Query: 340 VHTVHDEYT---CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE--DLWCIGWQN 394
D++ C+ S ++ P+ HF + Y + +E ++ C+
Sbjct: 440 --IAADDFIMSPCYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICL---- 493
Query: 395 SGMQSRDRKNMTLLGDF 411
+ +++ + ++T++G+
Sbjct: 494 AILKTPNHSHLTIIGNL 510
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/280 (29%), Positives = 127/280 (45%), Gaps = 27/280 (9%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
G G Y+ I +G+PP+ + DTGSD+ WV C CK S+ + + + S+T
Sbjct: 78 SGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKT---NCSIHPPGSTFLARHSTT 134
Query: 131 GKFVTCDQEFCHGV---YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
C C V P +++C Y +Y DGS T+G+F ++ + SG
Sbjct: 135 FSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGR 194
Query: 188 LQTTSTNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S+ FGCG SG +L ++ G++G G+ S SQL G + F++
Sbjct: 195 EMKLK---SIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFG--RSFSY 249
Query: 247 CLDGIN----GGGIFAIGHVVQPEVNK------TPLV--PNQP-HYSINMTAVQVGLDFL 293
CL IG VV + + TPL+ P P Y I++ V V L
Sbjct: 250 CLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKL 309
Query: 294 NLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKI 331
++ V+ + + N GT+IDSGTTL +L E Y ++S
Sbjct: 310 HIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAF 349
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/329 (26%), Positives = 137/329 (41%), Gaps = 47/329 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVN----CIQCKECPRRSSLGIELTLYDIKDSS 129
GLYY + IG PP+ Y++ VDTGSD+ W+ C+ C + P + + +
Sbjct: 56 GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNKVP-----------HPLYRPT 104
Query: 130 TGKFVTCDQEFCHGVYGG--PLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
K V C + C ++GG C + C Y Y D S+ G + D +
Sbjct: 105 KNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAVRLANS 164
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ SL FGCG Q ST DG++G G + S++SQL G + + H
Sbjct: 165 SI----VRPSLAFGCGYDQQVG-SSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGH 219
Query: 247 CLDGINGGGIFAIGHVVQPEVNKT--PLVPN--QPHYSINMTAVQVGLDFLNL-PTDVFG 301
CL I GGG G + P T P+V + + +YS ++ G L + P +V
Sbjct: 220 CLS-IRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEV-- 276
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLV-------SKIISQQPDLKVHTVHDEYTCFQYSE 354
++DSG++ Y Y+ LV SK + + D + F+
Sbjct: 277 -------VLDSGSSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPLCWKGKKPFKSVL 329
Query: 355 SVDEGFPNVTFHFEN--SVSLKVYPHEYL 381
V + F ++ F N +++ P YL
Sbjct: 330 DVKKEFKSLVLSFSNGKKALMEIPPENYL 358
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 142/335 (42%), Gaps = 44/335 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK--EC-PRRSSLGIELTLYDIKDSSTGK 132
Y +G GTP + +DTGSD+ WV C C +C P++ L+D SST
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDP------LFDPSKSSTYA 184
Query: 133 FVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C+ + C + CT+ T C Y Y DGS + G + + + L
Sbjct: 185 PIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT-------LAPG 237
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
T FGCG Q G D DG++G G + S++ Q +S G F++CL +
Sbjct: 238 ITVEDFHFGCGRDQRGPSDK-----YDGLLGLGGAPVSLVVQTSSVYG--GAFSYCLPAL 290
Query: 252 NG-GGIFAIGHVVQPEVNKTPLV----PNQPHYS----INMTAVQVGLDFLNLPTDVFGV 302
N G +G P NK+ V + P Y+ + MT + VG L++P F
Sbjct: 291 NSEAGFLVLGS--PPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAF-- 346
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
G IIDSGT LPE Y L + + + D TC+ ++ + P
Sbjct: 347 --RGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDDFDTCYNFTGYSNITVPR 404
Query: 363 VTFHFENSVSLKV-YPHEYLFPFEDLWCIGWQNSG 396
V F F ++ + P+ L D C+ +Q SG
Sbjct: 405 VAFTFSGGATIDLDVPNGIL--VND--CLAFQESG 435
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 128/312 (41%), Gaps = 40/312 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y I IGTPP +DTGSD++W C + P R LY S+T V+
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQC----DAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 136 CDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C + P + C+ +T C Y YGDG+ST G + L + +
Sbjct: 148 CRSPMCQALQ-SPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFT-------LGSDTAV 199
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
+ FGCG G+ D+++ G++G G+ S++SQL GV + F++C N
Sbjct: 200 RGVAFGCGTENLGSTDNSS-----GLVGMGRGPLSLVSQL----GVTR-FSYCFTPFNAT 249
Query: 255 G----IFAIGHVVQPEVNKTPLVPN--------QPHYSINMTAVQVGLDFLNLPTDVFGV 302
+ TP VP+ +Y +++ + VG L + VF +
Sbjct: 250 AASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRL 309
Query: 303 ---GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSESVDE 358
GD G IIDSGTT L E + L + S+ H + CF +
Sbjct: 310 TPMGDG-GVIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAV 368
Query: 359 GFPNVTFHFENS 370
P + HF+ +
Sbjct: 369 EVPRLVLHFDGA 380
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/321 (28%), Positives = 139/321 (43%), Gaps = 44/321 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +I +G+PPK + VDTGSD++W+ C C +C +S +YD SST
Sbjct: 2 GAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSD-----PIYDPSASSTFAK 56
Query: 134 VTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+C C + P + C+++ +C Y YGD SST G F + + G ++
Sbjct: 57 TSCSTSSCQSL---PASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGG---SSK 110
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+ FGCG SG+ GI+G G+ S+ +QL S+ + F++CL +
Sbjct: 111 AFPNFQFGCGRLNSGSFG-----GAAGIVGLGQGKISLSTQLGSA--INNKFSYCLVDFD 163
Query: 253 GGG------IFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPT---DVF 300
IF TP++PN +Y + + + VG L+L T D
Sbjct: 164 DDSSKTSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFL 223
Query: 301 GVGDNK------------GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY- 347
V K GTI DSGTTL L + VY + S S V +
Sbjct: 224 SVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFD 283
Query: 348 TCFQYSESVDEGFPNVTFHFE 368
C+ S+S + FP +T F+
Sbjct: 284 LCYDVSKSKNFKFPALTLAFK 304
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 136/294 (46%), Gaps = 40/294 (13%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+GVG Y I +GTP + V DTGSD++W C C +C ++ + + SST
Sbjct: 81 NGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPA-----PPFQPASSST 135
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C FC + C A T C Y YG G T GY + ++ S
Sbjct: 136 FSKLPCTSSFCQ-FLPNSIRTCNA-TGCVYNYKYGSG-YTAGYLATETLKVGDASFP--- 189
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
S+ FGC + ++G +ST+ GI G G+ S+I QL GV + F++CL
Sbjct: 190 -----SVAFGC-STENGVGNSTS-----GIAGLGRGALSLIPQL----GVGR-FSYCLRS 233
Query: 251 INGGG----IF-AIGHVVQPEVNKTPLVPNQP----HYSINMTAVQVGLDFLNLPTDVFG 301
+ G +F ++ ++ V TP V N +Y +N+T + VG L + T FG
Sbjct: 234 GSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFG 293
Query: 302 VGDN---KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQ 351
N GTI+DSGTTL YL + YE + +SQ ++ V+ CF+
Sbjct: 294 FTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTVNGTRGLDLCFK 347
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/412 (26%), Positives = 173/412 (41%), Gaps = 55/412 (13%)
Query: 1 MGLCLRNCLCIVLIATAAVGGVSSNHGVFSVKYRYAG----RERSLSLLKEHDARRQQRI 56
MG+ L VL VS V+ ++ +Y E S + D R R
Sbjct: 1 MGVLTNVFLVFVLFCVCMC--VSQQADVYRLQPKYPAADNDEEGSKASFVSRDTNRIGRR 58
Query: 57 LAG---VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN----CIQCKE 109
L L G+ P GLYY + +G P K Y++ VD+GS++ W+ CI C +
Sbjct: 59 LQAHQTAIFSLKGNVVP--YGLYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAK 116
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP---LTDCTANTSCPYLEIYGD 166
P LY +K G V C V G A+ C Y Y D
Sbjct: 117 GPH--------PLYKLK---KGSLVPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAYAD 165
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
+ G+ V+D V+ + + T ++ +FGCG Q +L ++ DGI+G G
Sbjct: 166 HGYSEGFLVRDSVRALLTNKTVLTANS----VFGCGYNQRESLPVSDART-DGILGLGSG 220
Query: 227 NSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHVVQPE-VNKTPLV--PNQPHYSI 281
+S+ SQ A G ++ + HC+ G +GG +F +V + P++ P+ HY +
Sbjct: 221 MASLPSQWAKQGLIKNVIGHCIFGAGRDGGYMFFGDDLVSTSAMTWVPMLGRPSIKHYYV 280
Query: 282 NMTAVQVGLDFLNLPTDVFGVGDNKGTII-DSGTTLAYLPEMVYEPLVSKIISQQPDLKV 340
A Q ++F N P D G G G II DSG+T Y Y +S + ++
Sbjct: 281 G--AAQ--MNFGNKPLDKDGDGKKLGGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQL 336
Query: 341 HT-VHDEY--TCFQYSE---SVDEG---FPNVTFHFENSVS--LKVYPHEYL 381
D + C++ E SV E F +T F ++ + ++++P YL
Sbjct: 337 EQDSSDSFLSLCWRRKEGFRSVAEAAAYFKPLTLKFRSTKTKQMEIFPEGYL 388
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 99/367 (26%), Positives = 157/367 (42%), Gaps = 52/367 (14%)
Query: 38 RERSLS---LLKEHDARRQQRILAGVDLPLGGSSRPDGVG------LYYAKIGIGTPPKD 88
R+R+ + + K R L+ D GG+S P +G Y +GIGTP
Sbjct: 46 RDRARTNYIVTKATGGRTAATALS--DAAGGGTSIPTFLGDSVNSLEYVVTLGIGTPAVQ 103
Query: 89 YYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH----GV 144
V +DTGSD+ WV QCK C + L+D SS+ V CD + C G
Sbjct: 104 QTVLIDTGSDLSWV---QCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSDACRKLAAGA 160
Query: 145 YGGPLTDCTANTS--CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
YG T + + C Y YG+ ++TTG + + + L+ FGCG
Sbjct: 161 YGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLT-------LKPGVVVADFGFGCG 213
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG-GIFAIG- 260
Q G E DG++G G + S++SQ +S G F++CL +GG G +G
Sbjct: 214 DHQHGPY-----EKFDGLLGLGGAPESLVSQTSSQFG--GPFSYCLPPTSGGAGFLTLGA 266
Query: 261 ------HVVQPEVNKTPL--VPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIID 311
++ TP+ +P+ P Y + +T + VG L +P F + G +ID
Sbjct: 267 PPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF----SSGMVID 322
Query: 312 SGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYSESVDEGFPNVTFHFE 368
SGT + LP Y L S S + ++ + TC+ ++ + P ++ F
Sbjct: 323 SGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTISLTFS 382
Query: 369 NSVSLKV 375
++ +
Sbjct: 383 GGATIDL 389
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/417 (23%), Positives = 157/417 (37%), Gaps = 78/417 (18%)
Query: 32 KYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
+ R GR+R L E D R DLP GG Y + IGTPP Y
Sbjct: 77 RSRSFGRDRDREL-AESDGRTTVSARTRKDLPNGGE--------YLMTLAIGTPPLPYAA 127
Query: 92 QVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
DTGSD++W C C +C + + LY+ S+T + C+
Sbjct: 128 VADTGSDLIWTQCAPCGTQCFEQPA-----PLYNPASSTTFSVLPCNSSLSMCAGALAGA 182
Query: 151 DCTANTSCPYLEIYGDG------SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGAR 204
+C Y + YG G S T F +V G + FGC
Sbjct: 183 APPPGCACMYNQTYGTGWTAGVQGSETFTFGSSAADQARVPG----------VAFGC--- 229
Query: 205 QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD--------------- 249
N S++ G++G G+ + S++SQL + F++CL
Sbjct: 230 --SNASSSDWNGSAGLVGLGRGSLSLVSQLGAG-----RFSYCLTPFQDTNSTSTLLLGP 282
Query: 250 --GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN-- 305
+NG G+ + V P P +Y +N+T + +G L + F + +
Sbjct: 283 SAALNGTGVRSTPFVASPA-----RAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGT 337
Query: 306 KGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYTCFQYSESVDEG--- 359
G IIDSGTT+ L Y+ + V +++ P + CF
Sbjct: 338 GGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAV 397
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQN------SGMQSRDRKNMTLLGD 410
P++T HF+ + + + Y+ +WC+ +N S + ++NM +L D
Sbjct: 398 LPSMTLHFDGA-DMVLPADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYD 453
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/347 (31%), Positives = 156/347 (44%), Gaps = 55/347 (15%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y+ I IGTPP + DTGSD+ WV C C++C ++++ L+D K SST K
Sbjct: 83 GEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNT-----PLFDKKKSSTYKT 137
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+CD C+ + + +C Y YGD S T G + + D SG S
Sbjct: 138 ESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSG--SPVSF 195
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD---- 249
G+ FGCG G T EE GIIG G S++SQL SS G K F++CL
Sbjct: 196 PGT-AFGCGYNNGG----TFEETGSGIIGLGGGPLSLVSQLGSSIG--KKFSYCLSHTSA 248
Query: 250 GINGGGIFAIG---HVVQPEVNK----TPLVPNQP--HYSINMTAVQVGLDFLNLP-TDV 299
NG + +G +P + TPL+ P +Y + + A+ VG LP T
Sbjct: 249 TTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGK--TKLPYTGG 306
Query: 300 FGVGDNKGT------IIDSGTTLAYLPEMVYEPL---------VSKIISQQPDLKVHTVH 344
G N+ + IIDSGTTL L Y+ +K +S + H
Sbjct: 307 GGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRVSDPQGILTH--- 363
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCI 390
CF+ S + G P +T HF + +K+ P + ++ ED+ C+
Sbjct: 364 ----CFK-SGDKEIGLPTITMHFTGA-DVKLSPINSFVKLSEDIVCL 404
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/333 (26%), Positives = 143/333 (42%), Gaps = 35/333 (10%)
Query: 66 GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG----IELT 121
G+S + L+YA + IGTP + + V +DTGSD+ W+ C C R I+L
Sbjct: 79 GNSTEEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGERIKLN 138
Query: 122 LYDIKDSSTGKFVTCDQEFC--HGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDV 178
+Y+ S + VTC+ C P++D CPY + GS +TG V+DV
Sbjct: 139 IYNPSKSKSSSKVTCNSTLCALRNRCISPVSD------CPYRIRYLSPGSKSTGVLVEDV 192
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+ G+ + + + FGC Q G E A++GI+G ++ ++ + L +G
Sbjct: 193 IHMSTEEGEAR----DARITFGCSESQLGLF---KEVAVNGIMGLAIADIAVPNMLVKAG 245
Query: 239 GVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLP 296
F+ C G NG G + G + +TPL + Y +++T +VG
Sbjct: 246 VASDSFSMCF-GPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVG------- 297
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY---S 353
V DSGT + +L E Y L + PD ++ D F Y S
Sbjct: 298 --KVTVDTEFTATFDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITS 355
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
S ++ P+V+F + + V+ +F D
Sbjct: 356 TSDEDKLPSVSFEMKGGAAYDVFSPILVFDTSD 388
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 136/323 (42%), Gaps = 47/323 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GIGTP V +DTGSD+ WV QCK C + L+D SS+ V
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWV---QCKPCGAGECYAQKDPLFDPSSSSSYASVP 227
Query: 136 CDQEFCH----GVYGGPLTDCTANTS--CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
CD + C G YG T + + C Y YG+ ++TTG + + + L+
Sbjct: 228 CDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLT-------LK 280
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
FGCG Q G E DG++G G + S++SQ +S G F++CL
Sbjct: 281 PGVVVADFGFGCGDHQHGPY-----EKFDGLLGLGGAPESLVSQTSSQFG--GPFSYCLP 333
Query: 250 GINGGGIF--------------AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNL 295
+GG F A G P + + P VP Y + +T + VG L +
Sbjct: 334 PTSGGAGFLTLGAPPNSSSSTAASGLSFTP-MRRLPSVPT--FYIVTLTGISVGGAPLAI 390
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQY 352
P F + G +IDSGT + LP Y L S S + ++ + TC+ +
Sbjct: 391 PPSAF----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDF 446
Query: 353 SESVDEGFPNVTFHFENSVSLKV 375
+ + P ++ F ++ +
Sbjct: 447 TGHANVTVPTISLTFSGGATIDL 469
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGRHGVFVERSVQEQDVWCLAF 311
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 141/345 (40%), Gaps = 47/345 (13%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSS 115
L+ V L L G+ P +G Y + IG PPK + +DTGSDI WV C C C
Sbjct: 37 LSSVVLLLSGNVFP--LGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPK 94
Query: 116 LGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYF 174
L + G V C C ++ C C Y Y D S+ G
Sbjct: 95 L---------QYKPKGNTVPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGAL 145
Query: 175 VQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
V D + ++G ++ L FGCG QS + A G++G G+ +++QL
Sbjct: 146 VIDQFPFKLLNG----SAMQPRLAFGCGYDQS-YPSAHPPPATAGVLGLGRGKIGLLTQL 200
Query: 235 ASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDF 292
S+G R + HCL GGG G + P V TPL+P HY T L F
Sbjct: 201 VSAGLTRNVVGHCLSS-KGGGYLFFGDTLIPSLGVAWTPLLPPDNHY----TTGPAELLF 255
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV---HTVHDEYTC 349
PT + G+ I D+G++ Y Y+ +V+ I + DLKV ++ T
Sbjct: 256 NGKPTGLKGL----KLIFDTGSSYTYFNSKTYQTIVNLIGN---DLKVSPLKVAKEDKTL 308
Query: 350 ---------FQYSESVDEGFPNVTFHFENS---VSLKVYPHEYLF 382
F+ V F +T +F N+ L++ P YL
Sbjct: 309 PICWKGAKPFKSVLEVKNFFKTITINFTNARRNTQLQIPPESYLI 353
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 157/348 (45%), Gaps = 38/348 (10%)
Query: 43 SLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWV 102
SL + +D LA V L G S GVG Y ++G+GTP K Y + VDTGS + W+
Sbjct: 107 SLYRANDDAAVDGSLASVPLTPGTSY---GVGNYVTRMGLGTPAKPYIMVVDTGSSLTWL 163
Query: 103 NCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCP 159
C C+ C R+S ++D K SS+ V+C C+ + L C+++ C
Sbjct: 164 QCSPCRVSCHRQSG-----PVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCI 218
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YGD S + GY +D V + +++ + +GCG G + G
Sbjct: 219 YQASYGDSSFSVGYLSKDTVSFG--------SNSVPNFYYGCGQDNEGLFGRSA-----G 265
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP---NQ 276
++G ++ S++ QLA + G F++CL + G +IG + + TP+V +
Sbjct: 266 LMGLARNKLSLLYQLAPTLGYS--FSYCLPSSSSSGYLSIGSYNPGQYSYTPMVSSTLDD 323
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP 336
Y I ++ + V L + + + + TIIDSGT + LP VY+ L +
Sbjct: 324 SLYFIKLSGMTVAGKPLAVSSSEY---SSLPTIIDSGTVITRLPTTVYDALSKAVAGAMK 380
Query: 337 DLK---VHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
K +++ D TCF ++ P V+ F +LK+ L
Sbjct: 381 GTKRADAYSILD--TCF-VGQASSLRVPAVSMAFSGGAALKLSAQNLL 425
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 113/235 (48%), Gaps = 35/235 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ IGTPP Y Q DTGSD++W+ CI C C ++ + ++D + SST +
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLN-----PMFDSQSSSTFSNIA 113
Query: 136 CDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C E C +Y T C+ + +C Y Y DGS T G Q+ + +G+ +
Sbjct: 114 CGSESCSKLYS---TSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGE--PVAFK 168
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-- 252
G +IFGCG +G N++ + GIIG G+ S++SQ+ SS G MF+ CL N
Sbjct: 169 G-VIFGCGHNNNGAF---NDKEM-GIIGLGRGPLSLVSQIGSSLG-GNMFSQCLVPFNTN 222
Query: 253 -----------GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLP 296
G + G V P V+KT Q Y + + + V + +NLP
Sbjct: 223 PSISSPMSFGKGSEVLGNGVVSTPLVSKTTY---QSFYFVTLLGISV--EDINLP 272
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 138/302 (45%), Gaps = 39/302 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+P + +DTGSD+ WV C C +C ++ +L+D SST +
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQAD-----SLFDPSSSSTYSAFS 181
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + C+ ++ C Y YGDGS+ +G + D + +ST
Sbjct: 182 CTSAACAQLR---QRGCS-SSQCQYTVKYGDGSTGSGTYSSDTLALG--------SSTVE 229
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING-G 254
+ FGC +SGNL ++ G++G G S+ +Q A + G K F++CL G
Sbjct: 230 NFQFGCSQSESGNL---LQDQTAGLMGLGGGAESLATQTAGTFG--KAFSYCLPPTPGSS 284
Query: 255 GIFAIGHVVQPEVNKTPL-----VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
G +G V KTP+ VP+ +Y + + A++VG LN+P F + G+I
Sbjct: 285 GFLTLGASTSGFVVKTPMLRSTQVPS--YYGVLLQAIRVGGRQLNIPASAF----SAGSI 338
Query: 310 IDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
+DSGT + LP Y L S + Q P + + D TCF +S P V
Sbjct: 339 MDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFD--TCFDFSGQSSVSIPTVALV 396
Query: 367 FE 368
F
Sbjct: 397 FS 398
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 91/355 (25%), Positives = 151/355 (42%), Gaps = 49/355 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IGIGTP ++ Y+ +DTGSD++W+ C C+EC ++ +++ S +
Sbjct: 4 GSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQAD-----PIFNPSSSVSF 58
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V CD C + DC C Y YGDGS T G + + + + T
Sbjct: 59 STVGCDSAVCSQL---DANDCHGG-GCLYEVSYGDGSYTVGSYATETLTFG--------T 106
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
++ ++ GCG G G S +QL + G + F++CL
Sbjct: 107 TSIQNVAIGCGHDNVGLFVGAAGLLGLGAGSL-----SFPAQLGTQTG--RAFSYCLVDR 159
Query: 249 DGINGGGI------FAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLN-LPTDVFG 301
D + G + IG + P V P +P Y ++M A+ VG L+ +P++ F
Sbjct: 160 DSESSGTLEFGPESVPIGSIFTPLV-ANPFLPT--FYYLSMVAISVGGVILDSVPSEAFR 216
Query: 302 VGDNK---GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVD 357
+ + G IIDSGT + L Y+ L I+ L + + TC+ S
Sbjct: 217 IDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQS 276
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNSGMQSRDRKNMTLLGD 410
P V FHF N + L P + + +C + + N++++G+
Sbjct: 277 VSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPA------DSNLSIMGN 325
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 115/274 (41%), Gaps = 32/274 (11%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPRRSSLG 117
LPL G+ P G Y+ + IG PPK Y++ DTGSD+ W+ C IQC P
Sbjct: 55 LPLYGNVYPSG--YYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPH----- 107
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
LY T V C C ++ C C Y Y DG S+ G V D
Sbjct: 108 ---PLY----QPTNDLVVCKDPICASLHPDNYR-CDDPDQCDYEVEYADGGSSIGVLVND 159
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ + SG L GCG Q L LDG++G G+ +SS+++QL+S
Sbjct: 160 LFPVNLTSG----MRARPRLTIGCGYDQ---LPGIAYHPLDGVLGLGRGSSSIVAQLSSQ 212
Query: 238 GGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPT 297
G VR + HC GG +F + + +K P Y + T G L L
Sbjct: 213 GLVRNVVGHCFSRRGGGYLFFGDDIY--DSSKVIWTPMSRDYLKHYTP---GFAELILNG 267
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
G+ N + DSG++ Y Y+ L+S I
Sbjct: 268 RSSGL-KNLLVVFDSGSSYTYFNTQTYQTLLSFI 300
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 149/345 (43%), Gaps = 62/345 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFSFGCNMDSFGANEFGN--------VDGLLGMGAGAMSVLKQ---SSPTFDCF 150
Query: 245 AHCL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDF 292
++CL G G F++G V + +V T +V + + + +++TA+ V +
Sbjct: 151 SYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGER 210
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
L L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 LGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM 267
Query: 353 SESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 268 -RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/353 (24%), Positives = 149/353 (42%), Gaps = 43/353 (12%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
DG G Y+ +G+GTPP+ + DTGSD++W+ C+ C+ C G L++ SST
Sbjct: 76 DGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSC-----YGQTDPLFNPSFSST 130
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ +TC C + + C N C Y YGDGS T G F + + +
Sbjct: 131 FQSITCGSSLCQQLL---IRGCRRN-QCLYQVSYGDGSFTVGEFSTETLSFG-------- 178
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
++ S+ GCG G T L G+ S S + QL S +F++CL
Sbjct: 179 SNAVNSVAIGCGHNNQGLF--TGAAGLLGLGKGLLSFPSQVGQLYGS-----VFSYCLPT 231
Query: 251 INGGGIFAI---GHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
G + V T L+ N Y + M ++VG +++P +
Sbjct: 232 RESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDS 291
Query: 305 ---NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP-DLKVHTVHDEY-TCFQYSESVDEG 359
N G I+DSGT + L Y P+ + P D K+ + + TC+ S
Sbjct: 292 STGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIM 351
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGMQSRDRKNMTLLGD 410
P V+F F ++ + + P ++ +C+ + + + +N +++G+
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAF------APNSENFSIIGN 398
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/420 (23%), Positives = 162/420 (38%), Gaps = 81/420 (19%)
Query: 32 KYRYAGRERSLSLLKEHDARRQQRILAGV--DLPLGGSSRPDGVGLYYAKIGIGTPPKDY 89
+ R GR+R L E D R + A DLP GG Y + IGTPP Y
Sbjct: 77 RSRSFGRDRDREL-AESDGRTSTTVSARTRKDLPNGGE--------YLMTLAIGTPPLPY 127
Query: 90 YVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
DTGSD++W C C +C + + LY+ S+T + C+
Sbjct: 128 AAVADTGSDLIWTQCAPCGTQCFEQPA-----PLYNPASSTTFSVLPCNSSLSMCAGALA 182
Query: 149 LTDCTANTSCPYLEIYGDG------SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
+C Y + YG G S T F +V G + FGC
Sbjct: 183 GAAPPPGCACMYYQTYGTGWTAGVQGSETFTFGSSAADQARVPG----------VAFGC- 231
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD------------- 249
N S++ G++G G+ + S++SQL + F++CL
Sbjct: 232 ----SNASSSDWNGSAGLVGLGRGSLSLVSQLGAG-----RFSYCLTPFQDTNSTSTLLL 282
Query: 250 ----GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+NG G+ + V P P +Y +N+T + +G L + F + +
Sbjct: 283 GPSAALNGTGVRSTPFVASPA-----RAPMSTYYYLNLTGISLGAKALPISPGAFSLKPD 337
Query: 306 --KGTIIDSGTTLAYLPEMVYE----PLVSKIISQQPDLKVHTVHDEYTCFQY---SESV 356
G IIDSGTT+ L Y+ + S++++ P + CF + +
Sbjct: 338 GTGGLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAP 397
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQN------SGMQSRDRKNMTLLGD 410
P++T HF+ + + + Y+ +WC+ +N S + ++NM +L D
Sbjct: 398 PAVLPSMTLHFDGA-DMVLPADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYD 456
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 148/345 (42%), Gaps = 62/345 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFSFGCNMDSFGANEFGN--------VDGLLGMGAGPMSVLKQ---SSPTFDCF 150
Query: 245 AHCLDGING--------GGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDF 292
++CL G F++G V + +V T +V + + + +++TA+ V +
Sbjct: 151 SYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGER 210
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
L L VF KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 LGLSPSVFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERNCYDM 267
Query: 353 SESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 268 -RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/359 (26%), Positives = 147/359 (40%), Gaps = 51/359 (14%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ 106
H +++ L V +P G Y + IGTPP + DT SD++WV C
Sbjct: 69 SHSDLNEKKTLERVRIPNHGE--------YLMRFYIGTPPVERLAIADTASDLIWVQCSP 120
Query: 107 CKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC--HGVYGGPLTDCTANTSCPYLEIY 164
C+ C + L++ SST ++CD + C +Y PL C Y Y
Sbjct: 121 CETC-----FPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPL----VGNLCLYTNTY 171
Query: 165 GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
GDGSST G + + + T T IFGCG+ +N+ + GI+G G
Sbjct: 172 GDGSSTKGVLCTESIHFG------SQTVTFPKTIFGCGSNNDFMHQISNK--VTGIVGLG 223
Query: 225 KSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGH-----VVQPEVNKTPLV--PNQP 277
S++SQL G + F++CL + + V TPL+ P+ P
Sbjct: 224 AGPLSLVSQLGDQIGHK--FSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYP 281
Query: 278 -HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP 336
+Y +++ + +G L + T N IID GT L YL Y V+ + +
Sbjct: 282 SYYFLHLVGITIGQKMLQVRTT---DHTNGNIIIDLGTVLTYLEVNFYHNFVTLL---RE 335
Query: 337 DLKVHTVHDEYTC---FQYSESVDEGFPNVTFHFENSVSLKVY--PHEYLFPFEDLWCI 390
L + D+ F + + FP + F F + KV+ P F F+DL I
Sbjct: 336 ALGISETKDDIPYPFDFCFPNQANITFPKIVFQFTGA---KVFLSPKNLFFRFDDLNMI 391
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 167/403 (41%), Gaps = 80/403 (19%)
Query: 36 AGRERSLSLLKEHDARR----QQRI----------------LAGVDLPLGG---SSRPDG 72
A ER L DARR +QRI +A V GG S G
Sbjct: 134 ASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHENVAEVAAEFGGEVVSGMAQG 193
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
G Y+ +IG+GTP ++ Y+ +DTGSD++W+ C C +C + +++ S++
Sbjct: 194 SGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVD-----PIFNPSLSASFS 248
Query: 133 FVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ C+ C + +GG C Y YGDGS T G F +++ +
Sbjct: 249 TLGCNSAVCSYLDAYNCHGG---------GCLYKVSYGDGSYTIGSFATEMLTFG----- 294
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
TTS I GCG +G G S SQL + G + F++C
Sbjct: 295 --TTSVRNVAI-GCGHDNAGLFVGAAGLLGL-----GAGLLSFPSQLGTQTG--RAFSYC 344
Query: 248 L-DGIN--------GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLN-LPT 297
L D + G +G ++ P + P +P Y + + ++ VG L+ +P
Sbjct: 345 LVDRFSESSGTLEFGPESVPLGSILTPLLTN-PSLPT--FYYVPLISISVGGALLDSVPP 401
Query: 298 DVFGVGDNKGT---IIDSGTTLAYLPEMVYEPLVSKIIS---QQPDLKVHTVHDEYTCFQ 351
DVF + + G I+DSGT + L VY+ + ++ Q P + ++ D TC+
Sbjct: 402 DVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFD--TCYD 459
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFP--FEDLWCIGW 392
S P V FHF N SL + Y+ P F +C +
Sbjct: 460 LSGLPLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAF 502
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 80/264 (30%), Positives = 114/264 (43%), Gaps = 27/264 (10%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC--PRRSSLGIELTLYDIKDSS 129
+G Y + IG PPK Y + +DTGSD+ WV C C+ C PR LY
Sbjct: 61 LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRNR-------LY----KP 109
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
G V C C + P C N C Y Y D S+ G ++D + +G L
Sbjct: 110 NGNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSL 169
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
L FGCG Q ++ + G++G G +S++SQL S G +R + HCL
Sbjct: 170 ----ARPILAFGCGYDQK-HVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCL 224
Query: 249 DGINGGGIFAIGHVV-QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
GG +F +V Q V TPL+ Q + + L F PT V G+
Sbjct: 225 SERGGGFLFFGDQLVPQSGVVWTPLL--QSSSTQHYKTGPADLFFDRKPTSVKGL----Q 278
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKI 331
I DSG++ Y ++ LV+ +
Sbjct: 279 LIFDSGSSYTYFNSKAHKALVNLV 302
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 153/345 (44%), Gaps = 31/345 (8%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++ +GTP K + + VDTGSD+ W+ C SS YD SS+
Sbjct: 55 GSGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSS--PPAPWYDKSSSSSY 112
Query: 132 KFVTCDQEFCHGVYGGPLTDC--TANTSCPYLEIYGDGSSTTGYFVQDVVQYD------K 183
+ + C + C + + C T+ + C Y Y D S TTG + + K
Sbjct: 113 REIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGK 172
Query: 184 VSGDLQTTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS--GGV 240
+G+ +T ++ GC G + G++G G+ S+ +Q + GG+
Sbjct: 173 RAGNHKTRRIRIKNVALGCSRESVG----ASFLGASGVLGLGQGPISLATQTRHTALGGI 228
Query: 241 RKMFAHC----LDGINGGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQV-GLDF 292
F++C L G N +G ++ TP+V N Q Y +N+T V V G
Sbjct: 229 ---FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPV 285
Query: 293 LNLPTDVFGV-GD-NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ + +G+ GD NKGTI DSGTTL+YL E Y ++ + + + + + +
Sbjct: 286 DGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELC 345
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYL-FPFEDLWCIGWQN 394
+++G P + F+ +++ + Y+ E++ C+ Q
Sbjct: 346 YNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQK 390
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 148/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CLDGING--------GGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGIHGVFVERSVQEQDVWCLAF 311
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 140/322 (43%), Gaps = 39/322 (12%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
A + LP S D + Y + IGTP V +DTGSD+ WV+C R+ G
Sbjct: 108 AAITLPTTLGSALDTLA-YVITVSIGTPAMTQAVMIDTGSDVSWVHCHA------RAGAG 160
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
L +D SST +C C + G C+ N++C Y YGDGS+TTG + D
Sbjct: 161 SSL-FFDPGKSSTYTPFSCSSAACTRLEGRD-NGCSLNSTCQYTVRYGDGSNTTGTYGSD 218
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ L +T + FGC + S + +E+ DG++G G S++SQ A++
Sbjct: 219 TLA-------LNSTEKVENFQFGC-SETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAAT 270
Query: 238 GGVRKMFAHCLDGIN--------GGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG 289
G F++CL G G V P + ++ P Y + + + VG
Sbjct: 271 YG--SAFSYCLPATTRSSGFLTLGASTGTSGFVTTP-MFRSRRAPT--FYFVILQGINVG 325
Query: 290 LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDE 346
D + + VF G+I+DSGT + LP Y L + + + P + ++ D
Sbjct: 326 GDPVAISPTVFAA----GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILD- 380
Query: 347 YTCFQYSESVDEGFPNVTFHFE 368
TCF ++ + P V F
Sbjct: 381 -TCFDFTGQDNVSIPAVELVFS 401
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 150/345 (43%), Gaps = 62/345 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K +++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
S FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---SFSFGCNMDSFGANEFGN--------VDGLLGMGAGPMSVLKQ---SSPTFDGF 150
Query: 245 AHCL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDF 292
++CL G G F++G V + +V T +V + + + +++TA+ V +
Sbjct: 151 SYCLPLQMSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGER 210
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
L L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 LGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM 267
Query: 353 SESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 268 -RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 150/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPRFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V + + + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/259 (28%), Positives = 117/259 (45%), Gaps = 27/259 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR-SSLG----IELTLYDIKDSS 129
LYYA + +GTPP + V +DTGSD+ W+ C C R +G + L LY S+
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G C++ +S CPY Y + + T G +QDV+ + D
Sbjct: 161 TSSSIRCSDKRCFGS-----KKCSSPSSICPYQISYSNSTGTKGTLLQDVLHL--ATEDE 213
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
T ++ GCG +Q+G N +++G++G G S+ S LA + F+ C
Sbjct: 214 NLTPVKANVTLGCGQKQTGLFQRNN--SVNGVLGLGIKGYSVPSLLAKANITANSFSMCF 271
Query: 249 DGINGG-GIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G G + G + +TP + P Y +N++ V V D P D+
Sbjct: 272 GRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGD----PVDIRLFAK- 326
Query: 306 KGTIIDSGTTLAYLPEMVY 324
D+G++ +L E Y
Sbjct: 327 ----FDTGSSFTHLREPAY 341
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 139/319 (43%), Gaps = 40/319 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y I +GTP K + V DTGSD++W+ C C+ C + ++D + SS+
Sbjct: 36 GGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKD-----PIFDPEGSSSY 90
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
++C C + P C+ N C Y YGDGS T G + V G+ +
Sbjct: 91 TTMSCGDTLCDSL---PRKSCSPN--CDYSYGYGDGSGTRGTLSSETVTLTSTQGE-KLA 144
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
+ N + FGCG G+ + + G++G G+ N S +SQL G + F++CL
Sbjct: 145 AKN--IAFGCGHLNRGSFNDAS-----GLVGLGRGNLSFVSQLGDLFGHK--FSYCLVPW 195
Query: 249 -DGINGGGIFAIG-----HVVQPEVNK--TPLVPN---QPHYSINMTAVQVGLDFLNLPT 297
D + G H +++ TP++ N + Y + + + + L +P
Sbjct: 196 RDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPA 255
Query: 298 DVFGVGDN--KGTIIDSGTTLAYLPEMVYE----PLVSKIISQQPDLKVHTVHDEYTCFQ 351
F + + G I DSGTTL LP+ Y+ L SK+ + D + Y
Sbjct: 256 GSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSG 315
Query: 352 YSESVDEGFPNVTFHFENS 370
S + P + FHFE +
Sbjct: 316 SKASYKKKIPAMVFHFEGA 334
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 134/329 (40%), Gaps = 36/329 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y+A +G+GTP + +DTGSD++W+ C C+ C ++D + SST +
Sbjct: 84 GEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRC-----YAQRGQVFDPRRSSTYRR 138
Query: 134 VTCDQEFCHGV-YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C C + + G + A C Y+ YGDGSS+TG D + + T
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAF------ANDTY 192
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
N ++ GCG G DS G++G G+ S+ +Q+A + G +F +CL
Sbjct: 193 VN-NVTLGCGRDNEGLFDSAA-----GLLGVGRGKISISTQVAPAYG--SVFEYCLGDRT 244
Query: 253 G----GGIFAIGHVVQPEVNK-TPLV--PNQPH-YSINMTAVQVGLD----FLNLPTDVF 300
G +P T L+ P +P Y ++M VG + F N +
Sbjct: 245 SRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALD 304
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESV 356
G ++DSGT ++ Y L ++ + + E++ C+
Sbjct: 305 TATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRP 364
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
P + HF + + P Y P +
Sbjct: 365 AASAPLIVLHFAGGADMALPPENYFLPVD 393
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/332 (25%), Positives = 137/332 (41%), Gaps = 48/332 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G+YY+ I +G+PPKD+ + +DTGSD+ WV C C P SS +D S+T K
Sbjct: 122 GVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCS--PDCSST------FDRLASNTYKA 173
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+TC + P L +G ++D ++ + D
Sbjct: 174 LTCADDL----------------RLPVLLRLWRRLFHSGRSLRDTLKMAGAASD--ELEE 215
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+FGCG+ G + GI+ + S SQ+ G + F++CL
Sbjct: 216 FPGFVFGCGSLLKGLISGEV-----GILALSPGSLSFPSQIGEKYGNK--FSYCLLRQTA 268
Query: 249 -DGINGGGIF---AIGHVVQP------EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD 298
+ + + A + +P E+ TP+ + +Y++ + + VG L+L
Sbjct: 269 QNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPS 328
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
F G +K TI DSGTTL LP V + + + S + + CF+ S +
Sbjct: 329 TFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGLDACFRVPPSSGQ 388
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFEDLWCI 390
G P++TFHF P Y+ L C+
Sbjct: 389 GLPDITFHFNGGADFVTRPSNYVIDLGSLQCL 420
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 129/283 (45%), Gaps = 25/283 (8%)
Query: 58 AGVD----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPR 112
A VD P+ G+ PDG LY+ I +G PP+ YY+ +DT SD+ W+ C C C +
Sbjct: 188 AAVDSSSVFPVRGNVYPDG--LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAK 245
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD-CTANTSCPYLEIYGDGSSTT 171
++ LY + + VT C ++ C C Y Y D SS+
Sbjct: 246 GAN-----ALYKPRRDN---IVTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADHSSSM 297
Query: 172 GYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMI 231
G +D + +G +STN FGC Q G L +T + DGI+G K+ S+
Sbjct: 298 GVLARDELHLTMANG----SSTNLKFNFGCAYDQQGLLLNTLVKT-DGILGLSKAKVSLP 352
Query: 232 SQLASSGGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQV-G 289
SQLA+ G + + HCL + + GGG +G P + VP SI+ Q+
Sbjct: 353 SQLANRGIINNVVGHCLANDVVGGGYMFLGDDFVPRWGMS-WVPMLDSPSIDSYQTQIMK 411
Query: 290 LDFLNLPTDVFGVGDN-KGTIIDSGTTLAYLPEMVYEPLVSKI 331
L++ + P + G + + DSG++ Y + Y LV+ +
Sbjct: 412 LNYGSGPLSLGGQERRVRRIVFDSGSSYTYFTKEAYSELVASL 454
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 138/333 (41%), Gaps = 37/333 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+A+IG+GTP + Y+ DTGSD+ W+ C C++C R+ + +++ SS+
Sbjct: 77 GSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQ-----QDPIFNPSLSSSF 131
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K + C C + + C+ C Y YGDGS T G F + + + +
Sbjct: 132 KPLACASSICGKLK---IKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGE-------- 180
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
S+ GCG G G+ S SQ +S +F++CL
Sbjct: 181 HAVRSVAMGCGRNNQGLFHGAAGLLGL-----GRGPLSFPSQTGTS--YASVFSYCLPRR 233
Query: 249 -DGINGGGIFAIGHVVQPEVNK-TPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVG 303
I +F G PE + T L+PN+ +Y + + ++V +N+P D F +G
Sbjct: 234 ESAIAASLVF--GPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMG 291
Query: 304 DN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
G I+DSGT ++ L Y L S + TC+ S P
Sbjct: 292 SRGTGGVIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISLFDTCYDLSSMKTATLP 351
Query: 362 NVTFHFENSVSLKVYPHEYLFPFED--LWCIGW 392
V F+ S+ + L +D +C+ +
Sbjct: 352 AVVLDFDGGASMPLPADGILVNVDDEGTYCLAF 384
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 147/360 (40%), Gaps = 72/360 (20%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +G PP+ + + +DTGSD+ W+ C CK C +S ++D S++
Sbjct: 167 GAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSG-----PVFDPSQSTSF 221
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K + C+ C V D ++ TS C Y YGD S T SGDL
Sbjct: 222 KIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRT--------------SGDL 267
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS-----------------SMI 231
S + SL + ++ E D +IG G SN S
Sbjct: 268 ALESLSVSL----------SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFP 317
Query: 232 SQLASSGGVRKMFAHCL----------DGINGGGIFAIGHVVQPEVNKTPLVPN----QP 277
SQL SS + + F++CL I+ G FA+ ++ TP V +
Sbjct: 318 SQLRSS-PIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFD-QMRFTPFVRTNNSVET 375
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
Y + + +++ + L +P + F + N GTIIDSGTTL YL Y + S +++
Sbjct: 376 FYYLGIQGIKIDQELLPIPAERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI 435
Query: 336 PDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF----PFEDLWCIG 391
+ C+ + FP ++ F+N L + P E F P E C+
Sbjct: 436 SYPRADPFDILGICYNATGRTAVPFPTLSIVFQNGAELDL-PQENYFIQPDPQEAKHCLA 494
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/360 (26%), Positives = 147/360 (40%), Gaps = 72/360 (20%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +G PP+ + + +DTGSD+ W+ C CK C +S ++D S++
Sbjct: 83 GAGEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSG-----PVFDPSQSTSF 137
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K + C+ C V D ++ TS C Y YGD S T SGDL
Sbjct: 138 KIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRT--------------SGDL 183
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS-----------------SMI 231
S + SL + ++ E D +IG G SN S
Sbjct: 184 ALESLSVSL----------SDHPSSLEIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFP 233
Query: 232 SQLASSGGVRKMFAHCL----------DGINGGGIFAIGHVVQPEVNKTPLVPN----QP 277
SQL SS + + F++CL I+ G FA+ ++ TP V +
Sbjct: 234 SQLRSS-PIGQSFSYCLVDRTNNLSVSSAISFGAGFALSRHFD-QMKFTPFVRTNNSVET 291
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
Y + + +++ + L +P + F + N GTIIDSGTTL YL Y + S +++
Sbjct: 292 FYYLGIQGIKIDQELLPIPAERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARI 351
Query: 336 PDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF----PFEDLWCIG 391
+ C+ + FP ++ F+N L + P E F P E C+
Sbjct: 352 SYPRADPFDILGICYNATGRAAVPFPALSIVFQNGAELDL-PQENYFIQPDPQEAKHCLA 410
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS I WV C +C C PR T + ++ K
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSSGVFVERSVQEQDVWCLAF 311
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 149/348 (42%), Gaps = 40/348 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTPP+ + DTGSD++WV C C+ C R + L + S+T
Sbjct: 85 GSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLA----RHSTTF 140
Query: 132 KFVTCDQEFCHGVYGGPLTDCT---ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
C C V C ++ C Y YGDGS T+G+F ++ + SG
Sbjct: 141 SPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSG-- 198
Query: 189 QTTSTNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ G + FGC R SG ++ + G++G G+ S+ SQL G + F++C
Sbjct: 199 REAKLKG-IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNK--FSYC 255
Query: 248 LD----GINGGGIFAIGHV---VQP--------EVNKTPLVPNQPHYSINMTAVQVGLDF 292
L + IG V P ++ PL P Y I + +V V D
Sbjct: 256 LMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPT--FYYIGIESVSV--DG 311
Query: 293 LNLPTD--VFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL--KVHTVHDE 346
+ LP + V+ + + N GTI+DSGTTL +LPE Y +++ +I ++ L
Sbjct: 312 IKLPINPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQILT-VIKRRVRLPSPAEPTPGF 370
Query: 347 YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQ 393
C SE P ++F P Y ED+ C+ Q
Sbjct: 371 DLCVNVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQ 418
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 129/293 (44%), Gaps = 43/293 (14%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI---QCKECPRRSSLGIELTLYDIK 126
P G Y + +GTPP+ V +DTGS + WV C QC+ C S + ++ K
Sbjct: 85 PHSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPK 144
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDC-------TANTSCPYLEIYGDGSSTTGYFVQDVV 179
+SS+ + V C C ++ + C + PYL +YG GS T+G + D +
Sbjct: 145 NSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGS-TSGLLISDTL 203
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
+ S + GC ++ S ++ G+ GFG+ S+ SQL
Sbjct: 204 RLSPSSSSSAPAPFR-NFAIGC------SIVSVHQPP-SGLAGFGRGAPSVPSQLK---- 251
Query: 240 VRKMFAHCL------DGINGGGIFAIGHVVQPEVNK------TPLVPN---QPHYSI--- 281
V K F++CL D G +G + P K PL+ N +P YS+
Sbjct: 252 VPK-FSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYY 310
Query: 282 -NMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
+T + VG +NLP+ F G IIDSGTT YL V++P+ + + S
Sbjct: 311 LALTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMES 363
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 122/289 (42%), Gaps = 45/289 (15%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+ G Y + IGTPP + V DTGS ++W C C EC R + + SST
Sbjct: 85 NSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPA-----PPFQPASSST 139
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C C P C A T C Y YG G T GY + + S
Sbjct: 140 FSKLPCASSLCQ-FLTSPYLTCNA-TGCVYYYPYGMG-FTAGYLATETLHVGGASFP--- 193
Query: 191 TSTNGSLIFGCGARQS-GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ FGC GN S GI+G G+S S++SQ+ GV + F++CL
Sbjct: 194 -----GVAFGCSTENGVGNSSS-------GIVGLGRSPLSLVSQV----GVGR-FSYCLR 236
Query: 250 GINGGG----IF-AIGHVVQPEVNKTPLV-----PNQPHYSINMTAVQVGLDFLNLPTDV 299
G +F ++ V V TPL+ P+ +Y +N+T + VG L + +
Sbjct: 237 SDADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTT 296
Query: 300 F------GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT 342
F G G GTI+DSGTTL YL + Y + +SQ + T
Sbjct: 297 FGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTT 345
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 117/285 (41%), Gaps = 30/285 (10%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSL 116
+ V P+ G+ P VG Y I IG PP+ Y++ +DTGSD+ W+ C C C +
Sbjct: 69 SSVVFPVHGNVYP--VGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPH- 125
Query: 117 GIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQ 176
LY + V C C V+ +C C Y Y D S+ G V
Sbjct: 126 ----PLY----RPSNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGVLVN 177
Query: 177 DVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS 236
DV + +G + GCG Q ++ +DG++G G+ SS+ISQL
Sbjct: 178 DVYVLNFTNG----VQLKVRMALGCGYDQI--FPDSSYHPVDGMLGLGRGKSSLISQLNG 231
Query: 237 SGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-NQPHYSINMTAVQVGLDFLNL 295
G VR + HCL GG IF + TP+ + HYS + +G
Sbjct: 232 QGLVRNVVGHCLSAQGGGYIFFGDVYDSSRLAWTPMSSRDYKHYSAGAAELVLG------ 285
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYE---PLVSKIISQQPD 337
G G N + D+G++ Y Y+ L K I + P+
Sbjct: 286 -GKRTGFG-NLLAVFDAGSSYTYFNSNAYQLTKELAGKPIKEAPE 328
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 128/316 (40%), Gaps = 32/316 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL-TLYDIKDSSTGKFV 134
Y +G+G+P V +DTGSD+ WV QC+ CP S L+D SST
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWV---QCEPCPAPSPCHAHAGALFDPAASSTYAAF 191
Query: 135 TCDQEFCHGVY-GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C C + G C A + C Y+ YGDGS+TTG + DV+ L +
Sbjct: 192 NCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLT-------LSGSDV 244
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
FGC L + ++ DG+IG G S++SQ A+ G K F++CL
Sbjct: 245 VRGFQFGC---SHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYG--KSFSYCLPATPA 299
Query: 254 GGIF-------AIGHVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVG 303
F + G TP++ ++ +Y + + VG L L VF
Sbjct: 300 SSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA- 358
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD-LKVHTVHDEYTCFQYSESVDEGFPN 362
G+++DSGT + LP Y L S + + + TCF ++ P
Sbjct: 359 ---GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPT 415
Query: 363 VTFHFENSVSLKVYPH 378
V F + + H
Sbjct: 416 VALVFAGGAVVDLDAH 431
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 101/357 (28%), Positives = 156/357 (43%), Gaps = 39/357 (10%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSD 98
R+ SL + DA LA V L G S GVG Y ++G+GTP Y + VDTGS
Sbjct: 89 ARATSLDADADAGLAGS-LASVPLSPGASV---GVGNYVTRMGLGTPATQYVMVVDTGSS 144
Query: 99 IMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTAN 155
+ W+ C C C R+S +++ K SST V C + C + L + C+++
Sbjct: 145 LTWLQCSPCLVSCHRQSG-----PVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSS 199
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
C Y YGD S + GY +D V + S + +GCG G +
Sbjct: 200 NVCIYQASYGDSSFSVGYLSKDTVSFGSTSLP--------NFYYGCGQDNEGLFGRSA-- 249
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPN 275
G+IG ++ S++ QLA S G F +CL + G ++G + + TP+V +
Sbjct: 250 ---GLIGLARNKLSLLYQLAPSLGYS--FTYCLPSSSSSGYLSLGSYNPGQYSYTPMVSS 304
Query: 276 Q---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VS 329
Y I ++ + V + L + + TIIDSGT + LP VY L V+
Sbjct: 305 SLDDSLYFIKLSGMTVAGNPL---SVSSSAYSSLPTIIDSGTVITRLPTSVYSALSKAVA 361
Query: 330 KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
+ +++ D TCF+ ++ P VT F +LK+ L +D
Sbjct: 362 AAMKGTSRASAYSILD--TCFK-GQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDD 415
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPRFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 144/321 (44%), Gaps = 49/321 (15%)
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
VDTGS ++ C C C + YD S+ V C C G+ G C
Sbjct: 51 VDTGSSRTYLPCKGCASCGAHEAG----RYYDYDASADFSRVECSA--CAGIGG----KC 100
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
+ C Y Y +GS + GY V+DVV L + N +++FGC R+ L S
Sbjct: 101 GTSGVCRYDVHYLEGSGSEGYLVRDVVS-------LGGSVGNATVVFGCEERE---LGSI 150
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING------GGIFAIGH----V 262
+++ DG+ GFG+ ++ +QLAS+ + +F+ C++G GG+ +G+
Sbjct: 151 KQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGNFDFGA 210
Query: 263 VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
P + TP+V + +Y + T+ +G + V TIIDSGT+ Y+P
Sbjct: 211 DAPALVYTPMVSSAMYYQVTTTSWTLGNSVVEGSRGVL-------TIIDSGTSYTYVPGN 263
Query: 323 VYEPL--VSKIISQQPDLKVHTVHDEY--TCFQYS-----ESVDEGFPNVTFHFENSVSL 373
++ +++ +++ L+ ++Y CF S +V E FP + + S L
Sbjct: 264 MHARFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYFPALKIEYHGSARL 323
Query: 374 KVYPHEYLFPFE---DLWCIG 391
+ P YL+ + +C+G
Sbjct: 324 TLSPETYLYWHQKNASAFCVG 344
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 94/332 (28%), Positives = 146/332 (43%), Gaps = 43/332 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC----PRRSSLGIELTLYDIKDSSTG 131
+ +G+GTP + + DTGSD+ WV C C P++ L+D SST
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDP------LFDPSKSSTY 202
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C + C G D NT+C YL YGDGSSTTG +D + L ++
Sbjct: 203 AAVHCGEPQCAAAGGLCSED---NTTCLYLVHYGDGSSTTGVLSRDTLA-------LTSS 252
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
FGCG R G+ +DG++G G+ S+ SQ A+S G +F++CL
Sbjct: 253 RALAGFPFGCGTRNLGDFGR-----VDGLLGLGRGELSLPSQAAASFGA--VFSYCLPSS 305
Query: 252 NG-GGIFAIGHVVQPEVN--------KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
N G IG + + P P+ Y + + ++ +G L +P VF
Sbjct: 306 NSTTGYLTIGATPATDTGAAQYTAMLRKPQFPS--FYFVELVSIDIGGYILPVPPAVFTR 363
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFP 361
G GT++DSGT L YLP YE L + +D C+ ++ + P
Sbjct: 364 G---GTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVP 420
Query: 362 NVTFHFENSVSLKV-YPHEYLFPFEDLWCIGW 392
V+F F + ++ + +F E++ C+ +
Sbjct: 421 AVSFRFGDGAVFELDFFGVMIFLDENVGCLAF 452
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 133/336 (39%), Gaps = 35/336 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP+ YV +D+GSDI+WV C C EC ++S ++D S+T
Sbjct: 133 GSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD-----PVFDPAGSATY 187
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
++CD C + D C Y YGDGS T G + + + +V
Sbjct: 188 AGISCDSSVCDRLDNAGCND----GRCRYEVSYGDGSYTRGTLALETLTFGRV------- 236
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
++ GCG G G S + QL G F++CL
Sbjct: 237 -LIRNIAIGCGHMNRGMFIGAAGLLGL-----GGGAMSFVGQLGGQTG--GAFSYCLVSR 288
Query: 250 GINGGGIFAIGHVVQP-EVNKTPLV--PNQPHYSINMTAVQVGLDF-LNLPTDVFGVGD- 304
G G G P PL+ P P + + + +P +F + D
Sbjct: 289 GTESTGTLEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDL 348
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
G ++D+GT + LP YE I Q +L + V TC+ + V P
Sbjct: 349 GYGGVVMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPT 408
Query: 363 VTFHFENSVSLKVYPHEYLFPF--EDLWCIGWQNSG 396
V+F+F L + +L P E +C + S
Sbjct: 409 VSFYFSGGPILTLPARNFLIPVDGEGTFCFAFAASA 444
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 120/282 (42%), Gaps = 47/282 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +GTPP+ + +DTGSD++W C C +C + + + D SST +
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPV----LDPAASSTHAALP 145
Query: 136 CDQEFCHGVYGGPLTDCTANT----SCPYLEIYGDGSSTTGYFVQDVVQY--DKVSGDLQ 189
CD C + P T C + SC Y+ YGD S T G D + D +G L
Sbjct: 146 CDAPLCRAL---PFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLA 202
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
+ FGCG G + NE GI GFG+ S+ SQL + F++C
Sbjct: 203 AR----RVTFGCGHINKGIFQA-NET---GIAGFGRGRWSLPSQLNVTS-----FSYCFT 249
Query: 250 -------------GINGGGIFAIGHVVQP-EVNKTPLV--PNQPH-YSINMTAVQVGLDF 292
G + H +V T L+ P+QP Y + + + VG
Sbjct: 250 SMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGAR 309
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
+ +P TIIDSG ++ LPE VYE + ++ +SQ
Sbjct: 310 VAVPESRL----RSSTIIDSGASITTLPEDVYEAVKAEFVSQ 347
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 139/341 (40%), Gaps = 50/341 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+GTP + +DTGSD+ WV QC+ C + + L+D SST +
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWV---QCQPCNSTTCYPQKDPLFDPSKSSTYAPIP 180
Query: 136 CDQEFCHGV----YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
C+ + C + YGG C + YGDGS T G + + + L
Sbjct: 181 CNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLA-------LAPG 233
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
FGCG Q G D DG++G G + S++ Q AS G F++CL +
Sbjct: 234 VAVKDFRFGCGHDQDGANDK-----YDGLLGLGGAPESLVVQTASVYG--GAFSYCLPAL 286
Query: 252 NG---------------GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLP 296
N G + G V P + + + Y +NMT + VG + +++P
Sbjct: 287 NNQVGFLALGGGGAPSGGVVNTSGFVFTPMIRE-----EETFYVVNMTGITVGGEPIDVP 341
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESV 356
F + G IIDSGT + L Y L + + + TC+ +S
Sbjct: 342 PSAF----SGGMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGELDTCYDFSGYS 397
Query: 357 DEGFPNVTFHFENSVSLKV-YPHEYLFPFEDLWCIGWQNSG 396
+ P V F ++ + P+ L +D C+ +Q SG
Sbjct: 398 NVTLPKVALTFSGGATIDLDVPNGIL--LDD--CLAFQESG 434
>gi|145523035|ref|XP_001447356.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414867|emb|CAK79959.1| unnamed protein product [Paramecium tetraurelia]
Length = 548
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 88/347 (25%), Positives = 145/347 (41%), Gaps = 55/347 (15%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G YY I IG + V VDTGS +NC QC +C + + + S
Sbjct: 41 LGYYYMNIYIGENMTKHSVIVDTGSQATTINCNQCHQCGQHQNPPYSFNEKNYNSSDLRI 100
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD-------VVQYDKVS 185
C N C + Y +GSS G++ +D ++Q D
Sbjct: 101 DFNC--------------SSFENDRCNFASYYVEGSSIAGFYFKDKVLIGDGLIQLD--- 143
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS------SMISQLASSG- 238
D + I GC ++G L ++ DGI G N+ S+I +A
Sbjct: 144 -DRYIEQESFESILGCTQFETGQL---YQQMADGIFGLAPINNHSQYPPSLIDFIAKKDK 199
Query: 239 --GVRKMFAHCLDG----INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDF 292
+++ F+ CL+ I+ GG + ++NK P Q Y +N+T + G
Sbjct: 200 ALSLKRRFSICLNDDYGYISVGGYDLLRQDPDFKINKIKFKPTQ-QYQVNLTKIAFGDQT 258
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI-----ISQQPDLKVHTVHDEY 347
+ ++ G +GT IDSG T++Y+ +Y LV I +++ P + T+
Sbjct: 259 FTVNNKIYTGG--QGTFIDSGATISYMDREIYSQLVQSIKDHFELNKAP---ITTILQSQ 313
Query: 348 TCFQYSESVDEG---FPNVTFHFENSVSLKVYPHEYLFPFEDLWCIG 391
CF++++ V + FP + F F++ V + P EYL E+ CIG
Sbjct: 314 VCFKFTQDVLDQYSYFPTIKFIFDDDVEIYWKPQEYLNIQENQVCIG 360
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 138/333 (41%), Gaps = 37/333 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+A+IG+GTP + Y+ DTGSD+ W+ C C++C R+ + +++ SS+
Sbjct: 10 GSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQ-----QDPIFNPSLSSSF 64
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K + C C + + C+ C Y YGDGS T G F + + + +
Sbjct: 65 KPLACASSICGKLK---IKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGE-------- 113
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
S+ GCG G G+ S SQ +S +F++CL
Sbjct: 114 HAVRSVAMGCGRNNQGLFHGAAGLLGL-----GRGPLSFPSQTGTS--YASVFSYCLPRR 166
Query: 249 -DGINGGGIFAIGHVVQPEVNK-TPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVG 303
I +F G PE + T L+PN+ +Y + + ++V +N+P D F +G
Sbjct: 167 ESAIAASLVF--GPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMG 224
Query: 304 DN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
G I+DSGT ++ L Y L S + TC+ S P
Sbjct: 225 SRGTGGVIVDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISLFDTCYDLSSMKTATLP 284
Query: 362 NVTFHFENSVSLKVYPHEYLFPFED--LWCIGW 392
V F+ S+ + L +D +C+ +
Sbjct: 285 AVVLDFDGGASMPLPADGILVNVDDEGTYCLAF 317
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 165/403 (40%), Gaps = 63/403 (15%)
Query: 26 HGVFSVKYRYAGRERSLSLLK-EHDARRQQRILAGVDLPLGGSSRPD------------- 71
H SV+ +SL+L + D R + ++ +DL + S+ D
Sbjct: 72 HSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPISTMYTTEEQ 131
Query: 72 ------------GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
G G Y+ ++GIG P ++ Y+ +DTGSD+ W+ C C +C ++
Sbjct: 132 DIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTE---- 187
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVV 179
+++ SS+ + ++CD C+ + +++C N +C Y YGDGS T G F + +
Sbjct: 188 -PIFEPSSSSSYEPLSCDTPQCNAL---EVSEC-RNATCLYEVSYGDGSYTVGDFATETL 242
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
++ ++ GCG G G ++ SQL ++
Sbjct: 243 TIG--------STLVQNVAVGCGHSNEGLFVGAAGLLGL-----GGGLLALPSQLNTTS- 288
Query: 240 VRKMFAHCL--DGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLN 294
F++CL + G + P+ PL+ N Y + +T + VG + L
Sbjct: 289 ----FSYCLVDRDSDSASTVDFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQ 344
Query: 295 LPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQ 351
+P F + + + G IIDSGT + L +Y L + DL K V TC+
Sbjct: 345 IPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYN 404
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
S P V FHF L + Y+ P + + +C+ +
Sbjct: 405 LSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAF 447
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 148/345 (42%), Gaps = 62/345 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFSFGCNMDSFGANEFGN--------VDGLLGMGAGPMSVLKQ---SSPTFDCF 150
Query: 245 AHCLDGING--------GGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDF 292
++CL G F++G V + +V T +V + + + +++ A+ V +
Sbjct: 151 SYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGER 210
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
L L VF KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 LGLSPSVFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERNCYDM 267
Query: 353 SESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF+++ + H E +D+WC+ +
Sbjct: 268 -RSVDEGDMPAISLHFDDAARFDLGSHGVFVERSVQEQDVWCLAF 311
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 127/301 (42%), Gaps = 39/301 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTP + V +DT +D W+ C C C L+D SS+ + +
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C+ C P CT + SC + YG GS+ Y QD + T
Sbjct: 141 CEAPQCK---QAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTL--------ATDVIP 188
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGIN 252
+ FGC + SG G++G G+ S+ISQ S + F++CL N
Sbjct: 189 NYTFGCINKASG-----TSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 253 GGGIFAIGHVVQP-EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNK 306
G +G QP + TPL+ N Y +N+ ++VG +++PT F
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
GTI DSGT L E Y + ++ + + ++ TC YS SV FP+VTF
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGGFDTC--YSGSVV--FPSVTFM 357
Query: 367 F 367
F
Sbjct: 358 F 358
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 147/363 (40%), Gaps = 78/363 (21%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTPP+ + +DTGSD++W C C C ++ L +D SST +
Sbjct: 89 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA-----LPYFDPSTSSTLSLTS 143
Query: 136 CDQEFCHG--VYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
CD C G V P +D G G+S G
Sbjct: 144 CDSTLCQGLPVASLPRSD--------KFTFVGAGASVPG--------------------- 174
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
+ FGCG +G S NE GI GFG+ S+ SQL F+HC I G
Sbjct: 175 ---VAFGCGLFNNGVFKS-NET---GIAGFGRGPLSLPSQLKVGN-----FSHCFTTITG 222
Query: 254 -----------GGIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDV 299
+F+ G Q V TPL+ N + Y +++ + VG L +P
Sbjct: 223 AIPSTVLLDLPADLFSNG---QGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESE 279
Query: 300 FGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHT--VHDEYTCFQYSESV 356
F + + GTIIDSGT + LP VY LV + Q L V + D Y C
Sbjct: 280 FALKNGTGGTIIDSGTAMTSLPTRVYR-LVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRA 338
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFED----LWCI----GWQNSGMQSRDRKNMTLL 408
P + HFE + ++ + Y+F ED + C+ G + + + + ++NM +L
Sbjct: 339 KPYVPKLVLHFEGA-TMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVL 397
Query: 409 GDF 411
D
Sbjct: 398 YDL 400
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 130/301 (43%), Gaps = 39/301 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTP + V +DT +D W+ C C C L+D SS+ + +
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C+ C P CT + SC + YG GS+ Y QD + ++ D+ T
Sbjct: 141 CEAPQCK---QAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTL---TLASDVIPNYT-- 191
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGIN 252
FGC + SG G++G G+ S+ISQ S + F++CL N
Sbjct: 192 ---FGCINKASG-----TSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 253 GGGIFAIGHVVQP-EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNK 306
G +G QP + TPL+ N Y +N+ ++VG +++PT F
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
GTI DSGT L E Y + ++ + + ++ TC YS SV FP+VTF
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTC--YSGSVV--FPSVTFM 357
Query: 367 F 367
F
Sbjct: 358 F 358
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 130/301 (43%), Gaps = 39/301 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTP + V +DT +D W+ C C C L+D SS+ + +
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C+ C P CT + SC + YG GS+ Y QD + ++ D+ T
Sbjct: 141 CEAPQCK---QAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTL---TLASDVIPNYT-- 191
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGIN 252
FGC + SG G++G G+ S+ISQ S + F++CL N
Sbjct: 192 ---FGCINKASG-----TSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 253 GGGIFAIGHVVQP-EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNK 306
G +G QP + TPL+ N Y +N+ ++VG +++PT F
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
GTI DSGT L E Y + ++ + + ++ TC YS SV FP+VTF
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGGFDTC--YSGSVV--FPSVTFM 357
Query: 367 F 367
F
Sbjct: 358 F 358
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/384 (25%), Positives = 157/384 (40%), Gaps = 47/384 (12%)
Query: 26 HGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTP 85
+ S K R G L LK +Q A ++P+ G G Y ++ GTP
Sbjct: 74 ESLMSEKIR--GDANRLRFLKRTSRSSKQD--ANANVPVRS-----GSGEYIIQVDFGTP 124
Query: 86 PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY 145
+ Y +DTGSD+ W+ C QC+ C + ++D SS+ K CD + C +
Sbjct: 125 KQSMYTLIDTGSDVAWIPCKQCQGCHSTAP------IFDPAKSSSYKPFACDSQPCQEIS 178
Query: 146 GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQ 205
G +C N+ C + YGDG+ G D + + + FGC
Sbjct: 179 G----NCGGNSKCQFEVSYGDGTQVDGTLASDAITLG--------SQYLPNFSFGCAESL 226
Query: 206 SGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI----GH 261
S + + G + ++L GG F++CL + +
Sbjct: 227 SEDTSPSPGLMGLGGGSLSLLTQAPTAELF--GGT---FSYCLPSSSTSSGSLVLGKEAA 281
Query: 262 VVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
V + T L+ P+ P Y + + A+ VG +++P + GTIIDSGTT+ +
Sbjct: 282 VSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGT--NIASGGGTIIDSGTTITH 339
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY-SESVDEGFPNVTFHFENSVSLKVYP 377
L Y L Q L+ V D TC+ S SVD P +T H + +V L V P
Sbjct: 340 LVPSAYTALRDAFRQQLSSLQPTPVEDMDTCYDLSSSSVD--VPTITLHLDRNVDL-VLP 396
Query: 378 HEYLFPFED--LWCIGWQNSGMQS 399
E + ++ L C+ + ++ +S
Sbjct: 397 KENILITQESGLACLAFSSTDSRS 420
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/396 (24%), Positives = 157/396 (39%), Gaps = 48/396 (12%)
Query: 38 RERSLSLLKEHDARRQQRILAG-----VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQ 92
R+R ++ + H RR + AG ++PL + G+G Y+ + +GTP + + +
Sbjct: 53 RQR-MAFIASHGRRRARETAAGSSAAAFEMPLTSGAY-TGIGQYFVRFRVGTPAQPFLLV 110
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
DTGSD+ WV C + S + +DS T ++C + C L C
Sbjct: 111 ADTGSDLTWVKCRR-PAANSSESGSGSGRAFRPEDSRTWAPISCASDTCTKSLPFSLATC 169
Query: 153 -TANTSCPYLEIYGDGSSTTGYF-VQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
T + C Y Y DGS+ G + G + + L+ GC + +G
Sbjct: 170 PTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKGLVLGCTSSYTG--- 226
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC----LDGINGGGIFAIG------ 260
+ E DG++ G S+ S S AS R F++C L N G
Sbjct: 227 -PSFEVSDGVLSLGYSDVSFASHAASRFAGR--FSYCLVDHLSPRNATSYLTFGPNPAVA 283
Query: 261 -----------------HVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVF 300
+P +TPL+ + +P Y + + AV V FL +P V+
Sbjct: 284 SSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVAGQFLKIPRAVW 343
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY-SESVDEG 359
V G I+DSGT+L L + Y +V+ + L T+ C+ + S S D
Sbjct: 344 DVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMDPFEYCYNWTSPSGDVT 403
Query: 360 FPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
P + HF + L+ Y+ + CIG Q
Sbjct: 404 LPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQE 439
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 136/309 (44%), Gaps = 28/309 (9%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG---VDLPLGGSS-RPDGVG-LYYAKIGIGTPPKDYY 90
AG + L HD RR R LAG V G + R + +G L+YA + +GTP +
Sbjct: 45 AGTAEYYAALAGHDLRR--RSLAGGGEVAFADGNDTYRLNELGFLHYAVVALGTPNVTFL 102
Query: 91 VQVDTGSDIMWV--NCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
V +DTGSD+ WV +CI C + ++ Y + SST + V C C
Sbjct: 103 VALDTGSDLFWVPCDCINCAPLVSPNYRDLKFDTYSPQKSSTSRKVPCSSNLCDEQSACR 162
Query: 149 LTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGN 208
+ S YL D +S+TG V+DV+ Y Q + FGCG Q+G+
Sbjct: 163 SASSSCPYSIQYLS---DNTSSTGVLVEDVL-YLVTEYGRQPKIVTAPITFGCGRTQTGS 218
Query: 209 LDSTNEEALDGIIGFGKSNSSMISQLASSG-GVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
T A +G++G G S+ S LAS G F+ C +G G G +
Sbjct: 219 FLGT--AAPNGLLGLGMDTISVPSLLASQGVAAANSFSMCF-AQDGHGRINFGDTGSSDQ 275
Query: 268 NKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYE 325
+TPL P+Y+I++T VG ++ + I+DSGT+ L + +Y
Sbjct: 276 QETPLNMYKQNPYYNISITGATVGSKSIHTKFNA---------IVDSGTSFTALSDPMYT 326
Query: 326 PLVSKIISQ 334
+ S + Q
Sbjct: 327 QITSSVSVQ 335
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 128/301 (42%), Gaps = 39/301 (12%)
Query: 43 SLLKEHDARRQQRILA-----------GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
S K+ R++ IL+ + LPL G+ P+G Y + +G PPK Y++
Sbjct: 15 SFFKDKPWERKRPILSVPTASSSFASSSIVLPLQGNVYPNG--FYNVTLYVGQPPKPYFL 72
Query: 92 QVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
DTGSD+ W+ C C++C TL+ + S V C C ++
Sbjct: 73 DPDTGSDLTWLQCDAPCQQCTE--------TLHPLYQPSN-DLVPCKDPLCMSLHSSMDH 123
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
C C Y Y DG S+ G V+DV + +GD L GCG Q
Sbjct: 124 RCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD----PIRPRLALGCGYDQDPG-- 177
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQP-EVNK 269
S++ +DGI+G G+ S++SQL + G VR + HC + GG F + P +
Sbjct: 178 SSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYXFFGDGIYDPYRLVW 237
Query: 270 TPLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
TP+ + P HYS L F T + N + DSG++ Y Y+ L
Sbjct: 238 TPMSRDYPKHYSPGFGE----LIFNGRSTGL----RNLFVVFDSGSSYTYFNAQAYQVLT 289
Query: 329 S 329
S
Sbjct: 290 S 290
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 152/338 (44%), Gaps = 35/338 (10%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y +GTPP Y +DTGS+I+W+ C C C ++S +++ SS+ K
Sbjct: 86 LGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTS-----PIFNPSKSSSYK 140
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+ C C ++ C Y YG + + G D + D SG ++
Sbjct: 141 NIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSG---SSV 197
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+++ GCG N+ N ++ G++G G+ S+I Q+ SS V F++CL N
Sbjct: 198 LFPNIVIGCGHI---NVLQDNSQS-SGVVGMGRGPMSLIKQVGSS-SVGSKFSYCLIPYN 252
Query: 253 GGG------IFAIGHVVQPE-VNKTPLVP---NQPHYSINMTAVQVGLDFLNLPTDVFGV 302
IF VV E V TP+V + +Y + + A VG + + +G
Sbjct: 253 SDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIE-----YGE 307
Query: 303 GDNKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE- 358
N T +IDSGT L LP + LVS ++Q+ L D + Y+ + +
Sbjct: 308 RSNASTQNILIDSGTPLTMLPNLFLSKLVS-YVAQEVKLPRIEPPDHHLSLCYNTTGKQL 366
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
P++T HF N +K+ + FPFED + C G+ +S
Sbjct: 367 NVPDITAHF-NGADVKLNSNGTFFPFEDGIMCFGFISS 403
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 122/273 (44%), Gaps = 36/273 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +G PP Y +DTGSD++W+ C C++C +++ ++D S+T K
Sbjct: 84 GEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTT-----RIFDPSKSNTYKI 138
Query: 134 VTCDQEFCHGVYGGPLTDCTANTS--CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C V T C+++ C Y YGDGS + G D L
Sbjct: 139 LPFSSTTCQSVED---TSCSSDNRKMCEYTIYYGDGSYSQG---------DLSVETLTLG 186
Query: 192 STNGS------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKMF 244
STNGS + GCG + + + + GI+G G S+I+QL S + + F
Sbjct: 187 STNGSSVKFRRTVIGCGRNNTVSFEGKSS----GIVGLGNGPVSLINQLRRRSSSIGRKF 242
Query: 245 AHCL---DGINGGGIFAIGHVVQPE-VNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTD 298
++CL I+ F VV + TP+V + P Y + + A VG + + +
Sbjct: 243 SYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSS 302
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
F G+ IIDSGTTL LP +Y L S +
Sbjct: 303 SFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAV 335
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 140/323 (43%), Gaps = 37/323 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++G+G P K +Y+ +DTGSD+ W+ C C +C ++S ++D SS+
Sbjct: 153 GSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSD-----PIFDPTASSSY 207
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+TCD + C + ++ C N C Y YGDGS T G +V + V + S +
Sbjct: 208 NPLTCDAQQCQDL---EMSAC-RNGKCLYQVSYGDGSFTVGEYVTETVSFGAGSVN---- 259
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ GCG G + G S+ SQ+ ++ F++CL
Sbjct: 260 ----RVAIGCGHDNEGLFVGSAGLLGL-----GGGPLSLTSQIKATS-----FSYCLVDR 305
Query: 252 NGGGIFAIGHVVQPEVNKT---PLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ G + P + PL+ NQ Y + +T V VG + + +P + F V +
Sbjct: 306 DSGKSSTL-EFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQS 364
Query: 306 --KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK-VHTVHDEYTCFQYSESVDEGFPN 362
G I+DSGT + L Y + + +L+ V TC+ S P
Sbjct: 365 GAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVALFDTCYDLSSLQSVRVPT 424
Query: 363 VTFHFENSVSLKVYPHEYLFPFE 385
V+FHF + + YL P +
Sbjct: 425 VSFHFSGDRAWALPAKNYLIPVD 447
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/303 (30%), Positives = 133/303 (43%), Gaps = 28/303 (9%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y IGTPP Y +DT +D +W C CK C +S ++D SST K +
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTS-----PMFDPSKSSTYKTIP 143
Query: 136 CDQEFCHGVYGGPLTDCTANTS--CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C C V T C+++ C Y YG + + G D + ++ + T +
Sbjct: 144 CSSPKCKNVEN---THCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLT---LNSNNDTPIS 197
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+++ GCG R G L E + G IG G+ S ISQL SS G + F++CL
Sbjct: 198 FKNIVIGCGHRNKGPL----EGYVSGNIGLGRGPLSFISQLNSSIGGK--FSYCLVPLFS 251
Query: 249 -DGINGGGIFAIGHVVQ-PEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+GI+G F VV TP+ + YS + A+ VG + DN
Sbjct: 252 NEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENST-SKNDNL 310
Query: 307 G-TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
G TIIDSGTTL LPE VY L S + S + + + ++ + + P +T
Sbjct: 311 GNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKATLKNLDVPIITA 370
Query: 366 HFE 368
HF
Sbjct: 371 HFN 373
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 101/414 (24%), Positives = 172/414 (41%), Gaps = 59/414 (14%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILA--GVDLPLGGSSRPDGVGLYYAK 79
V ++ V + ++ A R + H+AR+ + V P+ ++ P G +
Sbjct: 35 VHADPSVTASQFVRAALHRDM---HRHNARKLAASSSDGTVSAPVSPTTVP---GEFLMT 88
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
+ IGTPP + DTGSD++W C C ++C ++ + LY+ S+T + C+
Sbjct: 89 LAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPT-----PLYNPSSSTTFSALPCNS 143
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
L C +C Y YG G + Y Q + S +
Sbjct: 144 S---------LGLCAPACACMYNMTYGSGWT---YVFQGTETFTFGSSTPADQVRVPGIA 191
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGG 255
FGC SG N + G++G G+ + S++SQL G K F++CL N
Sbjct: 192 FGCSNASSG----FNASSASGLVGLGRGSLSLVSQL----GAPK-FSYCLTPYQDTNSTS 242
Query: 256 IFAIGHVVQPE----VNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KG 307
+G V+ TP V P+ +Y +N+T + +G L +P + F + + G
Sbjct: 243 TLLLGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGG 302
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIIS--QQPDLKVHTVHDEYTCFQY--SESVDEGFPNV 363
IIDSGTT+ L Y+ + + ++S P CF+ S S P++
Sbjct: 303 LIIDSGTTITMLGNTAYQQVRAAVLSLVTLPTTDGSAATGLDLCFELPSSTSAPPSMPSM 362
Query: 364 TFHFENSVSLKVYPHEYLF------PFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
T HF+ + + + Y+ LWC+ QN D +++LG++
Sbjct: 363 TLHFDGA-DMVLPADNYMMSLSDPDSDSSLWCLAMQN--QTDTDGVVVSILGNY 413
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 101/399 (25%), Positives = 162/399 (40%), Gaps = 59/399 (14%)
Query: 31 VKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGG--SSRPDGVGL------------Y 76
+ + YAG S + H AR + A + L G S+R GV +
Sbjct: 32 LDHPYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLSDQGH 91
Query: 77 YAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTC 136
+GIGTPP+ + VDTGSD++W C + G +YD +SST F+ C
Sbjct: 92 SLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHG-SPPVYDPGESSTFAFLPC 150
Query: 137 DQEFCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C G + +CT+ C Y ++YG ++ G + + G + S
Sbjct: 151 SDRLCQEGQFS--FKNCTSKNRCVYEDVYGSAAA-VGVLASETFTF----GARRAVSLR- 202
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGI 251
L FGCGA +G+L GI+G + S+I+QL + F++CL D
Sbjct: 203 -LGFGCGALSAGSLIGAT-----GILGLSPESLSLITQLKI-----QRFSYCLTPFADKK 251
Query: 252 NGGGIFAI-----GHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVG 303
+F H + T +V N +Y + + + +G L +P +
Sbjct: 252 TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMR 311
Query: 304 DN--KGTIIDSGTTLAYLPEMVYEPLVSKIIS-QQPDLKVHTVHDEYTCFQYSESVDEG- 359
+ GTI+DSG+T+AYL E +E + ++ + + TV D CF
Sbjct: 312 PDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAA 371
Query: 360 -----FPNVTFHFENSVSLKVYPHEYLF--PFEDLWCIG 391
P + HF+ ++ V P + F P L C+
Sbjct: 372 MEAVQVPPLVLHFDGGAAM-VLPRDNYFQEPRAGLMCLA 409
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 161/384 (41%), Gaps = 74/384 (19%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
L L G +R G +YA + IGTP + V VDTGS +V C C C + S
Sbjct: 126 LELNGKARD--TGYFYATVLIGTPGHQFEVIVDTGSTYTFVTCYPCASCGQHGSNAP--- 180
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
YD SS+ + V C G C A+ C Y E + + S G+ V DV+
Sbjct: 181 -YDAAKSSSYERVPCGSGCIFGA-------CRASGLCEYDEKFSEDSQVGGHVVSDVID- 231
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS---- 237
V G L T + FGC + ++ L + + +G+I G++ + + QL
Sbjct: 232 --VGGSLGTPRIH----FGCNSLETNMLKT---QKANGMIALGRAEAGLHRQLKKKAYPP 282
Query: 238 GGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPH--------------YSINM 283
G F CL GGG+ ++G + PE + V + H Y++ +
Sbjct: 283 GSYDGTFGLCLGSFEGGGVLSLGKL--PEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEV 340
Query: 284 TAVQVGLDFLNLPT-----DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD- 337
+ V L P+ + F G GT++DSGTT YL E V+ P +S+I + +
Sbjct: 341 HRMFVRNTELKKPSGAELMEAFRAG--YGTVLDSGTTYTYLHEDVFIPFISEIEDKVVND 398
Query: 338 -----LKVHTVHDEY---TCF-------QYSES-VDEGFPNVTFHF----ENSVSLKVYP 377
+V Y C+ Q SES V+ FP F E + ++ P
Sbjct: 399 HGANFFRVRGGDPNYPNDVCWRSLNENKQLSESNVNYLFPTFNLTFIGVNEEELPIEFLP 458
Query: 378 HEYLF--PFE-DLWCIGWQNSGMQ 398
YLF P E + +C+G ++G Q
Sbjct: 459 ENYLFVHPNEPNAFCVGVFDNGQQ 482
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 161/356 (45%), Gaps = 52/356 (14%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
+ R + + H++ Q + + +PL + + Y IG+G ++ V +DTGS
Sbjct: 99 QNRIRAKVSGHNSSEQS---SEIQIPLASGINLETLN-YIVTIGLGN--QNMTVIIDTGS 152
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY--GGPLTDCTAN 155
D+ WV C C C + +++ +SS+ + C+ C + G C +N
Sbjct: 153 DLTWVQCDPCMSCYSQQG-----PVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESN 207
Query: 156 --TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
+SC + YGDGS T G + + + +S + +FGCG G
Sbjct: 208 NPSSCNHTVSYGDGSFTDGELGVEHLSFGGISVS--------NFVFGCGRNNKGLFG--- 256
Query: 214 EEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLDGINGG--GIFAIGHVVQPEVNKT 270
+ GI+G G+SN SMISQ ++ GGV F++CL + G G IG+ N T
Sbjct: 257 --GVSGIMGLGRSNLSMISQTNTTFGGV---FSYCLPTTDSGASGSLVIGNESSLFKNLT 311
Query: 271 PL----VPNQPH----YSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
P+ + + P Y +N+T + VG + + FG N G +IDSGT + L
Sbjct: 312 PIAYTSMVSNPQLSNFYVLNLTGIDVG--GVAIQDTSFG---NGGILIDSGTVITRLAPS 366
Query: 323 VYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+Y L ++ + Q P ++ D TCF + + P ++ HFEN+V L V
Sbjct: 367 LYNALKAEFLKQFSGYPIAPALSILD--TCFNLTGIEEVSIPTLSMHFENNVDLNV 420
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 122/281 (43%), Gaps = 34/281 (12%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
P+ G P+G LY+ I +G+PP+ Y++ +DTGSD+ W+ C C C + +
Sbjct: 89 FPVRGDVYPNG--LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPN----- 141
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD-CTANTSCPYLEIYGDGSSTTGYFVQDVV 179
LY K G V C V T C C Y Y D SS+ G D
Sbjct: 142 PLYKPK---KGNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASD-- 196
Query: 180 QYDKVSGDLQTTSTNGSL-----IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
DL NGSL +FGC Q G L ++ + DGI+G K+ S+ SQL
Sbjct: 197 -------DLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKT-DGILGLSKAKVSLPSQL 248
Query: 235 ASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPE--VNKTPLV-PNQPHYSINMTAVQVGL 290
AS + + HCL GGG +G P + P++ + P+Y + + G
Sbjct: 249 ASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGS 308
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
L+L G + + D+G++ Y P+ Y LV+ +
Sbjct: 309 RQLSLGRQ---DGRTERVVFDTGSSYTYFPKEAYYALVASL 346
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 95/404 (23%), Positives = 165/404 (40%), Gaps = 64/404 (15%)
Query: 26 HGVFSVKYRYAGRERSLSLLK-EHDARRQQRILAGVDLPLGGSSRPD------------- 71
H SV+ +SL+L + D R + ++ +DL + S+ D
Sbjct: 74 HSRVSVRGTEHSDYKSLTLARLNRDTARVKSLITRLDLAINNISKADLKPVTTMYTTTEE 133
Query: 72 -------------GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI 118
G G Y+ ++GIG P ++ Y+ +DTGSD+ W+ C C +C ++
Sbjct: 134 EDIEAPLISGTTQGSGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTE--- 190
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDV 178
+++ SS+ + ++CD C+ + +++C N +C Y YGDGS T G F +
Sbjct: 191 --PIFEPSSSSSYEPLSCDTPQCNAL---EVSEC-RNATCLYEVSYGDGSYTVGDFATET 244
Query: 179 VQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSG 238
+ ++ ++ GCG G G ++ SQL ++
Sbjct: 245 LTIG--------STLVQNVAVGCGHSNEGLFVGAAGLLGL-----GGGLLALPSQLNTTS 291
Query: 239 GVRKMFAHCL--DGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFL 293
F++CL + G + P+ PL+ N Y + +T + VG + L
Sbjct: 292 -----FSYCLVDRDSDSASTVEFGTSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGGELL 346
Query: 294 NLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCF 350
+P F + + + G IIDSGT + L +Y L + DL K V TC+
Sbjct: 347 QIPQSSFEMDESGSGGIIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFDTCY 406
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
S P V FHF L + Y+ P + + +C+ +
Sbjct: 407 NLSAKTTIEVPTVAFHFPGGKMLALPAKNYMIPVDSVGTFCLAF 450
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 145/341 (42%), Gaps = 45/341 (13%)
Query: 64 LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLY 123
L GS P VG +Y + IG P + Y++ +DTGS W+ C K+ P ++ + LY
Sbjct: 29 LDGSVYP--VGHFYVTMNIGEPAEPYFLDIDTGSSFTWLEC-HAKDGPCKTCNKVPHPLY 85
Query: 124 DIKDSSTGKFVTCDQEFCHGVYG--GPLTDCT--ANTSCPYLEIYGDGSSTTGYFVQDVV 179
+ + K V C C ++ G CT C Y Y DG S+ G + D
Sbjct: 86 RL---TRKKLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKYQDGLSSLGVLLLDKF 142
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQ-SGNLDSTNEEA-LDGIIGFGKSNSSMISQLASS 237
T ++ FGCG Q G+ E+ +DGI+G G+ + + SQL S
Sbjct: 143 SL--------PTGGARNIAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHS 194
Query: 238 GGVRK-MFAHCLDGINGGGIFAIG--HVVQPEVNKTPLVPNQP----HYSINMTAVQVGL 290
G V K + HCL GGG IG +V V P+ P P HYS G
Sbjct: 195 GAVSKNVIGHCLSS-KGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYS-------PGQ 246
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--- 347
L+L ++ G K I DSG+T YLPE ++ LVS + + + V D
Sbjct: 247 ATLHLDSNPIGTKPLKA-IFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDPALPL 305
Query: 348 -----TCFQYSESVDEGFPN-VTFHFENSVSLKVYPHEYLF 382
F+ + F + VT F+ V++ + P YL
Sbjct: 306 CWKGPKPFKTVHDTPKEFKSLVTLKFDLGVTMIIPPENYLI 346
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 134/332 (40%), Gaps = 48/332 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++G+GTPP+ Y+ +DTGSDIMW+ C+ C +C G L++ SST
Sbjct: 149 GSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKC-----YGQTDPLFNPAASSTY 203
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ V C C + ++ C C Y YGDGS T G F +T
Sbjct: 204 RKVPCATPLCKKL---DISGCRNKRYCEYQVSYGDGSFTVGDF------------STETL 248
Query: 192 STNGSLI----FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR--KMFA 245
+ G +I GCG + E L S G + K F+
Sbjct: 249 TFRGQVIRRVALGCG---------HDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFS 299
Query: 246 HCL-----DGINGGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFL-NLP 296
+CL G IF + + + TPL+ N Y + + + VG L ++P
Sbjct: 300 YCLVDRSASGTASSLIFGKAAIPKSAIF-TPLLSNPKLDTFYYVELVGISVGGRRLTSIP 358
Query: 297 TDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYS 353
VF + N G IIDSGT++ L + Y + +LK + TC+ S
Sbjct: 359 ASVFRMDATGNGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSLFDTCYDLS 418
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
P + FHF+ + + YL P +
Sbjct: 419 GLKTVKVPTLVFHFQGGAHISLPATNYLIPVD 450
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 139/337 (41%), Gaps = 39/337 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+G+PP++ Y+ +D+GSDI+WV C C C ++S ++D DSS+
Sbjct: 139 GSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSD-----PVFDPADSSSF 193
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C + C + T C A C Y YGDGS T G + + +V
Sbjct: 194 AGVSCGSDVCDRLEN---TGCNAG-RCRYEVSYGDGSYTKGTLALETLTVGQV------- 242
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
+ GCG G G + S I QL G F++CL
Sbjct: 243 -MIRDVAIGCGHTNQGMFIGAAGLLGL-----GGGSMSFIGQLGGQTG--GAFSYCLVSR 294
Query: 250 GINGGGIFAIGHVVQP------EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
G G G P + + P P+ Y I + + VG +++P + F +
Sbjct: 295 GTGSTGALEFGRGALPVGATWISLIRNPRAPS--FYYIGLAGIGVGGVRVSVPEETFQLT 352
Query: 304 D--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGF 360
+ G ++D+GT + P Y +Q +L + V TC+ +
Sbjct: 353 EYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRV 412
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFE--DLWCIGWQNS 395
P V+F+F + L + +L P + +C+ + S
Sbjct: 413 PTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPS 449
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 146/352 (41%), Gaps = 41/352 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +G+PP+ DTGSD++WV C + SS T +D SST V+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNN--DTSSAAAPTTQFDPSRSSTYGRVS 158
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDK-VSGDLQTTSTN 194
C + C + G T C ++C YL YGDGS+TTG + +D SG
Sbjct: 159 CQTDACEAL--GRAT-CDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRV 215
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGIN 252
G + FGC +G+ + L S+++QL + + + F++CL +N
Sbjct: 216 GGVKFGCSTATAGSFPADGLVGLG------GGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269
Query: 253 GGGIF---AIGHVVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
A+ V +P TPLV +Y++ + +V+VG +
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVG-------NKTVASAASSR 322
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIIS-------QQPDLKVHTVHDEYTCFQYSESVDEGF 360
I+DSGTTL +L + P+V ++ Q PD + Y E
Sbjct: 323 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLC---YNVAGREVEAGESI 379
Query: 361 PNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
P++T F ++ + P E C+ + + +++ +++LG+
Sbjct: 380 PDLTLEFGGGAAVALKPENAFVAVQEGTLCLAI----VATTEQQPVSILGNL 427
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 138/312 (44%), Gaps = 45/312 (14%)
Query: 78 AKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV--- 134
A I IG PP V +DTGSDI+WV C C C + LG+ L+D SST +
Sbjct: 103 ANISIGQPPIPQLVVMDTGSDILWVMCTPCTNC--DNDLGL---LFDPSKSSTFSPLCKT 157
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
CD E C C P+ Y D S+ +G F +D V ++ + TS
Sbjct: 158 PCDFEGCR---------CDP---IPFTVTYADNSTASGTFGRDTVVFETTD---EGTSRI 202
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DG 250
++FGCG N+ + +GI+G S++++L + F++C+ D
Sbjct: 203 SDVLFGCGH----NIGHDTDPGHNGILGLNNGPDSLVTKLG------QKFSYCIGNLADP 252
Query: 251 INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK--GT 308
+G E TP Y + M + VG L++ + F + +N+ G
Sbjct: 253 YYNYHQLILGEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGV 312
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESVD-EGFPNV 363
IID+G+T+ +L + V++ L+SK + E + CF S S D GFP V
Sbjct: 313 IIDTGSTITFLVDSVHK-LLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVV 371
Query: 364 TFHFENSVSLKV 375
TFHF + L +
Sbjct: 372 TFHFSDGADLAL 383
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 136/331 (41%), Gaps = 35/331 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++G+G+P + Y+ +DTGSD+ WV C C +C ++S ++D S++
Sbjct: 159 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSTSY 213
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V CD CH + + T +C Y YGDGS T G F + + GD
Sbjct: 214 ASVACDNPRCHDLDAAACRNSTG--ACLYEVAYGDGSYTVGDFATETLTL----GDSAPV 267
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S S+ GCG G G S SQ++++ F++CL
Sbjct: 268 S---SVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAT-----TFSYCLVDR 314
Query: 252 N--GGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN- 305
+ G EV PL+ + Y + ++ + VG L++P F +
Sbjct: 315 DSPSSSTLQFGDAADAEVTA-PLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTG 373
Query: 306 -KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPNV 363
G I+DSGT + L Y L + L + V TC+ S+ P V
Sbjct: 374 AGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAV 433
Query: 364 TFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
+ F L++ YL P + +C+ +
Sbjct: 434 SLRFAGGGELRLPAKNYLIPVDGAGTYCLAF 464
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 136/311 (43%), Gaps = 32/311 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y A +G+GTP + +DTGS + WV QCK C L L+D SS+ V
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWV---QCKPCNSSQCYPQRLPLFDPNTSSSYSPVP 185
Query: 136 CDQEFCHGVYGGPLTD-CTA--NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
CD + C + G D CT+ + C Y YG G++ G + D + L +
Sbjct: 186 CDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALT-------LGPGA 238
Query: 193 TNGSLIFGCG-ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
FGCG +Q G D DG++G G+ S+ Q ++ G +F+HCL
Sbjct: 239 IVKRFHFGCGHHQQRGKFDMA-----DGVLGLGRLPQSLAWQASARRG-GGVFSHCLPPT 292
Query: 252 N-GGGIFAIG--HVVQPEVNKTPLVP--NQP-HYSINMTAVQVGLDFLNLPTDVFGVGDN 305
G A+G H V TPL+ +QP Y + TA+ V L++P VF
Sbjct: 293 GVSTGFLALGAPHDTSAFVF-TPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF----R 347
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH-TVHDEYTCFQYSESVDEGFPNVT 364
+G I DSGT L+ L E Y L + S + + V TCF ++ + P V+
Sbjct: 348 EGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVS 407
Query: 365 FHFENSVSLKV 375
F ++ +
Sbjct: 408 LTFRGGATVHL 418
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 84/331 (25%), Positives = 136/331 (41%), Gaps = 35/331 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++G+G+P + Y+ +DTGSD+ WV C C +C ++S ++D S++
Sbjct: 163 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSTSY 217
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V CD CH + + T +C Y YGDGS T G F + + GD
Sbjct: 218 ASVACDNPRCHDLDAAACRNSTG--ACLYEVAYGDGSYTVGDFATETLTL----GDSAPV 271
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S S+ GCG G G S SQ++++ F++CL
Sbjct: 272 S---SVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAT-----TFSYCLVDR 318
Query: 252 N--GGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN- 305
+ G EV PL+ + Y + ++ + VG L++P F +
Sbjct: 319 DSPSSSTLQFGDAADAEVTA-PLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTG 377
Query: 306 -KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPNV 363
G I+DSGT + L Y L + L + V TC+ S+ P V
Sbjct: 378 AGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAV 437
Query: 364 TFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
+ F L++ YL P + +C+ +
Sbjct: 438 SLRFAGGGELRLPAKNYLIPVDGAGTYCLAF 468
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 135/318 (42%), Gaps = 24/318 (7%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++ +GTP + + + DTGSD+ WV C + ++ S +
Sbjct: 100 GTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSW 159
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ CD + C L +C++ C Y Y D SS G D D
Sbjct: 160 SPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTR 219
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-D 249
+ ++ GC + + D + ++ DG++ G SN S S+ AS G R F++CL D
Sbjct: 220 KAKLQEVVLGC----TTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGR--FSYCLVD 273
Query: 250 GI---NGGGIFAIGH-----VVQPEVNKTPLV-----PNQPHYSINMTAVQVGLDFLNLP 296
+ N G+ +TPLV +P Y +++ AV V + L +
Sbjct: 274 HLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEIL 333
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSES 355
DV+ N G I+DSGT+L L Y+ +V I Q + +V+ EY C+ ++
Sbjct: 334 PDVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDPFEY-CYNWT-G 391
Query: 356 VDEGFPNVTFHFENSVSL 373
V P + F + +L
Sbjct: 392 VSAEIPRMELRFAGAATL 409
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 147/347 (42%), Gaps = 64/347 (18%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFTFGCNMDSFGANEFGN--------VDGLLGMGAGQMSVLKQ---SSPTFDGF 150
Query: 245 AHCL------DGI--NGGGIFAIG---HVVQPEVNKTPLVP---NQPHYSINMTAVQVGL 290
++CL G G F++G + +V T +V N + +++TA+ V
Sbjct: 151 SYCLPLQMSERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDG 210
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ L L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 ERLGLSPSIF---SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCY 267
Query: 351 QYSESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 268 DM-RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 313
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 81/305 (26%), Positives = 127/305 (41%), Gaps = 41/305 (13%)
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
V +D+ SD+ WV C+ C P + + YD S T +C C + GP
Sbjct: 31 VVLDSASDVPWVQCVPCPIPPCHPQVD---SFYDPSRSPTSAAFSCSSPTCTAL--GPYA 85
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD---KVSGDLQTTSTNGSLIFGCGARQSG 207
+ AN C YL Y DGSST+G ++ D++ D VSG FGC + G
Sbjct: 86 NGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSG----------FKFGCSHAEQG 135
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIG------ 260
+ D+ GI+ G S++SQ AS G F++C+ + G F +G
Sbjct: 136 SFDARAA----GIMALGGGPESLLSQTASRYG--NAFSYCIPATASDSGFFTLGVPRRAS 189
Query: 261 --HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
+VV P V Y + + + VG L + VF G+++DS T +
Sbjct: 190 SRYVVTPMVR---FRQAATFYGVLLRTITVGGQRLGVAPAVFAA----GSVLDSRTAITR 242
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSVSLKVYP 377
LP Y+ L + S + TC+ ++ V+ P ++ F+ + L + P
Sbjct: 243 LPPTAYQALRAAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDP 302
Query: 378 HEYLF 382
LF
Sbjct: 303 SGILF 307
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 80/329 (24%), Positives = 133/329 (40%), Gaps = 36/329 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y+A +G+GTP + +DTGSD++W+ C C+ C ++D + SST +
Sbjct: 84 GEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRC-----YAQRGQVFDPRRSSTYRR 138
Query: 134 VTCDQEFCHGV-YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V C C + + G + A C Y+ YGDGSS+TG D + + T
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAF------ANDTY 192
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
N ++ GCG G DS G++G + S+ +Q+A + G +F +CL
Sbjct: 193 VN-NVTLGCGRDNEGLFDSAA-----GLLGVARGKISISTQVAPAYG--SVFEYCLGDRT 244
Query: 253 G----GGIFAIGHVVQPEVNK-TPLV--PNQPH-YSINMTAVQVGLD----FLNLPTDVF 300
G +P T L+ P +P Y ++M VG + F N +
Sbjct: 245 SRSTRSSYLVFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALD 304
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESV 356
G ++DSGT ++ Y L ++ + + E++ C+
Sbjct: 305 TATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRP 364
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
P + HF + + P Y P +
Sbjct: 365 AASAPLIVLHFAGGADMALPPENYFLPVD 393
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 144/337 (42%), Gaps = 48/337 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y I +GTP K + V DTGSD++W+ C C+ C + ++D + SS+
Sbjct: 36 GGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKD-----PIFDPEGSSSY 90
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
++C C + P C+ + C Y YGDGS T G + V G+ +
Sbjct: 91 TTMSCGDTLCDSL---PRKSCSPD--CDYSYGYGDGSGTRGTLSSETVTLTSTQGE-KLA 144
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
+ N + FGCG G+ + + G++G G+ N S +SQL G + F++CL
Sbjct: 145 AKN--IAFGCGHLNRGSFNDAS-----GLVGLGRGNLSFVSQLGDLFGHK--FSYCLVPW 195
Query: 249 -DGINGGGIFAIG-----HVVQPEVNK--TPLVPN---QPHYSINMTAVQVGLDFLNLPT 297
D + G H +++ TP++ N + Y + + + + L +P
Sbjct: 196 RDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPA 255
Query: 298 DVFGVGDN--KGTIIDSGTTLAYLPEMVYE----PLVSKIISQQPDLKVHTVHDEYTCFQ 351
F + + G I DSGTTL LP+ Y+ L SKI + D + Y
Sbjct: 256 GSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSG 315
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLW 388
S P + FHFE + +Y P E+ +
Sbjct: 316 SKASYKMKIPAMVFHFEGA--------DYQLPVENYF 344
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 143/312 (45%), Gaps = 39/312 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + K+ IGTP + +DTGSD+ W C C +C + + +YD SST
Sbjct: 111 GNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPT-----PIYDPSQSSTY 165
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C + P+ C+ +C YL YGD SST G ++ Y+ + T+
Sbjct: 166 SKVPCSSSMCQAL---PMYSCSG-ANCEYLYSYGDQSSTQG-----ILSYESFT---LTS 213
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--- 248
+ + FGCG + G++GFG+ S+ISQL S G + F++CL
Sbjct: 214 QSLPHIAFGCGQEN----EGGGFSQGGGLVGFGRGPLSLISQLGQSLGNK--FSYCLVSI 267
Query: 249 -DGINGGGIFAIGHVVQ---PEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFG 301
D + IG V+ TPLV ++ Y +++ + VG L++ F
Sbjct: 268 TDSPSKTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFD 327
Query: 302 --VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS--QQPDLKVHTVHDEYTCFQ-YSESV 356
+ G IIDSGTT+ YL + Y+ + +IS P + + + CF+ S S
Sbjct: 328 LQLDGTGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLDL-CFEPQSGSS 386
Query: 357 DEGFPNVTFHFE 368
FP +TFHFE
Sbjct: 387 TSHFPTITFHFE 398
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 148/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSRGVFVERSVQEQDVWCLAF 311
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 116/264 (43%), Gaps = 25/264 (9%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC--PRRSSLGIELTLYDIKDSS 129
+G Y + IG PPK Y + +DTGSD+ WV C CK C PR D +
Sbjct: 45 LGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPR-----------DRQYKP 93
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
G V C C + P C N C Y Y D S+ G V+D++ +G L
Sbjct: 94 HGNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGSSLGVLVRDIIPLKLTNGTL 153
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
T+ L FGCG Q+ ++ + G++G G +S++SQL S G +R + HCL
Sbjct: 154 ----THSMLAFGCGYDQT-HVGHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVVGHCL 208
Query: 249 DGINGGGIFAIGHVV-QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G GG +F ++ Q V TP++ + + + F T V G+
Sbjct: 209 SGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGL----E 264
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKI 331
DSG++ Y + ++ LV I
Sbjct: 265 LTFDSGSSYTYFNSLAHKALVDLI 288
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 141/338 (41%), Gaps = 53/338 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G + + +GTP Y VDTGSD++W C C EC +++ ++D SST
Sbjct: 112 GNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTT-----PVFDPAASSTY 166
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCP----YLEIYGDGSSTTGYFVQD--VVQYDKVS 185
+ C C + ++++S Y YGD SST G + + KV
Sbjct: 167 AALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLARQKVP 226
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
G + FGCG G D + A G++G G+ S++SQL G+ + F+
Sbjct: 227 G----------VAFGCGDTNEG--DGFTQGA--GLVGLGRGPLSLVSQL----GIDR-FS 267
Query: 246 HCLDGINGGG----------IFAIGHVVQPEVNKTPLV--PNQPH-YSINMTAVQVGLDF 292
+CL ++ TPLV P+QP Y +++T + VG
Sbjct: 268 YCLTSLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTR 327
Query: 293 LNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQ--PDLKVHTVHDEYT 348
L LP+ F + D+ G I+DSGT++ YL Y L ++ P + + +
Sbjct: 328 LALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDL- 386
Query: 349 CFQ-----YSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
CFQ + V P + HF+ L + Y+
Sbjct: 387 CFQGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYM 424
>gi|357490961|ref|XP_003615768.1| F-box protein [Medicago truncatula]
gi|355517103|gb|AES98726.1| F-box protein [Medicago truncatula]
Length = 688
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 78/234 (33%), Positives = 114/234 (48%), Gaps = 38/234 (16%)
Query: 63 PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDT---GSDIMWVNCIQCKECPRRSSLGIE 119
P+G S D + K G G D Q+ G + V I C CP+ S L IE
Sbjct: 317 PIGAGSNGD----IFFKAGDGKLVFDLRTQMIEKLDGVEKFRVFSISCNGCPQTSRLQIE 372
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQ 176
C+ G L+D T ++ C Y YGDGS T+GY+V
Sbjct: 373 ----------------CNS-------GIQLSDATCSSQTKQCSYTFQYGDGSGTSGYYVS 409
Query: 177 DVVQYDKV-SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D + D + G ++ S + C QSG+L + ++ A+DGI GF + S+ISQL+
Sbjct: 410 DTMHLDTIFEGSDYKFFSSCSFLGDCSNEQSGDL-TKSDRAVDGIFGFWQQQMSVISQLS 468
Query: 236 SSGGVRKMFAHCLDG-INGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQV 288
S G +F+HCL G +GGGI +G +V+P + TP+VP++ S+N A+QV
Sbjct: 469 SQGIASGVFSHCLRGDSSGGGIPVLGEIVEPNIVYTPIVPSR--ISVNGQALQV 520
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 103/406 (25%), Positives = 161/406 (39%), Gaps = 38/406 (9%)
Query: 5 LRNCLCIVLIATAAVGG---VSSNHGVFS-VKYRYAGRERSLSLLKEHDARR---QQRIL 57
L N +C AA V HG S ++ R +G +L+ R ++++
Sbjct: 55 LPNTVCTSTKGPAAAPSSLTVVHRHGPCSPLRSRGSGAPSHTEILRRDQDRVDAIRRKVT 114
Query: 58 AGVDLPLGGSSRPDGVGL------YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECP 111
A + P GG S G Y A + +GTP + V++DTGSD WV C C +C
Sbjct: 115 ASSNKPKGGVSLLANWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCY 174
Query: 112 RRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGV---YGGPLTDCTANTSCPYLEIYGDGS 168
+ ++D SST V C C + N +CPY Y D S
Sbjct: 175 EQRD-----PVFDPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDS 229
Query: 169 STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS 228
T G +D + + G +FGCG +G +DG++G G +
Sbjct: 230 HTVGDLARDTLTLSPSPSPSPADTVPG-FVFGCGHSNAGTFGE-----VDGLLGLGLGKA 283
Query: 229 SMISQLASSGGVRKMFAHCL-DGINGGGIFAI-GHVVQPEVNKTPLVPNQ--PHYSINMT 284
S+ SQ+A+ G F++CL + G + G + T +V Q Y +N+T
Sbjct: 284 SLPSQVAARYGA--AFSYCLPSSPSAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLT 341
Query: 285 AVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVH 344
+ V + +P F GTIIDSGT + LP Y L S S +
Sbjct: 342 GIVVAGRAIKVPASAFATA--AGTIIDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAP 399
Query: 345 DEY---TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL 387
TC+ ++ P V F + ++ ++P L+ + D+
Sbjct: 400 SSPIFDTCYDFTGHETVRIPAVELVFADGATVHLHPSGVLYTWNDV 445
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 95/343 (27%), Positives = 148/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPTFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVP---NQPHYSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V N + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSRGVFVERSVQEQDVWCLAF 311
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 147/347 (42%), Gaps = 64/347 (18%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFTFGCNMDSFGANEFGN--------VDGLLGMGAGQMSVLKQ---SSPTFDGF 150
Query: 245 AHCL------DGI--NGGGIFAIG---HVVQPEVNKTPLVP---NQPHYSINMTAVQVGL 290
++CL G G F++G + +V T +V N + +++TA+ V
Sbjct: 151 SYCLPLQMSERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDG 210
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ L L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 ERLGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCY 267
Query: 351 QYSESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 268 DM-RSVDEGDMPAISLHFDDGARFDLGRHGVFVERSVQEQDVWCLAF 313
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 152/355 (42%), Gaps = 46/355 (12%)
Query: 68 SRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKD 127
SR G Y AKI +GTP + + +DT SD+ W+ C C+ C +S ++D +
Sbjct: 130 SRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSG-----PVFDPRH 184
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
S++ + ++ + C + D T C Y YGDGS+T G F+++ + + +G
Sbjct: 185 STSYREMSFNAADCQALGRSGGGDAKRGT-CVYTVGYGDGSTTVGDFIEETLTF---AGG 240
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
++ + GCG G + GI+G G+ S +Q+ +G F++C
Sbjct: 241 VRLPRIS----IGCGHDNKGLFGAPAA----GILGLGRGLMSFPNQIDHNG----TFSYC 288
Query: 248 L-DGINGGG------IFAIGHV-VQPEVNKTPLVPN---QPHYSINMTAVQV------GL 290
L D ++G G F G V P V+ TP V N Y + +T + V G+
Sbjct: 289 LVDFLSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGV 348
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--- 347
+L D + G I+DSGT + L Y + DL ++
Sbjct: 349 TERDLQLDPY--TGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFF 406
Query: 348 -TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLW--CIGWQNSGMQS 399
TC+ + P V+ HF SV +K+ P YL P + + C + +G S
Sbjct: 407 DTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHS 461
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 144/342 (42%), Gaps = 39/342 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+A++GIG P + YY+++DTGSD+ W+ C C C + +YD +SS+
Sbjct: 8 GSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVD-----PIYDPSNSSSY 62
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ V C C + D +A C Y +YGD S+++G D+ G
Sbjct: 63 RRVYCGSALCQAL------DYSACQGMGCSYRVVYGDSSASSG----DLGIESFYLGPNS 112
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL- 248
+T+ ++ FGCG SG G S SQ+A+S G F++CL
Sbjct: 113 STAMR-NIAFGCGHSNSGLFRGEAGLLGM-----GGGTLSFFSQIAASIG--PAFSYCLV 164
Query: 249 ----DGINGGGIFAIGHVVQPEVNK-TPLVPN---QPHYSINMTAVQVGLDFLNLPTDVF 300
+ G P + TPL+ N Y +T + VG L +P F
Sbjct: 165 DRYSQLQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQF 224
Query: 301 GVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVD 357
+ N G I+DSGT++ + Y L + +L V+ TCF +
Sbjct: 225 ALTGNGTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPT 284
Query: 358 EGFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGM 397
P++ HF+N V + + L P + +C+ + S M
Sbjct: 285 VQIPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSM 326
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 154/345 (44%), Gaps = 31/345 (8%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++ +GTP K + + +DTGSD+ W+ C SS YD SS+
Sbjct: 23 GSGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSS--PPAPWYDKSSSSSY 80
Query: 132 KFVTCDQEFCHGVYGGPLTDCT--ANTSCPYLEIYGDGSSTTGYFVQDVVQYD------K 183
+ + C + C + + C+ + + C Y Y D S TTG + + K
Sbjct: 81 REIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGK 140
Query: 184 VSGDLQTTSTN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS--GGV 240
+G+ +T + ++ GC G + G++G G+ S+ +Q + GG+
Sbjct: 141 RAGNHKTRTIRIKNVALGCSRESVG----ASFLGASGVLGLGQGPISLATQTRHTALGGI 196
Query: 241 RKMFAHC----LDGINGGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQV-GLDF 292
F++C L G N +G ++ TP+V N Q Y +N+T V V G
Sbjct: 197 ---FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPV 253
Query: 293 LNLPTDVFGV-GD-NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ + +G+ GD NKGTI DSGTTL+YL E Y ++ + + + + + +
Sbjct: 254 DGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELC 313
Query: 351 QYSESVDEGFPNVTFHFENSVSLKVYPHEYL-FPFEDLWCIGWQN 394
+++G P + F+ +++ + Y+ E++ C+ Q
Sbjct: 314 YNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQK 358
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 102/361 (28%), Positives = 154/361 (42%), Gaps = 43/361 (11%)
Query: 38 RERSLSLLKEHDARRQ--------QRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDY 89
R +L+ ARR +RI GV +P + D + Y +G GTP
Sbjct: 77 RPSPAEMLRRDRARRNHILRKASGRRITLGVSIPTSLGAFVDSL-QYVVTLGFGTPAVPQ 135
Query: 90 YVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGV----Y 145
+ +DTGSD+ WV QC+ C + + ++D SST V C E C + Y
Sbjct: 136 VLLIDTGSDLSWV---QCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSEACRDLDPDSY 192
Query: 146 GGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGAR 204
T+ ++ S C Y YG+G +T G + + + +S + T N S FGCG
Sbjct: 193 ANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETL---TLSPEAATVVNNFS--FGCGLV 247
Query: 205 QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING-GGIFAIGHVV 263
Q G D + G + S++SQ ++G F++CL N G A+G
Sbjct: 248 QKGVFDLFDGLLGL-----GGAPESLVSQ--TTGTYGGAFSYCLPAGNSTAGFLALGAPA 300
Query: 264 QPEVNK-----TPL-VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLA 317
N TPL V Y + +T + VG L++ VF G IIDSGT +
Sbjct: 301 TGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA----GGMIIDSGTIVT 356
Query: 318 YLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLK 374
LPE Y L + +S P L + D TC+ ++ + + P V FE V++
Sbjct: 357 GLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGVTID 416
Query: 375 V 375
+
Sbjct: 417 L 417
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 122/281 (43%), Gaps = 34/281 (12%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
P+ G P+G LY+ I +G+PP+ Y++ +DTGSD+ W+ C C C + +
Sbjct: 302 FPVRGDVYPNG--LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPN----- 354
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD-CTANTSCPYLEIYGDGSSTTGYFVQDVV 179
LY K G V C V T C C Y Y D SS+ G D
Sbjct: 355 PLYKPKK---GNLVPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASD-- 409
Query: 180 QYDKVSGDLQTTSTNGSL-----IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
DL NGSL +FGC Q G L ++ + DGI+G K+ S+ SQL
Sbjct: 410 -------DLHLMLANGSLTKLGIMFGCAYDQQGLLLNSLAKT-DGILGLSKAKVSLPSQL 461
Query: 235 ASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPE--VNKTPLV-PNQPHYSINMTAVQVGL 290
AS + + HCL GGG +G P + P++ + P+Y + + G
Sbjct: 462 ASQRIINNVLGHCLTSDATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGS 521
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
L+L G + + D+G++ Y P+ Y LV+ +
Sbjct: 522 RQLSLGRQ---DGRTERVVFDTGSSYTYFPKEAYYALVASL 559
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/349 (24%), Positives = 153/349 (43%), Gaps = 40/349 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
YY + +GTP + + +DTGSD+ W+ C+ CK+C + ++ + SS+ +
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDC-----VPALRPPFNPRHSSSFFKLP 192
Query: 136 CDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVS-GDLQTTST 193
C C VY G C+ + +C + YGDGS ++G + + + + GD +
Sbjct: 193 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 252
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC----LD 249
+ ++ GC L + G++G + S SQL+S + F+HC +
Sbjct: 253 S-NITLGCADIDREGLPT----GASGLLGMDRRPISFPSQLSSRYA--RKFSHCFPDKIA 305
Query: 250 GINGGGIFAIGH--VVQPEVNKTPLVPNQPHYSINMTAVQVGL-----DFLNLPT----- 297
+N G+ G ++ P + TPLV N S ++ VGL D LP
Sbjct: 306 HLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNF 365
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSESV 356
D+ V + GTIIDSGT YL + ++ + + +++ L + +T C+ +
Sbjct: 366 DIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGT 425
Query: 357 ----DEGFPNVTFHFENSVSLKVYPHEYLFPF-----EDLWCIGWQNSG 396
P++T HF + + + + L P + C+ +Q SG
Sbjct: 426 AALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSG 474
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 140/334 (41%), Gaps = 43/334 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++GIG PP YV +DTGSD+ W+ C C EC ++S ++D S++
Sbjct: 145 GSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSD-----PIFDPVSSNSY 199
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ CD C + L++C N +C Y YGDGS T G F + V T
Sbjct: 200 SPIRCDAPQCKSL---DLSECR-NGTCLYEVSYGDGSYTVGEFATETVTLG--------T 247
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM-FAHCLDG 250
+ ++ GCG N E L +G +L+ V F++CL
Sbjct: 248 AAVENVAIGCGH---------NNEGL--FVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN 296
Query: 251 INGGGIFAI-------GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV- 302
+ + + +VV + + P + Y + + + VG + L +P +F V
Sbjct: 297 RDSDAVSTLEFNSPLPRNVVTAPLRRNPEL--DTFYYLGLKGISVGGEALPIPESIFEVD 354
Query: 303 -GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGF 360
G IIDSGT + L VY+ L + + K + V TC+ S
Sbjct: 355 AIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQV 414
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
P V+FHF L + YL P + + +C +
Sbjct: 415 PTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAF 448
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPRFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V + + + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGRRGVFVERSVQEQDVWCLAF 311
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/341 (25%), Positives = 150/341 (43%), Gaps = 41/341 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSST 130
G Y +G+GTP +D + DTGSD+ W C C C ++ + ++D SS+
Sbjct: 42 GSANYVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQ-----QDAIFDPSKSSS 96
Query: 131 GKFVTCDQEFCHGVYG-GPLTDCTANT--SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+TC C + G ++C+++T SC Y YGD S++ G+ Q+ +
Sbjct: 97 YTNITCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLT------- 149
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ T +FGCG G + + G++G G+ S++ Q +S+ K+F++C
Sbjct: 150 ITATDIVDDFLFGCGQDNEGLFNGSA-----GLMGLGRHPISIVQQTSSN--YNKIFSYC 202
Query: 248 LDGIN---GGGIFAIGHVVQPEVNKTPLVP---NQPHYSINMTAVQVGLDFL-NLPTDVF 300
L + G F + TPL + Y +++ ++ VG L + + F
Sbjct: 203 LPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTF 262
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY----TCFQYSESV 356
G G+IIDSGT + L VY L S + ++ + V +E TC+ S
Sbjct: 263 SAG---GSIIDSGTVITRLAPTVYAALRSAF---RRXMEKYPVANEAGLLDTCYDLSGYK 316
Query: 357 DEGFPNVTFHFENSVSLKVYPHEYL-FPFEDLWCIGWQNSG 396
+ P + F F V++++ L E C+ + +G
Sbjct: 317 EISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANG 357
>gi|298707682|emb|CBJ25999.1| aspartyl protease [Ectocarpus siliculosus]
Length = 547
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 135/320 (42%), Gaps = 26/320 (8%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G ++A I GTPP+ V ++TGS C +C+ C + +D SST
Sbjct: 104 GYGTHFAYIYAGTPPQRASVIINTGSHFSAFPCSECRSCGNHTD-----PYWDPSQSSTA 158
Query: 132 KFVTCDQ-EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY-DKVSGDLQ 189
VTCD+ E CHG Y C ++ C E Y +GSS V D++ ++ D Q
Sbjct: 159 HIVTCDETERCHGAY-----KCQSDKKCVLREHYTEGSSWRAKQVDDLLWVGERTLSDSQ 213
Query: 190 TTSTNG---SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGV-RKMFA 245
+ FGC +G + + DGI+G + ++I+QLA++G + + F+
Sbjct: 214 KHDDSAFSVDFTFGCIESLTGLFKT---QLADGIMGLNADSRTLITQLATAGKISERKFS 270
Query: 246 HCLDGING----GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
C G GG + + E+ TP ++ +T V L+ +++ TD
Sbjct: 271 LCFSETGGTMVIGGYDPLLNKPGSEMQYTPSTGEISAPTVKVT--DVTLNGVSITTDASV 328
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
G I SGTT YLP V E + + +E+ C + E P
Sbjct: 329 FQKGTGIKIVSGTTNTYLPRAVAEGFSAAWEAATGSPYATCKMNEF-CMTRTTVELEALP 387
Query: 362 NVTFHFENSVSLKVYPHEYL 381
+ H + V + V P Y+
Sbjct: 388 VLMIHMDGGVEVNVRPEAYM 407
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 130/301 (43%), Gaps = 39/301 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IGTP + V +DT +D WV C C C L+D SS+ + +
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGC-------ASSVLFDPSKSSSSRNLQ 143
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
CD C P CTA SC + YG GS+ QD + ++ D+ + T
Sbjct: 144 CDAPQCK---QAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTL---TLANDVIKSYT-- 194
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGIN 252
FGC ++ +G G++G G+ S+ISQ + F++CL N
Sbjct: 195 ---FGCISKATG-----TSLPAQGLMGLGRGPLSLISQ--TQNLYMSTFSYCLPNSKSSN 244
Query: 253 GGGIFAIGHVVQP-EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNK 306
G +G QP + TPL+ N Y +N+ ++VG +++PT F
Sbjct: 245 FSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGA 304
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
GTI DSGT L E Y + ++ + + ++ TC YS SV +P+VTF
Sbjct: 305 GTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLGGFDTC--YSGSVV--YPSVTFM 360
Query: 367 F 367
F
Sbjct: 361 F 361
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 149/365 (40%), Gaps = 65/365 (17%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYVQVDT 95
+ R+ LL D + R P+ + DG Y + GTPP++ + +DT
Sbjct: 51 KARATHLLSAQDQSGRGR---SASAPVNPGAYDDGFPFTEYLVHLAAGTPPQEVQLTLDT 107
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG-GPLTDCTA 154
GSDI W QCK CP + L L+D SS+ + C C G D T+
Sbjct: 108 GSDITWT---QCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACETTPPCGGGNDATS 164
Query: 155 NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
C Y YGDGS + G ++V + +G+ + + G L+FGCG G S NE
Sbjct: 165 R-PCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPG-LVFGCGHANRGVFTS-NE 221
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP 274
GI GFG+ + S+ SQL F+HC I G +KT
Sbjct: 222 T---GIAGFGRGSLSLPSQLKVGN-----FSHCFTTITG--------------SKT---- 255
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI--------IDSGTTLAYLPEMVYEP 326
+AV +GL + P +G +G+ +SGT++ LP Y
Sbjct: 256 ---------SAVLLGLPGV-APPSASPLGRRRGSYRCRSTPRSSNSGTSITSLPPRTYRA 305
Query: 327 LVSKIISQQPDLKVHTVH----DEYTCFQYS-ESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+ + +Q +K+ V D +TCF P + HFE + ++++ Y+
Sbjct: 306 VREEFAAQ---VKLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEGA-TMRLPQENYV 361
Query: 382 FPFED 386
F D
Sbjct: 362 FEVVD 366
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/331 (25%), Positives = 137/331 (41%), Gaps = 35/331 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y++++G+G P + Y+ +DTGSD+ W+ C C +C +S +YD S++
Sbjct: 159 GSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSD-----PVYDPSVSTSY 213
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V CD C + + T SC Y YGDGS T G F + + GD
Sbjct: 214 ATVGCDSPRCRDLDAAACRNSTG--SCLYEVAYGDGSYTVGDFATETLTL----GDSAPV 267
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
S ++ GCG G G S SQ++++ F++CL
Sbjct: 268 S---NVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAT-----TFSYCLVDR 314
Query: 252 N--GGGIFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGD-- 304
+ G QP V PL+ + Y + ++ + VG + L++P+ F + D
Sbjct: 315 DSPSSSTLQFGDSEQPAVT-APLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAG 373
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPNV 363
+ G I+DSGT + L Y L + L + V TC+ + P V
Sbjct: 374 SGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAV 433
Query: 364 TFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
FE LK+ YL P + +C+ +
Sbjct: 434 ALWFEGGGELKLPAKNYLIPVDAAGTYCLAF 464
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 140/341 (41%), Gaps = 45/341 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++G+GTPPK Y+ +DTGSD++W+ C C++C ++ ++D K S +
Sbjct: 143 GSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTD-----PVFDPKKSGSF 197
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
++C C + C + SC Y YGDGS T G F + + +
Sbjct: 198 SSISCRSPLCLRLDS---PGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRG-------- 246
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR--KMFAHCLD 249
+ + GCG + E L + G+R + F++CL
Sbjct: 247 TRVPKVALGCGH---------DNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLV 297
Query: 250 GINGGG-----IFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQV-GLDFLNLPTDVF 300
+ +F V + V TPL+ N Y + +T + V G + +F
Sbjct: 298 DRSASSKPSSVVFGQSAVSRTAVF-TPLITNPKLDTFYYLELTGISVGGARVAGITASLF 356
Query: 301 GV--GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVD 357
+ N G IIDSGT++ L Y L + DLK + + TCF S +
Sbjct: 357 KLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGKTE 416
Query: 358 EGFPNVTFHFENS-VSLKVYPHEYLFPFED--LWCIGWQNS 395
P V HF + VSL YL P + ++C + +
Sbjct: 417 VKVPTVVMHFRGADVSLPA--TNYLIPVDTNGVFCFAFAGT 455
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/409 (24%), Positives = 155/409 (37%), Gaps = 81/409 (19%)
Query: 45 LKEHDARRQ---QRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
+ +D RR+ V++P+ + R D +G Y+ ++ +G+P + +++ DTGS+ W
Sbjct: 78 VSNYDRRRKGLETTTTTEVEMPMR-AGRDDALGEYFTEVKVGSPGQRFWLAADTGSEFTW 136
Query: 102 VNCIQ---------------------------------------------CKE--CPRRS 114
NC+ CK CP RS
Sbjct: 137 FNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRS 196
Query: 115 SLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYF 174
+ + +S + Q F + P C + S Y DGSS G+F
Sbjct: 197 K-----SFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDIS------YADGSSAKGFF 245
Query: 175 VQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
D + D +G + +L GC + N + NE+ GI+G G + S I +
Sbjct: 246 GTDTITVDLKNGKEGKLN---NLTIGC-TKSMENGVNFNEDT-GGILGLGFAKDSFIDKA 300
Query: 235 ASSGGVRKMFAHCL----DGINGGGIFAIG----HVVQPEVNKTPLVPNQPHYSINMTAV 286
A G + F++CL N IG + E+ +T L+ P Y +N+ +
Sbjct: 301 AYEYGAK--FSYCLVDHLSHRNVSSYLTIGGHHNAKLLGEIKRTELILFPPFYGVNVVGI 358
Query: 287 QVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE 346
+G L +P V+ GT+IDSGTTL L YEP+ +I +K T D
Sbjct: 359 SIGGQMLKIPPQVWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDF 418
Query: 347 YT---CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL-WCIG 391
CF D P + FHF + Y+ L CIG
Sbjct: 419 GALDFCFDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIG 467
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/347 (27%), Positives = 157/347 (45%), Gaps = 53/347 (15%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGL------YYAKIGIGTPPKDYYVQVDTGSDIMWVNCI 105
R +R+++ ++ + P G+ Y +G+G+ + V +DTGSD+ WV C
Sbjct: 35 RIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGS--TNMTVIIDTGSDLTWVQCE 92
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY--GGPLTDCTAN-TSCPYLE 162
C C + ++ SS+ + V+C+ C + G C +N ++C Y+
Sbjct: 93 PCMSCYNQQG-----PIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPSTCNYVV 147
Query: 163 IYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIG 222
YGDGS T G + + + VS +FGCG G + G++G
Sbjct: 148 NYGDGSYTNGELGVEQLSFGGVSVS--------DFVFGCGRNNKGLFG-----GVSGLMG 194
Query: 223 FGKSNSSMISQL-ASSGGVRKMFAHCLDGINGG--GIFAIGHVVQPEVNKTP-----LVP 274
G+S S++SQ A+ GGV F++CL G G +G+ N TP ++P
Sbjct: 195 LGRSYLSLVSQTNATFGGV---FSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRMLP 251
Query: 275 N---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI 331
N Y +N+T + V L +P+ FG N G +IDSGT + LP VY+ L +
Sbjct: 252 NPQLSNFYILNLTGIDVDGVALQVPS--FG---NGGVLIDSGTVITRLPSSVYKALKALF 306
Query: 332 ISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+ Q P ++ D TCF + + P ++ HFE + LKV
Sbjct: 307 LKQFTGFPSAPGFSILD--TCFNLTGYDEVSIPTISMHFEGNAELKV 351
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/331 (26%), Positives = 148/331 (44%), Gaps = 44/331 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ IGTPP + DTGSD+ W C CK C + +YD S++ V
Sbjct: 95 YLMELAIGTPPVPFVALADTGSDLTWTQCKPCKLC-----FPQDTPIYDTAASASFSPVP 149
Query: 136 CDQEFCHGVYGGPLTDCTANTSCP--YLEIYGDGSSTTGYFVQDVVQYDKVS-GDLQTTS 192
C C ++ +CTA T+ P Y Y DG+ + G + + + S G
Sbjct: 150 CASATCLPIWRSS-RNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGV 208
Query: 193 TNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DG 250
+ G + FGCG G + +ST G +G G+ + S+++QL GV K F++CL D
Sbjct: 209 SVGGVAFGCGVDNGGLSYNST------GTVGLGRGSLSLVAQL----GVGK-FSYCLTDF 257
Query: 251 IN---GGGIF--AIGHVVQPE------VNKTPLV--PNQP-HYSINMTAVQVGLDFLNLP 296
N G + ++ + P V TPLV P P Y +++ + +G L +P
Sbjct: 258 FNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIP 317
Query: 297 TDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKI--ISQQPDLKVHTVHDEYTCFQY 352
F + D+ G I+DSGT L E + +V+ + + QP + ++ + CF
Sbjct: 318 NGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQPVVNASSL--DSPCFPA 375
Query: 353 S--ESVDEGFPNVTFHFENSVSLKVYPHEYL 381
+ E P++ HF ++++ Y+
Sbjct: 376 TAGEQQLPDMPDMLLHFAGGADMRLHRDNYM 406
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/300 (27%), Positives = 124/300 (41%), Gaps = 28/300 (9%)
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
V VDT SDI WV QC CP + LYD SST + C C +
Sbjct: 171 VVVDTSSDIPWV---QCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGN 227
Query: 151 DCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
C+ T C Y+ YGDG +TTG +V D + + T FGC G+
Sbjct: 228 GCSPTTDECKYIVNYGDGKATTGTYVTDTLT-------MSPTIVVKDFRFGCSHAVRGSF 280
Query: 210 DSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV-- 267
+ N GI+ G S++ Q A + G F++C+ + G ++G V+ +
Sbjct: 281 SNQNA----GILALGGGRGSLLEQTADAYG--NAFSYCIPKPSSAGFLSLGGPVEASLKF 334
Query: 268 NKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
+ TPL+ N+ Y +++ A+ V L +P F G ++DSG + LP VY
Sbjct: 335 SYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT----GAVMDSGAVVTQLPPQVY 390
Query: 325 EPLVSKIISQQPDLK--VHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
L + S V + TC+ ++ D P V+ F +L + P +
Sbjct: 391 AALRAAFRSAMAAYGPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIIL 450
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 99/369 (26%), Positives = 153/369 (41%), Gaps = 61/369 (16%)
Query: 21 GVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKI 80
GV + + + + R R ++ + V+ PL PDG G Y I
Sbjct: 5 GVKRSEAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPL----HPDGGG-YVMDI 59
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
+GTP K + DTGSD++WV C C S G T++D + SST + + C +
Sbjct: 60 SVGTPGKRFRAIADTGSDLVWVQSEPCTGC----SGG---TIFDPRQSSTFREMDCSSQL 112
Query: 141 CHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
C + G C +S C Y YG G T G F +D + SG Q S
Sbjct: 113 CTELPG----SCEPGSSACSYSYEYGSG-ETEGEFARDTISLGTTSGGSQKFP---SFAV 164
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCG SG + +DG++G G+ S+ SQL S + F++CL IN
Sbjct: 165 GCGMVNSGF------DGVDGLVGLGQGPVSLTSQL--SAAIDSKFSYCLVDINS------ 210
Query: 260 GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF----------------GVG 303
+ +PL+ P +++ T +Q + P+D + +G
Sbjct: 211 ------QSESSPLLFG-PSAALHGTGIQS--TKITPPSDTYPTYYLLTVNGIAVAGQTMG 261
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH-TVHDEYTCFQYSESVDEGFPN 362
TIIDSGTTL Y+P VY ++S++ S +V + C+ S + + FP
Sbjct: 262 SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPA 321
Query: 363 VTFHFENSV 371
+T +
Sbjct: 322 LTIRLAGAT 330
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 95/340 (27%), Positives = 146/340 (42%), Gaps = 39/340 (11%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSS 115
LA V L G S GVG Y ++G+GTP Y + VDTGS + W+ C C C R+
Sbjct: 118 LASVPLSPGTSV---GVGNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVG 174
Query: 116 LGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGY 173
L+D + SST V C C + L + C+A+ C Y YGD S + G
Sbjct: 175 -----PLFDPRASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGS 229
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
D V + ++ S +GCG G + G+IG ++ S++ Q
Sbjct: 230 LSTDTVSFG--------STRYPSFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQ 276
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIG-HVVQPEVNKTPLVP---NQPHYSINMTAVQVG 289
LA S G F++CL G +IG + + TP+ + Y I ++ + VG
Sbjct: 277 LAPSLGYS--FSYCLPTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVG 334
Query: 290 LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDE 346
L + + + TIIDSGT + LP V+ L V++ ++ ++ D
Sbjct: 335 GSPLAVSPSEY---SSLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSILD- 390
Query: 347 YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
TCF+ ++ P V F S+K+ L +D
Sbjct: 391 -TCFE-GQASQLRVPTVAMAFAGGASMKLTTRNVLIDVDD 428
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 149/343 (43%), Gaps = 58/343 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGCGARQSGNLDS--TNEEA-LDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
S FGC NLDS NE +DG++G G S++ Q S F++
Sbjct: 105 KIP---SFTFGC------NLDSFGANEFGNVDGLLGMGAGPMSVLKQ---SSPRFDGFSY 152
Query: 247 CL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDFLN 294
CL G G F++G V + +V T +V + + + +++ A+ V + L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSE 354
L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM-R 268
Query: 355 SVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + E +D+WC+ +
Sbjct: 269 SVDEGDMPAISLHFDDGARFDLGSKGVFVERSVQEQDVWCLAF 311
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 137/334 (41%), Gaps = 42/334 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++GIG P K +Y+ +DTGSD+ W+ C C +C ++ ++D SS+
Sbjct: 156 GSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVD-----PIFDPASSSSF 210
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ C C + D A N SC Y YGDGS T G F + V + SG +
Sbjct: 211 SRLGCQTPQCRNL------DVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGN-SGSVD 263
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL- 248
+ GCG G G S+ SQ+ +S F++CL
Sbjct: 264 ------KVAIGCGHDNEGLFVGAAGLIGLGGGPL-----SLTSQIKASS-----FSYCLV 307
Query: 249 --DGINGGGIFAIGHVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGV- 302
D ++ + + P+ N Y + +T + VG + L +P +F V
Sbjct: 308 NRDSVDSSTL-EFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVD 366
Query: 303 GDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGF 360
G K G I+D GT + L Y L + DL + + TC+ S
Sbjct: 367 GSGKGGIIVDCGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFALFDTCYNLSSRTSVRV 426
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFED--LWCIGW 392
P V F F+ SL + P YL P + +C+ +
Sbjct: 427 PTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAF 460
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 159/392 (40%), Gaps = 67/392 (17%)
Query: 41 SLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
S L H ++ Q DLP S G G Y +G+GTP D + DTGSD+
Sbjct: 104 SKKLTTNHVSQSQS-----TDLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLT 157
Query: 101 WVNCIQCKECPRRSSLGIELTLYDIKD------SSTGKF-VTCDQEFCHGVYGGPLTDCT 153
W QC+ C R T YD K+ ST + V+C C G L+ T
Sbjct: 158 WT---QCQPCVR--------TCYDQKEPIFNPSKSTSYYNVSCSSAAC-----GSLSSAT 201
Query: 154 AN------TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
N ++C Y YGD S + G+ +D ++ S D+ + FGCG G
Sbjct: 202 GNAGSCSASNCIYGIQYGDQSFSVGFLAKD--KFTLTSSDVFD-----GVYFGCGENNQG 254
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIGHV-VQP 265
+ G++G G+ S SQ A++ K+F++CL + G G +
Sbjct: 255 LF-----TGVAGLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGSAGISR 307
Query: 266 EVNKTP---LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
V TP + Y +N+ A+ VG L +P+ VF G +IDSGT + LP
Sbjct: 308 SVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF---STPGALIDSGTVITRLPPK 364
Query: 323 VYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y L S +S+ P ++ D TCF S P V F F +++
Sbjct: 365 AYAALRSSFKAKMSKYPTTSGVSILD--TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKG 422
Query: 380 YLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGD 410
+ F+ C+ + + D N + G+
Sbjct: 423 IFYAFKISQVCLAFAG----NSDDSNAAIFGN 450
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 122/281 (43%), Gaps = 32/281 (11%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL G+ P G Y + IG P K Y++ VDTGSD+ W+ C + P R +
Sbjct: 59 FPLHGNVYP--AGYYNVTLSIGQPAKPYFLDVDTGSDLTWLQC----DAPCRQCIEAPHP 112
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
LY + V C+ C + + +C C Y Y DG S+ G V+DV
Sbjct: 113 LY----RPSNNLVICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGGSSLGVLVKDVFVL 168
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
+ +G N L GCG Q L + LDGI+G G+ SS+ SQL+S G V
Sbjct: 169 NFTNG----KRLNPLLALGCGYDQ---LPGRSNHPLDGILGLGRGISSIPSQLSSQGLVS 221
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPE-VNKTPLVPNQ-PHYSINMTAVQVGLDFLNLPTDV 299
+ HCL G GG +F + V TP+ + HYS F L D
Sbjct: 222 NVIGHCLSGRGGGFLFFGEDIYDSSGVTWTPMSRDHLKHYSPG---------FAELIFDG 272
Query: 300 FGVG-DNKGTIIDSGTTLAYLPEMVYEPLV---SKIISQQP 336
G N + DSG++ YL Y+ LV + +S++P
Sbjct: 273 KSTGIRNLLVVFDSGSSYTYLNAQAYQHLVFSLKRELSRKP 313
>gi|297723019|ref|NP_001173873.1| Os04g0331600 [Oryza sativa Japonica Group]
gi|255675338|dbj|BAH92601.1| Os04g0331600, partial [Oryza sativa Japonica Group]
Length = 72
Score = 93.6 bits (231), Expect = 2e-16, Method: Composition-based stats.
Identities = 42/73 (57%), Positives = 61/73 (83%), Gaps = 1/73 (1%)
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV 263
+Q+G+L+++ E A+DGIIGFG SN +++SQLA++G +K+F+HCLD NGGGIFAIG VV
Sbjct: 1 QQTGSLNNS-ELAIDGIIGFGNSNQTLLSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVV 59
Query: 264 QPEVNKTPLVPNQ 276
+P+V TP+V N+
Sbjct: 60 EPKVKTTPIVKNK 72
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/305 (26%), Positives = 127/305 (41%), Gaps = 41/305 (13%)
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
V +D+ SD+ WV C+ C P + + YD S + +C C + GP
Sbjct: 161 VVLDSASDVPWVQCVPCPIPPCHPQVD---SFYDPSRSPSSAPFSCSSPTCTAL--GPYA 215
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD---KVSGDLQTTSTNGSLIFGCGARQSG 207
+ AN C YL Y DGSST+G ++ D++ D VSG FGC + G
Sbjct: 216 NGCANNQCQYLVRYPDGSSTSGAYIADLLTLDAGNAVSG----------FKFGCSHAEQG 265
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGGGIFAIG------ 260
+ D+ GI+ G S++SQ AS G F++C+ + G F +G
Sbjct: 266 SFDARAA----GIMALGGGPESLLSQTASRYG--NAFSYCIPATASDSGFFTLGVPRRAS 319
Query: 261 --HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
+VV P V Y + + + VG L + VF G+++DS T +
Sbjct: 320 SRYVVTPMVR---FRQAATFYGVLLRTITVGGQRLGVAPAVFAA----GSVLDSRTAITR 372
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSVSLKVYP 377
LP Y+ L S S + TC+ ++ V+ P ++ F+ + L + P
Sbjct: 373 LPPTAYQALRSAFRSSMTMYRSAPPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDP 432
Query: 378 HEYLF 382
LF
Sbjct: 433 SGILF 437
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 111/257 (43%), Gaps = 34/257 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC--IQCKECPRRSSLGIELTLYDIKDSSTG 131
GLYY I +G+PP+ Y++ VDTGS WV C C C + + LY + + T
Sbjct: 158 GLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH-----PLY--RPARTA 210
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C G C Y Y DGSS+ G +V+D +Q+ G+ +
Sbjct: 211 DALPASDPLCEGA------QHENPNQCDYEISYADGSSSMGVYVRDSMQFVGEDGERE-- 262
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--D 249
N ++FGCG Q G L + E DG++G S+ +QLAS G + F HC+ D
Sbjct: 263 --NADIVFGCGYDQQGVLLNA-LETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTD 319
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQV-----GLDFLNLPTDVFGVGD 304
GG +G P T VP + + ++ QV G LN G
Sbjct: 320 PSGAGGYLFLGDDYIPRWGMT-WVPIRDGPADDVRRAQVKQINHGDQQLN------AQGK 372
Query: 305 NKGTIIDSGTTLAYLPE 321
+ D+G+T Y P+
Sbjct: 373 LTQVVFDTGSTYTYFPD 389
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 130/309 (42%), Gaps = 37/309 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ GTP V +DTGSD+ W +QCK C + LYD SST V
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSW---LQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 135
Query: 136 CDQEFCHGV----YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
C + C + YG + CT+ C + Y DG+ST G + QD + L
Sbjct: 136 CASDVCKKLAADAYG---SGCTSGKQCGFAISYADGTSTVGAYSQDKLT-------LAPG 185
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ + FGCG + DG++G G+ S+ ++ GGV F++CL +
Sbjct: 186 AIVQNFYFGCGHGK-----HAVRGLFDGVLGLGRLRESLGARY---GGV---FSYCLPSV 234
Query: 252 NGG-GIFAIGHVVQPE-VNKTPL--VPNQPHYS-INMTAVQVGLDFLNLPTDVFGVGDNK 306
+ G A+G P TP+ VP QP +S + + + VG L+L F +
Sbjct: 235 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF----SG 290
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
G I+DSGT + L Y L S ++ D TC+ + + P +
Sbjct: 291 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDLDTCYNLTGYKNVVVPKIALT 350
Query: 367 FENSVSLKV 375
F ++ +
Sbjct: 351 FTGGATINL 359
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 145/354 (40%), Gaps = 53/354 (14%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ 106
HD Q +++G L G G Y+ +GTPP+ + + VD+GSD++WV C
Sbjct: 44 SHDYGFQSPVVSGSTL---------GSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSP 94
Query: 107 CKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC---HGVYGGPLTDCTANTSCPYLEI 163
C++C + S LY +SST V C C G P D +C Y +
Sbjct: 95 CRQCYAQDS-----PLYVPSNSSTFSPVPCLSSDCLLIPATEGFP-CDFRYPGACAYEYL 148
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
Y D SS+ G F + D V D + FGCG+ G+ A G++G
Sbjct: 149 YADTSSSKGVFAYESATVDGVRID--------KVAFGCGSDNQGSF-----AAAGGVLGL 195
Query: 224 GKSNSSMISQLASSGGVRKMFAHCLDG-----------INGGGIFAIGHVVQPEVNKTPL 272
G+ S SQ+ + G + FA+CL I G + + H +Q TP+
Sbjct: 196 GQGPLSFGSQVGYAYGNK--FAYCLVNYLDPTSVSSSLIFGDELISTIHDMQ----YTPI 249
Query: 273 V--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVG--DNKGTIIDSGTTLAYLPEMVYEPL 327
V P P Y + + V VG L + + + N G+I DSGTTL Y Y +
Sbjct: 250 VSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDSGTTLTYWFPSAYSHI 309
Query: 328 VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
++ S + +V C + + FP+ T F++ + Y
Sbjct: 310 LAAFDSGVHYPRAESVQGLDLCVELTGVDQPSFPSFTIEFDDGAVFQPEAENYF 363
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 156/354 (44%), Gaps = 54/354 (15%)
Query: 45 LKEHDARRQQRILAG-------VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
L+ +R + IL+G +PL R + Y + +G + V VDTGS
Sbjct: 29 LRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQSLN-YIVTVELGG--RKMTVIVDTGS 85
Query: 98 DIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY--GGPLTDCTAN 155
D+ WV C C C + +++ S + + V C+ C + G C +N
Sbjct: 86 DLSWVQCQPCNRCYNQQD-----PVFNPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSN 140
Query: 156 -TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
+C Y+ YGDGS T+G + + +L T+ N + IFGCG + G +
Sbjct: 141 PPTCNYVVNYGDGSYTSGEVGMEHL-------NLGNTTVN-NFIFGCGRKNQGLFGGAS- 191
Query: 215 EALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLDGI--NGGGIFAIGHVVQPEVNKTP 271
G++G G+++ S+ISQ++ GGV F++CL G +G N TP
Sbjct: 192 ----GLVGLGRTDLSLISQISPMFGGV---FSYCLPTTEAEASGSLVMGGNSSVYKNTTP 244
Query: 272 LV-------PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVY 324
+ P P Y +N+T + VG + P+ FG IIDSGT ++ LP +Y
Sbjct: 245 ISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPS--FG---KDRMIIDSGTVISRLPPSIY 299
Query: 325 EPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+ L ++ + Q P + D +CF S + P++ +FE S L V
Sbjct: 300 QALKAEFVKQFSGYPSAPSFMILD--SCFNLSGYQEVKIPDIKMYFEGSAELNV 351
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 156/379 (41%), Gaps = 40/379 (10%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-I 105
E R+ + V + G+ P G Y + IG PPK + +DTGSD+ WV C
Sbjct: 27 ESSTPANDRVGSSVFFRVTGNVYP--TGYYSVILNIGNPPKAFDFDIDTGSDLTWVQCDA 84
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIY 164
CK C + LY K++ V C C V G C A + C Y Y
Sbjct: 85 PCKGCTKPRD-----KLYKPKNN----LVPCSNSLCQAVSTGENYHCDAPDDQCDYEIEY 135
Query: 165 GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
D S+ G + D +G L + FGCG Q +L GI+G G
Sbjct: 136 ADLGSSIGVLLSDSFPLRLSNGTL----LQPKMAFGCGYDQK-HLGPHPPPDTAGILGLG 190
Query: 225 KSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV-QPEVNKTPLVPNQPHYSINM 283
+ S++SQL + G + + HC GG +F H+ + TP++ + +
Sbjct: 191 RGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDHLFPSSRITWTPMLRSSSDTLYSS 250
Query: 284 TAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKV 340
+ L F PT + G+ I DSG++ Y VY+ LV K ++ +P LK
Sbjct: 251 GPAE--LLFGGKPTGIKGL----QLIFDSGSSYTYFNAQVYQSILNLVRKDLAGKP-LKD 303
Query: 341 HTVHDEYTCFQYSE------SVDEGFPNVTFHFENS--VSLKVYPHEYLFPFED-LWCIG 391
+ C++ ++ + F +T F N+ V L++ P +YL +D C+G
Sbjct: 304 APEKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKNVQLQLAPEDYLIITKDGNVCLG 363
Query: 392 WQNSGMQSRDRKNMTLLGD 410
N Q N ++GD
Sbjct: 364 ILNGSEQQLG--NFNVIGD 380
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/352 (26%), Positives = 149/352 (42%), Gaps = 40/352 (11%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGK 132
G Y + IG PPK + + +DTGSD+ WV C CK C + LY K++
Sbjct: 66 GHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLD-----KLYKPKNNR--- 117
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C C + +C T C Y Y D S+ G + D +G L
Sbjct: 118 -VPCASSLCQAIQN---NNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSL--- 170
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ FGCG Q L + GI+G G+ +S++SQL + G + + HC +
Sbjct: 171 -LQPRIAFGCGYDQK-YLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRV 228
Query: 252 NGGGIFAIGHVVQPE-VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
GG +F H++ P + TP++ + + + L F PT + G+ I
Sbjct: 229 TGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAE--LLFGGKPTGIKGL----QLIF 282
Query: 311 DSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVHDEYTCFQYSE------SVDEGFP 361
DSG++ Y VY+ LV K +S P C++ ++ + F
Sbjct: 283 DSGSSYTYFNAQVYQSILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFK 342
Query: 362 NVTFHF--ENSVSLKVYPHEYLFPFED-LWCIGWQNSGMQSRDRKNMTLLGD 410
+T +F +V L++ P +YL +D C+G N G Q N+ ++GD
Sbjct: 343 PLTINFIKAKNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLG--NLNVIGD 392
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 145/337 (43%), Gaps = 38/337 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSST 130
VG Y ++G+GTP Y + VDTGS + W+ C C C R++ ++D + S T
Sbjct: 127 AVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAG-----PVFDPRASGT 181
Query: 131 GKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
V C C + L + C+ + C Y YGD S + GY +D V + SG
Sbjct: 182 YAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSFG--SGSF 239
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+GCG G + G+IG K+ S++ QLA S G F++CL
Sbjct: 240 P------GFYYGCGQDNEGLFGRSA-----GLIGLAKNKLSLLYQLAPSLGY--AFSYCL 286
Query: 249 DGIN-GGGIFAIGHVVQPEVNKTPLVP---NQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+ G +IG + + TP+ + Y + ++ + V L +P +
Sbjct: 287 PTSSAAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEY---R 343
Query: 305 NKGTIIDSGTTLAYLPEMVYEPL----VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
+ TIIDSGT + LP VY L + + S P +++ D TCF+ S +
Sbjct: 344 SLPTIIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILD--TCFRGS-AAGLRV 400
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNSG 396
P V F +L + P L +D C+ + +G
Sbjct: 401 PRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFAPTG 437
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/309 (27%), Positives = 130/309 (42%), Gaps = 37/309 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ GTP V +DTGSD+ W +QCK C + LYD SST V
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSW---LQCKPCSSGQCFPQKDPLYDPSHSSTYSAVP 169
Query: 136 CDQEFCHGV----YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
C + C + YG + CT+ C + Y DG+ST G + QD + L
Sbjct: 170 CASDVCKKLAADAYG---SGCTSGKQCGFAISYADGTSTVGAYSQDKLT-------LAPG 219
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
+ + FGCG + DG++G G+ S+ ++ GGV F++CL +
Sbjct: 220 AIVQNFYFGCGHGK-----HAVRGLFDGVLGLGRLRESLGARY---GGV---FSYCLPSV 268
Query: 252 NGG-GIFAIGHVVQPE-VNKTPL--VPNQPHYS-INMTAVQVGLDFLNLPTDVFGVGDNK 306
+ G A+G P TP+ VP QP +S + + + VG L+L F +
Sbjct: 269 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF----SG 324
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
G I+DSGT + L Y L S ++ D TC+ + + P +
Sbjct: 325 GMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDLDTCYNLTGYKNVVVPKIALT 384
Query: 367 FENSVSLKV 375
F ++ +
Sbjct: 385 FTGGATINL 393
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 148/345 (42%), Gaps = 62/345 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K V++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFSFGCNMDSFGANEFGN--------VDGLLGMGAGAMSVLKQ---SSPTFDCF 150
Query: 245 AHCL------DGI--NGGGIFAIGHVV-QPEVNKTPLVPNQPH---YSINMTAVQVGLDF 292
++CL G G F++G V + +V T +V + + + +++TA+ V +
Sbjct: 151 SYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGER 210
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
L L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 LGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM 267
Query: 353 SESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + E +D+WC+ +
Sbjct: 268 -RSVDEGDMPAISLHFDDGARFDLGRGGVFVERSVQEQDVWCLAF 311
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 138/337 (40%), Gaps = 48/337 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G YY KIG+GTP K + + VDTGS + W +QC+ C + ++ ++ S T
Sbjct: 109 GSGNYYVKIGLGTPAKYFSMIVDTGSSLSW---LQCQPCVIYCHVQVD-PIFTPSTSKTY 164
Query: 132 KFVTCDQEFCHGVYGGPLTD--CT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K + C C + L C+ A +C Y YGD S + GY QDV+
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTP----- 219
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ + + ++GCG G ++ GIIG SM+ QL+ G F++CL
Sbjct: 220 -SEAPSSGFVYGCGQDNQGLFGRSS-----GIIGLANDKISMLGQLSKKYG--NAFSYCL 271
Query: 249 DGING-------GGIFAIG--HVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLP 296
G +IG + TPLV NQ Y +++T + V P
Sbjct: 272 PSSFSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVA----GKP 327
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL-------VSKIISQQPDLKVHTVHDEYTC 349
V N TIIDSGT + LP VY L +SK +Q P + TC
Sbjct: 328 LGVSASSYNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILD-----TC 382
Query: 350 FQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
F+ S P + F L++ H L E
Sbjct: 383 FKGSVKEMSTVPEIQIIFRGGAGLELKAHNSLVEIEK 419
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 137/343 (39%), Gaps = 47/343 (13%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSL 116
+ V PL G+ P +G Y + IG+PPK + +DTGSD+ WV C C C +L
Sbjct: 33 SSVVFPLSGNVFP--LGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNL 90
Query: 117 GIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFV 175
+ G + C C ++ C C Y Y D S+ G V
Sbjct: 91 QYK---------PKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALV 141
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
D V+G + FGCG QS + A G++G G+ +++QL
Sbjct: 142 TDQFPLKLVNGSFMQP----PVAFGCGYDQS-YPSAHPPPATAGVLGLGRGKIGLLTQLV 196
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFL 293
S+G R + HCL GGG G + P V TPL+ HY T L F
Sbjct: 197 SAGLTRNVVGHCLSS-KGGGFLFFGDNLVPSIGVAWTPLLSQDNHY----TTGPADLLFN 251
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV---HTVHDEYTC- 349
PT + G+ I D+G++ Y Y+ +++ I + DLKV ++ T
Sbjct: 252 GKPTGLKGL----KLIFDTGSSYTYFNSKAYQTIINLIGN---DLKVSPLKVAKEDKTLP 304
Query: 350 --------FQYSESVDEGFPNVTFHFEN---SVSLKVYPHEYL 381
F+ V F +T +F N + L + P YL
Sbjct: 305 ICWKGAKPFKSVLEVKNFFKTITINFTNGRRNTQLYLAPELYL 347
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 162/397 (40%), Gaps = 65/397 (16%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSR---------PDGVGLYYAKIGIGTPPKD 88
R+ LS L + R+ A + SR P G G Y + IGTPP
Sbjct: 34 RDSPLSPLHTPNLTFSDRLQASFLRAISRQSRHVDFQTDLLPSG-GEYMMNLSIGTPPFP 92
Query: 89 YYVQVDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
DTGSD+ W+ C +C P++ ++D +S+T + C C+ +
Sbjct: 93 ILAIADTGSDLTWLQSKPCDQCYPQKGP------IFDPSNSTTFHKLPCTTAPCNALDES 146
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
+ CT T+C Y YGD S TTGY D V S ++ ++ FGCG R G
Sbjct: 147 ARS-CTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIR------NVAFGCGTRNGG 199
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING-------------- 253
N D E GI+G G N S +SQL + G K F++CL +
Sbjct: 200 NFD----EQGSGIVGLGGGNLSFVSQLGDTIG--KKFSYCLLPLENEISSQPSDSPATSR 253
Query: 254 -----GGIFAIGHVVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFL-----NLPTDVFG 301
+F+ TPLV +P +Y + + A+ VG L + T +
Sbjct: 254 IVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYD 313
Query: 302 VGDNKGT-----IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQYSE 354
G IIDSGTTL +L E Y L + ++ + +V+ V + CF+ +
Sbjct: 314 SGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKSGK 373
Query: 355 SVDEGFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCI 390
E P + HF +++ P + ++ E L C
Sbjct: 374 EEVE-LPLMKVHFRGGADVELKPVNTFVRAEEGLVCF 409
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 149/347 (42%), Gaps = 39/347 (11%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSD 98
R L + + LA V L G S GVG Y ++G+GTP K Y + VDTGS
Sbjct: 87 SRPTKLRRGSSSSPDAESLASVPLGPGTSV---GVGNYVTRMGLGTPAKSYVMVVDTGSS 143
Query: 99 IMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS 157
+ W+ C C C R+S +++ + SS+ V+C C + L T +TS
Sbjct: 144 LTWLQCSPCLVSCHRQSG-----PVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTS 198
Query: 158 --CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
C Y YGD S + GY +D V + S + +GCG G +
Sbjct: 199 NVCIYQASYGDSSFSVGYLSKDTVSFGSTSVP--------NFYYGCGQDNEGLFGQSA-- 248
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQP-EVNKTPLVP 274
G+IG ++ S++ QLA S G F++CL + + P + + TP+
Sbjct: 249 ---GLIGLARNKLSLLYQLAPSMGYS--FSYCLPTSSSSSGYLSIGSYNPGQYSYTPMAK 303
Query: 275 NQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---V 328
+ Y I MT + V L++ + + TIIDSGT + LP VY L V
Sbjct: 304 SSLDDSLYFIKMTGITVAGKPLSVSASAY---SSLPTIIDSGTVITRLPTDVYSALSKAV 360
Query: 329 SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+ + P ++ D TCFQ ++ P V+ F +LK+
Sbjct: 361 AGAMKGTPRASAFSILD--TCFQ-GQASRLRVPQVSMAFAGGAALKL 404
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/372 (25%), Positives = 151/372 (40%), Gaps = 42/372 (11%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
+F + A + S+ H +++ + + G+ PDG LY I IG PPK
Sbjct: 22 IFPHHFSAANKNNSIPPTSIHS------LISSLVYTIKGNVYPDG--LYTVSINIGNPPK 73
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
Y + +DTGSD+ WV C + P G + + + + V C C
Sbjct: 74 PYELDIDTGSDLTWVQC----DGPDAPCKGCTMPKDKLYKPNGKQVVKCSDPICVATQST 129
Query: 148 PLTD--CTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGAR 204
+ C+ + C Y Y D +ST G V+D + G +++ + + FGCG
Sbjct: 130 HVLGQICSKQSPPCVYNVQYADHASTLGVLVRDYMHI----GSPSSSTKDPLVAFGCGYE 185
Query: 205 QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQ 264
Q + + GI+G G +S++SQL S G + + HCL GGG +G
Sbjct: 186 QKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSA-EGGGYLFLGDKFV 244
Query: 265 PE--VNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
P + TP++ + + HY+ V L F PT G+ I DSG++ Y
Sbjct: 245 PSSGIVWTPIIQSSLEKHYNTG----PVDLFFNGKPTPAKGL----QIIFDSGSSYTYFS 296
Query: 321 EMVY--------EPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVS 372
VY L K +S+ D + F+ V+ F +T F S +
Sbjct: 297 SPVYTIVANMVNNDLKGKPLSRVKDPSLPICWKGVKPFKSLNEVNNYFKPLTLSFTKSKN 356
Query: 373 L--KVYPHEYLF 382
L ++ P YL
Sbjct: 357 LQFQLPPVAYLI 368
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 133/308 (43%), Gaps = 50/308 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y K+ IGTPP + +DTGS+ +W C+ C C +++ ++D SST K +
Sbjct: 59 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTA-----PIFDPSKSSTFKEIR 113
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
CD T + SCPY +YG S T G V + V SG
Sbjct: 114 CD---------------THDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPET- 157
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG----- 250
I GCG SG + G++G + S+I+Q+ G + ++C G
Sbjct: 158 --IIGCGRNNSG-----FKPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSK 208
Query: 251 INGG--GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--K 306
IN G I A VV V P Y +N+ AV VG N + G + K
Sbjct: 209 INFGANAIVAGDGVVSTTVFVKTAKPG--FYYLNLDAVSVG----NTRIETVGTPFHALK 262
Query: 307 GTI-IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
G I IDSG+TL Y PE Y LV K + +Q V + C+ YS+++D FP +T
Sbjct: 263 GNIVIDSGSTLTYFPES-YCNLVRKAV-EQVVTAVRFPRSDILCY-YSKTIDI-FPVITM 318
Query: 366 HFENSVSL 373
HF L
Sbjct: 319 HFSGGADL 326
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 143/357 (40%), Gaps = 60/357 (16%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGK 132
G Y I +GTP + V DTGSD WV C C C ++ L+ S+T
Sbjct: 163 GNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKE-----PLFTPTKSATYA 217
Query: 133 FVTCDQEFC-----HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ--YDKVS 185
++C +C G GG C Y YGDGS T G++ QD + YD V
Sbjct: 218 NISCTSSYCSDLDTRGCSGG---------HCLYAVQYGDGSYTVGFYAQDTLTLGYDTVK 268
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
FGCG + G G++G G+ +S+ Q +FA
Sbjct: 269 ----------DFRFGCGEKNRGLFGKAA-----GLMGLGRGKTSVPVQAYDK--YSGVFA 311
Query: 246 HCLDGINGGG---IFAIGHVVQPEVNKTP-LVPNQP-HYSINMTAVQVGLDFLNLPTDVF 300
+C+ + G F G TP LV N P Y + MT ++VG L++P VF
Sbjct: 312 YCIPATSSGTGFLDFGPGAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF 371
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQ---YSE 354
+ G ++DSGT + LP YEPL S L T TC+ Y
Sbjct: 372 ---SDAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQG 428
Query: 355 SVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGMQSRDRKNMTLLGD 410
S+ P V+ F+ L V L+ + C+ + + D +MT++G+
Sbjct: 429 SI--ALPAVSLVFQGGACLDVDASGILYVADVSQACLAF----AANDDDTDMTIVGN 479
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 122/266 (45%), Gaps = 25/266 (9%)
Query: 75 LYYAKIGIGTPP--KDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTG 131
LYY +I +G P + Y++ +DTGS++ W+ C C C + ++ LY + +
Sbjct: 29 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN-----QLYKPRKDN-- 81
Query: 132 KFVTCDQEFCHGVYGGPLTD-CTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V + FC V LT+ C C Y Y D S + G +D +G L
Sbjct: 82 -LVRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSL-- 138
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-D 249
++FGCG Q G L +T + DGI+G ++ S+ SQLAS G + + HCL
Sbjct: 139 --AESDIVFGCGYDQQGLLLNTLLKT-DGILGLSRAKISLPSQLASRGIISNVVGHCLAS 195
Query: 250 GINGGGIFAIGHVVQPEVNKT--PLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
+NG G +G + P T P++ + Y + +T + G L+L + VG
Sbjct: 196 DLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGK- 254
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKI 331
+ D+G++ Y P Y LV+ +
Sbjct: 255 --VLFDTGSSYTYFPNQAYSQLVTSL 278
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 150/344 (43%), Gaps = 63/344 (18%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECPRRSSLGIELTLYDIKDSSTG 131
Y + IGTPP+ V +DTGSD+ W C C EC + + + + SS+
Sbjct: 80 YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRM-MASFSPSHSSSS 138
Query: 132 KFVTCDQEFCHGVYGG--PLTDCT---------ANTSC-----PYLEIYGDGSSTTGYFV 175
+C FC V+ PL CT +C P+ YG G TG
Sbjct: 139 HRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLT 198
Query: 176 QDVVQYDKVSG-DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
+D + +V G +L T FGC A S+ E + GI GFG+ S+ SQL
Sbjct: 199 RDTL---RVHGRNLGVTQEIPRFCFGCVA-------SSYREPI-GIAGFGRGALSLPSQL 247
Query: 235 ASSGGVRKMFAHCL------DGINGGGIFAIGHVVQPEVN--------KTPLVPNQPHYS 280
G +RK F+HC + N IG + + K+P+ PN +Y
Sbjct: 248 ---GFLRKGFSHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPN--YYY 302
Query: 281 INMTAVQVG-LDFLNLPTDV--FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS---- 333
+ + A+ VG + +P+ + F N G ++DSGTT +LPE Y ++S + S
Sbjct: 303 VGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINY 362
Query: 334 -QQPDLKVHTVHD---EYTCFQYSESVDEGFPNVTFHFENSVSL 373
+ D+++ T D + C S + P++TFHF N+ SL
Sbjct: 363 PRATDMEMRTGFDLCYKVPCQNNSILTGDLLPSITFHFLNNASL 406
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/319 (31%), Positives = 138/319 (43%), Gaps = 45/319 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP + DTGSD++WV C C C +S+ L+ SST
Sbjct: 88 GEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQST-----PLFQPLKSSTFMP 142
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSS-TTGYFVQDVVQYDKVSGDLQTTS 192
TC + C + C + C Y YGD S + G + +++D G +QT +
Sbjct: 143 TTCRSQPCTLLLPE-QKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDS-QGGVQTVA 200
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI- 251
S FGCG N+ L GI+G G S++SQ+ G + F++CL +
Sbjct: 201 FPNSF-FGCGLYN--NITVFPSYKLTGIMGLGAGPLSLVSQIGDQIGHK--FSYCLLPLG 255
Query: 252 ----------NGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
N I G V P + K P +P +Y +N+ AV V +PT
Sbjct: 256 STSTSKLKFGNESIITGEGVVSTPMIIK-PWLPT--YYFLNLEAVTVAQK--TVPT---- 306
Query: 302 VGDNKG-TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESV 356
G G IIDSGT L YL E Y + + Q L V V D + CF Y ++
Sbjct: 307 -GSTDGNVIIDSGTLLTYLGESFYYNFAASL---QESLAVELVQDVLSPLPFCFPYRDNF 362
Query: 357 DEGFPNVTFHFENS-VSLK 374
FP + F F + VSLK
Sbjct: 363 V--FPEIAFQFTGARVSLK 379
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 133/308 (43%), Gaps = 50/308 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y K+ IGTPP + +DTGS+ +W C+ C C +++ ++D SST K +
Sbjct: 65 YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTA-----PIFDPSKSSTFKEIR 119
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
CD T + SCPY +YG S T G V + V SG
Sbjct: 120 CD---------------THDHSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPET- 163
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG----- 250
I GCG SG + G++G + S+I+Q+ G + ++C G
Sbjct: 164 --IIGCGRNNSG-----FKPGFAGVVGLDRGPKSLITQMG--GEYPGLMSYCFAGKGTSK 214
Query: 251 INGG--GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--K 306
IN G I A VV V P Y +N+ AV VG N + G + K
Sbjct: 215 INFGANAIVAGDGVVSTTVFVKTAKPG--FYYLNLDAVSVG----NTRIETVGTPFHALK 268
Query: 307 GTI-IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
G I IDSG+TL Y PE Y LV K + +Q V + C+ YS+++D FP +T
Sbjct: 269 GNIVIDSGSTLTYFPES-YCNLVRKAV-EQVVTAVRFPRSDILCY-YSKTIDI-FPVITM 324
Query: 366 HFENSVSL 373
HF L
Sbjct: 325 HFSGGADL 332
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 115/242 (47%), Gaps = 28/242 (11%)
Query: 91 VQVDTGSDIMWVNCIQCKECPRRS----SLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG 146
V +DTGSD+ WV C C +C + EL++Y+ K S+T K VTC+ C
Sbjct: 2 VALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC----- 55
Query: 147 GPLTDCTAN-TSCPYLEIYGDG-SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGAR 204
C ++CPY+ Y +ST+G ++DV+ + D + FGCG
Sbjct: 56 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHL--TTEDKNPERVEAYVTFGCGQV 113
Query: 205 QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQ 264
QSG+ + A +G+ G G S+ S LA G V F+ C G +G G + G
Sbjct: 114 QSGSF--LDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCF-GHDGVGRISFGDKGS 170
Query: 265 PEVNKTP--LVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
+ +TP L P+ P+Y+I +T V+VG ++ D + D+GT+ YL +
Sbjct: 171 SDQEETPFNLNPSHPNYNITVTRVRVGTTLID---------DEFTALFDTGTSFTYLVDP 221
Query: 323 VY 324
+Y
Sbjct: 222 MY 223
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 147/347 (42%), Gaps = 64/347 (18%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC---PRRSSLGIELTLYDIKDSSTGK 132
Y +G+GTP K +++DTGS WV C +C C PR T + ++ K
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFC-ECDGCHTNPR--------TFLQSRSTTCAK 51
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V+C C + GG C + + CP+ Y DGS++ G QD + + D+Q
Sbjct: 52 -VSCGTSMC--LLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQ 104
Query: 190 TTSTNGSLIFGC-----GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
FGC GA + GN +DG++G G S++ Q S F
Sbjct: 105 KIP---GFSFGCNMDSFGANEFGN--------VDGLLGMGAGPMSVLKQ---SSPTFDGF 150
Query: 245 AHCL------DGI--NGGGIFAIG---HVVQPEVNKTPLVP---NQPHYSINMTAVQVGL 290
++CL G G F++G + +V T +V N + +++TA+ V
Sbjct: 151 SYCLPLQMSERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDG 210
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF 350
+ L L +F KG + DSG+ L+Y+P+ L +I E C+
Sbjct: 211 ERLGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCY 267
Query: 351 QYSESVDEG-FPNVTFHFENSVSLKVYPH----EYLFPFEDLWCIGW 392
SVDEG P ++ HF++ + H E +D+WC+ +
Sbjct: 268 DM-RSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 313
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 144/362 (39%), Gaps = 44/362 (12%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC 110
+ +R+ + V P+ G+ P +G YY + IG PPK + + +DTGSD+ WV C C C
Sbjct: 45 QNRRLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 102
Query: 111 --PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDG 167
PR + C C G+ C C Y Y D
Sbjct: 103 TKPRAKQY-----------KPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDH 151
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
+S+ G V D V +G + N L FGCG Q N GI+G G+
Sbjct: 152 ASSIGALVTDEVPLKLANGSIM----NLRLTFGCGYDQQ-NPGPHPPPPTAGILGLGRGK 206
Query: 228 SSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTA 285
+ +QL S G + + HCL G G +IG + P V T L N P S N A
Sbjct: 207 VGLSTQLKSLGITKNVIVHCLSH-TGKGFLSIGDELVPSSGVTWTSLATNSP--SKNYMA 263
Query: 286 VQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHT 342
L F + T V G+ + DSG++ Y Y+ L+ K ++ +P
Sbjct: 264 GPAELLFNDKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319
Query: 343 VHDEYTCFQYS---ESVDEG---FPNVTFHFENSVS---LKVYPHEYLFPFED-LWCIGW 392
C++ +S+DE F +T F N + +V P YL E C+G
Sbjct: 320 DKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGI 379
Query: 393 QN 394
N
Sbjct: 380 LN 381
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 144/362 (39%), Gaps = 44/362 (12%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC 110
+ +R+ + V P+ G+ P +G YY + IG PPK + + +DTGSD+ WV C C C
Sbjct: 45 QNRRLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 102
Query: 111 --PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDG 167
PR + C C G+ C C Y Y D
Sbjct: 103 TKPRAKQY-----------KPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDH 151
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
+S+ G V D V +G + N L FGCG Q N GI+G G+
Sbjct: 152 ASSIGALVTDEVPLKLANGSIM----NLRLTFGCGYDQQ-NPGPHPPPPTAGILGLGRGK 206
Query: 228 SSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTA 285
+ +QL S G + + HCL G G +IG + P V T L N P S N A
Sbjct: 207 VGLSTQLKSLGITKNVIVHCLSH-TGKGFLSIGDELVPSSGVTWTSLATNSP--SKNYMA 263
Query: 286 VQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHT 342
L F + T V G+ + DSG++ Y Y+ L+ K ++ +P
Sbjct: 264 GPAELLFNDKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319
Query: 343 VHDEYTCFQYS---ESVDEG---FPNVTFHFENSVS---LKVYPHEYLFPFED-LWCIGW 392
C++ +S+DE F +T F N + +V P YL E C+G
Sbjct: 320 DKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGI 379
Query: 393 QN 394
N
Sbjct: 380 LN 381
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 132/309 (42%), Gaps = 38/309 (12%)
Query: 78 AKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCD 137
A I IG PP V +DTGSDI+WV C C C L L+D SST F
Sbjct: 103 ANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGL-----LFDPSMSST--FSPLC 155
Query: 138 QEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
+ C C+ P+ Y D S+ +G F +D V ++ + TS +
Sbjct: 156 KTPCD------FKGCSRCDPIPFTVTYADNSTASGMFGRDTVVFETTD---EGTSRIPDV 206
Query: 198 IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGING 253
+FGCG N+ + +GI+G S+ +++ + F++C+ D
Sbjct: 207 LFGCGH----NIGQDTDPGHNGILGLNNGPDSLATKIG------QKFSYCIGDLADPYYN 256
Query: 254 GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK--GTIID 311
+G E TP + Y + M + VG L++ + F + N+ G IID
Sbjct: 257 YHQLILGEGADLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIID 316
Query: 312 SGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESVD-EGFPNVTFH 366
+G+T+ +L + V+ L+SK + E + CF S S D GFP VTFH
Sbjct: 317 TGSTITFLVDSVHR-LLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFH 375
Query: 367 FENSVSLKV 375
F + L +
Sbjct: 376 FADGADLAL 384
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 128/321 (39%), Gaps = 49/321 (15%)
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC- 141
G+P + V VDTGSD+ WV C C C + L+D S+T V C+ C
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRD-----PLFDPAGSATYAAVRCNASACA 251
Query: 142 ---HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
G P + N C Y YGDGS + G D V S D +
Sbjct: 252 ASLKAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGASLD--------GFV 303
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA-SSGGVRKMFAHCLDGINGG--- 254
FGCG G T G++G G++ S++SQ A GGV F++CL G
Sbjct: 304 FGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTALRYGGV---FSYCLPATTSGDAS 355
Query: 255 GIFAIGHVVQPEVNKTPLV-------PNQ-PHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G ++G N TP+ P Q P Y +N+T VG L G+G +
Sbjct: 356 GSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA----AQGLGASN 411
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQ-----QPDLKVHTVHDEYTCFQYSESVDEGFP 361
+IDSGT + L VY + ++ Q P ++ D TC+ + + P
Sbjct: 412 -VLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILD--TCYDLTGHDEVKVP 468
Query: 362 NVTFHFENSVSLKVYPHEYLF 382
+T E + V LF
Sbjct: 469 LLTLRLEGGAEVTVDAAGMLF 489
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 143/360 (39%), Gaps = 45/360 (12%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC 110
+ +R+ + V P+ G+ P +G YY + IG PPK + + +DTGSD+ WV C C C
Sbjct: 45 QNRRLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 102
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSS 169
K + C C G+ C C Y Y D +S
Sbjct: 103 --------------TKYKPNHNTLPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHAS 148
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
+ G V D V +G + N L FGCG Q N GI+G G+
Sbjct: 149 SIGALVTDEVPLKLANGSIM----NLRLTFGCGYDQQ-NPGPHPPPPTAGILGLGRGKVG 203
Query: 230 MISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQ 287
+ +QL S G + + HCL G G +IG + P V T L N P S N A
Sbjct: 204 LSTQLKSLGITKNVIVHCLSH-TGKGFLSIGDELVPSSGVTWTSLATNSP--SKNYMAGP 260
Query: 288 VGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVH 344
L F + T V G+ + DSG++ Y Y+ L+ K ++ +P
Sbjct: 261 AELLFNDKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKDDK 316
Query: 345 DEYTCFQYS---ESVDEG---FPNVTFHFENSVS---LKVYPHEYLFPFED-LWCIGWQN 394
C++ +S+DE F +T F N + +V P YL E C+G N
Sbjct: 317 SLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQVPPESYLIITEKGRVCLGILN 376
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 153/369 (41%), Gaps = 61/369 (16%)
Query: 21 GVSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKI 80
GV + + ++ + R R ++ + V+ PL PDG G Y I
Sbjct: 5 GVKRSEAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPL----HPDGGG-YVMDI 59
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
+GTP K + DTGSD++WV C C S G T++D + SST + + C +
Sbjct: 60 SVGTPGKRFRAIADTGSDLVWVQSEPCTGC----SGG---TIFDPRQSSTFREMDCSSQL 112
Query: 141 CHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
C + G C +S C Y YG G T G F +D + S Q S
Sbjct: 113 CAELPG----SCEPGSSTCSYSYEYGSG-ETEGEFARDTISLGTTSDGSQKFP---SFAV 164
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
GCG SG + +DG++G G+ S+ SQL S + F++CL IN
Sbjct: 165 GCGMVNSGF------DGVDGLVGLGQGPVSLTSQL--SAAIDSKFSYCLVDINS------ 210
Query: 260 GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF----------------GVG 303
+ +PL+ P +++ T +Q + P+D + +G
Sbjct: 211 ------QSESSPLLFG-PSAALHGTGIQS--TKITPPSDTYPTYYLLTVNGIAVAGQTMG 261
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH-TVHDEYTCFQYSESVDEGFPN 362
TIIDSGTTL Y+P VY ++S++ S +V + C+ S + + FP
Sbjct: 262 SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPA 321
Query: 363 VTFHFENSV 371
+T +
Sbjct: 322 LTIRLAGAT 330
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 126/316 (39%), Gaps = 32/316 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL-TLYDIKDSSTGKFV 134
Y +G+G+P V +DTGSD+ WV QC+ CP S L+D SST
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWV---QCEPCPAPSPCHAHAGALFDPAASSTYAAF 164
Query: 135 TCDQEFCHGVY-GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C C + G C A + C Y+ YGDGS+TTG + DV+ L +
Sbjct: 165 NCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLT-------LSGSDV 217
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
FGC L + ++ DG+IG G S +SQ A+ G K F +CL
Sbjct: 218 VRGFQFGC---SHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYG--KSFFYCLPATPA 272
Query: 254 GGIF-------AIGHVVQPEVNKTPLVPNQP---HYSINMTAVQVGLDFLNLPTDVFGVG 303
F + G TP++ ++ +Y + + VG L L VF
Sbjct: 273 SSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA- 331
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPD-LKVHTVHDEYTCFQYSESVDEGFPN 362
G+++DSGT + LP Y L S + + + TCF ++ P
Sbjct: 332 ---GSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPT 388
Query: 363 VTFHFENSVSLKVYPH 378
V F + + H
Sbjct: 389 VALVFAGGAVVDLDAH 404
>gi|215694947|dbj|BAG90138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 100
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 45/89 (50%), Positives = 54/89 (60%), Gaps = 2/89 (2%)
Query: 42 LSLLKEHDARRQQRILAGVDLPLGG--SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDI 99
+ L+ HD R L D LGG GLYY +IGIGTP +YYVQVDTGS
Sbjct: 10 IGALQTHDRNRHLSRLVAADFSLGGLGGISTSSTGLYYTEIGIGTPAMEYYVQVDTGSSA 69
Query: 100 MWVNCIQCKECPRRSSLGIELTLYDIKDS 128
WVNCI CK+CPR+S + +LTLYD + S
Sbjct: 70 FWVNCIPCKQCPRKSDILKKLTLYDPRSS 98
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 152/349 (43%), Gaps = 40/349 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
YY + +GTP + + +DTGSD+ W+ C+ CK+C + ++ + SS+ +
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDC-----VPALRPPFNPRHSSSFFKLP 193
Query: 136 CDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVS-GDLQTTST 193
C C VY G C+ + +C + YGDGS ++G + + + + GD +
Sbjct: 194 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 253
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC----LD 249
+ ++ GC L + G++G + S SQL+S + F+HC +
Sbjct: 254 S-NITLGCADIDREGLPT----GASGLLGMDRRPISFPSQLSSRYA--RKFSHCFPDKIA 306
Query: 250 GINGGGIFAIGH--VVQPEVNKTPLVPNQPHYSINMTAVQVGL-----DFLNLPT----- 297
+N G+ G ++ P + TPLV N S ++ VGL D LP
Sbjct: 307 HLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNF 366
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSESV 356
D+ V + GTIIDSGT YL + ++ + + +++ L + +T C+ +
Sbjct: 367 DIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAKVDDNSGFTPCYNITSGT 426
Query: 357 ----DEGFPNVTFHFENSVSLKVYPHEYLFPF-----EDLWCIGWQNSG 396
P++T HF + + + + L P + C+ + SG
Sbjct: 427 AALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSG 475
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 86/328 (26%), Positives = 137/328 (41%), Gaps = 42/328 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y K+ +GTPP D Y VDT SD++W C C+ C Y K+
Sbjct: 29 GDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGC------------YKQKNPMFDPL 76
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C+ F H C+ +C Y+ Y D S+T G +++ + G S
Sbjct: 77 KECNSFFDHS--------CSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPIVES- 127
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
+IFGCG +G + + + S++SQ+ + G ++ F+ CL +
Sbjct: 128 ---IIFGCGHNNTGVFNENDMGLIGLG----GGPLSLVSQMGNLYGSKR-FSQCLVPFHA 179
Query: 254 ----GGIFAIGH---VVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
G ++G V V TPLV Q Y + + + VG F+ P + +
Sbjct: 180 DPHTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFV--PFNSSEMLS 237
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
+IDSGT YLP+ Y+ LV ++ Q +H D T Y + P +T
Sbjct: 238 KGNIMIDSGTPETYLPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEGPILT 297
Query: 365 FHFENSVSLKVYPHEYLFPFED-LWCIG 391
HFE + +K+ P + P +D ++C
Sbjct: 298 AHFEGA-DVKLLPLQTFIPPKDGVFCFA 324
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 99/414 (23%), Positives = 152/414 (36%), Gaps = 74/414 (17%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
RER ++ + RR + +PL + G G Y+ + +GTP + + + DTGS
Sbjct: 51 RER-MAFISSRGRRRAAETASAFAMPLSSGAY-TGTGQYFVRFRVGTPAQPFLLVADTGS 108
Query: 98 DIMWVNC------------------IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
D+ WV C PRR+ + S T + C
Sbjct: 109 DLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT--------FRPDKSRTWAPIPCSSA 160
Query: 140 FCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C L C T C Y Y DGS+ G D +SG + ++
Sbjct: 161 TCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATI-ALSGRAARKAKLRGVV 219
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGI---NGG 254
GC +G + A DG++ G SN S S+ AS G R F++CL D + N
Sbjct: 220 LGCTTSYNGQ----SFLASDGVLSLGYSNISFASRAASRFGGR--FSYCLVDHLAPRNAT 273
Query: 255 GIFAIG-----HVVQPE---------------------VNKTPLV---PNQPHYSINMTA 285
G +P +TPLV +P Y++ +
Sbjct: 274 SYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKG 333
Query: 286 VQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD 345
V V + L +P V+ V G I+DSGT+L L + Y +V+ + + L T+
Sbjct: 334 VSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMDP 393
Query: 346 EYTCFQYSES----VDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
C+ ++ V P + HF S L+ Y+ + CIG Q
Sbjct: 394 FDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQE 447
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 86/321 (26%), Positives = 143/321 (44%), Gaps = 42/321 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
+ + IG+PP V VDTGS ++WV C+ C C ++S+ + +D S + K +
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQST-----SWFDPLKSVSFKTLG 158
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD-----------VVQYDKV 184
C + + G C Y Y G S+ G ++ V QY+ +
Sbjct: 159 CGFPGYNYING---YKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAI 215
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGK-SNSSMISQLASSGGVRKM 243
S + + ++ FGCG N+ + N++A +G+ G G + +M +QL +
Sbjct: 216 STQISKIKKS-NITFGCGHM---NIKTNNDDAYNGVFGLGAYPHITMATQLGNK------ 265
Query: 244 FAHCLDGINGGGIFAIGHVV-----QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD 298
F++C+ IN ++ H+V E + TPL + HY + + ++ VG L + +
Sbjct: 266 FSYCIGDIN-NPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPN 324
Query: 299 VFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL--KVHTVHD-EYTCFQYS 353
F + + G +IDSG T L +E L +I+ L ++ T E CF+
Sbjct: 325 AFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGV 384
Query: 354 ESVD-EGFPNVTFHFENSVSL 373
S D GFP VTFHF L
Sbjct: 385 VSRDLVGFPAVTFHFAGGADL 405
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 131/307 (42%), Gaps = 26/307 (8%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
L+ +G PP +DTGS ++W+ C CK C ++ I ++D SST +
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQ----IIGPMFDPSISSTYDSL 156
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+C C P +C +++ C Y + Y +G + G + + + S D + N
Sbjct: 157 SCKNIICR---YAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFG--SSDEGRNAVN 211
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
+++FGC R +GN + G+ G G +S+++Q+ S F++C+ I
Sbjct: 212 -NVLFGCSHR-NGNY---KDRRFTGVFGLGSGITSVVNQMGSK------FSYCIGNIADP 260
Query: 255 GI----FAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN-KGTI 309
+ V E TPL HY + + + VG L + F + + I
Sbjct: 261 DYSYNQLVLSEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVI 320
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFE 368
IDSGT +L E Y L ++ + + + + C++ D GFP VTFHF
Sbjct: 321 IDSGTAPTWLAENEYRALEREVRNLLDRFLTPFMRESFLCYKGKVGQDLVGFPAVTFHFA 380
Query: 369 NSVSLKV 375
L V
Sbjct: 381 EGADLVV 387
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 151/364 (41%), Gaps = 46/364 (12%)
Query: 43 SLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWV 102
SL + +R + V LP + P G Y +GTPP+ + +DTGS ++W
Sbjct: 45 SLSRARHLKRPPTLTGKVTLP----AYPRSYGGYSVIFSLGTPPQKVSLVLDTGSSLVWT 100
Query: 103 NCI------QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
C C+ C ++ +Y SST + + C C+ V+G L +C+
Sbjct: 101 PCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDL-NCSTTK 159
Query: 157 SCPYLEI-YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
CPY + YG G STTG V DV+ K+ + +FGC +L S +
Sbjct: 160 RCPYYGLEYGLG-STTGQLVSDVLGLSKL-------NRIPDFLFGC------SLVSNRQP 205
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI------------GHVV 263
+GI GFG+ +S+ +QL + + +H D G + G
Sbjct: 206 --EGIAGFGRGLASIPAQLGLTKFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAANGVAY 263
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPE 321
P L P +Y I+++ + VG + +P V + G I+DSG+T ++
Sbjct: 264 APFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTFTFMER 323
Query: 322 MVYEPLVSKIISQQPDLK-VHTVHDEY---TCFQYSESVDEGFPNVTFHFENSVSLKVYP 377
++++P+ ++ K + D C+ + + P +TF F+ ++ +
Sbjct: 324 IIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVDVPKLTFSFKGGANMDLPL 383
Query: 378 HEYL 381
+Y
Sbjct: 384 TDYF 387
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 143/330 (43%), Gaps = 36/330 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + +GTPP DTGS+++W C C +C + L+D K SST K
Sbjct: 92 GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVD-----PLFDPKASSTYKD 146
Query: 134 VTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
V+C C + C T + +C YL Y DGS T G F D + S D +
Sbjct: 147 VSCSSSQCTALENQ--ASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLG--STDNRPVQ 202
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
++I GCG + T G++G G S+I QL S + F++CL N
Sbjct: 203 LK-NIIIGCGQNNA----VTFRNKSSGVVGLGGGAVSLIKQLGDS--IDGKFSYCLVPEN 255
Query: 253 GGGI---FAIGHVVQ-PEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLP-TDVFGVGDN 305
F VV P TPLV Y + + ++ VG + P +++ G
Sbjct: 256 DQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKG---- 311
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQYSESVDEGFPNV 363
+IDSGTTL LP Y + + + S + DE + Y+ + D P +
Sbjct: 312 -NMVIDSGTTLTLLPVKYYIEIENAVASL---INADKSKDERIGSSLCYNATADLNIPVI 367
Query: 364 TFHFENSVSLKVYPHEYLFPF-EDLWCIGW 392
T HFE + +K+YP+ F EDL C+ +
Sbjct: 368 TMHFEGA-DVKLYPYNSFFKVTEDLVCLAF 396
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 140/341 (41%), Gaps = 45/341 (13%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IGTPP + DTGSD++WV C C+ C + L++ SST K
Sbjct: 90 GEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNC-----FPQDTPLFEPLKSSTFKA 144
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
TCD + C V C C Y YGD S T G + + + +GD QT S
Sbjct: 145 ATCDSQPCTSVPPS-QRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGS-TGDAQTVSF 202
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
S IFGCG + ++++ + G S++SQL G + F++CL +
Sbjct: 203 PSS-IFGCGVYNNFTFHTSDKVTGLVGL--GGGPLSLVSQLGPQIGYK--FSYCLLPFSS 257
Query: 254 G----------GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
I VV + PL P+ Y +N+ AV +G V G
Sbjct: 258 NSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPS--FYFLNLEAVTIG-------QKVVPTG 308
Query: 304 DNKGT-IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE----YTCFQYSESVDE 358
G IIDSGT L YL + Y V+ + Q L V + D CF Y D
Sbjct: 309 RTDGNIIIDSGTVLTYLEQTFYNNFVASL---QEVLSVESAQDLPFPFKFCFPYR---DM 362
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGM 397
P + F F + S+ + P L +D + C+ S +
Sbjct: 363 TIPVIAFQFTGA-SVALQPKNLLIKLQDRNMLCLAVVPSSL 402
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 147/333 (44%), Gaps = 45/333 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC----PRRSSLGIELTLYDIKDSSTG 131
+ +G+GTP + + DTGSD+ WV C C P++ L+D SST
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDP------LFDPSKSSTY 197
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
V C + C G L C+ NT+C YL YGDGSSTTG +D + L +
Sbjct: 198 AAVHCGEPQCAA--AGDL--CSEDNTTCLYLVRYGDGSSTTGVLSRDTLA-------LTS 246
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+ FGCG R G+ +DG++G G+ S+ SQ A+S G +F++CL
Sbjct: 247 SRALTGFPFGCGTRNLGDFGR-----VDGLLGLGRGELSLPSQAAASFGA--VFSYCLPS 299
Query: 251 ING-GGIFAIGHVVQPEVN--------KTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
N G IG + + P P+ Y + + ++ +G L +P VF
Sbjct: 300 SNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPS--FYFVELVSIDIGGYVLPVPPAVFT 357
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGF 360
G GT++DSGT L YLP Y L + +D C+ ++ +
Sbjct: 358 RG---GTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVV 414
Query: 361 PNVTFHFENSVSLKV-YPHEYLFPFEDLWCIGW 392
P V+F F + ++ + +F E++ C+ +
Sbjct: 415 PAVSFRFGDGAVFELDFFGVMIFLDENVGCLAF 447
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 171/399 (42%), Gaps = 65/399 (16%)
Query: 8 CLCIVLIATAA----VGGVSSNHGVFSVK--YRYAGRERSLSLLKEHDARRQQRILAGVD 61
C I ++ +A V + ++ ++S K + Y +E S+ L+ A+ I+A +
Sbjct: 16 CFSISVVHLSASPTLVLNLVHSYHIYSRKPPHVYHIKEASVERLEYLKAKTTGDIIAHL- 74
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
+ P + I IG+PP + +DT SD++W+ C+ C C +S L
Sbjct: 75 ----SPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQS-----LP 125
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQ 180
++D S T + TC Y P ANT SC Y Y D + + G ++++
Sbjct: 126 IFDPSRSYTHRNETCRT----SQYSMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLL 181
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEAL--DGIIGFGKSNSSMISQLASSG 238
++ + D +++ ++FGCG G E L GI+G G S++ +
Sbjct: 182 FNTIY-DESSSAALHDVVFGCGHDNYG-------EPLVGTGILGLGYGEFSLVHRFG--- 230
Query: 239 GVRKMFAHCLDGING----GGIFAIGHVVQPEV-NKTPLVPNQPHYSINMTAVQVGLDFL 293
K F++C ++ + +G + + TPL + Y + + A+ V D +
Sbjct: 231 ---KKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISV--DGI 285
Query: 294 NLPTDVFGVGDNK-----GTIIDSGTTLAYLPEMVYEPLVSKI------------ISQQP 336
LP D N GTIID+G +L L E Y+PL ++I +SQ
Sbjct: 286 ILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDD 345
Query: 337 DLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+K+ + + + V+ GFP VTFHF L +
Sbjct: 346 MIKMECYNGNFE----RDLVESGFPIVTFHFSEGAELSL 380
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/333 (27%), Positives = 150/333 (45%), Gaps = 34/333 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR-SSLGIE----LTLYDIKDSS 129
L+YA + +GTP + V +DTGSD+ W+ C C R +G+ L LY SS
Sbjct: 101 LHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G + +SCPY ++ + TTG +DV+ V+ D
Sbjct: 161 TSSSIRCSDDRCFGSS----RCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHL--VTEDE 214
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
++ GCG Q+G L S+ A++G++G G + S+ S LA + F+ C
Sbjct: 215 GLEPVKANITLGCGKNQTGFLQSS--AAVNGLLGLGLKDYSVPSILAKAKITANSFSMCF 272
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ I+ G + G + +TPL+P +P Y++++T V VG D VG
Sbjct: 273 GNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGD---------AVGVQ 323
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC-FQYSESVDEG---FP 361
+ D+GT+ +L E Y L++K K + E F Y S ++ FP
Sbjct: 324 LLALFDTGTSFTHLLEPEYG-LITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFP 382
Query: 362 NVTFHFENSVSLKVYPHEYLFPFED---LWCIG 391
V FE + + ++ ED ++C+G
Sbjct: 383 RVAMTFEGGSQMFLRNPLFIVWNEDNSAMYCLG 415
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 143/311 (45%), Gaps = 35/311 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
+ + IG+PP V VDTGS ++WV C+ C C ++S+ + +D S + K +
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQST-----SWFDPLKSVSFKTLG 158
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV-SGDLQTTSTN 194
C + + G C Y Y G S+ G ++ + ++ + G ++ +
Sbjct: 159 CGFPGYNYING---YKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKS--- 212
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGK-SNSSMISQLASSGGVRKMFAHCLDGING 253
++ FGCG N+ + N++A +G+ G G + +M +QL + F++C+ IN
Sbjct: 213 -NITFGCGHM---NIKTNNDDAYNGVFGLGAYPHITMATQLGNK------FSYCIGDIN- 261
Query: 254 GGIFAIGHVV-----QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--K 306
++ H+V E + TPL + HY + + ++ VG L + + F + +
Sbjct: 262 NPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSG 321
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL--KVHTVHD-EYTCFQYSESVD-EGFPN 362
G +IDSG T L +E L +I+ L ++ T E CF+ S D GFP
Sbjct: 322 GVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVGFPA 381
Query: 363 VTFHFENSVSL 373
VTFHF L
Sbjct: 382 VTFHFAGGADL 392
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 150/378 (39%), Gaps = 64/378 (16%)
Query: 41 SLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
SLSL + H + + + + PL P G Y + GTPP+ +DTGS ++
Sbjct: 52 SLSLSRAHHIKSPKTNFSLIKTPL----FPRSYGGYSISLNFGTPPQTTKFVMDTGSSLV 107
Query: 101 WVNCIQ---CKEC--PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT----- 150
W C C EC P GI L K SS+ K + C C ++G +
Sbjct: 108 WFPCTSRYLCSECNFPNIKKTGIPTFL--PKLSSSSKLIGCKNPRCSMIFGPEIQSKCQE 165
Query: 151 -DCTAN----TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQ 205
D TA T PY+ YG GS T G + + + D T + GC
Sbjct: 166 CDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETL-------DFPNKKTIPDFLVGC---- 213
Query: 206 SGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----------------D 249
++ S + +GI GFG+S S+ SQL G++K F++CL D
Sbjct: 214 --SIFSIKQP--EGIAGFGRSPESLPSQL----GLKK-FSYCLVSHAFDDTPTSSDLVLD 264
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTD--VFGVGDNKG 307
+G G+ + K P + +Y + + + +G + +P V G N G
Sbjct: 265 TGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGG 324
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYSESVDEGFPNV 363
TI+DSGTT ++ VYE + + Q V T T C+ S P++
Sbjct: 325 TIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVPDL 384
Query: 364 TFHFENSVSLKVYPHEYL 381
F F+ + + Y
Sbjct: 385 IFQFKGGAKMALPLSNYF 402
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 131/320 (40%), Gaps = 47/320 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + G+GTP + + +DT +D W +C C CP S + SS+ +
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR-------FIPASSSSYASLP 131
Query: 136 CDQEFCHGVYGGPLTDCTAN-------TSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSG 186
C ++C G P C AN +C + + + D +S D ++ D ++G
Sbjct: 132 CASDWCPLFEGQP---CPANQDASAPLPACAFSKPFAD-TSFQASLGSDTLRLGKDAIAG 187
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
FGC +G T G++G G+ S++SQ S+ +F++
Sbjct: 188 ----------YAFGCVGAVAG---PTTNLPKQGLLGLGRGPMSLLSQTGST--YNGVFSY 232
Query: 247 CLDGING---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD 298
CL G +G QP V TPL+ N PH Y +N+T + VG ++ +P
Sbjct: 233 CLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTN-PHRPSLYYVNVTGLSVGRTWVKVPAG 291
Query: 299 VFGV--GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSES 355
F GT+IDSGT + VY L + Q +T + TCF E
Sbjct: 292 SFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEV 351
Query: 356 VDEGFPNVTFHFENSVSLKV 375
G P VT H + V L +
Sbjct: 352 AAGGAPPVTLHMDGGVDLTL 371
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 134/306 (43%), Gaps = 31/306 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GIG+P + +DTGSD+ WV C C +C +L+D SST +
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVD-----SLFDPSSSSTYSPFS 176
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + + ++ C Y+ YGD SSTTG + D + +S
Sbjct: 177 CSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLG--------SSAMT 228
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
FGC +SG + + DG++G G S+ SQ A + G F++CL +G
Sbjct: 229 DFQFGCSQSESGGFN----DQTDGLMGLGGGAQSLASQTAGTFGT--AFSYCLPPTSGSS 282
Query: 256 IFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
F KTP++ + +Y + + +++VG LNLPT VF + G+++DS
Sbjct: 283 GFLTLGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVF----SAGSLMDS 338
Query: 313 GTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN 369
GT + LP Y L S + Q P + D TCF +S P VT F
Sbjct: 339 GTIITRLPPTAYSALSSAFKAGMQQYPPATPSGILD--TCFDFSGQSSISIPTVTLVFSG 396
Query: 370 SVSLKV 375
++ +
Sbjct: 397 GAAVDL 402
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 135/305 (44%), Gaps = 42/305 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ +GTP + ++ +DT +D WV C C C T + S+T +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC--------SSTTFLPNASTTLGSLD 96
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTST 193
C + C V G T +++C + + YG SS VQD + D + G
Sbjct: 97 CSEAQCSQVRGFSC-PATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPG------- 148
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
FGC SG G++G G+ S+ISQ + +F++CL
Sbjct: 149 ---FTFGCINAVSG-----GSIPPQGLLGLGRGPISLISQAGAM--YSGVFSYCLPSFKS 198
Query: 254 ---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD--VFGVG 303
G +G V QP+ + TPL+ N PH Y +N+T V VG + +P++ VF
Sbjct: 199 YYFSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPN 257
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
GTIIDSGT + + VY + + +Q + + ++ TCF +E+ + P V
Sbjct: 258 TGAGTIIDSGTVITRFVQPVYFAIRDE-FRKQVNGPISSLGAFDTCF--AETNEAEAPAV 314
Query: 364 TFHFE 368
T HFE
Sbjct: 315 TLHFE 319
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 149/332 (44%), Gaps = 43/332 (12%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRS 114
R L D+ GG G Y +I IG P + DTGSD++WV C C+ C +++
Sbjct: 79 RALVQSDIVPGG-------GEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQN 131
Query: 115 SLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN---TSCPYLEIYGDGSSTT 171
S ++D + SS+ + V C EFC+ + G C A +C Y YGD S +
Sbjct: 132 S-----PIFDPRRSSSYRNVLCGNEFCNKL-DGEARSCDARGFVKTCGYTYSYGDQSFSD 185
Query: 172 GYFVQDVVQYDKVSGDLQTTSTNG---SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS 228
G+ + ++ S + T++ + FGCG + G D E GIIG G +
Sbjct: 186 GHLA--IERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFD----ELGSGIIGLGGGSM 239
Query: 229 SMISQLASSGGVRKMFAHCL----------DGINGGGIFAIGHVVQPEVNKTPLVPNQPH 278
S++SQL + F++CL IN G I V TPL+P +P
Sbjct: 240 SLVSQLGPK--LSGKFSYCLVPTSEQSNYTSKINFGNDINISG-SNYNVVSTPLLPKKPE 296
Query: 279 YSINMTAVQVGLDFLNLP-TDVFGVGDNKGT-IIDSGTTLAYLPEMVYEPLVSKIISQQP 336
+T + ++ LP T+++ KG IIDSGTTL +L + L S +
Sbjct: 297 TYYYLTLEAISVENKRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVK 356
Query: 337 DLKVHTVHDEYT-CFQYSESVDEGFPNVTFHF 367
+V H + CF+ ++++ P +T HF
Sbjct: 357 GERVSDPHGLFNICFKDEKAIE--LPIITAHF 386
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 95/350 (27%), Positives = 155/350 (44%), Gaps = 36/350 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y IG P +DT + ++WV C C G+ K S T +
Sbjct: 73 GEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSK-SFTYEM 131
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
C FC+ + G + +++ C Y +YGD +T+G D +D G L
Sbjct: 132 EPCGSNFCNSLTGFQTCN-SSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGMLVDV-- 188
Query: 194 NGSLIFGCG-ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
G L FGC A +G +E++ G +G ++ S+ISQL G++K F++CL N
Sbjct: 189 -GFLNFGCSEAPLTG-----DEQSYTGNVGLNQTPLSLISQL----GIKK-FSYCLVPFN 237
Query: 253 GGGIFA---IGHVVQPEVNKTPLV-PNQPHYSINMTAVQVGLD--FLNLPTDVFGVGDNK 306
G + G + +TPL+ PN Y + + + +G D + DV+ V D
Sbjct: 238 NLGSTSKMYFGSLPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFDGVFDVYEVRD-- 295
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQ--PDLKVHTVHDEYTCFQYSESVD-EGFPNV 363
G IID+G T + L ++ L++K ++ + P K CF+ + D E FP+V
Sbjct: 296 GWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDV 355
Query: 364 TFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGMQSRDRKNMTLLGDF 411
T HF+ + L + ED ++C+ SG +++LG+F
Sbjct: 356 TVHFDGA-DLILNVESTFVKIEDDGIFCLALLRSG------SPVSILGNF 398
>gi|115465837|ref|NP_001056518.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|55733881|gb|AAV59388.1| unknown protein [Oryza sativa Japonica Group]
gi|57900669|gb|AAW57794.1| unknown protein [Oryza sativa Japonica Group]
gi|113580069|dbj|BAF18432.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|215697162|dbj|BAG91156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768162|dbj|BAH00391.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 535
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 150/373 (40%), Gaps = 65/373 (17%)
Query: 38 RERSLSLLKEHDARRQQR----ILAGVD---LPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
R L D RR R +++ D LP+ + VG+Y + IGTP Y
Sbjct: 65 RREHFRALMAKDMRRMMRQVPELMSKTDMFELPMRSALNIAQVGMYVVVVRIGTPALPYS 124
Query: 91 VQVDTGSDIMWVNCIQCKE----------CPRRSSLGIE--------------------L 120
+ ++T +++ W+NC + P +++ I+ +
Sbjct: 125 LALETANEVTWINCRLRRRKGKHPGRPHVPPAATTMSIQVDDDGGGGGSGGKSKVTKVIM 184
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA---NTSCPYLEIYGDGSSTTGYFVQD 177
Y SS+ + C Q C + P C + NTSC Y ++ D + T+G + Q+
Sbjct: 185 NWYRPAKSSSWRRFRCSQRACMDL---PYNTCESPDQNTSCTYYQVMKDSTITSGIYGQE 241
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
G ++ L+ GC + G +++ DGI+ G S SS A
Sbjct: 242 KATVAVSDGTMKKLP---GLVIGCSTFEHGGAVNSH----DGILSLGNSPSSFGIAAARR 294
Query: 238 GGVRKMFAHCL----DGINGGGIFAIGH---VVQPEVNKTPLVPNQPHYSINMTAVQVGL 290
G R F CL G N G V P +TPL+ Y ++T + VG
Sbjct: 295 FGGRLSF--CLLATTSGRNASSYLTFGANPAVQAPGTMETPLLYRDVAYGAHVTGILVGG 352
Query: 291 DFLNLPTDVFGVG----DN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVH 344
L++P +V+ G DN G I+D+GT++ YL VY+P+ + + S L +
Sbjct: 353 QPLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHLPKAEIK 412
Query: 345 DEYTCFQYSESVD 357
C+ ++ + D
Sbjct: 413 GFEYCYNWTFAGD 425
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 132/314 (42%), Gaps = 38/314 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFV 134
+ +G G+P ++Y + +DTGSD+ W+ C+ C C ++ ++D S+T V
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHD-----PVFDPTKSATYSAV 215
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C G C+ + +C Y YGDGSST G V+ ++ +S L +T
Sbjct: 216 PCGHPQCAAAGG----KCSNSGTCLYKVTYGDGSSTAG-----VLSHETLS--LSSTRDL 264
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
FGCG G + G+ S+ SQ A++ G F++CL +
Sbjct: 265 PGFAFGCGQTNLGEFGGVDGLVGL-----GRGALSLPSQAAATFGA--TFSYCLPSYDTT 317
Query: 255 -GIFAIGHVVQP------EVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVGD 304
G +G +V T ++ + + Y + + ++ +G L +P VF
Sbjct: 318 HGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVF---T 374
Query: 305 NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNV 363
GT+ DSGT L YLP Y L + K +D + TC+ ++ P V
Sbjct: 375 RDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAV 434
Query: 364 TFHFENSVSLKVYP 377
F F + + P
Sbjct: 435 AFKFSDGAVFDLSP 448
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 167/393 (42%), Gaps = 61/393 (15%)
Query: 41 SLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
S SL + H + + + P+ S P G + + GTPP+ VDTGSD++
Sbjct: 48 SASLSRAHHLKHGK-----TNPPVKTSLFPHSYGGHSISLSFGTPPQKLSFLVDTGSDVV 102
Query: 101 WVNCI---QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY------GGPLTD 151
W C C C ++ ++ ++D K SS+ K + C C Y G P
Sbjct: 103 WAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCP--R 160
Query: 152 CTANT-----SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
C N+ +CPY YG G+S +GYF+ + +++ + T + + GC +
Sbjct: 161 CNGNSKHCSYACPYSTQYGTGAS-SGYFLLENLKFPR--------KTIRNFLLGCTTSAA 211
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVV--- 263
L S D + GFG+S S+ Q+ GV+K FA+CL+ + G ++
Sbjct: 212 RELSS------DALAGFGRSMFSLPIQM----GVKK-FAYCLNSHDYDDTRNSGKLILDY 260
Query: 264 ----QPEVNKTPLVPNQP----HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSG 313
++ TP + + P +Y + + +++G L +P+ G + G IIDSG
Sbjct: 261 RDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSG 320
Query: 314 TTLA-YLPEMVYEPLVSKIISQ----QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
A Y+ V++ + +++ Q + L+ T C+ ++ P + + F
Sbjct: 321 YGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNFTGHKSIKIPPLIYQFR 380
Query: 369 NSVSLKVYPHEY--LFPFEDLWCIGWQNSGMQS 399
++ V Y + P E L C +G +
Sbjct: 381 GGANMVVPGKNYFGISPQESLACFLMDTNGTNA 413
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 122/258 (47%), Gaps = 34/258 (13%)
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
SS++G +D+V + + S +L+ +FGC ++G+L S + DGI+G G+
Sbjct: 2 SSSSGVLGEDIVSFGRES-ELKAQRA----VFGCENSETGDLFSQHA---DGIMGLGRGQ 53
Query: 228 SSMISQLASSGGVRKMFAHCLDGIN-GGGIFAIGHVVQPE----VNKTPLVPNQPHYSIN 282
S++ QL G + F+ C G++ GGG +G V P PL P+Y+I
Sbjct: 54 LSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPL--RSPYYNIE 111
Query: 283 MTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH 341
+ + V L + + +F D+K GT++DSGTT AYLPE + + S+ LK
Sbjct: 112 LKEIHVAGKALRVDSRIF---DSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKI 168
Query: 342 TVHD---EYTCFQYSE----SVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL---WCIG 391
D + CF + + E FP+V F N L + P YLF + +C+G
Sbjct: 169 RGPDPSYKDICFAGARRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLFRHSKVDGAYCLG 228
Query: 392 WQNSGMQSRDRKNMTLLG 409
+G + TLLG
Sbjct: 229 VFQNG-----KDPTTLLG 241
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 114/266 (42%), Gaps = 49/266 (18%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
D G + + GTPP+ + + +DTGS I W C C C + S +D SST
Sbjct: 122 DEDGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRH-----FDSLASST 176
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
F +C + NT Y YGD S++ G + D + L+
Sbjct: 177 YSFGSC------------IPSTVGNT---YNMTYGDKSTSVGNYGCDTMT-------LEP 214
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+ FGCG G+ S DG++G G+ S +SQ AS +K+F++CL
Sbjct: 215 SDVFQKFQFGCGRNNEGDFGS----GADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPE 268
Query: 251 INGGGIFAIGHVVQPEVNK---TPLVPNQP---------HYSINMTAVQVGLDFLNLPTD 298
N G G + + T LV N P +Y + + + VG LN+P+
Sbjct: 269 ENSIGSLLFGEKATSQSSSLKFTSLV-NGPGTSGLEESGYYFVKLLDISVGNKRLNIPSS 327
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVY 324
VF + GTIIDSGT + LP+ Y
Sbjct: 328 VFA---SPGTIIDSGTVITRLPQRAY 350
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 150/373 (40%), Gaps = 62/373 (16%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
DLP S G G Y +G+GTP D + DTGSD+ W QC+ C R
Sbjct: 117 TDLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWT---QCQPCVR------- 165
Query: 120 LTLYDIKD------SSTGKF-VTCDQEFCHGVYGGPLTDCTAN------TSCPYLEIYGD 166
T YD K+ ST + V+C C G L+ T N ++C Y YGD
Sbjct: 166 -TCYDQKEPIFNPSKSTSYYNVSCSSAAC-----GSLSSATGNAGSCSASNCIYGIQYGD 219
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
S + G+ ++ L + + FGCG G + G++G G+
Sbjct: 220 QSFSVGFLAKEKFT-------LTNSDVFDGVYFGCGENNQGLF-----TGVAGLLGLGRD 267
Query: 227 NSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIGHV-VQPEVNKTP---LVPNQPHYSI 281
S SQ A++ K+F++CL + G G + V TP + Y +
Sbjct: 268 KLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGL 325
Query: 282 NMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDL 338
N+ A+ VG L +P+ VF G +IDSGT + LP Y L S +S+ P
Sbjct: 326 NIVAITVGGQKLPIPSTVF---STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTT 382
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGM 397
++ D TCF S P V F F +++ + F+ C+ +
Sbjct: 383 SGVSILD--TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAG--- 437
Query: 398 QSRDRKNMTLLGD 410
+ D N + G+
Sbjct: 438 -NSDDSNAAIFGN 449
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 130/320 (40%), Gaps = 47/320 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + G+GTP + + +DT +D W +C C CP S + SS+ +
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR-------FIPASSSSYASLP 131
Query: 136 CDQEFCHGVYGGPLTDCTAN-------TSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSG 186
C ++C G P C AN +C + + + D +S D ++ D ++G
Sbjct: 132 CASDWCPLFEGQP---CPANQDASAPLPACAFSKPFAD-TSFQASLGSDTLRLGKDAIAG 187
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
FGC +G T G++G G+ S++SQ S +F++
Sbjct: 188 ----------YAFGCVGAVAG---PTTNLPKQGLLGLGRGPMSLLSQTGSR--YNGVFSY 232
Query: 247 CLDGING---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD 298
CL G +G QP V TPL+ N PH Y +N+T + VG ++ +P
Sbjct: 233 CLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTN-PHRPSLYYVNVTGLSVGRTWVKVPAG 291
Query: 299 VFGV--GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSES 355
F GT+IDSGT + VY L + Q +T + TCF E
Sbjct: 292 SFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEV 351
Query: 356 VDEGFPNVTFHFENSVSLKV 375
G P VT H + V L +
Sbjct: 352 AAGGAPPVTLHMDGGVDLTL 371
>gi|125553570|gb|EAY99279.1| hypothetical protein OsI_21243 [Oryza sativa Indica Group]
gi|125605796|gb|EAZ44832.1| hypothetical protein OsJ_29469 [Oryza sativa Japonica Group]
Length = 534
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/373 (24%), Positives = 150/373 (40%), Gaps = 65/373 (17%)
Query: 38 RERSLSLLKEHDARRQQR----ILAGVD---LPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
R L D RR R +++ D LP+ + VG+Y + IGTP Y
Sbjct: 64 RREHFRALMAKDMRRMMRQVPELMSKTDMFELPMRSALNIAQVGMYVVVVRIGTPALPYS 123
Query: 91 VQVDTGSDIMWVNCIQCKE----------CPRRSSLGIE--------------------L 120
+ ++T +++ W+NC + P +++ I+ +
Sbjct: 124 LALETANEVTWINCRLRRRKGKHPGRPHVPPAATTMSIQVDDDGGGGGSGGKSKVTKVIM 183
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA---NTSCPYLEIYGDGSSTTGYFVQD 177
Y SS+ + C Q C + P C + NTSC Y ++ D + T+G + Q+
Sbjct: 184 NWYRPAKSSSWRRFRCSQRACMDL---PYNTCESPDQNTSCTYYQVMKDSTITSGIYGQE 240
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
G ++ L+ GC + G +++ DGI+ G S SS A
Sbjct: 241 KATVAVSDGTMKKLP---GLVIGCSTFEHGGAVNSH----DGILSLGNSPSSFGIAAARR 293
Query: 238 GGVRKMFAHCL----DGINGGGIFAIGH---VVQPEVNKTPLVPNQPHYSINMTAVQVGL 290
G R F CL G N G V P +TPL+ Y ++T + VG
Sbjct: 294 FGGRLSF--CLLATTSGRNASSYLTFGANPAVQAPGTMETPLLYRDVAYGAHVTGILVGG 351
Query: 291 DFLNLPTDVFGVG----DN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVH 344
L++P +V+ G DN G I+D+GT++ YL VY+P+ + + S L +
Sbjct: 352 QPLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHLPKAEIK 411
Query: 345 DEYTCFQYSESVD 357
C+ ++ + D
Sbjct: 412 GFEYCYNWTFAGD 424
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 136/332 (40%), Gaps = 39/332 (11%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ ++GIG PP YV +DTGSD+ W+ C C EC ++S ++D S++
Sbjct: 145 GSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSD-----PIFDPISSNSY 199
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ CD+ C + L++C N +C Y YGDGS T G F + V + +
Sbjct: 200 SPIRCDEPQCKSL---DLSECR-NGTCLYEVSYGDGSYTVGEFATETVTLGSAAVE---- 251
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM-FAHCLDG 250
++ GCG N E L +G +L+ V F++CL
Sbjct: 252 ----NVAIGCGH---------NNEGL--FVGAAGLLGLGGGKLSFPAQVNATSFSYCLVN 296
Query: 251 INGGGI--FAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGV--G 303
+ + + PL+ N Y + + + VG + L +P F V
Sbjct: 297 RDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAI 356
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPN 362
G IIDSGT + L VY+ L + + K + V TC+ S P
Sbjct: 357 GGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPT 416
Query: 363 VTFHFENSVSLKVYPHEYLFPFEDL--WCIGW 392
V+F F L + YL P + + +C +
Sbjct: 417 VSFRFPEGRELPLPARNYLIPVDSVGTFCFAF 448
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 120/299 (40%), Gaps = 49/299 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + IG+P + +DTGSD+ W+ C + LYD SST +
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRC--------------KSRLYDPGTSSTYAPFS 176
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C + G T C++ ++C Y YGDGS+TTG + D + S L +
Sbjct: 177 CSAPACAQL-GRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSEPLIS----- 230
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-NGG 254
FGC A + G E+ DG++G G S +SQ A++ G F++CL N
Sbjct: 231 GFQFGCSAVEHG----FEEDNTDGLMGLGGDAQSFVSQTAATYG--SAFSYCLPPTWNSS 284
Query: 255 GIFAIGHVVQPEVNKTPLVP------NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT 308
G +G P Y + + + VG L +P+ VF + G+
Sbjct: 285 GFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVF----SAGS 340
Query: 309 IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHF 367
I+DSGT + LP Y L + D +QY + G + F F
Sbjct: 341 IVDSGTVITRLPPTAYGALSAAF------------RDGMARYQYQPAAPRGLLDTCFDF 387
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 150/373 (40%), Gaps = 62/373 (16%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
DLP S G G Y +G+GTP D + DTGSD+ W QC+ C R
Sbjct: 89 TDLPAKDGSTL-GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWT---QCQPCVR------- 137
Query: 120 LTLYDIKD------SSTGKF-VTCDQEFCHGVYGGPLTDCTAN------TSCPYLEIYGD 166
T YD K+ ST + V+C C G L+ T N ++C Y YGD
Sbjct: 138 -TCYDQKEPIFNPSKSTSYYNVSCSSAAC-----GSLSSATGNAGSCSASNCIYGIQYGD 191
Query: 167 GSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS 226
S + G+ ++ L + + FGCG G + G++G G+
Sbjct: 192 QSFSVGFLAKEKFT-------LTNSDVFDGVYFGCGENNQGLF-----TGVAGLLGLGRD 239
Query: 227 NSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIGHV-VQPEVNKTP---LVPNQPHYSI 281
S SQ A++ K+F++CL + G G + V TP + Y +
Sbjct: 240 KLSFPSQTATA--YNKIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGL 297
Query: 282 NMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDL 338
N+ A+ VG L +P+ VF G +IDSGT + LP Y L S +S+ P
Sbjct: 298 NIVAITVGGQKLPIPSTVF---STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTT 354
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE-DLWCIGWQNSGM 397
++ D TCF S P V F F +++ + F+ C+ +
Sbjct: 355 SGVSILD--TCFDLSGFKTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQVCLAFAG--- 409
Query: 398 QSRDRKNMTLLGD 410
+ D N + G+
Sbjct: 410 -NSDDSNAAIFGN 421
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 133/305 (43%), Gaps = 42/305 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ +GTP + ++ +DT +D WV C C C T + S+T +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC--------SSTTFLPNASTTLGSLD 96
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTST 193
C + C V G T +++C + + YG SS VQD + D + G
Sbjct: 97 CSEAQCSQVRGFSC-PATGSSACLFNQSYGGDSSLAATLVQDAITLANDVIPG------- 148
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
FGC SG G++G G+ S+ISQ + +F++CL
Sbjct: 149 ---FTFGCINAVSG-----GSIPPQGLLGLGRGPISLISQAGAM--YSGVFSYCLPSFKS 198
Query: 254 ---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD--VFGVG 303
G +G V QP+ + TPL+ N PH Y +N+T V VG + +P++ VF
Sbjct: 199 YYFSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPN 257
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNV 363
GTIIDSGT + + VY + + +Q + + ++ TCF + + P V
Sbjct: 258 TGAGTIIDSGTVITRFVQPVYFAIRDE-FRKQVNGPISSLGAFDTCFAATNEAEA--PAV 314
Query: 364 TFHFE 368
T HFE
Sbjct: 315 TLHFE 319
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 134/305 (43%), Gaps = 43/305 (14%)
Query: 87 KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY- 145
++ V VDTGSD+ WV C C+ C + L++ S + + + C+ C +
Sbjct: 76 RNMTVIVDTGSDLTWVQCQPCRLCYNQQD-----PLFNPSGSPSYQTILCNSSTCQSLQY 130
Query: 146 -GGPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGA 203
G L C +NT +C Y+ YGDGS T G + + +L TT + + IFGCG
Sbjct: 131 ATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQL-------NLGTTHVS-NFIFGCGR 182
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGINGGGIFAIGH 261
G + G++G GKS+ S++SQ +S +F++CL + G +G
Sbjct: 183 NNKGLFGGAS-----GLMGLGKSDLSLVSQ--TSAIFEGVFSYCLPTTAADASGSLILGG 235
Query: 262 VVQPEVNKTPLV-------PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSG 313
N TP+ P P Y +N+T + +G L P G +IDSG
Sbjct: 236 NSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNY-----RQSGILIDSG 290
Query: 314 TTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS 370
T + LP VY L ++ + Q P ++ D TCF + + P + FE +
Sbjct: 291 TVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILD--TCFNLNGYDEVDIPTIRMQFEGN 348
Query: 371 VSLKV 375
L V
Sbjct: 349 AELTV 353
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/320 (26%), Positives = 130/320 (40%), Gaps = 47/320 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + G+GTP + + +DT +D W +C C CP S + SS+ +
Sbjct: 79 YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR-------FIPASSSSYASLP 131
Query: 136 CDQEFCHGVYGGPLTDCTAN-------TSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSG 186
C ++C G P C AN +C + + + D +S D ++ D ++G
Sbjct: 132 CASDWCPLFEGQP---CPANQDASAPLPACAFSKPFAD-TSFQASLGSDTLRLGKDAIAG 187
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
FGC +G T G++G G+ S++SQ S +F++
Sbjct: 188 ----------YAFGCVGAVAG---PTTNLPKQGLLGLGRGPMSLLSQTGSR--YNGVFSY 232
Query: 247 CLDGING---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD 298
CL G +G QP V TPL+ N PH Y +N+T + VG ++ +P
Sbjct: 233 CLPSYRSYYFSGSLRLGAAGQPRNVRYTPLLTN-PHRPSLYYVNVTGLSVGRTWVKVPAG 291
Query: 299 VFGV--GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSES 355
F GT+IDSGT + VY L + Q +T + TCF E
Sbjct: 292 SFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEV 351
Query: 356 VDEGFPNVTFHFENSVSLKV 375
G P VT H + V L +
Sbjct: 352 AAGGAPPVTLHMDGGVDLTL 371
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 88/354 (24%), Positives = 147/354 (41%), Gaps = 49/354 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE----CPRRSSLGIELTLYDIKD 127
G G Y+ + +GTP + + + DTGSD+ WV C + PRR ++
Sbjct: 108 GTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRR--------VFRAAA 159
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQY----- 181
S + + C + C L +C++ S C Y Y DGS+ G D
Sbjct: 160 SRSWAPIACSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGS 219
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
+ G + G ++ GC A + D + ++ DG++ G SN S S+ A+ G R
Sbjct: 220 ESRDGGGRRAKLQG-VVLGCTA----SYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR 274
Query: 242 KMFAHCL-----------------DGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSI 281
F++CL G GG + +TPL+ ++ P Y++
Sbjct: 275 --FSYCLVDHLAPRNATSYLTFGPPGPEGGAAASSSSSSA--AARTPLLLDRRMSPFYAV 330
Query: 282 NMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH 341
+ AV V + L++P DV+ V G I+DSGT+L L Y +V+ + + L
Sbjct: 331 AVDAVHVAGEALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRV 390
Query: 342 TVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
++ C+ ++ + E P + F S L+ Y+ + CIG Q
Sbjct: 391 SMDPFEYCYNWTAAALE-IPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQE 443
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/324 (26%), Positives = 124/324 (38%), Gaps = 49/324 (15%)
Query: 64 LGGSSRPDGV---GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
LG S D V +Y ++ +GTPP + ++DTGSD++W C+ C C + +
Sbjct: 46 LGASPYADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFA----- 100
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQ 180
++D SST K ++ CHG SCPY IY D S +TG + V
Sbjct: 101 PIFDPSKSSTFK-----EKRCHG------------NSCPYEIIYADESYSTGILATETVT 143
Query: 181 YDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL------ 234
SG+ + GCG S + + GI+G SS+ISQ+
Sbjct: 144 IQSTSGEPFVMAETS---IGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPG 200
Query: 235 -----ASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG 289
SS G K+ + G G A ++ + QP Y +N+ AV VG
Sbjct: 201 LISYCFSSQGTSKINFGTNAVVAGDGTVAADMFIKKD---------QPFYYLNLDAVSVG 251
Query: 290 LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC 349
+ F D IDSGTT YLP + + +
Sbjct: 252 DKRIETLGTPFHAQDGN-IFIDSGTTYTYLPTSYCNLVREAVAASVVAANQVPDPSSENL 310
Query: 350 FQYSESVDEGFPNVTFHFENSVSL 373
Y+ E FP +T HF L
Sbjct: 311 LCYNWDTMEIFPVITLHFAGGADL 334
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 159/386 (41%), Gaps = 59/386 (15%)
Query: 38 RERSLSLLKEHDARR------QQRILAGVDLPLGGSSRPDG---------VGLYYAKIGI 82
++R+ +LK +AR +R A VD G +S D + + I
Sbjct: 57 KDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAADDAYEASLLSELCTFLVNFSI 116
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYD----IKDSSTGKFVTCDQ 138
G PP Y +DTGS + W+ C C C ++ LY+ S F D
Sbjct: 117 GQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKG-----PLYNPSSSSTYVSCSDFDRTDT 171
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
F T + C Y + Y D ++T G + ++ + ++ + + +I
Sbjct: 172 TFT----------ATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGI---TIMHDVI 218
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGINGG 254
FGCG + T + G+ G G S SS+IS+L F++C+ D + G
Sbjct: 219 FGCGHNNTQLPGPTGYAS--GVFGLGDSGSSIISKLGFG------FSYCIGNIGDPLYGF 270
Query: 255 GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG----TII 310
+G+ ++ E TPLVP +Y I + + +G + L++ VF D G +I
Sbjct: 271 HRLTLGNKLKIEGYSTPLVPRGLYY-ITLVGISIGQERLDIDPIVFQRVDLNGISSRIVI 329
Query: 311 DSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFH 366
DSG TL+Y+P Y + VS I+S + C+ + D +GFP+ TFH
Sbjct: 330 DSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLSLCYIGKLNQDLQGFPDATFH 389
Query: 367 FENSVSLKVYPHEYLFPFED-LWCIG 391
+ L F + D + C+
Sbjct: 390 LADGADLVFQVEGLFFQYTDNVLCLA 415
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 89/358 (24%), Positives = 153/358 (42%), Gaps = 44/358 (12%)
Query: 32 KYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYV 91
+ +Y S +L +E+ + L LP S G Y+ +G+GTP +D +
Sbjct: 97 RVKYIQSRLSKNLGRENSVKE----LDSTTLP-AKSGSLIGSANYFVVVGLGTPKRDLSL 151
Query: 92 QVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY-GGPL 149
DTGSD+ W C C C ++ + ++D SS+ +TC C + G
Sbjct: 152 VFDTGSDLTWTQCEPCAGSCYKQ-----QDAIFDPSKSSSYINITCTSSLCTQLTSAGIK 206
Query: 150 TDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGN 208
+ C+++ T+C Y YGD S++ G+ Q+ + + T +FGCG G
Sbjct: 207 SRCSSSTTACIYGIQYGDKSTSVGFLSQERLT-------ITATDIVDDFLFGCGQDNEGL 259
Query: 209 LDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN---GGGIFAIGHVVQP 265
+ G+IG G+ S + Q +S K+F++CL + G F
Sbjct: 260 FSGS-----AGLIGLGRHPISFVQQTSSI--YNKIFSYCLPSTSSSLGHLTFGASAATNA 312
Query: 266 EVNKTPLVP---NQPHYSINMTAVQVGLDFL-NLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
+ TPL + Y +++ + VG L + + F G G+IIDSGT + L
Sbjct: 313 NLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAG---GSIIDSGTVITRLAP 369
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDE----YTCFQYSESVDEGFPNVTFHFENSVSLKV 375
Y L S + ++ + V +E TC+ +S + P + F F V++++
Sbjct: 370 TAYAALRSAF---RQGMEKYPVANEDGLFDTCYDFSGYKEISVPKIDFEFAGGVTVEL 424
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 144/336 (42%), Gaps = 37/336 (11%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y I +GTPP + DTGSD++W C C C + IE ++D S T +
Sbjct: 93 GEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQ----IE-PIFDPAKSKTYQI 147
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
++C+ + C + G C+ + +C Y YGDGS T+G D + +G +
Sbjct: 148 LSCEGKSCSNL--GGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVSVP- 204
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI-N 252
++FGCG G T E G++G G SMISQL G R F++CL + N
Sbjct: 205 --KVVFGCGHNNGG----TFELHGSGLVGLGGGPLSMISQLRPLIGGR--FSYCLVPLGN 256
Query: 253 GGGIFAIGH------VVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLP-----TDV 299
+ + H V TPL QP Y + + ++ VG L
Sbjct: 257 DPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSP 316
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YSESVD 357
D IIDSGTTL LP+ Y L S ++S + V D F YS
Sbjct: 317 LADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSA---IGGKPVRDPNNVFSLCYSNLSG 373
Query: 358 EGFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCIGW 392
P +T HF + L++ P + ++ EDL+C
Sbjct: 374 LRIPTITAHFVGA-DLELKPLNTFVQVQEDLFCFAM 408
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 79/309 (25%), Positives = 138/309 (44%), Gaps = 32/309 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
+ A I IG PP + +DTGSD+ W+ C+ CK P+ + + SST + +
Sbjct: 88 FLANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQ------TIPFFHPSRSSTYRNAS 141
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C+ H + + T N C Y Y D S+T G ++ + + L +
Sbjct: 142 CESA-PHAMPQIFRDEKTGN--CRYHLRYRDFSNTRGILAKEKLTFQTSDEGL---ISKP 195
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
+++FGCG SG + G++G G S++++ S F +D
Sbjct: 196 NIVFGCGQDNSGFTQYS------GVLGLGPGTFSIVTRNFGS-KFSYCFGSLIDPTYPHN 248
Query: 256 IFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTIIDSGT 314
+G+ + E + TPL Q Y +++ A+ +G L++ +F +K GT+ID+G
Sbjct: 249 FLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGC 308
Query: 315 TLAYLPEMVYEP-------LVSKIISQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFH 366
+ L YE L+ +++ + D + +T H C++ + +D GFP VTFH
Sbjct: 309 SPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNH----CYEGNLKLDLYGFPVVTFH 364
Query: 367 FENSVSLKV 375
F L +
Sbjct: 365 FAGGAELAL 373
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 160/404 (39%), Gaps = 50/404 (12%)
Query: 29 FSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKD 88
F R+LSL E RR +G + G G Y + IG PP
Sbjct: 41 FRASLIRTAESRNLSLAAERSRRRLSVYTSGTGTKAPVTKSQKG-GKYIMQFSIGEPPLL 99
Query: 89 YYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
+ +VDTGSD+MWV C C C S LYD S + + C + C + G
Sbjct: 100 IWAEVDTGSDLMWVKCSPCNGCNPPPS-----PLYDPARSRSSGKLPCSSQLCQALGRGR 154
Query: 149 LTD--CTANTS-CPYLEIYGDGS--STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGA 203
+ C+ + C Y YG ST G + + GD + ++ FG
Sbjct: 155 IISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF----GDGYVAN---NVSFG--- 204
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-------INGGGI 256
+S +D + G++G G+ + S++SQL + FA+CL I G +
Sbjct: 205 -RSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAG-----RFAYCLAADPNVYSTILFGSL 258
Query: 257 FAIGHVVQPEVNKTPLVPN-QP----HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTI 309
A+ +V+ TPLV N +P HY +N+ + VG L + F + + G
Sbjct: 259 AAL-DTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVF 317
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCF-QYSESVDEGFPNVTFHFE 368
DSG L + Y+ + I S+ L D TCF ++ P + HF+
Sbjct: 318 FDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAGDD--TCFVAANQQAVAQMPPLVLHFD 375
Query: 369 NSVSLKVYPHEYLF-----PFEDLWCIGWQNSGMQSRDRKNMTL 407
+ + + YL P E L C+ ++S + NM +
Sbjct: 376 DGADMSLNGRNYLKTSTKGPSEVLVCMAIKSSSDSEVSQSNMNV 419
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 134/306 (43%), Gaps = 44/306 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ +GTP + ++ +DT +D WV C C C T + S+T +
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC--------SSTTFLPNASTTLGSLD 149
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTST 193
C C V G T +++C + + YG SS T VQD + D + G
Sbjct: 150 CSGAQCSQVRGFS-CPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPG------- 201
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKMFAHCLDGIN 252
FGC SG G++G G+ S+ISQ A GV F++CL
Sbjct: 202 ---FTFGCINAVSG-----GSIPPQGLLGLGRGPISLISQAGAMYSGV---FSYCLPSFK 250
Query: 253 G---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD--VFGV 302
G +G V QP+ + TPL+ N PH Y +N+T V VG + +P++ VF
Sbjct: 251 SYYFSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDP 309
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GTIIDSGT + + VY + + +Q + + ++ TCF + + P
Sbjct: 310 NTGAGTIIDSGTVITRFVQPVYFAIRDE-FRKQVNGPISSLGAFDTCFAATNEAEA--PA 366
Query: 363 VTFHFE 368
+T HFE
Sbjct: 367 ITLHFE 372
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 138/337 (40%), Gaps = 32/337 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+++IGIG+P + Y+ +DTGSD+ W+ C C +C +S L+D SS+
Sbjct: 192 GSGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSD-----PLFDPALSSSY 246
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V CD C + + A N+SC Y YGDGS T G F + + + GD
Sbjct: 247 ATVPCDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETL---TLGGD-- 301
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
++ + GCG G G S SQ++++ F++CL
Sbjct: 302 GSAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAT-----EFSYCLV 351
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPNQPH----YSINMTAVQVGLDFL-NLPTDVFGVGD 304
+ + T + P Y + + + VG + L ++P F + +
Sbjct: 352 DRDSPSASTLQFGASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDE 411
Query: 305 --NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFP 361
+ G I+DSGT + L Y L + L + V TC+ + P
Sbjct: 412 QGSGGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVP 471
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNSG 396
V+ FE LK+ YL P + +C+ + +G
Sbjct: 472 AVSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATG 508
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 141/332 (42%), Gaps = 43/332 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G YY I +G+P ++ + VDTGS++ W+ C+ CK C T+YD S++ +
Sbjct: 98 GEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVD-----TIYDAARSASYRP 152
Query: 134 VTCDQ-EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
VTC+ + C G C + C + YGDGS + G D + + V G T
Sbjct: 153 VTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTV 212
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
+ FGC G+L+ A GI+G ++ QL G + F+HC
Sbjct: 213 QD--FAFGCA---QGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWK--FSHCFPDRS 264
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPHY-SINMTAVQVGLDFLNLPTDVFGVGDN-- 305
+N G+ G+ P Q Y S+ +T ++ F ++ + +
Sbjct: 265 SHLNSTGVVFFGNAELPH--------EQVQYTSVALTNSELQRKFYHVALKGVSINSHEL 316
Query: 306 ----KGT--IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY----TCFQYS-E 354
+G+ I+DSG++ + + L + +P H D + TCF+ S +
Sbjct: 317 VFLPRGSVVILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSND 376
Query: 355 SVDE---GFPNVTFHFENSVSLKVYPHEYLFP 383
+DE P+++ FE+ V++ + L P
Sbjct: 377 DIDELHRTLPSLSLVFEDGVTIGIPSIGVLLP 408
>gi|219120658|ref|XP_002181063.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407779|gb|EEC47715.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 448
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 148/351 (42%), Gaps = 51/351 (14%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
A V LPL + G ++ +G PP+ + VDTGS + C C +C ++
Sbjct: 73 ATVRLPLHAVA-----GTHHVTAWMGEPPQAQTLIVDTGSRLTATACEPCSQC--GTTHA 125
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
D + SST ++ C G+ +C A C + Y +GSS T V D
Sbjct: 126 HPFPHLDPQRSSTLRYTQCGSCLLSGI-----QECAAEQKCGINQRYTEGSSWTAVEVSD 180
Query: 178 --VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
V+ ++S Q S FGC + G + + +GI+G +S+ S+I +L
Sbjct: 181 TFVLGGPEISSLEQYVSFTIIFAFGCQQKVRGLFRT---QYANGILGLERSDLSLIKRLW 237
Query: 236 SSGGV-RKMFAHCLDGING----GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGL 290
+ R+ F+ C+ G GG H + TP Q Y++++ V VG
Sbjct: 238 KENVIPRESFSLCMTPFEGYIGLGGPLRDKHT--ESMKYTPFTSTQSWYAVHVVRVFVGD 295
Query: 291 DFL--NLPTD-------VFGVGDNKGTIIDSGTTLAYLPEMV---YEPLVSKIISQ--QP 336
+ L N D V + KGTI+DSGTT YLP+ V + +++ + QP
Sbjct: 296 ECLTSNDQHDTVVEHALVEAFAEGKGTILDSGTTDTYLPKAVAGRMREIWARLSNTPFQP 355
Query: 337 DLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL 387
+DE+ P VTF N+V+L+ P ++ EDL
Sbjct: 356 SSTYAYTYDEF----------RSLPIVTFELANNVTLQALPKNFM---EDL 393
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 140/293 (47%), Gaps = 52/293 (17%)
Query: 67 SSRPDGVGLYY------AKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL 120
SS+ G L++ A + IGTPP++ + +DTGS++ W ++CK+ P +S
Sbjct: 52 SSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSW---LRCKKEPNFTS----- 103
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGG---PLTDCTANTSCPYLEIYGDGSSTTGYFVQD 177
+++ S T + C + C P+T C C ++ Y D SS G+ +
Sbjct: 104 -IFNPLASKTYTKIPCSSQTCKTRTSDLTLPVT-CDPAKLCHFIISYADASSVEGHLAFE 161
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA-LDGIIGFGKSNSSMISQLAS 236
++ + T + +FGC SG+ +T E+A G++G + + S ++Q+
Sbjct: 162 TFRFGSL--------TRPATVFGC--MDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQM-- 209
Query: 237 SGGVRKMFAHCLDGINGGGIFAIGHV----VQPEVNKTPLVP--------NQPHYSINMT 284
G RK F++C+ G++ G +G ++P +N TPLV ++ YS+ +
Sbjct: 210 --GFRK-FSYCISGLDSTGFLLLGEARYSWLKP-LNYTPLVQISTPLPYFDRVAYSVQLE 265
Query: 285 AVQVGLDFLNLPTDVFGVGDNKG---TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
++V L LP VF V D+ G T++DSGT +L VY L + + Q
Sbjct: 266 GIKVNNKVLPLPKSVF-VPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQ 317
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 109/266 (40%), Gaps = 22/266 (8%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
YY I IG P + Y++ VDTGS + W+ C C C + + + + V
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGP--------HPLYKPAKENIV 180
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C + G C C Y Y D SS+ G +D ++ G+ + N
Sbjct: 181 PPRDSHCQELQGN-QNYCDTCKQCDYEIAYADRSSSAGVLARDNMELITADGERE----N 235
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
L+FGC Q G L + + DGI+G S+ +QLA G + +F HC+ G
Sbjct: 236 MDLVFGCAHDQQGKLLGSPASS-DGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSG 294
Query: 255 GIFA-IGHVVQPEVNKTPL-VPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
+ +G P T + V N P YS + V G LN+ G I
Sbjct: 295 SAYMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQ---AGKLTQVIF 351
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQP 336
DSG++ Y P +Y L++ + + P
Sbjct: 352 DSGSSYTYFPHEIYTSLITSLEAVSP 377
>gi|219120652|ref|XP_002181060.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407776|gb|EEC47712.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 453
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 154/367 (41%), Gaps = 53/367 (14%)
Query: 44 LLKEHDARRQQRIL--AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
L + HD ++ A V LPL + G ++ +G PP+ + VDTGS +
Sbjct: 53 LRRRHDNGNTVELVPNATVRLPLHAVA-----GTHHVTAWMGEPPQAQTLIVDTGSRLTA 107
Query: 102 VNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYL 161
C C +C ++ D + SST ++ C G+ +C A C
Sbjct: 108 TACEPCSQC--GTTHAHPFPHLDPQRSSTLRYTQCGSCLLSGI-----QECAAEQKCGIN 160
Query: 162 EIYGDGSSTTGYFVQD--VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
+ Y +GSS T V D V+ ++S Q S FGC + G + + +G
Sbjct: 161 QRYTEGSSWTAVEVSDTFVLGGPEISSLEQYVSFTIIFAFGCQQKVRGLFRT---QYANG 217
Query: 220 IIGFGKSNSSMISQLASSGGV-RKMFAHCLDGING----GGIFAIGHVVQPEVNKTPLVP 274
I+G +S+ S+I +L + R+ F+ C+ G GG H + TP
Sbjct: 218 ILGLERSDLSLIKRLWKENVIPRESFSLCMTPFEGYIGLGGPLRDKHT--ESMKYTPFTS 275
Query: 275 NQPHYSINMTAVQVGLDFL--NLPTD-------VFGVGDNKGTIIDSGTTLAYLPEMV-- 323
Q Y++++ V VG + L N D V + KGTI+DSGTT YLP+ V
Sbjct: 276 TQSWYAVHVVRVFVGDECLTSNDQHDTVVEHALVEAFAEGKGTILDSGTTDTYLPKAVAG 335
Query: 324 -YEPLVSKIISQ--QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEY 380
+ +++ + QP +DE+ P VTF N+V+L+ P +
Sbjct: 336 RMREIWARLSNTPFQPSSTYAYTYDEF----------RSLPIVTFELANNVTLQALPKNF 385
Query: 381 LFPFEDL 387
+ EDL
Sbjct: 386 M---EDL 389
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 134/306 (43%), Gaps = 44/306 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y ++ +GTP + ++ +DT +D WV C C G T + S+T +
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCT--------GFSSTTFLPNASTTLGSLD 149
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTST 193
C C V G T +++C + + YG SS T VQD + D + G
Sbjct: 150 CSGAQCSQVRGFS-CPATGSSACLFNQSYGGDSSLTATLVQDAITLANDVIPG------- 201
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKMFAHCLDGIN 252
FGC SG G++G G+ S+ISQ A GV F++CL
Sbjct: 202 ---FTFGCINAVSG-----GSIPPQGLLGLGRGPISLISQAGAMYSGV---FSYCLPSFK 250
Query: 253 G---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTD--VFGV 302
G +G V QP+ + TPL+ N PH Y +N+T V VG + +P++ VF
Sbjct: 251 SYYFSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDP 309
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
GTIIDSGT + + VY + + +Q + + ++ TCF + + P
Sbjct: 310 NTGAGTIIDSGTVITRFVQPVYFAIRDE-FRKQVNGPISSLGAFDTCFAATNEAEA--PA 366
Query: 363 VTFHFE 368
+T HFE
Sbjct: 367 ITLHFE 372
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 114/264 (43%), Gaps = 27/264 (10%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC--PRRSSLGIELTLYDIKDSS 129
+G Y + IG PPK Y + +DTGSD+ WV C CK C PR LY
Sbjct: 61 LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRNR-------LY----KP 109
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
G V C C + P C N C Y Y D S+ G ++D + +G L
Sbjct: 110 HGDLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNGSL 169
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
L FGCG Q+ + + G++G G +S++SQL S G +R + HCL
Sbjct: 170 ----ARPMLAFGCGYDQTHH-GQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCL 224
Query: 249 DGINGGGIFAIGHVVQPE-VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
G GG +F ++ P V TPL+ Q + + L F T V G+
Sbjct: 225 SGRGGGFLFFGDQLIPPSGVVWTPLL--QSSSAQHYKTGPADLFFDRKTTSVKGL----E 278
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKI 331
I DSG++ Y ++ LV+ I
Sbjct: 279 LIFDSGSSYTYFNSQAHKALVNLI 302
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 143/318 (44%), Gaps = 47/318 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +G+G+ K+ V +DTGSD+ WV C C C + ++ SS+ + V+
Sbjct: 65 YIVTMGLGS--KNMTVIIDTGSDLTWVQCEPCMSCYNQQG-----PIFKPSTSSSYQSVS 117
Query: 136 CDQEFCHGVY--GGPLTDCTAN--TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
C+ C + G C ++ ++C Y+ YGDGS T G + + + VS
Sbjct: 118 CNSSTCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVSVS---- 173
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKMFAHCLDG 250
+FGCG G + G++G G+S S++SQ A+ GGV F++CL
Sbjct: 174 ----DFVFGCGRNNKGLFG-----GVSGLMGLGRSYLSLVSQTNATFGGV---FSYCLPT 221
Query: 251 INGG--GIFAIGHVVQPEVNKTPL----VPNQPH----YSINMTAVQVGLDFLNLPTDVF 300
G G +G+ N P+ + + P Y +N+T + VG L P F
Sbjct: 222 TEAGSSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLS-F 280
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVD 357
G N G +IDSGT + LP VY+ L ++ + + P ++ D TCF + +
Sbjct: 281 G---NGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILD--TCFNLTGYDE 335
Query: 358 EGFPNVTFHFENSVSLKV 375
P ++ FE + L V
Sbjct: 336 VSIPTISLRFEGNAQLNV 353
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 97/211 (45%), Gaps = 16/211 (7%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR-SSLG----IELTLYDIKDSS 129
LYYA + +GTPP + V +DTGSD+ W+ C C R +G + L LY S+
Sbjct: 101 LYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNAST 160
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G C++ +S CPY Y + + T G +QDV+ + D
Sbjct: 161 TSSSIRCSDKRCFGS-----KKCSSPSSICPYQISYSNSTGTKGTLLQDVLHL--ATEDE 213
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
T ++ GCG +Q+G N +++G++G G S+ S LA + F+ C
Sbjct: 214 NLTPVKANVTLGCGQKQTGLFQRNN--SVNGVLGLGIKGYSVPSLLAKANITANSFSMCF 271
Query: 249 DGINGG-GIFAIGHVVQPEVNKTPLVPNQPH 278
+ G G + G + +TP + P
Sbjct: 272 GRVIGNVGRISFGDRGYTDQEETPFISVAPR 302
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 160/360 (44%), Gaps = 54/360 (15%)
Query: 38 RERSLSL-LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
R +SL L +K + ++ ++ +PL + + + Y + +G K+ + VDTG
Sbjct: 97 RVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLN-YIVTVELG--GKNMSLIVDTG 153
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG-----GPL-- 149
SD+ WV C C+ C + LYD SS+ K V C+ C + GP
Sbjct: 154 SDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 208
Query: 150 TDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ T C Y+ YGDGS T G D+ + GD + + +FGCG G
Sbjct: 209 NNGVVKTPCEYVVSYGDGSYTRG----DLASESILLGDTKLE----NFVFGCGRNNKGLF 260
Query: 210 DSTNEEALDGIIGFGKSNSSMISQ-LASSGGVRKMFAHCL----DGINGGGIFAIGHVV- 263
++ G+S+ S++SQ L + GV F++CL DG +G F V
Sbjct: 261 GGSSGLMGL-----GRSSVSLVSQTLKTFNGV---FSYCLPSLEDGASGSLSFGNDSSVY 312
Query: 264 --QPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
V+ TPLV N + Y +N+T +G + L + FG +G +IDSGT +
Sbjct: 313 TNSTSVSYTPLVQNPQLRSFYILNLTGASIG--GVELKSSSFG----RGILIDSGTVITR 366
Query: 319 LPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
LP +Y+ + + + Q P +++ D TCF + D P + F+ + L+V
Sbjct: 367 LPPSIYKAVKIEFLKQFSGFPTAPGYSILD--TCFNLTSYEDISIPIIKMIFQGNAELEV 424
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 91/327 (27%), Positives = 130/327 (39%), Gaps = 55/327 (16%)
Query: 83 GTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCH 142
G+P + V VDTGSD+ WV C C C + L+D S+T V C+ C
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRD-----PLFDPAGSATYAAVRCNASACA 209
Query: 143 -------GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
G G + + C Y YGDGS + G D V S G
Sbjct: 210 DSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS--------LG 261
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLDGINGG 254
+FGCG G T G++G G++ S++SQ AS GGV F++CL G
Sbjct: 262 GFVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGV---FSYCLPAATSG 313
Query: 255 ---GIFAIG---HVVQPEVNKTPLV-------PNQ-PHYSINMTAVQVGLDFLNLPTDVF 300
G ++G N TP+ P Q P Y +N+T VG L
Sbjct: 314 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA----AQ 369
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ-----QPDLKVHTVHDEYTCFQYSES 355
G+G + +IDSGT + L VY + ++ + Q P ++ D TC+ +
Sbjct: 370 GLGASN-VLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILD--TCYDLTGH 426
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLF 382
+ P +T E + V LF
Sbjct: 427 DEVKVPLLTLRLEGGADVTVDAAGMLF 453
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 134/333 (40%), Gaps = 43/333 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y GIGTP + DTGSD++W C C C R S Y SS+
Sbjct: 88 GSGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGS-----PSYYPTSSSSA 142
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA-----NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
FV C C G PL A + +C Y YG+ T Y ++ G
Sbjct: 143 AFVACGDRTC-GELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFG 201
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRK--- 242
D + + FGC R G + + G++G G+ S+++QL + G R
Sbjct: 202 D--DAAAFPGIAFGCTLRSEGGFGTGS-----GLVGLGRGKLSLVTQLNVEAFGYRLSSD 254
Query: 243 -------MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNL 295
F D G G + + P+V + P Y + +T + VG + +
Sbjct: 255 LSAPSPISFGSLADVTGGNG----DSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQI 310
Query: 296 PTDVFGVGDNKGT---IIDSGTTLAYLPE----MVYEPLVSKIISQQPDLKVHTVHDEYT 348
P+ F + G I DSGTTL LP+ +V + L+S++ Q+P + D+
Sbjct: 311 PSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAAN--DDDLI 368
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
CF S FP++ HF+ + + YL
Sbjct: 369 CFTGGSSTTT-FPSMVLHFDGGADMDLSTENYL 400
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 160/360 (44%), Gaps = 54/360 (15%)
Query: 38 RERSLSL-LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
R +SL L +K + ++ ++ +PL + + + Y + +G K+ + VDTG
Sbjct: 97 RVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLN-YIVTVELG--GKNMSLIVDTG 153
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG-----GPL-- 149
SD+ WV C C+ C + LYD SS+ K V C+ C + GP
Sbjct: 154 SDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 208
Query: 150 TDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ T C Y+ YGDGS T G D+ + GD + + +FGCG G
Sbjct: 209 NNGVVKTPCEYVVSYGDGSYTRG----DLASESILLGDTKLE----NFVFGCGRNNKGLF 260
Query: 210 DSTNEEALDGIIGFGKSNSSMISQ-LASSGGVRKMFAHCL----DGINGGGIFAIGHVV- 263
++ G+S+ S++SQ L + GV F++CL DG +G F V
Sbjct: 261 GGSSGLMGL-----GRSSVSLVSQTLKTFNGV---FSYCLPSLEDGASGSLSFGNDSSVY 312
Query: 264 --QPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
V+ TPLV N + Y +N+T +G + L + FG +G +IDSGT +
Sbjct: 313 TNSTSVSYTPLVQNPQLRSFYILNLTGASIG--GVELKSSSFG----RGILIDSGTVITR 366
Query: 319 LPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
LP +Y+ + + + Q P +++ D TCF + D P + F+ + L+V
Sbjct: 367 LPPSIYKAVKIEFLKQFSGFPTAPGYSILD--TCFNLTSYEDISIPIIKMIFQGNAELEV 424
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 148/379 (39%), Gaps = 57/379 (15%)
Query: 41 SLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIM 100
SLSL + H + + + + PL P G Y + GTPP+ +DTGS ++
Sbjct: 61 SLSLSRAHHIKSPKTKFSLLKTPL----FPRSYGGYSISLNFGTPPQTTKFVMDTGSSLV 116
Query: 101 WVNCIQCKECPRRSSLGIELT---LYDIKDSSTGKFVTCDQEFCHGVYG----------G 147
W C C R IE+T + K SS+ + C C ++G
Sbjct: 117 WFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQECD 176
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
P T + PY+ YG G ST G + + + D T + GC
Sbjct: 177 PTTQNCTQSCPPYVIQYGLG-STAGLLLSETL-------DFPHKKTIPGFLVGC------ 222
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEV 267
+L S + +GI GFG+S S+ SQL + +H D + +
Sbjct: 223 SLFSIRQP--EGIAGFGRSPESLPSQLGLKKFSYCLVSHAFDDTPASSDLVLDTGSGSDD 280
Query: 268 NKTPLVPNQP-----------HYSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGT 314
KTP + P +Y + + + +G + +P V G N GTI+DSGT
Sbjct: 281 TKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGT 340
Query: 315 TLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-------TCFQYSESVDEGFPNVTFHF 367
T ++ + VYE LV+K +Q + +TV E CF S P FHF
Sbjct: 341 TFTFMEKPVYE-LVAKEFEKQ--VAHYTVATEVQNQTGLRPCFNISGEKSVSVPEFIFHF 397
Query: 368 ENSVSLKVYPHEYLFPFED 386
+ + + P F F D
Sbjct: 398 KGGAKMAL-PLANYFSFVD 415
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 83/332 (25%), Positives = 140/332 (42%), Gaps = 43/332 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G YY I +G+P ++ + VDTGS++ W+ C+ CK C T+YD S + K
Sbjct: 98 GEYYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVD-----TIYDAARSVSYKP 152
Query: 134 VTCDQ-EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
VTC+ + C G C + C + YGDGS + G D + + V G T
Sbjct: 153 VTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTV 212
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
+ FGC G+L+ A GI+G ++ QL G + F+HC
Sbjct: 213 QD--FAFGCA---QGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWK--FSHCFPDRS 264
Query: 249 DGINGGGIFAIGHVVQPEVNKTPLVPNQPHY-SINMTAVQVGLDFLNLPTDVFGVGDN-- 305
+N G+ G+ P Q Y S+ +T ++ F ++ + +
Sbjct: 265 SHLNSTGVVFFGNAELPH--------EQVQYTSVALTNSELQRKFYHVALKGVSINSHEL 316
Query: 306 ----KGT--IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY----TCFQYS-E 354
+G+ I+DSG++ + + L + +P H D + TCF+ S +
Sbjct: 317 VLLPRGSVVILDSGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSND 376
Query: 355 SVDE---GFPNVTFHFENSVSLKVYPHEYLFP 383
+DE P+++ FE+ V++ + L P
Sbjct: 377 DIDELHRTLPSLSLVFEDGVTIGIPSIGVLLP 408
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 83/350 (23%), Positives = 140/350 (40%), Gaps = 54/350 (15%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y + +G+PP+ DTGSD++WV C + SS T +D SST V+
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNN--DTSSAAAPTTQFDPSRSSTYGRVS 158
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDK-VSGDLQTTSTN 194
C + C + G T C ++C YL YGDGS+TTG + +D +G
Sbjct: 159 CQTDACEAL--GRAT-CDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRI 215
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--DGIN 252
G + FGC +G+ + L S+++QL + + + F++CL +N
Sbjct: 216 GGVKFGCSTATAGSFPADGLVGLG------GGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269
Query: 253 GGGIF---AIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
A+ V +P TPLV N+ S + + I
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVGNKTVASAASSRI----------------------I 307
Query: 310 IDSGTTLAYLPEMVYEPLVSKIIS-------QQPDLKVHTVHDEYTCFQYSESVDEGFPN 362
+DSGTTL +L + P+V ++ Q PD + Y E P+
Sbjct: 308 VDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLC---YNVAGREVEAGESIPD 364
Query: 363 VTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+T F ++ + P E C+ + + +++ +++LG+
Sbjct: 365 LTLEFGGGAAVALKPENAFVAVQEGTLCLAI----VATTEQQPVSILGNL 410
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 134/333 (40%), Gaps = 43/333 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y GIGTP + DTGSD++W C C C R S Y SS+
Sbjct: 88 GSGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGS-----PSYYPTSSSSA 142
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA-----NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
FV C C G PL A + +C Y YG+ T Y ++ G
Sbjct: 143 AFVACGDRTC-GELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFG 201
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRK--- 242
D + + FGC R G + + G++G G+ S+++QL + G R
Sbjct: 202 D--DAAAFPGIAFGCTLRSEGGFGTGS-----GLVGLGRGKLSLVTQLNVEAFGYRLSSD 254
Query: 243 -------MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNL 295
F D G G + + P+V + P Y + +T + VG + +
Sbjct: 255 LSAPSPISFGSLADVTGGNG----DSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQI 310
Query: 296 PTDVFGVGDNKGT---IIDSGTTLAYLPE----MVYEPLVSKIISQQPDLKVHTVHDEYT 348
P+ F + G I DSGTTL LP+ +V + L+S++ Q+P + D+
Sbjct: 311 PSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAAN--DDDLI 368
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
CF S FP++ HF+ + + YL
Sbjct: 369 CFTGGSSTTT-FPSMVLHFDGGADMDLSTENYL 400
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 149/350 (42%), Gaps = 43/350 (12%)
Query: 44 LLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
+ ++ A + L+ V +P + D + Y +GIGTP V +DTGSD+ WV
Sbjct: 96 ITRKAKASGRTTTLSDVSIPTSLGAAVDSLE-YVVTLGIGTPAVQQTVLIDTGSDLSWV- 153
Query: 104 CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGV----YGGPLTDCTANTSCP 159
QCK C S + LYD SST V CD + C + Y T+ + + C
Sbjct: 154 --QCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQ 211
Query: 160 YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDG 219
Y YG+ +T G + + + L + FGCG Q G D +
Sbjct: 212 YGIEYGNRDTTVGVYSTETLT-------LSPQVSVKDFGFGCGLVQQGTFDLFDGLLG-- 262
Query: 220 IIGFGKSNSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIGHVVQPEVNK-------TP 271
G + S++SQ A + G F++CL G + G A+G P N TP
Sbjct: 263 ---LGGAPESLVSQTAETYG--GAFSYCLPPGNSTTGFLALG---APTNNNDTAGFLFTP 314
Query: 272 L--VPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV 328
L +P Q Y +N+T V VG L++P V + G IIDSGT + LP+ Y L
Sbjct: 315 LHSLPEQATFYLVNLTGVSVGGKPLDIPPTVL----SGGMIIDSGTIITGLPDTAYSALR 370
Query: 329 SKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
+ +S P L + TC+ ++ + P V F+ ++ +
Sbjct: 371 TAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALTFDGGATIDL 420
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 99/360 (27%), Positives = 160/360 (44%), Gaps = 54/360 (15%)
Query: 38 RERSLSL-LKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTG 96
R +SL L +K + ++ ++ +PL + + + Y + +G K+ + VDTG
Sbjct: 49 RVQSLQLKIKAMTSSTTEQSVSETQIPLTSGIKLESLN-YIVTVELG--GKNMSLIVDTG 105
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG-----GPL-- 149
SD+ WV C C+ C + LYD SS+ K V C+ C + GP
Sbjct: 106 SDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSGPCGG 160
Query: 150 TDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
+ T C Y+ YGDGS T G D+ + GD + + +FGCG G
Sbjct: 161 NNGVVKTPCEYVVSYGDGSYTRG----DLASESILLGDTKLE----NFVFGCGRNNKGLF 212
Query: 210 DSTNEEALDGIIGFGKSNSSMISQ-LASSGGVRKMFAHCL----DGINGGGIFAIGHVV- 263
++ G+S+ S++SQ L + GV F++CL DG +G F V
Sbjct: 213 GGSSGLMGL-----GRSSVSLVSQTLKTFNGV---FSYCLPSLEDGASGSLSFGNDSSVY 264
Query: 264 --QPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
V+ TPLV N + Y +N+T +G + L + FG +G +IDSGT +
Sbjct: 265 TNSTSVSYTPLVQNPQLRSFYILNLTGASIG--GVELKSSSFG----RGILIDSGTVITR 318
Query: 319 LPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
LP +Y+ + + + Q P +++ D TCF + D P + F+ + L+V
Sbjct: 319 LPPSIYKAVKIEFLKQFSGFPTAPGYSILD--TCFNLTSYEDISIPIIKMIFQGNAELEV 376
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 120/280 (42%), Gaps = 42/280 (15%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y A+ IG PP+ +DTGS+++W QC C L+ YD S T + V
Sbjct: 71 YIAEYLIGDPPQQAEAIIDTGSNLIWT---QCSTCQPAGCFSQNLSFYDPSRSRTARPVA 127
Query: 136 CDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C+ C G T C N +C L YG G + V+ + + Q S N
Sbjct: 128 CNDTACA---LGSETRCARDNKACAVLTAYGAG------VIGGVLGTEAFT--FQPQSEN 176
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-------------ASSGGVR 241
SL FGC A + L + + GIIG G+ N S++SQL + S
Sbjct: 177 VSLAFGCIA--ATRLTPGSLDGASGIIGLGRGNLSLVSQLGDNKFSYCLTPYFSQSTNTS 234
Query: 242 KMFAHCLDGINGGGIFA--IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDV 299
++F G++ GG A + + P+V+ P Y + +T + VG L +P
Sbjct: 235 RLFVGASAGLSSGGAPATSVPFLKNPDVD-----PFSTFYYLPLTGITVGDAKLAVPEAA 289
Query: 300 F-----GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
F G GT+IDSG+ L ++ Y+ L +++ Q
Sbjct: 290 FDLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQ 329
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 85/380 (22%), Positives = 141/380 (37%), Gaps = 63/380 (16%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ-----------------------CK 108
G G Y+ + +GTP + + + DTGSD+ WV C +
Sbjct: 51 GTGQYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASND 110
Query: 109 ECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDG 167
++ ++ S T + C + C L C T + C Y Y DG
Sbjct: 111 SSSVSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDG 170
Query: 168 SSTTGYFVQD---VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
S+ G D + + +G Q + ++ GC +G + A DG++ G
Sbjct: 171 SAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGE----SFLASDGVLSLG 226
Query: 225 KSNSSMISQLASSGGVRKMFAHCL---------------------DGINGGGIFAIGHVV 263
SN S S+ A+ G R F++CL + G
Sbjct: 227 YSNVSFASRAAARFGGR--FSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAA 284
Query: 264 QPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
P +TPL+ + +P Y++ + V V + L +P V+ V G I+DSGT+L L
Sbjct: 285 APGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLV 344
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS-----ESVDEGFPNVTFHFENSVSLKV 375
Y +V+ + + L + C+ ++ E + P + HF S L+
Sbjct: 345 SPAYRAVVAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQP 404
Query: 376 YPHEYLF-PFEDLWCIGWQN 394
P Y+ + CIG Q
Sbjct: 405 PPKSYVIDAAPGVKCIGLQE 424
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 143/348 (41%), Gaps = 82/348 (23%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDAR--RQQRILAGVD------------------------ 61
VF + ++ RE L+ L+ H R ++ ++A VD
Sbjct: 127 VFPLYHKLGIREIPLNDLENHLRRFVYKENLVASVDHLNGPHKISKLASSNAAAAMDSSA 186
Query: 62 -LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC--------- 110
P+ G+ PDG PP+ YY+ DTGSD+ W+ C C C
Sbjct: 187 IFPVRGNLYPDG------------PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANAWYK 234
Query: 111 PRRSSL--GIELTLYDI-KDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDG 167
PRR ++ +L ++ ++ G TCDQ C Y Y D
Sbjct: 235 PRRGNIVPPKDLLCMEVQRNQKAGYCETCDQ-------------------CDYEIEYADH 275
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
SS+ G D + +G L T + IFGC Q G L T + DGI+G ++
Sbjct: 276 SSSMGVLATDKLLLMVANGSL----TKLNFIFGCAYDQQGLLLKTLVKT-DGILGLSRAK 330
Query: 228 SSMISQLASSGGVRKMFAHCLDG-INGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMT 284
S+ SQLAS G + + HCL + GGG +G P + P++ + P T
Sbjct: 331 VSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGDDFVPRWGMAWVPML-DSPSMEFYHT 389
Query: 285 AVQVGLDFLNLPTDVFGVGDN-KGTIIDSGTTLAYLPEMVYEPLVSKI 331
V V L++ + P + G+ K + DSG++ Y P+ Y LV+ +
Sbjct: 390 EV-VKLNYGSSPLSLGGMESRVKHILFDSGSSYTYFPKEAYSELVASL 436
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 94/365 (25%), Positives = 154/365 (42%), Gaps = 54/365 (14%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ---CKEC-PRRSSLGIELTLYDIKDSS 129
G Y + GTPP+ +DTGSDI+W C CK C SS + + K+SS
Sbjct: 65 GGYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESS 124
Query: 130 TGKFVTCDQEFCHGVYGGPLT---DCTA----NTSC-PYLEIYGDGSSTTGYFVQDVVQY 181
+ K + C C ++ + DC+ N +C PY+ YG G +T G + + +
Sbjct: 125 SSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSG-TTGGVALSETLHL 183
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
+S + + GC ++ S+++ A GI GFG+ SS+ SQL
Sbjct: 184 HSLS--------KPNFLVGC------SVFSSHQPA--GIAGFGRGLSSLPSQLGLGKFSY 227
Query: 242 KMFAHCLDGINGGGIFAIGHVVQPEVNK-------TPLVPNQP---------HYSINMTA 285
+ +H D + + Q + +K TP V N +Y + +
Sbjct: 228 CLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRR 287
Query: 286 VQVGLDFLNLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHT 342
+ VG + +P G+ N G IIDSGTT ++ +EPL + I Q D +V
Sbjct: 288 ITVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKE 347
Query: 343 VHDEY---TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE--DLWCIGWQNSGM 397
+ D CF S++ FP + +F+ + + P E F F ++ C+ G+
Sbjct: 348 IEDAIGLRPCFNVSDAKTVSFPELRLYFKGGADVAL-PVENYFAFVGGEVACLTVVTDGV 406
Query: 398 QSRDR 402
+R
Sbjct: 407 AGPER 411
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 142/341 (41%), Gaps = 41/341 (12%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE 109
A+ + + A +P+ + + Y A+ G+GTP + V +D +D WV C C
Sbjct: 57 AKPKPKNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAG 116
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEIYGDG 167
C S + SST + V C C V P C A +SC + Y
Sbjct: 117 CAASSP------SFSPTQSSTYRTVPCGSPQCAQV---PSPSCPAGVGSSCGFNLTYAAS 167
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
+ Q V+ D ++ + S FGC SG N G+IGFG+
Sbjct: 168 T------FQAVLGQDSLALENNVVV---SYTFGCLRVVSG-----NSVPPQGLIGFGRGP 213
Query: 228 SSMISQLASSGGVRKMFAHCLDGI---NGGGIFAIGHVVQPE-VNKTPLV--PNQPH-YS 280
S +SQ + G +F++CL N G +G + QP+ + TPL+ P++P Y
Sbjct: 214 LSFLSQTKDTYG--SVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYY 271
Query: 281 INMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL 338
+NM ++VG + +P F GTIID+GT L VY + +
Sbjct: 272 VNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTP 331
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
+ TC+ + SV P VTF F +V++ + P E
Sbjct: 332 VAPPLGGFDTCYNVTVSV----PTVTFMFAGAVAVTL-PEE 367
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 137/321 (42%), Gaps = 47/321 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKD---- 127
G G Y +G+GTP K + DTGSD+ W QC+ C R Y+ KD
Sbjct: 127 GSGNYIVSVGLGTPKKYLSLIFDTGSDLTWT---QCQPCARY--------CYNQKDPVFV 175
Query: 128 ---SSTGKFVTCDQEFCHGVYGGPLTD--CTANTSCPYLEIYGDGSSTTGYFVQDVVQYD 182
S+T ++C C + G C+A +C Y YGD S + GYF ++ +
Sbjct: 176 PSQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETL--- 232
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
L +T + +FGCG G S G+IG G+ S++ Q A G +
Sbjct: 233 ----TLTSTDVIENFLFGCGQNNRGLFGSAA-----GLIGLGQDKISIVKQTAQKYG--Q 281
Query: 243 MFAHCLDGING--GGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPT 297
+F++CL + G + G + TP+ Y +++ ++VG + + +
Sbjct: 282 VFSYCLPKTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISS 341
Query: 298 DVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSE 354
VF G IIDSGT + LP Y L S K +++ P ++ D TC+ S+
Sbjct: 342 SVF---STSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILD--TCYDLSK 396
Query: 355 SVDEGFPNVTFHFENSVSLKV 375
P V F F+ L +
Sbjct: 397 YSTIQIPKVGFVFKGGEELDL 417
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 144/367 (39%), Gaps = 44/367 (11%)
Query: 34 RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
R A R ++S L E A +R+ G + S G G Y+ +IG+GTPP+ Y+ +
Sbjct: 86 RDAARVEAISYLAE-TAGTGKRVGTGFSSSVI-SGLAQGSGEYFTRIGVGTPPRYVYMVL 143
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DTGSDI+W+ C CK C +S ++D + S + + C CH + P + T
Sbjct: 144 DTGSDIVWIQCAPCKRCYAQSD-----PVFDPRKSRSFASIACRSPLCHRL-DSPGCN-T 196
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
+C Y YGDGS T G F + + + + + + GCG +
Sbjct: 197 QKQTCMYQVSYGDGSFTFGDFSTETLTFRR--------TRVARVALGCGH---------D 239
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVR--KMFAHCLDGINGGG-----IFAIGHVVQPE 266
E L S G R F++CL + +F V
Sbjct: 240 NEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFG-DSAVSRT 298
Query: 267 VNKTPLVPN---QPHYSINMTAVQV-GLDFLNLPTDVFGVGD--NKGTIIDSGTTLAYLP 320
TPLV N Y + + + V G + +F + N G IIDSGT++ L
Sbjct: 299 ARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTSVTRLT 358
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENS-VSLKVYPH 378
Y + +LK + TCF S + P V HF + VSL
Sbjct: 359 RPAYIAFRDAFRAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRGADVSLPA--S 416
Query: 379 EYLFPFE 385
YL P +
Sbjct: 417 NYLIPVD 423
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 114/259 (44%), Gaps = 27/259 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-----PRRSSLGIELTLYDIKDSS 129
L+YA + +GTP + V +DTGSD+ W+ C C R S + L LY S+
Sbjct: 102 LHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNAST 161
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G C++ S CPY + TTG +QDV+ V+ D
Sbjct: 162 TSSSIRCSDKRCFGS-----GKCSSPESICPYQIALSSNTVTTGTLLQDVLHL--VTEDE 214
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
N ++ GCG Q+G + + A++G++G S+ S LA + F+ C
Sbjct: 215 DLKPVNANVTLGCGQNQTGAFQT--DIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCF 272
Query: 249 DG-INGGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
I+ G + G + +TPLV Y +N+T V VG +P DV
Sbjct: 273 GRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGVNVTGVSVG----GVPVDVPLFA-- 326
Query: 306 KGTIIDSGTTLAYLPEMVY 324
+ D+G++ L E Y
Sbjct: 327 ---LFDTGSSFTLLLESAY 342
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 114/259 (44%), Gaps = 27/259 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC-----PRRSSLGIELTLYDIKDSS 129
L+YA + +GTP + V +DTGSD+ W+ C C R S + L LY S+
Sbjct: 90 LHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTCIHDLKDARFSESVPLNLYTPNAST 149
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G C++ S CPY + TTG +QDV+ V+ D
Sbjct: 150 TSSSIRCSDKRCFGS-----GKCSSPESICPYQIALSSNTVTTGTLLQDVLHL--VTEDE 202
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
N ++ GCG Q+G + + A++G++G S+ S LA + F+ C
Sbjct: 203 DLKPVNANVTLGCGQNQTGAFQT--DIAVNGVLGLSMKEYSVPSLLAKANITANSFSMCF 260
Query: 249 DG-INGGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
I+ G + G + +TPLV Y +N+T V VG +P DV
Sbjct: 261 GRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGVNVTGVSVG----GVPVDVPLFA-- 314
Query: 306 KGTIIDSGTTLAYLPEMVY 324
+ D+G++ L E Y
Sbjct: 315 ---LFDTGSSFTLLLESAY 330
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 95/407 (23%), Positives = 161/407 (39%), Gaps = 68/407 (16%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSD 98
R+++L ++ + + GV P+ ++R Y A+ +G PP+ +DTGS
Sbjct: 54 RRAIALSRQINLASTRAEGGGVSAPVHWATRQ-----YIAEYMVGDPPQRAEALIDTGSS 108
Query: 99 IMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSC 158
++W QC C R+ + +L ++ S + V C + C G Y L C + +C
Sbjct: 109 LIWT---QCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDKACAGNY---LHFCALDGTC 162
Query: 159 PYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALD 218
+ YG G G+ D + S +L FGC + T A D
Sbjct: 163 TFRVTYGAG-GIIGFLGTDAFTFQ---------SGGATLAFGC-------VSFTRFAAPD 205
Query: 219 ------GIIGFGKSNSSMISQLAS-------------SGGVRKMFAHCLDGINGGGIFAI 259
G+IG G+ S+ SQ + +G +F ++GGG
Sbjct: 206 VLHGASGLIGLGRGRLSLASQTGAKRFSYCLTPYFHNNGASSHLFVGAAASLSGGG---- 261
Query: 260 GHVVQPEVNKTPL-VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD------NKGTIIDS 312
G V+ ++P P Y + + + VG L +P+ F + + G IIDS
Sbjct: 262 GAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEGGVIIDS 321
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE----YTCFQYSESVDEGFPNVTFHFE 368
G+ L E YEPL+ ++ Q V ++ C + +D P + HF
Sbjct: 322 GSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGD-LDRVVPTLVLHFS 380
Query: 369 NSVSLKVYPHEYLFPFE-DLWCI----GWQNSGMQSRDRKNMTLLGD 410
+ + P Y P E C+ G+ S + + ++NM +L D
Sbjct: 381 GGADMALPPENYWAPLEKSTACMAIVRGYLQSIIGNFQQQNMHILFD 427
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 142/341 (41%), Gaps = 41/341 (12%)
Query: 50 ARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE 109
A+ + + A +P+ + + Y A+ G+GTP + V +D +D WV C C
Sbjct: 76 AKPKPKNRANPPVPIAPGRQILSIPNYIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAG 135
Query: 110 CPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEIYGDG 167
C S + SST + V C C V P C A +SC + Y
Sbjct: 136 CAASSP------SFSPTQSSTYRTVPCGSPQCAQV---PSPSCPAGVGSSCGFNLTYAAS 186
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
+ Q V+ D ++ + S FGC SG N G+IGFG+
Sbjct: 187 T------FQAVLGQDSLALENNVVV---SYTFGCLRVVSG-----NSVPPQGLIGFGRGP 232
Query: 228 SSMISQLASSGGVRKMFAHCLDGI---NGGGIFAIGHVVQPE-VNKTPLV--PNQPH-YS 280
S +SQ + G +F++CL N G +G + QP+ + TPL+ P++P Y
Sbjct: 233 LSFLSQTKDTYG--SVFSYCLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYY 290
Query: 281 INMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL 338
+NM ++VG + +P F GTIID+GT L VY + +
Sbjct: 291 VNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTP 350
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
+ TC+ + SV P VTF F +V++ + P E
Sbjct: 351 VAPPLGGFDTCYNVTVSV----PTVTFMFAGAVAVTL-PEE 386
>gi|281200780|gb|EFA74998.1| putative aspartyl protease [Polysphondylium pallidum PN500]
Length = 394
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 134/312 (42%), Gaps = 52/312 (16%)
Query: 89 YYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
+ VQVDTGS +M + + C C R S YD S K V+C E C G P
Sbjct: 52 FTVQVDTGSSLMAIPMVNCNTCHDRPS-------YDPTHSQYSKVVSCFSEHCLGSGSAP 104
Query: 149 LTDCT--ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
C A C ++ +YGDGS +G QDVV +SG FG ++
Sbjct: 105 -PQCKNRAEDDCDFVILYGDGSRVSGKIYQDVVNLSGLSGIAN---------FGANRIET 154
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMI----SQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
G+ + DGI+GFG+S + + L + G++ +FA +D G G ++G +
Sbjct: 155 GDFEYPRA---DGIVGFGRSCKTCVPTVFESLVQAHGLKNIFAMSMD-YEGRGTLSLGEL 210
Query: 263 VQP----EVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
E+ TPL + P Y+I T +V D + LP + G + I+DSG++
Sbjct: 211 NPSNHIGEIQYTPLFEDGPFYNIKPTNFKVD-DTVILPR-LLG----RQVIVDSGSSALS 264
Query: 319 LPEMVYEPLVSKI---------ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFEN 369
L Y+ LV I P ++ D C+ + S+D P + FE
Sbjct: 265 LASGAYDALVHHFRKNYCHVAGICDSP-----SILDGSICYNSASSLDL-LPTIYLTFEG 318
Query: 370 SVSLKVYPHEYL 381
V + V P YL
Sbjct: 319 GVKVAVPPKNYL 330
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 162/395 (41%), Gaps = 60/395 (15%)
Query: 15 ATAAVGGVSSNHGVFSVKYRYAGRERSLS-------LLKEHDARRQQRILAGVDLPLGGS 67
A+ + GG S +V A R SL L++ DA ++ +P+
Sbjct: 49 ASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASASKL---AQVPVTSG 105
Query: 68 SRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKD 127
+R + Y A +GIG + V VDT S++ WV C C C + + L+D
Sbjct: 106 ARLRTLN-YVATVGIGG--GEATVIVDTASELTWVQCEPCDACHDQ-----QEPLFDPSS 157
Query: 128 SSTGKFVTCDQEFCH------GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY 181
S + V C+ C G+ G D A +C Y Y DGS + G D +
Sbjct: 158 SPSYAAVPCNSSSCDALRVATGMSGQACDDQPA--ACSYTLSYRDGSYSRGVLAHD--RL 213
Query: 182 DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ-LASSGGV 240
D+Q +FGCG G T+ G++G G+S S+ISQ + GGV
Sbjct: 214 SLAGEDIQ------GFVFGCGTSNQGPFGGTS-----GLMGLGRSQLSLISQTMDQFGGV 262
Query: 241 RKMFAHCLDGINGG--GIFAIGHVVQPEVNKTPLV-------PNQ-PHYSINMTAVQVGL 290
F++CL G G +G N TP+V P Q P Y N+T + VG
Sbjct: 263 ---FSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGG 319
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEY 347
+ + P F G I+DSGT + L VY + ++ +SQ P ++ D
Sbjct: 320 EDVQSPG--FSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSILD-- 375
Query: 348 TCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLF 382
TCF + + P++ F+ ++V L+
Sbjct: 376 TCFDLTGLREVQVPSLKLVFDGGAEVEVDSKGVLY 410
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 141/338 (41%), Gaps = 46/338 (13%)
Query: 89 YYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP 148
+ VQVDTGS +M + C C + SST V C + C G P
Sbjct: 133 FLVQVDTGSLLMAIPLEGCNTCVESRPV--------YHPSSTSTKVACSSDQCKGSGSTP 184
Query: 149 --LTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
+ ++ SC + YGDGS +GY +DVV + G FG ++
Sbjct: 185 PSCSRTSSGESCDFQIRYGDGSHVSGYIYEDVVNLAGLQGKAN---------FGANDEET 235
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMI----SQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
G+ + DGIIGFG++ SS + L S G++ F L+ GGG ++G +
Sbjct: 236 GDFEYPRA---DGIIGFGRTCSSCVPTVWDSLVSDLGLKNQFGMLLN-YEGGGSLSLGEI 291
Query: 263 VQP----EVNKTPLV-PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLA 317
++ TPLV N P YS+ T +++ D+ +P G + I+DSG+T
Sbjct: 292 NTSYYTGDIRYTPLVQKNTPFYSVKSTGIRIN-DY-TIPGSKLG----QEVIVDSGSTAL 345
Query: 318 YLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ-----YSESVDEGFPNVTFHFENSVS 372
L Y+ L + Q + V + FQ S+ V FP + F F+ V
Sbjct: 346 SLASGAYDQLRNYF--QTHYCSIQGVCENPNIFQGSICYSSDDVLSKFPTLYFTFDGGVQ 403
Query: 373 LKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
+ + P YL L + M R MT+LGD
Sbjct: 404 VAIPPKNYLVK-APLTNGKYGYCFMIERADSTMTILGD 440
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/372 (23%), Positives = 152/372 (40%), Gaps = 49/372 (13%)
Query: 29 FSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKD 88
FS + R A + ++ + R+ + L G+ P +G Y + IG PPK
Sbjct: 24 FSAQPRNAKKPKT-----PYSDNNHHRLSSSAVFKLQGNVYP--LGHYTVSLNIGYPPKL 76
Query: 89 YYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
Y + +D+GSD+ WV C CK C + LY V C + C V+
Sbjct: 77 YDLDIDSGSDLTWVQCDAPCKGCTKPRD-----QLY----KPNHNLVQCVDQLCSEVHLS 127
Query: 148 PLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQS 206
+C + + C Y Y D S+ G V+D + + +G + + FGCG Q
Sbjct: 128 MAYNCPSPDDPCDYEVEYADHGSSLGVLVRDYIPFQFTNGSV----VRPRVAFGCGYDQK 183
Query: 207 GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE 266
+ S + A G++G G +S++SQL S G +R + HCL GGG G
Sbjct: 184 YS-GSNSPPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSA-QGGGFLFFG------ 235
Query: 267 VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT-----IIDSGTTLAYLPE 321
+P+ +M + + + P ++ G I DSG++ Y
Sbjct: 236 ---DDFIPSSGIVWTSMLSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNS 292
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYT---CFQYSES------VDEGFPNVTFHFENSVS 372
Y+ +V + ++ D+ + C++ ++S V + F + F+ S +
Sbjct: 293 QAYQAVVDLVTKDLKGKQLKRATDDPSLPICWKGAKSFESLSDVKKYFKPLALSFKKSXN 352
Query: 373 LKVY--PHEYLF 382
L+++ P YL
Sbjct: 353 LQMHLPPESYLI 364
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/213 (30%), Positives = 98/213 (46%), Gaps = 22/213 (10%)
Query: 51 RRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKEC 110
++ R V +PL + G G YY K+G G+P + Y + VDTGS + W +QCK C
Sbjct: 94 KKDIRFPKSVSVPLNPGAS-IGSGNYYVKVGFGSPARYYSMIVDTGSSLSW---LQCKPC 149
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS---CPYLEIYGDG 167
+ + L+D S T K ++C C + L + TS C Y YGD
Sbjct: 150 VVYCHVQAD-PLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDS 208
Query: 168 SSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSN 227
S + GY QD++ L + T ++GCG G GI+G G++
Sbjct: 209 SYSMGYLSQDLL-------TLAPSQTLPGFVYGCGQDSDGLFGRA-----AGILGLGRNK 256
Query: 228 SSMISQLASSGGVRKMFAHCLDGINGGGIFAIG 260
SM+ Q++S G F++CL GGG +IG
Sbjct: 257 LSMLGQVSSKFGY--AFSYCLPTRGGGGFLSIG 287
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 154/391 (39%), Gaps = 52/391 (13%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSD 98
RS K AR + R+ + +PL S Y IGIGTPP+ + + DT SD
Sbjct: 58 RRSARASKARVARLEARLTGDMSVPLARISDEG----YTVTIGIGTPPQLHTLIADTASD 113
Query: 99 IMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSC 158
+ W C + ++ L+D SS+ FVTC + C P T +N +C
Sbjct: 114 LTWTQCNLFNDTAKQVE-----PLFDPAKSSSFAFVTCSSKLC--TEDNPGTKRCSNKTC 166
Query: 159 ----PYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
PY+ + G V+ Y+ + S FGCGA GNL +
Sbjct: 167 RYVYPYVSVEAAG----------VLAYESFTLSDNNQHICMSFGFGCGALTDGNLLGAS- 215
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIFAIGHVVQPEVNKT 270
GI+G + SM+SQLA F++CL D + F +
Sbjct: 216 ----GILGMSPAILSMVSQLAI-----PKFSYCLTPYTDRKSSPLFFGAWADLGRYKTTG 266
Query: 271 PLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVS 329
P+ + +Y + + + +G L++P F + GT++D G T+ L E + L
Sbjct: 267 PIQKSLTFYYYVPLVGLSLGTRRLDVPAATFAL-KQGGTVVDLGCTVGQLAEPAFTALKE 325
Query: 330 KII-SQQPDLKVHTVHDEYTCFQYSESVDEGF---PNVTFHFENSVSLKVYPHEYLF--P 383
++ + L TV D CF V G P + +F+ + V P + F P
Sbjct: 326 AVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGADM-VLPRDNYFQEP 384
Query: 384 FEDLWCIGWQNSGMQS----RDRKNMTLLGD 410
L C+ G S ++N LL D
Sbjct: 385 TAGLMCLALVPGGGMSIIGNVQQQNFHLLFD 415
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 140/350 (40%), Gaps = 32/350 (9%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLG 117
+ V LP+ + G G Y+ K+ +GTP +++ + DTGSD+ WV C R
Sbjct: 99 SAVSLPMSSGAY-SGTGQYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR----- 152
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQ 176
++ K S + + C + C L +C++ S C Y Y +GS+ V
Sbjct: 153 ----VFRPKTSRSWAPIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVG 208
Query: 177 DVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS 236
+ G + ++ GC S + D + + DG++ G + S +Q A+
Sbjct: 209 TESATIALPGG--KVAQLKDVVLGC----SSSHDGQSFRSADGVLSLGNAKISFATQAAA 262
Query: 237 SGGVRKMFAHCL------DGINGGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQV 288
G F++CL G F G V + +T L P P Y + + A+ V
Sbjct: 263 RFG--GSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHV 320
Query: 289 GLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLV---SKIISQQPDLKVHTVHD 345
L++P +V+ + G I+DSG TL L Y+ +V SK + P +
Sbjct: 321 AGKALDIPAEVWD-AKSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSFPPFEH 379
Query: 346 EYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQN 394
Y E P + F S L+ Y+ + + CIG Q
Sbjct: 380 CYNWTARRPGAPEIIPKLAVQFAGSARLEPPAKSYVIDVKPGVKCIGVQE 429
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 138/316 (43%), Gaps = 35/316 (11%)
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
+G+GTP Y + VDTGS + W+ C C C R+S +++ K SST V C
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSG-----PVFNPKSSSTYASVGCSA 55
Query: 139 EFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGS 196
+ C + L + C+++ C Y YGD S + GY +D V + S +
Sbjct: 56 QQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTSLP--------N 107
Query: 197 LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGI 256
+GCG G + G+IG ++ S++ QLA S G F +CL + G
Sbjct: 108 FYYGCGQDNEGLFGRS-----AGLIGLARNKLSLLYQLAPSLGYS--FTYCLPSSSSSGY 160
Query: 257 FAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSG 313
++G + + TP+V + Y I ++ + V + L + + TIIDSG
Sbjct: 161 LSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPL---SVSSSAYSSLPTIIDSG 217
Query: 314 TTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS 370
T + LP VY L V+ + +++ D TCF+ ++ P VT F
Sbjct: 218 TVITRLPTSVYSALSKAVAAAMKGTSRASAYSILD--TCFK-GQASRVSAPAVTMSFAGG 274
Query: 371 VSLKVYPHEYLFPFED 386
+LK+ L +D
Sbjct: 275 AALKLSAQNLLVDVDD 290
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 162/390 (41%), Gaps = 47/390 (12%)
Query: 30 SVKYRYAG-RERSLSLLKEHDARR--QQRILA------GVDLPLGGSSRPDGVGLYYAKI 80
SV R G R R + + +RR +QR+ A V LP+ + G G Y+ K+
Sbjct: 37 SVTARARGDRRRHAYISAQLPSRRGGRQRVAAEVASSSAVSLPMSSGAY-AGTGQYFVKV 95
Query: 81 GIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEF 140
+GTP +++ + DTGS++ WV C P G+ ++ + S + V C +
Sbjct: 96 LVGTPAQEFTLVADTGSELTWVKCAGGASPP-----GL---VFRPEASKSWAPVPCSSDT 147
Query: 141 CHGVYGGPLTDCTANTS-CPYLEIYGDGSS-TTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C L +C+++ S C Y Y +GS+ G D G + ++
Sbjct: 148 CKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQ---DVV 204
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL------DGIN 252
GC S D + +++DG++ G + S S+ A+ G F++CL
Sbjct: 205 LGC----SSTHDGQSFKSVDGVLSLGNAKISFASRAAARFG--GSFSYCLVDHLAPRNAT 258
Query: 253 GGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK--GT 308
G F G V + +T L P P Y + + AV V L++P +V+ D K G
Sbjct: 259 GYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVW---DPKSGGV 315
Query: 309 IIDSGTTLAYLPEMVYEPLVS---KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
I+DSGTTL L Y+ +V+ K+++ P + Y E P +
Sbjct: 316 ILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFPPFEHCYNWTAPRPGAPE-IPKLAV 374
Query: 366 HFENSVSLKVYPHEYLFPFED-LWCIGWQN 394
F L+ Y+ + + CIG Q
Sbjct: 375 QFTGCARLEPPAKSYVIDVKPGVKCIGLQE 404
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/272 (29%), Positives = 116/272 (42%), Gaps = 39/272 (14%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
+GVG Y I +GTP + V DTGSD++W C C +C ++ + + SST
Sbjct: 81 NGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPA-----PPFQPASSST 135
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C FC + C A T C Y YG G T GY + ++ S
Sbjct: 136 FSKLPCTSSFCQ-FLPNSIRTCNA-TGCVYNYKYGSG-YTAGYLATETLKVGDASFP--- 189
Query: 191 TSTNGSLIFGCGARQS-GNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
S+ FGC G LD +G G+ + + S S+ G + L
Sbjct: 190 -----SVAFGCSTENGLGQLD----------LGVGRFSYCLRS--GSAAGASPILFGSLA 232
Query: 250 GINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN---K 306
+ G + + V P V+ + +Y +N+T + VG L + T FG N
Sbjct: 233 NLTDGNVQSTPFVNNPAVHPS-------YYYVNLTGITVGETDLPVTTSTFGFTQNGLGG 285
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL 338
GTI+DSGTTL YL + YE + +SQ D+
Sbjct: 286 GTIVDSGTTLTYLAKDGYEMVKQAFLSQTADV 317
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 145/349 (41%), Gaps = 46/349 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GTP ++VDTGSD+ WV C C P S + L+D SS+ V
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAP--SCYSQKDPLFDPAQSSSYAAVP 197
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C G+ G + C Y+ YGDGS+TTG + D + L +S
Sbjct: 198 CGGPVCAGL-GIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLT-------LSASSAVQ 249
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLD-GING 253
FGCG QSG + +DG++G G+ S++ Q A + GGV F++CL +
Sbjct: 250 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGV---FSYCLPTKPST 301
Query: 254 GGIFAIG----HVVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G +G P + T L+ PN P +Y + +T + VG L++P F G
Sbjct: 302 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSESVDEGFP 361
T T + LP Y L S S P + + D TC+ ++ P
Sbjct: 362 DTG----TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILD--TCYNFAGYGTVTLP 415
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
NV F + ++ + L C+ + SG M +LG+
Sbjct: 416 NVALTFGSGATVTLGADGIL----SFGCLAFAPSG----SDGGMAILGN 456
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 119/280 (42%), Gaps = 39/280 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y A+ IG PP+ +DTGS+++W QC C G +LT YD S T K V
Sbjct: 84 YIAEYLIGDPPQQAAAIIDTGSNLIWT---QCSTCRANGCFGQDLTFYDPSRSRTAKPVA 140
Query: 136 CDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C+ C G T C + +C L YG G + G+ +V + G Q++ N
Sbjct: 141 CNDTAC---LLGSETRCARDGKACAVLTAYGAG-AIGGFLGTEVFTF----GHGQSSENN 192
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-----D 249
SL FGC + L + + GIIG G+ S+ SQL + F++CL D
Sbjct: 193 VSLAFGC--ITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDN-----KFSYCLTPYFSD 245
Query: 250 GINGGGIFA----IGHVVQPEVNKTPLVPN------QPHYSINMTAVQVGLDFLNLPTDV 299
N +F P + N Y + +T + VG L++P
Sbjct: 246 AANTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAA 305
Query: 300 FGVGDNK-----GTIIDSGTTLAYLPEMVYEPLVSKIISQ 334
F + + GT+IDSG+ L ++ Y+ L +++ Q
Sbjct: 306 FDLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQ 345
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 101/339 (29%), Positives = 140/339 (41%), Gaps = 42/339 (12%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y I +GTPP DTGSD++W C C +C + L+D K SST K
Sbjct: 92 GEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVD-----PLFDPKASSTYKD 146
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V+C C + NT C Y YGD S T G D + L +T T
Sbjct: 147 VSCSSSQCTALENQASCSTEDNT-CSYSTSYGDRSYTKGNIAVDTLT-------LGSTDT 198
Query: 194 N----GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
++I GCG +G + + GI+G G S+I+QL S + F++CL
Sbjct: 199 RPVQLKNIIIGCGHNNAGTFN----KKGSGIVGLGGGAVSLITQLGDS--IDGKFSYCLV 252
Query: 250 GINGGGI------FAIGHVVQPE-VNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVF 300
+ F VV V TPL+ + Y + + ++ VG + P
Sbjct: 253 PLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDS 312
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ--YSESVDE 358
G G+ IIDSGTTL LP Y L + S + D T YS + D
Sbjct: 313 GSGEG-NIIIDSGTTLTLLPTEFYSELEDAVASS---IDAEKKQDPQTGLSLCYSATGDL 368
Query: 359 GFPNVTFHFENS-VSLKVYPHEYLFPF-EDLWCIGWQNS 395
P +T HF+ + V+LK P EDL C ++ S
Sbjct: 369 KVPAITMHFDGADVNLK--PSNCFVQISEDLVCFAFRGS 405
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 147/342 (42%), Gaps = 31/342 (9%)
Query: 38 RERSLSLLKEHDARRQQRILAGVDLPLGGSSRPD-GVGLYYAKIGIGTPPKDYYVQVDTG 96
R ++ L ++R+ R+ D G + PD + Y + IGTPP + + DTG
Sbjct: 53 RIKNTVLRSFARSKRRLRLSQNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTG 112
Query: 97 SDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT 156
SD++WV C C++C +++ L+D + SST K V CD + C + +
Sbjct: 113 SDLIWVQCAPCEKCVPQNA-----PLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSG 167
Query: 157 SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEA 216
C Y IYGD + +G + + + G L FGC + +D +
Sbjct: 168 QCYYQYIYGDHTLVSGILGFESINF----GSKNNAIKFPKLTFGCTFSNNDTVDESKRNM 223
Query: 217 LDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGI----FAIGHVVQ--PEVNKT 270
G++G G S+ISQL G + F++C ++ F +V+ V T
Sbjct: 224 --GLVGLGVGPLSLISQLGYQIG--RKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVST 279
Query: 271 PLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL 327
PL+ +Y +N+ V +G N + +IDSGT+ L + Y
Sbjct: 280 PLIIKSIGPSYYYLNLEGVSIG----NKKVKTSESQTDGNILIDSGTSFTILKQSFYNKF 335
Query: 328 VS--KIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHF 367
V+ K + +K+ + + CF+ ++ + FP+V F F
Sbjct: 336 VALVKEVYGVEAVKIPPLVYNF-CFE-NKGKRKRFPDVVFLF 375
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 144/346 (41%), Gaps = 56/346 (16%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLGIELTLYDIKDSST 130
G Y IG+GTPP + V DTGSD WV C C C ++ L+D SST
Sbjct: 159 GTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKD-----RLFDPAKSST 213
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQD--VVQYDKVSGDL 188
V+C C + + C A C Y YGDGS T G+F +D V D + G
Sbjct: 214 YANVSCADPACADL---DASGCNAG-HCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKG-- 267
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
FGCG + G T G++G G+ +S+ Q G F++CL
Sbjct: 268 --------FKFGCGEKNRGLFGQTA-----GLLGLGRGPTSITVQAYEKYG--GSFSYCL 312
Query: 249 DGINGGGIFAIGHV---------VQPEVNKTPLVPNQ--PHYSINMTAVQVGLDFLN-LP 296
+ A G++ TP++ ++ Y + +T ++VG L +P
Sbjct: 313 PASSA----ATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIP 368
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQ 351
VF N GT++DSGT + LP+ Y L S + +++ D TC+
Sbjct: 369 ESVF---SNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSILD--TCYD 423
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF-EDLWCIGWQNSG 396
++ P V+ F+ L + ++ + C+G+ ++G
Sbjct: 424 FTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNG 469
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 145/349 (41%), Gaps = 46/349 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GTP ++VDTGSD+ WV C C P S + L+D SS+ V
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAP--SCYSQKDPLFDPAQSSSYAAVP 197
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C G+ G + C Y+ YGDGS+TTG + D + L +S
Sbjct: 198 CGGPVCAGL-GIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTL-------TLSASSAVQ 249
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLD-GING 253
FGCG QSG + +DG++G G+ S++ Q A + GGV F++CL +
Sbjct: 250 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGV---FSYCLPTKPST 301
Query: 254 GGIFAIG----HVVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G +G P + T L+ PN P +Y + +T + VG L++P F G
Sbjct: 302 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSESVDEGFP 361
T T + LP Y L S S P + + D TC+ ++ P
Sbjct: 362 DTG----TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILD--TCYNFAGYGTVTLP 415
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
NV F + ++ + L C+ + SG M +LG+
Sbjct: 416 NVALTFGSGATVTLGADGIL----SFGCLAFAPSG----SDGGMAILGN 456
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 144/382 (37%), Gaps = 53/382 (13%)
Query: 22 VSSNHGVFSVKYRYAGRERSLSLLKEHDARRQQRIL----AGVDLPLGGSSRPDGVGLYY 77
+SS F +GR S+L + R+L + + LPL G+ P VG Y
Sbjct: 16 MSSCSAWFGGNKHKSGRN---SILPSEATSSRSRLLNPAGSSIVLPLYGNVYP--VGFYN 70
Query: 78 AKIGIGTPPKDYYVQVDTGSDIMWVN----CIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
+ IG P + Y++ VDTGSD+ W+ C C E P LY + F
Sbjct: 71 VTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPH--------PLY----RPSNDF 118
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V C C + +C C Y Y D ST G + DV + +G
Sbjct: 119 VPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTFGVLLNDVYLLNFTNG----VQL 174
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING 253
+ GCG Q + S + +G GK +S+ISQL S G VR + HCL G
Sbjct: 175 KVRMALGCGYDQVFSPSSYHPLDGLLGLGRGK--ASLISQLNSQGLVRNVIGHCLSAQGG 232
Query: 254 GGIFAIGHVVQPEVNKTPLVP-NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDS 312
G IF V TP+ + HYS + G GVG + + D+
Sbjct: 233 GYIFFGNAYDSARVTWTPISSVDSKHYSAGPAELVFG-------GRKTGVG-SLTAVFDT 284
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC---------FQYSESVDEGFPNV 363
G++ Y Y+ L+S + + + D+ T F V + F V
Sbjct: 285 GSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCWHGKRPFTSLREVRKYFKPV 344
Query: 364 TFHFEN----SVSLKVYPHEYL 381
F N ++ P YL
Sbjct: 345 ALGFTNGGRTKAQFEILPEAYL 366
>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
Length = 817
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 150/347 (43%), Gaps = 52/347 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWV---NCIQCKECPRRSSL----GIELTLYDIKDS 128
Y+ I +GTPP+ + VQVDTGS + V NC K ++S G LY +++S
Sbjct: 205 YFIPILVGTPPQMFTVQVDTGSTSLAVPGSNCYLYKSQSIKTSCSCSDGNLDGLYSLEES 264
Query: 129 STGKFVTC-DQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVS-- 185
+ + C D C+ + +N CP++ YGDGS G V D V +
Sbjct: 265 ISSNQLNCSDTSNCNTC-----KNNKSNKPCPFVLKYGDGSFIAGSLVIDHVTIGDFTVP 319
Query: 186 ---GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG------KSNSSMISQLAS 236
G++Q S + S + C + Q + DGI+G + + S++ +
Sbjct: 320 AKFGNIQKESLSFSQL-TCPSTQRS------QAVRDGILGLSFQQLDPDNGDDIFSKIVA 372
Query: 237 SGGVRKMFAHCLDGINGGGIFAIG----HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDF 292
+ +F+ CL GG+ IG H+ Q TP+ + +YSI +T + VG D
Sbjct: 373 HYNIPNVFSMCLG--KDGGLLTIGGTNDHITQETPKYTPIFDSH-YYSITVTNIYVGNDS 429
Query: 293 LNL-PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK--VHTVHDEYTC 349
LNL P D+ +I+DSGTTL Y + ++ +V + + +L + E C
Sbjct: 430 LNLAPPDL------STSIVDSGTTLLYFSDEIFYSIVRNLEEKHCELPGICNDPFWEGNC 483
Query: 350 FQYSESVDEGFPNVTFHF-----ENSVSLKVYPHEYLFPFEDLWCIG 391
E + +P + E S L+V P Y L+C G
Sbjct: 484 HHLEEKLISEYPTIYLEMKGMNGEPSFKLEVPPDLYFLNINGLYCFG 530
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/359 (27%), Positives = 155/359 (43%), Gaps = 72/359 (20%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECP--RRSSLGIELTLYDIKDSS 129
Y + IGTPP+ V +DTGSD+ WV C C EC R + L + + SS
Sbjct: 82 YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKL---MATFSPSYSS 138
Query: 130 TGKFVTCDQEFCHGVYGG--PLTDCTA-------------NTSCP-YLEIYGDGSSTTGY 173
+ +C FC ++ PL CT + CP + YG G TG
Sbjct: 139 SSYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGI 198
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+D ++ + S + FGC + S E + GI GFG+ SM+SQ
Sbjct: 199 LTRDTLRVNGSSPGVAKEIPK--FCFGC-------VGSAYREPI-GIAGFGRGTLSMVSQ 248
Query: 234 LASSGGVRKMFAHCL------DGINGGGIFAIGHVV---------QPEVNKTPLVPNQPH 278
L G ++K F+HC + N +G + P +N +P+ PN
Sbjct: 249 L---GFLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLN-SPMYPN--F 302
Query: 279 YSINMTAVQVG-LDFLNLPTDV--FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS-- 333
Y + + A+ VG + +P+ + F N G IDSGTT +LPE Y ++S + S
Sbjct: 303 YYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTI 362
Query: 334 ---QQPDLKVHTVHDEYTCFQYSE------SVDEGFPNVTFHFENSVSLKVYPHEYLFP 383
+ +++ T D C++ + D+ P++TFHF N+VSL + + +P
Sbjct: 363 NYPRDTGMEMQTGFD--LCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYP 419
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 128/322 (39%), Gaps = 54/322 (16%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
D G + + GTPP+ + + +DTGS I W C C C + S +D S T
Sbjct: 157 DEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASR-----RHFDPSASLT 211
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+C + NT Y YGD S++ G + D + L+
Sbjct: 212 YSLGSC------------IPSTVGNT---YNMTYGDKSTSVGNYGCDTMT-------LEH 249
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-- 248
+ FGCG G+ S DG++G G+ S +SQ AS +K+F++CL
Sbjct: 250 SDVFPKFQFGCGRNNEGDFGS----GADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPE 303
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVPNQP---------HYSINMTAVQVGLDFLNLPTD 298
D I G +F Q K + N P +Y + + + VG LN+P+
Sbjct: 304 EDSI-GSLLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSS 362
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-----TCFQYS 353
VF + GTIIDSGT + LP+ Y L + + + TC+ S
Sbjct: 363 VFA---SPGTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLS 419
Query: 354 ESVDEGFPNVTFHFENSVSLKV 375
D P + HF +++
Sbjct: 420 GRKDVLLPEIVLHFGEGADVRL 441
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 141/340 (41%), Gaps = 45/340 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y I +GTPP DTGSD++W C+ C C + L+D K+S T
Sbjct: 90 GGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVE-----PLFDPKESETY 144
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K + CD EFC + G C + +C Y YGD S T G D + GD
Sbjct: 145 KTLDCDNEFCQDL--GQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGD---P 199
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI 251
++ + FGCG G + + + S++ QL+S G + F++CL +
Sbjct: 200 ASFPGIAFGCGHDNGGTFNEKDGGLIGLG----GGPLSLVMQLSSEVGGQ--FSYCLVPL 253
Query: 252 NGGGIFA-------IGHVVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLPTDVFGV 302
+ + G V TPL+ P Y + + + VG + + G
Sbjct: 254 SSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFK----GF 309
Query: 303 GDNKGT---------IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ-- 351
+NK + IIDSGTTL LP+ Y + S + + + T D F
Sbjct: 310 SENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNA---IGGQTTTDPNGIFSLC 366
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCI 390
YS + P +T HF + +++ P + ++ EDL C
Sbjct: 367 YSSVNNLEIPTITAHFTGA-DVQLPPLNTFVQVQEDLVCF 405
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 147/349 (42%), Gaps = 46/349 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y +GTP ++VDTGSD+ WV C C P S + L+D SS+ V
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAP--SCYSQKDPLFDPAQSSSYAAVP 105
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C C G+ G + C Y+ YGDGS+TTG + D + L +S
Sbjct: 106 CGGPVCAGL-GIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLT-------LSASSAVQ 157
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCL-DGING 253
FGCG QSG + +DG++G G+ S++ Q A + GGV F++CL +
Sbjct: 158 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGV---FSYCLPTKPST 209
Query: 254 GGIFAIG----HVVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
G +G P + T L+ PN P +Y + +T + VG L++P F G
Sbjct: 210 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 269
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSESVDEGFP 361
D+GT + LP Y L S S P + + D TC+ ++ P
Sbjct: 270 ----DTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILD--TCYNFAGYGTVTLP 323
Query: 362 NVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
NV F + ++ + L C+ + SG M +LG+
Sbjct: 324 NVALTFGSGATVTLGADGIL----SFGCLAFAPSG----SDGGMAILGN 364
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 127/319 (39%), Gaps = 50/319 (15%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
D G + + GTP + + +DTGS I W C C C + S+ +D SST
Sbjct: 123 DEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSN-----RYFDSSASST 177
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
F +C + N Y YGD S++ G + D + L+
Sbjct: 178 YSFGSC------------IPSTVENN---YNMTYGDDSTSVGNYGCDTMT-------LEP 215
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+ FGCG G+ S +DG++G G+ S +SQ AS K+F++CL
Sbjct: 216 SDVFQKFQFGCGRNNKGDFGS----GVDGMLGLGQGQLSTVSQTASK--FNKVFSYCLPE 269
Query: 251 INGGGIFAIGHVVQPE---------VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG 301
+ G G + VN + +Y +N++ + VG + LN+P+ VF
Sbjct: 270 EDSIGSLLFGEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVFA 329
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-----TCFQYSESV 356
+ GTIIDS T + LP+ Y L + + + TC+ S
Sbjct: 330 ---SPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRK 386
Query: 357 DEGFPNVTFHFENSVSLKV 375
D P + HF +++
Sbjct: 387 DVLLPEIVLHFGGGADVRL 405
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 138/321 (42%), Gaps = 47/321 (14%)
Query: 33 YRYAGRERSLSLLKEHDARR--QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
YR A R + RR +R++A V+ S G G Y + +GTPP+ +
Sbjct: 111 YRRAARSGGGRMPASSSPRRALSERMVATVE-----SGVAVGSGEYLMDVYVGTPPRRFR 165
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+ +DTGSD+ W+ C C +C + ++D SS+ + VTC C V P
Sbjct: 166 MIMDTGSDLNWLQCAPCLDCFEQRG-----PVFDPAASSSYRNVTCGDHRCGHVAPPPEP 220
Query: 151 DCTANTS--------CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
+ ++ + CPY YGD S+TTG + + ++ + +G ++FGCG
Sbjct: 221 EASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVN-LTAPGASRRVDG-VVFGCG 278
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIF- 257
R G G+ S SQL + G F++CL + +F
Sbjct: 279 HRNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYG--HTFSYCLVDHGSDVGSKVVFG 331
Query: 258 ----AIGHVVQPEVNKTPL-------VPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN- 305
A+ P++ T P Y + + V VG + LN+ +D + VG +
Sbjct: 332 EDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDG 391
Query: 306 -KGTIIDSGTTLAYLPEMVYE 325
GTIIDSGTTL+Y E Y+
Sbjct: 392 SGGTIIDSGTTLSYFVEPAYQ 412
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 139/309 (44%), Gaps = 32/309 (10%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
+ A I IG PP + +DTGSD+ W++C+ CK P+ + + SST + +
Sbjct: 78 FLANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQ------TIPFFHPSRSSTYRNAS 131
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C H + + T N C Y Y D S+T G ++ + ++ S D + N
Sbjct: 132 CVSA-PHAMPQIFRDEKTGN--CQYHLRYRDFSNTRGILAEEKLTFE-TSDDGLISKQN- 186
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING-- 253
++FGCG SG G++G G S++++ S F++C +
Sbjct: 187 -IVFGCGQDNSGF------TKYSGVLGLGPGTFSIVTRNFGSK-----FSYCFGSLTNPT 234
Query: 254 --GGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-GTII 310
I +G+ + E + TPL Q Y +++ A+ G L++ F ++ GT+I
Sbjct: 235 YPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVI 294
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPD-LKVHTVHDEYT--CFQYSESVD-EGFPNVTFH 366
D+G + L YE L +I + L+ D+YT C++ + +D GFP VTFH
Sbjct: 295 DTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFH 354
Query: 367 FENSVSLKV 375
F L +
Sbjct: 355 FAGGAELAL 363
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/339 (26%), Positives = 153/339 (45%), Gaps = 43/339 (12%)
Query: 75 LYYAKIGIG--------TPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI--ELTLYD 124
L+ A++G+G T K YY Q+DTG+++ W IQC+ C + ++ + Y
Sbjct: 79 LFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSW---IQCEGCQNKGNMCFPHKDPPYT 135
Query: 125 IKDSSTGKFVTCDQE-FCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDK 183
S + K V+C+Q FC C C Y YG GS T+G + +
Sbjct: 136 SSQSKSYKPVSCNQHSFCEP------NQCKEGL-CAYNVTYGPGSYTSGNLANETFTFYS 188
Query: 184 VSGDLQTTSTNGSLIFGCGARQSGNLDS--TNEEALDGIIGFGKSNSSMISQLASSGGVR 241
G S+ FGC + + ++ + G++G G S ++QL S +
Sbjct: 189 NHGKHTALK---SISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGK 245
Query: 242 KMFAHCLDGINGGGIFAI--GHVVQPE-VNKTPLVPNQPH--YSINMTAVQVGLDFLNLP 296
F++C+ N + HVV+ + + T ++ +P Y +N+ + V LN+
Sbjct: 246 --FSYCITANNTHNTYLRFGKHVVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNIT 303
Query: 297 TDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLK---VHTVHDEYT 348
V + +G IID+GT L + +++ L +S +S +LK +H +H +
Sbjct: 304 KTDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLC 363
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL 387
Q S++ + P VTFH EN+ L+V P E +F F +
Sbjct: 364 YEQLSDAGRKNLPVVTFHLENA-DLEVKP-EAIFLFREF 400
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 138/340 (40%), Gaps = 52/340 (15%)
Query: 51 RRQQRILAGVD----LPLGGSSRPDGV---GLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
R Q L G D L G S D + +Y K+ +GTPP + ++DTGSDI+W
Sbjct: 389 RAQNNFLVGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQ 448
Query: 104 CIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEI 163
C+ C C + + ++D SST + ++ C+G SC Y I
Sbjct: 449 CMPCPNCYSQFA-----PIFDPSKSSTFR-----EQRCNG------------NSCHYEII 486
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
Y D + + G + V SG+ + GCG + S + GI+G
Sbjct: 487 YADKTYSKGILATETVTIPSTSGEPFVMAETK---IGCGLDNTNLQYSGFASSSSGIVGL 543
Query: 224 GKSNSSMISQLASSGGVRKMFAHCLDGINGGGI-FAIGHVVQPE---VNKTPLVPNQPHY 279
S+ISQ+ + ++C G I F +V + + + P Y
Sbjct: 544 NMGPLSLISQMDLP--YPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFY 601
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ----- 334
+N+ AV V + + F D IDSGTTL Y P M Y LV + + Q
Sbjct: 602 YLNLDAVSVEDNLIATLGTPFHAEDGN-IFIDSGTTLTYFP-MSYCNLVREAVEQVVTAV 659
Query: 335 -QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
PD+ D C+ YS+++D FP +T HF L
Sbjct: 660 KVPDMG----SDNLLCY-YSDTIDI-FPVITMHFSGGADL 693
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 130/309 (42%), Gaps = 45/309 (14%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
+Y K+ +GTPP + ++DTGSD++W C+ C +C + ++D SS
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFD-----PIFDPSKSS----- 130
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
T +++ CHG SC Y IY D + + G + V SG+ +
Sbjct: 131 TFNEQRCHG------------KSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAET 178
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
GCG + +S + GI+G S+ISQ+ + ++C G
Sbjct: 179 ---TIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLP--YPGLISYCFSGQGTS 233
Query: 255 GI-FAIGHVVQPE---VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
I F +V + + + P Y +N+ AV V + + F D +I
Sbjct: 234 KINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGN-IVI 292
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQ------QPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
DSG+T+ Y P + Y LV K + Q PD ++ C+ +SE++D FP +T
Sbjct: 293 DSGSTVTYFP-VSYCNLVRKAVEQVVTAVRVPDPS----GNDMLCY-FSETIDI-FPVIT 345
Query: 365 FHFENSVSL 373
HF L
Sbjct: 346 MHFSGGADL 354
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/343 (28%), Positives = 140/343 (40%), Gaps = 60/343 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECP--RRSSL----GIELTLYDI 125
Y + IGTPP+ V +DTGSD+ WV C C +C R S L +
Sbjct: 12 YLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSSY 71
Query: 126 KDSSTGKFVT----CDQEF----CHGVYGGPLTDCTANTSCP-YLEIYGDGSSTTGYFVQ 176
+DS + T D F G L T CP + YG G TG +
Sbjct: 72 RDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTR 131
Query: 177 DVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS 236
D ++ + G + T FGC + ST E + GI GF + S SQL
Sbjct: 132 DTLRVHE--GPARVTKDIPKFCFGC-------VGSTYHEPI-GIAGFVRGTLSFPSQL-- 179
Query: 237 SGGVRKMFAHCL------DGINGGGIFAIGHVVQPEVN--------KTPLVPNQPHYSIN 282
G ++K F+HC + N IG + K+P+ PN +Y I
Sbjct: 180 -GLLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPN--YYYIG 236
Query: 283 MTAVQVG-LDFLNLPTDV--FGVGDNKGTIIDSGTTLAYLPEMVYEPLVS---KIISQQP 336
+ A+ VG + +P ++ F N G +IDSGTT +LPE Y L+S II+
Sbjct: 237 LEAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYPR 296
Query: 337 DLKVHTVHDEYTCFQYS------ESVDEGFPNVTFHFENSVSL 373
+V C++ D FP++TFHF N+VS
Sbjct: 297 ATEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLNNVSF 339
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/370 (25%), Positives = 152/370 (41%), Gaps = 55/370 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G Y + +GTPP+ + + +DTGSD+ W+ C C +C + ++D SS+
Sbjct: 142 GSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRG-----PVFDPAASSSY 196
Query: 132 KFVTCDQEFCHGVYGGPLTDCT-----ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
+ +TC C V CPY YGD S++TG + + ++
Sbjct: 197 RNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVN-LTA 255
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKMFA 245
++ +G ++FGCG R G G+ S SQL A GG F+
Sbjct: 256 PGASSRVDG-VVFGCGHRNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYGG--HTFS 307
Query: 246 HCL----DGINGGGIF----AIGHVVQPEVNKTPLVPNQP----HYSINMTAVQVGLDFL 293
+CL + +F A+ P + T P Y + +T V VG + L
Sbjct: 308 YCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELL 367
Query: 294 NLPTDVFGV--GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ-------PDLKVHTVH 344
N+ +D + G + GTIIDSGTTL+Y E Y+ + I + PD V +
Sbjct: 368 NISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLS-- 425
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED---LWCIGWQNSGMQSRD 401
C+ S P ++ F + ++ +P E F D + C+ +
Sbjct: 426 ---PCYNVSGVERPEVPELSLLFADG-AVWDFPAENYFIRLDPDGIMCL-----AVLGTP 476
Query: 402 RKNMTLLGDF 411
R M+++G+F
Sbjct: 477 RTGMSIIGNF 486
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 92/369 (24%), Positives = 155/369 (42%), Gaps = 57/369 (15%)
Query: 35 YAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGL---YYAKIGIGTPPKDYYV 91
Y+ + +S +KE R + + A + P+ + + I IG+PP +
Sbjct: 41 YSLQSPQVSHIKEASVERLEYLKAKATGDIIAHLSPNVPIIPQAFLVNISIGSPPVTQLL 100
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
+DT SD++W+ C C C +S L ++D S T + +C Y P
Sbjct: 101 HMDTASDLLWLQCRPCINCYAQS-----LPIFDPSRSYTHRNESCRT----SQYSMPSLR 151
Query: 152 CTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
A T SC Y Y DG+ + G ++++ ++ + D +++ ++FGCG G
Sbjct: 152 FNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIY-DESSSAALHDVVFGCGHDNYG--- 207
Query: 211 STNEEAL--DGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGING----GGIFAIGHVVQ 264
E L GI+G G S++ + + F++C ++ + +G
Sbjct: 208 ----EPLVGTGILGLGYGEFSLVHRFGTK------FSYCFGSLDDPSYPHNVLVLGDDGA 257
Query: 265 PEV-NKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNK-----GTIIDSGTTLAY 318
+ + TPL Y + + A+ V D + LP D + N GTIID+G +L
Sbjct: 258 NILGDTTPLEIYNGFYYVTIEAISV--DGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTS 315
Query: 319 LPEMVYEPLVSKI------------ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFH 366
L E Y+PL +KI ++Q KV Y + V+ GFP VTFH
Sbjct: 316 LVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVEC----YNGNLERDLVESGFPIVTFH 371
Query: 367 FENSVSLKV 375
F + L +
Sbjct: 372 FSDGAELSL 380
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 145/349 (41%), Gaps = 53/349 (15%)
Query: 39 ERSLSLLKEHDARRQ--QRILAGVDL-PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDT 95
E L L + AR Q ++AG + P+ + Y + IGTPP+ + +DT
Sbjct: 57 ESVLQLQAKDQARLQFLASMVAGRSIVPIASGRQIIQSPTYIVRAKIGTPPQTLLLAIDT 116
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN 155
+D W+ C C C TL+ + S+T K V+C C+ V P C
Sbjct: 117 SNDAAWIPCTACDGC--------TSTLFAPEKSTTFKNVSCGSPECNKV---PSPSC-GT 164
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQY--DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTN 213
++C + YG SS VQD V D + G FGC A+ +G ST
Sbjct: 165 SACTFNLTYGS-SSIAANVVQDTVTLATDPIPG----------YTFGCVAKTTG--PSTP 211
Query: 214 EEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQP-EVNK 269
+ L G+ S S L S F++CL +N G +G V QP +
Sbjct: 212 PQGLLGLGRGPLSLLSQTQNLYQS-----TFSYCLPSFKSLNFSGSLRLGPVAQPIRIKY 266
Query: 270 TPLVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVY 324
TPL+ N Y +N+ A++VG +++P F GT+ DSGT L VY
Sbjct: 267 TPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAPVY 326
Query: 325 EPLVSKI-----ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
+ + ++ + +L V ++ TC+ +V P +TF F
Sbjct: 327 TAVRDEFRRRVAMAAKANLTVTSLGGFDTCY----TVPIVAPTITFMFS 371
>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
Length = 469
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/343 (26%), Positives = 144/343 (41%), Gaps = 31/343 (9%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTP + + + DTGSD+ WV C P E + +S +
Sbjct: 101 GTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPARE---FRASESRSW 157
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVV--------QYD 182
+ C + C L +C++ S C Y Y DGS+ G D D
Sbjct: 158 APLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSED 217
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
G + G ++ GC A D + ++ DG++ G SN S S+ A+ G R
Sbjct: 218 GSGGGGRRAKLQG-VVLGCTA----TYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR- 271
Query: 243 MFAHCL-DGI---NGGGIFAIGHVVQPEVN---KTPLVPNQ---PHYSINMTAVQVGLDF 292
F++CL D + N G + +TPLV ++ P Y++ + AV V +
Sbjct: 272 -FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEA 330
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
L++P DV+ VG G I+DSGT+L L Y +V+ + + L + C+ +
Sbjct: 331 LDIPADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDPFEYCYNW 390
Query: 353 SESVDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
+ E P + F S L+ Y+ + CIG Q
Sbjct: 391 TAGAPE-IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQE 432
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 143/348 (41%), Gaps = 52/348 (14%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKEC 110
R+ V PL G+ P G Y + IG PPK Y + +D+GSD+ W+ C + C +
Sbjct: 49 RMGHTVVFPLQGNVYPQG--FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKA 106
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSS 169
P + + G +TC+ C ++ C A + C Y Y D S
Sbjct: 107 P-----------HPPYKPNKGP-ITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS 154
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
+ G V D+ +G L L FGCG QS +DG++G G SS
Sbjct: 155 SLGVLVHDIFSLQLTNGTLAAPR----LAFGCGYDQS-YPGPNAPPFVDGVLGLGYGKSS 209
Query: 230 MISQLASSGGVRKMFAHCLDGINGGGIFAIGHV-VQPEVNKTPLVPNQPHYSINMTAVQV 288
+++QL S G +R + HCL G GG +F + P + TP+ +A +
Sbjct: 210 IVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPMSRKS-----GESAYAL 264
Query: 289 GLDFLNLPTDVFGVGDNKGT-----IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTV 343
G P D+ G N G + DSG++ Y Y+ +S ++ + + K+
Sbjct: 265 G------PADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLS-LVRKYLNGKLKET 317
Query: 344 HDEY--TCFQYSESVDEGFP--------NVTFHFENSVSLKVYPHEYL 381
DE C++ ++ F ++F S L++ P YL
Sbjct: 318 ADESLPVCWRGAKPFKSIFEVKNYFKPFALSFTKAKSAQLQLPPESYL 365
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 89/360 (24%), Positives = 144/360 (40%), Gaps = 70/360 (19%)
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI---QCKECPRRSSLG 117
+P + P G Y +GTPP+ V +DTGS + WV C +C+ C S+
Sbjct: 84 SVPATAALYPHSYGGYAFTASLGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASA 143
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGP--LTDC---------------TANTSCPY 160
+ ++ K+SS+ + V C C V+ T C +N PY
Sbjct: 144 VP--VFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPY 201
Query: 161 LEIYGDGSSTTGYFVQDVVQYD--KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALD 218
+YG G ST G + D ++ V G + GC +L S ++
Sbjct: 202 AVVYGSG-STAGLLIADTLRAPGRAVPG----------FVLGC------SLVSVHQPP-S 243
Query: 219 GIIGFGKSNSSMISQLASSGGVRKMFAHCL-------DGINGGGIFAIGHVVQPEVNKTP 271
G+ GFG+ S+ +QL G+ K F++CL + G + G + P
Sbjct: 244 GLAGFGRGAPSVPAQL----GLPK-FSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYVP 298
Query: 272 LV--------PNQPHYSINMTAVQVGLDFLNLPTDVFG--VGDNKGTIIDSGTTLAYLPE 321
LV P +Y + + V VG + LP F + GTI+DSGTT YL
Sbjct: 299 LVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAGNAAGSGGTIVDSGTTFTYLDP 358
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDE-----YTCFQYSESVDE-GFPNVTFHFENSVSLKV 375
V++P+ +++ + E + CF + P ++FHFE +++
Sbjct: 359 TVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFALPQGARSMALPELSFHFEGGAVMQL 418
>gi|145511131|ref|XP_001441493.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408743|emb|CAK74096.1| unnamed protein product [Paramecium tetraurelia]
Length = 490
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 147/336 (43%), Gaps = 51/336 (15%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK-DSSTG 131
+G Y+ I +G PP+ V +DTGS I C C + S GI L Y I+ +SST
Sbjct: 31 LGYYFVNIYVGNPPQRQSVIIDTGSSI---TAFPCDACDQTKSCGIHLDQYYIRNNSSTQ 87
Query: 132 KFVTCDQEFCHGVYGGPLTDCTA----NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
+ + C +F +CT N C + Y +GS G++++D V + GD
Sbjct: 88 EELDCKSQF---------GECTCLRCLNQQCIFSISYSEGSHLEGFYLKDQV----IFGD 134
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG-KSNSSM-----ISQLASS-GGV 240
L + + + +FGC R++ NL T + +GI+G K+N+S+ + + + G+
Sbjct: 135 LLMEANSVTSVFGCTTRET-NLFKTQQA--NGIMGLSPKTNTSLAFPNIVDDIHTQHNGM 191
Query: 241 RKMFAHCLDGINGGGIFAIGHVVQPEVNKTPL--------VPNQPHYSINMTAVQVGLDF 292
FA C+ I+ G IG K N+P Y + ++ ++V
Sbjct: 192 NLFFAICIGRID--GYMTIGQYDYSRHQKNSAYYTIQYMHTQNKPVYGVKISQIKVHNKT 249
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQY 352
+ D+ G G+ IDSG+TL V LV+ + + + +D+ C+ Y
Sbjct: 250 ILAGADLQSGG---GSFIDSGSTLVNAHPDVTRALVNFFVCESANCPQMQFNDDLACYVY 306
Query: 353 SESVD-------EGFPNVTFHFENSVSLKVYPHEYL 381
++++ FP F EN+ P +YL
Sbjct: 307 NKTLHGSFEQFISFFPTYQFIMENNFIFDWTPRDYL 342
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 91/348 (26%), Positives = 144/348 (41%), Gaps = 36/348 (10%)
Query: 35 YAGRERSLSLLKEHDAR---RQQRI--LAGVDLPLGG--SSRPDGVGLYYAKIGIGTPPK 87
Y + L+K R R +R+ + + PL + PD G Y + +GTP
Sbjct: 41 YNSQMTQTELVKSAALRSITRSKRVNFIGQISPPLSPIITPIPDH-GEYLMRFSLGTPSV 99
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
+ DTGSD+ W+ C CK C + E L+D SST V C+ + C ++
Sbjct: 100 ERLAIFDTGSDLSWLQCTPCKTCYPQ-----EAPLFDPTQSSTYVDVPCESQPC-TLFPQ 153
Query: 148 PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
+C ++ C YL YG S T G D + + +G Q +T +FGC +
Sbjct: 154 NQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSS-TGMGQGGATFPKSVFGCAFYSNF 212
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN--GGGIFAIGHVVQP 265
+ + +G +G G S+ SQL G + F++C+ + G G +
Sbjct: 213 TFKISTKA--NGFVGLGPGPLSLASQLGDQIGHK--FSYCMVPFSSTSTGKLKFGSMAPT 268
Query: 266 -EVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
EV TP + P+ P +Y +N+ + VG V IIDS L +L +
Sbjct: 269 NEVVSTPFMINPSYPSYYVLNLEGITVGQK------KVLTGQIGGNIIIDSVPILTHLEQ 322
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQY--SESVDEGFPNVTFHF 367
+Y +S + + + V D T F+Y + FP FHF
Sbjct: 323 GIYTDFISSV---KEAINVEVAEDAPTPFEYCVRNPTNLNFPEFVFHF 367
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 92/379 (24%), Positives = 160/379 (42%), Gaps = 42/379 (11%)
Query: 37 GRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTP-PKDYYVQVDT 95
R + +S L+ R+ + +P+ S G Y+ I IGTP P+ + + DT
Sbjct: 81 ARRQMISSLRHGTRRKAFEVSHTAQIPIH-SGADSGQSQYFVSIRIGTPRPQKFILVTDT 139
Query: 96 GSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG--PLTDC 152
GSD+ W+NC CK CP+ + + + DSS+ + + C + C LT+C
Sbjct: 140 GSDLTWMNCEYWCKSCPKPNPHPGRV--FRANDSSSFRTIPCSSDDCKIELQDYFSLTEC 197
Query: 153 -TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDS 211
N C + Y +G G F + V V + ++ GC + + +
Sbjct: 198 PNPNAPCLFDYRYLNGPRAIGVFANETVT---VGLNDHKKIRLFDVLIGC----TESFNE 250
Query: 212 TNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIFAIGHVVQ--- 264
TN DG++G G S+ +LA G + F++CL N + G + +
Sbjct: 251 TNGFP-DGVMGLGYRKHSLALRLAEIFGNK--FSYCLVDHLSSSNHKNFLSFGDIPEMKL 307
Query: 265 PEVNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEM 322
P++ T L+ Y +N++ + VG L++ +D++ V G I+DSGT+L L
Sbjct: 308 PKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVGGMIVDSGTSLTMLAGE 367
Query: 323 VYEPLVSKII----SQQPDLKVHTVHDEYTCFQYSESVDEGF-----PNVTFHFENSVSL 373
Y+ +V + + + + CF+ D+GF P + HF +
Sbjct: 368 AYDKVVDALKPIFDKHKKVVPIELPELNNFCFE-----DKGFDRAAVPRLLIHFADGAIF 422
Query: 374 KVYPHEYLFPF-EDLWCIG 391
K Y+ E + C+G
Sbjct: 423 KPPVKSYIIDVAEGIKCLG 441
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 141/339 (41%), Gaps = 65/339 (19%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GTPP Y DTGSDI+W+ C CKEC +++ + SST K
Sbjct: 85 GEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTT-----PKFKPSKSSTYKN 139
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C + C G L+ T LE S+TG+ + + K
Sbjct: 140 IPCSSDLCKSGQQGNLSVDTLT-----LE------SSTGH----PISFPKT--------- 175
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+ GCG + + E A GI+G G +S+I+QL SS + F++CL
Sbjct: 176 ----VIGCGTDNTVSF----EGASSGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPV 225
Query: 249 -----DGINGGGIFAI---GHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVF 300
+N G + G V P V K P+V Y + + A VG +
Sbjct: 226 ESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIV----FYYLTLEAFSVGNKRIEFEGSSN 281
Query: 301 GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE-- 358
G G IIDSGTTL +P VY L S ++ +K+ V+D F SV
Sbjct: 282 G-GHEGNIIIDSGTTLTVIPTDVYNNLESAVLEL---VKLKRVNDPTRLFNLCYSVTSDG 337
Query: 359 -GFPNVTFHFENSVSLKVYPHEYLFPFED-LWCIGWQNS 395
FP +T HF+ + +K++P D + C+ + +
Sbjct: 338 YDFPIITTHFKGA-DVKLHPISTFVDVADGIVCLAFATT 375
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 135/300 (45%), Gaps = 38/300 (12%)
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
+G G+P + DTGSD+ W+ C C C ++ ++D SS+ V C
Sbjct: 116 VGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHD-----PVFDPAKSSSYAVVPCGT 170
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C G +C T+C Y YGDGSSTTG ++ + + ++S I
Sbjct: 171 TECAAAGG----ECN-GTTCVYGVEYGDGSSTTGVLARETLTF-------SSSSEFTGFI 218
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN-GGGIF 257
FGCG G+ +DG++G G+ + S+ SQ A + G +F++CL N G
Sbjct: 219 FGCGETNLGDFGE-----VDGLLGLGRGSLSLSSQAAPAFG--GIFSYCLPSYNTTPGYL 271
Query: 258 AIGHVV---QPEVNKTPLVPNQPHYS----INMTAVQVGLDFLNLPTDVFGVGDNKGTII 310
+IG Q V T +V N+P Y I + ++ +G L +P F GT++
Sbjct: 272 SIGATPVTGQIPVQYTAMV-NKPDYPSFYFIELVSINIGGYVLPVPPSEF---TKTGTLL 327
Query: 311 DSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFEN 369
DSGT L YLP Y L + K +DE TC+ ++ P V+F+F +
Sbjct: 328 DSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSD 387
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 139/323 (43%), Gaps = 52/323 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y A +G+G + V VDT S++ WV C C+ C + + L+D S + V
Sbjct: 120 YVATVGLGA--AEATVVVDTASELTWVQCQPCESCHDQ-----QDPLFDPSSSPSYAAVP 172
Query: 136 CDQEFCHGV---YGGPLTDCTANT----SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
C+ C + + C + +C Y Y DGS + G +D ++ D+
Sbjct: 173 CNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL--AGQDI 230
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ-LASSGGVRKMFAHC 247
+ +FGCG G G++G G+S+ S++SQ + GGV F++C
Sbjct: 231 E------GFVFGCGTSNQG----APFGGTSGLMGLGRSHVSLVSQTMDQFGGV---FSYC 277
Query: 248 LDGINGG--GIFAIGHVVQPEVNKTPLV---------PNQ-PHYSINMTAVQVGLDFLNL 295
L G G +G N TP+V P Q P Y +N+T + VG +
Sbjct: 278 LPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVES 337
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQY 352
P F G IIDSGT + L VY + ++ +SQ P ++ D TCF
Sbjct: 338 PW--FSAGR---VIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSILD--TCFNL 390
Query: 353 SESVDEGFPNVTFHFENSVSLKV 375
+ + P++ F FE SV ++V
Sbjct: 391 TGLKEVQVPSLKFVFEGSVEVEV 413
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 138/364 (37%), Gaps = 52/364 (14%)
Query: 39 ERSLSLLKEHDAR----RQQRILAGVD-LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
E ++L ++ DAR + AGV P+ P Y + G+G+P + + +
Sbjct: 42 ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQAPPS---YVVRAGLGSPSQQLLLAL 98
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DT +D W +C C CP S L+ +SS+ + C +C G
Sbjct: 99 DTSADATWAHCSPCGTCPSSS-------LFAPANSSSYASLPCSSSWCPLFQG------- 144
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL----------IFGCGA 203
+CP + GD + Q + +L FGC +
Sbjct: 145 --QACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDAIPNYTFGCVS 202
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-----INGGGIFA 258
+G T G++G G+ +++SQ S +F++CL +G
Sbjct: 203 SVTG---PTTNMPRQGLLGLGRGPMALLSQAGSL--YNGVFSYCLPSYRSYYFSGSLRLG 257
Query: 259 IGHVVQPEVNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTDVFG--VGDNKGTIIDS 312
G V TP++ N PH Y +N+T + VG ++ +P F GT++DS
Sbjct: 258 AGGGQPRSVRYTPMLRN-PHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATGAGTVVDS 316
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSV 371
GT + VY L + Q +T + TCF E G P VT H + V
Sbjct: 317 GTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGV 376
Query: 372 SLKV 375
L +
Sbjct: 377 DLAL 380
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 85/334 (25%), Positives = 143/334 (42%), Gaps = 49/334 (14%)
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
+G+GTPP+ V +D GSD++W C ++ ++D SS+ + CD +
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLE-----PVFDAARSSSFSVLPCDSK 165
Query: 140 FCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C G T+ T + C Y YG ++ TG + + G + +L
Sbjct: 166 LCE---AGTFTNKTCTDRKCAYENDYGIMTA-TGVLATETFTFGAHHG------VSANLT 215
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--------DG 250
FGCG +G + + GI+G SM+ QLA + F++CL
Sbjct: 216 FGCGKLANGTIAEAS-----GILGLSPGPLSMLKQLAITK-----FSYCLTPFADRKTSP 265
Query: 251 INGGGIFAIG-HVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDN- 305
+ G + +G + +V PL+ N +Y + M + VG L++P + + +
Sbjct: 266 VMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDG 325
Query: 306 -KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV--HTVHDEYTCFQYSESVD-EG-- 359
GT++DS TTLAYL E + L K + + L V +V D CF+ + EG
Sbjct: 326 TGGTVLDSATTLAYLVEPAFTEL-KKAVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQ 384
Query: 360 FPNVTFHFENSVSLKVYPHEYLF--PFEDLWCIG 391
P + HF+ + + P + F P + C+
Sbjct: 385 VPPLVLHFDGDAEMSL-PRDNYFQEPSPGMMCLA 417
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 143/349 (40%), Gaps = 52/349 (14%)
Query: 55 RILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKEC 110
R+ V PL G+ P G Y + IG PPK Y + +D+GSD+ W+ C + C +
Sbjct: 16 RMGHTVVFPLQGNVYPQG--FYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKA 73
Query: 111 PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEIYGDGSS 169
P + + G +TC+ C ++ C A + C Y Y D S
Sbjct: 74 P-----------HPPYKPNKGP-ITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS 121
Query: 170 TTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSS 229
+ G V D+ +G L L FGCG QS +DG++G G SS
Sbjct: 122 SLGVLVHDIFSLQLTNGTLAAPR----LAFGCGYDQS-YPGPNAPPFVDGVLGLGYGKSS 176
Query: 230 MISQLASSGGVRKMFAHCLDGINGGGIFAIGHV-VQPEVNKTPLVPNQPHYSINMTAVQV 288
+++QL S G +R + HCL G GG +F + P + TP+ +A +
Sbjct: 177 IVTQLRSLGLIRSIVGHCLSGRGGGFLFLGDGLSTTPGIIWTPM-----SRKSGESAYAL 231
Query: 289 GLDFLNLPTDVFGVGDNKGT-----IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTV 343
G P D+ G N G + DSG++ Y Y+ +S ++ + + K+
Sbjct: 232 G------PADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLS-LVRKYLNGKLKET 284
Query: 344 HDEY--TCFQYSESVDEGFP--------NVTFHFENSVSLKVYPHEYLF 382
DE C++ ++ F ++F S L++ P YL
Sbjct: 285 ADESLPVCWRGAKPFKSIFEVKNYFKPFALSFTKAKSAQLQLPPESYLI 333
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 80/156 (51%), Gaps = 17/156 (10%)
Query: 39 ERSLSLLKEHDARRQQRILAGVDLPLGGS-----SRPDGV---GLYYAKIGIGTPPKDYY 90
E L+ L D+ R R+L P+ GS R + LYY + IGTPP++
Sbjct: 36 ELDLTQLMTFDSARHGRLLQS---PVHGSFNWKVERDTSILLSALYYTTVQIGTPPRELD 92
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
V +DTGSD++WV+C C CP + +T +D SS+ + C + C +
Sbjct: 93 VVIDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRCSSDLQKK-S 146
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
C+ SC Y YGDGS T+GY++ D++ +D +SG
Sbjct: 147 RCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSG 182
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/274 (26%), Positives = 120/274 (43%), Gaps = 27/274 (9%)
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT-ANTSCPY-LEIYGDGSSTTGYFVQ 176
+L +Y +S+T + + C E C V G CT CPY ++ + + ++++G ++
Sbjct: 5 DLRIYRPAESTTSRHLPCSHELCQSVPG-----CTNPKQPCPYNIDYFSENTTSSGLLIE 59
Query: 177 DVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS 236
D + + + N S+I GCG +QSG D + A DG++G G ++ S+ S LA
Sbjct: 60 DTLHLNYREDHV---PVNASVIIGCGQKQSG--DYLDGIAPDGLLGLGMADISVPSFLAR 114
Query: 237 SGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP---NQPHYSINMTAVQVGLDFL 293
+G V+ F+ C + G IF G P TP VP Y++N+ +G L
Sbjct: 115 AGLVQNSFSMCFKEDSSGRIF-FGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCL 173
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQY 352
G + ++DSGT+ LP VY+ + Q +V + C+
Sbjct: 174 E--------GTSFKALVDSGTSFTSLPFDVYKAFTMEFDKQMNATRVPYEDTTWKYCYSA 225
Query: 353 SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
S P +T F SL+ + PF D
Sbjct: 226 SPLEMPDVPTITLTFAADKSLQAV--NPILPFND 257
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 85/311 (27%), Positives = 133/311 (42%), Gaps = 32/311 (10%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KEC-PRRSSLGIELTLYDIKDSS 129
G G Y +G+GTP +D+ + DTGS I W C C C P++ +D S+
Sbjct: 131 GTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQ------KFDPTKST 184
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
+ V+C C+ + +N++C Y IYGD S + G+F + + S D+
Sbjct: 185 SYNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTIS--SSDVF 242
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
T + +FGCG +G G++G S+ S+ SQ A +K F++CL
Sbjct: 243 T-----NFLFGCGQSNNGLFGQAA-----GLLGLSSSSVSLPSQTAEK--YQKQFSYCLP 290
Query: 250 GI-NGGGIFAIGHVVQPEVNKTPLVPN-QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
+ G G V TP+ P Y I++ + V L + +F G
Sbjct: 291 STPSSTGYLNFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIF---TTSG 347
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
IIDSGT + LP Y+ L +S P + D TC+ +S FP V+
Sbjct: 348 AIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDELLD--TCYDFSNYTTVSFPKVS 405
Query: 365 FHFENSVSLKV 375
F+ V + +
Sbjct: 406 VSFKGGVEVDI 416
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 139/358 (38%), Gaps = 55/358 (15%)
Query: 52 RQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKEC 110
+ +R+ + V P+ G+ P +G YY + IG PPK + + +DTGSD+ WV C C C
Sbjct: 46 QNRRLGSSVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 103
Query: 111 --PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANT-------SCPYL 161
PR + C C G+ D T N C Y
Sbjct: 104 TKPRAKQY-----------KPNHNTLPCSHLLCSGL------DLTQNRPCDDPEDQCDYE 146
Query: 162 EIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGII 221
Y D +S+ G V D +G + N L FGCG Q N GI+
Sbjct: 147 IGYSDHASSIGALVTDEFPLKLANGSIM----NPHLTFGCGYDQQ-NPGPHPPPPTAGIL 201
Query: 222 GFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHY 279
G G+ + +QL S G + + HCL G G +IG + P V T L N
Sbjct: 202 GLGRGKVGISTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVPSSGVTWTSLATNSA-- 258
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEP---LVSKIISQQP 336
S N L F + T V G+ + DSG++ Y Y+ L+ K ++ +P
Sbjct: 259 SKNYMTGPAELLFNDKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIRKDLNGKP 314
Query: 337 DLKVHTVHDEYTCFQYS---ESVDEG---FPNVTFHF---ENSVSLKVYPHEYLFPFE 385
C++ +S+DE F +T F +N +V P YL E
Sbjct: 315 LTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQVPPESYLIITE 372
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 90/317 (28%), Positives = 136/317 (42%), Gaps = 63/317 (19%)
Query: 51 RRQQRILAGVDLP--LGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI--- 105
R + R G P + S P G Y + +GTPP+ V +DTGS + WV C
Sbjct: 62 RPRPRSRQGTAPPPSVRASLYPHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSY 121
Query: 106 QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP-LTDCTANTSC------ 158
QC+ C S+ L ++ K+SS+ + + C C ++ L+DC A +SC
Sbjct: 122 QCRNCSSLSAAS-PLHVFHPKNSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCT 180
Query: 159 -----------PYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
PYL +YG G ST G + D ++ + + + GC
Sbjct: 181 PRNANANNVCPPYLVVYGSG-STAGLLISDTLRTPGRAVR--------NFVIGC------ 225
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL--------DGINGGGIF-- 257
+L S ++ G+ GFG+ S+ SQL G+ K F++CL ++G I
Sbjct: 226 SLASVHQPP-SGLAGFGRGAPSVPSQL----GLTK-FSYCLLSRRFDDNAAVSGELILGG 279
Query: 258 AIGHVVQPEVNKTPLV-------PNQPHYSINMTAVQVGLDFLNLPTDVF-GVGDNKGTI 309
A G + PL P +Y + +TA+ VG + LP F G G I
Sbjct: 280 AGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAI 339
Query: 310 IDSGTTLAYLPEMVYEP 326
+DSGTT +Y V+EP
Sbjct: 340 VDSGTTFSYFDRTVFEP 356
>gi|403222804|dbj|BAM40935.1| aspartyl(acid) protease [Theileria orientalis strain Shintoku]
Length = 509
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 91/363 (25%), Positives = 155/363 (42%), Gaps = 72/363 (19%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
YY +GIG P + +DTGS ++ V C +CKEC L Y++ S T K +
Sbjct: 80 YYVYVGIGNPKTKQMLIIDTGSQLINVACGKCKECGNHL-----LPNYELGASVTHKLID 134
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
CD EFC V G C + SC + E Y +GS+ G V D++ +D + D ST
Sbjct: 135 CDSEFCKAVEG----KCGLDESCLFNESYSEGSNVEGKVVGDLISFD-IKKDSSYLSTFF 189
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS------------SMISQLASS--GGVR 241
+ I GC +S + S + +GI+G KS+ S I + + ++
Sbjct: 190 NYI-GCVTNESQLIKS---QITNGILGLAKSDKPTLISHEYFETQSFIEKYLTDHFRPMK 245
Query: 242 KMFAHCLDGINGGGIFAIGHV---VQPEVNKT------PLVPNQPHYSINMTAVQVGLDF 292
K+F+ CL GG+ +G V + ++ T PLV ++ Y I + +
Sbjct: 246 KIFSLCLS--ENGGVMTLGGVDDQLNLKIKNTTQLIWAPLVKSE-FYIIKVLDASFQENK 302
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL----------VSKIISQQPDLKVHT 342
+ NK ++D+GTT++ L + V+ + ++K+ +++ T
Sbjct: 303 IEFK--------NKNFVLDTGTTISTLEKEVFNKIHKIFEGLCEDITKLSNEKKTSSKCT 354
Query: 343 VHDE--YTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED--------LWCIGW 392
V + CF S P++ FEN + + Y+ + WC+G
Sbjct: 355 VDKKTGKMCF----SDISKLPSIVLTFENGSNFEWTSDSYMINRTNKRTVNDYSWWCLGI 410
Query: 393 QNS 395
++S
Sbjct: 411 ESS 413
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 86/364 (23%), Positives = 138/364 (37%), Gaps = 52/364 (14%)
Query: 39 ERSLSLLKEHDAR----RQQRILAGVD-LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQV 93
E ++L ++ DAR + AGV P+ P Y + G+G+P + + +
Sbjct: 40 ESIIALARDDDARLLFLSSKAATAGVSSAPVASGQAPPS---YVVRAGLGSPSQQLLLAL 96
Query: 94 DTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT 153
DT +D W +C C CP S L+ +SS+ + C +C G
Sbjct: 97 DTSADATWAHCSPCGTCPSSS-------LFAPANSSSYASLPCSSSWCPLFQG------- 142
Query: 154 ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL----------IFGCGA 203
+CP + GD + Q + +L FGC +
Sbjct: 143 --QACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALASDTLRLGKDAIPNYTFGCVS 200
Query: 204 RQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG-----INGGGIFA 258
+G T G++G G+ +++SQ S +F++CL +G
Sbjct: 201 SVTG---PTTNMPRQGLLGLGRGPMALLSQAGSL--YNGVFSYCLPSYRSYYFSGSLRLG 255
Query: 259 IGHVVQPEVNKTPLVPNQPH----YSINMTAVQVGLDFLNLPTDVFG--VGDNKGTIIDS 312
G V TP++ N PH Y +N+T + VG ++ +P F GT++DS
Sbjct: 256 AGGGQPRSVRYTPMLRN-PHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAGTVVDS 314
Query: 313 GTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSV 371
GT + VY L + Q +T + TCF E G P VT H + V
Sbjct: 315 GTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVHMDGGV 374
Query: 372 SLKV 375
L +
Sbjct: 375 DLAL 378
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 146/340 (42%), Gaps = 53/340 (15%)
Query: 69 RPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDS 128
RP G + + IGTPP+ + +DTGSD++W QCK R E LYD S
Sbjct: 82 RPFGRLHHTLTVSIGTPPQPRTLILDTGSDLIWT---QCKLFDTRQHR--EKPLYDPAKS 136
Query: 129 STGKFVTCDQEFCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGD 187
S+ CD C G + +C+ N C Y YG ++T G + + G+
Sbjct: 137 SSFAAAPCDGRLCETGSFN--TKNCSRN-KCIYTYNYGS-ATTKGELASETFTF----GE 188
Query: 188 LQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
+ S SL FGCG SG+L + GI+G S++SQL F++C
Sbjct: 189 HRRVSV--SLDFGCGKLTSGSLPGAS-----GILGISPDRLSLVSQLQI-----PRFSYC 236
Query: 248 ----LDGINGGGIF--AIGHVVQPE----VNKTPLVPNQP----HYSINMTAVQVGLDFL 293
LD IF A+ + + + T LV N +Y + + + VG L
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296
Query: 294 NLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD---EYT 348
N+P F +G + GT +DSG T LP +V E L ++ + L V D EY
Sbjct: 297 NVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMV-EAVKLPVVNATDHGYEYE 355
Query: 349 -CFQYSE----SVDEG--FPNVTFHFENSVSLKVYPHEYL 381
CFQ +V+ P + +HF+ ++ + Y+
Sbjct: 356 LCFQLPRNGGGAVETAVQVPPLVYHFDGGAAMLLRRDSYM 395
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 90/301 (29%), Positives = 134/301 (44%), Gaps = 40/301 (13%)
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGKFVTCDQ 138
+G GTP + + +DTGSD+ W+ C C C R+ +D SS+ V C
Sbjct: 141 VGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPD-----FDPAKSSSYAAVPCGT 195
Query: 139 EFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C G T+C Y YGDGSSTTG +D + ++ ++S
Sbjct: 196 PVCAAAGG-----MCNGTTCLYGVQYGDGSSTTGVLSRDTLTFN-------SSSKFTGFT 243
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLDGIN-GGGI 256
FGCG + G+ +DG++G G+ S+ SQ A S GGV F++CL N G
Sbjct: 244 FGCGEKNIGDFGE-----VDGLLGLGRGKLSLPSQAAPSFGGV---FSYCLPSYNTTPGY 295
Query: 257 FAIGHVVQP----EVNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
IG +P V T ++ P P Y I + ++ +G L +P VF GT+
Sbjct: 296 LNIG-ATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF---TKTGTL 351
Query: 310 IDSGTTLAYLPEMVYEPLVSKI-ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
+DSGT L YLP Y L + + Q + TC+ ++ P V+F+F
Sbjct: 352 LDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFS 411
Query: 369 N 369
+
Sbjct: 412 D 412
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 85/322 (26%), Positives = 133/322 (41%), Gaps = 43/322 (13%)
Query: 60 VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIE 119
V P+ + VG Y ++ +GTP + Y+ +DT +D W C C C
Sbjct: 79 VAAPIASGQQVLNVGNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCS-------S 131
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC--TANTSCPYLEIYGDGSSTTGYFVQD 177
T + ++SST + C + C G C T N C + + YG S+ + VQD
Sbjct: 132 TTTFSAQNSSTFATLDCSKPECTQARG---LSCPTTGNVDCLFNQTYGGDSTFSATLVQD 188
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ + + FGC + SG + G++G G+ S+ISQ S
Sbjct: 189 SLHLGP--------NVIPNFSFGCISSASG-----SSIPPQGLMGLGRGPLSLISQ---S 232
Query: 238 GGVRK-MFAHCLDGING---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQV 288
G + +F++CL G +G V QP+ + TPL+ N PH Y +N+T + V
Sbjct: 233 GSLYSGLFSYCLPSFKSYYFSGSLKLGPVGQPKAIRTTPLLHN-PHRPSLYYVNLTGISV 291
Query: 289 GLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE 346
G + + ++ N GTIIDSGT + +Y V +Q +
Sbjct: 292 GRVLVPISPELLAFDPNTGAGTIIDSGTVITRFVPAIYT-AVRDEFRKQVGGSFSPLGAF 350
Query: 347 YTCFQYSESVDEGFPNVTFHFE 368
TCF + V P +T H
Sbjct: 351 DTCFATNNEVSA--PAITLHLS 370
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 131/302 (43%), Gaps = 48/302 (15%)
Query: 42 LSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
LSL K + R+L+ V PL G+ P +G Y I IG + + +D+GSD+ W
Sbjct: 27 LSLRK----KNSDRLLSSVVFPLKGNVYP--LGYYSVSINIGKGDEAFEFDIDSGSDLTW 80
Query: 102 VNC-IQCKEC--PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD--C-TAN 155
V C C C PR LY +++ + C + C ++ P+T+ C +A+
Sbjct: 81 VQCDAPCTHCTKPREQ-------LYKPNNNA----LNCFEPLCTSLH--PITNHHCKSAD 127
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL-DSTNE 214
C Y Y D S+ G V D V +G L + FGCG ++ DS+
Sbjct: 128 DQCQYEIEYADHGSSLGVLVNDHVPLKLTNGSLAAPR----IAFGCGYDHKYSVPDSSPP 183
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP 274
A G++G G S ISQL+S G VR + HCL + GG G VP
Sbjct: 184 TA--GVLGLGNGEVSFISQLSSMGVVRNVVGHCLS--DEGGFLFFGD---------EFVP 230
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT-----IIDSGTTLAYLPEMVYEPLVS 329
+ +M+ +G + + P +V+ G G + DSG++ Y Y +++
Sbjct: 231 SSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNSILA 290
Query: 330 KI 331
+
Sbjct: 291 LV 292
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 94/374 (25%), Positives = 158/374 (42%), Gaps = 71/374 (18%)
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT-----LYDIKDSSTGKFV 134
+GIGTPP+ + VDTGSD++W QC RR+ + LY+ + SS+ ++
Sbjct: 88 VGIGTPPQPRTLIVDTGSDLIWT---QCSMLSRRTRTAASASRQREPLYEPRRSSSFAYL 144
Query: 135 TCDQEFCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQY---DKVSGDLQT 190
C C G + +C N C Y E+YG + G + + KVS L
Sbjct: 145 PCSDRLCQEGQFS--YKNCARNNRCMYDELYGSAEA-GGVLASETFTFGVNAKVSLPLG- 200
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
FGCGA +G+L + G++G S++SQL+ F++CL
Sbjct: 201 --------FGCGALSAGDLVGAS-----GLMGLSPGIMSLVSQLSV-----PRFSYCLTP 242
Query: 251 INGG-------GIFA-------IGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLP 296
G A G V + + P + +Y + + + +G L++P
Sbjct: 243 FAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAM-ETAYYYVPLVGLSLGTKRLDVP 301
Query: 297 TDVFGV---GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDE-YTCFQY 352
G+ + GTI+DSG+T++YL E + V K + + L V DE Y ++
Sbjct: 302 ATSLGMIKPDGSGGTIVDSGSTMSYLEETAFR-AVKKAVVEAVRLPVANGTDEDYDDYEL 360
Query: 353 SESVDEGF-------PNVTFHFENSVSLKVYPHEYLF--PFEDLWCI-------GWQNSG 396
++ G P + HF+ ++ + P + F P L C+ G+ S
Sbjct: 361 CFALPTGVAMEAVKTPPLVLHFDGGAAMTL-PRDNYFQEPRAGLMCLAVGTSPDGFGVSI 419
Query: 397 MQSRDRKNMTLLGD 410
+ + ++NM +L D
Sbjct: 420 IGNVQQQNMHVLFD 433
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 128/303 (42%), Gaps = 59/303 (19%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y K+ IGTPP + +DTGS+++W C+ C C + + ++D SST K
Sbjct: 65 YLMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKA-----PIFDPSKSSTFKETR 119
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C+ T + SCPY +Y D S T G + V SG
Sbjct: 120 CN---------------TPDHSCPYKLVYDDKSYTQGTLATETVTIHSTSG---VPFVMP 161
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG 255
I GC SG S + GI+G + + S+ISQ+ GG A+ DG+
Sbjct: 162 ETIIGCSRNNSG---SGFRPSSSGIVGLSRGSLSLISQM---GG-----AYPGDGVVSTT 210
Query: 256 IFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG---LDFLNLPTDVFGVGDNKGTIIDS 312
+FA KT + Y +N+ AV VG ++ + P N +IDS
Sbjct: 211 MFA----------KT---AKRGQYYLNLDAVSVGDTRIETVGTPFHAL----NGNIVIDS 253
Query: 313 GTTLAYLPEMVYEPLVSKIISQ--QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENS 370
GT L Y P + Y LV K + + D V ++ C+ YS ++ E FP +T HF
Sbjct: 254 GTPLTYFP-VSYCNLVRKAVERVVTADRVVDPSRNDMLCY-YSNTI-EIFPVITVHFSGG 310
Query: 371 VSL 373
L
Sbjct: 311 ADL 313
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 127/312 (40%), Gaps = 55/312 (17%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
+Y K+ +GTPP + +DTGS+I W C+ C C ++++ ++D SST K
Sbjct: 379 VYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNA-----PIFDPSKSSTFK-- 431
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
++ CH + SCPY Y D + T G D V SG+ +
Sbjct: 432 ---EKRCH------------DHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMA-- 474
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG---- 250
I GCG +S + +G +G S+I+Q+ G + ++C G
Sbjct: 475 -ETIIGCGRN-----NSWFRPSFEGFVGLNWGPLSLITQMG--GEYPGLMSYCFAGNGTS 526
Query: 251 -INGG--GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVG---LDFLNLPTDVFGVGD 304
IN G I G VV + T P Y +N+ AV VG ++ L P
Sbjct: 527 KINFGTNAIVGGGGVVSTTMFVTTARPG--FYYLNLDAVSVGDTRIETLGTPFHAL---- 580
Query: 305 NKGTIIDSGTTLAYLPEM---VYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFP 361
+IDSGTTL Y PE + V ++ P ++ C Y + E FP
Sbjct: 581 EGNIVIDSGTTLTYFPESYCNLVRQAVEHVVPAVP--AADPTGNDLLC--YYSNTTEIFP 636
Query: 362 NVTFHFENSVSL 373
+T HF L
Sbjct: 637 VITMHFSGGADL 648
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 116/300 (38%), Gaps = 45/300 (15%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y I IGTPP +DTGSD++W C + P R LY S+T V+
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQC----DAPCRRCFPQPAPLYAPARSATYANVS 147
Query: 136 CDQEFCHGVYGGPLTDCT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C + P + C+ +T C Y YGDG+ST G + L + +
Sbjct: 148 CRSPMCQALQ-SPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFT-------LGSDTAV 199
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG 254
+ FGCG G+ D+++ G++G G+ S++SQL GV + C
Sbjct: 200 RGVAFGCGTENLGSTDNSS-----GLVGMGRGPLSLVSQL----GVTRPRRSC------- 243
Query: 255 GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV---GDNKGTIID 311
P + + + VG L + VF + GD G IID
Sbjct: 244 -----------RARAAARGGGAPTTTSPLEGITVGDTLLPIDPAVFRLTPMGDG-GVIID 291
Query: 312 SGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSESVDEGFPNVTFHFENS 370
SGTT L E + L + S+ H + CF + P + HF+ +
Sbjct: 292 SGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGA 351
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 93/372 (25%), Positives = 152/372 (40%), Gaps = 57/372 (15%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ---CKECPRRSSLGIELTLYDIKDSST 130
G Y + GTPP+ +DTGS +W C C C S ++ + K SS+
Sbjct: 75 GGYSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNC----SFTSRISPFLPKHSSS 130
Query: 131 GKFVTCDQEFCHGVYGGPL--TDCTANT-SC-----PYLEIYGDGSSTTGYFVQDVVQYD 182
K + C C ++ L TDC N+ +C PYL +YG G+ T G + + +
Sbjct: 131 SKIIGCKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGT-TGGVALSETLH-- 187
Query: 183 KVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
+ G + G +F +RQ GI GFG+ SS+ SQL +
Sbjct: 188 -LHGLIVPNFLVGCSVFS--SRQPA-----------GIAGFGRGPSSLPSQLGLTKFSYC 233
Query: 243 MFAHCLDGINGGGIFAI----------GHVVQPEVNKTPLVPNQP----HYSINMTAVQV 288
+ +H D + ++ + K P V ++P +Y +++ + +
Sbjct: 234 LLSHKFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISI 293
Query: 289 GLDFLNLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQ----QPDLKVHT 342
G + +P N GTIIDSGTT Y+ +E L ++ ISQ + L V
Sbjct: 294 GGRSVKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEA 353
Query: 343 VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPF---EDLWCIGWQNSGMQS 399
+ CF S + + P + HF+ +++ P E F F ++ C G +
Sbjct: 354 LSGLKPCFNVSGAKELELPQLRLHFKGGADVEL-PLENYFAFLGSREVACFTVVTDGAEK 412
Query: 400 RDRKNMTLLGDF 411
M +LG+F
Sbjct: 413 ASGPGM-ILGNF 423
>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
gi|223942623|gb|ACN25395.1| unknown [Zea mays]
Length = 378
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 140/342 (40%), Gaps = 29/342 (8%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +GTP + + + DTGSD+ WV C P E + +S +
Sbjct: 10 GTGQYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPARE---FRASESRSW 66
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
+ C + C L +C++ S C Y Y DGS+ G D +
Sbjct: 67 APLACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSED 126
Query: 191 TSTNGS-------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKM 243
S G ++ GC A D + ++ DG++ G SN S S+ A+ G R
Sbjct: 127 GSGGGGRRAKLQGVVLGCTA----TYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR-- 180
Query: 244 FAHCL-------DGINGGGIFAIGHVVQPEVNKTPLVPNQ---PHYSINMTAVQVGLDFL 293
F++CL + + +TPLV ++ P Y++ + AV V + L
Sbjct: 181 FSYCLVDHLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEAL 240
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYS 353
++P DV+ VG G I+DSGT+L L Y +V+ + + L + C+ ++
Sbjct: 241 DIPADVWDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDPFEYCYNWT 300
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLF-PFEDLWCIGWQN 394
E P + F S L+ Y+ + CIG Q
Sbjct: 301 AGAPE-IPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQE 341
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 136/305 (44%), Gaps = 33/305 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR-SSLGIE----LTLYDIKDSS 129
L+YA + +GTP + V +DTGSD+ W+ C C R +G+ L LY SS
Sbjct: 101 LHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRDLKEVGLSQSRPLNLYSPNTSS 160
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C + C G + +SCPY ++ + TTG +DV+ V+ D
Sbjct: 161 TSSSIRCSDDRCFGSS----RCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHL--VTEDE 214
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
++ GCG Q+G L S+ A++G++G G + S+ S LA + F+ C
Sbjct: 215 GLEPVKANITLGCGKNQTGFLQSS--AAVNGLLGLGLKDYSVPSILAKAKITANSFSMCF 272
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
+ I+ G + G + +TPL+P +P ++T V VG D VG
Sbjct: 273 GNIIDVVGRISFGDKGYTDQMETPLLPTEP----SVTEVSVGGD---------AVGVQLL 319
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC-FQYSESVDEG---FPNV 363
+ D+GT+ +L E Y L++K K + E F Y S ++ FP V
Sbjct: 320 ALFDTGTSFTHLLEPEYG-LITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRV 378
Query: 364 TFHFE 368
FE
Sbjct: 379 AMTFE 383
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 131/330 (39%), Gaps = 53/330 (16%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +G+GTP +D V DTGSD+ WV QC C + L+ DSST
Sbjct: 150 GTGNYVVSVGLGTPARDLTVVFDTGSDLSWV---QCGPCSSGGCYKQQDPLFAPSDSSTF 206
Query: 132 KFVTCDQEFCHGVY---GGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
V C C G P D CPY +YGD S T G+ D + ++
Sbjct: 207 SAVRCGARECRARQSCGGSPGDD-----RCPYEVVYGDKSRTQGHLGNDTLTLGTMAPAN 261
Query: 189 QTTSTNGSL---IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
+ + L +FGCG +G DG+ G G+ S+ SQ A G + F+
Sbjct: 262 ASAENDNKLPGFVFGCGENNTGLFGQA-----DGLFGLGRGKVSLSSQAAGKFG--EGFS 314
Query: 246 HCLD--GINGGGIFAIGHVV--------QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNL 295
+CL + G ++G V P +N+T P+ Y + + ++V + +
Sbjct: 315 YCLPSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRT-TTPS--FYYVKLVGIRVAGRAIRV 371
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS--------QQPDLKVHTVHDEY 347
+ + I+DSGT + L Y L + +S + P L +
Sbjct: 372 SSPRVAL----PLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILD----- 422
Query: 348 TCFQYSESVDE--GFPNVTFHFENSVSLKV 375
TC+ ++ + P V F ++ V
Sbjct: 423 TCYDFTAHANATVSIPAVALVFAGGATISV 452
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 144/357 (40%), Gaps = 43/357 (12%)
Query: 68 SRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKD 127
SR G Y AKI +GTP + +DT SD+ W+ C C+ C +S ++D +
Sbjct: 126 SRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSG-----PVFDPRH 180
Query: 128 SSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYD-KVSG 186
S++ + D C + D T C Y YGDG +T V D+V+ +G
Sbjct: 181 STSYGEMNYDAPDCQALGRSGGGDAKRGT-CIYTVQYGDGHGSTSTSVGDLVEETLTFAG 239
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
++ L GCG G + GI+G G+ S+ Q+A G F++
Sbjct: 240 GVR----QAYLSIGCGHDNKGLFGAPAA----GILGLGRGQISIPHQIAFLG-YNASFSY 290
Query: 247 CL-DGINGGG------IFAIGHV-VQPEVNKTPLVPNQ---PHYSINMTAVQVG------ 289
CL D I+G G F G V P + TP V NQ Y + + V VG
Sbjct: 291 CLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPG 350
Query: 290 LDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHT-----V 343
+ +L D + G I+DSGTT+ L Y + L +V T +
Sbjct: 351 VTERDLQLDPYT--GRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGL 408
Query: 344 HDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSGMQSR 400
D TC+ P V+ HF V + + P YL P + + + +G R
Sbjct: 409 FD--TCYTVGGRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDR 463
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 117/288 (40%), Gaps = 47/288 (16%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +GTP + +++ VDTGSD+ +V C C C + LY +SST
Sbjct: 30 GSGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDG-----PLYQPSNSSTF 84
Query: 132 KFVTCDQEFC---HGVYGGPLT----DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
V CD C G P + + +C Y YGD SST G F + +
Sbjct: 85 TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGI 144
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMF 244
+ + FGCG R G+ S G++G G+ S SQ + F
Sbjct: 145 RVN--------HVAFGCGNRNQGSFVSAG-----GVLGLGQGALSFTSQAGYA--FENKF 189
Query: 245 AHCLDG-----------INGGGIFAIGHVVQPEVNKTPLVPNQPH---YSINMTAVQVGL 290
A+CL I G + + H +Q TPLV N + Y + + + G
Sbjct: 190 AYCLTSYLSPTSVFSSLIFGDDMMSTIHDLQ----FTPLVSNPLNPSVYYVQIVRICFGG 245
Query: 291 DFLNLPTDVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP 336
+ L +P + + N GTI DSGTT+ Y Y +++ P
Sbjct: 246 ETLLIPDSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVP 293
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/396 (24%), Positives = 152/396 (38%), Gaps = 59/396 (14%)
Query: 31 VKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYY 90
+K+R +R + + E GV P+ S G G Y+ KIG+GTP
Sbjct: 85 LKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVV-SGLAQGSGEYFTKIGVGTPATQAL 143
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
+ +DTGSD++WV C C+ C +S ++D + SS+ V C C + G
Sbjct: 144 MVLDTGSDVVWVQCAPCRRCYEQSG-----PVFDPRRSSSYGAVGCGAALCRRLDSG--- 195
Query: 151 DCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL 209
C +C Y YGDGS T G FV + + + +G + + GCG G
Sbjct: 196 GCDLRRGACMYQVAYGDGSVTAGDFVTETLTF---AGGARVA----RVALGCGHDNEGLF 248
Query: 210 DSTNEEALDGIIG----------FGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAI 259
+ G G +G+S S + SSG +H ++ F
Sbjct: 249 VAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVS----FGA 304
Query: 260 GHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNK---------- 306
G V + TP+V N + Y + + + VG V GV ++
Sbjct: 305 GSVGASSASFTPMVRNPRMETFYYVQLVGISVG------GARVPGVAESDLRLDPSTGRG 358
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKIISQQP-DLKVH----TVHDEYTCFQYSESVDEGFP 361
G I+DSGT++ L Y L + L++ ++ D TC+ P
Sbjct: 359 GVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFD--TCYDLGGRRVVKVP 416
Query: 362 NVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNS 395
V+ HF + P YL P + +C + +
Sbjct: 417 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGT 452
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 131/302 (43%), Gaps = 48/302 (15%)
Query: 42 LSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMW 101
LSL K + R+L+ V PL G+ P +G Y I IG + + +D+GSD+ W
Sbjct: 27 LSLRK----KNSDRLLSSVVFPLKGNVYP--LGYYSVSINIGKGDEAFEFDIDSGSDLTW 80
Query: 102 VNC-IQCKEC--PRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD--C-TAN 155
V C C C PR LY +++ + C + C ++ P+T+ C +A+
Sbjct: 81 VQCDAPCTHCTKPREQ-------LYKPNNNA----LNCFEPLCTSLH--PITNHHCKSAD 127
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNL-DSTNE 214
C Y Y D S+ G V D V +G L + FGCG ++ DS+
Sbjct: 128 DQCQYEIEYADHGSSLGVLVNDHVPLKLTNGSLAAPR----IAFGCGYDHKYSVPDSSPP 183
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP 274
A G++G G S ISQL+S G VR + HCL + GG G VP
Sbjct: 184 TA--GVLGLGNGEVSFISQLSSMGVVRNVVGHCLS--DEGGFLFFGD---------EFVP 230
Query: 275 NQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT-----IIDSGTTLAYLPEMVYEPLVS 329
+ +M+ +G + + P +V+ G G + DSG++ Y Y +++
Sbjct: 231 SSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYNSILA 290
Query: 330 KI 331
+
Sbjct: 291 LV 292
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 140/342 (40%), Gaps = 71/342 (20%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ + +G+PPK + + +DTGSD+ W+ C+ C +C +++
Sbjct: 166 GSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQND---------------- 209
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
N SCPY YGD S+TTG F + + + +
Sbjct: 210 -----------------------NQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSE 246
Query: 192 STN-GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-- 248
N +++FGCG G G + S SQL S G F++CL
Sbjct: 247 LYNVENMMFGCGHWNRGLFHGAAGLLGLG-----RGPLSFSSQLQSLYG--HSFSYCLVD 299
Query: 249 ----DGINGGGIFAIGH--VVQPEVNKTPLVPNQPH-----YSINMTAVQVGLDFLNLPT 297
++ IF + P +N T V + + Y + + ++ V + LN+P
Sbjct: 300 RNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPE 359
Query: 298 DVFGVGDNK--GTIIDSGTTLAYLPEMVYEPLVSKIISQQ----PDLKVHTVHDEYTCFQ 351
+ + + + GTIIDSGTTL+Y E YE + +KI + P + + D CF
Sbjct: 360 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDP--CFN 417
Query: 352 YSESVDEGFPNVTFHFENSVSLKVYPHE--YLFPFEDLWCIG 391
S + P + F + +P E +++ EDL C+
Sbjct: 418 VSGIHNVQLPELGIAFADGAVWN-FPTENSFIWLNEDLVCLA 458
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/315 (26%), Positives = 135/315 (42%), Gaps = 39/315 (12%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
++ IG PP +DTGS + WV C C C ++S + ++D SST +
Sbjct: 92 VFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQS-----VPIFDPSKSSTYSNL 146
Query: 135 TCDQEFCHGVYGGPLTDC-TANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
+C + C+ C N CPY +E G GSS G + ++ + + + +
Sbjct: 147 SCSE--CN--------KCDVVNGECPYSVEYVGSGSS-QGIYAREQLTLETIDESIIKVP 195
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
SLIFGCG + S + + + ++G+ G G S++ K F++C+ +
Sbjct: 196 ---SLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG------KKFSYCIGNLR 246
Query: 253 GGGI----FAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFG---VGDN 305
+G + + T L Y +N+ A+ +G L++ +F +N
Sbjct: 247 NTNYKFNRLVLGDKANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNN 306
Query: 306 KGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEYT-CFQYSESVD-EGF 360
G IIDSG +L + +E L V ++ L H+ YT C+ S D GF
Sbjct: 307 SGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGF 366
Query: 361 PNVTFHFENSVSLKV 375
P VTFHF L +
Sbjct: 367 PLVTFHFAEGAVLDL 381
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/325 (28%), Positives = 136/325 (41%), Gaps = 55/325 (16%)
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
+GTPP ++++ G++++W + EC ++ E + S F +C
Sbjct: 1 MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTF----SRGLPFASC----- 51
Query: 142 HGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGC 201
G P N +C Y YGD S TTG+ ++ DK + S G + FGC
Sbjct: 52 ----GSP--KFWPNQTCVYTYSYGDKSVTTGF-----LEVDKFTFVGAGASVPG-VAFGC 99
Query: 202 GARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGG------- 254
G +G S NE GI GFG+ S+ SQL F+HC I G
Sbjct: 100 GLFNNGVFKS-NET---GIAGFGRGPLSLPSQLKVGN-----FSHCFTTITGAIPSTVLL 150
Query: 255 ----GIFAIGHVVQPEVNKTPLV------PNQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
+F+ G Q V TPL+ N Y +++ + VG L +P F + +
Sbjct: 151 DLPADLFSNG---QGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTN 207
Query: 305 NKG-TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV--HTVHDEYTCFQYSESVDEGFP 361
G TIIDSGT++ LP VY+ +V + Q L V YTCF P
Sbjct: 208 GTGGTIIDSGTSITSLPPQVYQ-VVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVP 266
Query: 362 NVTFHFENSVSLKVYPHEYLFPFED 386
+ HFE + ++ + Y+F D
Sbjct: 267 KLVLHFEGA-TMDLPRENYVFEVPD 290
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 147/354 (41%), Gaps = 42/354 (11%)
Query: 46 KEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC- 104
K+ + R+ + + G+ P +G Y + IG PPK Y + +D+GSD+ WV C
Sbjct: 36 KKLSSDNHHRLSSSAVFKVQGNVYP--LGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCD 93
Query: 105 IQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA-NTSCPYLEI 163
CK C + LY V C + C V C + + C Y
Sbjct: 94 APCKGCTKPRD-----QLY----KPNHNLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVE 144
Query: 164 YGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGF 223
Y D S+ G V+D + + +G + + FGCG Q + S + A G++G
Sbjct: 145 YADHGSSLGVLVRDYIPFQFTNGSV----VRPRVAFGCGYDQKYS-GSNSPPATSGVLGL 199
Query: 224 GKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVP--NQPHY 279
G +S++SQL S G + + HCL GGG G P + T ++P ++ HY
Sbjct: 200 GNGRASILSQLHSLGLIHNVVGHCLSA-RGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHY 258
Query: 280 SINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLK 339
S + L F T V G+ I DSG++ Y Y+ +V + +
Sbjct: 259 S----SGPAELVFNGKATVVKGL----ELIFDSGSSYTYFNSQAYQAVVDLVTQDLKGKQ 310
Query: 340 VHTVHDEYT---CFQYSES------VDEGFPNVTFHFENSVSLKVY--PHEYLF 382
+ D+ + C++ ++S V + F + F + L+++ P YL
Sbjct: 311 LKRATDDPSLPICWKGAKSFKSLSDVKKYFKPLALSFTKTKILQMHLPPEAYLI 364
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 146/368 (39%), Gaps = 52/368 (14%)
Query: 28 VFSVKYRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPK 87
+F + A + S+ H +++ + + G+ PDG+ Y I IG PP
Sbjct: 22 IFPHHFSAANKNNSIPPTSIHS------LISSLVYTIKGNVYPDGI--YTVSINIGNPPN 73
Query: 88 DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG 147
Y + +DTGSD+ WV C + P G L + + + V C C V
Sbjct: 74 PYELDIDTGSDLTWVQC----DGPDAPCKGCTLPKDKLYKPNGNQLVKCSDPICAAVQ-P 128
Query: 148 PLT----DCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCG 202
P + C C Y Y D + +TG +D + SG S ++FGCG
Sbjct: 129 PFSTFGQKCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSG-----SNVPLVVFGCG 183
Query: 203 ARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGGIFAIGHV 262
Q + + ++G G S++SQL S G + + HCL GGG +G
Sbjct: 184 YEQKFSGPTPPPSTPG-VLGLGNGKISILSQLHSMGFIHNVLGHCLSA-EGGGYLFLGDK 241
Query: 263 VQPE--VNKTPLVPN--QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
P + TP++ + + HYS V L F PT G+ I DSG++ Y
Sbjct: 242 FIPSSGIFWTPIIQSSLEKHYSTG----PVDLFFNGKPTPAKGL----QIIFDSGSSYTY 293
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDE------------YTCFQYSESVDEGFPNVTFH 366
VY +V+ +++ DLK + E F+ V+ F +T
Sbjct: 294 FSPRVYT-IVANMVNN--DLKGKPLRRETKDPSLPICWKGVKPFKSLNEVNNYFKPLTLS 350
Query: 367 FENSVSLK 374
F S +L+
Sbjct: 351 FTKSKNLQ 358
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 134/322 (41%), Gaps = 44/322 (13%)
Query: 59 GVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGI 118
+P+ + +G Y ++ +GTP + ++ +DT D WV C C C +
Sbjct: 82 ATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSSPT---- 137
Query: 119 ELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC--TANTSCPYLEIYGDGSSTTGYFVQ 176
+ SST + C C V G C T +C + + YG SS + Q
Sbjct: 138 ----FSPNTSSTYASLQCSVPQCTQVRG---LSCPTTGTAACFFNQTYGGDSSFSAMLSQ 190
Query: 177 DVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS 236
D + T S FGC SG + G++G G+ S++SQ
Sbjct: 191 DSLGL--------AVDTLPSYSFGCVNAVSG-----STLPPQGLLGLGRGPMSLLSQ--- 234
Query: 237 SGGVRK-MFAHCLDGING---GGIFAIGHVVQPE-VNKTPLVPNQPH----YSINMTAVQ 287
SG + +F++C G +G + QP+ + TPL+ N PH Y +N+T V
Sbjct: 235 SGSLYSGVFSYCFPSFKSYYFSGSLRLGPLGQPKNIRTTPLLRN-PHRPTLYYVNLTGVS 293
Query: 288 VGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD 345
VG + + ++ N GTIIDSGT + E VY + + +Q T+
Sbjct: 294 VGRVLVPVAPELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDE-FRKQVKGPFATIGA 352
Query: 346 EYTCFQYSESVDEGFPNVTFHF 367
TCF + ++ P VTFHF
Sbjct: 353 FDTCFAATN--EDIAPPVTFHF 372
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 100/368 (27%), Positives = 148/368 (40%), Gaps = 59/368 (16%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKE-HDARRQQRILAG-VDLPLGGSSRP-----DGVGLY 76
S H VF + E +++ + H +R + ILA + GS++ G G Y
Sbjct: 24 SQHQVF--RATMTRHEPTINFTRAAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAY 81
Query: 77 YAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTC 136
+GTPP+ DTGSD++W C CK C R S + Y K SS K + C
Sbjct: 82 DMTFSMGTPPQTLSALADTGSDLIWAKCGACKRCAPRGS----ASYYPTKSSSFSK-LPC 136
Query: 137 DQEFCHGVYGGPLTDCTANTS----CPYLEIYGDGSS----TTGYFVQDVVQY--DKVSG 186
C + L C + C Y YG S+ T GY + D V G
Sbjct: 137 SSALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQG 196
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+ FGC + + G++G G+ S++ QL F++
Sbjct: 197 ----------IGFGC-----TTMSEGGYGSGSGLVGLGRGKLSLVRQLKVGA-----FSY 236
Query: 247 CLD---GINGGGIFAIGHVVQPEVNKTPLV--PNQPHYSINMTAVQVGLDFLNLPTDVFG 301
CL + +F G + P V TPLV Y++N+ ++ +G G
Sbjct: 237 CLTSDPSTSSPLLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIG------AAKTPG 290
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGF 360
G + G I DSGTTL +L E Y + ++SQ +L D Y CFQ S F
Sbjct: 291 TGRH-GIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGTDGYEVCFQTSGGAV--F 347
Query: 361 PNVTFHFE 368
P++ HF+
Sbjct: 348 PSMVLHFD 355
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 130/308 (42%), Gaps = 31/308 (10%)
Query: 76 YYAKIGIGTPP-KDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
Y + +G+PP K + +DTGSDI WV +CK C ++ ++ L+D SST
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWV---RCKPCWQQCRPQVD-PLFDPSLSSTYSPF 195
Query: 135 TCDQEFCHGVYG-GPLTDCTANTSCPYLEIYGDGS-STTGYFVQDVVQYDKVSGDLQTTS 192
+C C ++ G C+++ C Y+ +YGDGS TTG + D + G T
Sbjct: 196 SCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLAL----GSNSNTV 251
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI- 251
FGC ++G T G S++SQ A + G F++CL
Sbjct: 252 VVSKFRFGCSHAETGITGLTAGLMGL-----GGGAQSLVSQTAGTFGT-TAFSYCLPPTP 305
Query: 252 NGGGIFAIGHVVQPEVN--KTPLVPNQ---PHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+ G +G KTP++ + Y + + A++VG L++PT VF +
Sbjct: 306 SSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF----SA 361
Query: 307 GTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEY-TCFQYSESVDEGFPN 362
G I+DSGT + LP Y L S + Q P TCF S P
Sbjct: 362 GMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPT 421
Query: 363 VTFHFENS 370
V F +
Sbjct: 422 VALVFSGA 429
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/344 (25%), Positives = 144/344 (41%), Gaps = 39/344 (11%)
Query: 34 RYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGL--YYAKIGIGTPPKDYYV 91
R R+L LK R+ +++ PL + P GVGL +YA++ IG PP+ V
Sbjct: 2 RIPSASRNLEPLKIELKRKTRQLKNQTSPPLVYNDAPLGVGLGTHYAELYIGIPPQRASV 61
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCD-QEFCHGVYGGPLT 150
+DTGS + C +C +C + +D S++ FV C +E C
Sbjct: 62 ILDTGSGLTAFPCDKCVDCGTHTD-----PKFDATKSTSINFVQCKYEEGC--------- 107
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG---SLIFGCGARQSG 207
D + C + Y +GS +QD++ V D FGC R++G
Sbjct: 108 DTCRDNLCVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMIMRRYGIRFKFGCQTRETG 167
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGINGGGIFAIGHV---- 262
+ E +GI+G G +++ +++ + V + FA C GG F IG V
Sbjct: 168 LFITQVE---NGIMGLGIGRNNIATEMYKAKRVEEHKFALCFG--QKGGSFVIGGVDYSH 222
Query: 263 VQPEVNKTPLVPN-QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
++ TPL + +Y I + V++G L + + F G +G I+DSGTT Y P
Sbjct: 223 HTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSG--RGAIVDSGTTDTYFPS 280
Query: 322 MVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTF 365
P Q+ ++ V + + E PNV+
Sbjct: 281 AAATPF------QEAFKRITGVEYNENKMNLTPEMVETLPNVSL 318
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 128/296 (43%), Gaps = 61/296 (20%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCI---QCKECPRRSSLGIELTLYDIK 126
P G Y + +GTPP+ V +DTGS + WV C QC+ C S+ L ++ K
Sbjct: 83 PHSYGGYAFTVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSLSAAS-PLHVFHPK 141
Query: 127 DSSTGKFVTCDQEFCHGVYGGP-LTDCTANTSC-----------------PYLEIYGDGS 168
+SS+ + + C C ++ L+DC A +SC PYL +YG G
Sbjct: 142 NSSSSRLIGCRNPSCLWIHSPDHLSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSG- 200
Query: 169 STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS 228
ST G + D ++ + + GC +L S ++ G+ GFG+
Sbjct: 201 STAGLLISDTLR--------TPGRAVRNFVIGC------SLASVHQPP-SGLAGFGRGAP 245
Query: 229 SMISQLASSGGVRKMFAHCL--------DGINGGGIF--AIGHVVQPEVNKTPLV----- 273
S+ SQL G+ K F++CL ++G I A G + PL
Sbjct: 246 SVPSQL----GLTK-FSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASA 300
Query: 274 --PNQPHYSINMTAVQVGLDFLNLPTDVF-GVGDNKGTIIDSGTTLAYLPEMVYEP 326
P +Y + +TA+ VG + LP F G G I+DSGTT +Y V+EP
Sbjct: 301 RPPYSVYYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEP 356
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/331 (26%), Positives = 128/331 (38%), Gaps = 45/331 (13%)
Query: 71 DGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSST 130
G G Y+ +IG+GTPPK Y+ +DTGSDI+W+ C CK C ++ + +K S
Sbjct: 37 QGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQT----DPVFNPVKSGSF 92
Query: 131 GKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
K V C C + C +C Y YGDGS TTG FV + + + + +
Sbjct: 93 AK-VLCRTPLCRRLES---PGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE--- 145
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG 250
+ GCG G G+ S SQ + + F++CL
Sbjct: 146 -----QVALGCGHDNEGLFVGAAGLLGL-----GRGGLSFPSQAGRT--FNQKFSYCLVD 193
Query: 251 INGGG-----IFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQV------GLDFLNLP 296
+ +F V TPL+ N Y + + + V G+ +
Sbjct: 194 RSASSKPSSVVFG-NSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFK 252
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSES 355
D G N G IID GT++ L + Y L + LK + TC+ S
Sbjct: 253 LDRTG---NGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGK 309
Query: 356 VDEGFPNVTFHFENS-VSLKVYPHEYLFPFE 385
P V HF + VSL YL P +
Sbjct: 310 TTVKVPTVVLHFRGADVSLPA--SNYLIPVD 338
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 143/344 (41%), Gaps = 39/344 (11%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y K+ IG+P Y+ DTGS + W QC+ C RR +++ S T + +
Sbjct: 91 YLVKVIIGSPGVPLYLVPDTGSGLFWT---QCEPCTRR--FRQLPPIFNSTASRTYRDLP 145
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNG 195
C +FC + C + C Y Y GS+T G QD++Q +
Sbjct: 146 CQHQFCTN--NQNVFQCR-DDKCVYRIAYAGGSATAGVAAQDILQ--------SAENDRI 194
Query: 196 SLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGIN-- 252
FGC +R + N + E+ G N S +S L + K F++CL+ +
Sbjct: 195 PFYFGC-SRDNQNFSTF--ESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLS 251
Query: 253 ----GGGIFAIGHVVQPEVNK---TPLVPNQ--PHYSINMTAVQVGLDFLNLPTDVFGVG 303
+ G+ ++ K TP V + P+Y +N+ V V + + +P F +
Sbjct: 252 SPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALK 311
Query: 304 DN--KGTIIDSGTTLAYLPEMVYEPLVSKI---ISQQPDLKVHTVHDEYTCFQYSESVDE 358
+ GTIIDSGT + Y+ + Y P+++ Q +V+ Y C++
Sbjct: 312 PDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQGHTFH 371
Query: 359 GFPNVTFHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGMQSR 400
+P++ FHF+ + V P +D +C+ Q Q R
Sbjct: 372 NYPSMAFHFQGA-DFFVEPEYVYLTVQDRGAFCVALQPISPQQR 414
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 132/317 (41%), Gaps = 41/317 (12%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +G+GTP KD + DTGSDI W C C RS + ++D S+
Sbjct: 145 GSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCA----RSCYKQKEQIFDPSQST-- 198
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANT------SCPYLEIYGDGSSTTGYFVQDVVQYDKVS 185
+ C LT T NT +C Y YGD S + G+F + +
Sbjct: 199 ---SYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLT----- 250
Query: 186 GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFA 245
L +T ++ FGCG + G++G G+ S++SQ A K+F+
Sbjct: 251 --LTSTDAFNNIYFGCGQN-----NQGLFGGSAGLLGLGRDKLSVVSQTAQK--YNKIFS 301
Query: 246 HCLDGINGG-GIFAIGHVVQPEVNKTPL--VPNQPH-YSINMTAVQVGLDFLNLPTDVFG 301
+CL + G G TPL + P Y ++ T + VG L + VF
Sbjct: 302 YCLPSSSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFS 361
Query: 302 VGDNKGTIIDSGTTLAYLPEMVYEPLVSK---IISQQPDLKVHTVHDEYTCFQYSESVDE 358
G IIDSGT + LP Y L + ++S+ P K ++ D TC+ +S
Sbjct: 362 TA---GAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILD--TCYDFSSYTTI 416
Query: 359 GFPNVTFHFENSVSLKV 375
P + F F + + + +
Sbjct: 417 SVPKIGFSFSSGIEVDI 433
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/349 (27%), Positives = 148/349 (42%), Gaps = 57/349 (16%)
Query: 39 ERSLSLLKE-HDARRQQRILAGV--DLPLGGSSRP----DGVGLYYAKIGIGTPPKDYYV 91
E +++L + H + ++ +LA D G + P G G Y IGTPP++
Sbjct: 38 EPAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSA 97
Query: 92 QVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTD 151
DTGSD++W C C C + S + Y K SS K + C C + P +
Sbjct: 98 LADTGSDLIWAKCGACTRCVPQGS----PSYYPNKSSSFSK-LPCSGSLCSDL---PSSQ 149
Query: 152 CTA-NTSCPYLEIYGDGSS----TTGYFVQDVVQY--DKVSGDLQTTSTNGSLIFGCGAR 204
C+A C Y YG S T GY + D V G + FGC
Sbjct: 150 CSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDAVPG----------IGFGC--- 196
Query: 205 QSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGINGGG---IFAIGH 261
+ + G++G G+ S++SQL F++CL +F G
Sbjct: 197 --TTMSEGGYGSGSGLVGLGRGPLSLVSQLNVGA-----FSYCLTSDAAKTSPLLFGSGA 249
Query: 262 VVQPEVNKTPLVPNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLP 320
+ V TPL+ +Y++N+ ++ +G G G + G I DSGTT+A+L
Sbjct: 250 LTGAGVQSTPLLRTSTYYYTVNLESISIGA------ATTAGTG-SSGIIFDSGTTVAFLA 302
Query: 321 EMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFE 368
E Y ++SQ +L + + D Y CFQ S +V FP++ HF+
Sbjct: 303 EPAYTLAKEAVLSQTTNLTMASGRDGYEVCFQTSGAV---FPSMVLHFD 348
>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
gi|194706308|gb|ACF87238.1| unknown [Zea mays]
Length = 467
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/320 (29%), Positives = 143/320 (44%), Gaps = 40/320 (12%)
Query: 60 VDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLG 117
+PLG G+S GVG Y ++G+GTP K Y + VDTGS + W+ C C C R+S
Sbjct: 114 ASVPLGPGTSV--GVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG-- 169
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTS--CPYLEIYGDGSSTTGYFV 175
+++ K SS+ V+C + C + L+ + +TS C Y YGD S + GY
Sbjct: 170 ---PVFNPKASSSYTSVSCSAQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLS 226
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+D V + S + +GCG G + G+IG ++ S++ QLA
Sbjct: 227 KDTVSFGSTSVP--------NFYYGCGQDNEGLFGQSA-----GLIGLARNKLSLLYQLA 273
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVV-QP-EVNKTPLVPNQ---PHYSINMTAVQVGL 290
S G F++CL + + P + + TP+ + Y I MT ++V
Sbjct: 274 PSMGYS--FSYCLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAG 331
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L + + TIIDSGT + LP VY L V+ + P ++ D
Sbjct: 332 KPL---SVSSSAYSSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILD-- 386
Query: 348 TCFQYSESVDEGFPNVTFHF 367
TCFQ ++ P VT F
Sbjct: 387 TCFQ-GQAARLRVPEVTMAF 405
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/330 (25%), Positives = 129/330 (39%), Gaps = 50/330 (15%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y +G+GTP +D V DTGSD+ WV QC C + L+ SST
Sbjct: 81 GTGNYVVSVGLGTPARDLTVVFDTGSDLSWV---QCGPCSSGGCYHQQDPLFAPSSSSTF 137
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V C + C + + CPY +YGD S T G+ D + T
Sbjct: 138 SAVRCGEPECPRARQS-CSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGT------TP 190
Query: 192 STNGS---------LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK 242
STN S +FGCG +G DG+ G G+ S+ SQ A G +
Sbjct: 191 STNASENNSNKLPGFVFGCGENNTGLFGKA-----DGLFGLGRGKVSLSSQAAGKYG--E 243
Query: 243 MFAHCL--DGINGGGIFAIGHVVQPEVNK--TPLV--PNQPH-YSINMTAVQVGLDFLNL 295
F++CL N G ++G + TP++ N P Y + + ++V + +
Sbjct: 244 GFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKV 303
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS--------QQPDLKVHTVHDEY 347
+ G I+DSGT + L Y L + +S + P L +
Sbjct: 304 SSRP--ALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILD----- 356
Query: 348 TCFQYSESVDE--GFPNVTFHFENSVSLKV 375
TC+ ++ + P V F ++ V
Sbjct: 357 TCYDFTAHANATVSIPAVALVFAGGATISV 386
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 128/306 (41%), Gaps = 43/306 (14%)
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DT SD+ WV QC CP LYD S + + C C + GP +
Sbjct: 186 LDTASDVAWV---QCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQL--GPYANG 240
Query: 153 TANTS-----CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSG 207
+++S C Y Y DGS+T+G V D + L TS FGC G
Sbjct: 241 CSSSSNSAGQCQYRVRYPDGSTTSGTLVADQL-------SLSPTSQVPKFEFGCSHAARG 293
Query: 208 NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-DGINGGGIFAIG------ 260
+ + GI+ G+ S++SQ ++ G ++F++C + G F +G
Sbjct: 294 SFSRSKTA---GIMALGRGVQSLVSQTSTKYG--QVFSYCFPPTASHKGFFVLGVPRRSS 348
Query: 261 --HVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAY 318
+ V P + KTP++ Y + + A+ V L++P VF G +DS T +
Sbjct: 349 SRYAVTPML-KTPML-----YQVRLEAIAVAGQRLDVPPTVFAA----GAALDSRTVITR 398
Query: 319 LPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENS-VSLKVY 376
LP Y+ L S + + + + TC+ ++ P ++ F+ + +++
Sbjct: 399 LPPTAYQALRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLD 458
Query: 377 PHEYLF 382
P LF
Sbjct: 459 PSGVLF 464
>gi|357152725|ref|XP_003576216.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like,
partial [Brachypodium distachyon]
Length = 354
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 136/354 (38%), Gaps = 75/354 (21%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y + IG K Y++ +DTGS + W+
Sbjct: 34 GHIYVTMSIGEQEKPYFLDIDTGSTLTWLE------------------------------ 63
Query: 134 VTCDQEFCHGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
D F H DC N C Y Y G S+ G + D K S L
Sbjct: 64 ---DVRFKH--------DCKENPNQCDYDVRYAGGESSLGVLIAD-----KFS--LPGRD 105
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGI 251
+L FGCG Q G E +DG++G G+ + SQL G + + + HCL I
Sbjct: 106 ARPTLTFGCGYDQEGG---KAEMPVDGVLGIGRGTRDLASQLKQQGAIAENVIGHCLR-I 161
Query: 252 NGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
GGG GH P V P+VPN +YS + A+ + N P V + +
Sbjct: 162 QGGGYLFFGHEKVPSSVVTWVPMVPNNHYYSPGLAALHFNGNLGN-PISVAPME----VV 216
Query: 310 IDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQYSE------SVDEGFP 361
IDSG+T Y+P Y LV +I+ + V D C+ E V + F
Sbjct: 217 IDSGSTYTYMPTETYRRLVFVVIASLSKSSLTLVRDPALPVCWAGKEPFKXIGDVKDKFK 276
Query: 362 NVTFHFENSVS---LKVYPHEYL-FPFEDLWCIGWQNSGMQSRDRKNMTLLGDF 411
+ F S +++ P YL E C+G + G Q+ RK + ++GD
Sbjct: 277 PLELAFIQGTSQAIMEIPPENYLIISGEGNVCMGILD-GTQAGLRK-LNVIGDI 328
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/332 (26%), Positives = 126/332 (37%), Gaps = 49/332 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y+ +IG+GTPPK Y+ +DTGSDI+W+ C CK C ++ + +K S
Sbjct: 125 GSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQT----DPVFNPVKSGSFA 180
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
K V C C + C +C Y YGDGS TTG FV + + + + +
Sbjct: 181 K-VLCRTPLCRRLES---PGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVE---- 232
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG--VRKMFAHCLD 249
+ GCG + E L S G + F++CL
Sbjct: 233 ----QVALGCGH---------DNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQKFSYCLV 279
Query: 250 GINGGG-----IFAIGHVVQPEVNKTPLVPN---QPHYSINMTAVQV------GLDFLNL 295
+ +F V TPL+ N Y + + + V G+ +
Sbjct: 280 DRSASSKPSSVVFG-NSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHF 338
Query: 296 PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSE 354
D G N G IID GT++ L + Y L + LK + TC+ S
Sbjct: 339 KLDRTG---NGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSG 395
Query: 355 SVDEGFPNVTFHFENS-VSLKVYPHEYLFPFE 385
P V HF + VSL YL P +
Sbjct: 396 KTTVKVPTVVLHFRGADVSLPA--SNYLIPVD 425
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 152/349 (43%), Gaps = 45/349 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL-TLYDIKDSSTGKFV 134
Y A+IG+G P K +Y+ DTGSD+ W +QC+ C ++ + ++D K SS+ +
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTW---LQCQPCASENTCYKQFDPIFDPKSSSSYSPL 204
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+C+ + C + +C ++T C Y YGDGS TTG + + + +++
Sbjct: 205 SCNSQQCKLLDKA---NCNSDT-CIYQVHYGDGSFTTGELATETLSFGN-------SNSI 253
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI--N 252
+L GCG G G S+ SQL +S F++CL + +
Sbjct: 254 PNLPIGCGHDNEGLFAGGAGLIGL-----GGGAISLSSQLKASS-----FSYCLVNLDSD 303
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPHYS---INMTAVQVGLDFLNLPTDVFGVGDN--KG 307
+ + +PLV N +S + + + VG L + F + ++ G
Sbjct: 304 SSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGG 363
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH---TVHDEYTCFQYSESVDEGFPNVT 364
I+DSGT ++ LP VYE L + L +V D TC+ +S + P +
Sbjct: 364 IIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFD--TCYNFSGQSNVEVPTIA 421
Query: 365 FHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGMQSRDRKNMTLLGDF 411
F SL++ YL + +C+ + + + +++++G F
Sbjct: 422 FVLSEGTSLRLPARNYLIMLDTAGTYCLAFI------KTKSSLSIIGSF 464
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/315 (25%), Positives = 129/315 (40%), Gaps = 42/315 (13%)
Query: 93 VDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC 152
+DTGSD+ WV C C +C ++S ++D S++ V+CD + C + +
Sbjct: 3 LDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSASYAAVSCDSQRCRDLDTAACRNA 57
Query: 153 TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDST 212
T +C Y YGDGS T G F + + L ++ G++ GCG G
Sbjct: 58 TG--ACLYEVAYGDGSYTVGDFATETLT-------LGDSTPVGNVAIGCGHDNEGLFVGA 108
Query: 213 NEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN---------GGGIFAIGHVV 263
G S SQ+++S F++CL + G G G V
Sbjct: 109 AGLLALGGGPL-----SFPSQISAS-----TFSYCLVDRDSPAASTLQFGDGAAEAGTVT 158
Query: 264 QPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGT---IIDSGTTLAYLP 320
P V ++P Y + ++ + VG L++P F + G+ I+DSGT + L
Sbjct: 159 APLV-RSPRTST--FYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQ 215
Query: 321 EMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHE 379
Y L + P L + V TC+ S+ P V+ FE +L++
Sbjct: 216 SAAYAALRDAFVQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKN 275
Query: 380 YLFPFEDL--WCIGW 392
YL P + +C+ +
Sbjct: 276 YLIPVDGAGTYCLAF 290
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 109/395 (27%), Positives = 160/395 (40%), Gaps = 93/395 (23%)
Query: 41 SLSLLKEHDARRQQRILAGVDL---PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGS 97
SL K R ++ L+ VD+ PL DG Y + IGTPP+ V +DTGS
Sbjct: 50 SLPTPKSQTQERIKKPLSSVDVVMEPLREVR--DG---YLITLNIGTPPQAVQVYLDTGS 104
Query: 98 DIMWVNC----IQCKECPRRSSLGIE-LTLYDIKDSSTGKFVTCDQEFCHGVYGG--PLT 150
D+ WV C C EC + ++ +++ SST +C FC ++ P
Sbjct: 105 DLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFD 164
Query: 151 DC-------------TANTSCP-YLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGS 196
C T CP + YG+G +G +D+++ T
Sbjct: 165 PCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILK--------ARTRDVPR 216
Query: 197 LIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-------- 248
FGC + ST E + GI GFG+ S+ SQL G + K F+HC
Sbjct: 217 FSFGC-------VTSTYREPI-GIAGFGRGLLSLPSQL---GFLEKGFSHCFLPFKFVNN 265
Query: 249 -----DGINGGGIFAIGHV----VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDV 299
I G +I P +N TP+ PN Y I + ++ +G + PT V
Sbjct: 266 PNISSPLILGASALSINLTDSLQFTPMLN-TPMYPNS--YYIGLESITIGTNI--TPTQV 320
Query: 300 ------FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQP-------------DL-- 338
F N G ++DSGTT +LPE Y L++ + S DL
Sbjct: 321 PLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITYPRATETESRTGFDLCY 380
Query: 339 KVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSL 373
KV ++ T + V FP++TFHF N+ +L
Sbjct: 381 KVPCPNNNLTSLE--NDVMMIFPSITFHFLNNATL 413
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 90/364 (24%), Positives = 154/364 (42%), Gaps = 49/364 (13%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y I +GTPP DTGSD++WV C + K+ S+ + S+ G+ V
Sbjct: 110 YLMAIEVGTPPVRVLAIADTGSDLVWVKC-KGKDNDNNSTAPPSVYFVPSASSTYGR-VG 167
Query: 136 CDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN- 194
CD + C + C+ + SC YL YGDGS +G + + ++ +T S
Sbjct: 168 CDTKACRALSSA--ASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGN 225
Query: 195 -------------GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVR 241
L FGC +G + DG++G G S+ SQL ++ +
Sbjct: 226 NNNNSSSHGQVEIAKLDFGCSTTTTGTFRA------DGLVGLGGGPVSLASQLGATTSLG 279
Query: 242 KMFAHCL---DGINGGGIFAIGH---VVQPEVNKTPLVPN--QPHYSINMTAVQVGLDFL 293
+ F++CL N G V +P TPL+ + +Y+I + ++ V
Sbjct: 280 RKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVA--GT 337
Query: 294 NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY--TCFQ 351
PT I+DSGTTL YL + PLV K ++++ L ++ C+
Sbjct: 338 KRPT----TAAQAHIIVDSGTTLTYLDSALLTPLV-KDLTRRIKLPRAESPEKILDLCYD 392
Query: 352 YSESVDE---GFPNVTFHFENSVSLKVYP-HEYLFPFEDLWCIGWQNSGMQSRDRKNMTL 407
S E G P+VT + + P + ++ E + C+ + + +R+++++
Sbjct: 393 ISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLAL----VATSERQSVSI 448
Query: 408 LGDF 411
LG+
Sbjct: 449 LGNI 452
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 136/341 (39%), Gaps = 50/341 (14%)
Query: 62 LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIEL 120
LP+ G+ P +G Y + IG PPK + + +DTGSD+ WV C C C +
Sbjct: 55 LPVFGNVYP--LGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLH----- 107
Query: 121 TLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGYFVQDVV 179
LY +++ ++C C V C +A C Y Y D S+ G V D
Sbjct: 108 HLYKPRNN----LLSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGSSLGVLVTDYF 163
Query: 180 QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGG 239
++G + FGCG Q + G++G G +S+ISQL + G
Sbjct: 164 PLRLMNGSF----LRPKMTFGCGYDQK-SPGPVAPPPTTGVLGLGNGKTSIISQLQALGV 218
Query: 240 VRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAV-QVGLD--FLNLP 296
+ + HCL GG +F + P+ P + I+ + Q LD + + P
Sbjct: 219 MGNVIGHCLSRKGGGFLF---------FGQDPV----PSFGISWAPMSQKSLDKYYASGP 265
Query: 297 TDVFGVGDNKGT-----IIDSGTTLAYLPEMVYEP---LVSKIISQQPDLKVHTVHDEYT 348
++ G GT I DSG++ Y VY+ L+ K +S +P
Sbjct: 266 AELLYGGKPTGTKAEEFIFDSGSSYTYFNAQVYQSTLNLIRKELSGKPLRDAPEEKALAI 325
Query: 349 C------FQYSESVDEGFPNVTFHF--ENSVSLKVYPHEYL 381
C F+ V F F SV L++ P +YL
Sbjct: 326 CWKGTKRFKSVNEVKSYFKPFALSFTKAKSVQLQIPPEDYL 366
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 115/282 (40%), Gaps = 30/282 (10%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSL 116
+ V L L G+ P +G ++ + IG P K Y++ +DTGS + W+ C C C
Sbjct: 22 SAVVLELHGNVYP--IGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC------ 73
Query: 117 GIELTLYDIKDSSTGKFVTCDQEFCHGVYG--GPLTDCTANTSCPYLEIYGDGSSTTGYF 174
+ + + + K VTC C +Y G C + C Y+ Y D SS+ G
Sbjct: 74 --NIVPHVLYKPTPKKLVTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVL 130
Query: 175 VQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL 234
V D +G TT + FGCG Q G + +D I+G + +++SQL
Sbjct: 131 VIDRFSLSASNGTNPTT-----IAFGCGYDQ-GKKNRNVPIPVDSILGLSRGKVTLLSQL 184
Query: 235 ASSGGVRK-MFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLD 291
S G + K + HC+ GGG G P V TP+ +YS + +
Sbjct: 185 KSQGVITKHVLGHCISS-KGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSN 243
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIIS 333
+ V I DSG T Y Y+ +S + S
Sbjct: 244 SKAISAAPMAV------IFDSGATYTYFAAQPYQATLSVVKS 279
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/392 (24%), Positives = 155/392 (39%), Gaps = 71/392 (18%)
Query: 36 AGRERSLSLLKEHDARRQQRILAG-VDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVD 94
A R+L L + Q+ G +P + P G Y +GTPP+ V +D
Sbjct: 26 ASLARALHLKRRDPNHHSQKGSGGHPSVPATAALYPHSYGGYAFTASLGTPPQPLPVLLD 85
Query: 95 TGSDIMWVNCI---QCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGP--L 149
TGS + WV C +C+ C S+ + ++ K+SS+ + V C C V+
Sbjct: 86 TGSHLTWVPCTSSYECRNCSSPSASAVP--VFHPKNSSSSRLVGCRNPSCQWVHSAANLA 143
Query: 150 TDC---------------TANTSCPYLEIYGDGSSTTGYFVQDVVQYD--KVSGDLQTTS 192
T C +N PY +YG G ST G + D ++ V G
Sbjct: 144 TKCRRAPCSPGAANCPAAASNVCPPYAVVYGSG-STAGLLIADTLRAPGRAVPG------ 196
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---- 248
+ GC +L S ++ G+ GFG+ S+ +QL G+ K F++CL
Sbjct: 197 ----FVLGC------SLVSVHQPP-SGLAGFGRGAPSVPAQL----GLPK-FSYCLLSRR 240
Query: 249 ---DGINGGGIFAIGHVVQPEVNKTPLV--------PNQPHYSINMTAVQVGLDFLNLPT 297
+ G + G + PLV P +Y + + V VG + LP
Sbjct: 241 FDDNAAVSGSLVLGGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAVRLPA 300
Query: 298 DVFGVGD--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL--KVHTVHDE---YTCF 350
F + GTI+DSGTT YL V++P+ +++ + DE + CF
Sbjct: 301 RAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCF 360
Query: 351 QYSESVDE-GFPNVTFHFENSVSLKVYPHEYL 381
+ P ++FHFE +++ Y
Sbjct: 361 ALPQGARSMALPELSFHFEGGAVMQLPVENYF 392
>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
gi|238015146|gb|ACR38608.1| unknown [Zea mays]
gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
Length = 467
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 141/320 (44%), Gaps = 40/320 (12%)
Query: 60 VDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLG 117
+PLG G+S GVG Y ++G+GTP K Y + VDTGS + W+ C C C R+S
Sbjct: 114 ASVPLGPGTSV--GVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG-- 169
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGYFV 175
+++ K SS+ V+C + C + L C+ + C Y YGD S + GY
Sbjct: 170 ---PVFNPKASSSYTSVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLS 226
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+D V + S + +GCG G + G+IG ++ S++ QLA
Sbjct: 227 KDTVSFGSTSVP--------NFYYGCGQDNEGLFGQSA-----GLIGLARNKLSLLYQLA 273
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVV-QP-EVNKTPLVPNQ---PHYSINMTAVQVGL 290
S G F++CL + + P + + TP+ + Y I MT ++V
Sbjct: 274 PSMGYS--FSYCLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAG 331
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L + + TIIDSGT + LP VY L V+ + P ++ D
Sbjct: 332 KPL---SVSSSAYSSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILD-- 386
Query: 348 TCFQYSESVDEGFPNVTFHF 367
TCFQ ++ P VT F
Sbjct: 387 TCFQ-GQAARLRVPEVTMAF 405
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 140/334 (41%), Gaps = 46/334 (13%)
Query: 91 VQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLT 150
++VDTGSD+ WV C C P S + L+D SS+ V C C G+ G
Sbjct: 1 MEVDTGSDLSWVQCKPCAAAP--SCYSQKDPLFDPAQSSSYAAVPCGGPVCAGL-GIYAA 57
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
+ C Y+ YGDGS+TTG + D + L +S FGCG QSG +
Sbjct: 58 SACSAAQCGYVVSYGDGSNTTGVYSSDTLT-------LSASSAVQGFFFGCGHAQSGLFN 110
Query: 211 STNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFAHCLD-GINGGGIFAIG----HVVQ 264
+DG++G G+ S++ Q A + GGV F++CL + G +G
Sbjct: 111 -----GVDGLLGLGREQPSLVEQTAGTYGGV---FSYCLPTKPSTAGYLTLGVGGPSGAA 162
Query: 265 PEVNKTPLV--PNQP-HYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPE 321
P + T L+ PN P +Y + +T + VG L++P F G T T + LP
Sbjct: 163 PGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG----TVVTRLPP 218
Query: 322 MVYEPLVSKIISQQ-----PDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVY 376
Y L S S P + + D TC+ ++ PNV F + ++ +
Sbjct: 219 TAYAALRSAFRSGMASYGYPTAPSNGILD--TCYNFAGYGTVTLPNVALTFGSGATVTLG 276
Query: 377 PHEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
L C+ + SG M +LG+
Sbjct: 277 ADGIL----SFGCLAFAPSG----SDGGMAILGN 302
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 128/324 (39%), Gaps = 40/324 (12%)
Query: 24 SNHGVFSVKYRYAGRERSLSLLKEHDAR-----RQQRILAGVDLPLGGSSRPDGVGL--- 75
+ HG + V + G S +++ E R RQ A P G + GVG
Sbjct: 347 NTHGSWGVTHDDRGVPHSEAIIHETPNRKVGTARQPSSPA----PTGAAILCRGVGAPRH 402
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
++ + IG P K Y++ +DTGS + W+ C C C + + + + K V
Sbjct: 403 FFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC--------NIVPHVLYKPTPKKLV 454
Query: 135 TCDQEFCHGVYG--GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
TC C +Y G C + C Y+ Y D SS+ G V D +G TT
Sbjct: 455 TCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNGTNPTT- 512
Query: 193 TNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGI 251
+ FGCG Q G + +D I+G + +++SQL S G + K + HC+
Sbjct: 513 ----IAFGCGYDQ-GKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCISS- 566
Query: 252 NGGGIFAIGHVVQPE--VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTI 309
GGG G P V TP+ +YS + + + V I
Sbjct: 567 KGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAV------I 620
Query: 310 IDSGTTLAYLPEMVYEPLVSKIIS 333
DSG T Y Y+ +S + S
Sbjct: 621 FDSGATYTYFAAQPYQATLSVVKS 644
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 64/266 (24%), Positives = 105/266 (39%), Gaps = 36/266 (13%)
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
T C Y Y DG+ST G + D +++ T +L FGCG Q +
Sbjct: 27 TQCDYEIKYADGASTIGALIVDQFSLPRIA-------TRPNLPFGCGYNQGIGENFQQTS 79
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRK-MFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP 274
++GI+G + S +SQL G + K + HCL GGG+ +G + + ++
Sbjct: 80 PVNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSS-GGGGLLFVG-----DGDGNLVLL 133
Query: 275 NQPHYSINMTAVQVGLDFLNL-PTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKI-- 331
+ +YS + L + P DV + DSG+T Y Y+ V I
Sbjct: 134 HANYYSPGSATLYFDRHSLGMNPMDV---------VFDSGSTYTYFTAQPYQATVYAIKG 184
Query: 332 ------ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFE 385
+ Q D + F+ V + F ++ +F N+ +++ P YL E
Sbjct: 185 GLSSTSLEQVSDPSLPLCWKGQKAFESVFDVKKEFKSLQLNFGNNAVMEIPPENYLIVTE 244
Query: 386 DLWCIGWQNSGMQSRDRKNMTLLGDF 411
G G+ R N ++GD
Sbjct: 245 ----YGNVCLGILHGCRLNFNIIGDI 266
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/355 (26%), Positives = 147/355 (41%), Gaps = 73/355 (20%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNC----IQCKECP--RRSSLGIELTLYDIKDSS 129
Y + IGTPP+ V +DTGSD+ WV C C++C + + G L + SS
Sbjct: 21 YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 80
Query: 130 TGKFVTCDQEFCHGVYG--GPLTDCT---------ANTSCP-----YLEIYGDGSSTTGY 173
T TC FC ++ P CT +CP + YG TG
Sbjct: 81 TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140
Query: 174 FVQDVV-QYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
+DV+ + + + FGC + +T E + GI GFG+ S+
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGC-------VGATYREPI-GIAGFGRGLLSLPF 192
Query: 233 QLASSGGVRKMFAHCL------DGINGGGIFAIGHVVQPEVN---------KTPLVPNQP 277
QL S K F+HC + N +G++ + K+P+ PN
Sbjct: 193 QLGFS---HKGFSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPN-- 247
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVG---------DNKGTIIDSGTTLAYLPEMVYEPLV 328
+Y I + ++ +G N FGV N G +IDSGTT +LPE +Y L+
Sbjct: 248 YYYIGLESITIGNGDNNF---RFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLI 304
Query: 329 SKI-----ISQQPDLKVHTVHD---EYTCFQYSESV--DEGFPNVTFHFENSVSL 373
S + + ++++T D + C + S D P++TFHF N+VS+
Sbjct: 305 SNLELVIGYPRAKQVELNTGFDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSV 359
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 152/349 (43%), Gaps = 45/349 (12%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIEL-TLYDIKDSSTGKFV 134
Y A+IG+G P K +Y+ DTGSD+ W +QC+ C ++ + ++D K SS+ +
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTW---LQCQPCASENTCYKQFDPIFDPKSSSSYSPL 204
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
+C+ + C + +C ++T C Y YGDGS TTG + + + +++
Sbjct: 205 SCNSQQCKLLDKA---NCNSDT-CIYQVHYGDGSFTTGELATETLSFGN-------SNSI 253
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDGI--N 252
+L GCG G G S+ SQL +S F++CL + +
Sbjct: 254 PNLPIGCGHDNEGLFAGGAGLIGL-----GGGAISLSSQLKASS-----FSYCLVNLDSD 303
Query: 253 GGGIFAIGHVVQPEVNKTPLVPNQPHYS---INMTAVQVGLDFLNLPTDVFGVGDN--KG 307
+ + +PLV N +S + + + VG L + F + ++ G
Sbjct: 304 SSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGG 363
Query: 308 TIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVH---TVHDEYTCFQYSESVDEGFPNVT 364
I+DSGT ++ LP VYE L + L +V D TC+ +S + P +
Sbjct: 364 IIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFD--TCYNFSGQSNVEVPTIA 421
Query: 365 FHFENSVSLKVYPHEYLFPFED--LWCIGWQNSGMQSRDRKNMTLLGDF 411
F SL++ YL + +C+ + + + +++++G F
Sbjct: 422 FVLSEGTSLRLPARNYLIMLDTAGTYCLAFI------KTKSSLSIIGSF 464
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 139/321 (43%), Gaps = 43/321 (13%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y + +GTPP+ ++ +DT +D +W+ C C C S+ ST
Sbjct: 27 IGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYST-- 81
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C G + S C + + YG SS + VQD + ++ D+
Sbjct: 82 -VSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTL---TLAPDVIP- 136
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS--SGGVRKMFAHCLD 249
+ FGC SG N G++G G+ S++SQ S SG +F++CL
Sbjct: 137 ----NFSFGCINSASG-----NSLPPQGLMGLGRGPMSLVSQTTSLYSG----VFSYCLP 183
Query: 250 GING---GGIFAIGHVVQPE-VNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD---- 298
G +G + QP+ + TPL+ P +P Y +N+T V VG + +P D
Sbjct: 184 SFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVG--SVQVPVDPVYL 241
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
F GTIIDSGT + + VYE + + Q T+ TCF S +
Sbjct: 242 TFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAFDTCF--SADNEN 299
Query: 359 GFPNVTFHFENSVSLKVYPHE 379
P +T H S+ LK+ P E
Sbjct: 300 VAPKITLHM-TSLDLKL-PME 318
>gi|66815065|ref|XP_641634.1| hypothetical protein DDB_G0279453 [Dictyostelium discoideum AX4]
gi|60469677|gb|EAL67665.1| hypothetical protein DDB_G0279453 [Dictyostelium discoideum AX4]
Length = 864
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 150/346 (43%), Gaps = 51/346 (14%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWV---NCIQCKECPRRSSL----GIELTLYDIKDS 128
Y+ I +GTPP+ + VQVDTGS + V NC K ++S G LY+ DS
Sbjct: 165 YFIPILVGTPPQMFTVQVDTGSTSLAVPGLNCYLYKSQTIKTSCSCSDGNLDGLYNFDDS 224
Query: 129 STGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVS--- 185
+G + C C+ D +CP++ YGDGS G V D V + +
Sbjct: 225 VSGIALNCSASVCNNSCQNKNHD-----NCPFMLKYGDGSFIAGSLVIDNVTIGQFTVPA 279
Query: 186 --GDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG------KSNSSMISQLASS 237
G++Q S + S + C + ++ ++ DGI+G + + S++ SS
Sbjct: 280 KFGNIQKESLSFSQL-TCPS------NARSQAVRDGILGLSFQELDPYNGDDIFSKIVSS 332
Query: 238 GGVRKMFAHCLDGINGGGIFAIGHV---VQPEVNKTPLVPNQPHYSINMTAVQVGLDFLN 294
G+ +F+ CL GGI IG + V E K + + +YSI++ + V + L
Sbjct: 333 YGIPNVFSMCLG--KDGGILTIGGINERVNIETPKYTPIIDFHYYSIHVLNIYVENESLK 390
Query: 295 LPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHD----EYTCF 350
F D +I+DSGTTL Y + ++ ++ + +Q K+ + + E C
Sbjct: 391 -----FTPNDFISSIVDSGTTLLYFNDEIFYSIIKNL--EQSYSKLPGIGEDKFWEGNCH 443
Query: 351 QYSESVDEGFPNVTFHFE-----NSVSLKVYPHEYLFPFEDLWCIG 391
SE E +P + + S L + P Y +L C G
Sbjct: 444 YLSEESVELYPTIYLELDGSGASGSFKLAIPPSLYFLKINNLHCFG 489
>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
gi|223975971|gb|ACN32173.1| unknown [Zea mays]
gi|224034191|gb|ACN36171.1| unknown [Zea mays]
gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
Length = 465
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 141/320 (44%), Gaps = 40/320 (12%)
Query: 60 VDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLG 117
+PLG G+S GVG Y ++G+GTP K Y + VDTGS + W+ C C C R+S
Sbjct: 112 ASVPLGPGTSV--GVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG-- 167
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGYFV 175
+++ K SS+ V+C + C + L C+ + C Y YGD S + GY
Sbjct: 168 ---PVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLS 224
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+D V + S + +GCG G + G+IG ++ S++ QLA
Sbjct: 225 KDTVSFGSTSVP--------NFYYGCGQDNEGLFGQSA-----GLIGLARNKLSLLYQLA 271
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVV-QP-EVNKTPLVPNQ---PHYSINMTAVQVGL 290
S G F++CL + + P + + TP+ + Y I MT ++V
Sbjct: 272 PSMGYS--FSYCLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAG 329
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L + + TIIDSGT + LP VY L V+ + P ++ D
Sbjct: 330 KPL---SVSSSAYSSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILD-- 384
Query: 348 TCFQYSESVDEGFPNVTFHF 367
TCFQ ++ P VT F
Sbjct: 385 TCFQ-GQAARLRVPEVTMAF 403
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 143/343 (41%), Gaps = 45/343 (13%)
Query: 39 ERSLSLLKEHDARRQQRILAGVD----LPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVD 94
E S+ ++ D R Q + + V +P+ + Y + IGTP + + +D
Sbjct: 55 EESVLQMQAKDQARLQFLSSLVARKSVVPIASGRQIVQSPTYIVRAKIGTPAQTMLLAMD 114
Query: 95 TGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
T +D W+ C C +G T+++ S+T K V C+ C V P + C
Sbjct: 115 TSNDAAWIPCSGC--------VGCSSTVFNNVKSTTFKTVGCEAPQCKQV---PNSKC-G 162
Query: 155 NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
++C + YG SS QDVV T + S FGC +G +
Sbjct: 163 GSACAFNMTYGS-SSIAANLSQDVVTL--------ATDSIPSYTFGCLTEATG-----SS 208
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD---GINGGGIFAIGHVVQPE-VNKT 270
G++G G+ S++SQ + + F++CL +N G +G V QP+ + T
Sbjct: 209 IPPQGLLGLGRGPMSLLSQ--TQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTT 266
Query: 271 PLVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYE 325
PL+ N Y +N+ A++VG +++P F GTI DSGT L Y
Sbjct: 267 PLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYT 326
Query: 326 PLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
+ + + V ++ TC Y+ + P +TF F
Sbjct: 327 AVRDAFRKRVGNATVTSLGGFDTC--YTSPIVA--PTITFMFS 365
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/337 (28%), Positives = 137/337 (40%), Gaps = 48/337 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G YY KIG+GTP K + + VDTGS + W +QC+ C + ++ ++ S T
Sbjct: 103 GSGNYYVKIGVGTPAKYFSMIVDTGSSLSW---LQCQPCVIYCHVQVD-PIFTPSVSKTY 158
Query: 132 KFVTCDQEFCHGVYGGPLTD--CT-ANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
K ++C C + L C+ A +C Y YGD S + GY QDV+
Sbjct: 159 KALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTP----- 213
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ + + ++GCG G + GIIG SM+ QL++ G F++CL
Sbjct: 214 -SAAPSSGFVYGCGQDNQGLFGRS-----AGIIGLANDKLSMLGQLSNKYG--NAFSYCL 265
Query: 249 DGINGG-------GIFAIGHVVQPEV--NKTPLV--PNQPH-YSINMTAVQVGLDFLNLP 296
G +IG TPLV P P Y + +T + V P
Sbjct: 266 PSSFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVA----GKP 321
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL-------VSKIISQQPDLKVHTVHDEYTC 349
V N TIIDSGT + LP +Y L +SK +Q P + TC
Sbjct: 322 LGVSASSYNVPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILD-----TC 376
Query: 350 FQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED 386
F+ S P + F L++ H L E
Sbjct: 377 FKGSVKEMSTVPEIRIIFRGGAGLELKVHNSLVEIEK 413
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 89/333 (26%), Positives = 148/333 (44%), Gaps = 34/333 (10%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR-SSLGIE----LTLYDIKDSS 129
L+YA + +GTP + V +DTGS++ W+ C C R +G+ L LY SS
Sbjct: 102 LHYANVSVGTPATWFLVALDTGSNLFWLPCNCGSTCIRDLKDIGLSQSRPLNLYSPNTSS 161
Query: 130 TGKFVTCDQEFCHGVYGGPLTDCTANTSCPY-LEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
T + C+ + C G + +SCPY ++ + TTG +DV+ V+ D+
Sbjct: 162 TSSSIRCNDDRCFGSS----QCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHL--VTEDV 215
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
++ GCG Q+G L S+ A++G++G G + S+ S LA + F+ C
Sbjct: 216 DLKPVKANITLGCGRNQTGFLQSS--AAINGLLGLGMKDYSVPSILAKAKITANSFSMCF 273
Query: 249 -DGINGGGIFAIGHVVQPEVNKTPLVPNQPH--YSINMTAVQVGLDFLNLPTDVFGVGDN 305
+ I+ G + G + +TPL+P +P Y++N+T V VG
Sbjct: 274 GNIIDVIGRISFGDKGYTDQMETPLLPTEPSPTYAVNVTEVS---------VGGDVVGVQ 324
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT---CFQYS-ESVDEGFP 361
+ D+GT+ +L E Y L++K K + E C+ S S FP
Sbjct: 325 LLALFDTGTSFTHLLEPEYG-LITKAFDDHVTDKRRPIDPEIPFEFCYDLSPNSTTILFP 383
Query: 362 NVTFHFENSVSLKVYPHEYLFPFED---LWCIG 391
V FE + + ++ ED ++C+G
Sbjct: 384 RVAMTFEGGSLMFLRNPLFIVWNEDNTAMYCLG 416
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/337 (24%), Positives = 141/337 (41%), Gaps = 41/337 (12%)
Query: 67 SSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIK 126
S +G G Y+ +IGIG+P Y+ +D+GSDI+W+ C C +C ++ +++
Sbjct: 120 SGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTD-----PIFNPA 174
Query: 127 DSSTGKFVTCDQEFCHGVYGGPLTDCTA--NTSCPYLEIYGDGSSTTGYFVQDVVQYDKV 184
S++ V C C+ L D A C Y YGDGS T G + + +
Sbjct: 175 TSASFIGVACSSNVCN-----QLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT 229
Query: 185 SGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQL-ASSGGVRKM 243
+Q T+ GCG G G S + QL A +GG
Sbjct: 230 V--IQDTA------IGCGHWNEGMFVGAAGLLGL-----GGGPMSFVGQLGAQTGGA--- 273
Query: 244 FAHCLDGINGGGIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVG 303
F +CL +G + P ++ P P+ Y ++++ + VG + + +F +
Sbjct: 274 FGYCLV----SRAMPVGAMWVPLIHN-PFYPS--FYYVSLSGLAVGGIRVPISEQIFQLT 326
Query: 304 D--NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL-KVHTVHDEYTCFQYSESVDEGF 360
D G ++D+GT + LP + Y I+Q +L + V TC+ + V
Sbjct: 327 DIGTGGVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRV 386
Query: 361 PNVTFHFENSVSLKVYPHEYLFPFEDL--WCIGWQNS 395
P V+F+F L +L P +D+ +C + S
Sbjct: 387 PTVSFYFSGGQILTFPARNFLIPADDVGTFCFAFAPS 423
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 90/318 (28%), Positives = 135/318 (42%), Gaps = 43/318 (13%)
Query: 80 IGIGTPPK--DYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT---LYDIKDSSTGKFV 134
+ +GTPP+ ++ + VD+G WV C SS I T L+ S++ +
Sbjct: 3 LSLGTPPQPLNFTLAVDSG--FSWVAC--------SSSCAINCTTASLFQPGLSTSHTKL 52
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
C C + T C ++SC Y YG S+ G V D+ D V +
Sbjct: 53 PCGSPSCSA-FSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSV----RNRKVA 107
Query: 195 GSLIFGCGARQSGNLDSTNEEALD--GIIGFGKSNSSMISQLASSGGVRKMFAHCLDGIN 252
+L GCG R SG L E LD G +GF K N S + QL S+ G R F +CL
Sbjct: 108 ANLSLGCG-RDSGGL----LELLDTSGFVGFDKGNVSFMGQL-SALGYRSKFIYCLPSDT 161
Query: 253 GGGIFAIGHV------VQPEVNKTPLVPNQPH---YSINMTAVQVGLDFLNLPTDVFGVG 303
G IG+ + + TP++ N Y IN++ + + + +P F
Sbjct: 162 FRGKLVIGNYKLRNASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSN 221
Query: 304 DNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDL--KVHTVHDEYTC-FQYSESVDEGF 360
GT+ID+ T L+YL Y LV I + +L +V D Y+ S + F
Sbjct: 222 GTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSDF 281
Query: 361 P---NVTFHFENSVSLKV 375
P +T+HF ++V
Sbjct: 282 PPPATLTYHFLGGAGVEV 299
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 120/307 (39%), Gaps = 45/307 (14%)
Query: 76 YYAKIGIGTP-PKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
Y + IGTP P+ + +DTGSD++W C C C +D S T V
Sbjct: 100 YLIHLSIGTPRPQRVALTLDTGSDLVWTQC-ACHVC-----FAQPFPTFDALASQTTLAV 153
Query: 135 TCDQEFC-HGVYGGPLTDCTAN-TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTS 192
C C G Y PL+ CT N +C YL Y D S T+G V+D + G+ + +
Sbjct: 154 PCSDPICTSGKY--PLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKA 211
Query: 193 TNGSLI----FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
G + FGCG G S NE GI GF + S+ SQL + F+HC
Sbjct: 212 HAGVAVPNVRFGCGQYNKGIFKS-NES---GIAGFSRGPMSLPSQLKVA-----RFSHCF 262
Query: 249 DGINGGGIFAI------------GHVVQPEVNKTPLV-PNQPHYSINMTAVQVGLDFLNL 295
I + H P V TP N Y + + + VG L L
Sbjct: 263 TAIADARTSPVFLGGAPGPDNLGAHATGP-VQSTPFANSNGSLYYLTLKGITVGKTRLPL 321
Query: 296 PTDVFGVGDNKGT----IIDSGTTLAYLPEMVYEPLVSKIIS--QQPDLKVHTVHDEYT- 348
F IIDSGT + LP +Y L + ++ + P E T
Sbjct: 322 NALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKLPVANESAADAESTL 381
Query: 349 CFQYSES 355
CF+ + S
Sbjct: 382 CFEAARS 388
>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Brachypodium distachyon]
Length = 429
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 156/353 (44%), Gaps = 36/353 (10%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCK-ECPRRSSLGIELTLYDIKDSSTGK 132
G ++ I +GTPP V VDTGS + WV C +C+ C ++ +++D S+T +
Sbjct: 73 GKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQISC--HTTAPEAGSVFDPDKSTTYE 130
Query: 133 FVTCDQEFCHGVYG---GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQ 189
V C C V P +C Y YG G S G + + DK++
Sbjct: 131 LVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPS--GQYSAGRLGTDKLTLASS 188
Query: 190 TTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD 249
++ +G IFGC D + + G+IGFG +N S +Q+A R F++C
Sbjct: 189 SSIIDG-FIFGCSG------DDSFKGYESGVIGFGGANFSFFNQVARQTNYRA-FSYCFP 240
Query: 250 GINGG-GIFAIGHVVQPEVNKTPLVP---NQPHYSINMTAVQVGLDFLNLPTDVFGVGDN 305
G + G +IG + E+ T L+P ++ YS+ + V + L + +
Sbjct: 241 GDHTAEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQSEY---TK 297
Query: 306 KGTIIDSGTTLAYLPEMVYEPLVSKIIS--QQPDLKVHTVHDEYTCFQYS--ESVDEG-F 360
+ ++DSGT +L V++ + S Q TV E TCF+ + +SVD G
Sbjct: 298 RMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTE-TCFRPNGGDSVDSGDL 356
Query: 361 PNVTFHFENSVSLKVYPHEY---LFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
P V F + +LK+ P L P D C+ ++ R N+ +LG+
Sbjct: 357 PTVEMRFIGT-TLKLPPENVFHDLLPSHDKICLAFKPDVAGVR---NVQILGN 405
>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 465
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 141/320 (44%), Gaps = 40/320 (12%)
Query: 60 VDLPLG-GSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQC-KECPRRSSLG 117
+PLG G+S GVG Y ++G+GTP K Y + VDTGS + W+ C C C R+S
Sbjct: 112 ASVPLGPGTSV--GVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSG-- 167
Query: 118 IELTLYDIKDSSTGKFVTCDQEFCHGVYGGPL--TDCTANTSCPYLEIYGDGSSTTGYFV 175
+++ K SS+ V+C + C + L C+ + C Y YGD S + GY
Sbjct: 168 ---PVFNPKASSSYASVSCSAQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLS 224
Query: 176 QDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA 235
+D V + S + +GCG G + G+IG ++ S++ QLA
Sbjct: 225 KDTVSFGSTSVP--------NFYYGCGQDNEGLFGQSA-----GLIGLARNKLSLLYQLA 271
Query: 236 SSGGVRKMFAHCLDGINGGGIFAIGHVV-QP-EVNKTPLVPNQ---PHYSINMTAVQVGL 290
S G F++CL + + P + + TP+ + Y I MT ++V
Sbjct: 272 PSMGYS--FSYCLPTSSSSSSGYLSIGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAG 329
Query: 291 DFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPL---VSKIISQQPDLKVHTVHDEY 347
L + + TIIDSGT + LP VY L V+ + P ++ D
Sbjct: 330 KPL---SVSSSAYSSLPTIIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILD-- 384
Query: 348 TCFQYSESVDEGFPNVTFHF 367
TCFQ ++ P VT F
Sbjct: 385 TCFQ-GQAARLRVPEVTMAF 403
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 146/366 (39%), Gaps = 44/366 (12%)
Query: 42 LSLLKEHDARRQQRILAGVDLPL------GGSSRPDGVG-LYYAKIGIGTPPKDYYVQVD 94
+ L EH A R I A ++ L S P G + IG P V +D
Sbjct: 60 MELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLTGRTILVNLSIGQPSIPQLVVMD 119
Query: 95 TGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTA 154
TGSDI+W+ C C C L L+D SST C G C
Sbjct: 120 TGSDILWIMCNPCTNCDNHLGL-----LFDPSMSSTFS------PLCKTPCGFKGCKCDP 168
Query: 155 NTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNE 214
P+ Y D SS +G F +D++ ++ + TS +I GCG N+ ++
Sbjct: 169 ---IPFTISYVDNSSASGTFGRDILVFETTD---EGTSQISDVIIGCGH----NIGFNSD 218
Query: 215 EALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGINGGGIFAIGHVVQPEVNKT 270
+GI+G +S+ +Q+ + F++C+ D +G E T
Sbjct: 219 PGYNGILGLNNGPNSLATQIG------RKFSYCIGNLADPYYNYNQLRLGEGADLEGYST 272
Query: 271 PLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPL- 327
P Y + M + VG L++ + F + N G I+DSGTT+ YL + ++ L
Sbjct: 273 PFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDSAHKLLY 332
Query: 328 --VSKIISQQPDLKVHTVHDEYTCFQYSESVD-EGFPNVTFHFENSVSLKVYPHEYLFPF 384
V ++ + C+ S D GFP VTFHF + L + +
Sbjct: 333 NEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDTGSFFSQR 392
Query: 385 EDLWCI 390
+D++C+
Sbjct: 393 DDIFCM 398
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/336 (25%), Positives = 138/336 (41%), Gaps = 48/336 (14%)
Query: 80 IGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQE 139
+GI P K + VDTGSD++W C + G +YD +SST F+ C
Sbjct: 20 VGIVQPRK---LIVDTGSDLIWTQCKLSSSTAAAARHG-SPPVYDPGESSTFAFLPCSDR 75
Query: 140 FCH-GVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLI 198
C G + +CT+ C Y ++YG ++ G + + G + S L
Sbjct: 76 LCQEGQFS--FKNCTSKNRCVYEDVYGSAAAV-GVLASETFTF----GARRAVSLR--LG 126
Query: 199 FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----DGINGG 254
FGCGA +G+L GI+G + S+I+QL + F++CL D
Sbjct: 127 FGCGALSAGSLIGAT-----GILGLSPESLSLITQLKI-----QRFSYCLTPFADKKTSP 176
Query: 255 GIFAI-----GHVVQPEVNKTPLVPN---QPHYSINMTAVQVGLDFLNLPTDVFGVGDNK 306
+F H + T +V N +Y + + + +G L +P + +
Sbjct: 177 LLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDG 236
Query: 307 G--TIIDSGTTLAYLPEMVYEPLVSKIIS-QQPDLKVHTVHDEYTCFQYSESVDEG---- 359
G TI+DSG+T+AYL E +E + ++ + + TV D CF
Sbjct: 237 GGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEA 296
Query: 360 --FPNVTFHFENSVSLKVYPHEYLF--PFEDLWCIG 391
P + HF+ ++ V P + F P L C+
Sbjct: 297 VQVPPLVLHFDGGAAM-VLPRDNYFQEPRAGLMCLA 331
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 89/317 (28%), Positives = 137/317 (43%), Gaps = 42/317 (13%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y + +GTPP+ ++ +DT +D +W+ C C C S+ ST
Sbjct: 101 IGNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYST-- 155
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C G + S C + + YG SS + VQD + ++ D+
Sbjct: 156 -VSCSTAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTL---TLAPDVIP- 210
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS--SGGVRKMFAHCLD 249
+ FGC SG N G++G G+ S++SQ S SG +F++CL
Sbjct: 211 ----NFSFGCINSASG-----NSLPPQGLMGLGRGPMSLVSQTTSLYSG----VFSYCLP 257
Query: 250 GING---GGIFAIGHVVQPE-VNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD---- 298
G +G + QP+ + TPL+ P +P Y +N+T V VG + +P D
Sbjct: 258 SFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVG--SVQVPVDPVYL 315
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
F GTIIDSGT + + VYE + + Q T+ TCF S +
Sbjct: 316 TFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGAFDTCF--SADNEN 373
Query: 359 GFPNVTFHFENSVSLKV 375
P +T H S+ LK+
Sbjct: 374 VAPKITLHM-TSLDLKL 389
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 133/333 (39%), Gaps = 57/333 (17%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G Y ++ IGTPP + DTGSD+ W C CK C G + +YD SS+
Sbjct: 79 GQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKLC-----FGQDTPIYDTTTSSSF 133
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ C C ++ + T + +C Y Y DG+ Y +
Sbjct: 134 SPLPCSSATCLPIWSSRCS--TPSATCRYRYAYDDGA------------YSPECAGISV- 178
Query: 192 STNGSLIFGCGARQSG-NLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLD- 249
G + FGCG G + +ST G +G G+ + S+++QL GV K F++CL
Sbjct: 179 ---GGIAFGCGVDNGGLSYNST------GTVGLGRGSLSLVAQL----GVGK-FSYCLTD 224
Query: 250 ------------GINGGGIFAIGHVVQPEVNKTPLV--PNQP-HYSINMTAVQVGLDFLN 294
G + V TPLV P P Y +++ + +G L
Sbjct: 225 FFNTSLSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLP 284
Query: 295 LPTDVFGVGDNKGT---IIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQ 351
+P F + D+ G+ I+DSGT L E + +V + V+ + CF
Sbjct: 285 IPNGTFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAGVLGQPVVNASSLDRPCFP 344
Query: 352 YSES-VDE--GFPNVTFHFENSVSLKVYPHEYL 381
+ V E P++ HF ++++ Y+
Sbjct: 345 APAAGVQELPDMPDMVLHFAGGADMRLHRDNYM 377
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 97/230 (42%), Gaps = 25/230 (10%)
Query: 57 LAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNC-IQCKECPRRSS 115
L+ V LPL G+ P +G Y + IGTPPK + +DTGSD+ WV C C C
Sbjct: 37 LSSVVLPLSGNVFP--LGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGC----- 89
Query: 116 LGIELTLYDIKD-SSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTGY 173
TL I+ G V C C ++ C C Y Y D S+ G
Sbjct: 90 -----TLPPIRQYKPKGNTVPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSMGA 144
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
V D ++G ++ L FGCG Q + A G++G G+ ++ Q
Sbjct: 145 LVIDQFPLKLLNG----SAMQPRLAFGCGYDQILP-KAHPPPATAGVLGLGRGKIGVLPQ 199
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPE--VNKTPLVPNQPHYSI 281
L ++G R + HCL GGG G + P V TPL+ P Y+
Sbjct: 200 LVAAGLTRNVVGHCLSS-KGGGYLFFGDTLIPTLGVAWTPLL--SPEYTF 246
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 83/351 (23%), Positives = 144/351 (41%), Gaps = 39/351 (11%)
Query: 82 IGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFC 141
IGTPP++ + VDT S++ WV C C ++ ++ SS+ C C
Sbjct: 5 IGTPPREVLLLVDTASELTWVQGTSCTNCSP-----TKVPPFNPGLSSSFISEPCTSSVC 59
Query: 142 HGVYG-GPLTDCTANT-SCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
G G + C +T SC + Y DGS G +++ G ST G +IF
Sbjct: 60 LGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDG---AASTLGDVIF 116
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLA--SSGGVRKMFAHCL----DGING 253
GC ++ +L + + G +G + + S +Q+ S G+ F++C + +N
Sbjct: 117 GCASK---DLQRPVDFS-SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNS 172
Query: 254 GGIFAIGHVVQPE--------VNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGD- 304
G+ G P + P+ Y + + + VG + L++P F +
Sbjct: 173 SGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRL 232
Query: 305 -NKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEG---- 359
N GT DSGTT+++L E + LV + L + D Y + +
Sbjct: 233 GNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPT 292
Query: 360 FPNVTFHFENSVSLKVYPHEYLFPFED-----LWCIGWQNSGMQSRDRKNM 405
P VT HF+N+V +++ P C+ + N+G ++ N+
Sbjct: 293 APLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNV 343
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/374 (24%), Positives = 149/374 (39%), Gaps = 65/374 (17%)
Query: 54 QRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRR 113
+R +GV P+ S G G Y+ KIG+GTP + +DTGSD++W+ C C+ C +
Sbjct: 121 RRTGSGVVAPVV-SGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQ 179
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC-TANTSCPYLEIYGDGSSTTG 172
S ++D + S + V C C + G C +C Y YGDGS T G
Sbjct: 180 SG-----QVFDPRRSRSYGAVGCSAPLCRRLDSG---GCDLRRKACLYQVAYGDGSVTAG 231
Query: 173 YFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMIS 232
F + + + +G + + GCG G + G+ + S +
Sbjct: 232 DFATETLTF---AGGARVA----RIALGCGHDNEGLFVAAAGLLGL-----GRGSLSFPA 279
Query: 233 QLASSGGVRKMFAHCL-DGINGG-----------GIFAIGHVVQPEVNKTPLVPN---QP 277
Q++ G + F++CL D + G A+G V + TP+V N +
Sbjct: 280 QISRRYG--RSFSYCLVDRTSSANPASHSSTVTFGSGAVGSTV--AASFTPMVKNPRMET 335
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDNK----------GTIIDSGTTLAYLPEMVYEPL 327
Y + + + VG V GV D+ G I+DSGT++ L Y L
Sbjct: 336 FYYVQLVGISVG------GARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSAL 389
Query: 328 VSKIISQQPDLKVH----TVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFP 383
+ L++ ++ D TC+ S P V+ HF + P YL P
Sbjct: 390 RDAFRAAAAGLRLSPGGFSLFD--TCYDLSGRKVVKVPTVSMHFAGGAEAALPPENYLIP 447
Query: 384 FED--LWCIGWQNS 395
+ +C + +
Sbjct: 448 VDSKGTFCFAFAGT 461
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 88/342 (25%), Positives = 127/342 (37%), Gaps = 46/342 (13%)
Query: 58 AGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN----CIQCKECPRR 113
+ + PL G+ P VG Y + IG P + Y++ VDTGSD+ W+ C C E P
Sbjct: 55 SSIVFPLYGNVYP--VGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETPHP 112
Query: 114 SSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGY 173
+ FV C C + +C C Y Y D ST G
Sbjct: 113 ------------LHRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTYGV 160
Query: 174 FVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+ DV + +G + GCG Q + S + +G GK +S+ISQ
Sbjct: 161 LLNDVYLLNSSNG----VQLKVRMALGCGYDQVFSPSSYHPLDGLLGLGRGK--ASLISQ 214
Query: 234 LASSGGVRKMFAHCLDGINGGGIFAIGHVVQPEVNKTPLVP-NQPHYSINMTAVQVGLDF 292
L S G VR + HCL GG IF V TP+ + HYS + G
Sbjct: 215 LNSQGLVRNVIGHCLSSQGGGYIFFGNAYDSARVTWTPISSVDSKHYSAGPAELVFG--- 271
Query: 293 LNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTC--- 349
GVG + + D+G++ Y Y+ L+S + + + D+ T
Sbjct: 272 ----GRKTGVG-SLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLC 326
Query: 350 ------FQYSESVDEGFPNVTFHFEN----SVSLKVYPHEYL 381
F V + F V F N ++ P YL
Sbjct: 327 WHGKRPFTSLREVRKYFKPVALSFTNGGRVKAQFEIPPEAYL 368
>gi|348685429|gb|EGZ25244.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 467
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 73/298 (24%), Positives = 121/298 (40%), Gaps = 32/298 (10%)
Query: 93 VDTGSDIMWVNCIQCKEC-PRRSSLGIELTLYDIKDSSTGKFVTCDQEFC-HGVYGGPLT 150
+DTGS C+ C C +R LT +++CD+ +G P
Sbjct: 77 IDTGSGKTAFVCVGCNNCGSKRRHEPFVLT-------GNTTYLSCDRSMTLQTSWGEPAC 129
Query: 151 DCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLD 210
N C Y + Y +G + Y D++Q + S + FGC QSG
Sbjct: 130 MACENGKCKYGQTYVEGDHWSAYKASDMMQL--------SPSFEARIEFGCIYEQSGVF- 180
Query: 211 STNEEALDGIIGFGKSNSSMISQLASSGGVR-KMFAHCLDGINGGGIFAIGHV-----VQ 264
++ DGI+GF + S+ Q ++F+ CL GGG+ IG V +
Sbjct: 181 --LDQPSDGIMGFSRHPDSIFEQFYRQKVTHSRIFSQCL--TEGGGMLTIGGVDLTRHTE 236
Query: 265 PEVNKTPLVPN-QPHYSINMTAVQVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMV 323
P V TPL ++++ + +V VG L D + ++G ++DSGTT Y+PE
Sbjct: 237 P-VRYTPLRSTGYQYWTVTLQSVSVGNQSNTLQVDTYEYNADRGCVLDSGTTFLYMPERT 295
Query: 324 YEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYL 381
EP ++ + + T + + P++ F +N V + + P Y
Sbjct: 296 KEPF--RLAWSRAVGSFSYIPQSDTFYSMTPDQVAALPDICFWLKNDVHICLPPSRYF 351
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 153/380 (40%), Gaps = 66/380 (17%)
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE-CPRRSSLGIE 119
DLP GG Y + IGTPP+ Y DTGSD++W C C E C ++ S
Sbjct: 85 DLPNGGE--------YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPS---- 132
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT--ANTSCPYLEIYGDGSSTTGYFVQD 177
LY+ S T + + C L T +C Y + YG G T+G +
Sbjct: 133 -PLYNPSSSPTFRVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSE 190
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ D + FGC N S + G++G G+ S++SQLA+
Sbjct: 191 TFTFGSSPADQVRVP---GIAFGC-----SNASSDDWNGSAGLVGLGRGGLSLVSQLAAG 242
Query: 238 GGVRKMFAHCLD--------------------GINGGGIFAIGHVVQPEVNKTPLVPNQP 277
MF++CL +NG G+ + V P +K P+
Sbjct: 243 -----MFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFV--PSPSKPPM---ST 292
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIIS-- 333
+Y +N+T + VG L +P F + + G IIDSGTT+ L + Y+ + + + S
Sbjct: 293 YYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLV 352
Query: 334 QQPDLKVHTVHDEYTCFQY--SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIG 391
+ P CF S + P++T HF + + Y+ +WC+
Sbjct: 353 KLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGGMWCL- 411
Query: 392 WQNSGMQSRDRKNMTLLGDF 411
M+S+ ++ LG++
Sbjct: 412 ----AMRSQTDGELSTLGNY 427
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 136/327 (41%), Gaps = 57/327 (17%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y A +G+G + V VDT S++ WV C C+ C + + L+D S + V
Sbjct: 143 YVATVGLGG--GEATVIVDTASELTWVQCAPCESCHDQ-----QGPLFDPSSSPSYAAVP 195
Query: 136 CDQEFCHGV---------YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
CD C + G P D +C Y Y DGS + G V+ +D++S
Sbjct: 196 CDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRG-----VLAHDRLS- 249
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS-GGVRKMFA 245
L +G +FGCG G G++G G+S S++SQ GGV F+
Sbjct: 250 -LAGEVIDG-FVFGCGTSNQG----PPFGGTSGLMGLGRSQLSLVSQTVDQFGGV---FS 300
Query: 246 HCL---DGINGGGIFAIGHVVQPEVNKTPLVPNQ-----------PHYSINMTAVQVGLD 291
+CL + G +G N TP+V P Y +N+T + VG
Sbjct: 301 YCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVG-- 358
Query: 292 FLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYT 348
+V G + I+DSGT + L VY + ++ +SQ P ++ D T
Sbjct: 359 ----GQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFSILD--T 412
Query: 349 CFQYSESVDEGFPNVTFHFENSVSLKV 375
CF + + P++T F+ ++V
Sbjct: 413 CFNMTGLKEVQVPSLTLVFDGGAEVEV 439
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 139/317 (43%), Gaps = 43/317 (13%)
Query: 73 VGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGK 132
+G Y + +GTPP+ ++ +DT +D +W+ C C C S+ ST
Sbjct: 102 IGNYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYST-- 156
Query: 133 FVTCDQEFCHGVYGGPLTDCTANTS-CPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
V+C C G T S C + + YG SS + VQD + +S D+
Sbjct: 157 -VSCSTTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTL---TLSPDVIP- 211
Query: 192 STNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLAS--SGGVRKMFAHCLD 249
+ FGC SG N G++G G+ S++SQ S SG +F++CL
Sbjct: 212 ----NFSFGCINSASG-----NSLPPQGLMGLGRGPMSLVSQTTSLYSG----VFSYCLP 258
Query: 250 GING---GGIFAIGHVVQPE-VNKTPLV--PNQPH-YSINMTAVQVGLDFLNLPTD---- 298
G +G + QP+ + TPL+ P +P Y +N+T V VG + +P D
Sbjct: 259 SFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVG--SVQVPVDPVYL 316
Query: 299 VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDE 358
F GTIIDSGT + + VYE + + +Q + T+ TCF S +
Sbjct: 317 TFDSNSGAGTIIDSGTVITRFAQPVYEAIRDE-FRKQVNGSFSTLGAFDTCF--SADNEN 373
Query: 359 GFPNVTFHFENSVSLKV 375
P +T H S+ LK+
Sbjct: 374 VTPKITLHMT-SLDLKL 389
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 85/343 (24%), Positives = 133/343 (38%), Gaps = 37/343 (10%)
Query: 44 LLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVN 103
L K + + L LP S R G YY +G+GTP +D + DTGS + W
Sbjct: 109 LSKNLGGENRVKELDSTTLP-AKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQ 167
Query: 104 CIQCK-ECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTANTSCPYLE 162
C C C ++ ++D SS+ + C C + T + SC Y
Sbjct: 168 CEPCAGSCYKQQD-----PIFDPSKSSSYTNIKCTSSLCTQFRSAGCSSST-DASCIYDV 221
Query: 163 IYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIG 222
YGD S + G+ Q+ + + T +FGCG G T G++G
Sbjct: 222 KYGDNSISRGFLSQERLT-------ITATDIVHDFLFGCGQDNEGLFRGT-----AGLMG 269
Query: 223 FGKSNSSMISQLASSGGVRKMFAHCLDGIN---GGGIFAIGHVVQPEVNKTPLVP---NQ 276
+ S + Q +S K+F++CL G F + TP
Sbjct: 270 LSRHPISFVQQTSSI--YNKIFSYCLPSTPSSLGHLTFGASAATNANLKYTPFSTISGEN 327
Query: 277 PHYSINMTAVQVGLDFL-NLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
Y +++ + VG L + + F G G+IIDSGT + LP Y L S +Q
Sbjct: 328 SFYGLDIVGISVGGTKLPAVSSSTFSAG---GSIIDSGTVITRLPPTAYAALRSAF--RQ 382
Query: 336 PDLKVHTVHDEY---TCFQYSESVDEGFPNVTFHFENSVSLKV 375
+K + TC+ +S + P + F F V +++
Sbjct: 383 FMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFAGGVKVEL 425
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 80/314 (25%), Positives = 125/314 (39%), Gaps = 41/314 (13%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y IGIGTP D + DTGSD+ W C C S + ++ SST
Sbjct: 128 GSGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPC----LGSCYSQKEPKFNPSSSSTY 183
Query: 132 KFVTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTT 191
+ V+C C C+A ++C Y +YGD S T G+ ++ + T
Sbjct: 184 QNVSCSSPMCEDA-----ESCSA-SNCVYSIVYGDKSFTQGFLAKE-----------KFT 226
Query: 192 STNGSLI----FGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHC 247
TN ++ FGCG G D + ++ +F++C
Sbjct: 227 LTNSDVLEDVYFGCGENNQGLFDGVAGLLG-------LGPGKLSLPAQTTTTYNNIFSYC 279
Query: 248 LDGI--NGGGIFAIGHV-VQPEVNKTPL--VPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
L N G G + V TP+ P+ +Y I++ + VG L + + F
Sbjct: 280 LPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFST 339
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY-TCFQYSESVDEGFP 361
+G IIDSGT LP VY L S + K + + + TC+ ++ +P
Sbjct: 340 ---EGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYP 396
Query: 362 NVTFHFENSVSLKV 375
+ F F S +++
Sbjct: 397 TIAFSFAGSTVVEL 410
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/394 (23%), Positives = 158/394 (40%), Gaps = 66/394 (16%)
Query: 33 YRYAGRERSLSLLKEHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQ 92
+ Y + S+ + H + + + + PL S G Y + +GTP + +
Sbjct: 45 WEYLNHLATTSISRAHHLKSPKTNFSLIKTPLFSRS----YGGYSMSLSLGTPSQTVKLI 100
Query: 93 VDTGSDIMWVNCIQ---CKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGG-- 147
+DTGS ++W C C C ++ ++ + + SS+ K + C C V+G
Sbjct: 101 MDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLIGCKNPKCAWVFGSSV 160
Query: 148 --------PLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIF 199
P PY+ YG GS T G + + + + T +
Sbjct: 161 QSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPN--------KTISDFLA 211
Query: 200 GCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-------DGIN 252
GC +L ST + +GI GFG+S S+ QL G++K F++CL ++
Sbjct: 212 GC------SLLSTRQP--EGIAGFGRSQESLPLQL----GLKK-FSYCLVSRRFDDSPVS 258
Query: 253 GGGIFAIGHVVQPE----VNKTPLVPN---------QPHYSINMTAVQVGLDFLNLPTD- 298
I +G ++ TP N Q +Y + + + VG + +P
Sbjct: 259 SDLILDMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHVKVPYSF 318
Query: 299 -VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT----CFQYS 353
V G N GTI+DSG+T ++ V+E L + Q + V T + T CF S
Sbjct: 319 LVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFDIS 378
Query: 354 ESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDL 387
P++TF F+ +++ P F F D+
Sbjct: 379 GEKSVVIPDLTFQFKGGAKMQL-PLSNYFAFVDM 411
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 142/337 (42%), Gaps = 64/337 (18%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ---CKEC--PRRSSLGIELTLYD 124
P G Y + GTP + ++ DTGS ++W C C EC P+ GI +
Sbjct: 75 PHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIP--RFV 132
Query: 125 IKDSSTGKFVTCDQEFCHGVYG----------GPLTDCTANTSCPYLEIYGDGSSTTGYF 174
K SS+ K V C C ++G P T+ T Y+ YG G ST G
Sbjct: 133 PKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSG-STAGLL 191
Query: 175 VQDVVQY-DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+ + + + DK + + GC + S ++ + GI GFG+ + S+ SQ
Sbjct: 192 LSETLDFPDKXIPN---------FVVGC------SFLSIHQPS--GIAGFGRGSESLPSQ 234
Query: 234 LASSGGVRKMFAHCLDG-------------INGGGIFAIGHVVQPEVNKTPLVPN---QP 277
+ G++K FA+CL ++ G+ + G P + P V N +
Sbjct: 235 M----GLKK-FAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTP-FRQNPSVSNNAYKE 288
Query: 278 HYSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
+Y +N+ + VG + +P V G N G+IIDSG+T ++ + V E + + Q
Sbjct: 289 YYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQL 348
Query: 336 PDLK----VHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
+ V T+ CF S+ FP + F F+
Sbjct: 349 ANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFK 385
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/338 (26%), Positives = 135/338 (39%), Gaps = 82/338 (24%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y KI IGTPP D Y DTGSD+MW C+ C C ++ + ++D S++ K
Sbjct: 22 GEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKN-----PMFDPSKSTSFKE 76
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
V+C+ + C L T ++
Sbjct: 77 VSCESQQCRL--------------------------------------------LDTPTS 92
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
+++FGCG SG NE + G+ G G S+ SQ+ S+ G + F+ CL
Sbjct: 93 ILNIVFGCGHNNSGTF---NENEM-GLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRT 148
Query: 249 -DGINGGGIFAI-GHVVQPEVNKTPLVP--NQPHYSINMTAVQVGLDFLNLPTDVFGVGD 304
I IF V +V TPLV + +Y + + + VG P
Sbjct: 149 DPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKL--FPFSSSSPMA 206
Query: 305 NKGTI-IDSGTTLAYLPEMVYEPLVSKIIS-------QQPDLKVHTVHDEYTCFQYSESV 356
KG + ID+GT LP Y LV + Q PDL+ C++ + +
Sbjct: 207 TKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQ------LCYRSATLI 260
Query: 357 DEGFPNVTFHFENS-VSLKVYPHEYLFPFEDLWCIGWQ 393
D P +T HF+ + V LK + ++ P E ++C Q
Sbjct: 261 DG--PILTAHFDGADVQLKPL-NTFISPKEGVYCFAMQ 295
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 151/370 (40%), Gaps = 55/370 (14%)
Query: 72 GVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTG 131
G G Y + +GTPP+ + + +DTGSD+ W+ C C +C +G ++D SS+
Sbjct: 147 GSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDC--FDQVG---PVFDPAASSSY 201
Query: 132 KFVTCDQEFCHGVYGG-PLTDC--TANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDL 188
+ VTC + C V P C SCPY YGD S+TTG + + +
Sbjct: 202 RNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 261
Query: 189 QTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL 248
+ ++FGCG G G+ S SQL + G F++CL
Sbjct: 262 SRRVDD--VVFGCGHWNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYG--HTFSYCL 312
Query: 249 ---------DGINGGGIFAIGHVVQPEVNKTPLVPNQP----HYSINMTAVQVGLDFLNL 295
+ G P++N T P Y + + V VG + LN+
Sbjct: 313 VDHGSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNI 372
Query: 296 PTDVF----GVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ-------PDLKVHTVH 344
+D + G G + GTIIDSGTTL+Y E Y+ + I + PD V +
Sbjct: 373 SSDTWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLS-- 430
Query: 345 DEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED---LWCIGWQNSGMQSRD 401
C+ S P ++ F + ++ +P E F D + C+ +
Sbjct: 431 ---PCYNVSGVDRPEVPELSLLFADG-AVWDFPAENYFIRLDPDGIMCL-----AVLGTP 481
Query: 402 RKNMTLLGDF 411
R M+++G+F
Sbjct: 482 RTGMSIIGNF 491
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 128/313 (40%), Gaps = 57/313 (18%)
Query: 75 LYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFV 134
+Y K+ +GTPP + +DTGS+I W C+ C C +++ ++D SST K
Sbjct: 64 VYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNA-----PIFDPSKSSTFKEK 118
Query: 135 TCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTN 194
CD SCPY Y D + T G + + SG+
Sbjct: 119 RCDGH-----------------SCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMP-- 159
Query: 195 GSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCLDG---- 250
I GCG +S + + G++G SS+I+Q+ G + ++C G
Sbjct: 160 -ETIIGCGHN-----NSWFKPSFSGMVGLNWGPSSLITQMG--GEYPGLMSYCFSGQGTS 211
Query: 251 -INGG--GIFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGVGDNKG 307
IN G I A VV + T P Y +N+ AV VG + F +
Sbjct: 212 KINFGANAIVAGDGVVSTTMFMTTAKPG--FYYLNLDAVSVGNTRIETMGTTFHALEGN- 268
Query: 308 TIIDSGTTLAYLP-------EMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVDEGF 360
+IDSGTTL Y P E +V+ + + P ++ C+ S+++D F
Sbjct: 269 IVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADP------TGNDMLCYN-SDTIDI-F 320
Query: 361 PNVTFHFENSVSL 373
P +T HF V L
Sbjct: 321 PVITMHFSGGVDL 333
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 142/347 (40%), Gaps = 49/347 (14%)
Query: 39 ERSLSLLKEHDARRQ--QRILAGVDL-PLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDT 95
E L L + AR Q ++AG + P+ + Y + IG+PP+ + +DT
Sbjct: 58 ESVLQLQAKDQARLQFLASMVAGRSVVPIASGRQIIQSPTYIVRAKIGSPPQTLLLAMDT 117
Query: 96 GSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCTAN 155
+D W+ C C C TL+ + S+T K V+C C+ V P C
Sbjct: 118 SNDAAWIPCTACDGC--------TSTLFAPEKSTTFKNVSCGSPQCNQV---PNPSC-GT 165
Query: 156 TSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEE 215
++C + YG SS VQD V T FGC A+ +G S +
Sbjct: 166 SACTFNLTYG-SSSIAANVVQDTVTL--------ATDPIPDYTFGCVAKTTG--ASAPPQ 214
Query: 216 ALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL---DGINGGGIFAIGHVVQP-EVNKTP 271
L G+ S S L S F++CL +N G +G V QP + TP
Sbjct: 215 GLLGLGRGPLSLLSQTQNLYQS-----TFSYCLPSFKSLNFSGSLRLGPVAQPIRIKYTP 269
Query: 272 LVPNQPH---YSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYEP 326
L+ N Y +N+ A++VG +++P + F GT+ DSGT L Y
Sbjct: 270 LLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTA 329
Query: 327 LVSKI-----ISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
+ + I+ + +L V ++ TC+ +V P +TF F
Sbjct: 330 VRDEFQRRVAIAAKANLTVTSLGGFDTCY----TVPIVAPTITFMFS 372
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 148/342 (43%), Gaps = 47/342 (13%)
Query: 53 QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR 112
Q L+ +P+ +R + Y +GIG ++ + VDTGSD+ WV C+ C+ C
Sbjct: 44 QTHQLSDSQIPISSGARLQTLN-YIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYN 100
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY----GGPLTDCTANTSCPYLEIYGDGS 168
+ + L++ +SS+ + C+ C + L +TSC Y YGDGS
Sbjct: 101 Q-----QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGS 155
Query: 169 STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS 228
+ G + + K D + IFGCG G + G++G +S
Sbjct: 156 YSRGELGFEKLTLGKTEID--------NFIFGCGRNNKGLFGGAS-----GLMGLARSEL 202
Query: 229 SMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHV-------VQPEVNKTPLVPN---Q 276
S++SQ +S G +F++CL G+ G +G + P ++ T ++ N
Sbjct: 203 SLVSQTSSLFG--SVFSYCLPTTGVGSSGSLTLGGADFSNFKNISP-ISYTRMIQNPQMS 259
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKG--TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
Y +N+T + +G LN+P + N+G +++DSGT + L +Y+ ++ Q
Sbjct: 260 NFYFLNLTGISIGGVNLNVPR----LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQ 315
Query: 335 QPDLKVHTVHDEY-TCFQYSESVDEGFPNVTFHFENSVSLKV 375
+ TCF + + P V F FE + + V
Sbjct: 316 FSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIV 357
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 149/347 (42%), Gaps = 57/347 (16%)
Query: 53 QQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPR 112
Q L+ +P+ +R + Y +GIG ++ + VDTGSD+ WV C+ C+ C
Sbjct: 123 QTHQLSDSQIPISSGARLQTLN-YIVTVGIGG--QNSTLIVDTGSDLTWVQCLPCRLCYN 179
Query: 113 RSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVY----GGPLTDCTANTSCPYLEIYGDGS 168
+ + L++ +SS+ + C+ C + L +TSC Y YGDGS
Sbjct: 180 Q-----QEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGS 234
Query: 169 STTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNS 228
+ G + + K D + IFGCG G + G++G +S
Sbjct: 235 YSRGELGFEKLTLGKTEID--------NFIFGCGRNNKGLFGGAS-----GLMGLARSEL 281
Query: 229 SMISQLASSGGVRKMFAHCL--DGINGGGIFAIGHV-------VQPEVNKTPLVPN---Q 276
S++SQ +S G +F++CL G+ G +G + P ++ T ++ N
Sbjct: 282 SLVSQTSSLFG--SVFSYCLPTTGVGSSGSLTLGGADFSNFKNISP-ISYTRMIQNPQMS 338
Query: 277 PHYSINMTAVQVGLDFLNLPTDVFGVGDNKG--TIIDSGTTLAYLPEMVYEPLVSKIISQ 334
Y +N+T + +G LN+P + N+G +++DSGT + L +Y+ ++ Q
Sbjct: 339 NFYFLNLTGISIGGVNLNVPR----LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQ 394
Query: 335 ------QPDLKVHTVHDEYTCFQYSESVDEGFPNVTFHFENSVSLKV 375
P + TCF + + P V F FE + + V
Sbjct: 395 FSGYRTTPGFSILN-----TCFNLTGYEEVNIPTVKFIFEGNAEMIV 436
>gi|209881472|ref|XP_002142174.1| eukaryotic aspartyl protease family protein [Cryptosporidium muris
RN66]
gi|209557780|gb|EEA07825.1| eukaryotic aspartyl protease family protein [Cryptosporidium muris
RN66]
Length = 442
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/365 (23%), Positives = 154/365 (42%), Gaps = 67/365 (18%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y+ + IGTP + + +DTGS + +C C +C + ++ Y++ S+T K+
Sbjct: 40 GYYFVDVYIGTPTQKQSLIIDTGSSHIGFSCATCLQCGKH-----DVQPYNLSKSTTAKW 94
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
E H + C Y++IY +GS +G + +D++ +++ + D++
Sbjct: 95 CNL-SENNHNI-------------CKYVQIYNEGSIVSGEYFEDILSFEEPNSDVKYFFN 140
Query: 194 NGSLIF---GCGARQSGNLDSTNEEALDGIIGFGKSNSSM-----------ISQLASSGG 239
+ + GC ++ + N GI+G G N + +S+ +
Sbjct: 141 GFRMHYNKLGCHEIETQLFINQNAS---GIMGLGIRNKDLQDNFINFLLLSVSRYYENEN 197
Query: 240 VRKMFAHCLDGINGGGIFAIGHV--------------VQPEVNKTPLVPNQPHYSINMTA 285
+ + CL + GGI IG ++ ++ PLV + Y I +
Sbjct: 198 SDIILSLCL--LKDGGIMNIGRYNDDIIEFDPENNIEIKNQILWIPLVLDTSVYRIKLEI 255
Query: 286 VQVGLDFLNLPTDVFG-VGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVH 341
+ D L FG D G +ID+G+T ++ P+ +Y+ L+ K Q D K
Sbjct: 256 IMKSSDIL----WAFGNTEDAIGVVIDTGSTFSHFPKSIYK-LIRKNFDQLCTAIDQKFG 310
Query: 342 T---VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYP-HEYLFPFED-LWCIGWQNSG 396
T VHD C+ + ++ FPN+T F + + H YL+ LWC+ +
Sbjct: 311 TCRIVHD-ILCWTNIKDINNKFPNITMKFLGQPNYITWTYHSYLYKTNSGLWCLAIEEHK 369
Query: 397 MQSRD 401
QS +
Sbjct: 370 FQSYE 374
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 136/317 (42%), Gaps = 56/317 (17%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y IGTPP + VDTGSD++W+ C CK+C + + ++D SS+ +
Sbjct: 86 GEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQIT-----PIFDPSLSSSYQN 140
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
+ C + CH + TSC GY + + D +G + S
Sbjct: 141 IPCLSDTCHSM---------RTTSC----------DVRGYLSVETLTLDSTTG--YSVSF 179
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
++I GCG R +G + GI+G G S+ SQL +S G + F++CL
Sbjct: 180 PKTMI-GCGYRNTGTFHGPSS----GIVGLGSGPMSLPSQLGTSIGGK--FSYCLGPWLP 232
Query: 249 ---DGINGGG---IFAIGHVVQPEVNKTPLVPNQPHYSINMTAVQVGLDFLNLPTDVFGV 302
+N G ++ G + P V K Q Y + + A VG + +G
Sbjct: 233 NSTSKLNFGDAAIVYGDGAMTTPIVKKDA----QSGYYLTLEAFSVGNKLIEFGGPTYG- 287
Query: 303 GDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYTCFQYSESVD-EGF- 360
G+ +IDSGTT +LP VY S + + + V D F+ +V GF
Sbjct: 288 GNEGNILIDSGTTFTFLPYDVYYRFESAVAEY---INLEHVEDPNGTFKLCYNVAYHGFE 344
Query: 361 -PNVTFHFENSVSLKVY 376
P +T HF+ + +K+Y
Sbjct: 345 APLITAHFKGA-DIKLY 360
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 149/341 (43%), Gaps = 40/341 (11%)
Query: 74 GLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKF 133
G Y +GTPP Y VDTGSDI+W+ C C++C +++ ++ SS+ K
Sbjct: 85 GDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTT-----PKFNPSKSSSYKN 139
Query: 134 VTCDQEFCHGVYGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTST 193
++C + C V T C +C Y YG+ S + G + + + +G + S
Sbjct: 140 ISCSSKLCQSVRD---TSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTG--RPVSF 194
Query: 194 NGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL----- 248
++I GCG G+ + G++G G +S+I+QL S G + F++CL
Sbjct: 195 PKTVI-GCGTNNIGSF----KRVSSGVVGLGGGPASLITQLGPSIGGK--FSYCLVRMSI 247
Query: 249 ---------DGINGGGIFAI-GHVVQPEVNKTPLVPNQP--HYSINMTAVQVGLDFLNLP 296
+N G + + GH V TP+V Y + + A VG +
Sbjct: 248 TLKNMSMGSSKLNFGDVAIVSGH----NVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFA 303
Query: 297 TDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEYT-CFQYSES 355
GV + IIDS T + ++P VY L S I+ +V + +++ C+ S
Sbjct: 304 GSSKGV-EEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSLCYNVSSD 362
Query: 356 VDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIGWQNSG 396
+ FP +T HF+ + L + ++ D+ C + S
Sbjct: 363 EEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAFAPSN 403
>gi|66817422|ref|XP_642564.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
gi|60470632|gb|EAL68608.1| hypothetical protein DDB_G0277581 [Dictyostelium discoideum AX4]
Length = 492
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 154/372 (41%), Gaps = 57/372 (15%)
Query: 63 PLGGSSRPDGVGLYYA-KIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELT 121
PL + D G +Y + + + + +QVDTGS + + C C +
Sbjct: 82 PLSKNIDIDMQGNFYQINVNVLIGQQKFILQVDTGSTLTAIPLKGCNSCKDNRPV----- 136
Query: 122 LYDIKDSSTGKFVTCDQEFCHGVYGGPLTDC----TANTSCPYLEIYGDGSSTTGYFVQD 177
YD SS+ + + C + C G G C A ++C ++ +YGDGS G D
Sbjct: 137 -YDPALSSSSQLIPCSSDKCLGS-GSASPSCKLHQNAKSTCDFIILYGDGSKIKGKVFSD 194
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKS--NSSMI---- 231
+ VS ++ FG + G + DGI+G G++ N +++
Sbjct: 195 EITVSGVS---------STIYFGANVEEVGAFEYPRA---DGIMGLGRTSNNKNLVPTIF 242
Query: 232 -SQLASSGGVRKMFAHCLDGINGGGIFAIG----HVVQPEVNKTPLVPNQPHYSINMTAV 286
S + S+ ++ +F LD +G G ++G H + TP+ P P Y+I T+
Sbjct: 243 DSMVRSNSSIKNIFGIYLD-YHGQGYLSLGKINHHYYIGSIQYTPIQPAGPFYAIKPTSF 301
Query: 287 QVGLDFLNLPTDVFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKV----HT 342
+V D + P + G I+DSGT+ L VY+ L+ + + +
Sbjct: 302 RV--DNTSFPANSMG-----QVIVDSGTSDLILTSRVYDHLIQYFRKHYCHIDMVCSYPS 354
Query: 343 VHDEYTCFQYSESVDEGFPNVTFHFENSVSLKVYPHEYLFPFED----LWCIGWQNSGMQ 398
+ CF+ E FP + F FE V + + P Y+ E ++ W G+
Sbjct: 355 IFSSRVCFEKEEDF-ATFPWLHFGFEGGVRIAIPPKNYMIKTESNQQGVYGYCW---GID 410
Query: 399 SRDRKNMTLLGD 410
D +MT+LGD
Sbjct: 411 RGD--DMTILGD 420
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 88/337 (26%), Positives = 142/337 (42%), Gaps = 64/337 (18%)
Query: 70 PDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ---CKEC--PRRSSLGIELTLYD 124
P G Y + GTP + ++ DTGS ++W C C EC P+ GI +
Sbjct: 75 PHSYGAYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIP--RFV 132
Query: 125 IKDSSTGKFVTCDQEFCHGVYG----------GPLTDCTANTSCPYLEIYGDGSSTTGYF 174
K SS+ K V C C ++G P T+ T Y+ YG G ST G
Sbjct: 133 PKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSG-STAGLL 191
Query: 175 VQDVVQY-DKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ 233
+ + + + DK + + GC + S ++ + GI GFG+ + S+ SQ
Sbjct: 192 LSETLDFPDKKIPN---------FVVGC------SFLSIHQPS--GIAGFGRGSESLPSQ 234
Query: 234 LASSGGVRKMFAHCLDG-------------INGGGIFAIGHVVQPEVNKTPLVPN---QP 277
+ G++K FA+CL ++ G+ + G P + P V N +
Sbjct: 235 M----GLKK-FAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTP-FRQNPSVSNNAYKE 288
Query: 278 HYSINMTAVQVGLDFLNLPTD--VFGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQ 335
+Y +N+ + VG + +P V G N G+IIDSG+T ++ + V E + + Q
Sbjct: 289 YYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQL 348
Query: 336 PDLK----VHTVHDEYTCFQYSESVDEGFPNVTFHFE 368
+ V T+ CF S+ FP + F F+
Sbjct: 349 ANWTRATDVETLTGLRPCFDISKEKSVKFPELIFQFK 385
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 146/360 (40%), Gaps = 60/360 (16%)
Query: 93 VDTGSDIMWVNCIQ---CKECPRRS-SLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG-- 146
+DTGSD++WV C + C CP S S G+ L + SS+ VTC C +YG
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLP----RMSSSLHLVTCADSNCKTLYGNN 56
Query: 147 ---------GPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSL 197
G L +C+ T PY YG GS T G + + + +G+ T+
Sbjct: 57 TELLCQSCAGSLKNCS-ETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITH--F 112
Query: 198 IFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAHCL-----DGIN 252
GC S + GI GFG+ SM SQL G + FA+CL D N
Sbjct: 113 AVGCSIVSS--------QQPSGIAGFGRGALSMPSQLGEHIG-KDRFAYCLQSHRFDEEN 163
Query: 253 GGGIFAIGHVVQPE---VNKTPLVPNQP---------HYSINMTAVQVGLDFLN-LPTDV 299
+ +G P +N TP + N +Y I + V +G L LP+ +
Sbjct: 164 KKSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKL 223
Query: 300 --FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQQPDLKVHTVHDEY---TCFQYSE 354
F N GTIIDSGTT + +++ + + SQ + V D+ C+ +
Sbjct: 224 LRFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVTG 283
Query: 355 SVDEGFPNVTFHFENSVSLKVYP----HEYLFPFEDLWCIGWQNSGMQSRDRKNMTLLGD 410
+ P FHF+ + V P Y F+ + + G+ D +LG+
Sbjct: 284 LENIVLPEFAFHFKGGSDM-VLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGN 342
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 103/271 (38%), Gaps = 43/271 (15%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y IGTPP +DTGSD++W C + P R LY S T V+
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQC----DAPCRRCFPQPAPLYAPARSVTYANVS 155
Query: 136 CDQEFCHGV---------YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSG 186
C C + C Y YGDGSST G + +
Sbjct: 156 CGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTFGA--- 212
Query: 187 DLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASSGGVRKMFAH 246
+T L FGCG G D+++ G++G G+ S++SQL GV K F++
Sbjct: 213 ----GTTVHDLAFGCGTDNLGGTDNSS-----GLVGMGRGPLSLVSQL----GVTK-FSY 258
Query: 247 CLDGINGGG-----IFAIGHVVQPEVNKTPLVPN------QPHYSINMTAVQVGLDFLNL 295
C N + P TP VP+ +Y +++ + VG L +
Sbjct: 259 CFTPFNDTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPI 318
Query: 296 PTDVFGV--GDNKGTIIDSGTTLAYLPEMVY 324
VF + G IIDSGTT L E +
Sbjct: 319 DPAVFRLTASGRGGLIIDSGTTFTALEERAF 349
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 134/319 (42%), Gaps = 52/319 (16%)
Query: 76 YYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKECPRRSSLGIELTLYDIKDSSTGKFVT 135
Y A +G+G + V VDT S++ WV C C C + L+D S + +
Sbjct: 127 YVATVGLGG--GEATVIVDTASELTWVQCAPCASCHDQQG-----PLFDPASSPSYAVLP 179
Query: 136 CDQEFCHGV-----YGGPLTDCTANTSCPYLEIYGDGSSTTGYFVQDVVQYDKVSGDLQT 190
C+ C + SC Y Y DGS + Q V+ +DK+S L
Sbjct: 180 CNSSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGS-----YSQGVLAHDKLS--LAG 232
Query: 191 TSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQ-LASSGGVRKMFAHC-- 247
+G +FGCG G T+ G++G G+S S+ISQ + GGV F++C
Sbjct: 233 EVIDG-FVFGCGTSNQGPFGGTS-----GLMGLGRSQLSLISQTMDQFGGV---FSYCLP 283
Query: 248 LDGINGGGIFAIGHVVQPEVNKTPLV-------PNQ-PHYSINMTAVQVGLDFLNLPTDV 299
L G +G N TP+V P Q P Y +N+T + +G +
Sbjct: 284 LKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE----- 338
Query: 300 FGVGDNKGTIIDSGTTLAYLPEMVYEPLVSKIISQ---QPDLKVHTVHDEYTCFQYSESV 356
I+DSGT + L VY + ++ +SQ P ++ D TCF +
Sbjct: 339 ---SSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGFSILD--TCFNLTGFR 393
Query: 357 DEGFPNVTFHFENSVSLKV 375
+ P++ F FE +V ++V
Sbjct: 394 EVQIPSLKFVFEGNVEVEV 412
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 153/380 (40%), Gaps = 66/380 (17%)
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE-CPRRSSLGIE 119
DLP GG Y + IGTPP+ Y DTGSD++W C C E C ++ S
Sbjct: 85 DLPNGGE--------YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPS---- 132
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT--ANTSCPYLEIYGDGSSTTGYFVQD 177
LY+ S T + + C L T +C Y + YG G T+G +
Sbjct: 133 -PLYNPSSSPTFRVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSE 190
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ D + FGC N S + G++G G+ S++SQLA+
Sbjct: 191 TFTFGSSPADQVRVP---GIAFGC-----SNASSDDWNGSAGLVGLGRGGLSLVSQLAAG 242
Query: 238 GGVRKMFAHCLD--------------------GINGGGIFAIGHVVQPEVNKTPLVPNQP 277
MF++CL +NG G+ + V P +K P+
Sbjct: 243 -----MFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFV--PSPSKPPM---ST 292
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIIS-- 333
+Y +N+T + VG L +P F + + G IIDSGTT+ L + Y+ + + + S
Sbjct: 293 YYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLV 352
Query: 334 QQPDLKVHTVHDEYTCFQY--SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIG 391
+ P CF S + P++T HF + + Y+ +WC+
Sbjct: 353 KLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGGMWCL- 411
Query: 392 WQNSGMQSRDRKNMTLLGDF 411
M+S+ ++ LG++
Sbjct: 412 ----AMRSQTDGELSTLGNY 427
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 134/336 (39%), Gaps = 51/336 (15%)
Query: 47 EHDARRQQRILAGVDLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQ 106
HD Q +++G L G G Y+ +GTPP+ + + VD+GSD++WV C
Sbjct: 45 SHDHDFQSPVVSGSTL---------GSGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAP 95
Query: 107 CKECPRRSSLGIELTLYDIKDSSTGKFVTCDQEFCHGVYG--GPLTDCTANTSCPYLEIY 164
C +C + LY +SST V C C + G D +C Y Y
Sbjct: 96 CLQC-----YAQDTPLYAPSNSSTFNPVPCLSPECLLIPATEGFPCDFHYPGACAYEYRY 150
Query: 165 GDGSSTTGYFVQDVVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFG 224
D S + G F + D V D + FGCG G+ A G++G G
Sbjct: 151 ADTSLSKGVFAYESATVDDVRID--------KVAFGCGRDNQGSF-----AAAGGVLGLG 197
Query: 225 KSNSSMISQLASSGGVRKMFAHCLDG-----------INGGGIFAIGHVVQPEVNKTPLV 273
+ S SQ+ + G + FA+CL I G + + H +Q TP+V
Sbjct: 198 QGPLSFGSQVGYAYGNK--FAYCLVNYLDPTSVSSWLIFGDELISTIHDLQ----FTPIV 251
Query: 274 PNQPH---YSINMTAVQVGLDFLNLPTDVFGVG--DNKGTIIDSGTTLAYLPEMVYEPLV 328
N + Y + + V VG + L + + + N G+I DSGTT+ Y Y ++
Sbjct: 252 SNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNIL 311
Query: 329 SKIISQQPDLKVHTVHDEYTCFQYSESVDEGFPNVT 364
+ + +V C + FP+ T
Sbjct: 312 AAFDKNVRYPRAASVQGLDLCVDVTGVDQPSFPSFT 347
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 96/380 (25%), Positives = 153/380 (40%), Gaps = 66/380 (17%)
Query: 61 DLPLGGSSRPDGVGLYYAKIGIGTPPKDYYVQVDTGSDIMWVNCIQCKE-CPRRSSLGIE 119
DLP GG Y + IGTPP+ Y DTGSD++W C C E C ++ S
Sbjct: 90 DLPNGGE--------YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPS---- 137
Query: 120 LTLYDIKDSSTGKFVTCDQEFCHGVYGGPLTDCT--ANTSCPYLEIYGDGSSTTGYFVQD 177
LY+ S T + + C L T +C Y + YG G T+G +
Sbjct: 138 -PLYNPSSSPTFRVLPCSSALNLCAAEARLAGATPPPGCACRYNQTYGTG-WTSGLQGSE 195
Query: 178 VVQYDKVSGDLQTTSTNGSLIFGCGARQSGNLDSTNEEALDGIIGFGKSNSSMISQLASS 237
+ D + FGC N S + G++G G+ S++SQLA+
Sbjct: 196 TFTFGSSPADQVRVP---GIAFGC-----SNASSDDWNGSAGLVGLGRGGLSLVSQLAAG 247
Query: 238 GGVRKMFAHCLD--------------------GINGGGIFAIGHVVQPEVNKTPLVPNQP 277
MF++CL +NG G+ + V P +K P+
Sbjct: 248 -----MFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRSTPFV--PSPSKPPM---ST 297
Query: 278 HYSINMTAVQVGLDFLNLPTDVFGVGDN--KGTIIDSGTTLAYLPEMVYEPLVSKIIS-- 333
+Y +N+T + VG L +P F + + G IIDSGTT+ L + Y+ + + + S
Sbjct: 298 YYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLV 357
Query: 334 QQPDLKVHTVHDEYTCFQY--SESVDEGFPNVTFHFENSVSLKVYPHEYLFPFEDLWCIG 391
+ P CF S + P++T HF + + Y+ +WC+
Sbjct: 358 KLPVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLPVENYMILDGGMWCL- 416
Query: 392 WQNSGMQSRDRKNMTLLGDF 411
M+S+ ++ LG++
Sbjct: 417 ----AMRSQTDGELSTLGNY 432
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.138 0.425
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,876,662,550
Number of Sequences: 23463169
Number of extensions: 309344366
Number of successful extensions: 633796
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2118
Number of HSP's successfully gapped in prelim test: 1496
Number of HSP's that attempted gapping in prelim test: 626596
Number of HSP's gapped (non-prelim): 4368
length of query: 411
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 266
effective length of database: 8,957,035,862
effective search space: 2382571539292
effective search space used: 2382571539292
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)