BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 037706
(394 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
Length = 416
Score = 662 bits (1708), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 323/394 (81%), Positives = 354/394 (89%), Gaps = 3/394 (0%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRN+KLMS FSPS SSSS RD+CAS +C +I
Sbjct: 24 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFSPSHSSSSYRDSCASPYCTDI 83
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
HSSDN FDPCT++GCSLSTL+K+TC RPCPSFAYTYG GG+VTG LTRDTL+VH +
Sbjct: 84 HSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTLTRDTLRVHEGPARV 143
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
++IPKFCFGCVGSTY EPIGIAGF RG LS PSQLG L+KGFSHCFLAFKYAN+PNISS
Sbjct: 144 TKDIPKFCFGCVGSTYHEPIGIAGFVRGTLSFPSQLGLLKKGFSHCFLAFKYANNPNISS 203
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PLVIGD A+SSKDN+QFTPMLKSPMYPNYYYIGLEAIT+GN S T VPL+LREFDSQGNG
Sbjct: 204 PLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGLEAITVGNVSATTVPLNLREFDSQGNG 263
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+L+DSGTTYTHLPEPFYSQLLSI ++ IT YPRA EVE R GFDLCY+VPCPNN TDD
Sbjct: 264 GMLIDSGTTYTHLPEPFYSQLLSIFKAIIT-YPRATEVEMRAGFDLCYKVPCPNNRLTDD 322
Query: 301 --LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
LFPSITFHFLNNVS VLPQGNHFYAMSAPSNS+ VKCLLFQSM D DYGP+GVFGSFQ
Sbjct: 323 DNLFPSITFHFLNNVSFVLPQGNHFYAMSAPSNSTVVKCLLFQSMADSDYGPAGVFGSFQ 382
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
QQNV++VYDLEKERIGFQPMDCAS A +QGLH++
Sbjct: 383 QQNVQIVYDLEKERIGFQPMDCASAAVSQGLHRE 416
>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
Length = 486
Score = 647 bits (1668), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 319/393 (81%), Positives = 355/393 (90%), Gaps = 3/393 (0%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
VIQV MDTGSDLTWVPCGNLSFDCM+CDDYRNNKLM+ FSPS SSSS R +CAS FC++I
Sbjct: 94 VIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSYRASCASPFCIDI 153
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
HSSDNP D CT++GCSLSTL+K+TC RPCPSFAYTYG GG+VTGILTRDTL+V+GSSPG+
Sbjct: 154 HSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTRDTLRVNGSSPGV 213
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+EIPKFCFGCVGS YREPIGIAGFGRG LS+ SQLGFLQKGFSHCFLAFKYAN+PNISS
Sbjct: 214 AKEIPKFCFGCVGSAYREPIGIAGFGRGTLSMVSQLGFLQKGFSHCFLAFKYANNPNISS 273
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PLV+GD+A++SKD++QFTPML SPMYPN+YY+GLEAIT+GN S TEVP SLREFDS GNG
Sbjct: 274 PLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNG 333
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP-NNTFT- 298
G+ +DSGTTYTHLPEPFYSQ+LSILQSTI YPR +E +TGFDLCY+VP P NNT T
Sbjct: 334 GMKIDSGTTYTHLPEPFYSQVLSILQSTIN-YPRDTGMEMQTGFDLCYKVPRPNNNTLTS 392
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
DDL PSITFHFLNNVSLVLPQGNHFY +SAP N + VKCL+FQS DDGD GP+GVFGSFQ
Sbjct: 393 DDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQ 452
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASAQGLHK 391
QQNVEVVYDLEKERIGFQPMDCAS AS+QGLHK
Sbjct: 453 QQNVEVVYDLEKERIGFQPMDCASAASSQGLHK 485
>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
Length = 483
Score = 642 bits (1656), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 308/392 (78%), Positives = 356/392 (90%), Gaps = 2/392 (0%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
VIQVYMDTGSDLTW PCGN+SFDC++CD+YRNN++M++FSPS SSSS RD+C S FC+++
Sbjct: 92 VIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSHRDSCTSPFCIDV 151
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
HSSDNP DPCTM+GCSLSTL+K+TC PCP FAYTYG GG+VTG LTRDTL+VHG + G+
Sbjct: 152 HSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTRDTLRVHGRNLGV 211
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+EIP+FCFGCV S+YREPIGIAGFGRGALS+PSQLGFL+KGFSHCFLAFKYAN+PNISS
Sbjct: 212 TQEIPRFCFGCVASSYREPIGIAGFGRGALSLPSQLGFLRKGFSHCFLAFKYANNPNISS 271
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PL+IGD+A++SKD++QFTPMLKSPMYPNYYY+GLEAIT+GN S TEVP SLREFDS GNG
Sbjct: 272 PLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVSATEVPSSLREFDSLGNG 331
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT-FTD 299
G+LVDSGTTYTHLPEPFYSQ+LS+LQS I YPRA ++E RTGFDLCY+VPC NN+ T
Sbjct: 332 GMLVDSGTTYTHLPEPFYSQVLSVLQSIIN-YPRATDMEMRTGFDLCYKVPCQNNSILTG 390
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
DL PSITFHFLNN SLVL +G+HFYAMSAPSNS+ VKCLLFQSMDDGDYGP+GV GSFQQ
Sbjct: 391 DLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQSMDDGDYGPAGVLGSFQQ 450
Query: 360 QNVEVVYDLEKERIGFQPMDCASTASAQGLHK 391
Q+VEVVYD+EKERIGF+PMDCAS AS QG +K
Sbjct: 451 QDVEVVYDMEKERIGFRPMDCASAASFQGFNK 482
>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
Length = 434
Score = 635 bits (1639), Expect = e-180, Method: Compositional matrix adjust.
Identities = 309/394 (78%), Positives = 345/394 (87%), Gaps = 3/394 (0%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
VIQVYMDTGSDLTWVPCGNLSFDCMDC+DYRNNKLMS +SPS SSSS RD C S C ++
Sbjct: 41 VIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSLRDLCVSPLCSDV 100
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
HSSDN +DPC ++GCSLSTL+K TC RPCPSFAYTYG GG+V G LTRDTL HGSSP
Sbjct: 101 HSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDTLTTHGSSPSF 160
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
RE+P FCFGCVGSTYREPIGIAGFGRG LS+PSQLGFLQKGFSHCFL FK+AN+PNISS
Sbjct: 161 TREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFSHCFLGFKFANNPNISS 220
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PLVIGD+AISS D+LQFT +LK+PMYPNYYYIGLEAIT+GN++ +VP SLREFDS GNG
Sbjct: 221 PLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNG 280
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT-- 298
G+++DSGTTYTHLP PFY+QLLS+LQS IT YPRA+E E RTGFDLCYR+PCPNN T
Sbjct: 281 GMIIDSGTTYTHLPGPFYTQLLSMLQSIIT-YPRAQEQEARTGFDLCYRIPCPNNVVTDH 339
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
D L PSI+FHF NNVSLVLPQGNHFYAM APSNS+ VKCLL Q+MDD D GP+GVFGSFQ
Sbjct: 340 DHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQ 399
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
QQNV+VVYDLEKERIGFQPMDCAS A++QG+ K
Sbjct: 400 QQNVKVVYDLEKERIGFQPMDCASAAASQGIIHK 433
>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 417
Score = 635 bits (1638), Expect = e-179, Method: Compositional matrix adjust.
Identities = 309/394 (78%), Positives = 345/394 (87%), Gaps = 3/394 (0%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
VIQVYMDTGSDLTWVPCGNLSFDCMDC+DYRNNKLMS +SPS SSSS RD C S C ++
Sbjct: 24 VIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSLRDLCVSPLCSDV 83
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
HSSDN +DPC ++GCSLSTL+K TC RPCPSFAYTYG GG+V G LTRDTL HGSSP
Sbjct: 84 HSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTRDTLTTHGSSPSF 143
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
RE+P FCFGCVGSTYREPIGIAGFGRG LS+PSQLGFLQKGFSHCFL FK+AN+PNISS
Sbjct: 144 TREVPNFCFGCVGSTYREPIGIAGFGRGVLSLPSQLGFLQKGFSHCFLGFKFANNPNISS 203
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PLVIGD+AISS D+LQFT +LK+PMYPNYYYIGLEAIT+GN++ +VP SLREFDS GNG
Sbjct: 204 PLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNG 263
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT-- 298
G+++DSGTTYTHLP PFY+QLLS+LQS IT YPRA+E E RTGFDLCYR+PCPNN T
Sbjct: 264 GMIIDSGTTYTHLPGPFYTQLLSMLQSIIT-YPRAQEQEARTGFDLCYRIPCPNNVVTDH 322
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
D L PSI+FHF NNVSLVLPQGNHFYAM APSNS+ VKCLL Q+MDD D GP+GVFGSFQ
Sbjct: 323 DHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQ 382
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
QQNV+VVYDLEKERIGFQPMDCAS A++QG+ K
Sbjct: 383 QQNVKVVYDLEKERIGFQPMDCASAAASQGIIHK 416
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 598 bits (1543), Expect = e-168, Method: Compositional matrix adjust.
Identities = 291/402 (72%), Positives = 337/402 (83%), Gaps = 18/402 (4%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN--FSPSRSSSSSRDTCASSFCL 58
+QVY+DTGSDLTWVPCGNLSFDC++C D +NN L S FSP SS+S RD+CASSFC+
Sbjct: 95 AVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCV 154
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
IHSSDNPFDPC ++GCS+S LLKSTC RPCPSFAYTYGEGGL++GILTRD LK
Sbjct: 155 EIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGILTRDILKAR---- 210
Query: 119 GIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
R++P+F FGCV STYREPIGIAGFGRG LS+PSQLGFL+KGFSHCFL FK+ N+PNI
Sbjct: 211 --TRDVPRFSFGCVTSTYREPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNI 268
Query: 179 SSPLVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIG-NSSLTEVPLSLREFD 235
SSPL++G A+S D+LQFTPML +PMYPN YYIGLE+ITIG N + T+VPL+LR+FD
Sbjct: 269 SSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFD 328
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
SQGNGG+LVDSGTTYTHLPEPFYSQLL+ LQSTITY PRA E E RTGFDLCY+VPCPNN
Sbjct: 329 SQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITY-PRATETESRTGFDLCYKVPCPNN 387
Query: 296 TFTD------DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
T +FPSITFHFLNN +L+LPQGN FYAMSAPS+ S V+CLLFQ+M+DGDYG
Sbjct: 388 NLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYG 447
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHK 391
P+GVFGSFQQQNV+VVYDLEKERIGFQ MDC A++ GL++
Sbjct: 448 PAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLEAASHGLNQ 489
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 588 bits (1516), Expect = e-165, Method: Compositional matrix adjust.
Identities = 289/402 (71%), Positives = 336/402 (83%), Gaps = 18/402 (4%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN--FSPSRSSSSSRDTCASSFCL 58
+QVYMDTGSDLTWVPCGNLSFDC+DC+D ++N L S+ FSP SSSS R +CASSFC
Sbjct: 23 AVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSSSFRASCASSFCA 82
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
IHSSDNPFDPC ++GCS+S LLKSTC RPCPSFAYTYGEGGLV+GILTRD LK
Sbjct: 83 EIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGILTRDILKAR---- 138
Query: 119 GIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
R++P+F FGCV STY EPIGIAGFGRG LS+PSQLGFL+KGFSHCFL FK+ N+PNI
Sbjct: 139 --TRDVPRFSFGCVTSTYHEPIGIAGFGRGLLSLPSQLGFLEKGFSHCFLPFKFVNNPNI 196
Query: 179 SSPLVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIG-NSSLTEVPLSLREFD 235
SSPL++G A+S D+LQFTPML +P+YPN YYIGLE+ITIG N + T+VPL+LR+FD
Sbjct: 197 SSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLESITIGTNITPTQVPLTLRQFD 256
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
SQGNGG+LVDSGTTYTHLP PFYSQLL+ILQSTITY PRA E E RTGFDLCY+VPCPNN
Sbjct: 257 SQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITY-PRATETESRTGFDLCYKVPCPNN 315
Query: 296 TFTD------DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
T +FPSITF+FLNN +L+LPQGN FYAMSAPS+ S V+CLLFQ+M+DG+YG
Sbjct: 316 NLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGNYG 375
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHK 391
P+GVFGSFQQQNV+VVYDLEKERIGFQ MDC A++ GL++
Sbjct: 376 PAGVFGSFQQQNVKVVYDLEKERIGFQAMDCVLEAASHGLNQ 417
>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 449
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 285/412 (69%), Positives = 336/412 (81%), Gaps = 21/412 (5%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNN---KLMSNFSPSRSSSSSRDTCASSFC 57
V+QVYMDTGSDLTWVPCGNLSFDC DC++Y+NN ++ F P+ SS+S RDTC SSFC
Sbjct: 33 VVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSSTSIRDTCGSSFC 92
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG-- 115
++IHSSDNPFDPCT++GCSL++L+K TC RPCPSFAYTYG G+VTG LTRD L HG
Sbjct: 93 MDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGSLTRDVLFTHGNY 152
Query: 116 -SSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
++ ++IP+FCFGCVG+TYREPIGIAGFGRG LS+P QLGF KGFSHCFL FK++N
Sbjct: 153 NNNNNNNKQIPRFCFGCVGATYREPIGIAGFGRGLLSLPFQLGFSHKGFSHCFLPFKFSN 212
Query: 175 DPNISSPLVIGDVAISSKD-NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT---EVPLS 230
+PN SSPL++G++AISSKD NLQFTP+LKSPMYPNYYYIGLE+ITIGN V
Sbjct: 213 NPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLESITIGNGDNNFRFGVSFK 272
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
LRE D++GNGG+L+DSGTTYTHLPEP YSQL+S L+ I YPRAK+VE TGFDLCY+V
Sbjct: 273 LREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIG-YPRAKQVELNTGFDLCYKV 331
Query: 291 PCPNN--TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM---- 343
PC NN +F DD PSITFHFLNNVS+VLPQGN+FYAM+AP NS+ VKCLL+QSM
Sbjct: 332 PCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPINSTVVKCLLYQSMDGVG 391
Query: 344 ---DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
D D GP+G+FGSFQQQN+EVVYDLEKER+GFQPMDC S A+ QGLHK
Sbjct: 392 DDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDCVSVAAKQGLHKN 443
>gi|125552953|gb|EAY98662.1| hypothetical protein OsI_20585 [Oryza sativa Indica Group]
Length = 429
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 223/391 (57%), Positives = 289/391 (73%), Gaps = 11/391 (2%)
Query: 1 VIQVYMDTGSDLTWVPCG-NLSFDCMDC-DDYRNNKLMSNFSPSRSSSSSRDTCASSFCL 58
V QVY+DTGSDLTWVPCG N S+ C++C +++ +K + +FSPS+SSS+ ++ C S FC+
Sbjct: 37 VFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSSSNMKELCGSRFCV 96
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+IHSSDN DPC GC++ + + C RPCP F+YTYG G LV G L +D + +HGS
Sbjct: 97 DIHSSDNSHDPCAAVGCAIPSFMSGLCTRPCPPFSYTYGGGALVLGSLAKDIVTLHGSIF 156
Query: 119 GI--IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
GI + ++P FCFGCVGS+ REPIGIAGFG+G LS+PSQLGFL KGFSHCFL F++A +P
Sbjct: 157 GIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILSLPSQLGFLDKGFSHCFLGFRFARNP 216
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N +S L++GD+A+S+KD+ FTPMLKS PN+YYIGLE ++IG+ + P SL DS
Sbjct: 217 NFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIGLEGVSIGDGAAIAAPPSLSSIDS 276
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG++VD+GTTYTHLP+PFY+ +LS L S I Y R+ ++E RTGFDLC+++PC +
Sbjct: 277 EGNGGMIVDTGTTYTHLPDPFYTAILSSLASVI-LYERSYDLEMRTGFDLCFKIPCTHTP 335
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD------YGP 350
T D P I FHFL +V L LP+ + +YA++AP NS VKCLLFQ MDD D GP
Sbjct: 336 CTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNSVVVKCLLFQRMDDEDDVGGANNGP 395
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
V GSFQ QNVEVVYD+E RIGFQP DCA
Sbjct: 396 GAVLGSFQMQNVEVVYDMEAGRIGFQPKDCA 426
>gi|297724243|ref|NP_001174485.1| Os05g0511050 [Oryza sativa Japonica Group]
gi|222632192|gb|EEE64324.1| hypothetical protein OsJ_19161 [Oryza sativa Japonica Group]
gi|255676482|dbj|BAH93213.1| Os05g0511050 [Oryza sativa Japonica Group]
Length = 432
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/394 (55%), Positives = 287/394 (72%), Gaps = 14/394 (3%)
Query: 1 VIQVYMDTGSDLTWVPCG-NLSFDCMDC-DDYRNNKLMSNFSPSRSSSSSRDTCASSFCL 58
V QVY+DTGSDLTWVPCG N S+ C++C +++ +K + +FSPS+SSS+ ++ C S FC+
Sbjct: 37 VFQVYLDTGSDLTWVPCGTNSSYQCLECGNEHSTSKPIPSFSPSQSSSNMKELCGSRFCV 96
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+IHSSDN DPC GC++ + + C RPCP F+YTYG G LV G L +D + +HGS
Sbjct: 97 DIHSSDNSHDPCAAVGCAIPSFMSDLCTRPCPPFSYTYGGGALVLGSLAKDIVTLHGSIF 156
Query: 119 GI--IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
GI + ++P FCFGCVGS+ REPIGIAGFG+G LS+PSQLGFL KGFSHCFL F++A +P
Sbjct: 157 GIAILLDVPGFCFGCVGSSIREPIGIAGFGKGILSLPSQLGFLDKGFSHCFLGFRFARNP 216
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N +S L++GD+A+S+KD+ FTPMLKS PN+YYIGLE ++IG+ + P SL DS
Sbjct: 217 NFTSSLIMGDLALSAKDDFLFTPMLKSITNPNFYYIGLEGVSIGDGAAIAAPPSLSSIDS 276
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG++VD+GTTYTHLP+PFY+ +LS L S I Y R+ ++E RTGFDLC+++PC +
Sbjct: 277 EGNGGMIVDTGTTYTHLPDPFYTAILSSLASVI-LYERSYDLEMRTGFDLCFKIPCTHTP 335
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM---------DDGD 347
T D P I FHFL +V L LP+ + +YA++AP NS VKCLLFQ M +
Sbjct: 336 CTQDELPLINFHFLGDVKLTLPKDSCYYAVTAPKNSVVVKCLLFQRMDNDDDDDDVGGAN 395
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GP V GSFQ QNVEVVYD+E RIGFQP DCA
Sbjct: 396 NGPGAVLGSFQMQNVEVVYDMEAGRIGFQPKDCA 429
>gi|357128791|ref|XP_003566053.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 441
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 227/406 (55%), Positives = 281/406 (69%), Gaps = 21/406 (5%)
Query: 1 VIQVYMDTGSDLTWVPCG-NLSFDCMDC-DDYRNNKLMSNFSPSRSSSSSRDTCASSFCL 58
V QVY+DTGSDLTWVPCG N S+ C++C +++ +K FS S+S SS+RD C S FC+
Sbjct: 37 VFQVYLDTGSDLTWVPCGTNTSYQCLECGNEHSISKPTPAFSLSQSYSSTRDLCGSRFCV 96
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
++HSSDN D C +GCS+ + C R CP FAYTYG LV G L RDT+ +HGS
Sbjct: 97 DVHSSDNSHDACAAAGCSIPVFMSGLCTRLCPPFAYTYGGRALVLGSLARDTIALHGSIY 156
Query: 119 GIIR--EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
GI E P FCFGCVGS+ REPIGIAGFG+G LS+PSQLGFL KGFSHCFL F +A +P
Sbjct: 157 GISVPIEFPGFCFGCVGSSIREPIGIAGFGKGKLSLPSQLGFLDKGFSHCFLGFWFARNP 216
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
NI+SP+VIGD+A+S KD FTPMLKS YPN+YYIGLE +TIG+++ P SL DS
Sbjct: 217 NITSPMVIGDLALSVKDGFLFTPMLKSLTYPNFYYIGLEGVTIGDNAAIPAPPSLSGIDS 276
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG++VD+GTTYTHL +PFY+ +LS L ST+ Y R+ E+E RTGFDLC +VPC +
Sbjct: 277 EGNGGVIVDTGTTYTHLSDPFYASVLSSLSSTVPYN-RSYELEIRTGFDLCLKVPCMHAP 335
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY-------- 348
DD P IT H +V+L LP+ + +YA++AP NS +KCLLFQ DD
Sbjct: 336 CNDDELPPITVHLGGDVTLALPKESCYYAVTAPRNSVVIKCLLFQRKDDDGVFSADNDDG 395
Query: 349 --------GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
GP+ V GSFQ QNVEVVYDLE R+GFQP DCA A
Sbjct: 396 EDASFSAGGPAAVLGSFQMQNVEVVYDLESGRVGFQPRDCALGVGA 441
>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
Length = 439
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 211/402 (52%), Positives = 270/402 (67%), Gaps = 23/402 (5%)
Query: 1 VIQVYMDTGSDLTWVPCGNLS-FDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
V QVY+DTGSDLTWVPCG+ S + C+DC + K F PS S+S++RD C S FC++
Sbjct: 37 VFQVYLDTGSDLTWVPCGSSSSYQCLDCGS--SVKPTPTFLPSESTSNTRDLCGSRFCVD 94
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+HSSDN FDPC +GC++ C RPCP F+YTYG G LV G L+RD++ +HGS+ G
Sbjct: 95 VHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALVLGSLSRDSVTLHGSTHG 154
Query: 120 -------IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
+ P F FGCVGS+ REP+GIAGFGRGALS+PSQLGFL KGFSHCFL F++
Sbjct: 155 SGAGAGPLPVAFPGFGFGCVGSSIREPLGIAGFGRGALSLPSQLGFLGKGFSHCFLGFRF 214
Query: 173 ANDPNISSPLVIGDVAISSKD---NLQFTPMLKSPMYPNYYYIGLEAITIGN---SSLTE 226
A +PN +SPLV+GD+A+SS FTPML S YPN+YY+GLE + +G+ S
Sbjct: 215 ARNPNFTSPLVMGDLALSSASTDGGFVFTPMLTSATYPNFYYVGLEGVVLGDDDGGSAMA 274
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
P SL D+QGNGG+LVD+GTTYT LP+PFY+ +L+ L S Y R++++E RTGFDL
Sbjct: 275 APPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYASVLASLISAAPPYERSRDLEARTGFDL 334
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD-- 344
C++VPC DD P IT H L LP+ + +Y ++A +S VKCLLFQ M+
Sbjct: 335 CFKVPCARAPCADDELPPITLHLAGGARLALPKLSSYYPVTAIRDSVVVKCLLFQRMEME 394
Query: 345 -----DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GP+ V GSFQ QNVEVVYDL R+GF+P DCA
Sbjct: 395 DDGDGTSGGGPAAVLGSFQMQNVEVVYDLAAGRVGFRPRDCA 436
>gi|359474399|ref|XP_003631454.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 485
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 151/407 (37%), Positives = 218/407 (53%), Gaps = 44/407 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +YMDTGSDL W PC F+C+ C+ + SP +SS+ +C S C H
Sbjct: 87 ISLYMDTGSDLVWFPCA--PFECILCEGKYDTAATGGLSPPNITSSASVSCKSPACSAAH 144
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+S + D C M+ C L + S C CP F Y YG+G LV L RD+L + SSP +
Sbjct: 145 TSLSSSDLCAMARCPLELIETSDCSSFSCPPFYYAYGDGSLVAR-LYRDSLSMPASSPLV 203
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG----FLQKGFSHCFLAFKY-AND 175
+ F FGC + EP+G+AGFGRG LS+P+QL L FS+C ++ + A+
Sbjct: 204 LHN---FTFGCAHTALGEPVGVAGFGRGVLSLPAQLASFSPHLGNQFSYCLVSHSFDADR 260
Query: 176 PNISSPLVIGDVAISSKDNLQ---------FTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
SPL++G ++ + + +T ML +P +P +Y +GLE IT+GN +
Sbjct: 261 VRRPSPLILGRYSLDDEKKKRVGHDRGEFVYTAMLDNPKHPYFYCVGLEGITVGNRKI-P 319
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGFD 285
VP L+ D +GNGG++VDSGTT+T LP Y L++ + Y RA ++EERTG
Sbjct: 320 VPEILKRVDRRGNGGMVVDSGTTFTMLPAGLYESLVTEFNHRMGRVYKRATQIEERTGLG 379
Query: 286 LCYRVPCPNNTFTDD---LFPSITFHFLNNVSLVLPQGNHFY----AMSAPSNSSAVKCL 338
CY ++DD P++ HF+ N +++LP+ N++Y V CL
Sbjct: 380 PCY--------YSDDSAAKVPAVALHFVGNSTVILPRNNYYYEFFDGRDGQKKKRKVGCL 431
Query: 339 LFQSMDDGDY----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ M+ GD GP+ G++QQQ EVVYDLEK R+GF CA
Sbjct: 432 ML--MNGGDEAESGGPAATLGNYQQQGFEVVYDLEKHRVGFARRKCA 476
>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
Length = 493
Score = 237 bits (604), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 150/404 (37%), Positives = 219/404 (54%), Gaps = 37/404 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +Y+DTGSDL W PC F+C+ C+ N S P SS++ C SS C H
Sbjct: 96 VSLYLDTGSDLVWFPCK--PFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSAAH 153
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+ D C ++ C L ++ S C CPSF Y YG+G LV L D++K+ ++P +
Sbjct: 154 SNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVAR-LYHDSIKLPLATPSL 212
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF----LQKGFSHCFLAFKYANDP 176
+ F FGC + EP+G+AGFGRG LS+P+QL L FS+C ++ + +D
Sbjct: 213 --SLHNFTFGCAHTALAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDR 270
Query: 177 -NISSPLVIG----DVAISSKDNLQF--TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ SPL++G +KD++QF T ML +P +P +Y +GLE I+IG + P
Sbjct: 271 LRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISIGKKKI-PAPE 329
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGFDLCY 288
L+ D +G+GG++VDSGTT+T LP Y+ +++ + + Y RAKEVE++TG CY
Sbjct: 330 FLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTGLGPCY 389
Query: 289 RVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFYAM----SAPSNSSAVKCLLFQSM 343
N PS+ HF+ N S+VLP+ N+FY V CL+ M
Sbjct: 390 YYDTVVN------IPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLML--M 441
Query: 344 DDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+ GP G++QQ EVVYDLE+ R+GF CAS
Sbjct: 442 NGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCAS 485
>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
Length = 499
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 147/397 (37%), Positives = 208/397 (52%), Gaps = 26/397 (6%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ VYMDTGSD+ W PC F+C+ C+ +P S SS +C S C H
Sbjct: 105 LSVYMDTGSDIVWFPCS--PFECILCEGKFEP---GTLTPLNVSKSSLISCKSRACSTAH 159
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+S + D C ++ C L + S C CPSF Y YG+G L+ + + + S+
Sbjct: 160 NSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIAKLHKHNLIMPSTSNKPF 219
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQ----KGFSHCFLAFKY-AND 175
+ F FGC S EPIG+AGFG G+LS+P+QL L FS+C ++ + +
Sbjct: 220 --SLKDFTFGCAHSALGEPIGVAGFGFGSLSLPAQLANLSPDLGNQFSYCLVSHSFDSTK 277
Query: 176 PNISSPLVIGDVAISSKDNLQ---FTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ SPL++G V D + +TPML +P +P +Y + +EAI++G SS P +L
Sbjct: 278 LHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEAISVG-SSRVRAPNALI 336
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGFDLCYRVP 291
D GNGG++VDSGTTYT LP FY+ + + L + + RA E E +TG CY +
Sbjct: 337 RIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRASETESKTGLSPCYYLE 396
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM---SAPSNSSAVKCLLFQSMDDGDY 348
+ P + FHF N S+VLP+ N+FY V CL+ MD GD
Sbjct: 397 GNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGRKVGCLML--MDGGDE 454
Query: 349 ---GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GP G++QQQ +VVYDLE+ R+GF P CAS
Sbjct: 455 SEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKCAS 491
>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 482
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 151/405 (37%), Positives = 219/405 (54%), Gaps = 44/405 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSP-SRSSSSSRDTCASSFCLNI 60
I +YMDTGSDL W PC F+C+ C+ KL S+ SP + S S+ +C S C
Sbjct: 88 ITLYMDTGSDLVWFPC--TPFNCILCE--LKPKLTSDPSPPTNISHSTPISCNSHACSVA 143
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
HSS D CTM+ C L ++ C CP F Y YG+G L+ L RDTL +
Sbjct: 144 HSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLIAS-LYRDTLSLS----- 197
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF----LQKGFSHCFLAFKYAND 175
++ F FGC +T+ EP G+AGFGRG LS+P+QL L FS+C ++ + ++
Sbjct: 198 -TLQLTNFTFGCAHTTFSEPTGVAGFGRGLLSLPAQLATHSPQLGNRFSYCLVSHSFRSE 256
Query: 176 P-NISSPLVIGDVAISSKDN------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
SPL++G + N +T ML++P + +Y +GL+ I++G ++ P
Sbjct: 257 RIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGLKGISVGKKTV-PAP 315
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL-QSTITYYPRAKEVEERTGFDLC 287
LR + +G+GG++VDSGTT+T LPE FY+ ++ + RA E+E++TG C
Sbjct: 316 KILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSNRRAPEIEQKTGLSPC 375
Query: 288 YRVPCPNNTFTDDLFPSITFHFLN-NVSLVLPQGNHFYAM----SAPSNSSAVKCLLFQS 342
Y + T + P++T F+ N S+VLP+ N+FY V CL+F
Sbjct: 376 YYLN------TAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRKERVGCLMF-- 427
Query: 343 MDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
M+ GD GP GV G++QQQ EV YDLEK+R+GF CAS
Sbjct: 428 MNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKCAS 472
>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 417
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 144/398 (36%), Positives = 210/398 (52%), Gaps = 40/398 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +YMDTGSDL W PC F+C+ C+ N P + S R +C S C H
Sbjct: 33 ITLYMDTGSDLVWFPCA--PFECILCEGKFNAT-----KPLNITRSHRVSCQSPACSTAH 85
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP-CPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
SS + D C ++ C L + S C CP F Y YG+G + L RDTL +
Sbjct: 86 SSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAYGDGSFIAH-LHRDTLSMSQLF--- 141
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQ----KGFSHCFLAFKYANDP 176
+ F FGC + EP G+AGFGRG LS+P+QL L FS+C ++ + +
Sbjct: 142 ---LKNFTFGCAHTALAEPTGVAGFGRGLLSLPAQLATLSPNLGNRFSYCLVSHSFDKER 198
Query: 177 -NISSPLVIGDVAISSKDNLQF--TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
SPL++G S + ++F T ML++P + +Y +GL I++G ++ P LR
Sbjct: 199 VRKPSPLILGHYDDYSSERVEFVYTSMLRNPKHSYFYCVGLTGISVGKRTIL-APEMLRR 257
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGFDLCYRVPC 292
D +G+GG++VDSGTT+T LP Y+ +++ + + RA EVEE+TG CY
Sbjct: 258 VDRRGDGGVVVDSGTTFTMLPASLYNSVVAEFDRRVGRVHKRASEVEEKTGLGPCY---- 313
Query: 293 PNNTFTDDLF--PSITFHFL-NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD-- 347
F + L P++T+HFL NN +++LP+ N+FY + + K M+ GD
Sbjct: 314 ----FLEGLVEVPTVTWHFLGNNSNVMLPRMNYFYEFLDGEDEARRKVGCLMLMNGGDDT 369
Query: 348 ---YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GP + G++QQQ EVVYDLE +R+GF CAS
Sbjct: 370 ELSGGPGAILGNYQQQGFEVVYDLENQRVGFAKRQCAS 407
>gi|356563324|ref|XP_003549914.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 480
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 147/405 (36%), Positives = 215/405 (53%), Gaps = 45/405 (11%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +YMDTGSDL W PC F C+ C+ N S P+ + S +C S C H
Sbjct: 85 ITLYMDTGSDLVWFPCA--PFKCILCEGKPNEPNAS--PPTNITQSVAVSCKSPACSAAH 140
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ P D C + C L ++ S C CP F Y YG+G L+ L RDTL + S
Sbjct: 141 NLAPPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLIAR-LYRDTLSL---SSLF 196
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFL--QKG--FSHCFLAFKYANDP 176
+R F FGC +T EP G+AGFGRG LS+P+QL L Q G FS+C ++ + ++
Sbjct: 197 LRN---FTFGCAHTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSER 253
Query: 177 -NISSPLVIGDVAISSKDNLQ-------FTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
SPL++G K+ + +T ML++P +P +Y + L I +G ++ P
Sbjct: 254 VRKPSPLILGRYEEKEKEKIGGGVAEFVYTSMLENPKHPYFYTVSLIGIAVGKRTI-PAP 312
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDLC 287
LR +++G+GG++VDSGTT+T LP FY+ ++ + RA+++EE+TG C
Sbjct: 313 EMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRRVGRDNKRARKIEEKTGLAPC 372
Query: 288 YRVPCPNNTFTDDLFPSITFHFL--NNVSLVLPQGNHFYAMSAPSNSS----AVKCLLFQ 341
Y + N+ D P++T F N S+VLP+ N+FY S S+ + V CL+
Sbjct: 373 YYL----NSVAD--VPALTLRFAGGKNSSVVLPRKNYFYEFSDGSDGAKGKRKVGCLML- 425
Query: 342 SMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
M+ GD GP G++QQQ EV YDLE++R+GF CA
Sbjct: 426 -MNGGDEADLSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCA 469
>gi|224101053|ref|XP_002334311.1| predicted protein [Populus trichocarpa]
gi|222871031|gb|EEF08162.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 150/408 (36%), Positives = 215/408 (52%), Gaps = 39/408 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
I +Y+DTGSDL W PC F+C+ C+ N L S P S +++ +C SS C
Sbjct: 93 IFLYLDTGSDLVWFPCQ--PFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAA 150
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
HS+ D C +S C L ++ S C + CP F Y YG+G L+ L RD++ + S+P
Sbjct: 151 HSNLPSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSLIAR-LYRDSISLPLSNPT 209
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFL--QKG--FSHCFLAFKYAND 175
+ + F FGC + EPIG+AGFGRG LS+P+QL L Q G FS+C ++ + +D
Sbjct: 210 NLI-VNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSD 268
Query: 176 P-NISSPLVIG---------DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
SPL++G V +K +T ML + +P +Y +GLE I+IG +
Sbjct: 269 RLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGRKKI- 327
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGF 284
P LR+ D +G+GGL+VDSGTT+T LP Y +++ ++ + RA+ +EE TG
Sbjct: 328 PAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGL 387
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFYAM----SAPSNSSAVKCLL 339
CY S+ HF+ N S+VLP+ N+FY V CL+
Sbjct: 388 SPCYYFDNNVVNVP-----SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLM 442
Query: 340 FQSMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
M+ GD GP G++QQQ EVVYDLE +R+GF CAS
Sbjct: 443 L--MNGGDEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCAS 488
>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
max]
Length = 455
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 146/406 (35%), Positives = 213/406 (52%), Gaps = 48/406 (11%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +YMDTGSDL W PC F C+ C+ N P ++ S +C S C H
Sbjct: 63 ITLYMDTGSDLVWFPCA--PFKCILCEGKPNAS-----PPVNTTRSVAVSCKSPACSAAH 115
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ +P D C + C L ++ S C CP F Y YG+G L+ L RDTL + S
Sbjct: 116 NLASPSDLCAAARCPLESIETSDCANFKCPPFYYAYGDGSLI-ARLYRDTLSL---SSLF 171
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFL--QKG--FSHCFLAFKYANDP 176
+R F FGC +T EP G+AGFGRG LS+P+QL L Q G FS+C ++ + ++
Sbjct: 172 LRN---FTFGCAYTTLAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSER 228
Query: 177 -NISSPLVIGDVAISSKD--------NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
SPL++G ++ +TPML++P +P +Y +GL I++G +
Sbjct: 229 VRKPSPLILGRYEEEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVGK-RIVPA 287
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY-PRAKEVEERTGFDL 286
P LR +++G+GG++VDSGTT+T LP FY+ ++ + RA+++EE+TG
Sbjct: 288 PEMLRRVNNRGDGGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTGLAP 347
Query: 287 CYRVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFY----AMSAPSNSSAVKCLLFQ 341
CY + N+ + P +T F N S+VLP+ N+FY A V CL+
Sbjct: 348 CYYL----NSVAE--VPVLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLML- 400
Query: 342 SMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
M+ GD GP G++QQQ EV YDLE++R+GF CAS
Sbjct: 401 -MNGGDEAELSGGPGATLGNYQQQGFEVEYDLEEKRVGFARRQCAS 445
>gi|224074147|ref|XP_002304273.1| predicted protein [Populus trichocarpa]
gi|222841705|gb|EEE79252.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 149/412 (36%), Positives = 215/412 (52%), Gaps = 39/412 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
I +Y+DTGSDL W PC F+C+ C+ N L S P S +++ +C SS C +
Sbjct: 93 ISLYLDTGSDLVWFPCQ--PFECILCEGKAENASLASTPPPKLSKTATPVSCKSSACSAV 150
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
HS+ D C +S C L ++ S C + CP F Y YG+G L+ L RD++++ S+
Sbjct: 151 HSNLPSSDLCAISNCPLESIEISDCRKHSCPQFYYAYGDGSLIAR-LYRDSIRLPLSNQT 209
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFL--QKG--FSHCFLAFKYAND 175
+ F FGC +T EPIG+AGFGRG LS+P+QL L Q G FS+C ++ + +D
Sbjct: 210 NL-IFNNFTFGCAHTTLAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSD 268
Query: 176 P-NISSPLVIGDVAISSKD---------NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
SPL++G K+ + +T ML +P +P +Y +GLE I+IG +
Sbjct: 269 RVRRPSPLILGRYDHDEKERRVNGVKKPSFVYTSMLDNPRHPYFYCVGLEGISIGRKKI- 327
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGF 284
P LR+ D +G+GG++VDSGTT+T LP Y +++ ++ + RA +EE TG
Sbjct: 328 PAPDFLRKVDRKGSGGVVVDSGTTFTMLPASLYDFVVAEFENRVGRVNERASVIEENTGL 387
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFYAM----SAPSNSSAVKCLL 339
CY + HF+ N S+VLP+ N+FY V CL+
Sbjct: 388 SPCYYFDNNVVNVP-----RVVLHFVGNGSSVVLPRRNYFYEFLDGGHGKGKKRKVGCLM 442
Query: 340 FQSMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
M+ GD GP G++QQQ EVVYDLE R+GF CAS A
Sbjct: 443 L--MNGGDEAELSGGPGATLGNYQQQGFEVVYDLENRRVGFARRQCASLWEA 492
>gi|224138580|ref|XP_002326638.1| predicted protein [Populus trichocarpa]
gi|222833960|gb|EEE72437.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 149/408 (36%), Positives = 215/408 (52%), Gaps = 39/408 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
I +Y+DTGSDL W PC F+C+ C+ N L S P S +++ +C SS C
Sbjct: 93 IFLYLDTGSDLVWFPCQ--PFECILCEGKAENTSLASTPPPKLSKTATPVSCKSSACSAA 150
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
HS+ D C +S C L ++ S C + CP F Y YG+G L+ L RD++ + S+P
Sbjct: 151 HSNLPSSDLCAISNCPLESIETSDCQKHSCPQFYYAYGDGSLIAR-LYRDSISLPLSNPT 209
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFL--QKG--FSHCFLAFKYAND 175
+ + F FGC + EPIG+AGFGRG LS+P+QL L Q G FS+C ++ + +D
Sbjct: 210 NLI-VNNFTFGCAHTALAEPIGVAGFGRGVLSLPAQLATLSPQLGNQFSYCLVSHSFDSD 268
Query: 176 P-NISSPLVIG---------DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
SPL++G V +K +T ML + +P +Y +GLE I+IG +
Sbjct: 269 RLRRPSPLILGRYDHDEKERRVNGVNKPRFVYTSMLDNLEHPYFYCVGLEGISIGRKKI- 327
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGF 284
P LR+ D +G+GGL+VDSGTT+T LP Y +++ ++ + RA+ +EE TG
Sbjct: 328 PAPGFLRKVDGEGSGGLVVDSGTTFTMLPASLYGSVVAEFENRVGRVNERARVIEEDTGL 387
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFYAM----SAPSNSSAVKCLL 339
CY S+ HF+ N S+VLP+ N+FY V CL+
Sbjct: 388 SPCYYFDNNVVNVP-----SVVLHFVGNGSSVVLPRRNYFYEFLDGGDGKGKKRKVGCLM 442
Query: 340 FQSMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
M+ G+ GP G++QQQ EVVYDLE +R+GF CAS
Sbjct: 443 L--MNGGEEAELSGGPGATLGNYQQQGFEVVYDLENKRVGFARRQCAS 488
>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
Length = 504
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 135/401 (33%), Positives = 195/401 (48%), Gaps = 33/401 (8%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +++DTGSDL W PC F CM C+ P S R CAS C H
Sbjct: 105 VSLFLDTGSDLVWFPCA--PFTCMLCEGKPTPGRSGPLPPP--PDSRRIPCASPLCSAAH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+S P D C + C L + +C CP Y YG+G LV + G+
Sbjct: 161 ASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARAS 220
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+ + F F C + EP+G+AGFGRG LS+P QL G FS+C ++ + D I
Sbjct: 221 VAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLI 280
Query: 179 S-SPLVIGD------VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
SPL++G A + D +TP+L +P +P +Y + LEA+++G + + P L
Sbjct: 281 RPSPLILGRSPDDADAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARP-EL 339
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQL--LSILQSTITYYPRAKEVEERTGFDLCYR 289
D GNGG++VDSGTT+T LP Y+++ + RA+ EE+TG CYR
Sbjct: 340 ARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYR 399
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS-----APSNSSAVKCLLFQ--- 341
+D P + HF N ++ LP+ N+F A + V CL+
Sbjct: 400 Y-----AASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGG 454
Query: 342 --SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
S ++GD GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 455 DASGEEGD-GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 494
>gi|414586111|tpg|DAA36682.1| TPA: pepsin A [Zea mays]
Length = 503
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/400 (33%), Positives = 195/400 (48%), Gaps = 32/400 (8%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +++DTGSDL W PC F CM C+ S R CAS C H
Sbjct: 105 VSLFLDTGSDLVWFPCA--PFTCMLCEG--KPTPGRLGPLPPPPDSRRIPCASPLCSAAH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+S P D C ++ C L + +C CP Y YG+G LV + G+
Sbjct: 161 ASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAHLRRGRVALGAGARAS 220
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
+ + F F C + EP+G+AGFGRG LS+P QL L FS+C ++ + D I
Sbjct: 221 VAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRFSYCLVSHSFRADRLI 280
Query: 179 S-SPLVIGD-----VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
SPL++G A + D +TP+L +P +P +Y + LEA+++G + + P L
Sbjct: 281 RPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAVSVGAARIQARP-ELA 339
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQL--LSILQSTITYYPRAKEVEERTGFDLCYRV 290
D GNGG++VDSGTT+T LP Y+++ + RA+ EE+TG CYR
Sbjct: 340 RVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARAERAEEQTGLTPCYRY 399
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS-----APSNSSAVKCLLFQ---- 341
+D P + HF N ++ LP+ N+F A + V CL+
Sbjct: 400 -----AASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGTRKDDVGCLMLMNGGD 454
Query: 342 -SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
S ++GD GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 455 ASGEEGD-GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 493
>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 481
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 143/403 (35%), Positives = 207/403 (51%), Gaps = 42/403 (10%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+I +YMDTGSDL W PC F+C+ C+ +N + S S C S C
Sbjct: 88 LITLYMDTGSDLVWFPCS--PFECILCEGKPQTTKPANITKQTHSVS----CQSPACSAA 141
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H+S + + C +S C L + S C CP F Y YG+G V L + TL +
Sbjct: 142 HASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFVAN-LYQQTLSLSS---- 196
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG----FLQKGFSHCFLAFKYAND 175
+ F FGC + EP G+AGFGRG LS+P+QL L FS+C ++ + D
Sbjct: 197 --LHLQNFTFGCAHTALAEPTGVAGFGRGILSLPAQLSTLSPHLGNRFSYCLVSHSFDGD 254
Query: 176 P-NISSPLVIG---DVAISSKD----NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
SPL++G D + D +T ML +P +P YY +GL I++G ++
Sbjct: 255 RLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNPKHPYYYCVGLAGISVGKRTV-PA 313
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFDL 286
P L+ D +GNGG++VDSGTT+T LPE FY+ +++ + ++ RA E+E +TG
Sbjct: 314 PEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRVNRFHKRASEIETKTGLGP 373
Query: 287 CYRVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFYAM----SAPSNSSAVKCLLFQ 341
CY + N + P + HF+ NN +VLP+ N+FY V C++
Sbjct: 374 CYYL----NGLSQ--IPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDGIRRKGKVGCMMLM 427
Query: 342 SMDDG---DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ +D D GP G++QQQ EVVYDLEKER+GF +CA
Sbjct: 428 NGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKECA 470
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 138/386 (35%), Positives = 204/386 (52%), Gaps = 27/386 (6%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDC-DDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
MDTGSDL WVPC ++ C++C +D +N + F P SSS TCA S C ++ ++
Sbjct: 1 MDTGSDLVWVPC-TRNYSCINCPEDSASNGV---FLPRMSSSLHLVTCADSNCKTLYGNN 56
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
C SL C CP + YG G G+L +TL + + R I
Sbjct: 57 TEL-LCQSCAGSLKN-----CSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAI 109
Query: 125 PKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPNISSPL 182
F GC + ++P GIAGFGRGALS+PSQLG + F++C + ++ ++ N S +
Sbjct: 110 THFAVGCSIVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRF-DEENKKSLM 168
Query: 183 VIGDVAISSKDNLQFTPMLK------SPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
V+GD A+ + L +TP L S Y YYYIGL ++IG L ++P L FD+
Sbjct: 169 VLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDT 228
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T + + + + S I Y RA EVE++TG LCY V N
Sbjct: 229 KGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYR-RAGEVEDKTGMGLCYDVTGLENI 287
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P FHF +VLP N+F S+ +S + + + + + D GP+ + G+
Sbjct: 288 ----VLPEFAFHFKGGSDMVLPVANYFSYFSS-FDSICLTMISSRGLLEVDSGPAVILGN 342
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
QQQ+ ++YD EK R+GF C +
Sbjct: 343 DQQQDFYLLYDREKNRLGFTQQTCKT 368
>gi|242076594|ref|XP_002448233.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
gi|241939416|gb|EES12561.1| hypothetical protein SORBIDRAFT_06g023740 [Sorghum bicolor]
Length = 508
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 133/404 (32%), Positives = 195/404 (48%), Gaps = 39/404 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDY----RNNKLMSNFSPSRSSSSSRDTCASSFC 57
+ +++DTGSDL W PC F CM C+ + + S R CAS C
Sbjct: 109 VSLFLDTGSDLVWFPCA--PFTCMLCEGKPTPSGGHSSSAPLPLPPPPDSRRVPCASPLC 166
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTC---CRPCPSFAYTYGEGGLVTGILTRDTLKVH 114
H+S P D C +GC L + +C CP Y YG+G LV L R + +
Sbjct: 167 SAAHASAPPSDLCAAAGCPLEDIETGSCRGASHACPPLYYAYGDGSLVA-HLRRGRVGL- 224
Query: 115 GSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYA 173
G + F F C + EP+G+AGFGRG LS+P QL G FS+C ++ +
Sbjct: 225 ----GASVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLAPQLSGRFSYCLVSHSFR 280
Query: 174 NDPNIS-SPLVIGDV--AISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
D I SPL++G A + +TP+L +P +P +Y + LEA+++G + + P
Sbjct: 281 ADRLIRPSPLILGRSPDAAAETGGFVYTPLLHNPKHPYFYSVALEAVSVGATRIQARP-E 339
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRA--KEVEERTGFDLCY 288
L D GNGG++VDSGTT+T LP Y+++ + A + EE+TG CY
Sbjct: 340 LARVDRAGNGGMVVDSGTTFTMLPNETYARVAEAFARAMAAAGFARAERAEEQTGLTPCY 399
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA------VKCLLFQS 342
+ +D P + HF N ++ LP+ N+F + + V CL+ +
Sbjct: 400 -----HYAASDRGVPPLALHFRGNATVALPRRNYFMGFKSEEEAGGAGRKDDVGCLMLMN 454
Query: 343 ------MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
D GD GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 455 GGDVSGEDGGDDGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 498
>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
Length = 492
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 136/395 (34%), Positives = 202/395 (51%), Gaps = 31/395 (7%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +++DTGSDL W PC F CM C+ + S R +CAS C H
Sbjct: 103 VSLFLDTGSDLVWFPCA--PFTCMLCEGKATPGGNHSSPLPPPIDSRRISCASPLCSAAH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCC-RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
SS D C + C L + +C CP Y YG+G LV L R + + S
Sbjct: 161 SSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-LRRGRVGLAAS---- 215
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI- 178
+ F F C + EP+G+AGFGRG LS+P+QL L FS+C +A + D I
Sbjct: 216 -MAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAHSFRADRLIR 274
Query: 179 SSPLVIG---DVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
SSPL++G D A +S+ + +TP+L +P +P +Y + LEA+++G + P L +
Sbjct: 275 SSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQP-ELGD 333
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLL--SILQSTITYYPRAKEVEERTGFDLCYRVP 291
D GNGG++VDSGTT+T LP ++++ + RA+ E +TG P
Sbjct: 334 VDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGL-----AP 388
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM----DDGD 347
C + + +D P + HF N ++ LP+ N+F + S V CL+ ++ DDG+
Sbjct: 389 CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRS-VGCLMLMNVGGNNDDGE 447
Query: 348 --YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 448 DGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
Length = 519
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 136/395 (34%), Positives = 202/395 (51%), Gaps = 31/395 (7%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +++DTGSDL W PC F CM C+ + S R +CAS C H
Sbjct: 103 VSLFLDTGSDLVWFPCA--PFTCMLCEGKATPGGNHSSPLPPPIDSRRISCASPLCSAAH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCC-RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
SS D C + C L + +C CP Y YG+G LV L R + + S
Sbjct: 161 SSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-LRRGRVGLAAS---- 215
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI- 178
+ F F C + EP+G+AGFGRG LS+P+QL L FS+C +A + D I
Sbjct: 216 -MAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLAPSLSGRFSYCLVAHSFRADRLIR 274
Query: 179 SSPLVIG---DVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
SSPL++G D A +S+ + +TP+L +P +P +Y + LEA+++G + P L +
Sbjct: 275 SSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQP-ELGD 333
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLL--SILQSTITYYPRAKEVEERTGFDLCYRVP 291
D GNGG++VDSGTT+T LP ++++ + RA+ E +TG P
Sbjct: 334 VDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGL-----AP 388
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM----DDGD 347
C + + +D P + HF N ++ LP+ N+F + S V CL+ ++ DDG+
Sbjct: 389 CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRS-VGCLMLMNVGGNNDDGE 447
Query: 348 --YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 448 DGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|449458942|ref|XP_004147205.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449505000|ref|XP_004162350.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 480
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 141/399 (35%), Positives = 207/399 (51%), Gaps = 34/399 (8%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +YMDTGSDL W PC F+C+ C+ K+ S P +++ S A++
Sbjct: 89 ISLYMDTGSDLVWFPCS--PFECILCEG--KPKIQSPL-PKIANNKSVSCSAAACSAAHG 143
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR-PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S + C +S C L ++ S C CP F Y YG+G LV L RD+L + +P
Sbjct: 144 GSLSASHLCAISRCPLESIEISECSSFSCPPFYYAYGDGSLVAR-LYRDSLSLPTPAPSP 202
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF----LQKGFSHCFLAFKYANDP 176
+ F FGC +T EP+G+AGFGRG LS+PSQL L FS+C ++ +A D
Sbjct: 203 PINVRNFTFGCAHTTLGEPVGVAGFGRGVLSMPSQLATFSPQLGNRFSYCLVSHSFAADR 262
Query: 177 -NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SPL++G + + +T +L++P +P +Y +GL I++GN + P L + D
Sbjct: 263 VRRPSPLILGRY-YTGETEFIYTSLLENPKHPYFYSVGLAGISVGNIRI-PAPEFLTKVD 320
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQS-TITYYPRAKEVEERTGFDLCYRVPCPN 294
G+GG++VDSGTT+T LP Y +++ ++ T RA+ +EE TG CY
Sbjct: 321 EGGSGGVVVDSGTTFTMLPAGLYESVVAEFENRTGKVANRARRIEENTGLSPCYYY---E 377
Query: 295 NTFTDDLFPSITFHFLNNVS-LVLPQGNHFYAM-----SAPSNSSAVKCLLFQSMDDGDY 348
N+ P + HF+ S +VLP+ N+FY V CL+ M+ GD
Sbjct: 378 NSVG---VPRVVLHFVGEKSNVVLPRKNYFYEFLDGGDGVVGRKRKVGCLML--MNGGDE 432
Query: 349 -----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GP G++QQQ EVVYDLEK R+GF C++
Sbjct: 433 AELAGGPGATLGNYQQQGFEVVYDLEKNRVGFARRQCST 471
>gi|357164972|ref|XP_003580227.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 492
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 192/396 (48%), Gaps = 28/396 (7%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDY----RNNKLMSNFSPSRSSSSSRDTCASSFC 57
+ +++DTGSDL W PC F CM C+ NN + P + S R CAS FC
Sbjct: 98 VSLFLDTGSDLVWFPCA--PFTCMLCEGKPTPPGNNNSSNPLPPP--TDSRRIPCASPFC 153
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCC--RPCPSFAYTYGEGGLVTGILTRDTLKVHG 115
HSS P D C + C L + +C CP Y YG+G LV L R + +
Sbjct: 154 SAAHSSAPPADLCAAARCPLDDIETGSCAASHACPPLYYAYGDGSLVA-RLRRGRVGIAA 212
Query: 116 SSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKY- 172
S + F F C + EP+G+AGFGRG LS+P+QL L FS+C +A +
Sbjct: 213 SV-----AVENFTFACAHTALGEPVGVAGFGRGPLSLPAQLAPAALSGRFSYCLVAHSFR 267
Query: 173 ANDPNISSPLVIGDVA---ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
A+ P SPL++G +S+ + +TP+L +P +P +Y + LEA+++G + + P
Sbjct: 268 ADRPIRPSPLILGRSPGEDPASETGIVYTPLLHNPKHPYFYSVALEAVSVGGTRIPARP- 326
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVE--ERTGFDLC 287
L G+GG++VDSGTT+T LP Y+++ + + ++TG C
Sbjct: 327 ELGRVGRAGDGGMVVDSGTTFTMLPNETYARVAEEFGRAMAAARFERAEAAEDQTGLAPC 386
Query: 288 Y---RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
Y + P + HF ++VLP+ N+F + +L +
Sbjct: 387 YYYDHDASAAEEGSARAVPPLAMHFRGEATVVLPRRNYFMGFRSEERRRVGCLMLMNGGE 446
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
D GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 447 DDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 482
>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
Length = 459
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 133/401 (33%), Positives = 204/401 (50%), Gaps = 38/401 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGS LTWVPC + S++C +C + + F P SSSS C + C +H
Sbjct: 80 LPVLLDTGSHLTWVPCTS-SYECRNCSS-PSASAVPVFHPKNSSSSRLVGCRNPSCQWVH 137
Query: 62 SSDNPFDPCTMSGCSLSTL-LKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+ N C + CS + CP +A YG G G+L DTL+ G
Sbjct: 138 SAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTLRAPG----- 191
Query: 121 IREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
R +P F GC + S ++ P G+AGFGRGA SVP+QLG + FS+C L+ ++ ++ +S
Sbjct: 192 -RAVPGFVLGCSLVSVHQPPSGLAGFGRGAPSVPAQLGLPK--FSYCLLSRRFDDNAAVS 248
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPM-----YPNYYYIGLEAITIGNSSLTEVPLSLREF 234
LV+ + +Q+ P++KS Y YYY+ L +T+G ++ +P
Sbjct: 249 GSLVL--GGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAV-RLPARAFAA 305
Query: 235 DSQGNGGLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
++ G+GG +VDSGTT+T+L P F +++ + Y R+K+ E+ G C+ +P
Sbjct: 306 NAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQG 365
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD-------- 345
+ P ++FHF + LP N+F + AV+ + + D
Sbjct: 366 ARSMA---LPELSFHFEGGAVMQLPVENYFVV----AGRGAVEAICLAVVTDFSGGSGAG 418
Query: 346 -GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
GP+ + GSFQQQN V YDLEKER+GF+ C S+ S
Sbjct: 419 NEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSSPS 459
>gi|224035171|gb|ACN36661.1| unknown [Zea mays]
Length = 378
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 177/355 (49%), Gaps = 28/355 (7%)
Query: 47 SSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTG 104
S R CAS C H+S P D C ++ C L + +C CP Y YG+G LV
Sbjct: 21 SRRIPCASPLCSAAHASAPPSDLCAVARCPLEDIETGSCGASHACPPLYYAYGDGSLVAH 80
Query: 105 ILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGF 163
+ G+ + + F F C + EP+G+AGFGRG LS+P QL L F
Sbjct: 81 LRRGRVALGAGARASVAVAVDNFTFACAHTALGEPVGVAGFGRGPLSLPGQLSPQLSGRF 140
Query: 164 SHCFLAFKYANDPNIS-SPLVIGD-----VAISSKDNLQFTPMLKSPMYPNYYYIGLEAI 217
S+C ++ + D I SPL++G A + D +TP+L +P +P +Y + LEA+
Sbjct: 141 SYCLVSHSFRADRLIRPSPLILGRSPDDAAAAAETDGFVYTPLLHNPKHPYFYSVALEAV 200
Query: 218 TIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL--LSILQSTITYYPRA 275
++G + + P L D GNGG++VDSGTT+T LP Y+++ + RA
Sbjct: 201 SVGAARIQARP-ELARVDRAGNGGMVVDSGTTFTMLPNEMYARVAEAFARAMAAAGFARA 259
Query: 276 KEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS-----APS 330
+ EE+TG CYR +D P + HF N ++ LP+ N+F A +
Sbjct: 260 ERAEEQTGLTPCYRY-----AASDRGVPPLALHFRGNATVALPRRNYFMGFKSEDAGAGT 314
Query: 331 NSSAVKCLLFQ-----SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V CL+ S ++GD GP+G G+FQQQ EVVYD++ R+GF C
Sbjct: 315 RKDDVGCLMLMNGGDASGEEGD-GPAGTLGNFQQQGFEVVYDVDAGRVGFARRRC 368
>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 491
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 133/401 (33%), Positives = 204/401 (50%), Gaps = 38/401 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGS LTWVPC + S++C +C + + F P SSSS C + C +H
Sbjct: 112 LPVLLDTGSHLTWVPCTS-SYECRNCSS-PSASAVPVFHPKNSSSSRLVGCRNPSCQWVH 169
Query: 62 SSDNPFDPCTMSGCSLSTL-LKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+ N C + CS + CP +A YG G G+L DTL+ G
Sbjct: 170 SAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTLRAPG----- 223
Query: 121 IREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
R +P F GC + S ++ P G+AGFGRGA SVP+QLG + FS+C L+ ++ ++ +S
Sbjct: 224 -RAVPGFVLGCSLVSVHQPPSGLAGFGRGAPSVPAQLGLPK--FSYCLLSRRFDDNAAVS 280
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPM-----YPNYYYIGLEAITIGNSSLTEVPLSLREF 234
LV+ + +Q+ P++KS Y YYY+ L +T+G ++ +P
Sbjct: 281 GSLVL--GGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAV-RLPARAFAG 337
Query: 235 DSQGNGGLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
++ G+GG +VDSGTT+T+L P F +++ + Y R+K+ E+ G C+ +P
Sbjct: 338 NAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLGLHPCFALPQG 397
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD-------- 345
+ P ++FHF + LP N+F + AV+ + + D
Sbjct: 398 ARSMA---LPELSFHFEGGAVMQLPVENYFVV----AGRGAVEAICLAVVTDFGGGSGAG 450
Query: 346 -GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
GP+ + GSFQQQN V YDLEKER+GF+ C S+ S
Sbjct: 451 NEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSSPS 491
>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
Length = 466
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 125/388 (32%), Positives = 186/388 (47%), Gaps = 43/388 (11%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +++DTGSDL W PC F CM C+ + S R +CAS C H
Sbjct: 103 VSLFLDTGSDLVWFPCA--PFTCMLCEGKATPGGNHSSPLPPPIDSRRISCASPLCSAAH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCC-RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
SS D C + C L + +C CP Y YG+G LV L R + + S
Sbjct: 161 SSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVAN-LRRGRVGLAAS---- 215
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ F F C + EP+G+AGFGRG LS+P+QL P++S
Sbjct: 216 -MAVENFTFACAHTALAEPVGVAGFGRGPLSLPAQLA------------------PSLSG 256
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ S D +TP+L +P +P +Y + LEA+++G + P L + D GNG
Sbjct: 257 STDAAAIGASETD-FVYTPLLHNPKHPYFYSVALEAVSVGGKRIQAQP-ELGDVDRDGNG 314
Query: 241 GLLVDSGTTYTHLPEPFYSQLL--SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G++VDSGTT+T LP ++++ + RA+ E +TG CY + + +
Sbjct: 315 GMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGLAPCY-----HYSPS 369
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM----DDGD--YGPSG 352
D P + HF N ++ LP+ N+F + S V CL+ ++ DDG+ GP+G
Sbjct: 370 DRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRS-VGCLMLMNVGGNNDDGEDGGGPAG 428
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+FQQQ EVVYD++ R+GF C
Sbjct: 429 TLGNFQQQGFEVVYDVDAGRVGFARRRC 456
>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
Length = 452
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 130/395 (32%), Positives = 200/395 (50%), Gaps = 38/395 (9%)
Query: 8 TGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPF 67
+GS LTWVPC + S++C +C + + F P SSSS C + C +HS+ N
Sbjct: 79 SGSHLTWVPCTS-SYECRNCSS-PSASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLA 136
Query: 68 DPCTMSGCSLSTL-LKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C + CS + CP +A YG G G+L DTL+ G R +P
Sbjct: 137 TKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTLRAPG------RAVPG 189
Query: 127 FCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG 185
F GC + S ++ P G+AGFGRGA SVP+QLG + FS+C L+ ++ ++ +S LV+
Sbjct: 190 FVLGCSLVSVHQPPSGLAGFGRGAPSVPAQLGLPK--FSYCLLSRRFDDNAAVSGSLVL- 246
Query: 186 DVAISSKDNLQFTPMLKSPM-----YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ +Q+ P++KS Y YYY+ L +T+G ++ +P ++ G+G
Sbjct: 247 -GGTGGGEGMQYVPLVKSAAGDKLPYGVYYYLALRGVTVGGKAV-RLPARAFAANAAGSG 304
Query: 241 GLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G +VDSGTT+T+L P F +++ + Y R+K+ E+ G C+ +P +
Sbjct: 305 GTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMA- 363
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD---------GDYGP 350
P ++FHF + LP N+F + AV+ + + D GP
Sbjct: 364 --LPELSFHFEGGAVMQLPVENYFVV----AGRGAVEAICLAVVTDFSGGSGAGNEGSGP 417
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
+ + GSFQQQN V YDLEKER+GF+ C S+ S
Sbjct: 418 AIILGSFQQQNYLVEYDLEKERLGFRRQSCTSSPS 452
>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 511
Score = 181 bits (459), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 127/384 (33%), Positives = 182/384 (47%), Gaps = 31/384 (8%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
DTGS L W PC + C C Y + +S F P SSS C + C I
Sbjct: 149 FDTGSSLVWFPC-TAGYRCSRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWI---- 203
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
F P S C C CP + YG G GIL +TL + + +
Sbjct: 204 --FGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGILLSETLDLEN------KRV 254
Query: 125 PKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVI 184
P F GC + +P GIAGFGRG S+PSQ+ K FSHC ++ + + P +SSPLV+
Sbjct: 255 PDFLVGCSVMSVHQPAGIAGFGRGPESLPSQMRL--KRFSHCLVSRGFDDSP-VSSPLVL 311
Query: 185 GDVAISSKDNLQ---FTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ S + + + P ++P N YYY+ L I IG + + P DS
Sbjct: 312 DSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPV-KFPYKYLVPDS 370
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG ++DSG+T+T L +P + + L+ + YPRAK+VE ++G C+ +P +
Sbjct: 371 TGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEES 430
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
FP + F L L N+ AM + + +++ G GP+ + G+
Sbjct: 431 AE---FPDVVLKFKGGGKLSLAAENYL-AMVTDEGVVCLTMMTDEAVVGGGGGPAIILGA 486
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
FQQQNV V YDL K+RIGF+ C
Sbjct: 487 FQQQNVLVEYDLAKQRIGFRKQKC 510
>gi|222629809|gb|EEE61941.1| hypothetical protein OsJ_16693 [Oryza sativa Japonica Group]
Length = 648
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 139/414 (33%), Positives = 203/414 (49%), Gaps = 53/414 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGS L+WVPC + S+ C +C + F P SSSS C + CL IH
Sbjct: 102 LPVLLDTGSHLSWVPCTS-SYQCRNCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP--------CPSFAYTYGEGGLVTGILTRDTLKV 113
S D+ +S C ++ C P CP + YG G G+L DTL+
Sbjct: 161 SPDH------LSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-TAGLLISDTLRT 213
Query: 114 HGSSPGIIREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
G R + F GC + S ++ P G+AGFGRGA SVPSQLG + FS+C L+ ++
Sbjct: 214 PG------RAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLTK--FSYCLLSRRF 265
Query: 173 ANDPNISSPLVIGDVAISSKDN-LQFTPMLKS----PMYPNYYYIGLEAITIGNSSLTEV 227
++ +S L++G +Q+ P+ +S P Y YYY+ L AIT+G S V
Sbjct: 266 DDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS---V 322
Query: 228 PLSLREF-DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFD 285
L R F GG +VDSGTT+++ + + + + + + Y R+K VEE G
Sbjct: 323 QLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 382
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS------APSNSSAVKCLL 339
C+ +P T P ++ HF + LP N+F AP+ + A+ CL
Sbjct: 383 PCFAMPPGTKTME---LPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAI-CLA 438
Query: 340 FQS--------MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
S GP+ + GSFQQQN + YDLEKER+GF+ CAS+++
Sbjct: 439 VVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCASSSN 492
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 139/413 (33%), Positives = 202/413 (48%), Gaps = 53/413 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGS L+WVPC + S+ C +C + F P SSSS C + CL IH
Sbjct: 102 LPVLLDTGSHLSWVPCTS-SYQCRNCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIH 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP--------CPSFAYTYGEGGLVTGILTRDTLKV 113
S D+ +S C ++ C P CP + YG G G+L DTL+
Sbjct: 161 SPDH------LSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-TAGLLISDTLRT 213
Query: 114 HGSSPGIIREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
G R + F GC + S ++ P G+AGFGRGA SVPSQLG + FS+C L+ ++
Sbjct: 214 PG------RAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLTK--FSYCLLSRRF 265
Query: 173 ANDPNISSPLVIGDVAISSKDN-LQFTPMLKS----PMYPNYYYIGLEAITIGNSSLTEV 227
++ +S L++G +Q+ P+ +S P Y YYY+ L AIT+G S V
Sbjct: 266 DDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS---V 322
Query: 228 PLSLREF-DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFD 285
L R F GG +VDSGTT+++ + + + + + + Y R+K VEE G
Sbjct: 323 QLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 382
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS------APSNSSAVKCLL 339
C+ +P T P ++ HF + LP N+F AP+ + A+ CL
Sbjct: 383 PCFAMPPGTKTME---LPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAI-CLA 438
Query: 340 FQS--------MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
S GP+ + GSFQQQN + YDLEKER+GF+ CAS++
Sbjct: 439 VVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCASSS 491
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 138/411 (33%), Positives = 206/411 (50%), Gaps = 57/411 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGS LTWVPC + ++DC +C + F P SSSS C + CL +H
Sbjct: 116 LPVLLDTGSQLTWVPCTS-NYDCRNCSS-PFAAAVPVFHPKNSSSSRLVGCRNPSCLWVH 173
Query: 62 SSDNPFD---PCTMSGCSLSTLLKSTCCRP----CPSFAYTYGEGGLVTGILTRDTLKVH 114
S+++ PC+ + C P CP +A YG G G+L DTL+
Sbjct: 174 SAEHVAKCRAPCS----------RGANCTPASNVCPPYAVVYGSGS-TAGLLIADTLRAP 222
Query: 115 GSSPGIIREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
G R + F GC + S ++ P G+AGFGRGA SVP+QLG + FS+C L+ ++
Sbjct: 223 G------RAVSGFVLGCSLVSVHQPPSGLAGFGRGAPSVPAQLGLSK--FSYCLLSRRFD 274
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPM-----YPNYYYIGLEAITIGNSSLTEVP 228
++ +S LV+G D +Q+ P++KS Y YYY+ L +T+G ++ +P
Sbjct: 275 DNAAVSGSLVLG----GDNDGMQYVPLVKSAAGDKQPYAVYYYLALSGVTVGGKAV-RLP 329
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
++ G+GG +VDSGTT+T+L P F +++ + Y R+K+VEE G C
Sbjct: 330 ARAFAANAAGSGGAIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPC 389
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFY-AMSAPSNS-------------S 333
+ +P + P ++ HF + LP N+F A AP +
Sbjct: 390 FALPQGAKSMA---LPELSLHFKGGAVMQLPLENYFVVAGRAPVPGAGAGAGAAEAICLA 446
Query: 334 AVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
V D GP+ + GSFQQQN V YDLEKER+GF+ CAS++
Sbjct: 447 VVTDFGGSGAGDEGGGPAIILGSFQQQNYLVEYDLEKERLGFRRQPCASSS 497
>gi|297800470|ref|XP_002868119.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313955|gb|EFH44378.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 499
Score = 178 bits (451), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 141/416 (33%), Positives = 198/416 (47%), Gaps = 59/416 (14%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
Y+DTGSDL W PC F C+ C+ S +++ S + + S HSS
Sbjct: 97 YLDTGSDLVWFPC--RPFTCILCESKPLPPSPPPTLSSSATTVSCSSPSCS---AAHSSL 151
Query: 65 NPFDPCTMSGCSLSTLLKSTC---CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D C +S C L + C PCP F Y YG+G LV + + D+L S P +
Sbjct: 152 PSSDLCAISNCPLDYIETGDCNTSSYPCPPFYYAYGDGSLVAKLFS-DSL----SLPSV- 205
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF----LQKGFSHCFLAFKYANDP- 176
+ F FGC +T EPIG+AGFGRG LS+P+QL L FS+C ++ + +D
Sbjct: 206 -SVANFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLSVHSPHLGNSFSYCLVSHSFDSDRV 264
Query: 177 NISSPLVIGDVAISSKDNLQ-------------------FTPMLKSPMYPNYYYIGLEAI 217
SPL++G + + FT ML +P +P +Y + L+ I
Sbjct: 265 RRPSPLILGRFVDKKEKRVATTDDDDDGDETKKKKNEFVFTEMLVNPKHPYFYSVSLQGI 324
Query: 218 TIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAK 276
+IG ++ P LR D G GG++VDSGTT+T LP FY+ ++ S + + RA
Sbjct: 325 SIGKRNI-PAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERAD 383
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLV-LPQGNHFYAM----SAPSN 331
VE +G CY + N T P++ HF N S V LP+ N+FY
Sbjct: 384 RVEPSSGMSPCYYL---NQTVK---VPALVLHFAGNGSTVTLPRRNYFYEFMDGGDGKEE 437
Query: 332 SSAVKCLLFQSMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
V CL+ M+ GD G + G++QQQ EVVYDL R+GF CAS
Sbjct: 438 KRKVGCLML--MNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCAS 491
>gi|326529727|dbj|BAK04810.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 143/408 (35%), Positives = 209/408 (51%), Gaps = 42/408 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGS LTWVPC + ++ C +C + F P SSSS +C+S CL IH
Sbjct: 99 LPVLLDTGSHLTWVPCTS-NYQCQNCSAAAGS--FPVFHPKSSSSSLLVSCSSPSCLWIH 155
Query: 62 SSDNPFD------PCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG 115
S + D PC S + S +T CP + YG G G+L DTL++
Sbjct: 156 SKSHLSDCARDSAPCRPSTANCS----ATATNVCPPYLVVYGSGS-TAGLLVSDTLRL-- 208
Query: 116 SSPGIIREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
S G F GC + S ++ P G+AGFGRGA SVP+QLG FS+C L+ ++ +
Sbjct: 209 SPRGAASR--NFAVGCSLASVHQPPSGLAGFGRGAPSVPAQLGV--NKFSYCLLSRRFDD 264
Query: 175 DPNISSPLVIG-DVAISSKDNLQFTPMLKS----PMYPNYYYIGLEAITIGNSSLTEVPL 229
D IS LV+G A +K +Q+ P+LK+ P Y YYY+ L I +G S+
Sbjct: 265 DAAISGELVLGASSAGKAKAMMQYAPLLKNAGARPPYSVYYYLSLTGIAVGGKSVALPAR 324
Query: 230 SLREFDSQGNGGLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
+L G GG ++DSGTT+T+L P F +++ + Y R+K+VE G C+
Sbjct: 325 ALAPVSGGGGGGAIIDSGTTFTYLDPTVFKPVAAAMVAAVGGRYNRSKDVEGALGLRPCF 384
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+P T DL P ++ HF + LP N+F A + P++ A + + + D
Sbjct: 385 ALPAGARTM--DL-PELSLHFSGGAEMRLPIENYFLA-AGPASGVAPEAICLAVVSDVSS 440
Query: 349 G-----------PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
P+ + GSFQQQN +V YDLEK R+GF+ C+S++S
Sbjct: 441 ASGGAGVSGGGGPAIILGSFQQQNYQVEYDLEKNRLGFRQQPCSSSSS 488
>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 447
Score = 178 bits (451), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 134/387 (34%), Positives = 188/387 (48%), Gaps = 46/387 (11%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGS W PC L + C +C +S F P SSSS C + C IH +D
Sbjct: 94 MDTGSSFVWFPC-TLRYLCNNCS---FTSRISPFLPKHSSSSKIIGCKNPKCSWIHQTDL 149
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C + C + CP + YG G G+ +TL +HG +I +P
Sbjct: 150 RCTDCDNNS--------RNCSQICPPYLILYGSG-TTGGVALSETLHLHG----LI--VP 194
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG 185
F GC + R+P GIAGFGRG S+PSQLG + FS+C L+ K+ +D SS LV+
Sbjct: 195 NFLVGCSVFSSRQPAGIAGFGRGPSSLPSQLGLTK--FSYCLLSHKF-DDTQESSSLVLD 251
Query: 186 DVAISSKDN--LQFTPMLKSP------MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ S K L +TP++K+P + YYY+ L I+IG S+ ++P D
Sbjct: 252 SQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSV-KIPYKYLSPDKD 310
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG ++DSGTT+T++ + L + S + Y RA VE +G PC N +
Sbjct: 311 GNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLK-----PCFNVSG 365
Query: 298 TDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY---GPSGV 353
+L P + HF + LP N+F + S V C F + DG GP +
Sbjct: 366 AKELELPQLRLHFKGGADVELPLENYFAFL----GSREVAC--FTVVTDGAEKASGPGMI 419
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+FQ QN V YDL+ ER+GF+ C
Sbjct: 420 LGNFQMQNFYVEYDLQNERLGFKKESC 446
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 130/389 (33%), Positives = 189/389 (48%), Gaps = 40/389 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL--MSNFSPSRSSSSSRDTCASSFCLNIHSS 63
MDTGS L W PC + + C CD + N ++ + F P +SSSS+ C + C +
Sbjct: 109 MDTGSSLVWFPCTS-RYLCSRCD-FPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWL--- 163
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
F P S C C + CP + YG G G+L +TL +
Sbjct: 164 ---FGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLSETLDFPHK-----KT 214
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
IP F GC + R+P GIAGFGR S+PSQLG K FS+C ++ + + P SS LV
Sbjct: 215 IPGFLVGCSLFSIRQPEGIAGFGRSPESLPSQLGL--KKFSYCLVSHAFDDTP-ASSDLV 271
Query: 184 IGDVAISSKDN----LQFTPMLKSP--MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ D S D L +TP K+P + +YYY+ L I IG++ + +VP S
Sbjct: 272 L-DTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTHV-KVPYKFLVPGSD 329
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG +VDSGTT+T + +P Y + + + +Y A EV+ +TG C+ + +
Sbjct: 330 GNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGLRPCFNISGEKSVS 389
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY----GPSGV 353
P FHF + LP N+F + S V CL S + GP+ +
Sbjct: 390 V----PEFIFHFKGGAKMALPLANYFSFV-----DSGVICLTIVSDNMSGSGIGGGPAII 440
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G++QQ+N V +DL+ ER GF+ +C S
Sbjct: 441 LGNYQQRNFHVEFDLKNERFGFKQQNCVS 469
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 126/383 (32%), Positives = 198/383 (51%), Gaps = 55/383 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDLTW+ S C C + + F PS+SS+ ++ C+SS C +
Sbjct: 40 VIIDTGSDLTWIQ----SEPCRACFEQADPI----FDPSKSSTYNKIACSSSACAD---- 87
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPS--FAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
LL + C + +AY YG+G + G +++T+ ++ +
Sbjct: 88 ----------------LLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEV 131
Query: 122 R-EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+ + G G T E GI G G+G +S+PSQLG L FS+C + + A +
Sbjct: 132 KFGASVYNTGTFGDTGGE--GILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSE--T 187
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + GD A+ S + +Q+TP++ + +P YYYI ++ I++G S L ++ S+ E DS G+
Sbjct: 188 STMYFGDAAVPSGE-VQYTPIVPNADHPTYYYIAVQGISVGGS-LLDIDQSVYEIDSGGS 245
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG ++DSGTT T+L + ++ L++ S + Y TG DLC+ +
Sbjct: 246 GGTIIDSGTTITYLQQEVFNALVAAYTSQVRY----PTTTSATGLDLCFNTRGTGSP--- 298
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
+FP++T H L+ V L LP N F ++ + + CL F S D P +FG+ QQ
Sbjct: 299 -VFPAMTIH-LDGVHLELPTANTFISLE-----TNIICLAFASALDF---PIAIFGNIQQ 348
Query: 360 QNVEVVYDLEKERIGFQPMDCAS 382
QN ++VYDL+ RIGF P DCAS
Sbjct: 349 QNFDIVYDLDNMRIGFAPADCAS 371
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 188/391 (48%), Gaps = 38/391 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNI 60
+ + DTGS L W PC + + C +C + + + F P SSSS C + C I
Sbjct: 94 LHLIFDTGSSLVWFPCTS-RYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWI 152
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
F P S C C + CP++ YG G G+L +TL
Sbjct: 153 ------FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPD----- 200
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
++IP F GC + +P GIAGFGRG+ S+PSQ+G K F++C + K+ + P+ S
Sbjct: 201 -KKIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL--KKFAYCLASRKFDDSPH-SG 256
Query: 181 PLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L++ + S L +TP ++P Y YYY+ + I +GN ++ +VP
Sbjct: 257 QLILDSTGVKS-SGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAV-KVPYKFLVPG 314
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
GNGG ++DSG+T+T + +P + + + + RA +VE TG C+ + +
Sbjct: 315 PDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKS 374
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL--LFQSMDD---GDYGP 350
FP + F F LP N+F +S SS V CL + M+D G GP
Sbjct: 375 V----KFPELIFQFKGGAKWALPLNNYFALVS----SSGVACLTVVTHQMEDGGGGGGGP 426
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
S + G+FQQQN V YDL +R+GF+ C+
Sbjct: 427 SVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|18414692|ref|NP_567506.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15809800|gb|AAL06828.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|18377815|gb|AAL67094.1| AT4g16560/dl4305c [Arabidopsis thaliana]
gi|332658370|gb|AEE83770.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 136/416 (32%), Positives = 194/416 (46%), Gaps = 59/416 (14%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
Y+DTGSDL W PC F C+ C+ + S +++ S + + S HSS
Sbjct: 99 YLDTGSDLVWFPC--RPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCS---AAHSSL 153
Query: 65 NPFDPCTMSGCSLSTLLKSTC---CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D C +S C L + C PCP F Y YG+G LV + + S
Sbjct: 154 PSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVS--- 210
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF----LQKGFSHCFLAFKYANDP- 176
F FGC +T EPIG+AGFGRG LS+P+QL L FS+C ++ + +D
Sbjct: 211 ----NFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRV 266
Query: 177 NISSPLVIGDVAISSK-------------------DNLQFTPMLKSPMYPNYYYIGLEAI 217
SPL++G + + FT ML++P +P +Y + L+ I
Sbjct: 267 RRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGI 326
Query: 218 TIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAK 276
+IG ++ P LR D G GG++VDSGTT+T LP FY+ ++ S + + RA
Sbjct: 327 SIGKRNI-PAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDSRVGRVHERAD 385
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFL-NNVSLVLPQGNHFYAM----SAPSN 331
VE +G CY + N T P++ HF N S+ LP+ N+FY
Sbjct: 386 RVEPSSGMSPCYYL---NQTVK---VPALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKEE 439
Query: 332 SSAVKCLLFQSMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ CL+ M+ GD G + G++QQQ EVVYDL R+GF CAS
Sbjct: 440 KRKIGCLML--MNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRKCAS 493
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 124/391 (31%), Positives = 187/391 (47%), Gaps = 38/391 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNI 60
+ + DTGS L W PC + + C +C + + + F P SSSS C + C I
Sbjct: 94 LHLIFDTGSSLVWFPCTS-RYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGCQNPKCSWI 152
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
F P S C C + CP++ YG G G+L +TL
Sbjct: 153 ------FGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPD----- 200
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ IP F GC + +P GIAGFGRG+ S+PSQ+G K F++C + K+ + P+ S
Sbjct: 201 -KXIPNFVVGCSFLSIHQPSGIAGFGRGSESLPSQMGL--KKFAYCLASRKFDDSPH-SG 256
Query: 181 PLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L++ + S L +TP ++P Y YYY+ + I +GN ++ +VP
Sbjct: 257 QLILDSTGVKS-SGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAV-KVPYKFLVPG 314
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
GNGG ++DSG+T+T + +P + + + + RA +VE TG C+ + +
Sbjct: 315 PDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPCFDISKEKS 374
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL--LFQSMDD---GDYGP 350
FP + F F LP N+F +S SS V CL + M+D G GP
Sbjct: 375 V----KFPELIFQFKGGAKWALPLNNYFALVS----SSGVACLTVVTHQMEDGGGGGGGP 426
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
S + G+FQQQN V YDL +R+GF+ C+
Sbjct: 427 SVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457
>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 466
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 122/391 (31%), Positives = 184/391 (47%), Gaps = 43/391 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS L W+PC + + C C+ + N F P SSSS C + C + D
Sbjct: 103 LDTGSTLVWLPCSS-HYLCSKCNSFSNTP---KFIPKNSSSSKFVGCTNPKCAWVFGPDV 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C + + C + CP++ YG G G L + L ++
Sbjct: 159 KSHCCRQDKAAFNN-----CSQTCPAYTVQYGLGS-TAGFLLSENLNFP------TKKYS 206
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG 185
F GC + +P GIAGFGRG S+PSQ+ + FS+C L+ ++ + I+S LV+
Sbjct: 207 DFLLGCSVVSVYQPAGIAGFGRGEESLPSQMNLTR--FSYCLLSHQFDDSATITSNLVLE 264
Query: 186 DVAISSKDN----LQFTPMLKSP------MYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS+D + +TP LK+P + YYYI L+ I +G + VP L E +
Sbjct: 265 TA--SSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKRV-RVPRRLLEPN 321
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G+GG +VDSG+T+T + P + + ++Y RA+E E++ G C+ +
Sbjct: 322 VDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSY-TRAREAEKQFGLSPCFVLAGGAE 380
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD----DGDYGPS 351
T + FP + F F + LP N+F + V CL S D G GP+
Sbjct: 381 TAS---FPELRFEFRGGAKMRLPVANYFSLV----GKGDVACLTIVSDDVAGSGGTVGPA 433
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G++QQQN V YDLE ER GF+ C +
Sbjct: 434 VILGNYQQQNFYVEYDLENERFGFRSQSCQT 464
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/388 (32%), Positives = 191/388 (49%), Gaps = 57/388 (14%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
V V +DTGSDLTWV C C C Y N + F P+ S+S ++ C S+ C +
Sbjct: 25 VFSVIVDTGSDLTWVQCS----PCGKC--YSQNDAL--FLPNTSTSFTKLACGSALCNGL 76
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
PF C + C + Y+YG+G L TG DT+ + G + G
Sbjct: 77 -----PFPMCNQTTCV---------------YWYSYGDGSLTTGDFVYDTITMDGIN-GQ 115
Query: 121 IREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
+++P F FGC ++ GI G G+G LS SQL + G FS+C + + P
Sbjct: 116 KQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLV--DWLAPP 173
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+SPL+ GD A+ ++++ P+L +P P YYY+ L I++G+ +L + ++ + DS
Sbjct: 174 TQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGD-NLLNISSTVFDIDS 232
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G G + DSGTT T L E Y ++L+ + ++ Y R ++++ + DLC +
Sbjct: 233 VGGAGTIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSR--KIDDISRLDLCL------SG 284
Query: 297 FTDDLFPSI---TFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
F D P++ TFHF +VLP N+F + SS C S D + +
Sbjct: 285 FPKDQLPTVPAMTFHFEGG-DMVLPPSNYFIYL----ESSQSYCFAMTSSPDVN-----I 334
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GS QQQN +V YD ++GF P DC
Sbjct: 335 IGSVQQQNFQVYYDTAGRKLGFVPKDCV 362
>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
Length = 445
Score = 172 bits (435), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 131/392 (33%), Positives = 181/392 (46%), Gaps = 47/392 (11%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL--MSNFSPSRSSSSSRDTCASSFCLNIHSS 63
MDTGSD+ W PC + + C C ++ + F P SSSS C + C IH S
Sbjct: 84 MDTGSDIVWFPCTS-HYLCKHCSFSSSSPSSRIQPFIPKESSSSKLLGCKNPKCSWIHHS 142
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ D CS+ + L TC P + YG G G+ +TL +H S
Sbjct: 143 NINCD----QDCSIKSCLNQTC----PPYMIFYGSG-TTGGVALSETLHLHSLSK----- 188
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
P F GC + +P GIAGFGRG S+PSQLG + FS+C L+ ++ +D SS LV
Sbjct: 189 -PNFLVGCSVFSSHQPAGIAGFGRGLSSLPSQLGLGK--FSYCLLSHRFDDDTKKSSSLV 245
Query: 184 IGDVAISSKDN---LQFTPMLKSPMYPN------YYYIGLEAITIGNSSLTEVPLSLREF 234
+ + S L +TP +K+P N YYY+GL IT+G + +VP
Sbjct: 246 LDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVGGHHV-KVPYKYLSP 304
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
GNGG+++DSGTT+T + + L I Y R KE+E+ G C+ V
Sbjct: 305 GEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDAIGLRPCFNVSDAK 364
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV- 353
FP + +F + LP N+F + V CL + DG GP V
Sbjct: 365 TV----SFPELRLYFKGGADVALPVENYFAFVGG-----EVACLTV--VTDGVAGPERVG 413
Query: 354 -----FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+FQ QN V YDL ER+GF+ C
Sbjct: 414 GPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 126/393 (32%), Positives = 188/393 (47%), Gaps = 43/393 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + MDTGSDL W PC + + C +C +N + F P SSSS C + C IH
Sbjct: 103 LPLIMDTGSDLVWFPCTH-RYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGCVNPKCGWIH 161
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S S C C + CP + YG G + GI+ +TL + G
Sbjct: 162 GSK------VQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDLPG------ 208
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ +P F GC + +P GI+GFGRG S+PSQLG K FS+C L+ +Y +D SS
Sbjct: 209 KGVPNFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGL--KKFSYCLLSRRY-DDTTESSS 265
Query: 182 LVIGDVAISSKDN--LQFTPMLKSP------MYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
LV+ + S + L +TP +++P + YYY+GL IT+G + ++P
Sbjct: 266 LVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHV-KIPYKYLI 324
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+ G+GG ++DSGTT+T++ + + + + + RA EVE TG C+ +
Sbjct: 325 PGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSK-RATEVEGITGLRPCFNI--- 380
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG---- 349
+ FP +T F + LP N+ + V CL + DG G
Sbjct: 381 -SGLNTPSFPELTLKFRGGAEMELPLANYVAFLGG----DDVVCLTI--VTDGAAGKEFS 433
Query: 350 --PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P+ + G+FQQQN V YDL ER+GF+ C
Sbjct: 434 GGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466
>gi|32488713|emb|CAE03456.1| OSJNBa0088H09.14 [Oryza sativa Japonica Group]
Length = 490
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 136/413 (32%), Positives = 199/413 (48%), Gaps = 54/413 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V ++TGS L+WVP + S +C + F P SSSS C + CL IH
Sbjct: 102 LPVLLETGSHLSWVP--STSSYSANCSSLSAASPLHVFHPKNSSSSRLIGCRNPSCLWIH 159
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP--------CPSFAYTYGEGGLVTGILTRDTLKV 113
S D+ +S C ++ C P CP + YG G G+L DTL+
Sbjct: 160 SPDH------LSDCRAASSCPGANCTPRNANANNVCPPYLVVYGSGS-TAGLLISDTLRT 212
Query: 114 HGSSPGIIREIPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
G R + F GC + S ++ P G+AGFGRGA SVPSQLG + FS+C L+ ++
Sbjct: 213 PG------RAVRNFVIGCSLASVHQPPSGLAGFGRGAPSVPSQLGLTK--FSYCLLSRRF 264
Query: 173 ANDPNISSPLVIGDVAISSKDN-LQFTPMLKS----PMYPNYYYIGLEAITIGNSSLTEV 227
++ +S L++G +Q+ P+ +S P Y YYY+ L AIT+G S V
Sbjct: 265 DDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSVYYYLALTAITVGGKS---V 321
Query: 228 PLSLREF-DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFD 285
L R F GG +VDSGTT+++ + + + + + + Y R+K VEE G
Sbjct: 322 QLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAVVAAVGGRYSRSKVVEEGLGLS 381
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS------APSNSSAVKCLL 339
C+ +P T P ++ HF + LP N+F AP+ + A+ CL
Sbjct: 382 PCFAMPPGTKTME---LPEMSLHFKGGSVMNLPVENYFVVAGPAPSGGAPAMAEAI-CLA 437
Query: 340 FQS--------MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
S GP+ + GSFQQQN + YDLEKER+GF+ CAS++
Sbjct: 438 VVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEYDLEKERLGFRRQQCASSS 490
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 191/394 (48%), Gaps = 53/394 (13%)
Query: 2 IQVYMDTGSDLTWVPCG--NLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
+ + +DTGS L W PC ++ C +C S P++ +R+ ++ L
Sbjct: 87 VSLVLDTGSSLVWTPCTIPTATYTCQNCT-------FSGVDPTKIPIYARNKSSTVQSL- 138
Query: 60 IHSSDNPFDPCTMSGCS--LSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
PC C+ + L + + CP + YG G TG L D L +
Sbjct: 139 ---------PCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSK-- 186
Query: 118 PGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ IP F FGC + R+P GIAGFGRG S+P+QLG + FS+C ++ ++ + P
Sbjct: 187 ---LNRIPDFLFGCSLVSNRQPEGIAGFGRGLASIPAQLGLTK--FSYCLVSHRFDDTPQ 241
Query: 178 ISSPLVI---GDVAISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEVPLSL 231
S LV+ A ++ + + + P KSP Y YYYI L I +G +VP+
Sbjct: 242 -SGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGK---DVPIPP 297
Query: 232 REF--DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
R +G+GG++VDSG+T+T + + + L+ +T Y RAKE+E+ +G CY
Sbjct: 298 RYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYN 357
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD---G 346
+ ++ P +TF F ++ LP ++F + + V C+ + D
Sbjct: 358 I----TGQSEVDVPKLTFSFKGGANMDLPLTDYFSLV-----TDGVVCMTVLTDPDEPGS 408
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
GP+ + G++QQQN + YDL+K+R GF+P C
Sbjct: 409 TTGPAIILGNYQQQNFYIEYDLKKQRFGFKPQQC 442
>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 180/386 (46%), Gaps = 36/386 (9%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
MDTGS L W PC + + C +C+ K + F P SSSS C + C I
Sbjct: 100 MDTGSSLVWFPCTS-RYLCSECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMI---- 154
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
F P S C C + CP + YG G G+L +TL + I
Sbjct: 155 --FGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLDFPNK-----KTI 206
Query: 125 PKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVI 184
P F GC + ++P GIAGFGR S+PSQLG K FS+C ++ + + P SS LV+
Sbjct: 207 PDFLVGCSIFSIKQPEGIAGFGRSPESLPSQLGL--KKFSYCLVSHAFDDTPT-SSDLVL 263
Query: 185 ---GDVAISSKDNLQFTPMLKSP--MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
++ L TP LK+P + +YYY+ L I IG++ + +VP + GN
Sbjct: 264 DTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHV-KVPYKFLVPGTDGN 322
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG +VDSGTT+T + P Y + + + +Y A E++ TG CY + +
Sbjct: 323 GGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLRPCYNISGEKSLSVP 382
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG----VFG 355
DL F F + LP N+F + S V CL S + G G + G
Sbjct: 383 DLI----FQFKGGAKMALPLSNYFSIV-----DSGVICLTIVSDNVAGPGLGGGPAIILG 433
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
++QQ+N V +DLE E+ GF+ CA
Sbjct: 434 NYQQRNFYVEFDLENEKFGFKQQSCA 459
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 132/398 (33%), Positives = 199/398 (50%), Gaps = 56/398 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + MDTGSD++W+ C C DC L F+P SSS + CASS C N++
Sbjct: 151 VVLIMDTGSDVSWIQC----VPCKDC----VPALRPPFNPRHSSSFFKLPCASSTCTNVY 202
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP--- 118
PF C+ SG TC F+ YG+G L +G+L +T + G++P
Sbjct: 203 QGVKPF--CSPSG--------RTCL-----FSIQYGDGSLSSGLLAMET--IAGNTPNFG 245
Query: 119 -GIIREIPKFCFGCVGSTYREPI-----GIAGFGRGALSVPSQLG-FLQKGFSHCFLAFK 171
G ++ GC RE + G+ G R +S PSQL + FSHCF K
Sbjct: 246 DGEPVKLSNITLGC-ADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-PDK 303
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP----NYYYIGLEAITIGNSSLTEV 227
A+ N S + G+ I S L++TP++++P P +YYY+GL I++ S L
Sbjct: 304 IAH-LNSSGLVFFGESDIISP-YLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRL--- 358
Query: 228 PLSLREFD---SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
PLS + FD G+GG ++DSGT +T+L +P + + + ++ + V++ +GF
Sbjct: 359 PLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK---VDDNSGF 415
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
CY + + PSIT HF + +VLP+ + +S+ S CL FQ
Sbjct: 416 TPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSS-SEEQTTLCLAFQM-- 472
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GD P + G++QQQN+ V YDLEK R+G P CA+
Sbjct: 473 SGDI-PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCAT 509
>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 469
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 127/394 (32%), Positives = 186/394 (47%), Gaps = 42/394 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
I DTGS L W+PC + + C CD + L+ F P SSSS C S C +
Sbjct: 103 IPFVFDTGSSLVWLPCTS-RYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFL 161
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ + GC +T C CP + YG G G+L + L P +
Sbjct: 162 YGPN-----VQCRGCDPNT---RNCTVGCPPYILQYGLGS-TAGVLITEKLDF----PDL 208
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+P F GC + R+P GIAGFGRG +S+PSQ+ K FSHC ++ ++ +D N+++
Sbjct: 209 T--VPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL--KRFSHCLVSRRF-DDTNVTT 263
Query: 181 PLVI----GDVAISSKDNLQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSL 231
L + G + S L +TP K+P N YYY+ L I +G + ++P
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV-KIPYKY 322
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ G+GG +VDSG+T+T + P + + S ++ Y R K++E+ TG C+ +
Sbjct: 323 LAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNIS 382
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD----DGD 347
D P + F F L LP N+F + N+ V CL S G
Sbjct: 383 GKG----DVTVPELIFEFKGGAKLELPLSNYFTFV---GNTDTV-CLTVVSDKTVNPSGG 434
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GP+ + GSFQQQN V YDLE +R GF C+
Sbjct: 435 TGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 183/393 (46%), Gaps = 41/393 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGS L W PC + + C C+ + + F P SS++ C + C I SD
Sbjct: 109 LDTGSSLVWFPCTS-RYLCSHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGYIFGSD 167
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
F C C CP++ YG G G L D L G + +
Sbjct: 168 VQFR------CPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFPG------KTV 214
Query: 125 PKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV- 183
P+F GC + R+P GIAGFGRG S+PSQ+ K FS+C ++ ++ + P SS LV
Sbjct: 215 PQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNL--KRFSYCLVSHRFDDTPQ-SSDLVL 271
Query: 184 -IGDVAISSKDNLQFTPM-----LKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
I + + L +TP +P + YYY+ L + +G + ++P + E S
Sbjct: 272 QISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVGGKDV-KIPYTFLEPGSD 330
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG +VDSG+T+T + P Y+ + ++ Y RA++ E ++G C+ + +
Sbjct: 331 GNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNI----SG 386
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG-----DYGPS 351
FP +TF F + P N+F + + V CL S D G GP+
Sbjct: 387 VKTVTFPELTFKFKGGAKMTQPLQNYFSLV----GDAEVVCLTVVS-DGGAGPPKTTGPA 441
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
+ G++QQQN + YDLE ER GF P C A
Sbjct: 442 IILGNYQQQNFYIEYDLENERFGFGPRSCRRKA 474
>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 469
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 127/394 (32%), Positives = 185/394 (46%), Gaps = 42/394 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
I DTGS L W PC + + C DC+ + + F P SSSS C + C +
Sbjct: 103 IPFVFDTGSSLVWFPCTS-RYLCSDCNFSGLDPTQIPRFIPKNSSSSRVIGCQNPKCQFL 161
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
++ GC +T C PCP + YG G GIL + L P +
Sbjct: 162 FGAN-----VQCRGCDPNT---RNCTVPCPPYILQYGLGS-TAGILISEKLDF----PDL 208
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+P F GC + R P GIAGFGRG S+PSQ+ K FSHC ++ ++ +D N+++
Sbjct: 209 T--VPDFVVGCSVISTRTPAGIAGFGRGPESLPSQMKL--KSFSHCLVSRRF-DDTNVTT 263
Query: 181 PLVI----GDVAISSKDNLQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSL 231
L + G + S L +TP K+P N YYY+ L I +G S ++P
Sbjct: 264 DLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVG-SKHVKIPYKF 322
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ GNGG +VDSG+T+T + P + + + ++ Y R K++E+ +G C+ +
Sbjct: 323 LAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSGIAPCFNIS 382
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD----GD 347
D P + F F + LP N+F S N+ V CL S + G
Sbjct: 383 GKG----DVTVPELIFEFKGGAKMELPLSNYF---SFVGNADTV-CLTVVSDNTVNPGGG 434
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GP+ + GSFQQQN V YDLE +R GF C+
Sbjct: 435 TGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 164 bits (415), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 131/398 (32%), Positives = 199/398 (50%), Gaps = 56/398 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + MDTGSD++W+ C C DC L F+P SSS + CASS C N++
Sbjct: 152 VVLIMDTGSDVSWIQC----VPCKDC----VPALRPPFNPRHSSSFFKLPCASSTCTNVY 203
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP--- 118
PF C+ SG TC F+ YG+G L +G+L +T + G++P
Sbjct: 204 QGVKPF--CSPSG--------RTCL-----FSIQYGDGSLSSGLLAMET--IAGNTPNFG 246
Query: 119 -GIIREIPKFCFGCVGSTYREPI-----GIAGFGRGALSVPSQLG-FLQKGFSHCFLAFK 171
G ++ GC RE + G+ G R +S PSQL + FSHCF K
Sbjct: 247 DGEPVKLSNITLGC-ADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-PDK 304
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP----NYYYIGLEAITIGNSSLTEV 227
A+ N S + G+ I S L++TP++++P P +YYY+GL I++ S L
Sbjct: 305 IAH-LNSSGLVFFGESDIISP-YLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRL--- 359
Query: 228 PLSLREFD---SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
PLS + FD G+GG ++DSGT +T+L +P + + + ++ + V++ +GF
Sbjct: 360 PLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK---VDDNSGF 416
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
CY + + PSIT HF + +VLP+ + +S+ S CL F +
Sbjct: 417 TPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSS-SEEQTTLCLAF--LM 473
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GD P + G++QQQN+ V YDLEK R+G P CA+
Sbjct: 474 SGDI-PFNIIGNYQQQNLWVEYDLEKLRLGIAPAQCAT 510
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 128/388 (32%), Positives = 186/388 (47%), Gaps = 68/388 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C++C N+ F PS SS+ S C+SS C SD
Sbjct: 135 VDTGSDLVWTQCK----PCVEC----FNQSTPVFDPSSSSTYSTLPCSSSLC-----SDL 181
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + + C + YTYG+ G+L +T + + ++P
Sbjct: 182 PTSTCT------------SAAKDC-GYTYTYGDASSTQGVLAAETFTLAKT------KLP 222
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + D SP
Sbjct: 223 GVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLGK--FSYCLTSL----DDTSKSP 276
Query: 182 LVIGDVAISSKDN-----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G +A S D +Q TP++K+P P++YY+ L+A+T+G+ T +PL F
Sbjct: 277 LLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGS---TRIPLPGSAFAV 333
Query: 237 Q--GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
Q G GG++VDSGT+ T+L Y L + + P A G DLC++ P
Sbjct: 334 QDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQMKL-PVADG--SAVGLDLCFKAPASG 390
Query: 295 NTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
DD+ P + HF L LP N+ SA S CL G G S +
Sbjct: 391 ---VDDVEVPKLVLHFDGGADLDLPAENYMVLDSA----SGALCLTVM----GSRGLS-I 438
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+FQQQN++ VYD++K+ + F P+ CA
Sbjct: 439 IGNFQQQNIQFVYDVDKDTLSFAPVQCA 466
>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 601
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 117/386 (30%), Positives = 178/386 (46%), Gaps = 33/386 (8%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS L W+PC + + C C+ + NN F P S SS C + C + SD
Sbjct: 233 LDTGSSLVWLPCYS-HYLCSKCNSFSNNN-TPKFIPKDSFSSKFVGCRNPKCAWVFGSDV 290
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C ++ + S + C + CP++ YG G G L + L + +
Sbjct: 291 TSHCCKLAKAAFSN--NNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPA------KNVS 341
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG 185
F GC + +P GIAGFGRG S+P+Q+ + FS+C L+ ++ P S ++
Sbjct: 342 DFLVGCSVVSVYQPGGIAGFGRGEESLPAQMNLTR--FSYCLLSHQFDESPENSDLVMEA 399
Query: 186 DVAISSK--DNLQFT-----PMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ K + + +T P K P + YYYI L I +G + VP + E D G
Sbjct: 400 TNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRV-RVPRRMLEPDVNG 458
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
+GG +VDSG+T T + P + + + Y RA+E+E++ G C+ + T +
Sbjct: 459 DGGFIVDSGSTLTFMERPIFDLVAEEFVKQVNY-TRARELEKQFGLSPCFVLAGGAETAS 517
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD----DGDYGPSGVF 354
FP + F F + LP N+F + V CL S D G GP+ +
Sbjct: 518 ---FPEMRFEFRGGAKMRLPVANYFSRVG----KGDVACLTIVSDDVAGQGGAVGPAVIL 570
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQQN V DLE ER GF+ C
Sbjct: 571 GNYQQQNFYVECDLENERFGFRSQSC 596
>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
Length = 468
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 124/401 (30%), Positives = 188/401 (46%), Gaps = 50/401 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL--MSNFSPSRSSSSSRDTCASSFCLN 59
+++ MDTGS L W PC + + C C+ + N + + F P SSSS C + C
Sbjct: 97 VKLIMDTGSSLVWFPCTS-RYVCASCN-FPNTDITKIPKFMPRLSSSSKLIGCKNPKCAW 154
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ F S C C + CP + YG G G+L +T+
Sbjct: 155 V------FGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPN---- 203
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ I F GC + R+P GIAGFGR S+P QLG K FS+C ++ ++ + P +S
Sbjct: 204 --KTISDFLAGCSLLSTRQPEGIAGFGRSQESLPLQLGL--KKFSYCLVSRRFDDSP-VS 258
Query: 180 SPLVIGDVAISSKDN----LQFTPMLKS------PMYPNYYYIGLEAITIGNSSLTEVPL 229
S L++ D+ S+ D+ L +TP K+ P + YYY+ L I +G + + +VP
Sbjct: 259 SDLIL-DMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKIIVGKTHV-KVPY 316
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
S S GNGG +VDSG+T+T + + L + + Y A V++ TG C+
Sbjct: 317 SFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQKLTGLRPCFD 376
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD----- 344
+ + + P +TF F + LP N+F + V CL S +
Sbjct: 377 ISGEKSV----VIPDLTFQFKGGAKMQLPLSNYFAFVDM-----GVVCLTIVSDNAAALG 427
Query: 345 -DGDY---GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
DG GP+ + G+FQQQN + YDLE +R GF+ CA
Sbjct: 428 GDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 122/383 (31%), Positives = 185/383 (48%), Gaps = 52/383 (13%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+TW+ C C +C Y+ + F+PS SSS C+SS CLN+
Sbjct: 31 LVVDTGSDITWLQCA----PCTNC--YKQKDAL--FNPSSSSSFKVLDCSSSLCLNLD-- 80
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS-SPG--I 120
+ GC L + C + YG+G G L D + + + PG +
Sbjct: 81 --------VMGC-----LSNKCL-----YQADYGDGSFTMGELVTDNVVLDDAFGPGQVV 122
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ IP C T+ GI G GRG LS P+ L + FS+C +DPN
Sbjct: 123 LTNIPLGCGHDNEGTFGTAAGILGLGRGPLSFPNNLDASTRNIFSYCLP--DRESDPNHK 180
Query: 180 SPLVIGDVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S LV GD AI ++ +++F P L++P YYY+ + I++G + LT +P S+ + DS
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG + DSGTT T L Y+ + ++ + A + + FD CY N+
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFRAATMHLTSAADFKI---FDTCYDFTGMNSIS 297
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P++TFHF +V + LP N+ P +++ + C F + GPS V G+
Sbjct: 298 V----PTVTFHFQGDVDMRLPPSNYI----VPVSNNNIFCFAFAA----SMGPS-VIGNV 344
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ+ V+YD ++IG P C
Sbjct: 345 QQQSFRVIYDNVHKQIGLLPDQC 367
>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 470
Score = 160 bits (406), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 182/394 (46%), Gaps = 43/394 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGS L W PC + + C C+ + + F P SS++ C + C +
Sbjct: 105 LDTGSSLVWFPCTS-HYLCSHCNFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGYL---- 159
Query: 65 NPFDPCTMSGCSLSTLLKS-TCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
F P S C S C CPS+ YG G G L D L G +
Sbjct: 160 --FGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNLNFPG------KT 210
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+P+F GC + R+P GIAGFGRG S+PSQ+ K FS+C ++ ++ + P SS LV
Sbjct: 211 VPQFLVGCSILSIRQPSGIAGFGRGQESLPSQMNL--KRFSYCLVSHRFDDTPQ-SSDLV 267
Query: 184 --IGDVAISSKDNLQFTPMLKSP----MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
I + + L +TP +P ++ YYY+ L + +G + ++P E S
Sbjct: 268 LQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDV-KIPYKFLEPGSD 326
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG +VDSG+T+T + P Y+ + L+ Y R + VE ++G C+ + +
Sbjct: 327 GNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNI----SG 382
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG------DYGP 350
FP TF F + P N+F S + L F + DG GP
Sbjct: 383 VKTISFPEFTFQFKGGAKMSQPLLNYF------SFVGDAEVLCFTVVSDGGAGQPKTAGP 436
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
+ + G++QQQN V YDLE ER GF P +C A
Sbjct: 437 AIILGNYQQQNFYVEYDLENERFGFGPRNCKRKA 470
>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
Length = 469
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 126/394 (31%), Positives = 186/394 (47%), Gaps = 42/394 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
I DTGS L +PC + + C CD + L+ F P SSSS C S C +
Sbjct: 103 IPFVFDTGSSLVCLPCTS-RYLCSGCDFSGLDPTLIPRFIPKNSSSSKIIGCQSPKCQFL 161
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ + GC +T C CP + YG G G+L + L P +
Sbjct: 162 YGPN-----VQCRGCDPNT---RNCTVGCPPYILQYGLGS-TAGVLITEKLDF----PDL 208
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+P F GC + R+P GIAGFGRG +S+PSQ+ K FSHC ++ ++ +D N+++
Sbjct: 209 T--VPDFVVGCSIISTRQPAGIAGFGRGPVSLPSQMNL--KRFSHCLVSRRF-DDTNVTT 263
Query: 181 PLVI----GDVAISSKDNLQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSL 231
L + G + S L +TP K+P N YYY+ L I +G + ++P
Sbjct: 264 DLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHV-KIPYKY 322
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ G+GG +VDSG+T+T + P + + S ++ Y R K++E+ TG C+ +
Sbjct: 323 LAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCFNI- 381
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD----DGD 347
+ D P + F F L LP N+F + N+ V CL S G
Sbjct: 382 ---SGKGDVTVPELIFEFKGGAKLELPLSNYFTFV---GNTDTV-CLTVVSDKTVNPSGG 434
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GP+ + GSFQQQN V YDLE +R GF C+
Sbjct: 435 TGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/385 (31%), Positives = 181/385 (47%), Gaps = 51/385 (13%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
V V +DTGSDLTWV C C C Y N S F P+ S+S ++ C + C +
Sbjct: 15 VFSVIVDTGSDLTWVQCS----PCGTC--YSQND--SLFIPNTSTSFTKLACGTELCNGL 66
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P+ C + C + Y+YG+G L TG DT+ + G + G
Sbjct: 67 -----PYPMCNQTTCV---------------YWYSYGDGSLSTGDFVYDTITMDGIN-GQ 105
Query: 121 IREIPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
+++P F FGC ++ GI G G+G LS PSQL + G FS+C + + P
Sbjct: 106 KQQVPNFAFGCGHDNEGSFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLV--DWLAPP 163
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+SPL+ GD A+ + +++ +L +P P YYY+ L I++G L + + + DS
Sbjct: 164 TQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGG-KLLNISSTAFDIDS 222
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G G + DSGTT T L + ++L+ + ++ YPR ++ +G DLC
Sbjct: 223 VGRAGTIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKS--DDSSGLDLCLGGFAEGQL 280
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T PS+TFHF + LP N+F + SS C S D + GS
Sbjct: 281 PT---VPSMTFHFEGG-DMELPPSNYFIFL----ESSQSYCFSMVSSPD-----VTIIGS 327
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQN +V YD +IGF P C
Sbjct: 328 IQQQNFQVYYDTVGRKIGFVPKSCV 352
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/400 (32%), Positives = 194/400 (48%), Gaps = 38/400 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN-FSPSRSSSSSRDTCASSFCLNI 60
+ V +DTGS L+WVPC + S+ C +C + F P SSSS C + C I
Sbjct: 104 LPVLLDTGSHLSWVPCTS-SYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVGCRNPACRWI 162
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
HS C +G + + + CP + YG G +G+L DTL++ SS
Sbjct: 163 HSKSP--STCGSTGNNGNGDV-------CPPYLVVYGSGS-TSGLLISDTLRLSPSSSSS 212
Query: 121 IRE-IPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
F GC + S ++ P G+AGFGRGA SVPSQL + FS+C L+ ++ ++ +
Sbjct: 213 APAPFRNFAIGCSIVSVHQPPSGLAGFGRGAPSVPSQLKVPK--FSYCLLSRRFDDNSAV 270
Query: 179 SSPLVIGDVAISS---KDNLQFTPMLKS----PMYPNYYYIGLEAITIGNSSLTEVPLSL 231
S LV+GD + + K +Q+ P+L + P Y YYY+ L I++G + L
Sbjct: 271 SGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVN---LPS 327
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFDLCYRV 290
R F GG ++DSGTT+T+L + + + ++S + Y R++ VE+ G C+ +
Sbjct: 328 RAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRPCFAL 387
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNS---------SAVKCLLFQ 341
P P +L P + F + LP N+F A + V L
Sbjct: 388 P-PGPGGAMEL-PDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVVSDLPAS 445
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
D GP+ + GSFQQQN + YDL KER+GF+ CA
Sbjct: 446 GGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPCA 485
>gi|2245012|emb|CAB10432.1| hypothetical protein [Arabidopsis thaliana]
gi|7268406|emb|CAB78698.1| hypothetical protein [Arabidopsis thaliana]
Length = 1046
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 133/430 (30%), Positives = 194/430 (45%), Gaps = 78/430 (18%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
Y+DTGSDL W PC F C+ C+ + S +++ S + + S HSS
Sbjct: 129 YLDTGSDLVWFPC--RPFTCILCESKPLPPSPPSSLSSSATTVSCSSPSCS---AAHSSL 183
Query: 65 NPFDPCTMSGCSLSTLLKSTC---CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D C +S C L + C PCP F Y YG+G LV + + S
Sbjct: 184 PSSDLCAISNCPLDFIETGDCNTSSYPCPPFYYAYGDGSLVAKLYSDSLSLPSVSVS--- 240
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF----LQKGFSHCFLAFKYANDP- 176
F FGC +T EPIG+AGFGRG LS+P+QL L FS+C ++ + +D
Sbjct: 241 ----NFTFGCAHTTLAEPIGVAGFGRGRLSLPAQLAVHSPHLGNSFSYCLVSHSFDSDRV 296
Query: 177 NISSPLVIGDVAISSK-------------------DNLQFTPMLKSPMYPNYYYIGLEAI 217
SPL++G + + FT ML++P +P +Y + L+ I
Sbjct: 297 RRPSPLILGRFVDKKEKRVGTTDDHDDGDDEKKKKNEFVFTEMLENPKHPYFYSVSLQGI 356
Query: 218 TIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKE 277
+IG ++ P LR D G GG++VDSGTT+T LP FY+ ++ S R
Sbjct: 357 SIGKRNI-PAPAMLRRIDKNGGGGVVVDSGTTFTMLPAKFYNSVVEEFDS------RVGR 409
Query: 278 VEERTGFDLCYRVPCPNNTFTDDLFPS--ITFHFL-NNVSLVLPQGNHFYAM----SAPS 330
V ER D + PS + HF N S+ LP+ N+FY
Sbjct: 410 VHER----------------ADRVEPSSALVLHFAGNRSSVTLPRRNYFYEFMDGGDGKE 453
Query: 331 NSSAVKCLLFQSMDDGDY-----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
+ CL+ M+ GD G + G++QQQ EVVYDL R+GF + + S
Sbjct: 454 EKRKIGCLML--MNGGDESELRGGTGAILGNYQQQGFEVVYDLLNRRVGFAKRNLLAIQS 511
Query: 386 AQ--GLHKKK 393
++ L+++K
Sbjct: 512 SRIPKLYRRK 521
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 182/395 (46%), Gaps = 47/395 (11%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ + DTGSDLTWV C +C + S F S++ S C SS C +
Sbjct: 95 TLLLVADTGSDLTWVRCSACKTNC------SIHPPGSTFLARHSTTFSPTHCFSSLC-QL 147
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
NP +PC T L STC + Y Y +G +G +++T ++ SS G
Sbjct: 148 VPQPNP-NPCN------HTRLHSTC-----RYEYVYSDGSKTSGFFSKETTTLNTSS-GR 194
Query: 121 IREIPKFCFGC---------VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAF 170
++ FGC +GS++ G+ G GRG +S SQLG + FS+C L
Sbjct: 195 EMKLKSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLL-- 252
Query: 171 KYANDPNISSPLVIGDVAISSKDN---LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
Y P +S L+IGDV + KDN + FTP+L +P P +YYI ++ + + L
Sbjct: 253 DYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHID 312
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDL 286
P S+ D GNGG ++DSGTT T L EP Y ++LS + + P R+GFDL
Sbjct: 313 P-SVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDL 371
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
C V + FP ++ P N+F + S +KCL Q + +
Sbjct: 372 CVNV----TGVSRPRFPRLSLELGGESLYSPPPRNYFIDI-----SEGIKCLAIQPV-EA 421
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G V G+ QQ + +D K R+GF CA
Sbjct: 422 ESGRFSVIGNLMQQGFLLEFDRGKSRLGFSRRGCA 456
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 184/383 (48%), Gaps = 71/383 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C DC D + F P +SSS S+ C+S C +
Sbjct: 114 MDTGSDLIWTQCK----PCKDCFD----QPTPIFDPKKSSSFSKLPCSSDLCAAL----- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ GC + Y+YG+ G+L +T +S +
Sbjct: 161 PISSCS-DGCE---------------YLYSYGDYSSTQGVLATETFAFGDAS------VS 198
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
K FGC GS + + G+ G GRG LS+ SQLG + FS+C + +D S
Sbjct: 199 KIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLISQLG--EPKFSYCLTSM---DDSKGISS 253
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ--GN 239
L++G A + N TP++++P P++YY+ LE I++G+ T +P+ F Q G+
Sbjct: 254 LLVGSEA--TMKNAITTPLIQNPSQPSFYYLSLEGISVGD---TLLPIEKSTFSIQNDGS 308
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEER--TGFDLCYRVPCPNNTF 297
GGL++DSGTT T+L + ++ L S + +V+E TG DLC+ +P P+ +
Sbjct: 309 GGLIIDSGTTITYLEDSAFAALKKEFISQLKL-----DVDESGSTGLDLCFTLP-PDAST 362
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
D P + FHF L LP N+ A S V CL G +FG+F
Sbjct: 363 VD--VPQLVFHF-EGADLKLPAENYIIADSGL----GVICLTM-----GSSSGMSIFGNF 410
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQN+ V++DLEKE I F P C
Sbjct: 411 QQQNIVVLHDLEKETISFAPAQC 433
>gi|225440731|ref|XP_002280866.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 469
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/393 (31%), Positives = 180/393 (45%), Gaps = 41/393 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ MDTGS L W PC + + C C + + F P SSS+ C + C +
Sbjct: 103 LSFVMDTGSSLVWFPCTS-RYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFV 161
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+ + C + C + CP++A YG G V +L +
Sbjct: 162 MDSE------VRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAE------ 209
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
R P F GC + R+P GIAGFGRG S+P Q+G K FS+C L+ ++ + P S
Sbjct: 210 -RTEPDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGL--KKFSYCLLSHRFDDSPKSSK 266
Query: 181 PLVIGDVAISSKDN----LQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSL 231
+ V SKD+ L +TP K+P+ N YYY+ L I +G+ + +VP S
Sbjct: 267 MTLY--VGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRV-KVPYSF 323
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
S GNGG +VDSG+T+T + +P + + + + Y RA +VE +G C+ +
Sbjct: 324 MVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLS 383
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG----D 347
+ PS+ F F + LP N+F + S V CL S +
Sbjct: 384 GVGSV----ALPSLVFQFKGGAKMELPVANYFSLVGDLS----VLCLTIVSNEAVGSTLS 435
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
GPS + G++Q QN YDLE ER GF+ C
Sbjct: 436 SGPSIILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 178/385 (46%), Gaps = 60/385 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C D F P +SSS + +C+S C +
Sbjct: 128 MDTGSDLIWTQCK----PCQQCFDQST----PIFDPKQSSSFYKISCSSELCGAL----- 174
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ GC + YTYG+ G+L +T S+ I IP
Sbjct: 175 PTSTCSSDGCE---------------YLYTYGDSSSTQGVLAFETFTFGDSTEDQI-SIP 218
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL ++ F++C A D + S
Sbjct: 219 GLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLK--EQKFAYCLTAI----DDSKPSS 272
Query: 182 LVIGDVA----ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L++G +A +SKD ++ TP++K+P P++YY+ L+ I++G + L+ +P S E
Sbjct: 273 LLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLS-IPKSTFELHDD 331
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G+GG+++DSGTT T++ S S+ I + G DLC+ +P N
Sbjct: 332 GSGGVIIDSGTTITYVEN---SAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQV 388
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P +TFHF L LP N+ S + + CL G +FG+
Sbjct: 389 E---VPKLTFHF-KGADLELPGENYMIGDS----KAGLLCLAI-----GSSRGMSIFGNL 435
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
QQQN VV+DL++E + F P C S
Sbjct: 436 QQQNFMVVHDLQEETLSFLPTQCDS 460
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 179/385 (46%), Gaps = 60/385 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C D + F P +SSS + +C+S C +
Sbjct: 383 MDTGSDLIWTQCK----PCQQCFD----QSTPIFDPKQSSSFYKISCSSELCGAL----- 429
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ GC + YTYG+ G+L +T S+ I IP
Sbjct: 430 PTSTCSSDGCE---------------YLYTYGDSSSTQGVLAFETFTFGDSTEDQI-SIP 473
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL ++ F++C A D + S
Sbjct: 474 GLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLK--EQKFAYCLTAI----DDSKPSS 527
Query: 182 LVIGDVA----ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L++G +A +SKD ++ TP++K+P P++YY+ L+ I++G + L+ +P S E
Sbjct: 528 LLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLS-IPKSTFELHDD 586
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G+GG+++DSGTT T++ S S+ I + G DLC+ +P N
Sbjct: 587 GSGGVIIDSGTTITYVEN---SAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQV 643
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P +TFHF L LP N+ S + + CL G +FG+
Sbjct: 644 E---VPKLTFHF-KGADLELPGENYMIGDS----KAGLLCLAI-----GSSRGMSIFGNL 690
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
QQQN VV+DL++E + F P C S
Sbjct: 691 QQQNFMVVHDLQEETLSFLPTQCDS 715
>gi|147789749|emb|CAN67405.1| hypothetical protein VITISV_025616 [Vitis vinifera]
Length = 609
Score = 151 bits (381), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 121/389 (31%), Positives = 178/389 (45%), Gaps = 41/389 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
MDTGS L W PC + + C C + + F P SSS+ C + C + S+
Sbjct: 107 MDTGSSLVWFPCTS-RYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFVMDSE 165
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
+ C + C + CP++A YG G V +L + R
Sbjct: 166 ------VRTRCPGCDQNSANCTKACPTYAIQYGLGTTVGLLLLESLVFAE-------RTE 212
Query: 125 PKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVI 184
P F GC + R+P GIAGFGRG S+P Q+G K FS+C L+ ++ + P S +
Sbjct: 213 PDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGL--KKFSYCLLSHRFDDSPKSSKMTLY 270
Query: 185 GDVAISSKDN----LQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSLREFD 235
V SKD+ L +TP K+P+ N YYY+ L I +G+ + + P S
Sbjct: 271 --VGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRV-KXPYSFMVAG 327
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S GNGG +VDSG+T+T + +P + + + + Y RA +VE +G C+ + +
Sbjct: 328 SDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLSGVGS 387
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG----DYGPS 351
PS+ F F + LP N+F + S V CL S + GPS
Sbjct: 388 V----ALPSLVFQFKGGAKMELPVANYFSLVGDLS----VLCLTIVSNEAVGSTLSSGPS 439
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G++Q QN YDLE ER GF+ C
Sbjct: 440 IILGNYQSQNFYTEYDLENERFGFRRQRC 468
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 150 bits (380), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 183/384 (47%), Gaps = 63/384 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+DC ++ + + F PS SS+ + C+S+ C SD
Sbjct: 122 VDTGSDLVWTQCK----PCVDC--FKQSTPV--FDPSSSSTYATVPCSSASC-----SDL 168
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + S C + YTYG+ G+L +T + S ++P
Sbjct: 169 PTSKCTSA---------SKC-----GYTYTYGDSSSTQGVLATETFTLAKS------KLP 208
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + N+ SP
Sbjct: 209 GVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDK--FSYCLTSLDDTNN----SP 262
Query: 182 LVIGDVA-----ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G +A ++ ++Q TP++K+P P++YY+ L+AIT+G++ ++ +P S
Sbjct: 263 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQD 321
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT+ T+L Y L + + P A G DLC+R P
Sbjct: 322 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA-LPAAD--GSGVGLDLCFRAPAKGVD 378
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P + FHF L LP N+ S CL G G S + G+
Sbjct: 379 QVE--VPRLVFHFDGGADLDLPAENYMVL----DGGSGALCLTVM----GSRGLS-IIGN 427
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
FQQQN + VYD+ + + F P+ C
Sbjct: 428 FQQQNFQFVYDVGHDTLSFAPVQC 451
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 183/384 (47%), Gaps = 63/384 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+DC ++ + + F PS SS+ + C+S+ C SD
Sbjct: 112 VDTGSDLVWTQCK----PCVDC--FKQSTPV--FDPSSSSTYATVPCSSASC-----SDL 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + S C + YTYG+ G+L +T + S ++P
Sbjct: 159 PTSKCTSA---------SKC-----GYTYTYGDSSSTQGVLATETFTLAKS------KLP 198
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + N+ SP
Sbjct: 199 GVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDK--FSYCLTSLDDTNN----SP 252
Query: 182 LVIGDVA-----ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G +A ++ ++Q TP++K+P P++YY+ L+AIT+G++ ++ +P S
Sbjct: 253 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQD 311
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT+ T+L Y L + + P A G DLC+R P
Sbjct: 312 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA-LPAAD--GSGVGLDLCFRAPAKGVD 368
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P + FHF L LP N+ S CL G G S + G+
Sbjct: 369 QVE--VPRLVFHFDGGADLDLPAENYMVL----DGGSGALCLTVM----GSRGLS-IIGN 417
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
FQQQN + VYD+ + + F P+ C
Sbjct: 418 FQQQNFQFVYDVGHDTLSFAPVQC 441
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 183/384 (47%), Gaps = 63/384 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+DC ++ + + F PS SS+ + C+S+ C SD
Sbjct: 184 VDTGSDLVWTQCK----PCVDC--FKQSTPV--FDPSSSSTYATVPCSSASC-----SDL 230
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + S C + YTYG+ G+L +T + S ++P
Sbjct: 231 PTSKCTSA---------SKC-----GYTYTYGDSSSTQGVLATETFTLAKS------KLP 270
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + N+ SP
Sbjct: 271 GVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDK--FSYCLTSLDDTNN----SP 324
Query: 182 LVIGDVA-----ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G +A ++ ++Q TP++K+P P++YY+ L+AIT+G++ ++ +P S
Sbjct: 325 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQD 383
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT+ T+L Y L + + P A G DLC+R P
Sbjct: 384 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA-LPAAD--GSGVGLDLCFRAPAKGVD 440
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P + FHF L LP N+ S CL G G S + G+
Sbjct: 441 QVE--VPRLVFHFDGGADLDLPAENYMVL----DGGSGALCLTVM----GSRGLS-IIGN 489
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
FQQQN + VYD+ + + F P+ C
Sbjct: 490 FQQQNFQFVYDVGHDTLSFAPVQC 513
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 124/402 (30%), Positives = 186/402 (46%), Gaps = 69/402 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDL WV C C +C +R T S+F L H
Sbjct: 102 LLLVADTGSDLVWVKCSA----CRNC--------------------TRHTPGSAF-LARH 136
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR------PCPSFAYTYGEGGLVTGILTRDTLKVHG 115
S+ + C S C L L K C PC + Y+YG+G +G +++T ++
Sbjct: 137 STTFSPNHCYDSACQLVPLPKHHRCNHARLHSPC-RYEYSYGDGSKTSGFFSKETTTLNT 195
Query: 116 SSPGIIREIPKFCFGCV---------GSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSH 165
SS G ++ FGC G+++ G+ G GRG +S+ SQLG FS+
Sbjct: 196 SS-GREAKLKGIAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSY 254
Query: 166 CFLAFKYANDPNISSPLVIG----DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGN 221
C + + P+ +S L+IG DVA K ++FTP+ +P+ P +YYIG+E++++
Sbjct: 255 CLM--DHDISPSPTSYLLIGSTQNDVA-PGKRRMRFTPLHINPLSPTFYYIGIESVSVDG 311
Query: 222 SSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEER 281
L P S+ D GNGG +VDSGTT T LPEP Y Q+L++++ + E
Sbjct: 312 IKLPINP-SVWALDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRL---PSPAEPT 367
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
GFDLC V + P ++F + P N+F VKCL Q
Sbjct: 368 PGFDLCVNV----SEIEHPRLPKLSFKLGGDSVFSPPPRNYFV-----DTDEDVKCLALQ 418
Query: 342 SMDDGDYGPSG--VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
++ PSG V G+ QQ + +D ++ R+GF CA
Sbjct: 419 AV----MTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGCA 456
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 120/384 (31%), Positives = 183/384 (47%), Gaps = 63/384 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+DC ++ + + F PS SS+ + C+S+ C SD
Sbjct: 91 VDTGSDLVWTQCK----PCVDC--FKQSTPV--FDPSSSSTYATVPCSSASC-----SDL 137
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + S C + YTYG+ G+L +T + S ++P
Sbjct: 138 PTSKCTSA---------SKC-----GYTYTYGDSSSTQGVLATETFTLAKS------KLP 177
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + N+ SP
Sbjct: 178 GVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLDK--FSYCLTSLDDTNN----SP 231
Query: 182 LVIGDVA-----ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G +A ++ ++Q TP++K+P P++YY+ L+AIT+G++ ++ +P S
Sbjct: 232 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRIS-LPSSAFAVQD 290
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT+ T+L Y L + + P A G DLC+R P
Sbjct: 291 DGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA-LPAAD--GSGVGLDLCFRAPAKGVD 347
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P + FHF L LP N+ S CL G G S + G+
Sbjct: 348 QVE--VPRLVFHFDGGADLDLPAENYMVL----DGGSGALCLTVM----GSRGLS-IIGN 396
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
FQQQN + VYD+ + + F P+ C
Sbjct: 397 FQQQNFQFVYDVGHDTLSFAPVQC 420
>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
Length = 458
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 117/390 (30%), Positives = 174/390 (44%), Gaps = 42/390 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGS + W PC + C +C + N K + F+P SSS C C N
Sbjct: 100 LSFLVDTGSHVVWAPC-TTHYTCTNCS-FSNPKKVPIFNPELSSSDKILGCRDPKCANTS 157
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S D GC C CP + YG G +G + L G
Sbjct: 158 SPD------VHLGCPRCNGNSKKCSHACPQYTLQYGTGA-ASGFFLLENLDFPG------ 204
Query: 122 REIPKFCFGCVGSTYREPI--GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ I KF GC S REP +AGFGR S+P Q+G K F++C + Y + N
Sbjct: 205 KTIHKFLVGCTTSADREPSSDALAGFGRTMFSLPMQMGV--KKFAYCLNSHDYDDTRN-- 260
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPM-YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S +I D + L + P LK+P YP YYY+G++ + IGN L +P S
Sbjct: 261 SGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNK-LLRIPGKYLTPGSDS 319
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG+++DSG Y ++ P + + + L+ ++ Y R+ E E ++G PC N T
Sbjct: 320 RGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGL-----TPCYNFTGH 374
Query: 299 DDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPS-------NSSAVKCLLFQSMDDGDYGP 350
+ P + + F ++V+P N+F S S S L F GP
Sbjct: 375 KSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTP------GP 428
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
S + G++QQ + V +DL+ ER+GF+ C
Sbjct: 429 SIILGNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/393 (30%), Positives = 185/393 (47%), Gaps = 57/393 (14%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
V V DTGSDL W+ C C C + ++ F P SSS + +C + C
Sbjct: 52 VFSVIADTGSDLIWIQCK----PCQACFNQKD----PIFDPEGSSSYTTMSCGDTLC--- 100
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+L + +C C ++Y YG+G G L+ +T+ + S+ G
Sbjct: 101 -----------------DSLPRKSCSPDC-DYSYGYGDGSGTRGTLSSETVTLT-STQGE 141
Query: 121 IREIPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDP 176
FGC ++ + G+ G GRG LS SQLG FS+C + ++ A P
Sbjct: 142 KLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDA--P 199
Query: 177 NISSPLVIGDVAIS----SKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +SP+ GD + S K + FTPM+ +P ++YY+ L+ I+I +L +P
Sbjct: 200 SKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRAL-RIPAGSF 258
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+ G+GG++ DSGTT T LP+ Y +L L+S I++ K G DLCY V
Sbjct: 259 DIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISF---PKIDGSSAGLDLCYDVSG 315
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDGDYGP 350
++ + P++ FHF LP N+F A +++ + CL S MD
Sbjct: 316 SKASYKMKI-PAMVFHF-EGADYQLPVENYFIAA---NDAGTIVCLAMVSSNMD------ 364
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G++G+ QQN V+YD+ +IG+ P C S+
Sbjct: 365 IGIYGNMMQQNFRVMYDIGSSKIGWAPSQCDSS 397
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 122/381 (32%), Positives = 181/381 (47%), Gaps = 67/381 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C D + F P +SSS S+ C+S C+ +
Sbjct: 114 MDTGSDLIWTQCK----PCKVCFD----QPTPIFDPEKSSSFSKLPCSSDLCVAL----- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ GC + Y+YG+ G+L +T +S +
Sbjct: 161 PISSCS-DGCE---------------YRYSYGDHSSTQGVLATETFTFGDAS------VS 198
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
K FGC G Y + G+ G GRG LS+ SQLG + FS+C + + IS+
Sbjct: 199 KIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPK--FSYCLTSID--DSKGISTL 254
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ--GN 239
LV + + S TP++++P P++YY+ LE I++G+ T +P+ F Q G+
Sbjct: 255 LVGSEATVKSAIP---TPLIQNPSRPSFYYLSLEGISVGD---TLLPIEKSTFSIQDDGS 308
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GGL++DSGTT T+L + ++ L + I+ + T +LC+ +P P+ + D
Sbjct: 309 GGLIIDSGTTITYLKDSAFAALK---KEFISQMKLDVDASGSTELELCFTLP-PDGSPVD 364
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + FHF V L LP+ N+ SA V CL G +FG+FQQ
Sbjct: 365 --VPQLVFHF-EGVDLKLPKENYIIEDSALR----VICLTM-----GSSSGMSIFGNFQQ 412
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ V++DLEKE I F P C
Sbjct: 413 QNIVVLHDLEKETISFAPAQC 433
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 185/394 (46%), Gaps = 59/394 (14%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
V V DTGSDL W+ C C C + ++ F P SSS + +C + C +
Sbjct: 52 VFSVIADTGSDLIWIQCK----PCQACFNQKD----PIFDPEGSSSYTTMSCGDTLCDS- 102
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
L C P ++Y YG+G G L+ +T+ + S+ G
Sbjct: 103 --------------------LPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLT-STQGE 141
Query: 121 IREIPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDP 176
FGC ++ + G+ G GRG LS SQLG FS+C + ++ A P
Sbjct: 142 KLAAKNIAFGCGHLNRGSFNDASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDA--P 199
Query: 177 NISSPLVIGDVAIS----SKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +SP+ GD + S K + FTPM+ +P ++YY+ L+ I+I +L +P
Sbjct: 200 SKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRAL-RIPAGSF 258
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVE-ERTGFDLCYRVP 291
+ G+GG++ DSGTT T LP+ Y +L L+S +++ E++ G DLCY V
Sbjct: 259 DIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSF----PEIDGSSAGLDLCYDVS 314
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDGDYG 349
++ + P++ FHF LP N+F A +++ + CL S MD
Sbjct: 315 GSKASYKKKI-PAMVFHF-EGADHQLPVENYFIAA---NDAGTIVCLAMVSSNMD----- 364
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G++G+ QQN V+YD+ +IG+ P C S+
Sbjct: 365 -IGIYGNMMQQNFRVMYDIGSSKIGWAPSQCDSS 397
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 182/385 (47%), Gaps = 65/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C++C N+ F PS SS+ + C+S+ C SD
Sbjct: 119 IDTGSDLVWTQCK----PCVEC----FNQSTPVFDPSSSSTYAALPCSSTLC-----SDL 165
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + C + YTYG+ G+L +T + + ++P
Sbjct: 166 PSSKCTSAKCG---------------YTYTYGDSSSTQGVLAAETFTLAKT------KLP 204
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG FS+C + D SP
Sbjct: 205 DVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGL--NKFSYCLTSL----DDTSKSP 258
Query: 182 LVIGDVAI-----SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G +A ++ ++Q TP++++P P++YY+ L+ +T+G++ +T +P S
Sbjct: 259 LLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHIT-LPSSAFAVQD 317
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT+ T+L Y L + + P A G D C+ P
Sbjct: 318 DGTGGVIVDSGTSITYLELQGYRALKKAFAAQMK-LPAAD--GSGIGLDTCFEAPASGVD 374
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P + FH L+ L LP N+ M S S A+ CL G G S + G+
Sbjct: 375 QVE--VPKLVFH-LDGADLDLPAENY---MVLDSGSGAL-CLTVM----GSRGLS-IIGN 422
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
FQQQN++ VYD+ + + F P+ CA
Sbjct: 423 FQQQNIQFVYDVGENTLSFAPVQCA 447
>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
Length = 458
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 175/386 (45%), Gaps = 42/386 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGS + W PC + C +C + N K + F+P SSS C C + S B
Sbjct: 104 MDTGSHVVWAPC-TTHYTCTNCS-FSNPKKVPIFNPELSSSDKILGCRDPKCADTSSPBV 161
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+G S C CP + YG G +G + L G + I
Sbjct: 162 HLGXPRCNGNS------KKCSHACPQYTLQYGTGA-ASGFFLLENLDFPG------KTIH 208
Query: 126 KFCFGCVGSTYREPI--GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
KF GC S REP +AGFGR S+P Q+G K F++C + Y + N S +
Sbjct: 209 KFLVGCTTSADREPSSDALAGFGRTMFSLPMQMGV--KKFAYCLNSHDYDDTRN--SGKL 264
Query: 184 IGDVAISSKDNLQFTPMLKSPM-YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
I D + L + P K+P YP YYY+G++ + IGN L +P S GG+
Sbjct: 265 ILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVL-RIPGKYLTPGSDSRGGV 323
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL- 301
++DSG Y+++ P + + + L+ ++ Y R+ E+E +TG PC N T +
Sbjct: 324 VIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGV-----TPCYNFTGHKSIK 378
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPS-------NSSAVKCLLFQSMDDGDYGPSGVF 354
P + + F ++V+P N+F S S S L F GPS +
Sbjct: 379 IPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTP------GPSIIL 432
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQ + V +DL+ ER+GF+ C
Sbjct: 433 GNYQQVDHYVEFDLKNERLGFRQQTC 458
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/381 (31%), Positives = 181/381 (47%), Gaps = 67/381 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C D + F P +SSS S+ C+S C+ +
Sbjct: 114 MDTGSDLIWTQCK----PCKVCFD----QPTPIFDPEKSSSFSKLPCSSDLCVAL----- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ GC + Y+YG+ G+L +T +S +
Sbjct: 161 PISSCS-DGCE---------------YRYSYGDHSSTQGVLATETFTFGDAS------VS 198
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
K FGC G Y + G+ G GRG LS+ SQLG + FS+C + + IS+
Sbjct: 199 KIGFGCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVPK--FSYCLTSID--DSKGISTL 254
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ--GN 239
LV + + S TP++++P P++YY+ LE I++G+ T +P+ F Q G+
Sbjct: 255 LVGSEATVKSAIP---TPLIQNPSRPSFYYLSLEGISVGD---TLLPIEKSTFSIQDDGS 308
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GGL++DSGTT T+L + ++ L + I+ + T +LC+ +P P+ + +
Sbjct: 309 GGLIIDSGTTITYLKDNAFAALK---KEFISQMKLDVDASGSTELELCFTLP-PDGSPVE 364
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + FHF V L LP+ N+ SA V CL G +FG+FQQ
Sbjct: 365 --VPQLVFHF-EGVDLKLPKENYIIEDSALR----VICLTM-----GSSSGMSIFGNFQQ 412
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ V++DLEKE I F P C
Sbjct: 413 QNIVVLHDLEKETISFAPAQC 433
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 120/390 (30%), Positives = 179/390 (45%), Gaps = 46/390 (11%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDL WV C C +C + + F P SS+ S C C +
Sbjct: 96 LLLIADTGSDLVWVKCSA----CRNCSHHSPATV---FFPRHSSTFSPAHCYDPVCRLVP 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P C+ T + STC + Y Y +G L +G+ R+T + SS G
Sbjct: 149 ------KPGRAPRCN-HTRIHSTC-----PYEYGYADGSLTSGLFARETTSLKTSS-GKE 195
Query: 122 REIPKFCFGC---------VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFK 171
++ FGC G+++ G+ G GRG +S SQLG FS+C +
Sbjct: 196 AKLKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLM--D 253
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
Y P +S L+IGD + L FTP+L +P+ P +YY+ L+++ + + L P S+
Sbjct: 254 YTLSPPPTSYLIIGDGG-DAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDP-SI 311
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
E D GNGG ++DSGTT L +P Y +++ ++ I P A E+ GFDLC V
Sbjct: 312 WEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIK-LPNADELTP--GFDLCVNV- 367
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
T + + P + F F V P N+F ++CL QS+D G S
Sbjct: 368 -SGVTKPEKILPRLKFEFSGGAVFVPPPRNYFI-----ETEEQIQCLAIQSVDP-KVGFS 420
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
V G+ QQ +D ++ R+GF CA
Sbjct: 421 -VIGNLMQQGFLFEFDRDRSRLGFSRRGCA 449
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 178/380 (46%), Gaps = 59/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C C Y+ + F P +SSS S+ +C SS C
Sbjct: 125 LDTGSDLIWTQCK----PCTRC--YKQPTPI--FDPKKSSSFSKVSCGSSLC-------- 168
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
S L STC C + Y+YG+ + G+L +T S + +
Sbjct: 169 ------------SALPSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGKSKNKV--SVH 213
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL ++ FS+C D S
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLK--EQRFSYCLTPI----DDTKESV 267
Query: 182 LVIGDVA-ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L++G + + + TP+LK+P+ P++YY+ LEAI++G++ L+ + S E GNG
Sbjct: 268 LLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLS-IEKSTFEVGDDGNG 326
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGTT T++ + Y L + I+ A + TG DLC+ +P + T
Sbjct: 327 GVIIDSGTTITYVQQKAYEAL---KKEFISQTKLALDKTSSTGLDLCFSLPSGS---TQV 380
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P + FHF L LP N+ S + V CL G +FG+ QQQ
Sbjct: 381 EIPKLVFHFKGG-DLELPAENYMIGDS----NLGVACLAM-----GASSGMSIFGNVQQQ 430
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N+ V +DLEKE I F P C
Sbjct: 431 NILVNHDLEKETISFVPTSC 450
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 183/384 (47%), Gaps = 56/384 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASS--FCLNIHSSD 64
DTGSDL W C C + ++PS S++ C SS C +
Sbjct: 106 DTGSDLIWTQCAPCGSQCF-------KQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPS 158
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
P GCS C + TYG G GI + +T GS+P +
Sbjct: 159 PP------PGCS---------CM----YNQTYGTG-WTAGIQSVETF-TFGSTPADQTRV 197
Query: 125 PKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
P FGC ++ + G+ G GRG++S+ SQLG FS+C F+ D N +S
Sbjct: 198 PGIAFGCSNASSDDWNGSAGLVGLGRGSMSLVSQLG--AGMFSYCLTPFQ---DANSTST 252
Query: 182 LVIGDVAISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
L++G A + + TP + SP YYY+ L I+IG ++L+ +P + + G
Sbjct: 253 LLLGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALS-IPPNAFALRTDG 311
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GGL++DSGTT T L + Y Q+ + ++S +T P A + + TG DLC+ + + T T
Sbjct: 312 TGGLIIDSGTTITSLVDAAYQQVRAAIESLVTL-PVA-DGSDSTGLDLCFALT--SETST 367
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
PS+TFHF + +VLP N+ S V CL +M + G FG++Q
Sbjct: 368 PPSMPSMTFHF-DGADMVLPVDNYMIL------GSGVWCL---AMRNQTVGAMSTFGNYQ 417
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQNV ++YD+ +E + F P C++
Sbjct: 418 QQNVHLLYDIHEETLSFAPAKCST 441
>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 469
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 178/382 (46%), Gaps = 49/382 (12%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C + + ++P+ S++ S C SS +
Sbjct: 130 DTGSDLIWTQCAPCGTQCFE-------QPAPLYNPASSTTFSVLPCNSSLSM-------- 174
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C + + C + TYG G G+ +T GSS +P
Sbjct: 175 ---CAGALAGAAPPPGCACM-----YNQTYGTG-WTAGVQGSETF-TFGSSAADQARVPG 224
Query: 127 FCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC ++ + G+ G GRG+LS+ SQLG + FS+C F+ D N +S L+
Sbjct: 225 VAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGR--FSYCLTPFQ---DTNSTSTLL 279
Query: 184 IGDVAISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+G A + ++ TP + SP YYY+ L I++G +L P + G G
Sbjct: 280 LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAF-SLKPDGTG 338
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
GL++DSGTT T L Y Q+ + ++S +T P + + TG DLC+ +P P +
Sbjct: 339 GLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTV-DGSDSTGLDLCFALPAPTSA-PPA 396
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ PS+T HF + +VLP ++ + S V CL ++ DG FG++QQQ
Sbjct: 397 VLPSMTLHF-DGADMVLPADSYMI------SGSGVWCLAMRNQTDGAMS---TFGNYQQQ 446
Query: 361 NVEVVYDLEKERIGFQPMDCAS 382
N+ ++YD+ +E + F P C++
Sbjct: 447 NMHILYDVREETLSFAPAKCST 468
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 175/382 (45%), Gaps = 59/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS++ W+PC C C + F PS+SS+ + TCAS C +
Sbjct: 141 LDTGSNIAWIPCN----PCSGCSSKQQP-----FEPSKSSTYNYLTCASQQCQLLRV--- 188
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
CT S S++ S YG+ V IL+ +TL V +++
Sbjct: 189 ----CTKSDNSVNC-----------SLTQRYGDQSEVDEILSSETLSVGS------QQVE 227
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLA-FKYANDPNIS 179
F FGC G R P + GFGR LS SQ L FS+C + F A +
Sbjct: 228 NFVFGCSNAARGLIQRTP-SLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSA----FT 282
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L++G A+S++ L+FTP+L + YP++YY+GL I++G L +P D
Sbjct: 283 GSLLLGKEALSAQ-GLKFTPLLSNSRYPSFYYVGLNGISVGE-ELVSIPAGTLSLDESTG 340
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T L EP Y+ + +S ++ A + FD CY P D
Sbjct: 341 RGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDL---FDTCYNRPS-----GD 392
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQSMDDGDYGPSGVFGSFQ 358
FP IT HF +N+ L LP N Y P N +V CL F G FG++Q
Sbjct: 393 VEFPLITLHFDDNLDLTLPLDNILY----PGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQ 448
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ + +V+D+ + R+G +C
Sbjct: 449 QQKLRIVHDVAESRLGIASENC 470
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 117/385 (30%), Positives = 179/385 (46%), Gaps = 56/385 (14%)
Query: 7 DTGSDLTWVPCGNLSFD-CMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSF--CLNIHSS 63
DTGSDL W C S D C + ++P+ S++ C SS C + +
Sbjct: 110 DTGSDLIWTQCAPCSGDQCF-------AQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAG 162
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P C C + TYG G G+ +T GS+
Sbjct: 163 KAPPPGC--------------ACM----YNQTYGTG-WTAGVQGSETF-TFGSAAADQAR 202
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+P FGC ++ + G+ G GRG+LS+ SQLG + FS+C F+ D N +S
Sbjct: 203 VPGIAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGR--FSYCLTPFQ---DTNSTS 257
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L++G A + ++ TP + SP YYY+ L I++G +L+ P + +
Sbjct: 258 TLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAF-SLKAD 316
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GGL++DSGTT T L Y Q+ + +QS +T A + + TG DLCY +P P T
Sbjct: 317 GTGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTL--PAIDGSDSTGLDLCYALPTP--TS 372
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
PS+T HF + +VLP ++ + S V CL ++ DG FG++
Sbjct: 373 APPAMPSMTLHF-DGADMVLPADSYMI------SGSGVWCLAMRNQTDGAMS---TFGNY 422
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
QQQN+ ++YD+ E + F P C++
Sbjct: 423 QQQNMHILYDVRNEMLSFAPAKCST 447
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/396 (29%), Positives = 176/396 (44%), Gaps = 55/396 (13%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC-LN 59
+ + DTGSDL WV C C +C S F S++ S C S C L
Sbjct: 98 TLLLVADTGSDLIWVKCS----PCRNCSHRSPG---SAFFARHSTTYSAIHCYSPQCQLV 150
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H NP + T L S C + YTY + TG +++ L ++ S+ G
Sbjct: 151 PHPHPNPCN---------RTRLHSPC-----RYQYTYADSSTTTGFFSKEALTLNTST-G 195
Query: 120 IIREIPKFCFGC---------VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLA 169
++++ FGC G+++ G+ G GR +S SQLG FS+C +
Sbjct: 196 KVKKLNGLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRRFGSKFSYCLM- 254
Query: 170 FKYANDPNISSPLVIG---DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
Y P +S L IG +VA+S K + FTP+L +P+ P +YYI ++ + + L
Sbjct: 255 -DYTLSPPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPI 313
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
P S+ D GNGG ++DSGTT T + EP Y+++L + + E GFDL
Sbjct: 314 NP-SVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKL---PSPAEPTPGFDL 369
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-DD 345
C V + T P ++F+ P N+F +KCL Q + D
Sbjct: 370 CMNV----SGVTRPALPRMSFNLAGGSVFSPPPRNYFI-----ETGDQIKCLAVQPVSQD 420
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G + V G+ QQ + +D +K R+GF CA
Sbjct: 421 GGF---SVLGNLMQQGFLLEFDRDKSRLGFTRRGCA 453
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 180/393 (45%), Gaps = 52/393 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCAS--SFCLNI 60
+ DTGSDL W C D D+ + ++PS S++ C S S C +
Sbjct: 101 RAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAM 160
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P GC+ C + TYG G G+ + +T SS
Sbjct: 161 AGPSPP------PGCA---------CM----YNQTYGTG-WTAGVQSVETFTFGSSSTPP 200
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC ++ + G+ G GRG++S+ SQLG FS+C F+ D N
Sbjct: 201 AVRVPNIAFGCSNASSNDWNGSAGLVGLGRGSMSLVSQLG--AGAFSYCLTPFQ---DAN 255
Query: 178 ISSPLVIG---DVAISSKDNLQFTPML----KSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
+S L++G A+ ++ TP + K+PM YYY+ L I++G ++L +P
Sbjct: 256 STSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMS-TYYYLNLTGISVGETAL-AIPPD 313
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQST-ITYYPRAKEVEERTGFDLCYR 289
+ G GGL++DSGTT T L + Y Q+ + ++S +T P A + TG DLC+
Sbjct: 314 AFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFA 373
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ + PS+T HF +VLP N+ S V CL +M + G
Sbjct: 374 L---KASTPPPAMPSMTLHFEGGADMVLPVENYMIL------GSGVWCL---AMRNQTVG 421
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G++QQQN+ V+YD+ KE + F P C+S
Sbjct: 422 AMSMVGNYQQQNIHVLYDVRKETLSFAPAVCSS 454
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/385 (30%), Positives = 173/385 (44%), Gaps = 46/385 (11%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C C +C + + F P SS+ S C C + D
Sbjct: 102 DTGSDLVWVKCSA----CRNCSHHSPATV---FFPRHSSTFSPAHCYDPVCRLVPKPDR- 153
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
+ T + STC + Y Y +G L +G+ R+T + SS G +
Sbjct: 154 ------APICNHTRIHSTC-----HYEYGYADGSLTSGLFARETTSLKTSS-GKEARLKS 201
Query: 127 FCFGC---------VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDP 176
FGC G+++ G+ G GRG +S SQLG FS+C + Y P
Sbjct: 202 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLM--DYTLSP 259
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+S L+IG+ L FTP+L +P+ P +YY+ L+++ + + L P S+ E D
Sbjct: 260 PPTSYLIIGNGG-DGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDP-SIWEIDD 317
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG +VDSGTT L EP Y +++ ++ + P A + GFDLC V T
Sbjct: 318 SGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK-LPIADALTP--GFDLCVNV--SGVT 372
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ + P + F F V P N+F ++CL QS+D G S V G+
Sbjct: 373 KPEKILPRLKFEFSGGAVFVPPPRNYFI-----ETEEQIQCLAIQSVDP-KVGFS-VIGN 425
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQ +D ++ R+GF CA
Sbjct: 426 LMQQGFLFEFDRDRSRLGFSRRGCA 450
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 183/382 (47%), Gaps = 51/382 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSDL WV C C C Y + + + PS SS+ S C SS CL I +++
Sbjct: 81 VDSGSDLLWVQCS----PCRQC--YAQDSPL--YVPSNSSTFSPVPCLSSDCLLIPATEG 132
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PC C ++ Y Y + G+ ++ V G I
Sbjct: 133 --FPCDFR-------YPGAC-----AYEYLYADTSSSKGVFAYESATVDG------VRID 172
Query: 126 KFCFGCVGS----TYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISS 180
K FGC GS ++ G+ G G+G LS SQ+G+ F++C + Y + ++SS
Sbjct: 173 KVAFGC-GSDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLV--NYLDPTSVSS 229
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L+ GD IS+ ++Q+TP++ +P P YY+ +E +T+G SL + S E D GNG
Sbjct: 230 SLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSL-PISDSAWEIDLLGNG 288
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G + DSGTT T+ YS +L+ S + +YPRA+ V+ G DLC +
Sbjct: 289 GSIFDSGTTLTYWFPSAYSHILAAFDSGV-HYPRAESVQ---GLDLCVEL----TGVDQP 340
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
FPS T F ++ ++ P+ +++ AP+ V+CL + G G+ QQ
Sbjct: 341 SFPSFTIEF-DDGAVFQPEAENYFVDVAPN----VRCLAMAGLAS-PLGGFNTIGNLLQQ 394
Query: 361 NVEVVYDLEKERIGFQPMDCAS 382
N V YD E+ IGF P C+S
Sbjct: 395 NFFVQYDREENLIGFAPAKCSS 416
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 143 bits (360), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/388 (30%), Positives = 183/388 (47%), Gaps = 68/388 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+DC ++ + + F PS SS+ + C+S+ C SD
Sbjct: 117 VDTGSDLVWTQCK----PCVDC--FKQSTPV--FDPSSSSTYATVPCSSALC-----SDL 163
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT + S C + YTYG+ G+L +T + +++P
Sbjct: 164 PTSTCTSA---------SKC-----GYTYTYGDASSTQGVLASETFTLGKEK----KKLP 205
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + +D + SP
Sbjct: 206 GVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLDK--FSYCLTSL---DDGDGKSP 260
Query: 182 LVIGDVAISSKDN-----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L++G A + ++ +Q TP++K+P P++YY+ L +T+G++ +T +P S
Sbjct: 261 LLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRIT-LPASAFAIQD 319
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT+ T+L Y L + + P E G DLC++ P
Sbjct: 320 DGTGGVIVDSGTSITYLELQGYRALKKAFVAQMA-LPTVDGSE--IGLDLCFQGPAKG-- 374
Query: 297 FTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS---G 352
D++ P + HF L LP N+ SA S CL PS
Sbjct: 375 -VDEVQVPKLVLHFDGGADLDLPAENYMVLDSA----SGALCLTV--------APSRGLS 421
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+FQQQN + VYD+ + + F P+ C
Sbjct: 422 IIGNFQQQNFQFVYDVAGDTLSFAPVQC 449
>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 180/392 (45%), Gaps = 40/392 (10%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ +DTGS + W PC + C +C K + F+P SSSS C + C+N
Sbjct: 100 LSFLVDTGSHVVWAPC-TTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKILGCRNPKCVNT 158
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S D GC C CP ++ YG TG + D L + + PG
Sbjct: 159 SSPD------VHLGCPPCNGNSKNCSHACPPYSLQYG-----TGASSGDFLLENLNFPG- 206
Query: 121 IREIPKFCFGCVGSTYRE--PIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
+ I +F GC S E +AGFGR S+P Q+G K F++C + Y +D
Sbjct: 207 -KTIHEFLVGCTTSAVGEVTSAALAGFGRSMFSLPMQMGV--KKFAYCLNSHDY-DDTRN 262
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPM-YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS L++ D + L + P LK+P +P YYY+G++ I IGN L +P S
Sbjct: 263 SSKLIL-DYSDGETKGLSYAPFLKNPPDFPIYYYLGVKDIKIGNK-LLRIPSKYLAPGSD 320
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GGL++DSG Y ++ P + ++ + L+ ++ Y R+ E E G CY N T
Sbjct: 321 GRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCY-----NFTG 375
Query: 298 TDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY------GP 350
+ P + + F ++V+P N+F + S + C + D G GP
Sbjct: 376 QKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEIS----LACFPL-TTDAGTNTLEFTPGP 430
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
S + G+ Q + V +DL+ ER+GF+ C S
Sbjct: 431 SIILGNSQHVDYYVEFDLKNERLGFRQQTCQS 462
>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 472
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 110/382 (28%), Positives = 176/382 (46%), Gaps = 48/382 (12%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C + + ++P+ S++ S C SS +
Sbjct: 132 DTGSDLIWTQCAPCGTQCFE-------QPAPLYNPASSTTFSVLPCNSSLSM-------- 176
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C + + C + TYG G G+ +T GSS +P
Sbjct: 177 ---CAGALAGAAPPPGCACM-----YYQTYGTG-WTAGVQGSETF-TFGSSAADQARVPG 226
Query: 127 FCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC ++ + G+ G GRG+LS+ SQLG + FS+C F+ D N +S L+
Sbjct: 227 VAFGCSNASSSDWNGSAGLVGLGRGSLSLVSQLGAGR--FSYCLTPFQ---DTNSTSTLL 281
Query: 184 IGDVAISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+G A + ++ TP + SP YYY+ L I++G +L P + G G
Sbjct: 282 LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAF-SLKPDGTG 340
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
GL++DSGTT T L Y Q+ + ++S + + + TG DLC+ +P P +
Sbjct: 341 GLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSA-PPA 399
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ PS+T HF + +VLP ++ + S V CL ++ DG FG++QQQ
Sbjct: 400 VLPSMTLHF-DGADMVLPADSYMI------SGSGVWCLAMRNQTDGAM---STFGNYQQQ 449
Query: 361 NVEVVYDLEKERIGFQPMDCAS 382
N+ ++YD+ +E + F P C++
Sbjct: 450 NMHILYDVREETLSFAPAKCST 471
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 120/405 (29%), Positives = 189/405 (46%), Gaps = 50/405 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGS+ V CG+ S D P+ S S + C S CL +
Sbjct: 113 LSAIIDTGSEAVLVQCGSRSRPVFD--------------PAASQSYRQVPCISQLCLAVQ 158
Query: 62 --SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS-SP 118
+S+ PC S +TC +++ +YG+ TG ++D + ++ + S
Sbjct: 159 QQTSNGSSQPCVNS--------SATC-----TYSLSYGDSRNSTGDFSQDVIFLNSTNSS 205
Query: 119 GIIREIPKFCFGCVGS-----TYREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFK 171
G + FGC S +GI GF RG LS+PSQL G FS+CF +
Sbjct: 206 GQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQP 265
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVP 228
+ P + + +GD +S K + +TP+L +P+ P YY+GL +I++ +L +P
Sbjct: 266 W--QPRATGVIFLGDSGLS-KSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLA-IP 321
Query: 229 LSLREFD-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
S + D S G+GG ++DSGTT+T + + Y+ + ++ R K+V GFD C
Sbjct: 322 ESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLR-KKVGAAAGFDDC 380
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
Y + ++ P + NNV L L + F +SA N V CL S
Sbjct: 381 YNISAGSSL---PGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTV-CLAILSSQKSG 436
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
+G V G++QQ N V YD E+ R+GF+ DC+ A + +H K
Sbjct: 437 FGKINVLGNYQQSNYLVEYDNERSRVGFERADCSGAAGSFLVHSK 481
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 119/382 (31%), Positives = 171/382 (44%), Gaps = 56/382 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL WV C C C K F PS+S S + C + C + S
Sbjct: 54 VIVDTGSDLNWVQC----LPCRVCYQQPGPK----FDPSKSRSFRKAACTDNLC---NVS 102
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P C + C + YTYG+ G L +T+ ++ + +
Sbjct: 103 ALPLKACAANVCQ---------------YQYTYGDQSNTNGDLAFETISLNNGAG--TQS 145
Query: 124 IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPNI 178
+P F FGC T+ G+ G G+G LS+ SQL F K FS+C ++ +
Sbjct: 146 VPNFAFGCGTQNLGTFAGAAGLVGLGQGPLSLNSQLSHTFANK-FSYCLVSLNSLS---- 200
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+SPL G +A ++ N+Q+T ++ + +P YYY+ L +I +G L P S G
Sbjct: 201 ASPLTFGSIAAAA--NIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTG 258
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG ++DSGTT T L P YS +L +S + YPR G DLC+ + +N
Sbjct: 259 RGGTIIDSGTTITMLTLPAYSAVLRAYESFVN-YPRLD--GSAYGLDLCFNIAGVSNPSV 315
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P + F F + N F + S+ CL G G S + G+ Q
Sbjct: 316 ----PDMVFKF-QGADFQMRGENLFVLV---DTSATTLCLAM----GGSQGFS-IIGNIQ 362
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQN VVYDLE ++IGF DC
Sbjct: 363 QQNHLVVYDLEAKKIGFATADC 384
>gi|115461432|ref|NP_001054316.1| Os04g0685200 [Oryza sativa Japonica Group]
gi|113565887|dbj|BAF16230.1| Os04g0685200, partial [Oryza sativa Japonica Group]
Length = 330
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 160/318 (50%), Gaps = 38/318 (11%)
Query: 89 CPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC-VGSTYREPIGIAGFGR 147
CP + YG G G+L DTL+ G R + F GC + S ++ P G+AGFGR
Sbjct: 29 CPPYLVVYGSGS-TAGLLISDTLRTPG------RAVRNFVIGCSLASVHQPPSGLAGFGR 81
Query: 148 GALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDN-LQFTPMLKS--- 203
GA SVPSQLG + FS+C L+ ++ ++ +S L++G +Q+ P+ +S
Sbjct: 82 GAPSVPSQLGLTK--FSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASA 139
Query: 204 -PMYPNYYYIGLEAITIGNSSLTEVPLSLREF-DSQGNGGLLVDSGTTYTHLPEPFYSQL 261
P Y YYY+ L AIT+G S V L R F GG +VDSGTT+++ + +
Sbjct: 140 RPPYSVYYYLALTAITVGGKS---VQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPV 196
Query: 262 LSILQSTIT-YYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQG 320
+ + + + Y R+K VEE G C+ +P T P ++ HF + LP
Sbjct: 197 AAAVVAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTME---LPEMSLHFKGGSVMNLPVE 253
Query: 321 NHFYAMS------APSNSSAVKCLLFQS--------MDDGDYGPSGVFGSFQQQNVEVVY 366
N+F AP+ + A+ CL S GP+ + GSFQQQN + Y
Sbjct: 254 NYFVVAGPAPSGGAPAMAEAI-CLAVVSDVPTSSGGAGVSSGGPAIILGSFQQQNYYIEY 312
Query: 367 DLEKERIGFQPMDCASTA 384
DLEKER+GF+ CAS++
Sbjct: 313 DLEKERLGFRRQQCASSS 330
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 121/380 (31%), Positives = 175/380 (46%), Gaps = 59/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C C Y+ + F P +SSS S+ +C SS C
Sbjct: 125 LDTGSDLIWTQCK----PCTQC--YKQPTPI--FDPKKSSSFSKVSCGSSLC-------- 168
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
S + STC C + Y+YG+ + G+L +T S + +
Sbjct: 169 ------------SAVPSSTCSDGC-EYVYSYGDYSMTQGVLATETFTFGKSKNKV--SVH 213
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL + FS+C D S
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLK--EPRFSYCLTPM----DDTKESI 267
Query: 182 LVIGDVA-ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L++G + + + TP+LK+P+ P++YY+ LE I++G++ L+ + S E GNG
Sbjct: 268 LLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLS-IEKSTFEVGDDGNG 326
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGTT T++ + + L S T P K TG DLC+ +P + T
Sbjct: 327 GVIIDSGTTITYIEQKAFEALKKEFISQ-TKLPLDK--TSSTGLDLCFSLPSGS---TQV 380
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P I FHF L LP N+ S + V CL G +FG+ QQQ
Sbjct: 381 EIPKIVFHFKGG-DLELPAENYMIGDS----NLGVACLAM-----GASSGMSIFGNVQQQ 430
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N+ V +DLEKE I F P C
Sbjct: 431 NILVNHDLEKETISFVPTSC 450
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/390 (29%), Positives = 181/390 (46%), Gaps = 54/390 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSF--CLNI 60
Q DTGSDL W C + C +R + N PS S++ + C SS C
Sbjct: 46 QAIADTGSDLIWTQCAPCTSQC-----FRQPTPLYN--PSSSTTFAVLPCNSSLSVCAAA 98
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ P GC+ + + TYG G T + GS+P
Sbjct: 99 LAGTGTAPP---PGCACT-------------YNVTYGSG--WTSVFQGSETFTFGSTPAG 140
Query: 121 IREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G G+ G GRG LS+ SQLG + FS+C ++ D
Sbjct: 141 HARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPK--FSYCLTPYQ---DT 195
Query: 177 NISSPLVIGDVA-ISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLR 232
N +S L++G A ++ + TP + SP +YY+ L I++G ++L+ +P
Sbjct: 196 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS-IPPDAF 254
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
++ G GGL++DSGTT T L Y Q+ + + S +T + TG DLC+ +P
Sbjct: 255 SLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL--PTTDGSADTGLDLCFMLP- 311
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
++T PS+T HF N +VLP ++ S+ S + CL Q+ DG+
Sbjct: 312 -SSTSAPPAMPSMTLHF-NGADMVLPADSYMM-----SDDSGLWCLAMQNQTDGEVN--- 361
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G++QQQN+ ++YD+ +E + F P C++
Sbjct: 362 ILGNYQQQNMHILYDIGQETLSFAPAKCSA 391
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/389 (31%), Positives = 175/389 (44%), Gaps = 66/389 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C +C D + F P +SSS S+ C+S C + S+
Sbjct: 125 VDTGSDLIWTQCK----PCTECFD----QPTPIFDPEKSSSYSKVGCSSGLCNALPRSNC 176
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D K +C + YTYG+ G+L +T + I
Sbjct: 177 NED-------------KDSC-----EYLYTYGDYSSTRGLLATETFTFEDEN-----SIS 213
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL + FS+C + + D SS
Sbjct: 214 GIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLK--ETKFSYCLTSIE---DSEASSS 268
Query: 182 LVIGDVA--ISSKDNLQF-------TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
L IG +A I +K +L++P P++YY+ L+ IT+G L+ V S
Sbjct: 269 LFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLS-VEKSTF 327
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
E G GG+++DSGTT T+L E + L S ++ + TG DLC+++P
Sbjct: 328 ELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSL---PVDDSGSTGLDLCFKLP- 383
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N + P + FHF L LP N+ A S S+ V CL G
Sbjct: 384 --NAAKNIAVPKLIFHF-KGADLELPGENYMVADS----STGVLCLAM-----GSSNGMS 431
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+FG+ QQQN V++DLEKE + F P +C
Sbjct: 432 IFGNVQQQNFNVLHDLEKETVTFVPTECG 460
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 115/390 (29%), Positives = 181/390 (46%), Gaps = 54/390 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSF--CLNI 60
Q DTGSDL W C + C +R + N PS S++ + C SS C
Sbjct: 106 QAIADTGSDLIWTQCAPCTSQC-----FRQPTPLYN--PSSSTTFAVLPCNSSLSVCAAA 158
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ P GC+ + + TYG G T + GS+P
Sbjct: 159 LAGTGTAPP---PGCACT-------------YNVTYGSG--WTSVFQGSETFTFGSTPAG 200
Query: 121 IREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G G+ G GRG LS+ SQLG + FS+C ++ D
Sbjct: 201 HARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPK--FSYCLTPYQ---DT 255
Query: 177 NISSPLVIGDVA-ISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLR 232
N +S L++G A ++ + TP + SP +YY+ L I++G ++L+ +P
Sbjct: 256 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS-IPPDAF 314
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
++ G GGL++DSGTT T L Y Q+ + + S +T + TG DLC+ +P
Sbjct: 315 SLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL--PTTDGSADTGLDLCFMLP- 371
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
++T PS+T HF N +VLP ++ S+ S + CL Q+ DG+
Sbjct: 372 -SSTSAPPAMPSMTLHF-NGADMVLPADSYMM-----SDDSGLWCLAMQNQTDGEVN--- 421
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G++QQQN+ ++YD+ +E + F P C++
Sbjct: 422 ILGNYQQQNMHILYDIGQETLSFAPAKCSA 451
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 121/384 (31%), Positives = 182/384 (47%), Gaps = 63/384 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL--N 59
+ MDTGSDL W C C DC S + PS SS+ S+ C SS C +
Sbjct: 55 LSAIMDTGSDLVWTKCN----PCTDC------STSSIYDPSSSSTYSKVLCQSSLCQPPS 104
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
I S +N D C + Y YG+ +GIL+ +T + S
Sbjct: 105 IFSCNNDGD-----------------CE----YVYPYGDRSSTSGILSDETFSISSQS-- 141
Query: 120 IIREIPKFCFGC--VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDP 176
+P FGC + + G+ GFGRG+LS+ SQLG + FS+C ++ D
Sbjct: 142 ----LPNITFGCGHDNQGFDKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVS---RTDS 194
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ +SPL IG+ A + TP+++S N+YY+ LE I++G SL +P + S
Sbjct: 195 SKTSPLFIGNTASLEATTVGSTPLVQSS-STNHYYLSLEGISVGGQSLA-IPTGTFDIQS 252
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G+GGL++DSGTT T L + Y + + S+I P+A DLC+ +N
Sbjct: 253 DGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSIN-LPQADGQ-----LDLCFNQQGSSNP 306
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
FPS+TFHF +P+ N+ + P ++S + CL + + G +FG+
Sbjct: 307 G----FPSMTFHF-KGADYDVPKENYLF----PDSTSDIVCLAMMPTNS-NLGNMAIFGN 356
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQN +++YD E + F P C
Sbjct: 357 VQQQNYQILYDNENNVLSFAPTAC 380
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 115/390 (29%), Positives = 181/390 (46%), Gaps = 54/390 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSF--CLNI 60
Q DTGSDL W C + C +R + N PS S++ + C SS C
Sbjct: 104 QAIADTGSDLIWTQCAPCTSQC-----FRQPTPLYN--PSSSTTFAVLPCNSSLSVCAAA 156
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ P GC+ + + TYG G T + GS+P
Sbjct: 157 LAGTGTAPP---PGCACT-------------YNVTYGSG--WTSVFQGSETFTFGSTPAG 198
Query: 121 IREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G G+ G GRG LS+ SQLG + FS+C ++ D
Sbjct: 199 QSRVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVPK--FSYCLTPYQ---DT 253
Query: 177 NISSPLVIGDVA-ISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLR 232
N +S L++G A ++ + TP + SP +YY+ L I++G ++L+ +P
Sbjct: 254 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALS-IPPDAF 312
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
++ G GGL++DSGTT T L Y Q+ + + S +T + TG DLC+ +P
Sbjct: 313 LLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL--PTTDGSAATGLDLCFMLP- 369
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
++T PS+T HF N +VLP ++ S+ S + CL Q+ DG+
Sbjct: 370 -SSTSAPPAMPSMTLHF-NGADMVLPADSYMM-----SDDSGLWCLAMQNQTDGEVN--- 419
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G++QQQN+ ++YD+ +E + F P C++
Sbjct: 420 ILGNYQQQNMHILYDIGQETLSFAPAKCSA 449
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 182/391 (46%), Gaps = 38/391 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDL W+ C + C ++ F S+S++ S C+++ CL +
Sbjct: 67 VLLIADTGSDLIWLQCSTTAAPPAFCPKKACSR-RPAFVASKSATLSVVPCSAAQCLLVP 125
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP-SFAYTYGEGGLVTGILTRDTLKV-HGSSPG 119
+ C+ + P P +AY Y +G TG L RDT + +G+S G
Sbjct: 126 APRGHGPSCSPAA-------------PVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 172
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYAN 174
+ FGC G ++ G+ G G+G LS P+Q G L + FS+C L +
Sbjct: 173 A--AVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGR 230
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
SS L +G + +TP++ +P+ P +YY+G+ AI +GN L VP S
Sbjct: 231 RGRSSSFLFLGRP--ERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVL-PVPGSEWAI 287
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRA-KEVEERTGFDLCYRVPCP 293
D GNGG ++DSG+T T+L Y L+S +++ + PR G +LCY V
Sbjct: 288 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSATFFQGLELCYNVSSS 346
Query: 294 NNTF-TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPS 351
++ + FP +T F +SL LP GN+ + + VKCL + ++ +
Sbjct: 347 SSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-----ADDVKCLAIRPTLSPFAF--- 398
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
V G+ QQ V +D RIGF +C +
Sbjct: 399 NVLGNLMQQGYHVEFDRASARIGFARTECVA 429
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 117/381 (30%), Positives = 175/381 (45%), Gaps = 49/381 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSDL WV C C+ C Y + + ++PS SS+ + C S CL I +++
Sbjct: 82 VDSGSDLLWVQCA----PCLQC--YAQDTPL--YAPSNSSTFNPVPCLSPECLLIPATEG 133
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PC C ++ Y Y + L G+ ++ V I
Sbjct: 134 --FPCDFH-------YPGAC-----AYEYRYADTSLSKGVFAYESATVDDV------RID 173
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISSP 181
K FGC ++ G+ G G+G LS SQ+G+ F++C + Y + ++SS
Sbjct: 174 KVAFGCGRDNQGSFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLV--NYLDPTSVSSW 231
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L+ GD IS+ +LQFTP++ + P YY+ +E + +G SL + S D GNGG
Sbjct: 232 LIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESL-PISHSAWSLDFLGNGG 290
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGTT T+ P Y +L+ + YPRA V+ G DLC V
Sbjct: 291 SIFDSGTTVTYWLPPAYRNILAAFDKNVR-YPRAASVQ---GLDLCVDV----TGVDQPS 342
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
FPS T L ++ PQ +++ AP+ V+CL + G G+ QQN
Sbjct: 343 FPSFTI-VLGGGAVFQPQQGNYFVDVAPN----VQCLAMAGLPS-SVGGFNTIGNLLQQN 396
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
V YD E+ RIGF P C+S
Sbjct: 397 FLVQYDREENRIGFAPAKCSS 417
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 183/391 (46%), Gaps = 38/391 (9%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDL W+ C + C ++ F S+S++ S C+++ CL +
Sbjct: 66 VLLIADTGSDLIWLQCSTTAAPPAFCPKKACSR-RPAFVASKSATLSVVPCSAAQCLLVP 124
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP-SFAYTYGEGGLVTGILTRDTLKV-HGSSPG 119
+ C+ + P P +AY Y +G TG L RDT + +G+S G
Sbjct: 125 APRGHGPACSPAA-------------PVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 171
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYAN 174
+ FGC G ++ G+ G G+G LS P+Q G L + FS+C L +
Sbjct: 172 A--AVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGR 229
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
SS L +G + +TP++ +P+ P +YY+G+ AI +GN L VP S
Sbjct: 230 RGRSSSFLFLGRP--ERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVL-PVPGSEWAI 286
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRA-KEVEERTGFDLCYRVPCP 293
D GNGG ++DSG+T T+L Y L+S +++ + PR G +LCY V
Sbjct: 287 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASV-HLPRIPSSATFFQGLELCYNVSSS 345
Query: 294 NNTF-TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPS 351
+++ + FP +T F +SL LP GN+ + + VKCL + ++ +
Sbjct: 346 SSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDV-----ADDVKCLAIRPTLSPFAF--- 397
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
V G+ QQ V +D RIGF +C +
Sbjct: 398 NVLGNLMQQGYHVEFDRASARIGFARTECVA 428
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 177/383 (46%), Gaps = 54/383 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C S C + ++PS S++ S C SS L
Sbjct: 103 DTGSDLIWTQCAPCSRQCF-------QQPTPLYNPSSSTTFSALPCNSSLGL-------- 147
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAY--TYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
C P + Y TYG G +T S+P +
Sbjct: 148 -------------------CAPACACMYNMTYGSGWTYV-FQGTETFTFGSSTPADQVRV 187
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
P FGC G G+ G GRG+LS+ SQLG FS+C ++ D N +S
Sbjct: 188 PGIAFGCSNASSGFNASSASGLVGLGRGSLSLVSQLG--APKFSYCLTPYQ---DTNSTS 242
Query: 181 PLVIGDVA-ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L++G A ++ + TP + SP YYY+ L I++G ++L +P + + G
Sbjct: 243 TLLLGPSASLNDTGVVSSTPFVASPSS-IYYYLNLTGISLGTTAL-PIPPNAFSLKADGT 300
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GGL++DSGTT T L Y Q+ + + S +T + TG DLC+ + P++T
Sbjct: 301 GGLIIDSGTTITMLGNTAYQQVRAAVLSLVTL--PTTDGSAATGLDLCFEL--PSSTSAP 356
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
PS+T HF + +VLP N+ ++S P + S++ CL Q+ D D + G++QQ
Sbjct: 357 PSMPSMTLHF-DGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQ 415
Query: 360 QNVEVVYDLEKERIGFQPMDCAS 382
QN+ ++YD+ KE + F P C++
Sbjct: 416 QNMHILYDVGKETLSFAPAKCST 438
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/389 (30%), Positives = 174/389 (44%), Gaps = 66/389 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C +C D + F P +SSS S+ C+S C + S+
Sbjct: 124 VDTGSDLIWTQCK----PCTECFD----QPTPIFDPEKSSSYSKVGCSSGLCNALPRSNC 175
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D K C + YTYG+ G+L +T + I
Sbjct: 176 NED-------------KDAC-----EYLYTYGDYSSTRGLLATETFTFEDEN-----SIS 212
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL + FS+C + + D SS
Sbjct: 213 GIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLK--ETKFSYCLTSIE---DSEASSS 267
Query: 182 LVIGDVA--ISSKDNLQF-------TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
L IG +A I +K +L++P P++YY+ L+ IT+G L+ V S
Sbjct: 268 LFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLS-VEKSTF 326
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
E G GG+++DSGTT T+L E + L S ++ + TG DLC+++P
Sbjct: 327 ELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSL---PVDDSGSTGLDLCFKLP- 382
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ + P + FHF L LP N+ A S S+ V CL G
Sbjct: 383 --DAAKNIAVPKMIFHF-KGADLELPGENYMVADS----STGVLCLAM-----GSSNGMS 430
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+FG+ QQQN V++DLEKE + F P +C
Sbjct: 431 IFGNVQQQNFNVLHDLEKETVSFVPTECG 459
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/389 (30%), Positives = 174/389 (44%), Gaps = 66/389 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C +C D + F P +SSS S+ C+S C + S+
Sbjct: 16 VDTGSDLIWTQCK----PCTECFD----QPTPIFDPEKSSSYSKVGCSSGLCNALPRSNC 67
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D K C + YTYG+ G+L +T + I
Sbjct: 68 NED-------------KDAC-----EYLYTYGDYSSTRGLLATETFTFEDENS-----IS 104
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL + FS+C + + D SS
Sbjct: 105 GIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLK--ETKFSYCLTSIE---DSEASSS 159
Query: 182 LVIGDVA--ISSKDNLQF-------TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
L IG +A I +K +L++P P++YY+ L+ IT+G L+ V S
Sbjct: 160 LFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLS-VEKSTF 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
E G GG+++DSGTT T+L E + L S ++ + TG DLC+++P
Sbjct: 219 ELAEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSL---PVDDSGSTGLDLCFKLP- 274
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ + P + FHF L LP N+ A S S+ V CL G
Sbjct: 275 --DAAKNIAVPKMIFHF-KGADLELPGENYMVADS----STGVLCLAM-----GSSNGMS 322
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+FG+ QQQN V++DLEKE + F P +C
Sbjct: 323 IFGNVQQQNFNVLHDLEKETVSFVPTECG 351
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/383 (31%), Positives = 182/383 (47%), Gaps = 68/383 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C C ++ F P +SSS S+ +C+S C
Sbjct: 114 LDTGSDLIWTQCK----PCTQC----FHQSTPIFDPKKSSSFSKLSCSSQLC-------- 157
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
L +S+C C + Y+YG+ GIL +TL +S +P
Sbjct: 158 ------------EALPQSSCNNGC-EYLYSYGDYSSTQGILASETLTFGKAS------VP 198
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC GS + + G+ G GRG LS+ SQL + FS+C D +S
Sbjct: 199 NVAFGCGADNEGSGFSQGAGLVGLGRGPLSLVSQLK--EPKFSYCLTTV----DDTKTST 252
Query: 182 LVIGDVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ-- 237
L++G +A +S ++ TP++ SP +P++YY+ LE I++G+ T +P+ F Q
Sbjct: 253 LLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGD---TRLPIKKSTFSLQDD 309
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G+GGL++DSGTT T+L E ++ + + I + TG D+C+ +P +
Sbjct: 310 GSGGLIIDSGTTITYLEESAFNLVAKEFTAKINL---PVDSSGSTGLDVCFTLPSGS--- 363
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P + FHF + L LP N+ S S V CL G +FG+
Sbjct: 364 TNIEVPKLVFHF-DGADLELPAENYMIGDS----SMGVACLAM-----GSSSGMSIFGNV 413
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQN+ V++DLEKE + F P C
Sbjct: 414 QQQNMLVLHDLEKETLSFLPTQC 436
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 118/384 (30%), Positives = 184/384 (47%), Gaps = 49/384 (12%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGSDL +V C CD Y + + + PS SS+ + C S+ CL I
Sbjct: 48 HLIVDTGSDLAFV-------QCAPCDLCYEQDGPL--YQPSNSSTFTPVPCDSAECLLI- 97
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P + S+ +S C S+ Y YG+ G+ +T V G I
Sbjct: 98 -------PAPVGAPCSSSYPESPPQGAC-SYEYRYGDNSSTVGVFAYETATVGG-----I 144
Query: 122 REIPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
R + FGC ++ G+ G G+GALS SQ G+ + F++C + Y + +
Sbjct: 145 R-VNHVAFGCGNRNQGSFVSAGGVLGLGQGALSFTSQAGYAFENKFAYCLTS--YLSPTS 201
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ S L+ GD +S+ +LQFTP++ +P+ P+ YY+ + I G +L +P S + DS
Sbjct: 202 VFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLL-IPDSAWKIDSV 260
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG + DSGTT T+ Y+++++ + ++ YPRA + G LC V +
Sbjct: 261 GNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVP-YPRAPPSPQ--GLPLCVNV----SGI 313
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVFGS 356
++PS T F + QGN+F +S + CL + +S DG V G+
Sbjct: 314 DHPIYPSFTIEFDQGATYRPNQGNYFIEVSP-----NIDCLAMLESSSDG----FNVIGN 364
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQN V YD E+ RIGF +C
Sbjct: 365 IIQQNYLVQYDREEHRIGFAHANC 388
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 116/385 (30%), Positives = 179/385 (46%), Gaps = 57/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDL W C C C N+ + + SRSS+ + +C S+ C
Sbjct: 104 VQLTLDTGSDLVWTQCQ----PCAVC----FNQSLPYYDASRSSTFALPSCDSTQC---- 151
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPGI 120
DP +++ C T+ TC +F+Y+YG+ G L +T+ V G+S
Sbjct: 152 ----KLDP-SVTMCVNQTV--QTC-----AFSYSYGDKSATIGFLDVETVSFVAGAS--- 196
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGFGRG LS+PSQL FSHCF A P
Sbjct: 197 ---VPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTAVS-GRKP 250
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + D+ + + +Q TP++K+P +P +YY+ L+ IT+G++ L VP S +
Sbjct: 251 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRL-PVPESAFALKN 309
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEV-EERTGFDLCYRVPCPNN 295
G GG ++DSGT +T LP Y ++ + + V TG LC+ P
Sbjct: 310 -GTGGTIIDSGTAFTSLPPRVY----RLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGK 364
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P + HF ++ LP+ N+ + N S ++ M + G
Sbjct: 365 A---PHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMT--------IIG 412
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+FQQQN+ V+YDL+ ++ F C
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 113/390 (28%), Positives = 176/390 (45%), Gaps = 55/390 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSF--CLNI 60
Q DTGSDL W C S C + ++PS S++ + C SS C
Sbjct: 100 QAIADTGSDLIWTQCAPCSSQCF-------QQPTPLYNPSSSTTFAVLPCNSSLSMCAAA 152
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ P CT C + TYG G + +T S+P
Sbjct: 153 LAGTTPPPGCT--------------CM----YNMTYGSG-WTSVYQGSETFTFGSSTPAN 193
Query: 121 IREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G G+ G GRG+LS+ SQLG + FS+C ++ D
Sbjct: 194 QTGVPGIAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVPK--FSYCLTPYQ---DT 248
Query: 177 NISSPLVIGDVA-ISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEVPLSLR 232
N +S L++G A ++ + TP + SP YYY+ L I++G ++L+ +P +
Sbjct: 249 NSTSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALS-IPTTAL 307
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+ G GG ++DSGTT T L Y Q+ + + S +T P TG DLC+ +P
Sbjct: 308 SLKADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATGLDLCFELP- 365
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
++T PS+T HF + +VLP ++ S + CL Q+ DG
Sbjct: 366 -SSTSAPPTMPSMTLHF-DGADMVLPADSYMML------DSNLWCLAMQNQTDGGVS--- 414
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G++QQQN+ ++YD+ +E + F P C++
Sbjct: 415 ILGNYQQQNMHILYDVGQETLTFAPAKCST 444
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 173/387 (44%), Gaps = 55/387 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL W C C DC D + + P+ SS+ + C ++ C +
Sbjct: 97 VALTLDTGSDLVWTQCA----PCRDCFD----QDLPVLDPAASSTYAALPCGAARCRAL- 147
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS-PGI 120
PF + C + TL C +AY YG+ L G + D S G
Sbjct: 148 ----PF-----TSCGVRTLGNHRSC----IYAYHYGDKSLTVGEIATDRFTFGDSGGSGE 194
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+ FGC G GIAGFGRG S+PSQL FS+CF + +
Sbjct: 195 SLHTRRLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNV--TSFSYCFTSMFESKSS 252
Query: 177 NIS---SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
++ SP + A S + ++ TP+LK+P P+ Y++ L+ I++G T +P+ +
Sbjct: 253 LVTLGGSPAALYSHAHSGE--VRTTPILKNPSQPSLYFLSLKGISVGK---TRLPVPETK 307
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
F S ++DSG + T LPE Y + + + + P E + DLC+ +P
Sbjct: 308 FRST-----IIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVE---GSALDLCFALPV- 358
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ PS+T H L LP+ N+ + + V C++ D G V
Sbjct: 359 TALWRRPAVPSLTLH-LEGADWELPRSNYVFE----DLGARVMCIVL----DAAPGEQTV 409
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+FQQQN VVYDLE +R+ F P C
Sbjct: 410 IGNFQQQNTHVVYDLENDRLSFAPARC 436
>gi|147866226|emb|CAN79938.1| hypothetical protein VITISV_027777 [Vitis vinifera]
Length = 454
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/395 (30%), Positives = 174/395 (44%), Gaps = 58/395 (14%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ + MDTGSDL W PC + + C +C +N + F P SSSS C + C I
Sbjct: 102 TLPLIMDTGSDLVWFPCTH-RYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGCVNPKCGWI 160
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL-KVHGSSPG 119
H S S C C + CP + R L +H S
Sbjct: 161 HGSK------VQSRCRDCEPTSPNCTQICPPYLNFLRFWDHRRSQFHRRMLCPLHQS--- 211
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
T RE I+GFGRG S+PSQLG K FS+C L+ +Y +D S
Sbjct: 212 ---------------TRRE---ISGFGRGPPSLPSQLGL--KKFSYCLLSRRY-DDTTES 250
Query: 180 SPLVIGDVAISSKDN--LQFTPMLKSP------MYPNYYYIGLEAITIGNSSLTEVPLSL 231
S LV+ + S + L +TP +++P + YYY+GL IT+G + ++P
Sbjct: 251 SSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKHV-KIPYKY 309
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ G+GG ++DSGTT+T++ + + + + + RA EVE TG C+ +
Sbjct: 310 LIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSK-RATEVEGITGLRPCFNI- 367
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG-- 349
+ FP +T F + LP N+ + V CL + DG G
Sbjct: 368 ---SGLNTPSFPELTLKFRGGAEMELPLANYVAFLGG----DDVVCLTI--VTDGAAGKE 418
Query: 350 ----PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P+ + G+FQQQN V YDL ER+GF+ C
Sbjct: 419 FSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 453
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 178/394 (45%), Gaps = 61/394 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDLTW C C+ C +R + + F+PSRS + S C C ++
Sbjct: 124 VQLILDTGSDLTWTQCA----PCVSC--FRQS--LPRFNPSRSMTFSVLPCDLRICRDL- 174
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
T S C + C +AY Y + + TG L DT + I
Sbjct: 175 ---------TWSSCGEQSWGNGICV-----YAYAYADHSITTGHLDSDTFSFASADHAIG 220
Query: 122 -REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGF RGALS+P+QL FS+CF A ++P
Sbjct: 221 GASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV--DNFSYCFTAIT-GSEP 277
Query: 177 NISSPLVIG-------DVAISSKDNLQFTPMLK-SPMYPNYYYIGLEAITIGNSSLTEVP 228
SP+ +G D A +Q T +++ YYI L+ +T+G + L +P
Sbjct: 278 ---SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRL-PIP 333
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDL 286
S+ G GG +VDSGT T LPE Y+ + + Q+ +T + + + L
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQ-----L 388
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
C+ VP P P++ HF +L LP+ N+ + + + + CL + +D
Sbjct: 389 CFSVP-PG---AKPDVPALVLHF-EGATLDLPRENYMFEIE-EAGGIRLTCLAINAGED- 441
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+FQQQN+ V+YDL + + F P C
Sbjct: 442 ----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 178/394 (45%), Gaps = 61/394 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDLTW C C+ C +R + + F+PSRS + S C C ++
Sbjct: 98 VQLILDTGSDLTWTQCA----PCVSC--FRQS--LPRFNPSRSMTFSVLPCDLRICRDL- 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
T S C + C +AY Y + + TG L DT + I
Sbjct: 149 ---------TWSSCGEQSWGNGICV-----YAYAYADHSITTGHLDSDTFSFASADHAIG 194
Query: 122 -REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGF RGALS+P+QL FS+CF A ++P
Sbjct: 195 GASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV--DNFSYCFTAIT-GSEP 251
Query: 177 NISSPLVIG-------DVAISSKDNLQFTPMLK-SPMYPNYYYIGLEAITIGNSSLTEVP 228
SP+ +G D A +Q T +++ YYI L+ +T+G + L +P
Sbjct: 252 ---SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRL-PIP 307
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDL 286
S+ G GG +VDSGT T LPE Y+ + + Q+ +T + + + L
Sbjct: 308 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQ-----L 362
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
C+ VP P P++ HF +L LP+ N+ + + + + CL + +D
Sbjct: 363 CFSVP-PG---AKPDVPALVLHF-EGATLDLPRENYMFEIEE-AGGIRLTCLAINAGED- 415
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+FQQQN+ V+YDL + + F P C
Sbjct: 416 ----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 445
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 118/394 (29%), Positives = 178/394 (45%), Gaps = 61/394 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDLTW C C+ C +R + + F+PSRS + S C C ++
Sbjct: 124 VQLILDTGSDLTWTQCA----PCVSC--FRQS--LPRFNPSRSMTFSVLPCDLRICRDL- 174
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
T S C + C +AY Y + + TG L DT + I
Sbjct: 175 ---------TWSSCGEQSWGNGICV-----YAYAYADHSITTGHLDSDTFSFASADHAIG 220
Query: 122 -REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGF RGALS+P+QL FS+CF A ++P
Sbjct: 221 GASVPDLTFGCGLFNNGIFVSNETGIAGFSRGALSMPAQLKV--DNFSYCFTAIT-GSEP 277
Query: 177 NISSPLVIG-------DVAISSKDNLQFTPMLK-SPMYPNYYYIGLEAITIGNSSLTEVP 228
SP+ +G D A +Q T +++ YYI L+ +T+G + L +P
Sbjct: 278 ---SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRL-PIP 333
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDL 286
S+ G GG +VDSGT T LPE Y+ + + Q+ +T + + + L
Sbjct: 334 ESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLSQ-----L 388
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
C+ VP P P++ HF +L LP+ N+ + + + + CL + +D
Sbjct: 389 CFSVP-PG---AKPDVPALVLHF-EGATLDLPRENYMFEIE-EAGGIRLTCLAINAGED- 441
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+FQQQN+ V+YDL + + F P C
Sbjct: 442 ----LSVIGNFQQQNMHVLYDLANDMLSFVPARC 471
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 121/383 (31%), Positives = 181/383 (47%), Gaps = 68/383 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C D + F P +SSS S+ +C+S C
Sbjct: 114 MDTGSDLIWTQCK----PCTQCFD----QPTPIFDPKKSSSFSKLSCSSKLC-------- 157
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
L +STC C + Y YG+ G+L +TL S +P
Sbjct: 158 ------------EALPQSTCSDGC-EYLYGYGDYSSTQGMLASETLTFGKVS------VP 198
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC GS + + G+ G GRG LS+ SQL + FS+C + D +S
Sbjct: 199 EVAFGCGEDNEGSGFSQGSGLVGLGRGPLSLVSQLK--EPKFSYCLTSV----DDTKAST 252
Query: 182 LVIGDVA--ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ-- 237
L++G +A +S ++ TP++++ P++YY+ LE I++G++SL P+ F Q
Sbjct: 253 LLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSL---PIKKSTFSLQED 309
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G+GGL++DSGTT T+L + + + S I + TG ++C+ +P +
Sbjct: 310 GSGGLIIDSGTTITYLEQSAFDLVAKEFTSQINL---PVDNSGSTGLEVCFTLPSGS--- 363
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
TD P + FHF + L LP N+ A S V CL G +FG+
Sbjct: 364 TDIEVPKLVFHF-DGADLELPAENYMIA----DASMGVACLAM-----GSSSGMSIFGNI 413
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQN+ V++DLEKE + F P C
Sbjct: 414 QQQNMLVLHDLEKETLSFLPTQC 436
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 176/391 (45%), Gaps = 44/391 (11%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DT S+LTWV C +C + + F+P SSS + C SS CL
Sbjct: 12 VLLLVDTASELTWVQ----GTSCTNCSPTK----VPPFNPGLSSSFISEPCTSSVCLG-- 61
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S F S C+ ST +C SF Y +G G++ R+ + S G
Sbjct: 62 RSKLGFQ----SACNRST---GSC-----SFQVAYLDGSEAYGVIAREIFSLQ-SWDGAA 108
Query: 122 REIPKFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKG-----FSHCFLAFKY 172
+ FGC + P+ G G RG+ S P+Q+G K FS+CF
Sbjct: 109 STLGDVIFGCASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFP--NR 166
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPL 229
A N S ++ GD I + + Q+ + + P ++YY+GL+ I++G L +P
Sbjct: 167 AEHLNSSGVIIFGDSGIPAH-HFQYLSLEQEPPIASIVDFYYVGLQGISVGGE-LLHIPR 224
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
S + D GNGG DSGTT + L EP ++ L+ + + R + +LCY
Sbjct: 225 SAFKIDRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTK--ELCYD 282
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
V + P +T HF NNV + L + + + + A + CL F + G
Sbjct: 283 VAAGDARLPTA--PLVTLHFKNNVDMELREASVWVPL-ARTPQVVTICLAFVNAGAVAQG 339
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G++QQQ+ + +DLE+ RIGF P +C
Sbjct: 340 GVNVIGNYQQQDYLIEHDLERSRIGFAPANC 370
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 166/385 (43%), Gaps = 47/385 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDL W C C C ++ + PS SS+ C+S C N+
Sbjct: 428 VQLILDTGSDLVWTQCR----PCPVC----FSRALGPLDPSNSSTFDVLPCSSPVCDNL- 478
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
T S C TC + Y Y +G + TG L +T +
Sbjct: 479 ---------TWSSCGKHNWGNQTCV-----YVYAYADGSITTGHLDAETFTFAAADGTGQ 524
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC G GIAGFGRGALS+PSQL FSHCF A ++P+
Sbjct: 525 ATVPDLAFGCGLFNNGIFTSNETGIAGFGRGALSLPSQLKV--DNFSHCFTAIT-GSEPS 581
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ ++ + +Q TP++++ YY+ L+ IT+G++ L +P S
Sbjct: 582 SVLLGLPANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRL-PIPESTFALKQD 640
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR--AKEVEERTGFDLCYRVPCPNN 295
G GG ++DSGT T LP+ Y ++ T R + LC+ P
Sbjct: 641 GTGGTIIDSGTGMTTLPQDAYK----LVHDAFTAQVRLPVDNATSSSLSRLCFSFSVPRR 696
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
D P + HF +L LP+ N+ + S V CL + DD + G
Sbjct: 697 AKPD--VPKLVLHF-EGATLDLPRENYMFEFEDAGGS--VTCLAINAGDD-----LTIIG 746
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQQN+ V+YDL + + F P C
Sbjct: 747 NYQQQNLHVLYDLVRNMLSFVPAQC 771
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 170/387 (43%), Gaps = 63/387 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDL W C C C + F P+ SS+ S+ C SSFC +
Sbjct: 101 VVADTGSDLIWTQCA----PCTKC----FQQPAPPFQPASSSTFSKLPCTSSFCQFL--- 149
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N C +GC + Y YG G G L +TLKV +S
Sbjct: 150 PNSIRTCNATGCV---------------YNYKYGSG-YTAGYLATETLKVGDAS------ 187
Query: 124 IPKFCFGC-----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
P FGC VG++ GIAG GRGALS+ QLG + FS+C + A
Sbjct: 188 FPSVAFGCSTENGVGNSTS---GIAGLGRGALSLIPQLGVGR--FSYCLRSGSAAG---- 238
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+SP++ G +A + N+Q TP + +P ++P+YYY+ L IT+G T++P++ F
Sbjct: 239 ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE---TDLPVTTSTFGFT 295
Query: 238 GNG---GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
NG G +VDSGTT T+L + Y + Q+ ++ V G DLC++
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEM---VKQAFLSQTANVTTVNGTRGLDLCFKSTGGG 352
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
PS+ F +P +F + S S L GD P V
Sbjct: 353 GGIA---VPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ-PMSVI 406
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q ++ ++YDL+ F P DCA
Sbjct: 407 GNVMQMDMHLLYDLDGGIFSFSPADCA 433
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 181/393 (46%), Gaps = 50/393 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGS+ V CG+ S D P+ S S + C S CL +
Sbjct: 12 LSAIIDTGSEAVLVQCGSRSRPVFD--------------PAASQSYRQVPCISQLCLAVQ 57
Query: 62 --SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS-SP 118
+S+ PC S S C +++ +YG+ TG ++D + ++ + S
Sbjct: 58 QQTSNGSSQPCVNS---------SAAC----TYSLSYGDSRNSTGDFSQDVIFLNSTNSS 104
Query: 119 GIIREIPKFCFGCVGS-----TYREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFK 171
+ FGC S +GI GF RG LS+PSQL G FS+CF +
Sbjct: 105 SQAVQFRDVAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQP 164
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVP 228
+ P + + +GD +S K + +TP+L +P+ P YY+GL +I++ +L +P
Sbjct: 165 W--QPRATGVIFLGDSGLS-KSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLA-IP 220
Query: 229 LSLREFD-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
S + D S G+GG ++DSGTT+T + + Y+ + ++ R K+V GFD C
Sbjct: 221 ESAFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLR-KKVGAAAGFDDC 279
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
Y + ++ P + NNV L L + F +SA N V CL S
Sbjct: 280 YNISAGSSL---PGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTV-CLAILSSQKSG 335
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+G V G++QQ N V YD E+ R+GF+ DC
Sbjct: 336 FGKINVLGNYQQSNYLVEYDNERSRVGFERADC 368
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 114/384 (29%), Positives = 173/384 (45%), Gaps = 62/384 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+TW+ C C DC +++ F P +SSS +C SS C + +
Sbjct: 153 LIIDTGSDVTWIQCK----PCSDC----YSQVDPIFEPQQSSSYKHLSCLSSACTELTTM 204
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
++ C + GC + YG+G G +++TL + S
Sbjct: 205 NH----CRLGGCV---------------YEINYGDGSRSQGDFSQETLTLGSDS------ 239
Query: 124 IPKFCFGCVGST----YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
P F FGC G T ++ G+ G GR ALS PSQ G FS+C F +
Sbjct: 240 FPSFAFGC-GHTNTGLFKGSAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTS--- 295
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ +G +I + F P++ + YP++Y++GL I++G L+ P L G
Sbjct: 296 TGSFSVGQGSIPATAT--FVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVL------G 347
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG +VDSGT T L Y L + +S P AK D CY + ++++
Sbjct: 348 RGGTIVDSGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSI---LDTCYDL----SSYS 400
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+ITFHF NN + + + + S+ S V CL F S + + G+FQ
Sbjct: 401 QVRIPTITFHFQNNADVAVSAVGILFTIQ--SDGSQV-CLAFASASQSI--STNIIGNFQ 455
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQ + V +D RIGF P CA+
Sbjct: 456 QQRMRVAFDTGAGRIGFAPGSCAT 479
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 113/396 (28%), Positives = 179/396 (45%), Gaps = 62/396 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDLTW+ C +P ++++S A + + SS
Sbjct: 74 LIVDTGSDLTWIQC----------------------NPPNTTANSSSPPAPWYDKSSSSS 111
Query: 64 DNPFDPCTMSGCS-LSTLLKSTCCRPCPS---FAYTYGEGGLVTGILTRDTL-----KVH 114
PCT C L + S+C PS + Y Y + TGIL +T+ K
Sbjct: 112 YREI-PCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRS 170
Query: 115 GSSPGIIR----EIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG--FS 164
G G + I GC VG+++ G+ G G+G +S+ +Q G FS
Sbjct: 171 GKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFS 230
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
+C + Y N SS LV+G + L TP++++P ++YY+ + + + +
Sbjct: 231 YCLV--DYLRGSNASSFLVMGR---THWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPV 285
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
+ S D GN G + DSGTT ++L EP YS++L L ++I Y PRA+E+ E GF
Sbjct: 286 DGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASI-YLPRAQEIPE--GF 342
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+LCY V T + P + F + LP N+ + + V+C+ Q +
Sbjct: 343 ELCYNV-----TRMEKGMPKLGVEFQGGAVMELPWNNYMVLV-----AENVQCVALQKVT 392
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ S + G+ QQ+ + YDL K RIGF+ C
Sbjct: 393 TTN--GSNILGNLLQQDHHIEYDLAKARIGFKWSPC 426
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/387 (30%), Positives = 170/387 (43%), Gaps = 62/387 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDL W C C C + F P+ SS+ S+ C SSFC +
Sbjct: 101 VVADTGSDLIWTQCA----PCTKC----FQQPAPPFQPASSSTFSKLPCTSSFCQFL--- 149
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N C +GC + Y YG G G L +TLKV +S
Sbjct: 150 PNSIRTCNATGCV---------------YNYKYGSG-YTAGYLATETLKVGDAS------ 187
Query: 124 IPKFCFGC-----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
P FGC VG++ GIAG GRGALS+ QLG + FS+C + A
Sbjct: 188 FPSVAFGCSTENGVGNSTS---GIAGLGRGALSLIPQLGVGR--FSYCLRSGSAAG---- 238
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+SP++ G +A + N+Q TP + +P ++P+YYY+ L IT+G T++P++ F
Sbjct: 239 ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE---TDLPVTTSTFGFT 295
Query: 238 GNG---GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
NG G +VDSGTT T+L + Y + Q+ ++ V G DLC++
Sbjct: 296 QNGLGGGTIVDSGTTLTYLAKDGYEM---VKQAFLSQTADVTTVNGTRGLDLCFK--STG 350
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
PS+ F +P +F + S S L GD P V
Sbjct: 351 GGGGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ-PMSVI 407
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q ++ ++YDL+ F P DCA
Sbjct: 408 GNVMQMDMHLLYDLDGGIFSFAPADCA 434
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 177/399 (44%), Gaps = 82/399 (20%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LN 59
+Q+ +DTGSDL W C C C D + + F PS SS+ S +C S+ C L
Sbjct: 95 VQLTLDTGSDLIWTQCQ----PCPACFD----QALPYFDPSTSSTLSLTSCDSTLCQGLP 146
Query: 60 IHSSDNP-FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ S +P F P TC + Y+YG+ + TG L D G+
Sbjct: 147 VASCGSPKFWP------------NQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG- 188
Query: 119 GIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+P FGC G GIAGFGRG LS+PSQL FSHCF A
Sbjct: 189 ---ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTAVN-GL 242
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
P+ + D+ S + +Q TP++++P P +YY+ L+ IT+G+ T +P+ EF
Sbjct: 243 KPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGS---TRLPVPESEF 299
Query: 235 D-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G GG ++DSGT T LP Y + R F ++P
Sbjct: 300 ALKNGTGGTIIDSGTAMTSLPTRVYRLV-------------------RDAFAAQVKLPVV 340
Query: 294 NNTFTDDLF------------PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
+ TD F P + HF ++ LP+ N+ + + S++ CL
Sbjct: 341 SGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMDLPRENYVFEVE--DAGSSILCL--- 394
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
++ +G G G+FQQQN+ V+YDL+ ++ F P C
Sbjct: 395 AIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 168/382 (43%), Gaps = 60/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C N+ F+P SSS S C+S C + S
Sbjct: 112 MDTGSDLIWTQCQ----PCTQCF----NQSTPIFNPQGSSSFSTLPCSSQLCQALQSP-- 161
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
TC + Y YG+G G + +TL S IP
Sbjct: 162 ------------------TCSNNSCQYTYGYGDGSETQGSMGTETLTFGSVS------IP 197
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G G+ G GRG LS+PSQL + FS+C +N SS
Sbjct: 198 NITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTK--FSYCMTPIGSSN----SST 251
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L++G +A S T +++S P +YYI L +++G++ L P + + G GG
Sbjct: 252 LLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+++DSGTT T+ + Y ++ Q+ I+ + +GFDLC+++P +
Sbjct: 312 IIIDSGTTLTYFVDNAYQ---AVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQ--- 365
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ HF + LVLP N+F +PSN + CL S G +FG+ QQQN
Sbjct: 366 IPTFVMHF-DGGDLVLPSENYFI---SPSN--GLICLAMGSSSQG----MSIFGNIQQQN 415
Query: 362 VEVVYDLEKERIGFQPMDCAST 383
+ VVYD + F C ++
Sbjct: 416 LLVVYDTGNSVVSFLSAQCGAS 437
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 170/385 (44%), Gaps = 61/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C Y + + F+P +S S + C+S C +
Sbjct: 123 LYMVLDTGSDVVWLQCS----PCRKC--YSQSDPI--FNPYKSKSFAGIPCSSPLCRRLD 174
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
SS GCS + TC + +YG+G TG +TL G+
Sbjct: 175 SS----------GCSTR---RHTCL-----YQVSYGDGSFTTGDFATETLTFRGN----- 211
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGAL-----SVPSQLGF-LQKGFSHCFLAFKYAND 175
+I K GC + E + + G L S PSQ G FS+C + ++
Sbjct: 212 -KIAKVALGC--GHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASSK 268
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
P S +V GD AIS +FTP++++P +YY+GL I++G + V SL + D
Sbjct: 269 P---SSMVFGDAAISRLA--RFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLD 323
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S GNGG+++DSGT+ T L P Y+ L + + R E FD CY + ++
Sbjct: 324 SAGNGGVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSL---FDTCYDLSGQSS 380
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P++ HF + LP N+ P + + C F G + G
Sbjct: 381 VKV----PTVVLHF-RGADMALPATNYL----IPVDENGSFCFAFAGTISG----LSIIG 427
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ VVYDL RIGF P C
Sbjct: 428 NIQQQGFRVVYDLAGSRIGFAPRGC 452
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 177/399 (44%), Gaps = 82/399 (20%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LN 59
+Q+ +DTGSDL W C C C D + + F PS SS+ S +C S+ C L
Sbjct: 95 VQLTLDTGSDLIWTQCQ----PCPACFD----QALPYFDPSTSSTLSLTSCDSTLCQGLP 146
Query: 60 IHSSDNP-FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ S +P F P TC + Y+YG+ + TG L D G+
Sbjct: 147 VASCGSPKFWP------------NQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG- 188
Query: 119 GIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+P FGC G GIAGFGRG LS+PSQL FSHCF A
Sbjct: 189 ---ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTAVN-GL 242
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
P+ + D+ S + +Q TP++++P P +YY+ L+ IT+G+ T +P+ EF
Sbjct: 243 KPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGS---TRLPVPESEF 299
Query: 235 D-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G GG ++DSGT T LP Y + R F ++P
Sbjct: 300 TLKNGTGGTIIDSGTAMTSLPTRVYRLV-------------------RDAFAAQVKLPVV 340
Query: 294 NNTFTDDLF------------PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
+ TD F P + HF ++ LP+ N+ + + S++ CL
Sbjct: 341 SGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMDLPRENYVFEVE--DAGSSILCL--- 394
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
++ +G G G+FQQQN+ V+YDL+ ++ F P C
Sbjct: 395 AIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 431
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 177/394 (44%), Gaps = 62/394 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW+ C +P ++++S A + + SS
Sbjct: 44 IDTGSDLTWIQC----------------------NPPNTTANSSSPPAPWYDKSSSSSYR 81
Query: 66 PFDPCTMSGCS-LSTLLKSTCCRPCPS---FAYTYGEGGLVTGILTRDTL---------K 112
PCT C L + S+C PS + Y Y + TGIL +T+ K
Sbjct: 82 EI-PCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGK 140
Query: 113 VHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG--FSHC 166
G+ I GC VG+++ G+ G G+G +S+ +Q G FS+C
Sbjct: 141 RAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGIFSYC 200
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
+ Y N SS LV+G + L TP++++P ++YY+ + + + +
Sbjct: 201 LV--DYLRGSNASSFLVMGR---TRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDG 255
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+ S D GN G + DSGTT ++L EP YS++L L ++I Y PRA+E+ E GF+L
Sbjct: 256 IASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASI-YLPRAQEIPE--GFEL 312
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CY V T + P + F + LP N+ + + V+C+ Q +
Sbjct: 313 CYNV-----TRMEKGMPKLGVEFQGGAVMELPWNNYMVLV-----AENVQCVALQKVTTT 362
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ S + G+ QQ+ + YDL K RIGF+ C
Sbjct: 363 N--GSNILGNLLQQDHHIEYDLAKARIGFKWSPC 394
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 181/380 (47%), Gaps = 57/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL WV C C C + + F P SSS S +C S C +
Sbjct: 25 VDTGSDLCWVQCA----PCARCFEQPDPL----FIPLASSSYSNASCTDSLCDAL----- 71
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+M ++TC +++Y+YG+G G +T+ ++GS+ +
Sbjct: 72 PRPTCSM---------RNTC-----TYSYSYGDGSNTRGDFAFETVTLNGST------LA 111
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCF-LAFKYANDPNISSP 181
+ FGC T+ G+ G G+G LS+PSQL F+H F + SP
Sbjct: 112 RIGFGCGHNQEGTFAGADGLIGLGQGPLSLPSQL---NSSFTHIFSYCLVDQSTTGTFSP 168
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+ G+ A +S+ FTP+L++ P+YYY+G+E+I++GN + P + R D+ G GG
Sbjct: 169 ITFGNAAENSR--ASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFR-IDANGVGG 225
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+++DSGTT T+ + +L+ L+ I+ YP A G +LCY + + + +
Sbjct: 226 VILDSGTTITYWRLAAFIPILAELRRQIS-YPEADPTPY--GLNLCYDI--SSVSASSLT 280
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
PS+T H L NV +P N + + N C + D + G+ QQQN
Sbjct: 281 LPSMTVH-LTNVDFEIPVSNLWVLV---DNFGETVCTAMSTSDQ-----FSIIGNVQQQN 331
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+V D+ R+GF DC+
Sbjct: 332 NLIVTDVANSRVGFLATDCS 351
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 119/385 (30%), Positives = 181/385 (47%), Gaps = 62/385 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
DTGSDLTW C C D + + S+FSP CAS+ CL I SS N
Sbjct: 111 DTGSDLTWTQCQPCKL-CFPQDTPIYDTAVSSSFSPV--------PCASATCLPIWSSRN 161
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
CT S S+ CR + Y YG+G G+L +TL G+ PG+ +
Sbjct: 162 ----CTAS---------SSPCR----YRYAYGDGAYSAGVLGTETLTFPGA-PGV--SVG 201
Query: 126 KFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
FGC G G G GRG+LS+ +QLG + FS+C F + ++ SP+
Sbjct: 202 GIAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGK--FSYCLTDFF---NTSLGSPV 256
Query: 183 VIGDVAI----SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ G +A S+ +Q TP+++SP P +YY+ LE I++G++ L +P + G
Sbjct: 257 LFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARL-PIPNGTFDLRDDG 315
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC-PNNTF 297
+GG++VDSGTT+T L E + ++ + + + V + D PC P T
Sbjct: 316 SGGMIVDSGTTFTFLVESAFRVVVDHVAGVLR-----QPVVNASSLD----SPCFPAATG 366
Query: 298 TDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
L P + HF + L + N+ MS S+ CL D + G
Sbjct: 367 EQQLPAMPDMVLHFAGGADMRLHRDNY---MSFNQEESSF-CLNIAGSPSADV---SILG 419
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+FQQQN+++++D+ ++ F P DC
Sbjct: 420 NFQQQNIQMLFDITVGQLSFMPTDC 444
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 178/385 (46%), Gaps = 57/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGS L W C C C N+ + + SRSS+ + +C S+ C
Sbjct: 104 VQLTLDTGSVLVWTQCQ----PCAVC----FNQSLPYYDASRSSTFALPSCDSTQC---- 151
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPGI 120
DP +++ C T+ TC +++Y+YG+ G L +T+ V G+S
Sbjct: 152 ----KLDP-SVTMCVNQTV--QTC-----AYSYSYGDKSATIGFLDVETVSFVAGAS--- 196
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGFGRG LS+PSQL FSHCF A P
Sbjct: 197 ---VPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTAVS-GRKP 250
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + D+ + + +Q TP++K+P +P +YY+ L+ IT+G++ L VP S +
Sbjct: 251 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRL-PVPESAFALKN 309
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEV-EERTGFDLCYRVPCPNN 295
G GG ++DSGT +T LP Y ++ + + V TG LC+ P
Sbjct: 310 -GTGGTIIDSGTAFTSLPPRVY----RLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGK 364
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P + HF ++ LP+ N+ + N S ++ M + G
Sbjct: 365 A---PHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMT--------IIG 412
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+FQQQN+ V+YDL+ ++ F C
Sbjct: 413 NFQQQNMHVLYDLKNSKLSFVRAKC 437
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 174/396 (43%), Gaps = 51/396 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL W C C DC ++ + P+ SS+ + C + C +
Sbjct: 105 VALTLDTGSDLVWTQCA----PCRDC----FHQGLPLLDPAASSTYAALPCGAPRCRAL- 155
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
PF C G S +C ++ Y YG+ + G + D G +
Sbjct: 156 ----PFTSCGGGGRSSWGNGNRSC-----AYIYHYGDKSVTVGEIATDRFTFGGDNGDGD 206
Query: 122 REIP--KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+P + FGC G GIAGFGRG S+PSQL FS+CF + +
Sbjct: 207 SRLPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTT--FSYCFTSMFESKS 264
Query: 176 PNIS-----SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
++ + ++ A ++ TP+LK+P P+ Y++ L+ I++G + L
Sbjct: 265 SLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAK 324
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
LR ++DSG + T LPE Y + + + + P V E + DLC+ +
Sbjct: 325 LRS--------TIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTG--VVEGSALDLCFAL 374
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
P + PS+T H L+ LP+GN+ + A + V C++ D G
Sbjct: 375 PV-TALWRRPPVPSLTLH-LDGADWELPRGNYVFEDLA----ARVMCVVL----DAAPGD 424
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
V G+FQQQN VVYDLE + + F P C S ++
Sbjct: 425 QTVIGNFQQQNTHVVYDLENDWLSFAPARCDSLVAS 460
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 178/385 (46%), Gaps = 57/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGS L W C C C N+ + + SRSS+ + +C S+ C
Sbjct: 48 VQLTLDTGSVLVWTQCQ----PCAVC----FNQSLPYYDASRSSTFALPSCDSTQC---- 95
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPGI 120
DP +++ C T+ TC +++Y+YG+ G L +T+ V G+S
Sbjct: 96 ----KLDP-SVTMCVNQTV--QTC-----AYSYSYGDKSATIGFLDVETVSFVAGAS--- 140
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGFGRG LS+PSQL FSHCF A P
Sbjct: 141 ---VPGVVFGCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTAVS-GRKP 194
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + D+ + + +Q TP++K+P +P +YY+ L+ IT+G++ L VP S +
Sbjct: 195 STVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRL-PVPESAFALKN 253
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEV-EERTGFDLCYRVPCPNN 295
G GG ++DSGT +T LP Y ++ + + V TG LC+ P
Sbjct: 254 -GTGGTIIDSGTAFTSLPPRVY----RLVHDEFAAHVKLPVVPSNETGPLLCFSAPPLGK 308
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P + HF ++ LP+ N+ + N S ++ M + G
Sbjct: 309 A---PHVPKLVLHF-EGATMHLPRENYVFEAKDGGNCSICLAIIEGEMT--------IIG 356
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+FQQQN+ V+YDL+ ++ F C
Sbjct: 357 NFQQQNMHVLYDLKNSKLSFVRAKC 381
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 173/385 (44%), Gaps = 64/385 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
+ MDTGSD+ W+ C C C Y+ N + F P SSS R +C++ C L++
Sbjct: 29 LVMDTGSDVPWIQCS----PCKSC--YKQNDAV--FDPRASSSFRRLSCSTPQCKLLDVK 80
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSPGI 120
+ + + C + +YG+G G L D+ V G + +
Sbjct: 81 ACASTDNRCL--------------------YQVSYGDGSFTVGDLASDSFSVSRGRTSPV 120
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ FGC + G+ G G G LS PSQL + FS+C ++ N
Sbjct: 121 V-------FGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLS--SRKFSYCLVSRD--NGVR 169
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS L+ GD A+ + + +T +LK+P +YY GL I+IG + L+ + + S
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG+++DSGT+ T LP Y+ + +S PRA + FD CY +
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL---FDTCYDF----SAL 282
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDGDYGPSGVFG 355
T P+++FHF S+ LP N+ P ++S C F S+D + G
Sbjct: 283 TSVTIPTVSFHFEGGASVQLPPSNYL----VPVDTSGTFCFAFSKTSLD------LSIIG 332
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ + V DL+ R+GF P C
Sbjct: 333 NIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 172/389 (44%), Gaps = 86/389 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ MDTGSDLTWV C S DC SS+ D AS+ + +
Sbjct: 18 LVMDTGSDLTWVRCDPCSPDC---------------------SSTFDRLASNTYKALTCA 56
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D+ ++Y YG+G G L+ DTLK+ G++ + E
Sbjct: 57 DD--------------------------YSYGYGDGSFTQGDLSVDTLKMAGAASDELEE 90
Query: 124 IPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
P F FGC GS + +GI G+LS PSQ+G FS+C L + A +
Sbjct: 91 FPGFVFGC-GSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLR-QTAQNSLK 148
Query: 179 SSPLVIGDVAISSKD-------NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
SP+V G+ A+ K+ LQ+TP+ +S + YY + L+ I++GN L LS
Sbjct: 149 KSPMVFGEAAVELKEPGSGKLQELQYTPIGESSI---YYTVRLDGISVGNQRLD---LSP 202
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
F + + + DSGTT T LP + L S ++ E G D C+RVP
Sbjct: 203 SAFLNGQDKPTIFDSGTTLTMLPPGVCDSIKQSLASMVS----GAEFVAIKGLDACFRVP 258
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ P ITFHF V N+ + +++CL+F ++
Sbjct: 259 PSSGQG----LPDITFHFNGGADFVTRPSNYVIDL------GSLQCLIFVPTNE-----V 303
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ QQQ+ V++D++ RIGF+ DC
Sbjct: 304 SIFGNLQQQDFFVLHDMDNRRIGFKETDC 332
>gi|297740190|emb|CBI30372.3| unnamed protein product [Vitis vinifera]
Length = 445
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 101/332 (30%), Positives = 152/332 (45%), Gaps = 43/332 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ MDTGS L W PC + + C C + + F P SSS+ C + C +
Sbjct: 119 LSFVMDTGSSLVWFPCTS-RYVCTRCSFPNIDPAKIPTFIPKLSSSAKIVGCLNPKCGFV 177
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+N + C + CP++A YG G V +L +
Sbjct: 178 MDSEN----------------SANCTKACPTYAIQYGLGTTVGLLLLESLVFAE------ 215
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
R P F GC + R+P GIAGFGRG S+P Q+G K FS+C L+ ++ + P S
Sbjct: 216 -RTEPDFVVGCSILSSRQPSGIAGFGRGPSSLPKQMGL--KKFSYCLLSHRFDDSPKSSK 272
Query: 181 PLVIGDVAISSKDN----LQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSL 231
+ V SKD+ L +TP K+P+ N YYY+ L I +G+ + +VP S
Sbjct: 273 MTLY--VGPDSKDDKTGGLSYTPFRKNPVSSNSAFKEYYYVTLRHIIVGDKRV-KVPYSF 329
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
S GNGG +VDSG+T+T + +P + + + + Y RA +VE +G C+ +
Sbjct: 330 MVAGSDGNGGTIVDSGSTFTFMEKPVFEAVATEFDRQMANYTRAADVEALSGLKPCFNLS 389
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHF 323
+ PS+ F F + LP N+F
Sbjct: 390 GVGSV----ALPSLVFQFKGGAKMELPVANYF 417
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 183/380 (48%), Gaps = 59/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW C C DC Y + + PS+SS+ S+ C+SS C +
Sbjct: 132 LDTGSDLTWTQCK----PCTDC--YPQPTPI--YDPSQSSTYSKVPCSSSMCQAL----- 178
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ + C + Y+YG+ GIL+ ++ + S +P
Sbjct: 179 PMYSCSGANCE---------------YLYSYGDQSSTQGILSYESFTLTSQS------LP 217
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISS 180
FGC G + + G+ GFGRG LS+ SQLG L FS+C ++ + P+ +S
Sbjct: 218 HIAFGCGQENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSI--TDSPSKTS 275
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PL IG A + + TP+++S P +YY+ LE I++G L ++ + G G
Sbjct: 276 PLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGG-QLLDIADGTFDLQLDGTG 334
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGTT T+L + Y + + S+I P+ G DLC+ P + +
Sbjct: 335 GVIIDSGTTVTYLEQSGYDVVKKAVISSIN-LPQVD--GSNIGLDLCFE---PQSGSSTS 388
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
FP+ITFHF LP+ N+ Y ++SS + CL + +FG+ QQQ
Sbjct: 389 HFPTITFHF-EGADFNLPKENYIY-----TDSSGIACLAMLPSNG-----MSIFGNIQQQ 437
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N +++YD E+ + F P C
Sbjct: 438 NYQILYDNERNVLSFAPTVC 457
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 173/385 (44%), Gaps = 64/385 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
+ MDTGSD+ W+ C C C Y+ N + F P SSS R +C++ C L++
Sbjct: 29 LVMDTGSDVPWIQCS----PCKSC--YKQNDAV--FDPRASSSFRRLSCSTPQCKLLDVK 80
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSPGI 120
+ + + C + +YG+G G L D+ V G + +
Sbjct: 81 ACASTDNRCL--------------------YQVSYGDGSFTVGDLASDSFLVSRGRTSPV 120
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ FGC + G+ G G G LS PSQL + FS+C ++ N
Sbjct: 121 V-------FGCGHDNEGLFVGAAGLLGLGAGKLSFPSQLS--SRKFSYCLVSRD--NGVR 169
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS L+ GD A+ + + +T +LK+P +YY GL I+IG + L+ + + S
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG+++DSGT+ T LP Y+ + +S PRA + FD CY +
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL---FDTCYDF----SAL 282
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDGDYGPSGVFG 355
T P+++FHF S+ LP N+ P ++S C F S+D + G
Sbjct: 283 TSVTIPTVSFHFEGGASVQLPPSNYL----VPVDTSGTFCFAFSKTSLD------LSIIG 332
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ + V DL+ R+GF P C
Sbjct: 333 NIQQQTMRVAIDLDSSRVGFAPRQC 357
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 166/387 (42%), Gaps = 58/387 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW C + C + + P+RSS+ S+ CAS C + S+
Sbjct: 113 IDTGSDLTWTQCAPCTTACF-------AQPTPLYDPARSSTFSKLPCASPLCQALPSA-- 163
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE-- 123
F C +GC + Y Y G G L DTL +
Sbjct: 164 -FRACNATGCV---------------YDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSS 206
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
FGC G GI G GR ALS+ SQ+G + FS+C + A +S
Sbjct: 207 FAGVAFGCSTANGGDMDGASGIVGLGRSALSLLSQIGVGR--FSYCLRSDADAG----AS 260
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPN----YYYIGLEAITIGNSSLTEVPLSLREFDS 236
P++ G +A + D +Q T +L++P+ YYY+ L I +G++ L V S F +
Sbjct: 261 PILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDL-PVTSSTFGFTA 319
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G GG++VDSGTT+T+L E Y+ L + L T R + FDLC+
Sbjct: 320 AGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFD--FDLCFEAGA--- 374
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
D P + F F +P+ ++F A+ V CLL V G
Sbjct: 375 --ADTPVPRLVFRFAGGAEYAVPRQSYFDAV---DEGGRVACLLVLPTRG-----VSVIG 424
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ Q ++ V+YDL+ F P DCAS
Sbjct: 425 NVMQMDLHVLYDLDGATFSFAPADCAS 451
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 112/382 (29%), Positives = 171/382 (44%), Gaps = 54/382 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C D + PS SS+ S C+S+ CL + S N
Sbjct: 95 DTGSDLTWTQCQPCKL-CFPQD-------TPVYDPSASSTFSPVPCSSATCLPVLRSRNC 146
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
P S+ CR + Y+Y +G GIL +TL + S PG +
Sbjct: 147 STP-------------SSLCR----YGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSD 189
Query: 127 FCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC G G G GRG LS+ +QLG + FS+C F + + SP +
Sbjct: 190 VAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGK--FSYCLTDFF---NSTLDSPFL 244
Query: 184 IGDVA--ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN-- 239
+G +A +Q TP+L+SP+ P+ Y + L+ IT+G+ +P+ + FD N
Sbjct: 245 LGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGD---VRLPIPNKTFDLHANST 301
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG++VDSGTT++ LPE + ++ + + P + F P P
Sbjct: 302 GGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSPCF------PAPAGERQL 355
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + HF + L + N+ MS S+ CL G + G+FQQ
Sbjct: 356 PFMPDLVLHFAGGADMRLHRDNY---MSYNQEDSSF-CLNIV----GTTSTWSMLGNFQQ 407
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
QN+++++D+ ++ F P DC+
Sbjct: 408 QNIQMLFDMTVGQLSFLPTDCS 429
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/394 (29%), Positives = 176/394 (44%), Gaps = 51/394 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDL WV C C +C + + S F P SSS S C C +
Sbjct: 101 LLLVADTGSDLVWVKCS----ACRNCSHHPPS---SAFLPRHSSSFSPFHCFDPHCRLL- 152
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P P + C+ T L S C F Y+Y +G L +G +++T + S I
Sbjct: 153 ----PHAPHHL--CN-HTRLHSPC-----RFLYSYADGSLSSGFFSKETTTLKSLSGSEI 200
Query: 122 REIPKFCFGC---------VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFK 171
+ FGC G+ + G+ G GRG++S SQLG FS+C +
Sbjct: 201 -HLKGLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLM--D 257
Query: 172 YANDPNISSPLVIG----DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
Y P +S L+IG + +++ + +TP+ +P+ P +YYI + +ITI L
Sbjct: 258 YTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPIN 317
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P ++ E D QGNGG +VDSGTT T+L + Y ++L ++ + P A E+ GFDLC
Sbjct: 318 P-AVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK-LPNAAELTP--GFDLC 373
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+ P + F P N+F V CL ++++ G+
Sbjct: 374 VNA---SGESRRPSLPRLRFRLGGGAVFAPPPRNYFL-----ETEEGVMCLAIRAVESGN 425
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
V G+ QQ + +D E+ R+GF C
Sbjct: 426 G--FSVIGNLMQQGFLLEFDKEESRLGFTRRGCG 457
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 167/382 (43%), Gaps = 60/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C N+ F+P SSS S C+S C + S
Sbjct: 112 MDTGSDLIWTQCQ----PCTQCF----NQSTPIFNPQGSSSFSTLPCSSQLCQALQSP-- 161
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
TC + Y YG+G G + +TL S IP
Sbjct: 162 ------------------TCSNNSCQYTYGYGDGSETQGSMGTETLTFGSVS------IP 197
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G G+ G GRG LS+PSQL + FS+C + SS
Sbjct: 198 NITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTK--FSYCMTPIGSST----SST 251
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L++G +A S T +++S P +YYI L +++G++ L P + + G GG
Sbjct: 252 LLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+++DSGTT T+ + Y ++ Q+ I+ + +GFDLC+++P +
Sbjct: 312 IIIDSGTTLTYFADNAYQ---AVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQ--- 365
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ HF + LVLP N+F +PSN + CL S G +FG+ QQQN
Sbjct: 366 IPTFVMHF-DGGDLVLPSENYFI---SPSN--GLICLAMGSSSQG----MSIFGNIQQQN 415
Query: 362 VEVVYDLEKERIGFQPMDCAST 383
+ VVYD + F C ++
Sbjct: 416 LLVVYDTGNSVVSFLFAQCGAS 437
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 175/382 (45%), Gaps = 65/382 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL W C C C N F P +SS+ +CAS+FC S
Sbjct: 95 VIVDTGSDLIWTQC----LPCETC----NAAASVIFDPVKSSTYDTVSCASNFC-----S 141
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
PF CT T C+ + Y YG+G +G L+ +T+ V
Sbjct: 142 SLPFQSCT------------TSCK----YDYMYGDGSSTSGALSTETVTVG------TGT 179
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNI 178
IP FGC +GS + GI G G+G LS+ SQ + K FS+C +
Sbjct: 180 IPNVAFGCGHTNLGS-FAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTK---- 234
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+SP++IGD A + + +T +L + P +YY L I++ ++T P+ D+ G
Sbjct: 235 TSPMLIGDSA--AAGGVAYTALLTNTANPTFYYADLTGISVSGKAVT-YPVGTFSIDASG 291
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG ++DSGTT T+L ++ L++ L++ + +P A G D C+ N
Sbjct: 292 QGGFILDSGTTLTYLETGAFNALVAALKAEVP-FPEAD--GSLYGLDYCFSTAGVAN--- 345
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+P++TFHF LP N F A+ ++ CL + + G+ Q
Sbjct: 346 -PTYPTMTFHF-KGADYELPPENVFVAL----DTGGSICLAMAASTGFS-----IMGNIQ 394
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQN +V+DL +R+GF+ +C
Sbjct: 395 QQNHLIVHDLVNQRVGFKEANC 416
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 165/382 (43%), Gaps = 60/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C ++ F+P SSS S C S +C D
Sbjct: 113 MDTGSDLIWTQC----EPCTQC----FSQPTPIFNPQDSSSFSTLPCESQYC-----QDL 159
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P + C + C + Y YG+G G + +T SS +P
Sbjct: 160 PSETCNNNECQ---------------YTYGYGDGSTTQGYMATETFTFETSS------VP 198
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G G+ G G G LS+PSQLG Q FS+C ++ ++ S
Sbjct: 199 NIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSYGSSS----PST 252
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G A + T ++ S + P YYYI L+ IT+G +L +P S + G GG
Sbjct: 253 LALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLG-IPSSTFQLQDDGTGG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+++DSGTT T+LP+ Y+ + I P E +G C++ P +T
Sbjct: 312 MIIDSGTTLTYLPQDAYNAVAQAFTDQIN-LPTVD--ESSSGLSTCFQQPSDGSTVQ--- 365
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P I+ F V L L + N S + V CL S G S +FG+ QQQ
Sbjct: 366 VPEISMQFDGGV-LNLGEQNILI-----SPAEGVICLAMGS--SSQLGIS-IFGNIQQQE 416
Query: 362 VEVVYDLEKERIGFQPMDCAST 383
+V+YDL+ + F P C ++
Sbjct: 417 TQVLYDLQNLAVSFVPTQCGAS 438
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 119/384 (30%), Positives = 174/384 (45%), Gaps = 68/384 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C D + F P +SSS S+ +C+S C
Sbjct: 117 MDTGSDLIWTQCK----PCTQCFDQPS----PIFDPKKSSSFSKLSCSSQLC-------- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
L +S+C C + YTYG+ G + +T S IP
Sbjct: 161 ------------KALPQSSCSDSC-EYLYTYGDYSSTQGTMATETFTFGKVS------IP 201
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQL + FS+C + D +S
Sbjct: 202 NVGFGCGEDNEGDGFTQGSGLVGLGRGPLSLVSQLK--EAKFSYCLTSI----DDTKTST 255
Query: 182 LVIGDVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ-- 237
L++G +A + ++ TP++++P+ P++YY+ LE I++G T +P+ F Q
Sbjct: 256 LLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGG---TRLPIKESTFQLQDD 312
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GGL++DSGTT T+L E + + S + + TG +LCY +P +
Sbjct: 313 GTGGLIIDSGTTITYLEESAFDLVKKEFTSQMGL---PVDNSGATGLELCYNLPSDTSEL 369
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + HF L LP N+ A S S V CL G G +FG+
Sbjct: 370 E---VPKLVLHF-TGADLELPGENYMIADS----SMGVICLAM-----GSSGGMSIFGNV 416
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQN+ V +DLEKE + F P +C
Sbjct: 417 QQQNMFVSHDLEKETLSFLPTNCG 440
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 126/382 (32%), Positives = 175/382 (45%), Gaps = 61/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW C C C Y+ +++ F P SS+ +C +SFCL + +
Sbjct: 109 VDTGSDLTWTQCR----PCTHC--YK--QVVPLFDPKNSSTYRDSSCGTSFCLALGKDRS 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
CS K C +F Y+Y +G G L +TL V S+ G P
Sbjct: 161 ---------CS-----KEKKC----TFRYSYADGSFTGGNLASETLTVD-STAGKPVSFP 201
Query: 126 KFCFGCVGSTY----REPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
F FGC S+ + GI G G G LS+ SQL G FS+C L + D +ISS
Sbjct: 202 GFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPV--STDSSISS 259
Query: 181 PLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ G S TP++ KSP +YY+ LE I++G L S + +GN
Sbjct: 260 RINFGASGRVSGYGTVSTPLVQKSP--DTFYYLTLEGISVGKKRLPYKGYSKKTEVEEGN 317
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFT 298
++VDSGTTYT LP+ FYS+L + ++I + K V + G F LCY NT
Sbjct: 318 --IIVDSGTTYTFLPQEFYSKLEKSVANSI----KGKRVRDPNGIFSLCY------NTTA 365
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ P IT HF + ++ L N F M + C D GV G+
Sbjct: 366 EINAPIITAHF-KDANVELQPLNTFMRM-----QEDLVCFTVAPTSD-----IGVLGNLA 414
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q N V +DL K+R+ F+ DC
Sbjct: 415 QVNFLVGFDLRKKRVSFKAADC 436
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 169/399 (42%), Gaps = 57/399 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL W C C++C D + P+ SS+ + C + C +
Sbjct: 107 VALTLDTGSDLVWTQCA----PCLNCFD---QGAIPVLDPAASSTHAAVRCDAPVCRAL- 158
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV---HGSSP 118
PF C G S R C + Y YG+ + G L D +
Sbjct: 159 ----PFTSCGRGGSSWGE-------RSC-VYVYHYGDKSITVGKLASDRFTFGPGDNADG 206
Query: 119 GIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
G + E + FGC G GIAGFGRG S+PSQLG FS+CF + +
Sbjct: 207 GGVSE-RRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTS--FSYCFTSMFEST 263
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
++ L + + +Q TP+L+ P P+ Y++ L+AIT+G T +P+ R
Sbjct: 264 SSLVT--LGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGA---TRIPIPERRQ 318
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP--- 291
+ ++DSG + T LPE Y + + + + A E + DLC+ +P
Sbjct: 319 RLR-EASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVE---GSALDLCFALPSAA 374
Query: 292 CPNNTFTDDL----------FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
P + F P + FH LP+ N+ + + V CL+
Sbjct: 375 APKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFE----DYGARVMCLVLD 430
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G + V G++QQQN VVYDLE + + F P C
Sbjct: 431 AATGGG-DQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 178/386 (46%), Gaps = 57/386 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C++C N+ F P+ SS+ + C+S+ C ++ +S
Sbjct: 133 VDTGSDLVWTQCK----PCVEC----FNQTTPVFDPAASSTYAALPCSSALCADLPTS-- 182
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ ++ S+ + YTYG+ G+L +T + +++P
Sbjct: 183 ----------TCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTL------ARQKVP 226
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G + + G+ G GRG LS+ SQLG + FS+C + +D SP
Sbjct: 227 GVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGIDR--FSYCLTSL---DDAAGRSP 281
Query: 182 LVIGDVAISSKDNL----QFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L++G A S Q TP++K+P P++YY+ L +T+G++ L +P S
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLA-LPSSAFAIQDD 340
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG++VDSGT+ T+L Y L + ++ P E G DLC++ P
Sbjct: 341 GTGGVIVDSGTSITYLELRAYRALRKAFVAHMS-LPTVDASE--IGLDLCFQGPA--GAV 395
Query: 298 TDDL---FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
D+ P + HF L LP N+ SA S CL + G S +
Sbjct: 396 DQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSA----SGALCLTVMA----SRGLS-II 446
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+FQQQN + VYD+ + + F P +C
Sbjct: 447 GNFQQQNFQFVYDVAGDTLSFAPAEC 472
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 165/386 (42%), Gaps = 61/386 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ MDTGSDL W C C C ++ F+P SSS S C S +C ++
Sbjct: 109 LSAIMDTGSDLIWTQC----EPCTQC----FSQPTPIFNPQDSSSFSTLPCESQYCQDLP 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S +C C + Y YG+G G + +T SS
Sbjct: 161 SE--------------------SCYNDC-QYTYGYGDGSSTQGYMATETFTFETSS---- 195
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC G G+ G G G LS+PSQLG Q FS+C + ++
Sbjct: 196 --VPNIAFGCGEDNQGFGQGNGAGLIGMGWGPLSLPSQLGVGQ--FSYCMTSSGSSS--- 248
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S L +G A + T ++ S + P YYYI L+ IT+G +L +P S +
Sbjct: 249 -PSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLG-IPSSTFQLQDD 306
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG+++DSGTT T+LP+ Y+ + I P E +G C+++P +T
Sbjct: 307 GTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVD---ESSSGLSTCFQLPSDGSTV 363
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P I+ F V L L + N S + V CL +M +FG+
Sbjct: 364 Q---VPEISMQFDGGV-LNLGEENVLI-----SPAEGVICL---AMGSSSQQGISIFGNI 411
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAST 383
QQQ +V+YDL+ + F P C ++
Sbjct: 412 QQQETQVLYDLQNLAVSFVPTQCGAS 437
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 173/380 (45%), Gaps = 53/380 (13%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C D + PS SS+ S C+S+ CL S N
Sbjct: 84 DTGSDLTWTQCQPCKL-CFPQD-------TPVYDPSASSTFSPVPCSSATCLPTWRSRNC 135
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
+P S+ CR + Y+Y +G GIL +TL + S PG +
Sbjct: 136 SNP-------------SSPCR----YIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGS 178
Query: 127 FCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC G G G GRG LS+ +QLG + FS+C F + + SP
Sbjct: 179 VAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGK--FSYCLTDFF---NSTMDSPFF 233
Query: 184 IGDVA--ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+G +A +Q TP+L+SP+ P+ Y++ L+ I++G+ L +P + + GNGG
Sbjct: 234 LGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRL-PIPNGTFDLRADGNGG 292
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++VDSGTT+T L + + +++ + + P + F P P+ +
Sbjct: 293 MMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSPCF------PSPDG---EPF 343
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P + HF + L + N+ MS + S+ CL G G+FQQQN
Sbjct: 344 MPDLVLHFAGGADMRLHRDNY---MSYNEDDSSF-CLNIV----GSPSTWSRLGNFQQQN 395
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+++++D+ ++ F P DC+
Sbjct: 396 IQMLFDMTVGQLSFLPTDCS 415
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 173/382 (45%), Gaps = 57/382 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C Y + F+P++S S + C S C +
Sbjct: 162 MVLDTGSDVVWIQCA----PCKKC--YSQTDPV--FNPTKSRSFANIPCGSPLCRRL--- 210
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D+P GCS K C + +YG+G G + +TL G+ G
Sbjct: 211 DSP-------GCSTK---KHICL-----YQVSYGDGSFTYGEFSTETLTFRGTRVG---- 251
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNIS 179
+ GC + G+ G GRG LS PSQ+G + FS+C + ++ P
Sbjct: 252 --RVALGCGHDNEGLFIGAAGLLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKP--- 306
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S +V GD AIS +FTP++ +P +YY+ L +++G + + + SL + DS GN
Sbjct: 307 SYMVFGDSAISR--TARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGN 364
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG+++DSGT+ T L P Y L + + RA E FD C+ + + T+
Sbjct: 365 GGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL---FDTCFDL----SGKTE 417
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ HF + LP N+ P ++S C F G + G+ QQ
Sbjct: 418 VKVPTVVLHF-RGADVSLPASNYLI----PVDNSGSFCFAFAGTMSG----LSIVGNIQQ 468
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
Q VVYDL R+GF P CA
Sbjct: 469 QGFRVVYDLAASRVGFAPRGCA 490
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 163/387 (42%), Gaps = 59/387 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C D F P RS S C++ C + S
Sbjct: 157 MVLDTGSDVVWLQCA----PCRRCYDQSGQV----FDPRRSRSYGAVGCSAPLCRRLDSG 208
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC L + C + YG+G + G +TL G +
Sbjct: 209 ----------GCDLR---RKACL-----YQVAYGDGSVTAGDFATETLTFAGGA-----R 245
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFL-AFKYANDPNI 178
+ + GC + G+ G GRG+LS P+Q+ + FS+C + AN +
Sbjct: 246 VARIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASH 305
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFDSQ 237
SS + G A+ S FTPM+K+P +YY+ L I++G + ++ V S LR S
Sbjct: 306 SSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSS 365
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG----FDLCYRVPCP 293
G GG++VDSGT+ T L P YS L ++ A + G FD CY +
Sbjct: 366 GRGGVIVDSGTSVTRLARPAYSALRDAFRAA------AAGLRLSPGGFSLFDTCYDLSGR 419
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
P+++ HF LP N+ P +S C F D G +
Sbjct: 420 KVV----KVPTVSMHFAGGAEAALPPENYLI----PVDSKGTFCFAFAGTDGG----VSI 467
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQ VV+D + +R+GF P C
Sbjct: 468 IGNIQQQGFRVVFDGDGQRVGFVPKGC 494
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 173/384 (45%), Gaps = 57/384 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C Y + F PS+S S + C S C +
Sbjct: 143 LYMVLDTGSDVVWLQCK----PCTKC--YSQTDQI--FDPSKSKSFAGIPCYSPLCRRL- 193
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D+P GCSL K+ C+ + +YG+G G + +TL ++
Sbjct: 194 --DSP-------GCSL----KNNLCQ----YQVSYGDGSFTFGDFSTETLTFRRAA---- 232
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+P+ GC + G+ G GRG LS P+Q G FS+C + P
Sbjct: 233 --VPRVAIGCGHDNEGLFVGAAGLLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASAKP- 289
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S +V GD A+S +FTP++K+P +YY+ L I++G + + + S DS
Sbjct: 290 --SSIVFGDSAVSR--TARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDST 345
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG+++DSGT+ T L P Y L + ++ RA E FD CY + +
Sbjct: 346 GNGGVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSL---FDTCYDL----SGL 398
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
++ P++ HF + LP N+ P ++S C F G + G+
Sbjct: 399 SEVKVPTVVLHF-RGADVSLPAANYL----VPVDNSGSFCFAFAGTMSG----LSIIGNI 449
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VV+DL R+GF P CA
Sbjct: 450 QQQGFRVVFDLAGSRVGFAPRGCA 473
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 177/381 (46%), Gaps = 63/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL WV C C C + L + F PS+S+S C S+FC D
Sbjct: 107 VDTGSDLNWVQC----LPCKSCYE----TLSAKFDPSKSASYKTLGCGSNFC-----QDL 153
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PF C S C+ + Y YG+G +G L+ D + + +IP
Sbjct: 154 PFQSCAAS------------CQ----YDYMYGDGSSTSGALSTDDVTIG------TGKIP 191
Query: 126 KFCFGCVGS---TYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
FGC S T+ G+ G G+G LS+ SQLG K FS+C + +SP
Sbjct: 192 NVAFGCGNSNLGTFAGAGGLVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTK----TSP 247
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L IGD ++ + +TPML + YP +YY L+ I++ ++ P + + + G GG
Sbjct: 248 LYIGDSTLAG--GVAYTPMLTNNNYPTFYYAELQGISVEGKAV-NYPANTFDIAATGRGG 304
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
L++DSGTT T+L ++ +++ L++ + YP A G + C+ N
Sbjct: 305 LILDSGTTLTYLDVDAFNPMVAALKAALP-YPEAD--GSFYGLEYCFSTAGVAN----PT 357
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P++ FHF N + L N F A+ + CL S +FG+ QQ N
Sbjct: 358 YPTVVFHF-NGADVALAPDNTFIAL----DFEGTTCLAMASSTG-----FSIFGNIQQLN 407
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
+V+DL +RIGF+ +C +
Sbjct: 408 HVIVHDLVNKRIGFKSANCET 428
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 161/380 (42%), Gaps = 51/380 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C + + F P++S+S + C+S+ C ++S
Sbjct: 102 IDTGSDLIWTQCA----PCLLCVE----QPTPYFEPAKSTSYASLPCSSAMCNALYS--- 150
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C + C YG+ G+L +T +S + +P
Sbjct: 151 PL--CFQNACVYQAF---------------YGDSASSAGVLANETFTFGTNSTRV--AVP 191
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI--SS 180
+ FGC T G+ GFGRGALS+ SQLG FS+C +F + +
Sbjct: 192 RVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLG--SPRFSYCLTSFMSPATSRLYFGA 249
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ SS +Q TP + +P P Y++ + I++ L P ++ G G
Sbjct: 250 YATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTG 309
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGTT T L +P Y+ + + + PRA T FD C++ P P
Sbjct: 310 GVIIDSGTTVTFLAQPAYAMVQGAFVAWVG-LPRANATPSDT-FDTCFKWPPPPRRMVT- 366
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P + HF + + LP N+ N CL DDG + GSFQ Q
Sbjct: 367 -LPEMVLHF-DGADMELPLENYMVMDGGTGN----LCLAMLPSDDGS-----IIGSFQHQ 415
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N ++YDLE + F P C
Sbjct: 416 NFHMLYDLENSLLSFVPAPC 435
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 161/380 (42%), Gaps = 51/380 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C + + F P++S+S + C+S+ C ++S
Sbjct: 105 IDTGSDLIWTQCA----PCLLCVE----QPTPYFEPAKSTSYASLPCSSAMCNALYS--- 153
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C + C YG+ G+L +T +S + +P
Sbjct: 154 PL--CFQNACVYQAF---------------YGDSASSAGVLANETFTFGTNSTRV--AVP 194
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI--SS 180
+ FGC T G+ GFGRGALS+ SQLG FS+C +F + +
Sbjct: 195 RVSFGCGNMNAGTLFNGSGMVGFGRGALSLVSQLG--SPRFSYCLTSFMSPATSRLYFGA 252
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ SS +Q TP + +P P Y++ + I++ L P ++ G G
Sbjct: 253 YATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTG 312
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGTT T L +P Y+ + + + PRA T FD C++ P P
Sbjct: 313 GVIIDSGTTVTFLAQPAYAMVQGAFVAWVG-LPRANATPSDT-FDTCFKWPPPPRRMVT- 369
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P + HF + + LP N+ N CL DDG + GSFQ Q
Sbjct: 370 -LPEMVLHF-DGADMELPLENYMVMDGGTGN----LCLAMLPSDDGS-----IIGSFQHQ 418
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N ++YDLE + F P C
Sbjct: 419 NFHMLYDLENSLLSFVPAPC 438
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 178/393 (45%), Gaps = 63/393 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGS L W C C +C + F P+ SS+ S+ CASS C + S
Sbjct: 105 VLADTGSSLIWTQCA----PCTECAA----RPAPPFQPASSSTFSKLPCASSLCQFLTS- 155
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P+ C +GC + Y YG G G L +TL V G+S
Sbjct: 156 --PYLTCNATGCV---------------YYYPYGMG-FTAGYLATETLHVGGAS------ 191
Query: 124 IPKFCFGC-----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
P FGC VG++ GI G GR LS+ SQ+G + FS+C + A D
Sbjct: 192 FPGVAFGCSTENGVGNSSS---GIVGLGRSPLSLVSQVGVGR--FSYCLRSDADAGD--- 243
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREFD- 235
SP++ G +A + N+Q TP+L++P P+ YYY+ L IT+G T++P++ F
Sbjct: 244 -SPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGA---TDLPVTSTTFGF 299
Query: 236 SQGNG-----GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVE-ERTGFDLCYR 289
++G G G +VDSGTT T+L + Y+ + S + V R GFDLC+
Sbjct: 300 TRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFD 359
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNS-SAVKCLLFQSMDDGDY 348
+ P++ F + + ++ ++ S +AV+CLL + +
Sbjct: 360 ATAAGGG-SGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLV--LPASEK 416
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G+ Q ++ V+YDL+ F P DCA
Sbjct: 417 LSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 449
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 179/386 (46%), Gaps = 61/386 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSD +W+ C C DC Y ++ + F PS+SS+ S TC+S C +
Sbjct: 147 LLVELDTGSDQSWIQCK----PCPDC--YEQHEAL--FDPSKSSTYSDITCSSRECQELG 198
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCC--RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
SS K C + CP + TY + G L RDTL + SP
Sbjct: 199 SSH-----------------KHNCSSDKKCP-YEITYADDSYTVGNLARDTLTL---SP- 236
Query: 120 IIREIPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYAND 175
+P F FGC ++ E G+ G GRG S+ SQ+ GFS+C +
Sbjct: 237 -TDAVPGFVFGCGHNNAGSFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCL-----PSS 290
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
P+ + L A ++ N QFT M+ +P++YY+ L IT+ ++ +VP S+ F
Sbjct: 291 PSATGYLSFSGAAAAAPTNAQFTEMVAG-QHPSFYYLNLTGITVAGRAI-KVPPSV--FA 346
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ G ++DSGT ++ LP Y+ L S ++S + Y RA T FD CY +
Sbjct: 347 TAA--GTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPS---STIFDTCYDLTGHET 401
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
PS+ F + ++ L Y S S + CL F + + D GV G
Sbjct: 402 V----RIPSVALVFADGATVHLHPSGVLYTWSNVSQT----CLAF--LPNPDDTSLGVLG 451
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
+ QQ+ + V+YD++ +++GF CA
Sbjct: 452 NTQQRTLAVIYDVDNQKVGFGANGCA 477
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 171/384 (44%), Gaps = 59/384 (15%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD+ W+ C C C Y + + F P +S S + C S C H
Sbjct: 139 VYMVLDTGSDIVWIQCA----PCKRC--YAQSDPV--FDPRKSRSFASIACRSPLC---H 187
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D+P GC+ K TC + +YG+G G + +TL +
Sbjct: 188 RLDSP-------GCNTQ---KQTCM-----YQVSYGDGSFTFGDFSTETLTFRRT----- 227
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ + GC + G+ G GRG LS PSQ G FS+C + ++ P
Sbjct: 228 -RVARVALGCGHDNEGLFVGAAGLLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASSKP- 285
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S +V GD A+S +FTP++ +P +YY+ L I++G + + + SL + D
Sbjct: 286 --SSMVFGDSAVSR--TARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQT 341
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG+++DSGT+ T L P Y ++ + RA + FD C+ +
Sbjct: 342 GNGGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSL---FDTCFDLSGK---- 394
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P++ HF + LP N+ P ++S CL F G G + G+
Sbjct: 395 TEVKVPTVVLHF-RGADVSLPASNYLI----PVDTSGNFCLAFA----GTMGGLSIIGNI 445
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYDL R+GF P CA
Sbjct: 446 QQQGFRVVYDLAGSRVGFAPHGCA 469
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 169/384 (44%), Gaps = 57/384 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C+ C Y + F P++S S + C S C +
Sbjct: 158 VYMVLDTGSDIVWIQCA----PCIKC--YSQTDPV--FDPTKSRSFANIPCGSPLCRRL- 208
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D P GCS K C + +YG+G G + +TL G+ G
Sbjct: 209 --DYP-------GCSTK---KQICL-----YQVSYGDGSFTVGEFSTETLTFRGTRVG-- 249
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ GC + G+ G GRG LS PSQ+G FS+C ++ P
Sbjct: 250 ----RVVLGCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRP- 304
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S +V GD AIS +FTP+L +P +YY+ L I++G + ++ + SL + DS
Sbjct: 305 --SSIVFGDSAISR--TTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDST 360
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG+++DSGT+ T L Y L + RA E FD C+ +
Sbjct: 361 GNGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSL---FDTCFDLSGK---- 413
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P++ HF + LP N+ P ++S C F G + G+
Sbjct: 414 TEVKVPTVVLHF-RGADVPLPASNYLI----PVDNSGSFCFAFAGTASG----LSIIGNI 464
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYDL R+GF P CA
Sbjct: 465 QQQGFRVVYDLATSRVGFAPRGCA 488
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 162/382 (42%), Gaps = 62/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT D WVPC DC C FSP+ SS+ + C+ C +
Sbjct: 114 MVLDTSRDAAWVPCA----DCAGCSS-------PTFSPNTSSTYASLQCSVPQCTQVR-- 160
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
G S T + C F TYG + +L++D+L + +
Sbjct: 161 ----------GLSCPTTGTAACF-----FNQTYGGDSSFSAMLSQDSLGLA------VDT 199
Query: 124 IPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+P + FGCV GST P G+ G GRG +S+ SQ G L G FS+CF +FK
Sbjct: 200 LPSYSFGCVNAVSGSTL-PPQGLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFK---SYYF 255
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S L +G + N++ TP+L++P P YY+ L +++G L V L FD
Sbjct: 256 SGSLRLGP--LGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGR-VLVPVAPELLAFDPNT 312
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G ++DSGT T EP Y+ + + + FD C+
Sbjct: 313 GAGTIIDSGTVITRFVEPVYAAIRDEFRKQV-----KGPFATIGAFDTCFAAT------N 361
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+D+ P +TFHF + L LP N SA S + CL + + V + Q
Sbjct: 362 EDIAPPVTFHF-TGMDLKLPLENTLIHSSAGS----LACLAMAAAPNNVNSVLNVIANLQ 416
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQN+ +++D+ R+G C
Sbjct: 417 QQNLRIMFDVTNSRLGIARELC 438
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 164/387 (42%), Gaps = 59/387 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C Y + + F P RS S + CA+ C + S
Sbjct: 155 MVLDTGSDVVWLQCA----PCRRC--YEQSGQV--FDPRRSRSYNAVGCAAPLCRRLDSG 206
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC L +S C + YG+G + G +TL G +
Sbjct: 207 ----------GCDLR---RSACL-----YQVAYGDGSVTAGDFATETLTFAGGA-----R 243
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFL-AFKYANDPNI 178
+ + GC + G+ G GRG+LS P+Q+ + FS+C + AN +
Sbjct: 244 VARVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASR 303
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFDSQ 237
SS + G A+ S FTPM+K+P +YY+ L I++G + + V S LR S
Sbjct: 304 SSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSS 363
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG----FDLCYRVPCP 293
G GG++VDSGT+ T L P YS L + A + G FD CY +
Sbjct: 364 GRGGVIVDSGTSVTRLARPAYSALRDAFRGA------AAGLRLSPGGFSLFDTCYDLSGR 417
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
P+++ HF LP N+ P +S C F D G +
Sbjct: 418 KVV----KVPTVSMHFAGGAEAALPPENYLI----PVDSKGTFCFAFAGTDGG----VSI 465
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQ VV+D + +R+ F P C
Sbjct: 466 IGNIQQQGFRVVFDGDGQRVAFTPKGC 492
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 159/381 (41%), Gaps = 54/381 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C D + F P+ SS+ C++ C
Sbjct: 109 LDTGSDLIWTQCA----PCLLCVD----QPTPYFDPANSSTYRSLGCSAPAC-------- 152
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ L C + + Y YG+ G+L +T + + +P
Sbjct: 153 ------------NALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFGTNDTRV--TLP 198
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
+ FGC + G+ GFGRG+LS+ SQLG FS+C +F + S L
Sbjct: 199 RISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLG--SPRFSYCLTSFLSP----VRSRL 252
Query: 183 VIGDVAISSKDN---LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
G A + N +Q TP + +P P Y++ + I++G + L P L D+ G
Sbjct: 253 YFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGT 312
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG ++DSGTT T+L EP Y + + +V E + D C++ P P
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVT 372
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + HF + LP N Y + PS CL + DG + GS+Q
Sbjct: 373 --LPQLVLHF-DGADWELPLQN--YMLVDPSTGG--LCLAMATSSDGS-----IIGSYQH 420
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN V+YDLE + F P C
Sbjct: 421 QNFNVLYDLENSLLSFVPAPC 441
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 115/383 (30%), Positives = 162/383 (42%), Gaps = 62/383 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C C N+ F+P SSS S C+S C
Sbjct: 112 MDTGSDLIWTQCQ----PCTQCF----NQSTPIFNPQGSSSFSTLPCSSQLC-------- 155
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
L TC + Y YG+G G + +TL S IP
Sbjct: 156 ------------QALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGSVS------IP 197
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC G G+ G GRG LS+PSQL + FS+C + N
Sbjct: 198 NITFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVTK--FSYCMTPIGSSTPSN---- 251
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L++G +A S T +++S P +YYI L +++G++ L P + + G GG
Sbjct: 252 LLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC-PNNTFTDD 300
+++DSGTT T+ Y S+ Q I+ +GFDLC++ P P+N
Sbjct: 312 IIIDSGTTLTYFVNNAYQ---SVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNL---- 364
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+ HF + L LP N+F +PSN + CL S G +FG+ QQQ
Sbjct: 365 QIPTFVMHF-DGGDLELPSENYFI---SPSN--GLICLAMGSSSQG----MSIFGNIQQQ 414
Query: 361 NVEVVYDLEKERIGFQPMDCAST 383
N+ VVYD + F C ++
Sbjct: 415 NMLVVYDTGNSVVSFASAQCGAS 437
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 178/390 (45%), Gaps = 61/390 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDL W C C+ C D + + F SRSS+++ C S+ C
Sbjct: 48 VQLTLDTGSDLIWTQCK----PCVSCFD----QPLPYFDTSRSSTNALLPCESTQC---- 95
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPGI 120
DP T++ C TC ++ +YG+ + G+L D V G+S
Sbjct: 96 ----KLDP-TVTVCVKLNQTVQTC-----AYYTSYGDNSVTIGLLAADKFTFVAGTS--- 142
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC G GIAGFGRG LS+PSQL FSHCF A
Sbjct: 143 ---LPGVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTTITGA--- 194
Query: 177 NISSPLVI---GDVAISSKDNLQFTPML---KSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
I S +++ D+ + + +Q TP++ K+ P YY+ L+ IT+G++ L VP S
Sbjct: 195 -IPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRL-PVPES 252
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+ G GG ++DSGT+ T LP Y + + I P TG C+
Sbjct: 253 AFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKL-PVVPG--NATGHYTCFSA 308
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
P + P + HF ++ LP+ N+ + + + +S + CL D+
Sbjct: 309 P----SQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSII-CLAINKGDE----- 357
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + G+FQQQN+ V+YDL+ + F C
Sbjct: 358 TTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 387
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 172/382 (45%), Gaps = 59/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW C C C Y+ +++ F P SS+ +C +SFCL + +
Sbjct: 109 VDTGSDLTWTQCR----PCTHC--YK--QVVPFFDPKNSSTYRDSSCGTSFCLALGN--- 157
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D +G + C +F Y+Y +G G L +TL V S+ G P
Sbjct: 158 --DRSCRNG------------KKC-TFMYSYADGSFTGGNLAVETLTV-ASTAGKPVSFP 201
Query: 126 KFCFGCV---GSTYRE-PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
F FGCV G + E GI G G LS+ SQL G FS+C L D ++SS
Sbjct: 202 GFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPV--FTDSSMSS 259
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ G I S TP++ YY I LE ++G L+ S + +GN
Sbjct: 260 RINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVEEGN- 318
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNNTFTD 299
++VDSGTTYT+LP FY + L+ ++ + + K V + G LCY NT D
Sbjct: 319 -IIVDSGTTYTYLPLEFYVK----LEESVAHSIKGKRVRDPNGISSLCY------NTTVD 367
Query: 300 DL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ P IT HF + ++ L N F M + F + D G+ G+
Sbjct: 368 QIDAPIITAHF-KDANVELQPWNTFLRMQE-------DLVCFTVLPTSDI---GILGNLA 416
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q N V +DL K+R+ F+ DC
Sbjct: 417 QVNFLVGFDLRKKRVSFKAADC 438
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 169/397 (42%), Gaps = 58/397 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL W C C+DC + ++ P+ SS+ + C + C +
Sbjct: 103 VALTLDTGSDLVWTQCA----PCLDCFEQGAAPVLD---PAASSTHAALPCDAPLCRAL- 154
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
PF C +S R C + Y YG+ L G L D+ G
Sbjct: 155 ----PFTSCGG---------RSWGDRSC-VYVYHYGDRSLTVGQLATDSFTFGGDDNAGG 200
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ FGC G GIAGFGRG S+PSQL FS+CF + D
Sbjct: 201 LAARRVTFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNV--TSFSYCFTSM---FDTK 255
Query: 178 ISSPLVIGDVAI--------SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
SS + +G A + +++ T ++K+P P+ Y++ L I++G + + VP
Sbjct: 256 SSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVA-VP- 313
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
+S+ ++DSG + T LPE Y + + S + DLC+
Sbjct: 314 -----ESRLRSSTIIDSGASITTLPEDVYEAVKAEFVSQVGL---PAAAAGSAALDLCFA 365
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+P + P++T H LP+GN+ + ++ V C++ D G
Sbjct: 366 LPVAA-LWRRPAVPALTLHLDGGADWELPRGNYVF----EDYAARVLCVVL----DAAAG 416
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
V G++QQQN VVYDLE + + F P C A++
Sbjct: 417 EQVVIGNYQQQNTHVVYDLENDVLSFAPARCDKLAAS 453
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 156/385 (40%), Gaps = 60/385 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C CM C D + F P++S S ++ C S C
Sbjct: 106 LDTGSDLIWTQCA----PCMLCVD----QPTPFFDPAQSPSYAKLPCNSPMC-------- 149
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ L C R + Y YG+ G+L+ +T + + +P
Sbjct: 150 ------------NALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFGTNDTRV--TVP 195
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC GS + G+ GFGRG LS+ SQLG FS+C +F + S
Sbjct: 196 RIAFGCGNLNAGSLFNGS-GMVGFGRGPLSLVSQLG--SPRFSYCLTSFMSP----VPSR 248
Query: 182 LVIGDVAI------SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L G A S+ + +Q TP + +P P YY+ + I++G L P D
Sbjct: 249 LYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAIND 308
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ G GG+++DSG+T T+L Y + + P D C+ P P
Sbjct: 309 ADGTGGVIIDSGSTITYLARAAYDMVHQAFADQVG-LPLTNATSLADVLDTCFVWPPPPR 367
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P + FHF ++ LP N+ N CL + DDG + G
Sbjct: 368 KIVT--MPELAFHF-EGANMELPLENYMLIDGDTGN----LCLAIAASDDGS-----IIG 415
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
SFQ QN V+YD E + F P C
Sbjct: 416 SFQHQNFHVLYDNENSLLSFTPATC 440
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/390 (28%), Positives = 171/390 (43%), Gaps = 67/390 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGSDL W C C + ++P+RS++ + +C S C +
Sbjct: 105 LTAVLDTGSDLIWTQCDAPCRRCFP-------QPAPLYAPARSATYANVSCRSPMCQALQ 157
Query: 62 SSDNPFDPCTM--SGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
S P+ C+ +GC+ + ++YG+G G+L +T + G
Sbjct: 158 S---PWSRCSPPDTGCA---------------YYFSYGDGTSTDGVLATETFTL-----G 194
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC +GST G+ G GRG LS+ SQLG + FS+CF F +
Sbjct: 195 SDTAVRGVAFGCGTENLGSTDNSS-GLVGMGRGPLSLVSQLGVTR--FSYCFTPF----N 247
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPLS 230
+SPL +G A S + TP + SP +YYY+ LE IT+G++ L P
Sbjct: 248 ATAASPLFLGSSARLSSAA-KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAV 306
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
R G+GG+++DSGTT+T L E + L L S + P A G LC+
Sbjct: 307 FR-LTPMGDGGVIIDSGTTFTALEERAFVALARALASRVRL-PLASGAH--LGLSLCFAA 362
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
P P + HF + + L + ++ S+ V CL S
Sbjct: 363 ASPEAVEV----PRLVLHF-DGADMELRRESYV----VEDRSAGVACLGMVSARG----- 408
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V GS QQQN ++YDLE+ + F+P C
Sbjct: 409 MSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/390 (28%), Positives = 171/390 (43%), Gaps = 67/390 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGSDL W C C + ++P+RS++ + +C S C +
Sbjct: 105 LTAVLDTGSDLIWTQCDAPCRRCFP-------QPAPLYAPARSATYANVSCRSPMCQALQ 157
Query: 62 SSDNPFDPCTM--SGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
S P+ C+ +GC+ + ++YG+G G+L +T + G
Sbjct: 158 S---PWSRCSPPDTGCA---------------YYFSYGDGTSTDGVLATETFTL-----G 194
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC +GST G+ G GRG LS+ SQLG + FS+CF F +
Sbjct: 195 SDTAVRGVAFGCGTENLGSTDNSS-GLVGMGRGPLSLVSQLGVTR--FSYCFTPF----N 247
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPLS 230
+SPL +G A S + TP + SP +YYY+ LE IT+G++ L P
Sbjct: 248 ATAASPLFLGSSARLSSAA-KTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAV 306
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
R G+GG+++DSGTT+T L E + L L S + P A G LC+
Sbjct: 307 FR-LTPMGDGGVIIDSGTTFTALEESAFVALARALASRVRL-PLASGAH--LGLSLCFAA 362
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
P P + HF + + L + ++ S+ V CL S
Sbjct: 363 ASPEAVEV----PRLVLHF-DGADMELRRESYV----VEDRSAGVACLGMVSARG----- 408
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V GS QQQN ++YDLE+ + F+P C
Sbjct: 409 MSVLGSMQQQNTHILYDLERGILSFEPAKC 438
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 159/384 (41%), Gaps = 69/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT +D W+PC C+ C F PS+SSSS C + C
Sbjct: 103 VALDTSNDAAWIPCSG----CVGCSSS------VLFDPSKSSSSRTLQCEAPQC-----K 147
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P CT+S + C F TYG G + LT+DTL +
Sbjct: 148 QAPNPSCTVS-------------KSC-GFNMTYG-GSAIEAYLTQDTLTLA------TDV 186
Query: 124 IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
IP + FGC+ T G+ G GRG LS+ SQ L Q FS+C PN
Sbjct: 187 IPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--------PNSK 238
Query: 180 SPLVIGDVAISSKDN---LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S G + + K+ ++ TP+LK+P + YY+ L I +GN + ++P S FD
Sbjct: 239 SSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNK-IVDIPTSALAFDP 297
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G + DSGT YT L EP Y + + + + + GFD CY
Sbjct: 298 ATGAGTIFDSGTVYTRLVEPAYVAMRNEFRRRV----KNANATSLGGFDTCYS------- 346
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+FPS+TF F +++ LP N SA + + CL + V S
Sbjct: 347 -GSVVFPSVTFMFA-GMNVTLPPDNLLIHSSAGN----LSCLAMAAAPTNVNSVLNVIAS 400
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQN V+ D+ R+G C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 117/395 (29%), Positives = 174/395 (44%), Gaps = 63/395 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + MDTGSDL W C C C D + F PS SS+ C C
Sbjct: 101 VALTMDTGSDLVWTQCT----PCPVCFD----QPFPLFDPSVSSTFRAVACPDPIC---- 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-----HGS 116
P ++S C+L T C +YG+ + G + +DT G+
Sbjct: 149 ---RPSSGLSVSACALKTFRCFYLC--------SYGDKSITAGYIFKDTFTFMSPNGEGA 197
Query: 117 SPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
P + + FGC G GIAGFGRG LS+PSQL + FS+C +
Sbjct: 198 PPVAVSGL---AFGCGDYNTGVFASNESGIAGFGRGPLSLPSQLRVGR--FSYCLTSHD- 251
Query: 173 ANDPNISSPLVIGD----VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ N +S + +G + S + TP++ SP +P +YY+ LE IT+G + L V
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRL-PVD 310
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDL 286
S+ G+GG ++DSGT T P + QL + + Q + Y EV G L
Sbjct: 311 SSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEV----GNLL 366
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNS-SAVKCLLFQSMDD 345
C++ P P + FH L + + LP+ N+ P ++ S V CL M +
Sbjct: 367 CFQRPKGGKQVP---VPKLIFH-LASADMDLPRENYI-----PEDTDSGVMCL----MIN 413
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G + G+FQQQN+ +VYD+E ++ F C
Sbjct: 414 GAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQC 448
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 174/383 (45%), Gaps = 62/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD++W+ C S C Y+ + + F P++S++ S C C S
Sbjct: 150 VIFDTGSDVSWIQCLPCSGHC-----YKQHDPI--FDPTKSATYSVVPCGHPQCAAADGS 202
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
CS T L + YG+G G+L+ +TL + + R
Sbjct: 203 K----------CSNGTCL----------YKVEYGDGSSSAGVLSHETLSLTST-----RA 237
Query: 124 IPKFCFGCVGST----YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+P F FGC G T + + G+ G GRG LS+ SQ G FS+C +D
Sbjct: 238 LPGFAFGC-GQTNLGDFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCL-----PSDNTT 291
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
L IG +S D++Q+T M++ YP++Y++ L +I IG L VP +L D
Sbjct: 292 HGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYIL-PVPPTLFTDD--- 347
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G +DSGT T+LP Y+ L + T+T Y A + FD CY + F
Sbjct: 348 --GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYDP---FDTCYDFTGQSAIF- 401
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQSMDDGDYGPSGVFGSF 357
P+++F F + L + F + P +++ A+ CL F + P + G+
Sbjct: 402 ---IPAVSFKFSDGSVFDL---SFFGILIFPDDTAPAIGCLGFVARPSA--MPFTIVGNM 453
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ+N EV+YD+ E+IGF C
Sbjct: 454 QQRNTEVIYDVAAEKIGFASASC 476
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 112/384 (29%), Positives = 171/384 (44%), Gaps = 58/384 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDLTW+ C C DC +++ + F P +SSS C S+ C + +S
Sbjct: 152 LIIDTGSDLTWIQCK----PCADC----YSQVDAIFEPKQSSSYKTLPCLSATCTELITS 203
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
++ PC + GC + YG+G G +++TL + S
Sbjct: 204 ESNPTPCLLGGCV---------------YEINYGDGSSSQGDFSQETLTLGSDS------ 242
Query: 124 IPKFCFGCVGST----YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
F FGC G T ++ G+ G G+ +LS PSQ G F++C L ++
Sbjct: 243 FQNFAFGC-GHTNTGLFKGSSGLLGLGQNSLSFPSQSKSKYGGQFAYC-LPDFGSSTSTG 300
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S + G + S+ FTP++ + MYP +Y++GL I++G L+ P L G
Sbjct: 301 SFSVGKGSIPASAV----FTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVL------G 350
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G +VDSGT T L Y+ L + +S P AK D CY + + +
Sbjct: 351 RGSTIVDSGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI---LDTCYDL----SRHS 403
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+ITFHF NN + + + + N + CL F S D + G+FQ
Sbjct: 404 QVRIPTITFHFQNNADVAV---SDVGILVPVQNGGSQVCLAFASASQMD--GFNIIGNFQ 458
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQ + V +D RIGF CA+
Sbjct: 459 QQRMRVAFDTGAGRIGFASGSCAA 482
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 159/384 (41%), Gaps = 69/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT +D W+PC C+ C F PS+SSSS C + C
Sbjct: 103 VALDTSNDAAWIPCSG----CVGCSSS------VLFDPSKSSSSRTLQCEAPQC-----K 147
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P CT+S + C F TYG G + LT+DTL +
Sbjct: 148 QAPNPSCTVS-------------KSC-GFNMTYG-GSTIEAYLTQDTLTLASD------V 186
Query: 124 IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
IP + FGC+ T G+ G GRG LS+ SQ L Q FS+C PN
Sbjct: 187 IPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--------PNSK 238
Query: 180 SPLVIGDVAISSKDN---LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S G + + K+ ++ TP+LK+P + YY+ L I +GN + ++P S FD
Sbjct: 239 SSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNK-IVDIPTSALAFDP 297
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G + DSGT YT L EP Y + + + + + GFD CY
Sbjct: 298 ATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRV----KNANATSLGGFDTCYS------- 346
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+FPS+TF F +++ LP N SA + + CL + V S
Sbjct: 347 -GSVVFPSVTFMFA-GMNVTLPPDNLLIHSSAGN----LSCLAMAAAPVNVNSVLNVIAS 400
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQN V+ D+ R+G C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 159/384 (41%), Gaps = 69/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT +D W+PC C+ C F PS+SSSS C + C
Sbjct: 103 VALDTSNDAAWIPCSG----CVGCSSS------VLFDPSKSSSSRTLQCEAPQC-----K 147
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P CT+S + C F TYG G + LT+DTL +
Sbjct: 148 QAPNPSCTVS-------------KSC-GFNMTYG-GSTIEAYLTQDTLTLASD------V 186
Query: 124 IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
IP + FGC+ T G+ G GRG LS+ SQ L Q FS+C PN
Sbjct: 187 IPNYTFGCINKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--------PNSK 238
Query: 180 SPLVIGDVAISSKDN---LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S G + + K+ ++ TP+LK+P + YY+ L I +GN + ++P S FD
Sbjct: 239 SSNFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNK-IVDIPTSALAFDP 297
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G + DSGT YT L EP Y + + + + + GFD CY
Sbjct: 298 ATGAGTIFDSGTVYTRLVEPAYVAVRNEFRRRV----KNANATSLGGFDTCYS------- 346
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+FPS+TF F +++ LP N SA + + CL + V S
Sbjct: 347 -GSVVFPSVTFMFA-GMNVTLPPDNLLIHSSAGN----LSCLAMAAAPVNVNSVLNVIAS 400
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQN V+ D+ R+G C
Sbjct: 401 MQQQNHRVLIDVPNSRLGISRETC 424
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 110/378 (29%), Positives = 167/378 (44%), Gaps = 51/378 (13%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C Y + F+P+ SS+ + CA+ C +
Sbjct: 168 MVLDTGSDIMWIQC----LPCAKC--YGQTDPL--FNPAASSTYRKVPCATPLCKKLD-- 217
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGC R C + +YG+G G + +TL G +IR
Sbjct: 218 --------ISGCRNK--------RYC-EYQVSYGDGSFTVGDFSTETLTFRGQ---VIRR 257
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G GRG+LS PSQ G K FS+C + + +S L
Sbjct: 258 VALGCGHDNEGLFIGAAGLLGLGRGSLSFPSQTGAQFSKRFSYCLVD---RSASGTASSL 314
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
+ G AI + FTP+L +P +YY+ L I++G LT +P S+ D+ GNGG+
Sbjct: 315 IFGKAAI--PKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGV 372
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++DSGT+ T L + YS + + K + FD CY +
Sbjct: 373 IIDSGTSVTRLVDSAYSTMRDAFRVGT---GNLKSAGGFSLFDTCYDLSGLKTV----KV 425
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P++ FHF + LP N+ P +SSA C F G+ G + G+ QQQ
Sbjct: 426 PTLVFHFQGGAHISLPATNYLI----PVDSSATFCFAFA----GNTGGLSIIGNIQQQGY 477
Query: 363 EVVYDLEKERIGFQPMDC 380
VV+D R+GF+ C
Sbjct: 478 RVVFDSLANRVGFKAGSC 495
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 159/387 (41%), Gaps = 50/387 (12%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL W+ C C C YR ++ + P SS+ R CAS C ++
Sbjct: 103 VVIDTGSDLIWLQC----VPCRHC--YR--QVTPLYDPRSSSTHRRIPCASPRCRDV--- 151
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC T C + YG+G +G L D L +
Sbjct: 152 ------LRYPGCDART---GGCV-----YMVVYGDGSASSGDLATDRLVFPDDT-----H 192
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCF---LAFKYANDP 176
+ GC VG G+ G GRG LS P+QL + H F L + +
Sbjct: 193 VHNVTLGCGHDNVG-LLESAAGLLGVGRGQLSFPTQLA---PAYGHVFSYCLGDRLSRAQ 248
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP-LSLREFD 235
N SS LV G + FTP+ +P P+ YY+ + ++G +T SL
Sbjct: 249 NGSSYLVFGRT--PEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNP 306
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEER-TGFDLCYRVPCPN 294
+ G GG++VDSGT + Y+ + S +++ + + FD CY +
Sbjct: 307 ATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNG 366
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
PSI HF + LPQ N+ + + CL Q+ DDG V
Sbjct: 367 APAAAVRVPSIVLHFAGGADMALPQANYLIPVQG-GDRRTYFCLGLQAADDG----LNVL 421
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ QQQ +V+D+E+ RIGF P C+
Sbjct: 422 GNVQQQGFGLVFDVERGRIGFTPNGCS 448
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 167/388 (43%), Gaps = 55/388 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI- 60
+ +DTGSDL W C + C+ D F+P S+S CA C +I
Sbjct: 115 VSALLDTGSDLIWTQCAPCA-SCLAQPD-------PLFAPGESASYEPMRCAGQLCSDIL 166
Query: 61 -HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H + P D CT + Y YG+G + G+ + S
Sbjct: 167 HHGCEMP-DTCT--------------------YRYNYGDGTMTMGVYATERFTFTSSGGD 205
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ +P FGC VGS GI GFGR LS+ SQL + FS+C ++
Sbjct: 206 RLMTVP-LGFGCGSMNVGS-LNNGSGIVGFGRNPLSLVSQLSI--RRFSYCLTSYGSGRK 261
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+ + G V + +Q TP+L+S P +YY+ L +T+G L +P S
Sbjct: 262 STLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRL-RIPESAFALR 320
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP-- 293
G+GG++VDSGT T LP ++++ + + P A G +C+ VP
Sbjct: 321 PDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLR-LPFANGGNPEDG--VCFLVPAAWR 377
Query: 294 -NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+++ + P + FHF + L LP+ N+ + CLL D GD G +
Sbjct: 378 RSSSTSQVPVPRMVFHF-QDADLDLPRRNYVL----DDHRKGRLCLLLA--DSGDDGST- 429
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ++ V+YDLE E + F P C
Sbjct: 430 -IGNLVQQDMRVLYDLEAETLSFAPAQC 456
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 113/389 (29%), Positives = 173/389 (44%), Gaps = 58/389 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDL W C C C D + + F PS SS+ S +C S+ C +
Sbjct: 48 VQLTLDTGSDLIWTQCQ----PCPACFD----QALPYFDPSTSSTLSLTSCDSTLCQGL- 98
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ C + C + Y+YG+ + TG L D G+
Sbjct: 99 ---------PVASCGSPKFWPNQTCV----YTYSYGDKSVTTGFLEVDKFTFVGAG---- 141
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC G GIAGFGRG LS+PSQL FSHCF A
Sbjct: 142 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTTITGA---- 195
Query: 178 ISSPLVI---GDVAISSKDNLQFTPML---KSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
I S +++ D+ + + +Q TP++ K+ P YY+ L+ IT+G++ L VP S
Sbjct: 196 IPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRL-PVPESA 254
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ G GG ++DSGT+ T LP Y + + I P TG C+ P
Sbjct: 255 FAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-LPVVP--GNATGHYTCFSAP 310
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ P + HF ++ LP+ N+ + + + +S + CL D+ +
Sbjct: 311 ----SQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGNSII-CLAINKGDE-----T 359
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+FQQQN+ V+YDL+ + F C
Sbjct: 360 TIIGNFQQQNMHVLYDLQNNMLSFVAAQC 388
>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
Length = 464
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 168/391 (42%), Gaps = 82/391 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ MDTGSDLTWV C S DC S F S++ TCA L +
Sbjct: 139 LVMDTGSDLTWVRCDPCSPDCS-----------STFDRLASNTYKALTCADDLRLPV--- 184
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
L L + +G RDTLK+ G++ + E
Sbjct: 185 -------------LLRLWRRL----------------FHSGRSLRDTLKMAGAASDELEE 215
Query: 124 IPKFCFGCVGSTYRE----PIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
P F FGC GS + +GI G+LS PSQ+G FS+C L + A +
Sbjct: 216 FPGFVFGC-GSLLKGLISGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLR-QTAQNSLK 273
Query: 179 SSPLVIGDVAISSKD-------NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
SP+V G+ A+ K+ LQ+TP+ +S +Y Y + L+ I++GN L LS
Sbjct: 274 KSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIY---YTVRLDGISVGNQRLD---LSP 327
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
F + + + DSGTT T LP + L S ++ E G D C+RVP
Sbjct: 328 STFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVS----GAEFVAIKGLDACFRVP 383
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ P ITFHF V N+ + + ++CL+F ++
Sbjct: 384 PSSGQG----LPDITFHFNGGADFVTRPSNYVIDLGS------LQCLIFVPTNE-----V 428
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+FG+ QQQ+ V++D++ RIGF+ DC +
Sbjct: 429 SIFGNLQQQDFFVLHDMDNRRIGFKETDCGA 459
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 111/381 (29%), Positives = 169/381 (44%), Gaps = 52/381 (13%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ MDTGSD+ W+ C CD+ F P +SS+ S C S CLN+
Sbjct: 52 LVMDTGSDILWLQCAPCVSCYHQCDEV--------FDPYKSSTYSTLGCNSRQCLNLD-- 101
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG---I 120
+ GC + L + YG+G TG D + ++ +S G +
Sbjct: 102 --------VGGCVGNKCL----------YQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV 143
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ +IP C + G+ G G+G LS P+Q+ G FS+C D
Sbjct: 144 LNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQINSENGGRFSYCLTGRD--TDSTER 201
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L+ GD A+ ++FTP + +YY+ + I++G S LT +P S + DS GN
Sbjct: 202 SSLIFGDAAVPPA-GVRFTPQASNLRVSTFYYLKMTGISVGGSILT-IPTSAFQLDSLGN 259
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG+++DSGT+ T L Y+ L ++ + E FD CY + + +
Sbjct: 260 GGVIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSL---FDTCYNL----SDLSS 312
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++T HF L LP N+ P ++S+ CL F G GPS + G+ QQ
Sbjct: 313 VDVPTVTLHFQGGADLKLPASNYL----VPVDNSSTFCLAFA----GTTGPS-IIGNIQQ 363
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
Q V+YD ++GF P C
Sbjct: 364 QGFRVIYDNLHNQVGFVPSQC 384
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 171/380 (45%), Gaps = 55/380 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C D + PS SS+ S C+S+ CL I S +
Sbjct: 89 DTGSDLTWTQCQPCKL-CFPQD-------TPVYDPSASSTFSPLPCSSATCLPIWSRN-- 138
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
CT S+ CR + Y YG+G GIL +TL + SS + +
Sbjct: 139 ---CT----------PSSLCR----YRYAYGDGAYSAGILGTETLTLGPSSAPV--SVGG 179
Query: 127 FCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC G G G GRG LS+ +QLG + FS+C F + + SP +
Sbjct: 180 VAFGCGTDNGGDSLNSTGTVGLGRGTLSLLAQLGVGK--FSYCLTDFF---NSALDSPFL 234
Query: 184 IGDVA--ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+G +A +Q TP+L+SP P+ Y++ L+ I++G+ L +P + G GG
Sbjct: 235 LGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRL-PIPNGTFDLRGDGTGG 293
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++VDSGTT+T L E + +++ + + P V + C+ P +
Sbjct: 294 MIVDSGTTFTILAESGFREVVGRVARVLGQPP----VNASSLDAPCFPAPAGEPPY---- 345
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P + HF + L + N+ MS S+ CL + V G+FQQQN
Sbjct: 346 MPDLVLHFAGGADMRLYRDNY---MSYNEEDSSF-CLNIAGTTPES---TSVLGNFQQQN 398
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+++++D ++ F P DC+
Sbjct: 399 IQMLFDTTVGQLSFLPTDCS 418
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 162/380 (42%), Gaps = 59/380 (15%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ V +D +D WVPC + +F P+RSS+ C + C
Sbjct: 119 ALLVAIDPSNDAAWVPCAACA----------GCARAPSFDPTRSSTYRPVRCGAPQC--- 165
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S P C L S+C +F +Y +L +D L +H
Sbjct: 166 --SQAPAPSCPGG-------LGSSC-----AFNLSYAASTF-QALLGQDALALHDD---- 206
Query: 121 IREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
+ + + FGC V P G+ GFGRG LS PSQ + FS+C ++K +N
Sbjct: 207 VDAVAAYTFGCLHVVTGGSVPPQGLVGFGRGPLSFPSQTKDVYGSVFSYCLPSYKSSN-- 264
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S L +G ++ TP+L +P P+ YY+ + I +G + VP S FD
Sbjct: 265 -FSGTLRLGPAG--QPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPV-PVPASALAFDP 320
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G +VD+GT +T L P Y+ + + +S + RA GFD CY N T
Sbjct: 321 TSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV----RAPVAGPLGGFDTCY-----NVT 371
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPSGVFG 355
+ P++TF F VS+ LP+ N S+S + CL + DG V
Sbjct: 372 IS---VPTVTFSFDGRVSVTLPEENVVIR----SSSGGIACLAMAAGPPDGVDAALNVLA 424
Query: 356 SFQQQNVEVVYDLEKERIGF 375
S QQQN V++D+ R+GF
Sbjct: 425 SMQQQNHRVLFDVANGRVGF 444
>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
Length = 453
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 177/384 (46%), Gaps = 49/384 (12%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C + + ++PS S + C+S+ LN+ +++
Sbjct: 110 DTGSDLVWT-------QCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA--LNLCAAEAR 160
Query: 67 FDPCTMS-GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
T GC+ CR + TYG G +G+ +T GSSP +P
Sbjct: 161 LAGATPPPGCA---------CR----YNQTYGTG-WTSGLQGSETF-TFGSSPADQVRVP 205
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
FGC ++ + G AG S + L G FS+C F+ D S L++
Sbjct: 206 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQ---DTKSKSTLLL 262
Query: 185 GDVAISSKDN---LQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
G A ++ N ++ TP + SP P YYY+ L I++G ++L +P + G
Sbjct: 263 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAAL-PIPPGAFALRADG 321
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GGL++DSGTT T L + Y ++ + ++S + + TG DLC+ +P +++
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL--PVTDGSNATGLDLCFALP--SSSAP 377
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
PS+T HF +VLP N+ + CL +S DG+ G++Q
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMIL------DGGMWCLAMRSQTDGELS---TLGNYQ 428
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQN+ ++YD++KE + F P C++
Sbjct: 429 QQNLHILYDVQKETLSFAPAKCST 452
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 113/392 (28%), Positives = 164/392 (41%), Gaps = 62/392 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI- 60
+ +DTGSDL W C + C+ D FSP SSS CA C +I
Sbjct: 117 VSALLDTGSDLIWTQCAPCA-SCLPQPD-------PIFSPGASSSYEPMRCAGELCNDIL 168
Query: 61 -HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILT--RDTLKVHGSS 117
HS P D CT + Y+YG+G G+ R T S
Sbjct: 169 HHSCQRP-DTCT--------------------YRYSYGDGTTTRGVYATERFTFSSSSSG 207
Query: 118 PGIIREIPKFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ FGC G+ + + GI GFGR LS+ SQL + FS+C +
Sbjct: 208 GETTKLSAPLGFGC-GTMNKGSLNNGSGIVGFGRAPLSLVSQLAI--RRFSYCLTPYASG 264
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+ + G V ++ +Q T +L+S P +YY+ +T+G L +P+S
Sbjct: 265 RKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRL-RIPISAFA 323
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY----- 288
G+GG +VDSGT T P P ++++ +S + P A +C+
Sbjct: 324 LRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQLR-LPFAANGSSGPDDGVCFAAAAS 382
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
RVP P + P + FH L L LP+ N+ N CLL D GD
Sbjct: 383 RVPRPA------VVPRMVFH-LQGADLDLPRRNYVLDDQRKGN----LCLLL--ADSGDS 429
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G + G+F QQ++ V+YDLE + + F P C
Sbjct: 430 GTT--IGNFVQQDMRVLYDLEADTLSFAPAQC 459
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 168/385 (43%), Gaps = 39/385 (10%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGSD+ W PC + C +C + K + F P SSSS C + C++
Sbjct: 95 VDTGSDVVWAPC-TTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVST---- 149
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
+ P GC C CP ++ YG G +G + LK + I
Sbjct: 150 --YFPYVHLGCPRCNGNSKHCSYACP-YSTQYGTGA-SSGYFLLENLKFPR------KTI 199
Query: 125 PKFCFGCVGSTYRE--PIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
F GC S RE +AGFGR S+P Q+G K F++C + Y + N S
Sbjct: 200 RNFLLGCTTSAARELSSDALAGFGRSMFSLPIQMGV--KKFAYCLNSHDYDDTRN--SGK 255
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYY-IGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+I D L +TP LKSP +YY +G++ I IGN L +P S G G
Sbjct: 256 LILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNK-LLRIPSKYLAPGSDGRSG 314
Query: 242 LLVDSGTTYT-HLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
+++DSG ++ P + + + L+ ++ Y R+ E E +TG PC N T
Sbjct: 315 VIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGL-----TPCYNFTGHKS 369
Query: 301 L-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY----GPSGVFG 355
+ P + + F ++V+P N+F S ++ C L + PS + G
Sbjct: 370 IKIPPLIYQFRGGANMVVPGKNYF----GISPQESLACFLMDTNGTNALEITPDPSIILG 425
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ Q + V YDL+ +R GF+ C
Sbjct: 426 NSQHVDYYVEYDLKNDRFGFRRQTC 450
>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 453
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 177/384 (46%), Gaps = 49/384 (12%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C + + ++PS S + C+S+ LN+ +++
Sbjct: 110 DTGSDLVWT-------QCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA--LNLCAAEAR 160
Query: 67 FDPCTMS-GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
T GC+ CR + TYG G +G+ +T GSSP +P
Sbjct: 161 LAGATPPPGCA---------CR----YNQTYGTG-WTSGLQGSETF-TFGSSPADQVRVP 205
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
FGC ++ + G AG S + L G FS+C F+ D S L++
Sbjct: 206 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQ---DTKSKSTLLL 262
Query: 185 GDVAISSKDN---LQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
G A ++ N ++ TP + SP P YYY+ L I++G ++L +P + G
Sbjct: 263 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAAL-PIPPGAFALRADG 321
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GGL++DSGTT T L + Y ++ + ++S + + TG DLC+ +P +++
Sbjct: 322 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL--PVTDGSNATGLDLCFALP--SSSAP 377
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
PS+T HF +VLP N+ + CL +S DG+ G++Q
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMIL------DGGMWCLAMRSQTDGELS---TLGNYQ 428
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQN+ ++YD++KE + F P C++
Sbjct: 429 QQNLHILYDVQKETLSFAPAKCST 452
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 155/383 (40%), Gaps = 41/383 (10%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W+ C C C R F P RSS+ R C+S C +
Sbjct: 101 LVIDTGSDLVWLQCS----PCRRCYAQRGQV----FDPRRSSTYRRVPCSSPQCRALR-- 150
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
F C G + CR + YG+G TG L D L + +
Sbjct: 151 ---FPGCDSGGAA------GGGCR----YMVAYGDGSSSTGDLATDKLAFANDT--YVNN 195
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G GRG +S+ +Q+ F +C + SS L
Sbjct: 196 VTLGCGRDNEGLFDSAAGLLGVGRGKISISTQVAPAYGSVFEYCL--GDRTSRSTRSSYL 253
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ-GNGG 241
V G + FT +L +P P+ YY+ + ++G +T + D+ G GG
Sbjct: 254 VFGRT--PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++VDSGT + Y+ L + + E + FD CY +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDL----RGRPAAS 367
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA--VKCLLFQSMDDGDYGPSGVFGSFQQ 359
P I HF + LP N+F + +A +CL F++ DDG V G+ QQ
Sbjct: 368 APLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG----LSVIGNVQQ 423
Query: 360 QNVEVVYDLEKERIGFQPMDCAS 382
Q VV+D+EKERIGF P C S
Sbjct: 424 QGFRVVFDVEKERIGFAPKGCTS 446
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 163/380 (42%), Gaps = 49/380 (12%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+YM DTGSD+TW+ C C DC Y + + F P+ SSS + C S C +
Sbjct: 208 QLYMVLDTGSDVTWLQCA----PCADC--YAQSDPL--FDPALSSSYATVPCDSPHCRAL 259
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ S C + ++ C + YG+G G +TL + G
Sbjct: 260 DA----------SACHNNAANGNSSC----VYEVAYGDGSYTVGDFATETLTLGGDGSAA 305
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ ++ C + G+ G G LS PSQ+ + FS+C + D +S
Sbjct: 306 VHDVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATE--FSYCLV----DRDSPSAS 359
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L G +S + P+++SP +YY+ L I++G +L+++P + D QG+G
Sbjct: 360 TLQFG----ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSG 415
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G++VDSGT T L YS L PRA V FD CY + ++
Sbjct: 416 GVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL---FDTCYDLAGRSSV---- 468
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++ F L LP N+ P + + CL F + G + G+ QQQ
Sbjct: 469 QVPAVSLRFEGGGELKLPAKNYLI----PVDGAGTYCLAFAATG----GAVSIVGNVQQQ 520
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+ V +D K +GF P C
Sbjct: 521 GIRVSFDTAKNTVGFSPNKC 540
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 156/385 (40%), Gaps = 48/385 (12%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL W+ C C C YR ++ + P S + R CAS C +
Sbjct: 107 VVIDTGSDLIWLQC----LPCRRC--YR--QVTPLYDPRNSKTHRRIPCASPQCRGV--- 155
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC T C + YG+G +G L DTL + +
Sbjct: 156 ------LRYPGCDART---GGCV-----YMVVYGDGSASSGDLATDTLVLPDDT-----R 196
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCF---LAFKYANDPN 177
+ GC G+ G GRG LS P+QL + H F L + + N
Sbjct: 197 VHNVTLGCGHDNEGLLASAAGLLGAGRGQLSFPTQLA---PAYGHVFSYCLGDRMSRARN 253
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP-LSLREFDS 236
SS LV G + FTP+ +P P+ YY+ + ++G + SL +
Sbjct: 254 SSSYLVFGRT--PELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPA 311
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG++VDSGT + Y+ + S + + + FD CY V N
Sbjct: 312 TGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHG-NGP 370
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T PSI HF + LPQ N+ + + CL Q+ DDG V G+
Sbjct: 371 GTGVRVPSIVLHFAAAADMALPQANYLIPVVG-GDRRTYFCLGLQAADDG----LNVLGN 425
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VV+D+E+ RIGF P C+
Sbjct: 426 VQQQGFGVVFDVERGRIGFTPNGCS 450
>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
Length = 458
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 177/384 (46%), Gaps = 49/384 (12%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C + + ++PS S + C+S+ LN+ +++
Sbjct: 115 DTGSDLVWT-------QCAPCGERCFKQPSPLYNPSSSPTFRVLPCSSA--LNLCAAEAR 165
Query: 67 FDPCTMS-GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
T GC+ CR + TYG G +G+ +T GSSP +P
Sbjct: 166 LAGATPPPGCA---------CR----YNQTYGTG-WTSGLQGSETF-TFGSSPADQVRVP 210
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
FGC ++ + G AG S + L G FS+C F+ D S L++
Sbjct: 211 GIAFGCSNASSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQ---DTKSKSTLLL 267
Query: 185 GDVAISSKDN---LQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
G A ++ N ++ TP + SP P YYY+ L I++G ++L +P + G
Sbjct: 268 GPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAAL-PIPPGAFALRADG 326
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GGL++DSGTT T L + Y ++ + ++S + + TG DLC+ +P +++
Sbjct: 327 TGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKL--PVTDGSNATGLDLCFALP--SSSAP 382
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
PS+T HF +VLP N+ + CL +S DG+ G++Q
Sbjct: 383 PATLPSMTLHFGGGADMVLPVENYMIL------DGGMWCLAMRSQTDGELS---TLGNYQ 433
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQN+ ++YD++KE + F P C++
Sbjct: 434 QQNLHILYDVQKETLSFAPAKCST 457
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 113/384 (29%), Positives = 171/384 (44%), Gaps = 63/384 (16%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+MDTGS++ W+ C C C N+ F+PS+SSS C SS C ++
Sbjct: 105 FMDTGSNIVWLQCQ----PCNTCF----NQTSPIFNPSKSSSYKNIPCTSSTC---KDTN 153
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
+ C+ G C ++ TYG G L+ D+L + +S +
Sbjct: 154 DTHISCSNGG---------DVCE----YSITYGGDAKSQGDLSNDSLTLDSTSGSSVL-F 199
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDPNI 178
P GC V + G+ G GRG +S+ Q+G G FS+C + Y +D N
Sbjct: 200 PNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIP--YNSDSNS 257
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SS L+ G+ + S + + TPM+K NYY++ LEA ++GN+ + E+ +
Sbjct: 258 SSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRI--------EYGERS 309
Query: 239 NG---GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
N +L+DSGT T LP F S+L+S + + PR + + LCY N
Sbjct: 310 NASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVK-LPRIEPPDHH--LSLCY-----NT 361
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
T P IT HF N + L F+ + C F S + + +FG
Sbjct: 362 TGKQLNVPDITAHF-NGADVKLNSNGTFFPF-----EDGIMCFGFISSNGLE-----IFG 410
Query: 356 SFQQQNVEVVYDLEKERIGFQPMD 379
+ Q N+ + YDLEKE I F+P D
Sbjct: 411 NIAQNNLLIDYDLEKEIISFKPTD 434
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 172/388 (44%), Gaps = 60/388 (15%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C + D D N F P+RSS+ S+ +C S+ C + +
Sbjct: 121 DTGSDLVWVNCSSSGGGLADADAGGNVV----FQPTRSSTYSQLSCQSNACQALSQASCD 176
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPGIIREIP 125
D S C + Y+YG+G G+L+ +T V G G +R +P
Sbjct: 177 AD--------------SEC-----QYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVR-VP 216
Query: 126 KFCFGCV---GSTYREPIGIAGFGRGALSVPSQLG---FLQKGFSHCFLAFKYANDPNIS 179
+ FGC T+R G+ G G GA S+ SQLG + + S+C + + D N S
Sbjct: 217 RVNFGCSTASAGTFRSD-GLVGLGAGAFSLVSQLGATTHIDRKLSYCLIP---SYDANSS 272
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L G A+ S+ TP++ S + +YY + LE++ +G + DS+
Sbjct: 273 STLNFGSRAVVSEPGAASTPLVPSDV-DSYYTVALESVAVGGQEVAT-------HDSR-- 322
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++VDSGTT T L L++ L+ I R + E+ LCY V + TD
Sbjct: 323 --IIVDSGTTLTFLDPALLGPLVTELERRIKLQ-RVQPPEQL--LQLCYDVQ--GKSETD 375
Query: 300 DL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ P +T F ++ L N F + CL+ + + P + G+
Sbjct: 376 NFGIPDVTLRFGGGAAVTLRPENTFSLLQ-----EGTLCLVLVPVSESQ--PVSILGNIA 428
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASA 386
QQN V YDL+ + F DCA ++++
Sbjct: 429 QQNFHVGYDLDARTVTFAAADCARSSAS 456
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 162/390 (41%), Gaps = 62/390 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W C FDC + + F S S + C C +
Sbjct: 106 VALEVDTGSDVVWTQC-RPCFDCF-------TQPLPRFDTSASDTVHGVLCTDPICRALR 157
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
C + GC+ + YG+ + G L +D+ G G +
Sbjct: 158 PH-----ACFLGGCT---------------YQVNYGDNSVTIGQLAKDSFTFDGKGGGKV 197
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC G+ + GIAGFGRG LS+P QLG FS+CF +
Sbjct: 198 -TVPDLVFGCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGV--SSFSYCFTTIFESK--- 251
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPN---YYYIGLEAITIGNSSLTEVPLSLREF 234
S+P+ +G P+L +P PN YYY+ L+ IT+G + L VP S
Sbjct: 252 -STPVFLGGAPADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLA-VPESAFVV 309
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+ G+GG ++DSGT T P + S+ ++ + P TG + C +
Sbjct: 310 KADGSGGTIIDSGTAITAFPRAVFR---SLWEAFVAQVPLPHTSYNDTGEPT---LQCFS 363
Query: 295 NTFTDDL----FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
D P +T H L LP+ N+ M+ +S + ++ DD
Sbjct: 364 TESVPDASKVPVPKMTLH-LEGADWELPRENY---MAEYPDSDQLCVVVLAGDDD----- 414
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+FQQQN+ +V+DL ++ +P C
Sbjct: 415 RTMIGNFQQQNMHIVHDLAGNKLVIEPAQC 444
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 170/384 (44%), Gaps = 58/384 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C Y + F P +S S S +C S CL +
Sbjct: 160 VYMVLDTGSDVVWIQCA----PCRKC--YSQTDPV--FDPKKSGSFSSISCRSPLCLRL- 210
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D+P GC+ + +C + YG+G G + +TL G+
Sbjct: 211 --DSP-------GCNS----RQSCL-----YQVAYGDGSFTFGEFSTETLTFRGT----- 247
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+PK GC + G+ G GRG LS P+Q G + FS+C + ++ P
Sbjct: 248 -RVPKVALGCGHDNEGLFVGAAGLLGLGRGRLSFPTQTGLRFGRKFSYCLVDRSASSKP- 305
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S +V G A+S FTP++ +P +YY+ L I++G + + + SL + D+
Sbjct: 306 --SSVVFGQSAVSR--TAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTA 361
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG+++DSGT+ T L Y L ++ RA + FD C+ +
Sbjct: 362 GNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSL---FDTCFDLSGK---- 414
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P++ HF + LP N+ P +++ V C F G + G+
Sbjct: 415 TEVKVPTVVMHF-RGADVSLPATNYLI----PVDTNGVFCFAFAGTMSG----LSIIGNI 465
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VV+D+ RIGF CA
Sbjct: 466 QQQGFRVVFDVAASRIGFAARGCA 489
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 164/388 (42%), Gaps = 54/388 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +DTGSDL W C C C R + FSP SSS CA C +I
Sbjct: 111 ITALLDTGSDLIWTQCDT----CTAC--LRQPDPL--FSPRMSSSYEPMRCAGQLCGDI- 161
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP--CPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
L +C RP C ++ Y+YG+G G + SS G
Sbjct: 162 -------------------LHHSCVRPDTC-TYRYSYGDGTTTLGYYATERF-TFASSSG 200
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ +P FGC VGS GI GFGR LS+ SQL + FS+C + +
Sbjct: 201 ETQSVP-LGFGCGTMNVGS-LNNASGIVGFGRDPLSLVSQLSI--RRFSYCLTPYASSRK 256
Query: 176 PNISSPLVIGDVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+ + DV + + +Q TP+L+S P +YY+ +T+G L +P S
Sbjct: 257 STLQFG-SLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRL-RIPASAFA 314
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDLCYRVPC 292
G+GG+++DSGT T P ++++ +S + + ++ F
Sbjct: 315 LRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAG 374
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
P + FHF L LP+ N+ + C+L D GD G +
Sbjct: 375 GGRMARQVAVPRMVFHF-QGADLDLPRENYVLE----DHRRGHLCVLLG--DSGDDGAT- 426
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+F QQ++ VVYDLE+E + F P++C
Sbjct: 427 -IGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
Length = 459
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 114/390 (29%), Positives = 178/390 (45%), Gaps = 60/390 (15%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRDTCASSFCLNI-HSSD 64
DTGSDLTW C C D + ++FSP CAS+ CL I SS
Sbjct: 113 DTGSDLTWTQCKPCKL-CFPQDTPIYDTAASASFSPV--------PCASATCLPIWRSSR 163
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR-- 122
N CT +T PC + Y Y +G G+L +TL GSSPG
Sbjct: 164 N----CT-----------ATTTSPC-RYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPG 207
Query: 123 -EIPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
+ FGC G G G GRG+LS+ +QLG + FS+C F + ++
Sbjct: 208 VSVGGVAFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGK--FSYCLTDFF---NTSL 262
Query: 179 SSPLVIGDVAISSKDN------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
SP++ G +A + + +Q TP+++ P P+ YY+ LE I++G++ L +P
Sbjct: 263 GSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARL-PIPNGTF 321
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CYRVP 291
+ G+GG++VDSGT +T L E + +++ + + + V + D C+
Sbjct: 322 DLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLN-----QPVVNASSLDSPCFPAT 376
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
D P + HF + L + N+ MS SS+ CL YG
Sbjct: 377 AGEQQLPD--MPDMLLHFAGGADMRLHRDNY---MSFNQESSSF-CLNIAGAPSA-YG-- 427
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G+FQQQN+++++D+ ++ F P DC+
Sbjct: 428 SILGNFQQQNIQMLFDITVGQLSFVPTDCS 457
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 162/388 (41%), Gaps = 60/388 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C D F P S S CA+ C + S
Sbjct: 162 MVLDTGSDVVWLQCA----PCRRCYDQSGQM----FDPRASHSYGAVDCAAPLCRRLDSG 213
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC L + C + YG+G + G +TL +
Sbjct: 214 ----------GCDLR---RKACL-----YQVAYGDGSVTAGDFATETLTFASGA-----R 250
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFL--AFKYANDPN 177
+P+ GC + G+ G GRG+LS PSQ+ + FS+C + A+ +
Sbjct: 251 VPRVALGCGHDNEGLFVAAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATS 310
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFDS 236
SS + G A+ FTPM+K+P +YY+ L I++G + + V +S LR S
Sbjct: 311 RSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPS 370
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG----FDLCYRVPC 292
G GG++VDSGT+ T L P Y+ L ++ A + G FD CY +
Sbjct: 371 TGRGGVIVDSGTSVTRLARPAYAALRDAFRAA------AAGLRLSPGGFSLFDTCYDL-- 422
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ P+++ HF LP N+ P +S C F D G
Sbjct: 423 --SGLKVVKVPTVSMHFAGGAEAALPPENYLI----PVDSRGTFCFAFAGTDGG----VS 472
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ VV+D + +R+GF P C
Sbjct: 473 IIGNIQQQGFRVVFDGDGQRLGFVPKGC 500
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 165/387 (42%), Gaps = 48/387 (12%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS+L W C C C + P+RSS+ SR C SFC + +S
Sbjct: 106 VIVDTGSNLIWAQCA----PCTRC--FPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTS 159
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P + C+ + YTYG G G L +TL V +
Sbjct: 160 SRPRTCNATAACA---------------YNYTYGSG-YTAGYLATETLTVGDGT------ 197
Query: 124 IPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
PK FGC + GI G GRG LS+ SQL + FS+C + +SP+
Sbjct: 198 FPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGR--FSYCL---RSDMADGGASPI 252
Query: 183 VIGDVA-ISSKDNLQFTPMLKSP--MYPNYYYIGLEAITIGNSSLTEVPLSLREF---DS 236
+ G +A ++ + +Q TP+LK+P +YY+ L I + + TE+P++ F +
Sbjct: 253 LFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDS---TELPVTGSTFGFTQT 309
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT-GFDLCYRVPCPNN 295
GG +VDSGTT T+L + Y+ + QS + + DLCY+ P
Sbjct: 310 GLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYK-PSAGG 368
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA-VKCLLFQSMDDGDYGPSGVF 354
P + F +P N+F + A S V CLL D P +
Sbjct: 369 GGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISII 426
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q ++ ++YD++ F P DCA
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCA 453
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 177/392 (45%), Gaps = 83/392 (21%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSD+ W+ C C C N+ F+PS+SSS C+S C HS
Sbjct: 105 DTGSDIVWLQCE----PCEQC----YNQTTPIFNPSKSSSYKNIPCSSKLC---HS---- 149
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAY--TYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
++ T C S Y +YG+ G L+ DTL + +S G
Sbjct: 150 --------------VRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS-GSPVSF 194
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
PK GC G+ GI G G G +S+ +QLG G FS+C + + N S
Sbjct: 195 PKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPL-LNKESNAS 253
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L GD A+ S D + TP++K P +Y++ L+A ++GN + E S D +GN
Sbjct: 254 SILSFGDAAVVSGDGVVSTPLIKKD--PVFYFLTLQAFSVGNKRV-EFGGSSEGGDDEGN 310
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE-RTGFDLCYRVPCPNNTFT 298
+++DSGTT T +P Y+ L +S + + V++ F LCY + +N +
Sbjct: 311 --IIIDSGTTLTLIPSDVYTNL----ESAVVDLVKLDRVDDPNQQFSLCYSLK--SNEYD 362
Query: 299 DDLFPSITFHF------LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS- 351
FP IT HF L+++S +P + + C FQ PS
Sbjct: 363 ---FPIITVHFKGADVELHSISTFVPI------------TDGIVCFAFQ--------PSP 399
Query: 352 ---GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ QQN+ V YDL+++ + F+P DC
Sbjct: 400 QLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 123/388 (31%), Positives = 177/388 (45%), Gaps = 61/388 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ V DTGSDLTWV C+ CD YR + F PSRSSS C S FC +
Sbjct: 107 VIVIADTGSDLTWV-------QCLPCDPCYRQKSPL--FDPSRSSSYRHMLCGSRFCNAL 157
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+ CTM + C + Y+YG+ G L + + +S
Sbjct: 158 DVSEQA---CTM---------DTNICE----YHYSYGDKSYTNGNLATEKFTIGSTSSRP 201
Query: 121 IREIPKFCFGC---VGSTYRE-PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAND 175
+ P FGC G T+ E GI G G GALS+ SQL + KG FS+C + +
Sbjct: 202 VHLSP-IVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPL--SEQ 258
Query: 176 PNISSPLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N++S + G ++ S + TP++ K P YYY+ LEAI++GN L L
Sbjct: 259 SNVTSKIKFGTDSVISGPQVVSTPLVSKQP--DTYYYVTLEAISVGNKRLPYTNGLLNGN 316
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCP 293
+GN +++DSGTT T L F+++L +L+ T+ +A+ V + G F +C+R
Sbjct: 317 VEKGN--VIIDSGTTLTFLDSEFFTELERVLEETV----KAERVSDPRGLFSVCFR---- 366
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ D P I HF N+ + L N F L F + G +
Sbjct: 367 --SAGDIDLPVIAVHF-NDADVKLQPLNTFVKADE-------DLLCFTMISSNQIG---I 413
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
FG+ Q + V YDLEK + F+P DC
Sbjct: 414 FGNLAQMDFLVGYDLEKRTVSFKPTDCT 441
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 118/387 (30%), Positives = 173/387 (44%), Gaps = 57/387 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C CM C N+ F PS SSS +C SS C ++
Sbjct: 76 MTVIIDTGSDLTWVQCE----PCMSC----YNQQGPIFKPSTSSSYQSVSCNSSTCQSLQ 127
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ C S STC ++ YG+G G L + L G S
Sbjct: 128 FATGNTGACGSSN-------PSTC-----NYVVNYGDGSYTNGELGVEALSFGGVS---- 171
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + + G+ G GR LS+ SQ G FS+C +
Sbjct: 172 --VSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPT----TEAG 225
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S LV+G+ + K+ + +T ML +P N+Y + L I +G +L + PLS
Sbjct: 226 SSGSLVMGNESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVAL-KAPLSF---- 280
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
GNGG+L+DSGT T LP Y L + T +P A GF + C N
Sbjct: 281 --GNGGILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAP------GFSILD--TCFNL 330
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T D++ P+I+ F N L + FY + ++S V CL S+ D + +
Sbjct: 331 TGYDEVSIPTISLRFEGNAQLNVDATGTFYVV--KEDASQV-CLALASLSDAY--DTAII 385
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N V+YD ++ ++GF C+
Sbjct: 386 GNYQQRNQRVIYDTKQSKVGFAEEPCS 412
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 171/382 (44%), Gaps = 53/382 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C Y + + F P +S + + C+S C +
Sbjct: 155 VYMVLDTGSDIVWLQCA----PCRRC--YSQSDPI--FDPRKSKTYATIPCSSPHCRRLD 206
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ GC+ + TC + +YG+G G + +TL + +
Sbjct: 207 SA----------GCNTR---RKTCL-----YQVSYGDGSFTVGDFSTETLTFRRNR---V 245
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPNIS 179
+ + C + G+ G G+G LS P Q G F QK FS+C + ++ P
Sbjct: 246 KGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP--- 301
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S +V G+ A+S +FTP+L +P +YY+GL I++G + + V SL + D GN
Sbjct: 302 SSVVFGNAAVSRI--ARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGN 359
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG+++DSGT+ T L P Y + + RA + FD C+ + N
Sbjct: 360 GGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSL---FDTCFDLSNMNEV--- 413
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ HF + LP N+ P +++ C F G G + G+ QQ
Sbjct: 414 -KVPTVVLHF-RGADVSLPATNYLI----PVDTNGKFCFAFA----GTMGGLSIIGNIQQ 463
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
Q VVYDL R+GF P CA
Sbjct: 464 QGFRVVYDLASSRVGFAPGGCA 485
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 165/385 (42%), Gaps = 62/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C+ C D + F +S++ C SS C ++ S
Sbjct: 106 MDTGSDLIWTQCA----PCLLCAD----QPTPYFDVKKSATYRALPCRSSRCASLSS--- 154
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+C + + Y YG+ G+L +T ++ +R
Sbjct: 155 -----------------PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRAT- 196
Query: 126 KFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC GS + G+ GFGRG LS+ SQLG FS+C ++ A S
Sbjct: 197 NIAFGC-GSLNAGDLANSSGMVGFGRGPLSLVSQLG--PSRFSYCLTSYLSAT----PSR 249
Query: 182 LVIGDVAISSKDN------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L G A S N +Q TP + +P PN Y++ L+AI++G L PL + +
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPL-VFAIN 308
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G GG+++DSGT+ T L + Y + L S I P + G D C++ P P N
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---PLPAMNDTDIGLDTCFQWPPPPN 365
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P + FHF + +LP+ A +++ CL+ G + G
Sbjct: 366 VTVT--VPDLVFHFDSANMTLLPENYMLIA-----STTGYLCLVMAPTGVGT-----IIG 413
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQQN+ ++YD+ + F P C
Sbjct: 414 NYQQQNLHLLYDIGNSFLSFVPAPC 438
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 164/388 (42%), Gaps = 54/388 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +DTGSDL W C C C R + FSP SSS CA C +I
Sbjct: 111 ITALLDTGSDLIWTQCDT----CTAC--LRQPDPL--FSPRMSSSYEPMRCAGQLCGDI- 161
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP--CPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
L +C RP C ++ Y+YG+G G + SS G
Sbjct: 162 -------------------LHHSCVRPDTC-TYRYSYGDGTTTLGYYATERF-TFASSSG 200
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ +P FGC VGS GI GFGR LS+ SQL + FS+C + +
Sbjct: 201 ETQSVP-LGFGCGTMNVGS-LNNASGIVGFGRDPLSLVSQLSI--RRFSYCLTPYASSRK 256
Query: 176 PNISSPLVIGDVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+ + DV + + +Q TP+L+S P +YY+ +T+G L +P S
Sbjct: 257 STLQFG-SLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRL-RIPASAFA 314
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDLCYRVPC 292
G+GG+++DSGT T P ++++ +S + + ++ F
Sbjct: 315 LRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAG 374
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
P + FHF L LP+ N+ + C+L D GD G +
Sbjct: 375 GGRMARQVAVPRMVFHF-QGADLDLPRENYVLE----DHRRGHLCVLLG--DSGDDGAT- 426
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+F QQ++ VVYDLE+E + F P++C
Sbjct: 427 -IGNFVQQDMRVVYDLERETLSFAPVEC 453
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 167/388 (43%), Gaps = 48/388 (12%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ +DTGSDL W C C + ++P+RS + + +C S C +
Sbjct: 112 ALSAVLDTGSDLIWTQCDAPCRRCFP-------QPAPLYAPARSVTYANVSCGSRLCDAL 164
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S S ++ R ++ Y+YG+G G+L +T G
Sbjct: 165 PS-------LRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF-----GA 212
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+ FGC +G T G+ G GRG LS+ SQLG + FS+CF F ND
Sbjct: 213 GTTVHDLAFGCGTDNLGGTDNSS-GLVGMGRGPLSLVSQLGVTK--FSYCFTPF---NDT 266
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYP---NYYYIGLEAITIGNSSLTEVPLSLRE 233
SSPL +G A S + TP + SP P +YYY+ LE IT+G++ L P R
Sbjct: 267 TTSSPLFLGSSA-SLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFR- 324
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+ G GGL++DSGTT+T L E + L + P A G +C+ P
Sbjct: 325 LTASGRGGLIIDSGTTFTALEERAFVVLARA-VAARVALPLASGAH--LGLSVCFAAPQG 381
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
D+ P + HF + + LP+ + + V CL S V
Sbjct: 382 RGPEAVDV-PRLVLHF-DGADMELPRSSAVVE----DRVAGVACLGIVSARG-----MSV 430
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GS QQQN+ V YD+ ++ + F+P +C
Sbjct: 431 LGSMQQQNMHVRYDVGRDVLSFEPANCG 458
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 102/373 (27%), Positives = 162/373 (43%), Gaps = 61/373 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT +D +WVPC C+ C + F+P++S++ + C +S C + N
Sbjct: 115 MDTSNDASWVPCT----ACVGCST------TTPFAPAKSTTFKKVGCGASQCKQVR---N 161
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F +TYG V L +DT+ + + P +P
Sbjct: 162 PT--CDGSACA---------------FNFTYGTSS-VAASLVQDTVTL-ATDP-----VP 197
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ GS+ + + Q FS+C +FK N S
Sbjct: 198 AYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLN---FSGS 254
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA ++FTP+LK+P + YY+ L AI +G + ++P F++ G
Sbjct: 255 LRLGPVA--QPKRIKFTPLLKNPRRSSLYYVNLVAIRVGR-RIVDIPPEALAFNANTGAG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L EP Y+ + + + I + + V GFD CY P +
Sbjct: 312 TVFDSGTVFTRLVEPAYNAVRNEFRRRIAVH-KKLTVTSLGGFDTCYTAPI--------V 362
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F + +++ LP N S + +V CL D V + QQQN
Sbjct: 363 APTITFMF-SGMNVTLPPDNILIH----STAGSVTCLAMAPAPDNVNSVLNVIANMQQQN 417
Query: 362 VEVVYDLEKERIG 374
V++D+ R+G
Sbjct: 418 HRVLFDVPNSRLG 430
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 165/385 (42%), Gaps = 62/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C+ C D + F +S++ C SS C ++ S
Sbjct: 1 MDTGSDLIWTQCA----PCLLCAD----QPTPYFDVKKSATYRALPCRSSRCASLSS--- 49
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+C + + Y YG+ G+L +T ++ +R
Sbjct: 50 -----------------PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRAT- 91
Query: 126 KFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC GS + G+ GFGRG LS+ SQLG FS+C ++ A S
Sbjct: 92 NIAFGC-GSLNAGDLANSSGMVGFGRGPLSLVSQLG--PSRFSYCLTSYLSAT----PSR 144
Query: 182 LVIGDVAISSKDN------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L G A S N +Q TP + +P PN Y++ L+AI++G L PL + +
Sbjct: 145 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPL-VFAIN 203
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G GG+++DSGT+ T L + Y + L S I P + G D C++ P P N
Sbjct: 204 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---PLPAMNDTDIGLDTCFQWPPPPN 260
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P + FHF + +LP+ A +++ CL+ G + G
Sbjct: 261 VTVT--VPDLVFHFDSANMTLLPENYMLIA-----STTGYLCLVMAPTGVGT-----IIG 308
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQQN+ ++YD+ + F P C
Sbjct: 309 NYQQQNLHLLYDIGNSFLSFVPAPC 333
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 154/383 (40%), Gaps = 41/383 (10%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W+ C C C R F P RSS+ R C+S C +
Sbjct: 101 LVIDTGSDLVWLQCS----PCRRCYAQRGQV----FDPRRSSTYRRVPCSSPQCRALR-- 150
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
F C G + CR + YG+G TG L D L + +
Sbjct: 151 ---FPGCDSGGAA------GGGCR----YMVAYGDGSSSTGELATDKLAFANDT--YVNN 195
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G RG +S+ +Q+ F +C + SS L
Sbjct: 196 VTLGCGRDNEGLFDSAAGLLGVARGKISISTQVAPAYGSVFEYCL--GDRTSRSTRSSYL 253
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ-GNGG 241
V G + FT +L +P P+ YY+ + ++G +T + D+ G GG
Sbjct: 254 VFGRT--PEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGG 311
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++VDSGT + Y+ L + + E + FD CY +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDL----RGRPAAS 367
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA--VKCLLFQSMDDGDYGPSGVFGSFQQ 359
P I HF + LP N+F + +A +CL F++ DDG V G+ QQ
Sbjct: 368 APLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDG----LSVIGNVQQ 423
Query: 360 QNVEVVYDLEKERIGFQPMDCAS 382
Q VV+D+EKERIGF P C S
Sbjct: 424 QGFRVVFDVEKERIGFAPKGCTS 446
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 169/385 (43%), Gaps = 59/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C Y + + F P +S + + C+S C +
Sbjct: 155 VYMVLDTGSDIVWLQCA----PCRRC--YSQSDPI--FDPRKSKTYATIPCSSPHCRRLD 206
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ GC+ + TC + +YG+G G + +TL +
Sbjct: 207 SA----------GCNTR---RKTCL-----YQVSYGDGSFTVGDFSTETLTFRRN----- 243
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
+ GC + G+ G G+G LS P Q G F QK FS+C + ++ P
Sbjct: 244 -RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP 301
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S +V G+ A+S +FTP+L +P +YY+GL I++G + + V SL + D
Sbjct: 302 ---SSVVFGNAAVSRI--ARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQ 356
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG+++DSGT+ T L P Y + + RA FD C+ + N
Sbjct: 357 IGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSL---FDTCFDLSNMNEV 413
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P++ HF + LP N+ P +++ C F G G + G+
Sbjct: 414 ----KVPTVVLHF-RRADVSLPATNYLI----PVDTNGKFCFAFA----GTMGGLSIIGN 460
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYDL R+GF P CA
Sbjct: 461 IQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 159/384 (41%), Gaps = 60/384 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D +D WVPC C C +FSP++SS+ C S C + S
Sbjct: 98 VAIDPSNDAAWVPCS----ACAGCAASS-----PSFSPTQSSTYRTVPCGSPQCAQVPSP 148
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P + S+C F TY +L +D+L + +
Sbjct: 149 SCPAG------------VGSSC-----GFNLTYAASTF-QAVLGQDSLALENN------V 184
Query: 124 IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+ + FGC V P G+ GFGRG LS SQ FS+C ++ +N S
Sbjct: 185 VVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSN---FS 241
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G I ++ TP+L +P P+ YY+ + I +G S + +VP S F+
Sbjct: 242 GTLKLGP--IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVG-SKVVQVPQSALAFNPVTG 298
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++D+GT +T L P Y+ + + + R GFD CY V
Sbjct: 299 SGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCYNV--------T 346
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPSGVFGSFQ 358
P++TF F V++ LP+ N S+S V CL + DG V S Q
Sbjct: 347 VSVPTVTFMFAGAVAVTLPEENVMIH----SSSGGVACLAMAAGPSDGVNAALNVLASMQ 402
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQN V++D+ R+GF C +
Sbjct: 403 QQNQRVLFDVANGRVGFSRELCTA 426
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 173/385 (44%), Gaps = 60/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C +C Y + F+P +S S ++ C + C +
Sbjct: 55 VYMVLDTGSDIVWLQCA----PCKNC--YSQTDPV--FNPVKSGSFAKVLCRTPLCRRLE 106
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S GC+ + TC + +YG+G TG +TL +
Sbjct: 107 SP----------GCNQ----RQTCL-----YQVSYGDGSYTTGEFVTETLTFRRT----- 142
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
++ + GC + G+ G GRG LS PSQ G F QK FS+C + ++ P
Sbjct: 143 -KVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQK-FSYCLVDRSASSKP 200
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S +V G+ A+S +FTP+L +P +YY+ L I++G + ++ + S + D
Sbjct: 201 ---SSVVFGNSAVSR--TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 255
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG+++D GT+ T L +P Y L ++ + K E + FD CY + +
Sbjct: 256 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGAS---SLKSAPEFSLFDTCYDL----SG 308
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T P++ HF + LP N+ P + S C F G + G+
Sbjct: 309 KTTVKVPTVVLHF-RGADVSLPASNYLI----PVDGSGRFCFAFAGTTSG----LSIIGN 359
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYDL R+GF P CA
Sbjct: 360 IQQQGFRVVYDLASSRVGFSPRGCA 384
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 185/408 (45%), Gaps = 82/408 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY-RNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV +C+ CD R + L ++ + P SS+ S+ +C FC
Sbjct: 19 VQVDTGSDILWV-------NCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAA 71
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSP 118
+ P GC+ S PC ++ TYG+G TG D L+ S
Sbjct: 72 TYGGLLP-------GCTTSL--------PC-EYSVTYGDGSSTTGYFVSDLLQFDQVSGD 115
Query: 119 GIIREI-PKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
G R FGC +GS+ + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 116 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 175
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSL 224
+ N IG+V P +K+ P+ PN +Y + L++I +G ++L
Sbjct: 176 ------DTINGGGIFAIGNVV---------QPKVKTTPLVPNMPHYNVNLKSIDVGGTAL 220
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL---QSTITYYPRAKEVEER 281
L FD+ G ++DSGTT T+LPE Y +++ + IT++ V+E
Sbjct: 221 ---KLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFH----NVQEF 273
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
F RV DD FP ITFHF N++ L + ++F+ N + C+ FQ
Sbjct: 274 LCFQYVGRV--------DDDFPKITFHFENDLPLNVYPHDYFF-----ENGDNLYCVGFQ 320
Query: 342 S--MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ + D + G N VVYDLE + IG+ +C+S+ +
Sbjct: 321 NGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCSSSIKIK 368
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 180/388 (46%), Gaps = 74/388 (19%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSSD 64
DTGSDL W C C+ C Y+ M F PS+S+S +C S C L+ S
Sbjct: 109 DTGSDLMWTQC----LPCLSC--YKQKNPM--FDPSKSTSFKEVSCESQQCRLLDTVSCS 160
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
P C F+Y YG+G L G++ +TL ++ +S G I
Sbjct: 161 QPQKLC--------------------DFSYGYGDGSLAQGVIATETLTLNSNS-GQPTSI 199
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQ----LGFLQKGFSHCFLAFKYANDP 176
FGC G+ +G+ G G LS+ SQ LG +K FS C + F+ DP
Sbjct: 200 LNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRK-FSQCLVPFR--TDP 256
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL---TEVPLSLRE 233
+I+S ++ G A S ++ TP++ P YY++ L+ I++G+ + P++ +
Sbjct: 257 SITSKIIFGPEAEVSGSDVVSTPLVTKD-DPTYYFVTLDGISVGDKLFPFSSSSPMATK- 314
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP-RAKEVEERTGFDLCYRVPC 292
G + +D+GT T LP FY++L+ ++ I P + +++ + LCYR
Sbjct: 315 ------GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQ----LCYR--- 361
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ T D P +T HF + + L N F S V C Q +D G +G
Sbjct: 362 -SATLIDG--PILTAHF-DGADVQLKPLNTFI-----SPKEGVYCFAMQPID----GDTG 408
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+F Q N + +DL+ +++ F+ +DC
Sbjct: 409 IFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 159/384 (41%), Gaps = 60/384 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D +D WVPC C C +FSP++SS+ C S C + S
Sbjct: 117 VAIDPSNDAAWVPCS----ACAGCAASS-----PSFSPTQSSTYRTVPCGSPQCAQVPSP 167
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P + S+C F TY +L +D+L + +
Sbjct: 168 SCPAG------------VGSSC-----GFNLTYAASTF-QAVLGQDSLALENN------V 203
Query: 124 IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+ + FGC V P G+ GFGRG LS SQ FS+C ++ +N S
Sbjct: 204 VVSYTFGCLRVVSGNSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSSN---FS 260
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G I ++ TP+L +P P+ YY+ + I +G S + +VP S F+
Sbjct: 261 GTLKLGP--IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVG-SKVVQVPQSALAFNPVTG 317
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++D+GT +T L P Y+ + + + R GFD CY V
Sbjct: 318 SGTIIDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCYNV--------T 365
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPSGVFGSFQ 358
P++TF F V++ LP+ N S+S V CL + DG V S Q
Sbjct: 366 VSVPTVTFMFAGAVAVTLPEENVMIH----SSSGGVACLAMAAGPSDGVNAALNVLASMQ 421
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQN V++D+ R+GF C +
Sbjct: 422 QQNQRVLFDVANGRVGFSRELCTA 445
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 174/387 (44%), Gaps = 59/387 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C CM C N+ F PS SSS +C SS C ++
Sbjct: 76 MTVIIDTGSDLTWVQCE----PCMSC----YNQQGPIFKPSTSSSYQSVSCNSSTCQSLQ 127
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ C + STC ++ YG+G G L + L G S
Sbjct: 128 FATGNTGACGSN--------PSTC-----NYVVNYGDGSYTNGELGVEQLSFGGVS---- 170
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + + G+ G GR LS+ SQ G FS+C +
Sbjct: 171 --VSDFVFGCGRNNKGLFGGVSGLMGLGRSYLSLVSQTNATFGGVFSYCLPT----TESG 224
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S LV+G+ + K+ + +T ML +P N+Y + L I + +L +VP
Sbjct: 225 ASGSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVAL-QVP------- 276
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S GNGG+L+DSGT T LP Y L ++ T +P A GF + C N
Sbjct: 277 SFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAP------GFSILD--TCFNL 328
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T D++ P+I+ HF N L + FY + ++S V CL S+ D + +
Sbjct: 329 TGYDEVSIPTISMHFEGNAELKVDATGTFYVV--KEDASQV-CLALASLSDAY--DTAII 383
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N V+YD ++ ++GF C+
Sbjct: 384 GNYQQRNQRVIYDTKQSKVGFAEESCS 410
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 164/387 (42%), Gaps = 48/387 (12%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS+L W C C C + P+RSS+ SR C SFC + +S
Sbjct: 106 VIVDTGSNLIWAQCA----PCTRC--FPRPTPAPVLQPARSSTFSRLPCNGSFCQYLPTS 159
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P + C+ + YTYG G G L +TL V +
Sbjct: 160 SRPRTCNATAACA---------------YNYTYGSG-YTAGYLATETLTVGDGT------ 197
Query: 124 IPKFCFGC-VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
PK FGC + GI G GRG LS+ SQL + FS+C + +SP+
Sbjct: 198 FPKVAFGCSTENGVDNSSGIVGLGRGPLSLVSQLAVGR--FSYCL---RSDMADGGASPI 252
Query: 183 VIGDVA-ISSKDNLQFTPMLKSP--MYPNYYYIGLEAITIGNSSLTEVPLSLREF---DS 236
+ G +A ++ +Q TP+LK+P +YY+ L I + + TE+P++ F +
Sbjct: 253 LFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDS---TELPVTGSTFGFTQT 309
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT-GFDLCYRVPCPNN 295
GG +VDSGTT T+L + Y+ + QS + + DLCY+ P
Sbjct: 310 GLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYK-PSAGG 368
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA-VKCLLFQSMDDGDYGPSGVF 354
P + F +P N+F + A S V CLL D P +
Sbjct: 369 GGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDL--PISII 426
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q ++ ++YD++ F P DCA
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCA 453
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 118/383 (30%), Positives = 176/383 (45%), Gaps = 61/383 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C +C N+ F+PS+SSS C S C ++ +
Sbjct: 104 VDTGSDIVWLQCE----PCQEC----YNQTTPMFNPSKSSSYKNIPCPSKLCQSMEDT-- 153
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
S K+ C ++ YG+ G L+ DTL + S+ G+ P
Sbjct: 154 ------------SCNDKNYC-----EYSTYYGDNSHSGGDLSVDTLTLE-STNGLTVSFP 195
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFL-AFKYAN-DPNI 178
GC + S GI GFG G S +QLG G FS+C F N N
Sbjct: 196 NIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLTPLFSVTNIQSNA 255
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S L GD A S D + TP+LK +YY+ LEA ++GN + E+ + D++G
Sbjct: 256 TSKLNFGDAATVSGDGVVTTPILKKDP-ETFYYLTLEAFSVGNRRV-EIG-GVPNGDNEG 312
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTF 297
N +++DSGTT T L + YS L+S + + + V++ T +LCY V F
Sbjct: 313 N--IIIDSGTTLTSLTKDDYS----FLESAVVDLVKLERVDDPTQTLNLCYSVKAEGYDF 366
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P IT HF + L + F +++ V CL F+S D +FG+
Sbjct: 367 -----PIITMHF-KGADVDLHPISTFVSVA-----DGVFCLAFESSQD-----HAIFGNL 410
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQN+ V YDL+++ + F+P DC
Sbjct: 411 AQQNLMVGYDLQQKIVSFKPSDC 433
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 173/385 (44%), Gaps = 60/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C +C Y + F+P +S S ++ C + C +
Sbjct: 142 VYMVLDTGSDIVWLQCA----PCKNC--YSQTDPV--FNPVKSGSFAKVLCRTPLCRRLE 193
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S GC+ + TC + +YG+G TG +TL +
Sbjct: 194 SP----------GCNQ----RQTCL-----YQVSYGDGSYTTGEFVTETLTFRRT----- 229
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
++ + GC + G+ G GRG LS PSQ G F QK FS+C + ++ P
Sbjct: 230 -KVEQVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQK-FSYCLVDRSASSKP 287
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S +V G+ A+S +FTP+L +P +YY+ L I++G + ++ + S + D
Sbjct: 288 ---SSVVFGNSAVSR--TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 342
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG+++D GT+ T L +P Y L ++ + K E + FD CY + +
Sbjct: 343 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGAS---SLKSAPEFSLFDTCYDL----SG 395
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T P++ HF + LP N+ P + S C F G + G+
Sbjct: 396 KTTVKVPTVVLHF-RGADVSLPASNYLI----PVDGSGRFCFAFAGTTSG----LSIIGN 446
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYDL R+GF P CA
Sbjct: 447 IQQQGFRVVYDLASSRVGFSPRGCA 471
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 167/385 (43%), Gaps = 70/385 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+PC C C + F P++SSS C S C I +
Sbjct: 132 IDTGSDVAWIPCKQ----CQGC-----HSTAPIFDPAKSSSYKPFACDSQPCQEISGNCG 182
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C F YG+G V G L D + + GS + +P
Sbjct: 183 GNSKC--------------------QFEVLYGDGTQVDGTLASDAITL-GS-----QYLP 216
Query: 126 KFCFGCVGS----TYREPIGIAGFGRGALSVPSQ--LGFLQKGFSHCFLAFKYANDPNIS 179
F FGC S TY P + G + FS+C + + S
Sbjct: 217 NFSFGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTS-----S 271
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
LV+G A S +L+FT ++K P +P +Y++ L+AI++GN+ ++ VP + +
Sbjct: 272 GSLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRIS-VPAT----NIASG 326
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG ++DSGTT T+L Y L + ++ + VE+ D CY + +++ D
Sbjct: 327 GGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSL-QPTPVED---MDTCYDL---SSSSVD 379
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P+IT H NV LVLP+ N + S + CL F S D + G+ QQ
Sbjct: 380 --VPTITLHLDRNVDLVLPKENILI-----TQESGLSCLAFSSTDS-----RSIIGNVQQ 427
Query: 360 QNVEVVYDLEKERIGFQPMDCASTA 384
QN +V+D+ ++GF CA+ A
Sbjct: 428 QNWRIVFDVPNSQVGFAQEQCAAPA 452
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 120/408 (29%), Positives = 185/408 (45%), Gaps = 82/408 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY-RNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV +C+ CD R + L ++ + P SS+ S+ +C FC
Sbjct: 104 VQVDTGSDILWV-------NCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAA 156
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSP 118
+ P GC+ S PC ++ TYG+G TG D L+ S
Sbjct: 157 TYGGLLP-------GCTTSL--------PC-EYSVTYGDGSSTTGYFVSDLLQFDQVSGD 200
Query: 119 GIIREI-PKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
G R FGC +GS+ + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 201 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 260
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSL 224
+ N IG+V P +K+ P+ PN +Y + L++I +G ++L
Sbjct: 261 ------DTINGGGIFAIGNVV---------QPKVKTTPLVPNMPHYNVNLKSIDVGGTAL 305
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL---QSTITYYPRAKEVEER 281
L FD+ G ++DSGTT T+LPE Y +++ + IT++ V+E
Sbjct: 306 ---KLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFH----NVQEF 358
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
F RV DD FP ITFHF N++ L + ++F+ N + C+ FQ
Sbjct: 359 LCFQYVGRV--------DDDFPKITFHFENDLPLNVYPHDYFF-----ENGDNLYCVGFQ 405
Query: 342 S--MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ + D + G N VVYDLE + IG+ +C+S+ +
Sbjct: 406 NGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEYNCSSSIKIK 453
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 110/395 (27%), Positives = 182/395 (46%), Gaps = 62/395 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C +C Y+ +L F P SS+ S S C
Sbjct: 76 VDTGSDLIWLQC----IPCTNC--YK--QLNPMFDPQSSSTYSNIAYGSESC-------- 119
Query: 66 PFDPCTMSGCSLSTLLKSTCCRP----CPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ L ST C P C ++ Y+Y + + G+L ++TL + S+ G
Sbjct: 120 -------------SKLYSTSCSPDQNNC-NYTYSYEDDSITEGVLAQETLTL-TSTTGKP 164
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYAND 175
+ FGC G + +GI G GRG LS+ SQ+G F K FS C + F +
Sbjct: 165 VALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQCLVPFH--TN 222
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
P+I+SP+ G + + + TP++ + +Y++ L I++ + +L S E
Sbjct: 223 PSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFNDGSSLEPI 282
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
++GN +++DSGT T LPE FY +L+ +++ + P ++ G+ LCYR P
Sbjct: 283 TKGN--MVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIP--IDPTLGYQLCYRTP---- 334
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
T+ ++T HF L+ P + P + C F S +Y G++G
Sbjct: 335 --TNLKGTTLTAHFEGADVLLTPT-----QIFIPVQ-DGIFCFAFTSTFSNEY---GIYG 383
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLH 390
+ Q N + +DLEK+ + F+ DC + A ++
Sbjct: 384 NHAQSNYLIGFDLEKQLVSFKATDCTNLQDAPSIN 418
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 179/388 (46%), Gaps = 74/388 (19%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSSD 64
DTGSDL W C C+ C Y+ M F PS+S+S +C S C L+ S
Sbjct: 109 DTGSDLMWTQC----LPCLSC--YKQKNPM--FDPSKSTSFKEVSCESQQCRLLDTVSCS 160
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
P C F+Y YG+G L G++ +TL ++ +S G I
Sbjct: 161 QPQKLC--------------------DFSYGYGDGSLAQGVIATETLTLNSNS-GQPXSI 199
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQ----LGFLQKGFSHCFLAFKYANDP 176
FGC G+ +G+ G G LS+ SQ LG +K FS C + F+ DP
Sbjct: 200 XNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRK-FSQCLVPFR--TDP 256
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL---TEVPLSLRE 233
+I+S ++ G A S + TP++ P YY++ L+ I++G+ + P++ +
Sbjct: 257 SITSKIIFGPEAEVSGSXVVSTPLVTKD-DPTYYFVTLDGISVGDKLFPFSSSSPMATK- 314
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP-RAKEVEERTGFDLCYRVPC 292
G + +D+GT T LP FY++L+ ++ I P + +++ + LCYR
Sbjct: 315 ------GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQ----LCYR--- 361
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ T D P +T HF + + L N F S V C Q +D G +G
Sbjct: 362 -SATLIDG--PILTAHF-DGADVQLKPLNTFI-----SPKEGVYCFAMQPID----GDTG 408
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+F Q N + +DL+ +++ F+ +DC
Sbjct: 409 IFGNFVQMNFLIGFDLDGKKVSFKAVDC 436
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 117/404 (28%), Positives = 178/404 (44%), Gaps = 78/404 (19%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ WV C C C + ++ + P SSS S +C + FC +
Sbjct: 101 HVQVDTGSDILWVNC----VSCDKCPTKSGLGIDLALYDPKGSSSGSAVSCDNKFCAATY 156
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S CT +PC + YG+G G D+L+ + S
Sbjct: 157 GSGEKLPGCTAG-------------KPC-EYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQ 202
Query: 122 REIPK--FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
K FGC + ST + GI GFG+ S SQL G ++K FSHC
Sbjct: 203 TRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDT 262
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSLTE 226
K IG+V P +KS P+ PN +Y + L++I + ++L
Sbjct: 263 IKGGG------IFAIGEVV---------QPKVKSTPLLPNMSHYNVNLQSIDVAGNALQL 307
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT--GF 284
P F++ G ++DSGTT T+LPE Y +L+ + + + +++ RT GF
Sbjct: 308 PP---HIFETSEKRGTIIDSGTTLTYLPELVYKDILAAV------FQKHQDITFRTIQGF 358
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
LC+ + DD FP ITFHF +++ L + ++F+ N + CL FQ
Sbjct: 359 -LCFEY----SESVDDGFPKITFHFEDDLGLNVYPHDYFF-----QNGDNLYCLGFQ--- 405
Query: 345 DGDYGPSG-----VFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+G + P + G N VVYDLEK+ IG+ +C+S+
Sbjct: 406 NGGFQPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWTDYNCSSS 449
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 175/384 (45%), Gaps = 64/384 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C C C +N L F P +SS+ C S
Sbjct: 110 DTGSDLIWVQCA----PCEKCVP-QNAPL---FDPRKSSTFKTVPCDS------------ 149
Query: 67 FDPCTMSGCSLSTLL-KSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PCT+ S + KS C + Y YG+ LV+GIL +++ + I + P
Sbjct: 150 -QPCTLLPPSQRACVGKSGQCY----YQYIYGDHTLVSGILGFESINFGSKNNAI--KFP 202
Query: 126 KFCFGCVGST------YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNI 178
K FGC S + +G+ G G G LS+ SQLG+ + + FS+CF N
Sbjct: 203 KLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLGYQIGRKFSYCFPPL----SSNS 258
Query: 179 SSPLVIGDVAISSK-DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G+ AI + + TP++ + P+YYY+ LE ++IGN + + +SQ
Sbjct: 259 TSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKV-------KTSESQ 311
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+G +L+DSGT++T L + FY++ +++++ Y A ++ ++ C+ N
Sbjct: 312 TDGNILIDSGTSFTILKQSFYNKFVALVKE--VYGVEAVKIPPLV-YNFCFE-----NKG 363
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
FP + F F + + N F A + + C++ D D +FG+
Sbjct: 364 KRKRFPDVVFLF-TGAKVRVDASNLFEA-----EDNNLLCMVALPTSDED---DSIFGNH 414
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
Q +V YDL+ + F P DCA
Sbjct: 415 AQIGYQVEYDLQGGMVSFAPADCA 438
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 159/384 (41%), Gaps = 60/384 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C+ C + F RS++ C SS C + S
Sbjct: 106 MDTGSDLIWTQCA----PCLLCAA----QPTPYFDVKRSATYRALPCRSSRCAALSS--- 154
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ K C + Y YG+ G+L +T +S +R
Sbjct: 155 ------------PSCFKKMCV-----YQYYYGDTASTAGVLANETFTFGAASSTKVRAA- 196
Query: 126 KFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
FGC E G+ GFGRG LS+ SQLG FS+C ++ S L
Sbjct: 197 NISFGCGSLNAGELANSSGMVGFGRGPLSLVSQLG--PSRFSYCLTSYLSPTP----SRL 250
Query: 183 VIGDVA------ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
G A SS +Q TP + +P PN Y++ ++ I++G L PL + +
Sbjct: 251 YFGVFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPL-VFAIND 309
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG+++DSGT+ T L + Y + L STI P + G D C++ P P N
Sbjct: 310 DGTGGVIIDSGTSITWLQQDAYEAVRRGLASTI---PLPAMNDTDIGLDTCFQWPPPPNV 366
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P FHF + ++ LP N+ S ++ CL G + G+
Sbjct: 367 TVT--VPDFVFHF-DGANMTLPPENYMLIAS----TTGYLCLAMAPTSVGT-----IIGN 414
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
+QQQN+ ++YD+ + F P C
Sbjct: 415 YQQQNLHLLYDIANSFLSFVPAPC 438
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 168/374 (44%), Gaps = 62/374 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT +D W+PC C C + F+P +S++ +C S C +
Sbjct: 115 MDTSNDAAWIPCT----ACDGCTS-------TLFAPEKSTTFKNVSCGSPQCNQV----- 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F TYG + ++ +DT+ + + P IP
Sbjct: 159 PNPSCGTSACT---------------FNLTYGSSSIAANVV-QDTVTL-ATDP-----IP 196
Query: 126 KFCFGCVGSTY---REPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNISSP 181
+ FGCV T P G+ G GRG LS+ SQ L Q FS+C +FK N S
Sbjct: 197 DYTFGCVAKTTGASAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 253
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA + +++TP+LK+P + YY+ L AI +G + ++P F++ G
Sbjct: 254 LRLGPVAQPIR--IKYTPLLKNPRRSSLYYVNLVAIRVGR-KVVDIPPEALAFNAATGAG 310
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK-EVEERTGFDLCYRVPCPNNTFTDD 300
+ DSGT +T L P Y+ + Q + +A V GFD CY VP
Sbjct: 311 TVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPI-------- 362
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P+ITF F + +++ LP+ N +A S + CL S D V + QQQ
Sbjct: 363 VAPTITFMF-SGMNVTLPEDNILIHSTAGSTT----CLAMASAPDNVNSVLNVIANMQQQ 417
Query: 361 NVEVVYDLEKERIG 374
N V+YD+ R+G
Sbjct: 418 NHRVLYDVPNSRLG 431
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 160/387 (41%), Gaps = 70/387 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS L W+ C C +C F P +SS+ TC S
Sbjct: 106 VDTGSSLIWLQCS----PCHNCFPQETPL----FEPLKSSTYKYATCDS----------- 146
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PCT+ S K C + YG+ GIL +TL + P
Sbjct: 147 --QPCTLLQPSQRDCGKLGQCI----YGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFP 200
Query: 126 KFCFGC------VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNI 178
FGC T + +GIAG G G LS+ SQLG + FS+C L + D
Sbjct: 201 NTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIGHKFSYCLLPY----DSTS 256
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S L G AI + + + TP++ P P YY++ LEA+TIG ++ Q
Sbjct: 257 TSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVST---------GQT 307
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT-- 296
+G +++DSGT T+L FY+ ++ LQ T+ G L +P P T
Sbjct: 308 DGNIVIDSGTPLTYLENTFYNNFVASLQETL-------------GVKLLQDLPSPLKTCF 354
Query: 297 --FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ P I F F + P+ + P S + CL + G S +F
Sbjct: 355 PNRANLAIPDIAFQFTGASVALRPKN-----VLIPLTDSNILCL--AVVPSSGIGIS-LF 406
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
GS Q + +V YDLE +++ F P DCA
Sbjct: 407 GSIAQYDFQVEYDLEGKKVSFAPTDCA 433
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 118/392 (30%), Positives = 176/392 (44%), Gaps = 83/392 (21%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSD+ W+ C C C N+ F+PS+SSS C S C HS
Sbjct: 105 DTGSDIVWLQCE----PCEQC----YNQTTPIFNPSKSSSYKNIPCLSKLC---HS---- 149
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAY--TYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
++ T C S Y +YG+ G L+ DTL + +S G
Sbjct: 150 --------------VRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTS-GSPVSF 194
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
PK GC G+ GI G G G +S+ +QLG G FS+C + + N S
Sbjct: 195 PKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPL-LNKESNAS 253
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L GD A+ S D + TP++K P +Y++ L+A ++GN + E S D +GN
Sbjct: 254 SILSFGDAAVVSGDGVVSTPLIKKD--PVFYFLTLQAFSVGNKRV-EFGGSSEGGDDEGN 310
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE-RTGFDLCYRVPCPNNTFT 298
+++DSGTT T +P Y+ L +S + + V++ F LCY + +N +
Sbjct: 311 --IIIDSGTTLTLIPSDVYTNL----ESAVVDLVKLDRVDDPNQQFSLCYSLK--SNEYD 362
Query: 299 DDLFPSITFHF------LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS- 351
FP IT HF L+++S +P + + C FQ PS
Sbjct: 363 ---FPIITAHFKGADIELHSISTFVPI------------TDGIVCFAFQ--------PSP 399
Query: 352 ---GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ QQN+ V YDL+++ + F+P DC
Sbjct: 400 QLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDC 431
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 111/390 (28%), Positives = 166/390 (42%), Gaps = 57/390 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI- 60
+ +DTGSDL W C + C+ D F+P +S+S CA + C +I
Sbjct: 109 VSALLDTGSDLIWTQCAPCA-SCLSQPD-------PLFAPGQSASYEPMRCAGTLCSDIL 160
Query: 61 -HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILT--RDTLKVHGSS 117
HS + P D CT + Y YG+G + G+ R T G
Sbjct: 161 HHSCERP-DTCT--------------------YRYNYGDGTMTVGVYATERFTFASSGGG 199
Query: 118 PGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+P FGC VGS GI GFGR LS+ SQL + FS+C ++
Sbjct: 200 GLTTTTVP-LGFGCGSVNVGS-LNNGSGIVGFGRNPLSLVSQLSI--RRFSYCLTSYASR 255
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+ + V + +Q TP+L+SP P +YY+ +T+G L +P S
Sbjct: 256 RQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRL-RIPESAFA 314
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G+GG++VDSGT T LP ++++ + + P A G +C+ VP
Sbjct: 315 LRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLR-LPFANGGNPEDG--VCFLVPAA 371
Query: 294 --NNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
++ T + P + HF L LP+ N+ + CLL D GD G
Sbjct: 372 WRRSSSTSQMPVPRMVLHF-QGADLDLPRRNYVL----DDHRRGRLCLLL--ADSGDDGS 424
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQ++ V+YDLE E + P C
Sbjct: 425 T--IGNLVQQDMRVLYDLEAETLSIAPARC 452
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 173/386 (44%), Gaps = 63/386 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDL W C C C Y+ + F P +SS + RD
Sbjct: 108 IMGIADTGSDLIWTQCK----PCERC--YKQVDPL--FDP-KSSKTYRDF---------- 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
C CSL L +STC + Y+YG+ G + DT+ + S+ G
Sbjct: 149 -------SCDARQCSL--LDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLD-STTGSP 198
Query: 122 REIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
PK GC G+ + GI G G G LS+ SQ+G G FS+C + ++
Sbjct: 199 VSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPL--SSRA 256
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
SS L G A+ S +Q TP+L S ++Y++ LEA+++GN + SL
Sbjct: 257 GNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSL----G 312
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNN 295
G G +++DSGTT T +P+ F+S L + + + + + E+ +GF +CY
Sbjct: 313 TGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQV----EGRRAEDPSGFLSVCYSA----- 363
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T DL P+IT HF + L N F + S V CL F S G ++
Sbjct: 364 --TSDLKVPAITAHF-TGADVKLKPINTFVQV-----SDDVVCLAFASTTSG----ISIY 411
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ Q N V Y+++ + + F+P DC
Sbjct: 412 GNVAQMNFLVEYNIQGKSLSFKPTDC 437
>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 438
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/374 (29%), Positives = 168/374 (44%), Gaps = 62/374 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C C + F+P +S++ +C S C +
Sbjct: 114 IDTSNDAAWIPCT----ACDGCTS-------TLFAPEKSTTFKNVSCGSPECNKV----- 157
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F TYG + ++ +DT+ + + P IP
Sbjct: 158 PSPSCGTSACT---------------FNLTYGSSSIAANVV-QDTVTL-ATDP-----IP 195
Query: 126 KFCFGCVGSTY---REPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNISSP 181
+ FGCV T P G+ G GRG LS+ SQ L Q FS+C +FK N S
Sbjct: 196 GYTFGCVAKTTGPSTPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 252
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA +++TP+LK+P + YY+ L AI +G + ++P + F++ G
Sbjct: 253 LRLGPVA--QPIRIKYTPLLKNPRRSSLYYVNLFAIRVGR-KIVDIPPAALAFNAATGAG 309
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK-EVEERTGFDLCYRVPCPNNTFTDD 300
+ DSGT +T L P Y+ + + + +A V GFD CY VP
Sbjct: 310 TVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPI-------- 361
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P+ITF F + +++ LPQ N +A S S CL S D V + QQQ
Sbjct: 362 VAPTITFMF-SGMNVTLPQDNILIHSTAGSTS----CLAMASAPDNVNSVLNVIANMQQQ 416
Query: 361 NVEVVYDLEKERIG 374
N V+YD+ R+G
Sbjct: 417 NHRVLYDVPNSRLG 430
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 156/384 (40%), Gaps = 69/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT +D WVPC C+ C F PS+SSSS C + C
Sbjct: 106 VALDTSNDAAWVPCSG----CVGCASS------VLFDPSKSSSSRNLQCDAPQC-----K 150
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P CT + C F TYG G + LT+DTL +
Sbjct: 151 QAPNPTCTAG-------------KSC-GFNMTYG-GSTIEASLTQDTLTLAND------V 189
Query: 124 IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
I + FGC+ T G+ G GRG LS+ SQ L FS+C PN
Sbjct: 190 IKSYTFGCISKATGTSLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCL--------PNSK 241
Query: 180 SPLVIGDVAISSK---DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S G + + K ++ TP+LK+P + YY+ L I +GN + ++P S FD+
Sbjct: 242 SSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNK-IVDIPTSALAFDA 300
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G + DSGT +T L EP Y + + + I + GFD CY
Sbjct: 301 STGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRI----KNANATSLGGFDTCYS------- 349
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
++PS+TF F +++ LP N S+S + CL + + V S
Sbjct: 350 -GSVVYPSVTFMFA-GMNVTLPPDNLLIH----SSSGSTSCLAMAAAPNNVNSVLNVIAS 403
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQN V+ DL R+G C
Sbjct: 404 MQQQNHRVLIDLPNSRLGISRETC 427
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 167/386 (43%), Gaps = 72/386 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+PC C C + F P++SSS C S C I +
Sbjct: 132 IDTGSDVAWIPCKQ----CQGC-----HSTAPIFDPAKSSSYKPFACDSQPCQEISGNCG 182
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C F +YG+G V G L D + + GS + +P
Sbjct: 183 GNSKC--------------------QFEVSYGDGTQVDGTLASDAITL-GS-----QYLP 216
Query: 126 KFCFGCVGSTYREP-------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
F FGC S + G P+ F FS+C + +
Sbjct: 217 NFSFGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELF-GGTFSYCLPSSSTS----- 270
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S LV+G A S +L+FT ++K P P +Y++ L+AI++GN+ ++ VP + +
Sbjct: 271 SGSLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRIS-VPGT----NIAS 325
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG ++DSGTT THL Y+ L + ++ + VE+ D CY + +++
Sbjct: 326 GGGTIIDSGTTITHLVPSAYTALRDAFRQQLSSL-QPTPVED---MDTCYDL---SSSSV 378
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
D P+IT H NV LVLP+ N + S + CL F S D + G+ Q
Sbjct: 379 D--VPTITLHLDRNVDLVLPKENILI-----TQESGLACLAFSSTDS-----RSIIGNVQ 426
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTA 384
QQN +V+D+ ++GF CA+ A
Sbjct: 427 QQNWRIVFDVPNSQVGFAQEQCAAPA 452
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 161/381 (42%), Gaps = 56/381 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D W+PC C C + + ++ S + S S C + L
Sbjct: 120 MVLDTSNDAVWLPCSG----CSGCSNASTSFNTNSSSTYSTVSCSTTQCTQARGLT---- 171
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C ST S C SF +YG + L +DTL + SP +I
Sbjct: 172 -----------CPSSTPQPSIC-----SFNQSYGGDSSFSANLVQDTLTL---SPDVI-- 210
Query: 124 IPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
P F FGC+ S P G+ G GRG +S+ SQ L G FS+C +F+ S
Sbjct: 211 -PNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY---FS 266
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G + ++++TP+L++P P+ YY+ L +++G+ + P+ L FDS
Sbjct: 267 GSLKLG--LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYL-TFDSNSG 323
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + FD C+ +
Sbjct: 324 AGTIIDSGTVITRFAQPVYEAIRDEFRKQVN-----GSFSTLGAFDTCFSAD------NE 372
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
++ P IT H + ++ L LP N SA + CL + V + QQ
Sbjct: 373 NVTPKITLH-MTSLDLKLPMENTLIHSSA----GTLTCLSMAGIRQNANAVLNVIANLQQ 427
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D+ RIG P C
Sbjct: 428 QNLRILFDVPNSRIGIAPEPC 448
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 158/386 (40%), Gaps = 60/386 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C C D F P RSSS CA+ C + S
Sbjct: 157 LDTGSDVVWLQCAP----CRRCYDQSG----PVFDPRRSSSYGAVDCAAPLCRRLDSG-- 206
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
GC L + C + YG+G + G +TL G + +
Sbjct: 207 --------GCDLR---RRACL-----YQVAYGDGSVTAGDFATETLTFAGGA-----RVA 245
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
+ GC + G+ G GRG+LS P+Q+ K FS+C + ++ +S
Sbjct: 246 RVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASR 305
Query: 182 LVIGDVAIS--SKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFDSQG 238
V S FTPM+++P +YY+ L I++G + + V S LR S G
Sbjct: 306 SRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 365
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG----FDLCYRVPCPN 294
GG++VDSGT+ T L P YS L ++ A + G FD CY +
Sbjct: 366 RGGVIVDSGTSVTRLARPSYSALRDAFRAA------AAGLRLSPGGFSLFDTCYDLGGRK 419
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
P+++ HF LP N+ P +S C F D G +
Sbjct: 420 VV----KVPTVSMHFAGGAEAALPPENYLI----PVDSRGTFCFAFAGTDGG----VSII 467
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQ VV+D + +R+GF P C
Sbjct: 468 GNIQQQGFRVVFDGDGQRVGFAPKGC 493
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 171/384 (44%), Gaps = 59/384 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C + F P++S + + C + C +
Sbjct: 142 VYMVLDTGSDVVWLQCA----PCRKC----YTQADPVFDPTKSRTYAGIPCGAPLCRRL- 192
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D+P GC+ K+ C+ + +YG+G G + +TL +
Sbjct: 193 --DSP-------GCNN----KNKVCQ----YQVSYGDGSFTFGDFSTETLTFRRT----- 230
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
+ + GC + G+ G GRG LS P Q G F QK FS+C + + P
Sbjct: 231 -RVTRVALGCGHDNEGLFIGAAGLLGLGRGRLSFPVQTGRRFNQK-FSYCLVDRSASAKP 288
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S +V GD A+S +FTP++K+P +YY+ L I++G S + + SL D+
Sbjct: 289 ---SSVVFGDSAVSR--TARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDA 343
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG+++DSGT+ T L P Y L + ++ RA E FD C+ + +
Sbjct: 344 AGNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSL---FDTCFDL----SG 396
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T+ P++ HF + LP N+ P ++S C F G + G+
Sbjct: 397 LTEVKVPTVVLHF-RGADVSLPATNYLI----PVDNSGSFCFAFAGTMSG----LSIIGN 447
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +DL R+GF P C
Sbjct: 448 IQQQGFRVSFDLAGSRVGFAPRGC 471
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/385 (28%), Positives = 169/385 (43%), Gaps = 59/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C Y + + F P +S + + C+S C +
Sbjct: 155 VYMVLDTGSDIVWLQCA----PCRRC--YSQSDPI--FDPRKSKTYATIPCSSPHCRRLD 206
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ GC+ + TC + +YG+G G + +TL +
Sbjct: 207 SA----------GCNTR---RKTCL-----YQVSYGDGSFTVGDFSTETLTFRRN----- 243
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
+ GC + G+ G G+G LS P Q G F QK FS+C + ++ P
Sbjct: 244 -RVKGVALGCGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQK-FSYCLVDRSASSKP 301
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S +V G+ A+S +FTP+L +P +YY+ L I++G + + V SL + D
Sbjct: 302 ---SSVVFGNAAVSRI--ARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQ 356
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
GNGG+++DSGT+ T L P Y + + RA + FD C+ + N
Sbjct: 357 IGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSL---FDTCFDLSNMNEV 413
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P++ HF + LP N+ P +++ C F G G + G+
Sbjct: 414 ----KVPTVVLHF-RGADVSLPATNYLI----PVDTNGKFCFAFA----GTMGGLSIIGN 460
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYDL R+GF P CA
Sbjct: 461 IQQQGFRVVYDLASSRVGFAPGGCA 485
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/378 (29%), Positives = 170/378 (44%), Gaps = 57/378 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT SD+ WV C C C Y + M F PS S + C+S+ C ++
Sbjct: 105 VDTASDIIWVQCQL----CETC--YNDTSPM--FDPSYSKTYKNLPCSSTTCKSVQ---- 152
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
G S S+ + C Y +G G L +T+ + GS P
Sbjct: 153 --------GTSCSSDERKIC-----EHTVNYKDGSHSQGDLIVETVTL-GSYNDPFVHFP 198
Query: 126 KFCFGCVGSTYR--EPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSPL 182
+ GC+ +T + IGI G G G +S+ QL + K FS+C A + SS L
Sbjct: 199 RTVIGCIRNTNVSFDSIGIVGLGGGPVSLVPQLSSSISKKFSYCL-----APISDRSSKL 253
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
GD A+ S D T ++ + +YY+ LEA ++GN+ + S G G +
Sbjct: 254 KFGDAAMVSGDGTVSTRIVFKD-WKKFYYLTLEAFSVGNN---RIEFRSSSSRSSGKGNI 309
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++DSGTT+T LP+ YS+L S + + RA++ ++ F LCY+ +T+
Sbjct: 310 IIDSGTTFTVLPDDVYSKLESAVADVVK-LERAEDPLKQ--FSLCYK-----STYDKVDV 361
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P IT HF + + L N F S V CL F S G +FG+ QQN
Sbjct: 362 PVITAHF-SGADVKLNALNTFIVA-----SHRVVCLAFLSSQSG-----AIFGNLAQQNF 410
Query: 363 EVVYDLEKERIGFQPMDC 380
V YDL+++ + F+P DC
Sbjct: 411 LVGYDLQRKIVSFKPTDC 428
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 115/395 (29%), Positives = 165/395 (41%), Gaps = 65/395 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI- 60
+ +DTGSDL W C + C+ D F+P+ SSS C+ C +I
Sbjct: 116 VSALLDTGSDLIWTQCAPCA-SCLAQPD-------PLFAPAASSSYVPMRCSGQLCNDIL 167
Query: 61 -HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
HS P D CT + Y YG+G G+ + SS G
Sbjct: 168 HHSCQRP-DTCT--------------------YRYNYGDGTTTLGVYATERF-TFASSSG 205
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+P FGC VGS GI GFGR LS+ SQL + FS+C +
Sbjct: 206 EKLSVP-LGFGCGTMNVGS-LNNGSGIVGFGRDPLSLVSQLSI--RRFSYCLTPYTSTRK 261
Query: 176 P-----NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
++S + GD A + + +Q T +L+S P +YY+ +T+G L +PLS
Sbjct: 262 STLMFGSLSDGVFEGDDAATGQ--VQTTRLLQSRQNPTFYYVPFTGVTVGTRRL-RIPLS 318
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
G+GG++VDSGT T P +++L ++ + P G +C+
Sbjct: 319 AFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQL-RLPFTSSSSPDDG--VCFAT 375
Query: 291 PCPNNTFTDDL-----FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
P P + FHF L LP+ N Y + P S C+L D
Sbjct: 376 PMAAGGRRASAATVVSVPRMAFHF-QGADLELPRRN--YVLDDPRRGSL--CILL--ADS 428
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
GD G G+F QQ++ V+YDLE E + F P C
Sbjct: 429 GDSG--ATIGNFVQQDMRVLYDLEAETLSFAPAQC 461
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 172/391 (43%), Gaps = 64/391 (16%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
+I +DTGSDL W+ C N C CD D+ + F SSS + C S+ C
Sbjct: 17 LIPAMIDTGSDLVWLKCDN----CDHCDLDHHGETI---FFSDASSSYKKLPCNSTHC-- 67
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD--TLKVHGSS 117
MS + + TC + Y YG+G +G + D + + HG+
Sbjct: 68 ----------SGMSSAGIGPRCEETC-----KYKYEYGDGSRTSGDVGSDRISFRSHGAG 112
Query: 118 PGIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYA 173
F FGC + + G+ G G+ + S+ QLG L FS+C ++ Y
Sbjct: 113 EDHRSFFDGFLFGCARKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVS--YD 170
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ P+ S L +G A ++ TP+L + YY+ L++ITIG VP+ +
Sbjct: 171 SPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGG-----VPVVVY 225
Query: 233 EFDSQGNGGL--------LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
+ +S N + ++DSGTTYT L P Y + ++ + + G
Sbjct: 226 DKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVIL----PTLGNSAGL 281
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
DLC+ ++ T FPS+TF+F N V LVLP N F S V CL SMD
Sbjct: 282 DLCFN----SSGDTSYGFPSVTFYFANQVQLVLPFENIFQV-----TSRDVVCL---SMD 329
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
G + G+ QQQN ++YDL +I F
Sbjct: 330 SSG-GDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 121/394 (30%), Positives = 181/394 (45%), Gaps = 74/394 (18%)
Query: 3 QVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+Y +DTGSD+ W+ C C C N+ F PS+S++ +S+ C ++
Sbjct: 98 QLYGIIDTGSDMIWLQCK----PCEKC----YNQTTRIFDPSKSNTYKILPFSSTTCQSV 149
Query: 61 H----SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
SSDN R + YG+G G L+ +TL + GS
Sbjct: 150 EDTSCSSDN---------------------RKMCEYTIYYGDGSYSQGDLSVETLTL-GS 187
Query: 117 SPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQL----GFLQKGFSHCFL 168
+ G + + GC S + GI G G G +S+ +QL + + FS+C
Sbjct: 188 TNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCL- 246
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN-YYYIGLEAITIGNSSLTEV 227
A+ NISS L GD A+ S D TP++ P +YY+ LEA ++GN+ +
Sbjct: 247 ----ASMSNISSKLNFGDAAVVSGDGTVSTPIVTHD--PKVFYYLTLEAFSVGNNRIEFT 300
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
S R F +GN +++DSGTT T LP YS+L S + + + R K+ ++ LC
Sbjct: 301 SSSFR-FGEKGN--IIIDSGTTLTLLPNDIYSKLESAV-ADLVELDRVKDPLKQ--LSLC 354
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
YR +TF + P I HF + + L N F + V CL F S
Sbjct: 355 YR-----STFDELNAPVIMAHF-SGADVKLNAVNTFIEVE-----QGVTCLAFIS---SK 400
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
GP +FG+ QQN V YDL+K+ + F+P DC+
Sbjct: 401 IGP--IFGNMAQQNFLVGYDLQKKIVSFKPTDCS 432
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 114/391 (29%), Positives = 172/391 (43%), Gaps = 64/391 (16%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCD-DYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
+I +DTGSDL W+ C N C CD D+ + F SSS + C S+ C
Sbjct: 17 LIPAMIDTGSDLVWLKCDN----CDHCDLDHHGETI---FFSDASSSYKKLPCNSTHC-- 67
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD--TLKVHGSS 117
MS + + TC + Y YG+G +G + D + + HG+
Sbjct: 68 ----------SGMSSAGIGPRCEETC-----KYKYEYGDGSRTSGDVGSDRISFRSHGAG 112
Query: 118 PGIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYA 173
F FGC + + G+ G G+ + S+ QLG L FS+C ++ Y
Sbjct: 113 EDHRSFFDGFLFGCGRKLKGDWNFTQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVS--YD 170
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ P+ S L +G A ++ TP+L + YY+ L++IT+G VP+ +
Sbjct: 171 SPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGG-----VPVVVY 225
Query: 233 EFDSQGNGGL--------LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
+ +S N + ++DSGTTYT L P Y + ++ + + G
Sbjct: 226 DKESGHNTSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVIL----PTLGNSAGL 281
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
DLC+ ++ T FPS+TF+F N V LVLP N F S V CL SMD
Sbjct: 282 DLCFN----SSGDTSYGFPSVTFYFANQVQLVLPFENIFQV-----TSRDVVCL---SMD 329
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
G + G+ QQQN ++YDL +I F
Sbjct: 330 SSG-GDLSIIGNMQQQNFHILYDLVASQISF 359
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 160/383 (41%), Gaps = 57/383 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C D + F P+RS++ CAS C
Sbjct: 107 LDTGSDLIWTQCA----PCLLCVD----QPTPYFDPARSATYRSLGCASPAC-------- 150
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ L C + + Y YG+ G+L +T + + +P
Sbjct: 151 ------------NALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRV--SLP 196
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
FGC + G+ GFGRG+LS+ SQLG FS+C +F + S L
Sbjct: 197 GISFGCGNLNAGSLANGSGMVGFGRGSLSLVSQLG--SPRFSYCLTSFLSP----VPSRL 250
Query: 183 VIGDVAI-----SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
G A +S + +Q TP + +P P Y++ + I++G L P D+
Sbjct: 251 YFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTD 310
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG ++DSGTT T+L EP Y + + S IT V + + D C++ P P
Sbjct: 311 GTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL--PLLNVTDASVLDTCFQWPPPPRQS 368
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + HF + LP N Y + PS + CL S D + GS+
Sbjct: 369 VT--LPQLVLHF-DGADWELPLQN--YMLVDPSTGGGL-CLAMASSSD-----GSIIGSY 417
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q QN V+YDLE + F P C
Sbjct: 418 QHQNFNVLYDLENSLMSFVPAPC 440
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 162/380 (42%), Gaps = 67/380 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT +D W+PC C+ C + F+ +S++ C + C +
Sbjct: 113 MDTSNDAAWIPCSG----CVGCSS-------TVFNNVKSTTFKTVGCEAPQCKQV----- 156
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F TYG + L++D + + S IP
Sbjct: 157 PNSKCGGSACA---------------FNMTYGSSSIAAN-LSQDVVTLATDS------IP 194
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNISS 180
+ FGC+ GS+ P G+ G GRG +S+ SQ L Q FS+C +F+ N S
Sbjct: 195 SYTFGCLTEATGSSI-PPQGLLGLGRGPMSLLSQTQNLYQSTFSYCLPSFRSLN---FSG 250
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G V ++ TP+LK+P + YY+ L AI +G + ++P S F+
Sbjct: 251 SLRLGPVG--QPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRR-VVDIPPSALAFNPTTGA 307
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G + DSGT +T L P Y+ + + + V GFD CY P
Sbjct: 308 GTIFDSGTVFTRLVAPAYTAVRDAFRKRVGN----ATVTSLGGFDTCYTSPI-------- 355
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P+ITF F + +++ LP N S +S++ CL + D V + QQQ
Sbjct: 356 VAPTITFMF-SGMNVTLPPDNLLIH----STASSITCLAMAAAPDNVNSVLNVIANMQQQ 410
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N +++D+ R+G C
Sbjct: 411 NHRILFDVPNSRLGVAREPC 430
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 174/387 (44%), Gaps = 59/387 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C CM C + F+PS SSS + C SS C N+
Sbjct: 144 MTVIIDTGSDLTWVQCD----PCMSCYSQQG----PVFNPSNSSSYNSLLCNSSTCQNLQ 195
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + C + S+C + +YG+G G L + L G S
Sbjct: 196 FTTGNTEACESNN-------PSSC-----NHTVSYGDGSFTDGELGVEHLSFGGIS---- 239
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + + GI G GR LS+ SQ G FS+C D
Sbjct: 240 --VSNFVFGCGRNNKGLFGGVSGIMGLGRSNLSMISQTNTTFGGVFSYCL----PTTDSG 293
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S LVIG+ + K+ + +T M+ +P N+Y + L I +G ++ +
Sbjct: 294 ASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQDTSF------ 347
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
GNGG+L+DSGT T L Y+ L + + YP A + + D C+ N
Sbjct: 348 --GNGGILIDSGTVITRLAPSLYNALKAEFLKQFSGYPIAPAL---SILDTCF-----NL 397
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T +++ P+++ HF NNV L + Y P + S V CL S+ D + +
Sbjct: 398 TGIEEVSIPTLSMHFENNVDLNVDAVGILYM---PKDGSQV-CLALASLSDEN--DMAII 451
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N V+YD ++ +IGF DC+
Sbjct: 452 GNYQQRNQRVIYDAKQSKIGFAREDCS 478
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 166/373 (44%), Gaps = 64/373 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT +D W+PC C C + F+P +S++ +CA+ C +
Sbjct: 95 MDTSNDAAWIPCT----ACDGCAS-------TLFAPEKSTTFKNVSCAAPECKQV----- 138
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C +S C+ F TYG + L +DT+ + + P +P
Sbjct: 139 PNPGCGVSSCN---------------FNLTYGSSSIAAN-LVQDTITL-ATDP-----VP 176
Query: 126 KFCFGCVGSTY---REPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNISSP 181
+ FGCV T P G+ G GRG LS+ SQ L Q FS+C +FK N S
Sbjct: 177 SYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 233
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA +++TP+LK+P + YY+ LEAI +G + ++P + F+ G
Sbjct: 234 LRLGPVA--QPKRIKYTPLLKNPRRSSLYYVNLEAIRVGR-KVVDIPPAALAFNPTTGAG 290
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y + + + P+ V GFD CY VP +
Sbjct: 291 TIFDSGTVFTRLVAPVYVAVRDEFRRRVG--PKL-TVTSLGGFDTCYNVPI--------V 339
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LPQ N +A S + CL D V + QQQN
Sbjct: 340 VPTITFIF-TGMNVTLPQDNILIHSTAGSTT----CLAMAGAPDNVNSVLNVIANMQQQN 394
Query: 362 VEVVYDLEKERIG 374
V+YD+ R+G
Sbjct: 395 HRVLYDVPNSRVG 407
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 161/381 (42%), Gaps = 55/381 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D W+PC C C + + ++ S + S S C + L
Sbjct: 119 MVLDTSNDAVWLPCSG----CSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLT---- 170
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C S+ S C SF +YG + L +DTL + +P +I
Sbjct: 171 -----------CPSSSPQPSVC-----SFNQSYGGDSSFSASLVQDTLTL---APDVI-- 209
Query: 124 IPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
P F FGC+ S P G+ G GRG +S+ SQ L G FS+C +F+ S
Sbjct: 210 -PNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY---FS 265
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G + ++++TP+L++P P+ YY+ L +++G+ + P+ L FD+
Sbjct: 266 GSLKLG--LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYL-TFDANSG 322
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + FD C+ +
Sbjct: 323 AGTIIDSGTVITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDTCFSAD------NE 372
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
++ P IT H + ++ L LP N SA + CL + V + QQ
Sbjct: 373 NVAPKITLH-MTSLDLKLPMENTLIHSSA----GTLTCLSMAGIRQNANAVLNVIANLQQ 427
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D+ RIG P C
Sbjct: 428 QNLRILFDVPNSRIGIAPEPC 448
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 116/392 (29%), Positives = 178/392 (45%), Gaps = 62/392 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C C Y + + + PS SS+ ++ +C++S C ++ +
Sbjct: 21 VDTGSDLVWIQCK----PCSQC--YSQSDPI--YDPSASSTFAKTSCSTSSCQSLPA--- 69
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
SGCS S TC + Y YG+ G +TL + SS G + P
Sbjct: 70 -------SGCSSSA---KTCI-----YGYQYGDSSSTQGDFALETLTLR-SSGGSSKAFP 113
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
F FGC ++ GI G G+G +S+ +QLG + FS+C + F +D + +SP
Sbjct: 114 NFQFGCGRLNSGSFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFD--DDSSKTSP 171
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL-------------TEVP 228
L+ G A + + TP++ + YY++GLE I++G L ++
Sbjct: 172 LIFGSSASTGSGAIS-TPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKK 230
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
L +R + +GG + DSGTT T L + YS++ S S+++ P +GFDLCY
Sbjct: 231 LRVRALEVN-SGGTIFDSGTTLTLLDDAVYSKVKSAFASSVS-LPTVD--ASSSGFDLCY 286
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
V N FP++T F PQ N+F + + V CL +M
Sbjct: 287 DVSKSKNF----KFPALTLAF-KGTKFSPPQKNYFVIV---DTAETVACL---AMGGSGS 335
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ QQN VVYD I P C
Sbjct: 336 LGLGIIGNLMQQNYHVVYDRGTSTISMSPAQC 367
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 159/383 (41%), Gaps = 57/383 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C D + F P+RS++ CAS C
Sbjct: 107 LDTGSDLIWTQCA----PCLLCVD----QPTPYFDPARSATYRSLGCASPAC-------- 150
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ L C + + Y YG+ G+L +T + + +P
Sbjct: 151 ------------NALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFGTNETRV--SLP 196
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
FGC G+ GFGRG+LS+ SQLG FS+C +F + S L
Sbjct: 197 GISFGCGNLNAGLLANGSGMVGFGRGSLSLVSQLG--SPRFSYCLTSFLSP----VPSRL 250
Query: 183 VIGDVAI-----SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
G A +S + +Q TP + +P P Y++ + I++G L P D+
Sbjct: 251 YFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTD 310
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG ++DSGTT T+L EP Y + + S IT V + + D C++ P P
Sbjct: 311 GTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL--PLLNVTDASVLDTCFQWPPPPRQS 368
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + HF + LP N Y + PS + CL S D + GS+
Sbjct: 369 VT--LPQLVLHF-DGADWELPLQN--YMLVDPSTGGGL-CLAMASSSD-----GSIIGSY 417
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q QN V+YDLE + F P C
Sbjct: 418 QHQNFNVLYDLENSLMSFVPAPC 440
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 163/381 (42%), Gaps = 67/381 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D D W+PC C+ C + F+ +S++ C + C + N
Sbjct: 52 LDNSYDAAWIPCKG----CVGCSS-------TVFNTVKSTTFKTLGCGAPQCKQV---PN 97
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ +T TYG +++ LTRDT+ + + +P
Sbjct: 98 PI--CGGSTCTWNT---------------TYGSSTILSN-LTRDTIALS------MDPVP 133
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
+ FGC+ GS+ P G+ GFGRG LS SQ L K FS+C +F+ N S
Sbjct: 134 YYAFGCIQKATGSSV-PPQGLLGFGRGPLSFLSQTQNLYKSTFSYCLPSFRTLN---FSG 189
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G V ++ TP+LK+P + YY+ L I +G + ++P S F+
Sbjct: 190 SLRLGPVG--QPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRK-IVDIPRSALAFNPTTGA 246
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G + DSGT +T L P Y + + + + V GFD CY VP
Sbjct: 247 GTIFDSGTVFTRLVAPAYIAVRNEFRKRVG----NATVSSLGGFDTCYSVPI-------- 294
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P+ITF F + +++ +P N S + CL + D V S QQQ
Sbjct: 295 VPPTITFMF-SGMNVTMPPENLLIH----STAGVTSCLAMAAAPDNVNSVLNVIASMQQQ 349
Query: 361 NVEVVYDLEKERIGFQPMDCA 381
N +++D+ R+G C+
Sbjct: 350 NHRILFDVPNSRLGVAREQCS 370
>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
Length = 315
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 137/298 (45%), Gaps = 35/298 (11%)
Query: 95 TYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGAL 150
+YG+ + G + +DT S G+ + + FGC G GIAGFGRG
Sbjct: 38 SYGDRSITAGHIFKDTFTFM-SPNGVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQ 96
Query: 151 SVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGD------VAISSKDNLQFTPMLKSP 204
S+PSQL + FS+C + SS +++G + + Q TP++ +P
Sbjct: 97 SLPSQLKVGR--FSYCLTLVTESK----SSVVILGTPPDPDGLRAHTTGPFQSTPIIYNP 150
Query: 205 MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS- 263
+ P +YY+ LE IT+G + L S+ G+GG ++DSGT+ T LPE + L
Sbjct: 151 LIPTFYYLSLEGITVGKTRL-PFDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEE 209
Query: 264 -ILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNH 322
+ Q + Y EV +R LC+R P P + H L + LP+ N+
Sbjct: 210 LVAQFPLPRYDNTPEVGDR----LCFRRPKGGKQVP---VPKLILH-LAGADMDLPRDNY 261
Query: 323 FYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
F S V CL +D + G+FQQQN+ VVYD+E ++ F P C
Sbjct: 262 FVE----EPDSGVMCLQINGAEDTTMV---LIGNFQQQNMHVVYDVENNKLLFAPAQC 312
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 168/384 (43%), Gaps = 57/384 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C ++ F P++S + + C + C +
Sbjct: 131 VYMVLDTGSDVVWLQCA----PCRKCYTQTDHV----FDPTKSRTYAGIPCGAPLCRRL- 181
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D+P GCS K+ C+ + +YG+G G + +TL +
Sbjct: 182 --DSP-------GCSN----KNKVCQ----YQVSYGDGSFTFGDFSTETLTFRRN----- 219
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ + GC + G+ G GRG LS P Q G FS+C + + P
Sbjct: 220 -RVTRVALGCGHDNEGLFTGAAGLLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASAKP- 277
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S ++ GD A+S FTP++K+P +YY+ L I++G + + + SL D+
Sbjct: 278 --SSVIFGDSAVSR--TAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAA 333
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG+++DSGT+ T L P Y L + ++ RA E FD C+ + +
Sbjct: 334 GNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSL---FDTCFDL----SGL 386
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P++ HF + LP N+ P ++S C F G + G+
Sbjct: 387 TEVKVPTVVLHF-RGADVSLPATNYLI----PVDNSGSFCFAFAGTMSG----LSIIGNI 437
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ + YDL R+GF P C
Sbjct: 438 QQQGFRISYDLTGSRVGFAPRGCV 461
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 167/391 (42%), Gaps = 56/391 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +D +D WVPC C+ C ++ +F P++SS+ C + C +
Sbjct: 113 LLVAIDPSNDAAWVPCS----ACLGCAPGASSP---SFDPTQSSTYRPVRCGAPQCAQVP 165
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P P +G S +F +Y L +L +D L + S+ +
Sbjct: 166 ----PATPSCPAGPGASC-----------AFNLSYASSTL-HAVLGQDALSLSDSNGAAV 209
Query: 122 REIPKFCFGCV-------GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYA 173
+ + FGC+ GS P G+ GFGRG LS SQ FS+C ++K +
Sbjct: 210 PDD-HYTFGCLRVVTGSGGSV--PPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS 266
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
N S L +G ++ TP+L +P P+ YY+ + + + N +P S
Sbjct: 267 N---FSGTLRLGPAG--QPRRIKTTPLLSNPHRPSLYYVAMVGVRV-NGKAVPIPASALA 320
Query: 234 FDSQ-GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
D+ G GG +VD+GT +T L P Y+ L + + ++ A GFD CY V
Sbjct: 321 LDAATGRGGTIVDAGTMFTRLSPPAYAALRNAFRRGVS----APAAPALGGFDTCYYV-- 374
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPS 351
N T + P++ F F + LP+ N + S S V CL + DG
Sbjct: 375 -NGTKS---VPAVAFVFAGGARVTLPEENVVIS----STSGGVACLAMAAGPSDGVNAGL 426
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
V S QQQN VV+D+ R+GF C +
Sbjct: 427 NVLASMQQQNHRVVFDVGNGRVGFSRELCTA 457
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 161/381 (42%), Gaps = 55/381 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D W+PC C C + + ++ S + S S C + L
Sbjct: 45 MVLDTSNDAVWLPCSG----CSGCSNASTSFNTNSSSTYSTVSCSTAQCTQARGLT---- 96
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C S+ S C SF +YG + L +DTL + +P +I
Sbjct: 97 -----------CPSSSPQPSVC-----SFNQSYGGDSSFSASLVQDTLTL---APDVI-- 135
Query: 124 IPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
P F FGC+ S P G+ G GRG +S+ SQ L G FS+C +F+ S
Sbjct: 136 -PNFSFGCINSASGNSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFY---FS 191
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G + ++++TP+L++P P+ YY+ L +++G+ + P+ L FD+
Sbjct: 192 GSLKLG--LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYL-TFDANSG 248
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + FD C+ +
Sbjct: 249 AGTIIDSGTVITRFAQPVYEAIRDEFRKQV----NVSSFSTLGAFDTCFSAD------NE 298
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
++ P IT H + ++ L LP N SA + CL + V + QQ
Sbjct: 299 NVAPKITLH-MTSLDLKLPMENTLIHSSA----GTLTCLSMAGIRQNANAVLNVIANLQQ 353
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D+ RIG P C
Sbjct: 354 QNLRILFDVPNSRIGIAPEPC 374
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 156/381 (40%), Gaps = 63/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C+ C + FS +SSS C S C +
Sbjct: 43 LDTSNDAAWIPCSG----CIGCPS------TTVFSSDKSSSFRPLPCQSPQCNQV----- 87
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ S C F TYG V L +D L + S +P
Sbjct: 88 PNPSCSGSACG---------------FNLTYGSS-TVAADLVQDNLTLATDS------VP 125
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ GS+ + + Q FS+C +FK N S
Sbjct: 126 SYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVN---FSGS 182
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA +++TP+L++P + YY+ L +I +G + ++P S F+S G
Sbjct: 183 LRLGPVA--QPIRIKYTPLLRNPRRSSLYYVNLISIRVGRK-IVDIPPSALAFNSATGAG 239
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGTT+T L P Y+ + + + R V GFD CY VP +
Sbjct: 240 TVIDSGTTFTRLVAPAYTAVRDEFRRRVG---RNVTVSSLGGFDTCYTVPI--------I 288
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LP N S S + CL + D V S QQQN
Sbjct: 289 SPTITFMFAG-MNVTLPPDNFLIH----STSGSTTCLAMAAAPDNVNSVLNVIASMQQQN 343
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
+++D+ R+G C+S
Sbjct: 344 HRILFDIPNSRVGVARESCSS 364
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/396 (27%), Positives = 167/396 (42%), Gaps = 59/396 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +++DTGSDL W C C C D + + F S S + SR C+ C H
Sbjct: 108 VVLHLDTGSDLVWTQCA-----CTVCFD----QPVPVFRASVSHTFSRVPCSDPLC--GH 156
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG-SSPGI 120
+ P +SGC+ R C +AY Y + + TG + DT
Sbjct: 157 AVYLP-----LSGCAARD-------RSC-FYAYGYMDHSITTGKMAEDTFTFKAPDRADT 203
Query: 121 IREIPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P FGC Y GIAGFG G LS+PSQL + FS+CF A + +
Sbjct: 204 AAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQLKV--RRFSYCFTAMEESR-- 259
Query: 177 NISSPLVIG----DVAISSKDNLQFTPMLKSPM-----YPNYYYIGLEAITIGNSSLTEV 227
SP+++G ++ + +Q TP P +Y++ L +T+G T +
Sbjct: 260 --VSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGE---TRL 314
Query: 228 PLSLREF--DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
P + F G+GG +DSGT T P+ + L + + P AK +
Sbjct: 315 PFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPL-PVAKGYTDPDNL- 372
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK-CLLFQSMD 344
LC+ VP P + H L LP+ N+ + + K C++ S
Sbjct: 373 LCFSVPAKKKA---PAVPKLILH-LEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAG 428
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + + G+FQQQN+ +VYDLE ++ F P C
Sbjct: 429 NSN---GTIIGNFQQQNMHIVYDLESNKMVFAPARC 461
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 165/385 (42%), Gaps = 70/385 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C C Y+ + F+PS+SSS C+S+ C ++
Sbjct: 104 VDTGSDIVWLQCK----PCEQC--YKQTTPI--FNPSKSSSYKNIPCSSNLCQSVR---- 151
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFA-YTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
+ C K C +F+ +Y +G L LT D+ H S
Sbjct: 152 -YTSCN----------KQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVS------F 194
Query: 125 PKFCFGCVGSTYR-----EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
PK GC G R E GI G G G +S+ +QL G FS+C L D N
Sbjct: 195 PKTVIGC-GHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLV--DSNK 251
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S L GD A+ S D + TP +K +YY+ LEA ++GN + D
Sbjct: 252 TSKLNFGDAAVVSGDGVVSTPFVKKDPQA-FYYLTLEAFSVGNKRI-----EFEVLDDSE 305
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE-RTGFDLCYRVPCPNNTF 297
G +++DSGTT T LP Y+ L +S + + V++ +LCY +
Sbjct: 306 EGNIILDSGTTLTLLPSHVYTNL----ESAVAQLVKLDRVDDPNQLLNLCYSI------- 354
Query: 298 TDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
T D FP IT HF + P + + V CL F S G +FG
Sbjct: 355 TSDQYDFPIITAHFKGADIKLNPISTFAHV------ADGVVCLAFTSSQTGP-----IFG 403
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ Q N+ V YDL++ + F+P DC
Sbjct: 404 NLAQLNLLVGYDLQQNIVSFKPSDC 428
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 123/387 (31%), Positives = 172/387 (44%), Gaps = 59/387 (15%)
Query: 3 QVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+Y +DTGSD W C C C N+ F+PS+SS+ C+S C
Sbjct: 102 QLYGVVDTGSDGIWFQCK----PCKPCL----NQTSPIFNPSKSSTYKNIRCSSPICKR- 152
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
T S R C + TY + G +++DTL ++ S+ G
Sbjct: 153 ---------------GEKTRCSSNRKRKC-EYEITYLDRSGSQGDISKDTLTLN-SNDGS 195
Query: 121 IREIPKFCFGC--VGSTYREPI--GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAND 175
PK GC S E + GI GFGRG S+ SQLG G FS+C + +
Sbjct: 196 PISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASL--FSK 253
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
NISS L GD+A+ S + TP+++S Y Y+ LEA ++G+ + SL D
Sbjct: 254 ANISSKLYFGDMAVVSGHGVVSTPLIQS-FYVGNYFTNLEAFSVGDHIIKLKDSSLIP-D 311
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT-GFDLCYRVPCPN 294
++GN ++DSG+T T LP YSQL + + S + + K V++ T LCY+
Sbjct: 312 NEGNA--VIDSGSTITQLPNDVYSQLETAVISMV----KLKRVKDPTQQLSLCYKT---- 361
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T P IT HF + L N F M + V C F S P V+
Sbjct: 362 -TLKKYEVPIITAHF-RGADVKLNAFNTFIQM-----NHEVMCFAFNS----SAFPWVVY 410
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ QQN V YD K I F+P +C
Sbjct: 411 GNIAQQNFLVGYDTLKNIISFKPTNCT 437
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 171/382 (44%), Gaps = 54/382 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ MDTGSD+ W+ C C++C Y + + F P +SS+ S C++ CLN+
Sbjct: 73 LVMDTGSDILWLQCA----PCVNC--YHQSDAI--FDPYKSSTYSTLGCSTRQCLNL--- 121
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH---GSSPGI 120
+ T + C + YG+G TG D + ++ G +
Sbjct: 122 ------------DIGTCQANKCL-----YQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVV 164
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ +IP C + G+ G G+G LS P+Q+ G FS+C D
Sbjct: 165 LNKIPLGCGHDNEGYFVGAAGLLGLGKGPLSFPNQVDPQNGGRFSYCLT--DRETDSTEG 222
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S LV G+ A+ +FTP + P +YY+ + I++G + LT +P S + DS GN
Sbjct: 223 SSLVFGEAAVPPA-GARFTPQDSNMRVPTFYYLKMTGISVGGTILT-IPTSAFQLDSLGN 280
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQS-TITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG+++DSGT+ T L Y+ L ++ T P A + FD CY + +
Sbjct: 281 GGVIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAG----FSLFDTCYDL----SGLA 332
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++T HF L LP N+ P ++S CL F G GPS + G+ Q
Sbjct: 333 SVDVPTVTLHFQGGTDLKLPASNYLI----PVDNSNTFCLAFA----GTTGPS-IIGNIQ 383
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ V+YD ++GF P C
Sbjct: 384 QQGFRVIYDNLHNQVGFVPSQC 405
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 158/381 (41%), Gaps = 63/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C+ C + FS +SSS C S C +
Sbjct: 120 LDTSNDAAWIPCSG----CIGCPS------TTVFSSDKSSSFRPLPCQSPQCNQV----- 164
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ S C F TYG V L +D L + S +P
Sbjct: 165 PNPSCSGSACG---------------FNLTYGSS-TVAADLVQDNLTLATDS------VP 202
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ GS+ + + Q FS+C +FK N S
Sbjct: 203 SYTFGCIRKATGSSVPPQGLLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVN---FSGS 259
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA + +++TP+L++P + YY+ L +I +G + ++P S F+S G
Sbjct: 260 LRLGPVAQPIR--IKYTPLLRNPRRSSLYYVNLISIRVGRK-IVDIPPSALAFNSATGAG 316
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGTT+T L P Y+ + + + R V GFD CY VP +
Sbjct: 317 TVIDSGTTFTRLVAPAYTAVRDEFRRRVG---RNVTVSSLGGFDTCYTVPI--------I 365
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LP N +A S + CL + D V S QQQN
Sbjct: 366 SPTITFMFA-GMNVTLPPDNFLIHSTAGSTT----CLAMAAAPDNVNSVLNVIASMQQQN 420
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
+++D+ R+G C+S
Sbjct: 421 HRILFDIPNSRVGVARESCSS 441
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 158/374 (42%), Gaps = 63/374 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT SD W+PC C+ C + F+P +S+S +C S C +
Sbjct: 114 LDTSSDAAWIPCSG----CVGCSTSKP------FAPIKSTSFRNVSCGSPHCKQV----- 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F +TYG + ++ +DTL + IP
Sbjct: 159 PNPTCGGSACA---------------FNFTYGSSSIAASVV-QDTLTLAAD------PIP 196
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGCV GS+ + + + + FS+C +FK N S
Sbjct: 197 GYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSIN---FSGS 253
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V +++TP+L++P + YY+ L AI +G + ++P + F+ G
Sbjct: 254 LRLGPVY--QPKRIKYTPLLRNPRRSSLYYVNLVAIKVGR-KIVDIPPAALAFNPTTGAG 310
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L EP Y+ + + + + P+ V GFD CY VP +
Sbjct: 311 TIFDSGTVFTRLAEPVYTAVRNEFRRRVG--PKL-PVTTLGGFDTCYNVPI--------V 359
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F + +++ LP N +A S + CL D V + QQQN
Sbjct: 360 VPTITFLF-SGMNVALPPDNIVIHSTAGSTT----CLAMAGAPDNVNSVLNVIANMQQQN 414
Query: 362 VEVVYDLEKERIGF 375
V++D+ RIG
Sbjct: 415 HRVLFDVPNSRIGI 428
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 117/402 (29%), Positives = 180/402 (44%), Gaps = 76/402 (18%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV C C C R + L + + P SSS S +C FC
Sbjct: 97 HVQVDTGSDILWVNC----ISCNKCP--RKSDLGIDLRLYDPKGSSSGSTVSCDQKFCAA 150
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG-SSP 118
+ P GC+ + PC ++ YG+G TG D+L+ + S
Sbjct: 151 TYGGKLP-------GCAKNI--------PC-EYSVMYGDGSSTTGYFVSDSLQYNQVSGD 194
Query: 119 GIIREI-PKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
G R FGC +GST + GI GFG+ S+ SQL G ++K FSHC
Sbjct: 195 GQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL 254
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSL 224
K IGDV P +KS P+ P+ +Y + LE+I +G ++L
Sbjct: 255 DTIKGGG------IFAIGDVV---------QPKVKSTPLVPDMPHYNVNLESINVGGTTL 299
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
++P + F++ G ++DSGTT T+LPE Y +L+ + + + + +
Sbjct: 300 -QLPSHM--FETGEKKGTIIDSGTTLTYLPELVYKDVLAAV------FAKHPDTTFHSVQ 350
Query: 285 D-LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS- 342
D LC + DD FP ITFHF +++ L + ++F+ N + C FQ+
Sbjct: 351 DFLCIQY----FQSVDDGFPKITFHFEDDLGLNVYPHDYFF-----QNGDNLYCFGFQNG 401
Query: 343 -MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ D + G N VVYDLE + +G+ +C+S+
Sbjct: 402 GLQSKDGKDMVLLGDLVLSNKVVVYDLENQVVGWTDYNCSSS 443
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 157/385 (40%), Gaps = 74/385 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDL W C C C + F P+ SS+ S+ C SSFC +
Sbjct: 101 VVADTGSDLIWTQCA----PCTKC----FQQPAPPFQPASSSTFSKLPCTSSFCQFL--- 149
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N C +GC + Y YG G G L +TLKV +S
Sbjct: 150 PNSIRTCNATGCV---------------YNYKYGSG-YTAGYLATETLKVGDAS------ 187
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG---FSHCFLAFKYANDPNISS 180
P FGC S + LG L G FS+C + A +S
Sbjct: 188 FPSVAFGC-------------------STENGLGQLDLGVGRFSYCLRSGSAAG----AS 224
Query: 181 PLVIGDVAISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
P++ G +A + N+Q TP + +P ++P+YYY+ L IT+G T++P++ F N
Sbjct: 225 PILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGE---TDLPVTTSTFGFTQN 281
Query: 240 G---GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G G +VDSGTT T+L + Y + Q+ ++ V G DLC++
Sbjct: 282 GLGGGTIVDSGTTLTYLAKDGYEM---VKQAFLSQTADVTTVNGTRGLDLCFK--STGGG 336
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
PS+ F +P +F + S S L GD P V G+
Sbjct: 337 GGGIAVPSLVLRFDGGAEYAVP--TYFAGVETDSQGSVTVACLMMLPAKGDQ-PMSVIGN 393
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
Q ++ ++YDL+ F P DCA
Sbjct: 394 VMQMDMHLLYDLDGGIFSFAPADCA 418
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 113/387 (29%), Positives = 176/387 (45%), Gaps = 63/387 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRD-TCASSFCLN 59
I DTGSDL W C CD Y+ ++ F P +SS + RD +C + C N
Sbjct: 106 ILAIADTGSDLIWT-------QCTPCDKCYK--QIAPLFDP-KSSKTYRDLSCDTRQCQN 155
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ S S CS L + ++Y YG+ G L DT+ + ++ G
Sbjct: 156 LGES---------SSCSSEQLCQ---------YSYYYGDRSFTNGNLAVDTVTLPSTNGG 197
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN 174
+ PK GC G+ ++ GI G G G +S+ SQ+G G FS+C + F +
Sbjct: 198 PVY-FPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSES 256
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
N SS L G A+ S +Q TP++ K+P +YY+ LEA+++G+ + S
Sbjct: 257 AGN-SSKLHFGRNAVVSGSGVQSTPLISKNP--DTFYYLTLEAMSVGDKKIEFGGSSFGG 313
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+ +++DSGT+ T P F+++ + +++ + R ++ CYR P P
Sbjct: 314 SEGN----IIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGL--LSHCYR-PTP 366
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
D P IT HF N +VL N F +S V CL F S G +
Sbjct: 367 -----DLKVPVITAHF-NGADVVLQTLNTFILIS-----DDVLCLAFNSTQSG-----AI 410
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
FG+ Q N + YD++ + + F+P DC
Sbjct: 411 FGNVAQMNFLIGYDIQGKSVSFKPTDC 437
>gi|413937238|gb|AFW71789.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 598
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 132/287 (45%), Gaps = 37/287 (12%)
Query: 104 GILTRDTLKVHGSSPGIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQ----L 156
+L +D L +H + + + FGC V P G+ GFG G LS PSQ
Sbjct: 341 ALLGQDALALHDD----VDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVY 396
Query: 157 GFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEA 216
GF+ FS+C ++K +N SS L +G ++ TP+L +P P+ YY+ +
Sbjct: 397 GFV---FSYCLPSYKSSN---FSSTLRLGPAG--QPKRIKMTPLLSNPHRPSLYYVNMVG 448
Query: 217 ITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK 276
I +G + VP S FD G +VD+GT +T L P Y+ + + +S + RA
Sbjct: 449 IHVGGRPML-VPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV----RAP 503
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK 336
GFD CY V P++TF F VS+ LP+ N + S+S +
Sbjct: 504 VTGPLGGFDTCYNVTIS--------VPTVTFSFDGRVSVTLPEEN----VVIRSSSDGIA 551
Query: 337 CLLFQSM-DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
CL + DG V S QQQN V++D+ R+GF C +
Sbjct: 552 CLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSRELCTT 598
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 126/387 (32%), Positives = 182/387 (47%), Gaps = 63/387 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRD-TCASSFCLNI 60
I DTGSDL W C C C Y + + F P +SSS+ RD +C++ C +
Sbjct: 105 ILAIADTGSDLIWTQCK----PCDQC--YEQDAPL--FDP-KSSSTYRDISCSTKQCDLL 155
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
G S S TC ++Y+YG+ +G + DT+ + GS+ G
Sbjct: 156 KE-----------GASCSGEGNKTC-----HYSYSYGDRSFTSGNVAADTITL-GSTSGR 198
Query: 121 IREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAND 175
+PK GC GS + GI G G G +S+ SQLG G FS+C + +N
Sbjct: 199 PVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQLGSTIDGKFSYCLVPLS-SNA 257
Query: 176 PNISSPLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N SS L G I S +Q TP++ K P +Y++ LEA+++G+ + S
Sbjct: 258 TN-SSKLNFGSNGIVSGGGVQSTPLISKDP--DTFYFLTLEAVSVGSERIKFPGSSFGT- 313
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCP 293
S+GN +++DSGTT T PE F+S+L S +Q + P VE+ +G LCY +
Sbjct: 314 -SEGN--IIIDSGTTLTLFPEDFFSELSSAVQDAVAGTP----VEDPSGILSLCYSIDA- 365
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
D FPSIT HF + + L N F + S V C F ++ G +
Sbjct: 366 -----DLKFPSITAHF-DGADVKLNPLNTFVQV-----SDTVLCFAFNPINSG-----AI 409
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
FG+ Q N V YDLE + + F+P DC
Sbjct: 410 FGNLAQMNFLVGYDLEGKTVSFKPTDC 436
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 166/384 (43%), Gaps = 66/384 (17%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C C RN F P +S+S +C S C
Sbjct: 43 DTGSDLTWTSC----VPCNKCYKQRN----PIFDPQKSTSYRNISCDSKLCHK------- 87
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
L + C P ++ Y Y + G+L ++T+ + S+ G +
Sbjct: 88 --------------LDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLS-STKGESVPL 132
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPNI 178
FGC G +GI G G G +S SQ+G F K FS C + F D ++
Sbjct: 133 KGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFH--TDVSV 190
Query: 179 SSPLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS + +G + S + TP++ K P Y++ L I++GN+ L + S
Sbjct: 191 SSKMSLGKGSEVSGKGVVSTPLVAKQDKTP--YFVTLLGISVGNTYLH---FNGSSSQSV 245
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G + +DSGT T LP Y +L++ ++S + P +++ G LCYR T
Sbjct: 246 EKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLD--LGPQLCYR------TK 297
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF-QSMDDGDYGPSGVFGS 356
+ P +T HF +LP + S V CL F + DG GV+G+
Sbjct: 298 NNLRGPVLTAHFEGGDVKLLP------TQTFVSPKDGVFCLGFTNTSSDG-----GVYGN 346
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
F Q N + +DL+++ + F+PMDC
Sbjct: 347 FAQSNYLIGFDLDRQVVSFKPMDC 370
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 158/380 (41%), Gaps = 48/380 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C D + F P+RS++ C S C +
Sbjct: 109 VDTGSDLIWTQCA----PCVLCAD----QPTPYFRPARSATYRLVPCRSPLCAAL----- 155
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK--VHGSSPGIIRE 123
P+ C +S C + Y YG+ G+L +T SS ++ +
Sbjct: 156 PYPAC---------FQRSVCV-----YQYYYGDEASTAGVLASETFTFGAANSSKVMVSD 201
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ C G+ G GRG LS+ SQLG FS+C +F ++ +
Sbjct: 202 VAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLG--PSRFSYCLTSFLSPEPSRLNFGVF 259
Query: 184 I---GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
G A SS +Q TP++ + P+ Y++ L+ I++G L PL + G G
Sbjct: 260 ATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVF-AINDDGTG 318
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+ +DSGT+ T L + Y + L S + P + E G + C+ P P +
Sbjct: 319 GVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTE--IGLETCFPWPPPPSVAVT- 375
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P + HF ++ +P N+ A L + GD + + G++QQQ
Sbjct: 376 -VPDMELHFDGGANMTVPPENYMLI------DGATGFLCLAMIRSGD---ATIIGNYQQQ 425
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N+ ++YD+ + F P C
Sbjct: 426 NMHILYDIANSLLSFVPAPC 445
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 158/380 (41%), Gaps = 48/380 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C+ C D + F P+RS++ C S C +
Sbjct: 109 VDTGSDLIWTQCA----PCVLCAD----QPTPYFRPARSATYRLVPCRSPLCAAL----- 155
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK--VHGSSPGIIRE 123
P+ C +S C + Y YG+ G+L +T SS ++ +
Sbjct: 156 PYPAC---------FQRSVCV-----YQYYYGDEASTAGVLASETFTFGAANSSKVMVSD 201
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ C G+ G GRG LS+ SQLG FS+C +F ++ +
Sbjct: 202 VAFGCGNINSGQLANSSGMVGLGRGPLSLVSQLG--PSRFSYCLTSFLSPEPSRLNFGVF 259
Query: 184 I---GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
G A SS +Q TP++ + P+ Y++ L+ I++G L PL + G G
Sbjct: 260 ATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVF-AINDDGTG 318
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+ +DSGT+ T L + Y + L S + P + E G + C+ P P +
Sbjct: 319 GVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTE--IGLETCFPWPPPPSVAVT- 375
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P + HF ++ +P N+ A L + GD + + G++QQQ
Sbjct: 376 -VPDMELHFDGGANMTVPPENYMLI------DGATGFLCLAMIRSGD---ATIIGNYQQQ 425
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N+ ++YD+ + F P C
Sbjct: 426 NMHILYDIANSLLSFVPAPC 445
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 166/391 (42%), Gaps = 66/391 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C Y + + F PS S + S +C S+ C +
Sbjct: 167 LSLIFDTGSDLTWTQCQPCVKSC-----YAQQQPI--FDPSASKTYSNISCTSTACSGLK 219
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C+ S C + YG+ G +DTL + +
Sbjct: 220 SATGNSPGCSSSNCV---------------YGIQYGDSSFTVGFFAKDTLTLTQNDV--- 261
Query: 122 REIPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDP 176
F FGC G R + G+ G GR LS+ Q K FS+C + +N
Sbjct: 262 --FDGFMFGC-GQNNRGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN-- 316
Query: 177 NISSPLVIGD-----VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
L G+ + + K+ + FTP S +Y+I + I++G +L+ P+
Sbjct: 317 ---GHLTFGNGNGVKTSKAVKNGITFTP-FASSQGATFYFIDVLGISVGGKALSISPMLF 372
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ N G ++DSGT T LP Y L S + ++ YP A + D CY +
Sbjct: 373 Q------NAGTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSL---LDTCYDL- 422
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
+ +T P I+F+F N ++ L P G +N ++ CL F +GD
Sbjct: 423 ---SNYTSISIPKISFNFNGNANVDLEPNGILI------TNGASQVCLAFAG--NGDDDT 471
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+FG+ QQQ +EVVYD+ ++GF C+
Sbjct: 472 IGIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 171/384 (44%), Gaps = 68/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GSD+ WV C C++C Y + F P+ S++ S C S+ C + +S
Sbjct: 142 LVVDSGSDVIWVQCK----PCLEC--YAQADPL--FDPATSATFSAVPCGSAVCRTLRTS 193
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC S C + +YG+G G L +TL + G++
Sbjct: 194 ----------GCG-----DSGGC----DYEVSYGDGSYTKGALALETLTLGGTA------ 228
Query: 124 IPKFCFGCVGSTYRE----PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+ GC G R G+ G G G +S+ QLG G FS+C LA + A
Sbjct: 229 VEGVAIGC-GHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYC-LASRGAGS--- 283
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDS 236
LV+G + + + + P++++P P++YY+GL I +G+ L PL L +
Sbjct: 284 ---LVLGR-SEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERL---PLQEDLFQLTE 336
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG+++D+GT T LP+ Y+ L + + PRA V D CY + +
Sbjct: 337 DGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSL---LDTCYDL----SG 389
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+T P+++F+F +L LP N + + CL F GPS + G+
Sbjct: 390 YTSVRVPTVSFYFDGAATLTLPARNLLLEVDG-----GIYCLAFAPSSS---GPS-ILGN 440
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ+ +++ D IGF P C
Sbjct: 441 IQQEGIQITVDSANGYIGFGPTTC 464
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 113/381 (29%), Positives = 167/381 (43%), Gaps = 58/381 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+TW+ C C C +++ + PS SSS R C S+ C +
Sbjct: 29 LDTGSDVTWIQCA----PCSSC----YSQVDPIYDPSNSSSYRRVYCGSALCQALD---- 76
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ C GCS + YG+ +G L ++ + +S +R I
Sbjct: 77 -YSACQGMGCS---------------YRVVYGDSSASSGDLGIESFYLGPNSSTAMRNI- 119
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
FGC S +R G+ G G G LS SQ+ + FS+C L +Y+ + SSP
Sbjct: 120 --AFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYC-LVDRYSQLQSRSSP 176
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L+ G AI +FTP+LK+P +YY L I++G + L +P + G GG
Sbjct: 177 LIFGRTAIPFAA--RFTPLLKNPRINTFYYAVLTGISVGGTPL-PIPPAQFALTGNGTGG 233
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT+ T + P Y+ L ++ P A V D C+
Sbjct: 234 AILDSGTSVTRVVPPAYAVLRDAYRAASRNLPPAPGVYL---LDTCFNF----QGLPTVQ 286
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF--QSMDDGDYGPSGVFGSFQQ 359
PS+ HF N V +VLP GN + P + S CL F SM P V G+ QQ
Sbjct: 287 IPSLVLHFDNGVDMVLPGGN----ILIPVDRSGTFCLAFAPSSM------PISVIGNVQQ 336
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
Q + +DL++ I P +C
Sbjct: 337 QTFRIGFDLQRSLIAIAPREC 357
>gi|413937239|gb|AFW71790.1| hypothetical protein ZEAMMB73_638381 [Zea mays]
Length = 537
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 131/287 (45%), Gaps = 37/287 (12%)
Query: 104 GILTRDTLKVHGSSPGIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQ----L 156
+L +D L +H + + + FGC V P G+ GFG G LS PSQ
Sbjct: 280 ALLGQDALALHDD----VDVVAAYTFGCLRVVTGGSVPPQGLVGFGCGPLSFPSQNKDVY 335
Query: 157 GFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEA 216
GF+ FS+C ++K N SS L +G ++ TP+L +P P+ YY+ +
Sbjct: 336 GFV---FSYCLPSYK---SSNFSSTLRLGPAG--QPKRIKMTPLLSNPHRPSLYYVNMVG 387
Query: 217 ITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK 276
I +G + VP S FD G +VD+GT +T L P Y+ + + +S + RA
Sbjct: 388 IHVGGRPML-VPASALAFDPASGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRV----RAP 442
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK 336
GFD CY V P++TF F VS+ LP+ N + S+S +
Sbjct: 443 VTGPLGGFDTCYNVTIS--------VPTVTFSFDGRVSVTLPEEN----VVIRSSSDGIA 490
Query: 337 CLLFQSM-DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
CL + DG V S QQQN V++D+ R+GF C +
Sbjct: 491 CLAMAAGPSDGVDAVLNVLASMQQQNHRVLFDVANGRVGFSRELCTT 537
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 168/399 (42%), Gaps = 108/399 (27%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+Q+ +DTGSDL W C C C D + + F PS SS+ S +C S+ C
Sbjct: 102 VQLTLDTGSDLIWTQCQ----PCPACFD----QALPYFDPSTSSTLSLTSCDSTLC---- 149
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
G +++L +S +T+ G
Sbjct: 150 -----------QGLPVASLPRSD--------KFTFVGAG--------------------- 169
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC G GIAGFGRG LS+PSQL FSHCF A
Sbjct: 170 ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTTITGA---- 223
Query: 178 ISSPLVI---GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
I S +++ D+ + + +Q TP++++P P +YY+ L+ IT+G+ T +P+ EF
Sbjct: 224 IPSTVLLDLPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGS---TRLPVPESEF 280
Query: 235 D-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G GG ++DSGT T LP Y + R F ++P
Sbjct: 281 ALKNGTGGTIIDSGTAMTSLPTRVYRLV-------------------RDAFAAQVKLPVV 321
Query: 294 NNTFTDDLF------------PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
+ TD F P + HF ++ LP+ N+ + + S++ CL
Sbjct: 322 SGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMDLPRENYVFEVE--DAGSSILCL--- 375
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
++ +G G G+FQQQN+ V+YDL+ ++ F P C
Sbjct: 376 AIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 99/374 (26%), Positives = 158/374 (42%), Gaps = 63/374 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT SD W+PC C+ C + F+P +S+S +C S C +
Sbjct: 114 LDTSSDAAWIPCSG----CVGCSTSKP------FAPIKSTSFRNVSCGSPHCKQV----- 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F +TYG + ++ +DTL + IP
Sbjct: 159 PNPTCGGSACA---------------FNFTYGSSSIAASVV-QDTLTLATD------PIP 196
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGCV GS+ + + + + FS+C +FK N S
Sbjct: 197 GYTFGCVNKTTGSSAPQQGLLGLGRGPLSLLSQSQNLYKSTFSYCLPSFKSIN---FSGS 253
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V +++TP+L++P + YY+ L AI +G + ++P + F+ G
Sbjct: 254 LRLGPVY--QPKRIKYTPLLRNPRRSSLYYVNLVAIKVGR-KIVDIPPAALAFNPTTGAG 310
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L EP Y+ + + + + P+ V GFD CY VP +
Sbjct: 311 TIFDSGTVFTRLAEPVYTAVRNEFRRRVG--PKL-PVTTLGGFDTCYNVPI--------V 359
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F + +++ LP N +A S + CL D V + QQQN
Sbjct: 360 VPTITFLF-SGMNVTLPPDNIVIHSTAGSTT----CLAMAGAPDNVNSVLNVIANMQQQN 414
Query: 362 VEVVYDLEKERIGF 375
V++D+ RIG
Sbjct: 415 HRVLFDVPNSRIGI 428
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 180/392 (45%), Gaps = 58/392 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC-LNIHS 62
+ +DTGSDLTW+ C C C D F PS+S+S C ++ C L +H
Sbjct: 102 LIIDTGSDLTWLQCK----PCKACFDQSG----PVFDPSQSTSFKIIPCNAAACDLVVH- 152
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D C + S TC + Y YG+ +G L ++L V S
Sbjct: 153 -----DECRDNSSKTS---PKTC-----KYFYWYGDSSRTSGDLALESLSVSLSDHPSSL 199
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF--LQKGFSHCFLAFKYANDPN 177
EI GC S ++ G+ G G+GALS PSQL + + FS+C + N+ +
Sbjct: 200 EIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV--DRTNNLS 257
Query: 178 ISSPLVIG-DVAISSK-DNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
+SS + G A+S D ++FTP +++ +YY+G++ I I + L +P
Sbjct: 258 VSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKI-DQELLPIPAERFAI 316
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY----RV 290
+ G+GG ++DSGTT T+L Y + S + I+Y PRA + +CY R
Sbjct: 317 ATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISY-PRADPFDI---LGICYNATGRA 372
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
P FP+++ F N L LPQ N+F P A CL D
Sbjct: 373 AVP--------FPALSIVFQNGAELDLPQENYFIQ---PDPQEAKHCLAILPTDG----- 416
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+FQQQN+ +YD++ R+GF DC++
Sbjct: 417 MSIIGNFQQQNIHFLYDVQHARLGFANTDCSA 448
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 169/386 (43%), Gaps = 63/386 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GSD+ WV C C++C Y + F P+ S++ S +C S+ C + +S
Sbjct: 140 LVVDSGSDVIWVQCK----PCLEC--YAQADPL--FDPASSATFSAVSCGSAICRTLRTS 191
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC S C + +YG+G G L +TL + G++
Sbjct: 192 ----------GCG-----DSGGCE----YEVSYGDGSYTKGTLALETLTLGGTA------ 226
Query: 124 IPKFCFGCVGSTYRE----PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAND--P 176
+ GC G R G+ G G G +S+ QLG G FS+C + +
Sbjct: 227 VEGVAIGC-GHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAA 285
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL--SLREF 234
+ + LV+G + + + + P++++P P++YY+G+ I +G+ L PL L +
Sbjct: 286 DAAGSLVLGR-SEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERL---PLQDGLFQL 341
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G GG+++D+GT T LP+ Y+ L + PRA V D CY +
Sbjct: 342 TEDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSL---LDTCYDL---- 394
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ +T P+++F+F +L LP N + + CL F G +
Sbjct: 395 SGYTSVRVPTVSFYFDGAATLTLPARNLLLEVDG-----GIYCLAFAPSSSG----LSIL 445
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ+ +++ D IGF P C
Sbjct: 446 GNIQQEGIQITVDSANGYIGFGPATC 471
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 164/385 (42%), Gaps = 59/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C + C D + F PS+SSS TC SS C +
Sbjct: 149 LSLVFDTGSDLTWTQCEPCAGSCYKQQD-------AIFDPSKSSSYINITCTSSLCTQLT 201
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ S CS ST T C + YG+ G L+++ L + +
Sbjct: 202 SAG------IKSRCSSST----TACI----YGIQYGDKSTSVGFLSQERLTITATDI--- 244
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPN 177
+ F FGC + G+ G GR +S Q + K FS+C P+
Sbjct: 245 --VDDFLFGCGQDNEGLFSGSAGLIGLGRHPISFVQQTSSIYNKIFSYCL--------PS 294
Query: 178 ISSPL--VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L + + ++ NL++TP+ +Y + + I++G + L V S F
Sbjct: 295 TSSSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSS--TFS 352
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ GG ++DSGT T L Y+ L S + + YP A E FD CY +
Sbjct: 353 A---GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVA---NEDGLFDTCYDF----S 402
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ + P I F F V++ LP SA CL F + +G+ +FG
Sbjct: 403 GYKEISVPKIDFEFAGGVTVELPLVGILIGRSAQQ-----VCLAFAA--NGNDNDITIFG 455
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQ+ +EVVYD+E RIGF C
Sbjct: 456 NVQQKTLEVVYDVEGGRIGFGAAGC 480
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 162/391 (41%), Gaps = 69/391 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W C C +C + + F + S++ C+ C N H
Sbjct: 106 VVLTLDTGSDVVWTQCE----PCAEC----FTQPLPRFDTAASNTVRSVACSDPLC-NAH 156
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S C + GC+ + YG+G L G RD+ G
Sbjct: 157 SEHG----CFLHGCT---------------YVSGYGDGSLSFGHFLRDSFTFDDGKGGGK 197
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+P FGC G + GIAGFGRG LS+PSQL Q FS+CF A
Sbjct: 198 VTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQ--FSYCFTTRFEAK--- 252
Query: 178 ISSPLVI---GDVAISSKDNLQFTPMLKS--PMYPNYYYI-GLEAITIGNSSLTEVPLSL 231
SSP+ + GD+ + + TP ++S P N +Y+ + +T+G + L VP
Sbjct: 253 -SSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRL-PVP--- 307
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDLCYR 289
E + G+G +DSGT T P+ + QL S I Q+ + P K +E D+C+
Sbjct: 308 -EIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQAAL---PVNKTADED---DICFS 360
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
P + FH L LP+ N+ S C+ + D
Sbjct: 361 WDGKKTA----AMPKLVFH-LEGADWDLPRENYV----TEDRESGQVCVAVSTSGQMD-- 409
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+FQQQN +VYDL ++ P C
Sbjct: 410 -RTLIGNFQQQNTHIVYDLAAGKLLLVPAQC 439
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 115/397 (28%), Positives = 170/397 (42%), Gaps = 91/397 (22%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C C C ++ L F P +SS+ TC S
Sbjct: 108 DTGSDLIWVQCS----PCASCFP-QSTPL---FQPLKSSTFMPTTCRS------------ 147
Query: 67 FDPCTM-----SGCSLSTLLKSTCCRPCPSFAYTYGEG-GLVTGILTRDTLKVHGSSPGI 120
PCT+ GC KS C + Y YG+ G+L+ +TL+
Sbjct: 148 -QPCTLLLPEQKGCG-----KSGECI----YTYKYGDQYSFSEGLLSTETLRFDSQGGVQ 197
Query: 121 IREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKY 172
P FGC V +Y+ GI G G G LS+ SQ+G + FS+C L
Sbjct: 198 TVAFPNSFFGCGLYNNITVFPSYKL-TGIMGLGAGPLSLVSQIGDQIGHKFSYCLLPLGS 256
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +S L G+ +I + + + TPM+ P P YY++ LEA+T+ + VP
Sbjct: 257 TS----TSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKT---VP---- 305
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+G +++DSGT T+L E FY + LQ ++ A E+ + L + P
Sbjct: 306 --TGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESL-----AVELVQDVLSPLPFCFPY 358
Query: 293 PNNTFTDDLFPSITFHFLN--------NVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+N +FP I F F N+ ++ N M APS+ S +
Sbjct: 359 RDNF----VFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGIS-------- 406
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+FGSF Q + +V YDLE +++ FQP DC+
Sbjct: 407 --------IFGSFSQIDFQVEYDLEGKKVSFQPTDCS 435
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 166/390 (42%), Gaps = 64/390 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C Y + + F PS S + S +C S+ C ++
Sbjct: 167 LSLIFDTGSDLTWTQCQPCVKSC-----YAQQQPI--FDPSTSKTYSNISCTSAACSSLK 219
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C+ S C + YG+ G +D L + +
Sbjct: 220 SATGNSPGCSSSNCV---------------YGIQYGDSSFTIGFFAKDKLTLTQNDV--- 261
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPN 177
F FGC + + + G+ G GR LS+ Q K FS+C + +N
Sbjct: 262 --FDGFMFGCGQNNKGLFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSN--- 316
Query: 178 ISSPLVIGD-----VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
L G+ + + K+ + FTP S YY+I + I++G +L+ P+ +
Sbjct: 317 --GHLTFGNGNGVKASKAVKNGITFTP-FASSQGTAYYFIDVLGISVGGKALSISPMLFQ 373
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
N G ++DSGT T LP Y L S + ++ YP A + D CY +
Sbjct: 374 ------NAGTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSL---LDTCYDL-- 422
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ +T P I+F+F N ++ L P G +N ++ CL F +GD
Sbjct: 423 --SNYTSISIPKISFNFNGNANVELDPNGILI------TNGASQVCLAFAG--NGDDDSI 472
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+FG+ QQQ +EVVYD+ ++GF C+
Sbjct: 473 GIFGNIQQQTLEVVYDVAGGQLGFGYKGCS 502
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/386 (27%), Positives = 170/386 (44%), Gaps = 64/386 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C+ C Y+ K M F P +SS+ + +C S C H D
Sbjct: 85 VDTGSDLIWIQCA----PCLGC--YKQIKPM--FDPLKSSTYNNISCDSPLC---HKLD- 132
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEG--GLVTGILTRDTLKVHGSSPGIIRE 123
+ C P YTYG G L G+L +DT S+ G
Sbjct: 133 -----------------TGVCSPEKRCNYTYGYGDNSLTKGVLAQDT-ATFTSNTGKPVS 174
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPN 177
+ +F FGC G +G+ G G G S+ SQ+G F K FS C + F D
Sbjct: 175 LSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPF--LTDIK 232
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
ISS + G + + + TP++ Y++ L I++ + T P++ +
Sbjct: 233 ISSRMSFGKGSQVLGNGVVTTPLVPREK-DTSYFVTLLGISVED---TYFPMN----STI 284
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G +LVDSGT LP+ Y ++ + +++ + P + G LCYR T
Sbjct: 285 GKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDD--PSLGTQLCYR------TQ 336
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPS-NSSAVKCLLFQSMDDGDYGPSGVFGS 356
T+ P++TFHF+ L+ P P+ + + CL + + D GV+G+
Sbjct: 337 TNLKGPTLTFHFVGANVLLTP----IQTFIPPTPQTKGIFCLAIYNRTNSD---PGVYGN 389
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
F Q N + +DL+++ + F+P DC
Sbjct: 390 FAQSNYLIGFDLDRQVVSFKPTDCTK 415
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 161/381 (42%), Gaps = 47/381 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+ W+ C C C N+ + F P +S + + C S C +
Sbjct: 148 VYMVLDTGSDVVWLQCS----PCKAC----YNQTDAIFDPKKSKTFATVPCGSRLCRRLD 199
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S C T TC + +YG+G G + +TL HG+ +
Sbjct: 200 DSSE----CV-------TRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGAR---V 240
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFL-AFKYANDPNIS 179
+P C + G+ G GRG LS PSQ G FS+C + +
Sbjct: 241 DHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPP 300
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S +V G+ A+ FTP+L +P +YY+ L I++G S + V S + D+ GN
Sbjct: 301 STIVFGNAAVPKTS--VFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 358
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG+++DSGT+ T L +P Y L + T RA FD C+ + + T
Sbjct: 359 GGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSL---FDTCFDL----SGMTT 411
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ FHF + LP N+ P N+ C F G G + G+ QQ
Sbjct: 412 VKVPTVVFHF-GGGEVSLPASNYLI----PVNTEGRFCFAFA----GTMGSLSIIGNIQQ 462
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
Q V YDL R+GF C
Sbjct: 463 QGFRVAYDLVGSRVGFLSRAC 483
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 115/385 (29%), Positives = 172/385 (44%), Gaps = 85/385 (22%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSD+ W+ C C +C N+ F PS+SS+ C+S C + +
Sbjct: 105 DTGSDIVWLQCE----PCKEC----YNQTTPKFKPSKSSTYKNIPCSSDLCKSGQQGNLS 156
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
D T L+S+ P SF T V G T +T+ G+S GI+
Sbjct: 157 VDTLT---------LESSTGHPI-SFPKT------VIGCGTDNTVSFEGASSGIV----- 195
Query: 127 FCFGCVGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSPLVIG 185
G G G S+ +QLG + FS+C L + N +S L G
Sbjct: 196 -----------------GLGGGPASLITQLGSSIDAKFSYCLLPNPV--ESNTTSKLNFG 236
Query: 186 DVAISSKDNLQFTPMLKS-PMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG--- 241
D A+ S D + TP++K P+ +YY+ LEA ++GN + EF+ NGG
Sbjct: 237 DTAVVSGDGVVSTPIVKKDPIV--FYYLTLEAFSVGNKRI--------EFEGSSNGGHEG 286
Query: 242 -LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFTD 299
+++DSGTT T +P Y+ L+S + + K V + T F+LCY V T
Sbjct: 287 NIIIDSGTTLTVIPTDVYNN----LESAVLELVKLKRVNDPTRLFNLCYSV-----TSDG 337
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV---FGS 356
FP IT HF + L + F + + + CL F + + PS V FG+
Sbjct: 338 YDFPIITTHF-KGADVKLHPISTFVDV-----ADGIVCLAFATT--SAFIPSDVVSIFGN 389
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQN+ V YDL+++ + F+P DC+
Sbjct: 390 LAQQNLLVGYDLQQKIVSFKPTDCS 414
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/381 (29%), Positives = 168/381 (44%), Gaps = 58/381 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+TW+ C C C +++ + PS SSS R C S+ C +
Sbjct: 62 LDTGSDVTWIQCA----PCSSC----YSQVDPIYDPSNSSSYRRVYCGSALCQALD---- 109
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ C GCS + YG+ +G L ++ + +S +R I
Sbjct: 110 -YSACQGMGCS---------------YRVVYGDSSASSGDLGIESFYLGPNSSTAMRNI- 152
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
FGC S +R G+ G G G LS SQ+ + FS+C L +Y+ + SSP
Sbjct: 153 --AFGCGHSNSGLFRGEAGLLGMGGGTLSFFSQIAASIGPAFSYC-LVDRYSQLQSRSSP 209
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L+ G AI +FTP+LK+P +YY L I++G ++L +P + G GG
Sbjct: 210 LIFGRTAIPFA--ARFTPLLKNPRIDTFYYAILTGISVGGTAL-PIPPAQFALTGNGTGG 266
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT+ T + Y+ L ++ P A V D C+
Sbjct: 267 AILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYL---LDTCFNF----QGLPTVQ 319
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF--QSMDDGDYGPSGVFGSFQQ 359
PS+ HF N+V +VLP GN + P + S CL F SM P V G+ QQ
Sbjct: 320 IPSLVLHFDNDVDMVLPGGN----ILIPVDRSGTFCLAFAPSSM------PISVIGNVQQ 369
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
Q + +DL++ I P +C
Sbjct: 370 QTFRIGFDLQRSLIAIAPREC 390
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 119/392 (30%), Positives = 179/392 (45%), Gaps = 58/392 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC-LNIHS 62
+ +DTGSDLTW+ C C C D F PS+S+S C ++ C L +H
Sbjct: 186 LIIDTGSDLTWLQCK----PCKACFDQSG----PVFDPSQSTSFKIIPCNAAACDLVVH- 236
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D C + S TC + Y YG+ +G L ++L V S
Sbjct: 237 -----DECRDNSSKTS---PKTC-----KYFYWYGDSSRTSGDLALESLSVSLSDHPSSL 283
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF--LQKGFSHCFLAFKYANDPN 177
EI GC S ++ G+ G G+GALS PSQL + + FS+C + N+ +
Sbjct: 284 EIRDMVIGCGHSNKGLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV--DRTNNLS 341
Query: 178 ISSPLVIG-DVAISSK-DNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
+SS + G A+S D ++FTP +++ +YY+G++ I I + L +P
Sbjct: 342 VSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKI-DQELLPIPAERFAI 400
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY----RV 290
G+GG ++DSGTT T+L Y + S + I+Y PRA + +CY R
Sbjct: 401 APNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISY-PRADPFDI---LGICYNATGRT 456
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
P FP+++ F N L LPQ N+F P A CL D
Sbjct: 457 AVP--------FPTLSIVFQNGAELDLPQENYFIQ---PDPQEAKHCLAILPTDG----- 500
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+FQQQN+ +YD++ R+GF DC++
Sbjct: 501 MSIIGNFQQQNIHFLYDVQHARLGFANTDCSA 532
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 102/387 (26%), Positives = 164/387 (42%), Gaps = 54/387 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D WVPC C C +F+P+ S++ C + C S
Sbjct: 111 VDTSNDAAWVPCAG----CHGCP-----TTAPSFNPASSATFRPVPCGAPPC-----SQA 156
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CT S K++C F+ +YG+ L L++D L V + G+I+
Sbjct: 157 PNPSCTSLAKS-----KNSC-----GFSLSYGDSSL-DATLSQDNLAVTANG-GVIK--- 201
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ GS + V G + FS+C ++ Y + N S
Sbjct: 202 GYTFGCLTKSNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSY-YRSAANFSGS 260
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + + ++ TP+L SP P+ YY+ + + IG S+ +P S FD+ G
Sbjct: 261 LTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSV-PIPPSALAFDAATGAG 319
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTIT-------YYPRAKEVEERTGFDLCYRVPCPN 294
++DSGT + L +P Y+ + ++ + + V GFD CY V
Sbjct: 320 TVLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNV---- 375
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGV 353
+ +P++T F + + LP+ N + S S CL + S DG V
Sbjct: 376 ---STVAWPAVTLVFGGGMEVRLPEENVVIRSTYGSTS----CLAMAASPADGVNAALNV 428
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
GS QQQN V++D+ R+GF C
Sbjct: 429 IGSLQQQNHRVLFDVPNARVGFARERC 455
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 167/371 (45%), Gaps = 64/371 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT +D W+PC C C + F+P +S++ +CA+ C + N
Sbjct: 110 MDTSNDAAWIPCT----ACDGCAS-------TLFAPEKSTTFKNVSCAAPECKQV---PN 155
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P GC +S+ +F TYG + L +DT+ + + P +P
Sbjct: 156 P-------GCGVSSR----------NFNLTYGSSSIAAN-LVQDTITL-ATDP-----VP 191
Query: 126 KFCFGCVGSTY---REPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNISSP 181
+ FGCV T P G+ G GRG LS+ SQ L Q FS+C +FK N S
Sbjct: 192 SYTFGCVSKTTGTSAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 248
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G VA + +++TP+LK+P + YY+ LEAI +G + ++P + F+ G
Sbjct: 249 LRLGPVAQPKR--IKYTPLLKNPRRSSLYYVNLEAIRVGR-KVVDIPPAALAFNPTTGAG 305
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y + + + P+ V GFD CY VP +
Sbjct: 306 TIFDSGTVFTRLVAPVYVAVRDEFRRRVG--PKL-TVTSLGGFDTCYNVPI--------V 354
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LPQ N +A S + CL D V + QQQN
Sbjct: 355 VPTITFIF-TGMNVTLPQDNILIHSTAGSTT----CLAMAGAPDNVNSVLNVIANMQQQN 409
Query: 362 VEVVYDLEKER 372
V+YD+ R
Sbjct: 410 HRVLYDVPNSR 420
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 170/391 (43%), Gaps = 79/391 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL+W+ C S C YR + +F P++SSS + C + C
Sbjct: 152 IILDTGSDLSWIQCKPCSGHC-----YRQHD--PDFDPAKSSSYAAVPCGTPVC------ 198
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ + T C + YG+G TG+L+RDTL + SS +
Sbjct: 199 -----------AAAGGMCNGTTCL----YGVQYGDGSSTTGVLSRDTLTFNSSS-----K 238
Query: 124 IPKFCFGCVGSTYREPIGIAGFGR---------GALSVPSQLGFLQKG-FSHCFLAFKYA 173
F FGC I FG G LS+PSQ G FS+C + Y
Sbjct: 239 FTGFTFGCGEKN------IGDFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPS--YN 290
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
P L IG +S +Q+T M+K P YP++Y+I L +I IG L VP S+
Sbjct: 291 TTPGY---LNIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYIL-PVPPSVFT 346
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G L+DSGT T+LP P Y+ L + T+ A E D CY
Sbjct: 347 -----KTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEP---LDTCYD---- 394
Query: 294 NNTFTDD---LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA-VKCLLFQSMDDGDYG 349
FT + P+++F+F + L + + M P ++ + CL F S
Sbjct: 395 ---FTGQGAIVIPAVSFNFSDGAVFDL---DFYGIMIFPDDAKPLIGCLAFVSRPAA--M 446
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P + G+ QQ+ EV+YD+ ++IGF P+ C
Sbjct: 447 PFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477
>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
Length = 440
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/394 (27%), Positives = 162/394 (41%), Gaps = 71/394 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D TW C C C S F+P+ SSS + C+SS+C
Sbjct: 98 LDTSADATWAHCS----PCGTCPS------SSLFAPANSSSYASLPCSSSWCPLFQGQAC 147
Query: 66 PFD---------PCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
P P T+ C+ S +P FA + L + DTL++
Sbjct: 148 PAPQGGGDAAPPPATLPTCAFS--------KP---FADASFQAALAS-----DTLRLGKD 191
Query: 117 SPGIIREIPKFCFGCV----GSTYREPI-GIAGFGRGALSVPSQLGFLQKG-FSHCFLAF 170
+ IP + FGCV G T P G+ G GRG +++ SQ G L G FS+C ++
Sbjct: 192 A------IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSY 245
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
+ S L +G ++++TPML++P + YY+ + +++G + +VP
Sbjct: 246 R---SYYFSGSLRLG-AGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRA-WVKVPAG 300
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
FD+ G +VDSGT T P Y+ L + + FD C+
Sbjct: 301 SFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVA---APSGYTSLGAFDTCFN- 356
Query: 291 PCPNNTFTDDLF----PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
TD++ P++T H V L LP N SA + + CL
Sbjct: 357 -------TDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSA----TPLACLAMAEAPQN 405
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V + QQQN+ VV+D+ RIGF C
Sbjct: 406 VNSVVNVIANLQQQNIRVVFDVANSRIGFAKESC 439
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 169/393 (43%), Gaps = 87/393 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+TW+ C C C Y+ S F P+ S++ C S+ C + S
Sbjct: 3 LLIDTGSDITWIQCD----PCPQC--YKQQD--SLFQPAGSATYKPLPCNSTMCQQLQSF 54
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ + L S+C ++ +YG+ G +TL + S I+
Sbjct: 55 SH-------------SCLNSSC-----NYMVSYGDKSTTRGDFALETLTLR-SDDTILVS 95
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNIS 179
+P F FGC + + G+ G G+ ++ P+Q K FS+C P++S
Sbjct: 96 VPNFAFGCGHANKGLFNGAAGLMGLGKSSIGFPAQTSVAFGKVFSYCL--------PSVS 147
Query: 180 SP-----LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
S L G+ A+ D ++FTP++ S P+ Y++ + I +G+ L P+S
Sbjct: 148 STIPSGILHFGEAAMLDYD-VRFTPLVDSSSGPSQYFVSMTGINVGDELL---PIS---- 199
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFY-------SQLLSILQSTITYYPRAKEVEERTGFDLC 287
++VDSGT + + Y +Q+L LQ+ ++ P FD C
Sbjct: 200 -----ATVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP----------FDTC 244
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+RV +T D P IT HF ++ L L + Y + V C F G
Sbjct: 245 FRV----STVDDINIPLITLHFRDDAELRLSPVHILYPVD-----DGVMCFAFAPSSSG- 294
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+FQQQN+ VYD+ K R+G +C
Sbjct: 295 ---RSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 164/387 (42%), Gaps = 90/387 (23%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
++DTGSDL W+ C C C ++ F PS SSS C S C HS
Sbjct: 104 FVDTGSDLVWLQCE----PCKQCYP----QITPIFDPSLSSSYQNIPCLSDTC---HS-- 150
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
+++T C V G L+ +TL + S+ G
Sbjct: 151 ----------------MRTTSCD--------------VRGYLSVETLTLD-STTGYSVSF 179
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
PK GC G+ + GI G G G +S+PSQLG G FS+C + PN +
Sbjct: 180 PKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWL----PNST 235
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L GD AI D TP++K YY + LEA ++GN L EF
Sbjct: 236 SKLNFGDAAIVYGDGAMTTPIVKKDAQSGYY-LTLEAFSVGNK--------LIEFGGPTY 286
Query: 240 GG----LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPN 294
GG +L+DSGTT+T LP Y + +S + Y + VE+ G F LCY V
Sbjct: 287 GGNEGNILIDSGTTFTFLPYDVYYRF----ESAVAEYINLEHVEDPNGTFKLCYNV---- 338
Query: 295 NTFTDDLFPSITFHFLN-NVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P IT HF ++ L +Y + S + CL F + +
Sbjct: 339 -AYHGFEAPLITAHFKGADIKL-------YYISTFIKVSDGIACLAFIPSQ------TAI 384
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
FG+ QQN+ V Y+L + + F+P+DC
Sbjct: 385 FGNVAQQNLLVGYNLVQNTVTFKPVDC 411
>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 438
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 159/394 (40%), Gaps = 71/394 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D TW C C C S F+P+ SSS + C+SS+C
Sbjct: 96 LDTSADATWAHCS----PCGTCPS------SSLFAPANSSSYASLPCSSSWCPLFQGQAC 145
Query: 66 PFD---------PCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
P P T+ C+ S SF L DTL++
Sbjct: 146 PAPQGGGDAAPPPATLPTCAFSKPFADA------SF----------QAALASDTLRLGKD 189
Query: 117 SPGIIREIPKFCFGCV----GSTYREPI-GIAGFGRGALSVPSQLGFLQKG-FSHCFLAF 170
+ IP + FGCV G T P G+ G GRG +++ SQ G L G FS+C ++
Sbjct: 190 A------IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSY 243
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
+ S L +G ++++TPML++P + YY+ + +++G++ +VP
Sbjct: 244 R---SYYFSGSLRLG-AGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHA-WVKVPAG 298
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
FD+ G +VDSGT T P Y+ L + + FD C+
Sbjct: 299 SFAFDAATGAGTVVDSGTVITRWTAPVYAALREEFRRQVA---APSGYTSLGAFDTCFN- 354
Query: 291 PCPNNTFTDDLF----PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
TD++ P++T H V L LP N SA + + CL
Sbjct: 355 -------TDEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSA----TPLACLAMAEAPQN 403
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V + QQQN+ VV+D+ R+GF C
Sbjct: 404 VNSVVNVIANLQQQNIRVVFDVANSRVGFAKESC 437
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/393 (29%), Positives = 162/393 (41%), Gaps = 63/393 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C DC + N + P SSS C C + S D
Sbjct: 107 LDTGSDLNWIQC----VPCHDC--FEQNGPY--YDPKESSSFRNIGCHDPRCHLVSSPDP 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP---GIIR 122
P PC + CP F Y YG+ TG +T V+ +SP +
Sbjct: 159 PL-PCKAEN------------QTCPYF-YWYGDSSNTTGDFATETFTVNLTSPTGKSEFK 204
Query: 123 EIPKFCFGCVGSTYREPIGIAGFGRGA----------LSVPSQLGFLQ-KGFSHCFLAFK 171
+ FGC G R G GA LS SQL L FS+C +
Sbjct: 205 RVENVMFGC-GHWNR------GLFHGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--D 255
Query: 172 YANDPNISSPLVIG-DVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVP 228
+D N+SS L+ G D + + L FT ++ P +YY+ +++I +G L +P
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLN-IP 314
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
S S G GG +VDSGTT ++ EP Y + + YP V++ D CY
Sbjct: 315 ESTWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI---VQDFPILDPCY 371
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
V DL P F + P N+F + + V CL
Sbjct: 372 NVSGVEKI---DL-PDFGILFADGAVWNFPVENYFIRL----DPEEVVCLAILGTPRSAL 423
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G++QQQN V+YD +K R+G+ PM+CA
Sbjct: 424 S---IIGNYQQQNFHVLYDTKKSRLGYAPMNCA 453
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 157/384 (40%), Gaps = 62/384 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V DTGSDL+WV C C DC + ++ F P+RSS+ S CAS C +
Sbjct: 159 MTVVFDTGSDLSWVQC----TPCSDCYEQKDPL----FDPARSSTYSAVPCASPECQGLD 210
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S CS + CR + YG+ G L RDTL + S
Sbjct: 211 SRS----------CS-----RDKKCR----YEVVYGDQSQTDGALARDTLTLTQSD---- 247
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+P F FGC + G+ G GR +S+ SQ GFS+C + P+
Sbjct: 248 -VLPGFVFGCGEQDTGLFGRADGLVGLGREKVSLSSQAASKYGAGFSYCL-----PSSPS 301
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L +G A + N +FT M P++YY+ L + + ++ P+
Sbjct: 302 AAGYLSLGGPAPA---NARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSA---- 354
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T LP Y+ L S ++ Y K + D CY
Sbjct: 355 --AGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGY-KRAPALSILDTCYDF----TGH 407
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T PS+ F ++ L Y CL F +GD +G+ G+
Sbjct: 408 TTVRIPSVALVFAGGAAVGLDFSGVLYVAKVSQ-----ACLAFAP--NGDGADAGIIGNT 460
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQ+ + VVYD+ +++IGF C+
Sbjct: 461 QQKTLAVVYDVARQKIGFGANGCS 484
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 116/408 (28%), Positives = 168/408 (41%), Gaps = 85/408 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS +T+VPC + C N + F P SS++SR +C S C S
Sbjct: 93 VIVDTGSTMTYVPCSSCGSGCGP------NHQDAAFDPEASSTASRISCTSPKC----SC 142
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+P C+ C+ + +Y E +GIL D L +H PG
Sbjct: 143 GSPRCGCSTQQCT---------------YTRSYAEQSSSSGILLEDVLALHDGLPGA--- 184
Query: 124 IPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
FGC G +R+ G+ G G SV +QL G + FS CF +
Sbjct: 185 --PIIFGCETRETGEIFRQRADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEG--- 239
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L++GD + +LQ+TP+L S +P YY + + ++ + L P+S FD
Sbjct: 240 ---DGALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLL---PVSQSLFD 293
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
QG G +L DSGTT+T++P P + A VE+ RVP P+
Sbjct: 294 -QGYGTVL-DSGTTFTYMPSPVFKAF-------------AGAVEKYALSHGLKRVPGPDP 338
Query: 296 TFTD----------------DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLL 339
F D +FPS+ F SLVL N+ + + +S CL
Sbjct: 339 QFDDICFGQAPSHDDLEALSSVFPSMEVQFDQGTSLVLGPLNYLFVHTF---NSGKYCLG 395
Query: 340 FQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D+G G + G +NV V YD +R+GF P C Q
Sbjct: 396 V--FDNGRAGT--LLGGITFRNVLVRYDRANQRVGFGPALCKELGEMQ 439
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 117/393 (29%), Positives = 168/393 (42%), Gaps = 87/393 (22%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
DTGSDL+W+ C S C Y+ + + F P++SSS + C ++ C N
Sbjct: 129 FDTGSDLSWIQCQPCSGHC-----YKQHDPV--FDPAKSSSYAVVPCGTTECAAAGGECN 181
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+TC + YG+G TG+L R+TL SS E
Sbjct: 182 ----------------GTTCV-----YGVEYGDGSSTTGVLARETLTFSSSS-----EFT 215
Query: 126 KFCFGCVGSTYREPIGIAGFGR--------------GALSVPSQLGFLQKGFSHCFLAFK 171
F FGC G T + FG + + P+ G FS+C +
Sbjct: 216 GFIFGC-GET-----NLGDFGEVDGLLGLGRGSLSLSSQAAPAFGGI----FSYCLPS-- 263
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
Y P L IG ++ + +Q+T M+ P YP++Y+I L +I IG L P+
Sbjct: 264 YNTTPGY---LSIGATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVL---PVPP 317
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
EF G L+DSGT T+LP P Y+ L + T+ A +E D CY
Sbjct: 318 SEFTKTGT---LLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDE---LDTCYD-- 369
Query: 292 CPNNTFTDD---LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQSMDDGD 347
FT L P ++F+F + L N F M+ P ++ AV CL F S D
Sbjct: 370 -----FTGQSGILIPGVSFNFSDGAVFNL---NFFGIMTFPDDTKPAVGCLAFVSR-PAD 420
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P V GS Q++ EV+YD+ ++IGF P C
Sbjct: 421 M-PFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 161/383 (42%), Gaps = 62/383 (16%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+YM DTGSD+TW+ C C DC Y + + + PS S+S + C S C ++
Sbjct: 175 QLYMVLDTGSDVTWLQCQP----CADC--YAQSDPV--YDPSVSTSYATVGCDSPRCRDL 226
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ + C ST +C + YG+G G +TL + S+P
Sbjct: 227 DA----------AACRNST---GSCL-----YEVAYGDGSYTVGDFATETLTLGDSAP-- 266
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ GC + G+ G G LS PSQ+ FS+C + D
Sbjct: 267 ---VSNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQISATT--FSYCLV----DRDSP 317
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS L GD S+ P+++SP +YY+ L I++G +L+ +P S D
Sbjct: 318 SSSTLQFGD----SEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALS-IPSSAFAMDDA 372
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G+GG++VDSGT T L Y L PRA V FD CY + ++
Sbjct: 373 GSGGVIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSL---FDTCYDLAGRSSV- 428
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P++ F L LP N+ P +++ CL F G GP + G+
Sbjct: 429 ---QVPAVALWFEGGGELKLPAKNYLI----PVDAAGTYCLAFA----GTSGPVSIIGNV 477
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ V V +D K +GF C
Sbjct: 478 QQQGVRVSFDTAKNTVGFTADKC 500
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 111/400 (27%), Positives = 172/400 (43%), Gaps = 100/400 (25%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTWV C C + + N + + P SS+ + C S C +
Sbjct: 114 DTGSDLTWVQCS----PCDNTKCFAQNTPL--YDPLNSSTFTLLPCDSQPCTQL------ 161
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDT-----LKVHGSSPGII 121
P + CS C +AYTYG+ G L+ D+ L++H +S
Sbjct: 162 --PYSQYVCSD----YGDCI-----YAYTYGDNSYSYGGLSSDSIRLMLLQLHYNS---- 206
Query: 122 REIPKFCFGC------VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYAN 174
K CFGC + GI G G G LS+ SQLG + FS+C L F
Sbjct: 207 ----KICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFS--- 259
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N +S L G+ AI + + TP++ P P +YY+ LE IT+G ++
Sbjct: 260 -SNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLP-FYYLNLEGITVGAKTVKT-------- 309
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT----GFDLCYRV 290
Q +G +++DSG+T T+L E FY++ +S+++ T+ VEE FD C+
Sbjct: 310 -GQTDGNIIIDSGSTLTYLEESFYNEFVSLVKETVA-------VEEDQYIPYPFDFCF-- 359
Query: 291 PCPNNTFTDDLF--PSITFHFLNN-------VSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
T+ + + P + FHF +LVL + N + PS+ +
Sbjct: 360 -----TYKEGMSTPPDVVFHFTGGDVVLKPMNTLVLIEDNLICSTVVPSHFDGI------ 408
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+FG+ Q + V YD++ ++ F P DC+
Sbjct: 409 ----------AIFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 169/393 (43%), Gaps = 81/393 (20%)
Query: 4 VYMDTGSDLTWV---PCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL-- 58
+ DTGSDL+WV PCG+ S C D F PS+SS+ + C C
Sbjct: 159 LIFDTGSDLSWVQCQPCGS-SGHCHPQQD-------PLFDPSKSSTYAAVHCGEPQCAAA 210
Query: 59 -NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
++ S DN +TC + YG+G TG+L+RDTL + S
Sbjct: 211 GDLCSEDN-----------------TTCL-----YLVRYGDGSSTTGVLSRDTLALTSS- 247
Query: 118 PGIIREIPKFCFGCVGSTYREPIGIAGFGR---------GALSVPSQLGF-LQKGFSHCF 167
R + F FGC G+ + FGR G LS+PSQ FS+C
Sbjct: 248 ----RALTGFPFGC-GTR-----NLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL 297
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
+ + + L IG + Q+T ML+ P +P++Y++ L +I IG L
Sbjct: 298 -----PSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVP 352
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P GG L+DSGT T+LP Y+ L + T+ Y A + D C
Sbjct: 353 PAVFTR------GGTLLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDV---LDAC 403
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
Y + + P+++F F + L + F M + V CL F +MD G
Sbjct: 404 YDFAGESEV----VVPAVSFRFGDGAVFEL---DFFGVMIFLDEN--VGCLAFAAMDTGG 454
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P + G+ QQ++ EV+YD+ E+IGF P C
Sbjct: 455 L-PLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 160/389 (41%), Gaps = 57/389 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DT +D TW C C C S F+P+ S+S + C+S+ C +
Sbjct: 90 ILLALDTSADATWAHCS----PCGTCPSSG-----SLFAPANSTSYAPLPCSSTMCTVLQ 140
Query: 62 S----SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
+ +P+D S+ P +F + + L D L + +
Sbjct: 141 GQPCPAQDPYD--------------SSAPLPMCAFTKPFADASF-QASLASDWLHLGKDA 185
Query: 118 PGIIREIPKFCFGCV----GSTYREPI-GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFK 171
IP + FGCV G T P G+ G GRG +++ SQ+G + G FS+C ++K
Sbjct: 186 ------IPNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYK 239
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
S L +G A +++TPMLK+P + YY+ + +++G + + +VP
Sbjct: 240 SYY---FSGSLRLG--AAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPV-KVPAGS 293
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
FD G +VDSGT T P Y+ L + + FD C+
Sbjct: 294 FAFDPATGAGTVVDSGTVITRWTPPVYAALREEFRRHVA---APSGYTSLGAFDTCFN-- 348
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ + P++T H + L LP N SA + + CL
Sbjct: 349 --TDEVAAGVAPAVTVHMDGGLDLALPMENTLIHSSA----TPLACLAMAEAPQNVNAVV 402
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V + QQQN+ VV+D+ R+GF C
Sbjct: 403 NVLANLQQQNLRVVFDVANSRVGFARESC 431
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 156/386 (40%), Gaps = 75/386 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C F P++S+S + +C++ C ++
Sbjct: 147 LMLIFDTGSDLTWARC----------------SAAETFDPTKSTSYANVSCSTPLCSSVI 190
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C S C + YG+G G L ++ L + G
Sbjct: 191 SATGNPSRCAASTCV---------------YGIQYGDGSYSIGFLGKERLTI-----GST 230
Query: 122 REIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPN 177
F FGC V + + G+ G GR LSV SQ + FS+C P+
Sbjct: 231 DIFNNFYFGCGQDVDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCL--------PS 282
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS + S + +FTP+ P ++Y + L IT+G L +PLS+
Sbjct: 283 SSSTGFL-SFGSSQSKSAKFTPLSSGP--SSFYNLDLTGITVGGQKLA-IPLSVFS---- 334
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T LP YS L S + + YP K + D CY + +
Sbjct: 335 -TAGTIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSI---LDTCYDF----SKY 386
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP--SGVFG 355
P I F V + + Q F A N CL F G+ G + +FG
Sbjct: 387 KTIKVPKIVISFSGGVDVDVDQAGIFVA-----NGLKQVCLAFA----GNTGARDTAIFG 437
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
+ QQ+N EVVYD+ ++GF P C+
Sbjct: 438 NTQQRNFEVVYDVSGGKVGFAPASCS 463
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 159/386 (41%), Gaps = 54/386 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+V +D GSDL W C + +L F +RSSS S
Sbjct: 121 KVILDLGSDLLWTQCSLVGPTA--------KQLEPVFDAARSSSFS-------------- 158
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC-RPCPSFAYTYGEGGL-VTGILTRDTLKVHGSSPGI 120
PC C T TC R C AY G + TG+L +T G+ G+
Sbjct: 159 ----VLPCDSKLCEAGTFTNKTCTDRKC---AYENDYGIMTATGVLATETF-TFGAHHGV 210
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ C T E GI G G LS+ QL + FS+C F +S
Sbjct: 211 SANLTFGCGKLANGTIAEASGILGLSPGPLSMLKQLAITK--FSYCLTPFADRK----TS 264
Query: 181 PLVIGDVAISSK----DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
P++ G +A K +Q P+LK+P+ YYY+ + +++G+ L +VP
Sbjct: 265 PVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRL-DVPQETLAIKP 323
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG ++DS TT +L EP +++L + I + V++ + +C+ +P +
Sbjct: 324 DGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDD---YPVCFELP-RGMS 379
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P + HF + + LP+ N+F S AV F+ G V G+
Sbjct: 380 MEGVQVPPLVLHFDGDAEMSLPRDNYFQEPSPGMMCLAVMQAPFE-------GAPNVIGN 432
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
QQQN+ V+YD+ + + P C S
Sbjct: 433 VQQQNMHVLYDVGNRKFSYAPTKCDS 458
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/381 (25%), Positives = 158/381 (41%), Gaps = 60/381 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D WVPC C C + F P+ S++ C+ + C +
Sbjct: 113 MVLDTSNDAAWVPCSG----CTGCSS-------TTFLPNASTTLGSLDCSGAQCSQVRGF 161
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C +G S+ C F +YG +T L +D + +
Sbjct: 162 S-----CPATG--------SSACL----FNQSYGGDSSLTATLVQDAITLAND------V 198
Query: 124 IPKFCFGCVGSTYR---EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP F FGC+ + P G+ G GRG +S+ SQ G + G FS+C +FK S
Sbjct: 199 IPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYY---FS 255
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L++P P+ YY+ L +++G + +P FD
Sbjct: 256 GSLKLGPVG--QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKV-PIPSEQLVFDPNTG 312
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + + FD C+ +
Sbjct: 313 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAAT------NE 361
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P+IT HF ++LVLP N S+S ++ CL + + V + QQ
Sbjct: 362 AEAPAITLHF-EGLNLVLPMENSLIH----SSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D R+G C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437
>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 169/394 (42%), Gaps = 59/394 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + + F P+RSSS S C+S C
Sbjct: 98 VSMVLDTGSELSWLRCN------------KTQTFQTTFDPNRSSSYSPVPCSSLTC---- 141
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+D D + C + L + +Y + G L DT + S
Sbjct: 142 -TDRTRDFPIPASCDSNQLCHAIL---------SYADASSSEGNLASDTFYIGNS----- 186
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
++P FGC+ S++ + G+ G RG+LS SQ+ F + FS+C +
Sbjct: 187 -DMPGTIFGCMDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFPK--FSYCI------S 237
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
D + S L++GD S L +TP+++ S P + Y + LE I + +S L +P
Sbjct: 238 DSDFSGVLLLGDANFSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKV-SSKLLPLPK 296
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE---RTGFDL 286
S+ D G G +VDSGT +T L P YS L + + + R E + G DL
Sbjct: 297 SVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDL 356
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CYRVP + P+++ F V + S +V C F + D
Sbjct: 357 CYRVPLSQTSLP--WLPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLL 414
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ V G QQNV + +DLEK RIGF + C
Sbjct: 415 AV-EAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447
>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 424
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 108/387 (27%), Positives = 175/387 (45%), Gaps = 61/387 (15%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
++DT + L WV C N + C + L + F S+S + + C S+FC +S
Sbjct: 91 FLDTSNGLIWVQCSNCNSQC----EPEKRGLTTKFLSSKSFTYEMEPCGSNFC----NSL 142
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
F C S C+ + YG+ +GIL+ D+ +S G++ ++
Sbjct: 143 TGFQTCNSS---------DKWCK----YRLVYGDNKATSGILSSDSFGFD-TSDGMLVDV 188
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
FGC + + G G + LS+ SQLG K FS+C + F N+ +S
Sbjct: 189 GFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGI--KKFSYCLVPF---NNLGSTS 243
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREFDS-Q 237
+ G + ++S TP+L YPN YY+ + I+IGN + P FD +
Sbjct: 244 KMYFGSLPVTSGGQ---TPLL----YPNSDAYYVKVLGISIGN----DEPHFDGVFDVYE 292
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++D+G TY+ L + LL+ + + R + +ER F+LC+ + N+
Sbjct: 293 VRDGWIIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKER--FELCFELQNANDL- 349
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVFGS 356
+ FP +T HF + L+L + F + + CL L +S P + G+
Sbjct: 350 --ESFPDVTVHF-DGADLILNVESTFVKIE----DDGIFCLALLRSG-----SPVSILGN 397
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAST 383
FQ QN V YDLE + I F P+DCA +
Sbjct: 398 FQLQNYHVGYDLEAQVISFAPVDCADS 424
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 113/409 (27%), Positives = 170/409 (41%), Gaps = 84/409 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCD----DYRNNKL---MSNFSPSRSSSSSRDTCASSF 56
V +D GSDL WVPC DC+ C Y N L +S +SPS SS+S +C
Sbjct: 122 VALDAGSDLLWVPC-----DCIQCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQL 176
Query: 57 CLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
C + NP DPC F Y E G L D L +
Sbjct: 177 CEWGSNCKNPKDPCPY------------------IFNYDDFENTTSAGFLVEDKLHLASV 218
Query: 117 SPGIIREI--PKFCFGC---VGSTYRE---PIGIAGFGRGALSVPSQL---GFLQKGFSH 165
R++ GC G ++ + P G+ G G G +SVPS L G +Q FS
Sbjct: 219 GDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSL 278
Query: 166 CFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
CF D N S ++ GD +S+ + F P+ + + Y++G+E+ +GNS L
Sbjct: 279 CF-------DENDSGRILFGDRGHASQQSTPFLPIQGTYV---AYFVGVESYCVGNSCLK 328
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-F 284
LVDSG+++T+LP Y++L+S + AK + + G +
Sbjct: 329 RSGFK-----------ALVDSGSSFTYLPSEVYNELVSEFDKQVN----AKRISFQDGLW 373
Query: 285 DLCYRVPCPNNTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQ 341
D CY N + +L P+I F N + V+ H S P + + CL Q
Sbjct: 374 DYCY------NASSQELHDIPAIQLKFPRNQNFVV----HNPTYSIPHHQGFTMFCLSLQ 423
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLH 390
D G G+ G +V+D+E ++G+ C T+ + +H
Sbjct: 424 PTD----GSYGIIGQNFMIGYRMVFDIENLKLGWSNSSCQDTSDSADVH 468
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 167/389 (42%), Gaps = 55/389 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C +C + N ++ P +SSS C S C ++ SS +
Sbjct: 198 LDTGSDLNWIQC----VPCYEC--FEQNG--PHYDPGQSSSYRNIGCHDSRC-HLVSSPD 248
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH---GSSPGIIR 122
P PC + CP + Y YG+ TG +T V+ S +R
Sbjct: 249 PPQPCKAEN------------QTCP-YYYWYGDSSNTTGDFALETFTVNLTMSSGKPELR 295
Query: 123 EIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
+ FGC + G+ G GRG LS SQL L FS+C + +D N+
Sbjct: 296 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSDANV 353
Query: 179 SSPLVIG-DVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L+ G D + S L FT ++ P +YY+ +++I +G + +P +
Sbjct: 354 SSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVG-GEVVNIPEEKWQIA 412
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ G+GG ++DSGTT ++ EP Y + + + YP K D PC N
Sbjct: 413 TDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVK--------DFPVLEPCYNV 464
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG-- 352
T + P F + P N+F + V CL PS
Sbjct: 465 TGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEP----REVVCLAILGTP-----PSALS 515
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G++QQQN ++YD +K R+GF P CA
Sbjct: 516 IIGNYQQQNFHILYDTKKSRLGFAPTKCA 544
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 161/371 (43%), Gaps = 57/371 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C + C D + F PS+S+S S TC S+ C +
Sbjct: 158 LSLIFDTGSDLTWTQCEPCARSCYKQQD-------AIFDPSKSTSYSNITCTSTLCTQLS 210
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ +P GCS ST + C + YG+ G +R+ L V +
Sbjct: 211 TATGN-EP----GCSAST-------KACI-YGIQYGDSSFSVGYFSRERLSVTATDI--- 254
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPN 177
+ F FGC + + G+ G GR +S Q + +K FS+C P
Sbjct: 255 --VDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAVYRKIFSYCL--------PA 304
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS ++ +++TP ++Y + + I++G + L P+S F +
Sbjct: 305 TSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKL---PVSSSTFST- 360
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GG ++DSGT T LP Y+ L S + ++ YP A E+ D CY + + +
Sbjct: 361 --GGAIIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSI---LDTCYDL----SGY 411
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P I F F V++ LP Y SA CL F + +GD ++G+
Sbjct: 412 EVFSIPKIDFSFAGGVTVQLPPQGILYVASAKQ-----VCLAFAA--NGDDSDVTIYGNV 464
Query: 358 QQQNVEVVYDL 368
QQ+ +EVVYD+
Sbjct: 465 QQKTIEVVYDV 475
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 115/392 (29%), Positives = 169/392 (43%), Gaps = 68/392 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
+ +DTGSDL W+ C C C Y+ + F P SSS R C S C L IH
Sbjct: 144 MVVDTGSDLPWLQCQ----PCKSC--YKQADPI--FDPRNSSSFQRIPCLSPLCKALEIH 195
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S CS S S C S+ YG+G G + D + S +
Sbjct: 196 S------------CSGSRGATSRC-----SYQVAYGDGSFSVGDFSSDLFTLGTGSKAM- 237
Query: 122 REIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQL------GFLQKGFSHCFLAFKY 172
FGC + G+ G G G LS PSQ+ FS+C L +
Sbjct: 238 ----SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC-LVDRS 292
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
SS L+ G AI S L +P+LK+P +YY + +++G + L P+SL+
Sbjct: 293 NPMTRSSSSLIFGAAAIPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQL---PISLK 347
Query: 233 --EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+ G+GG+++DSGT+ T P Y+ + ++ T P A + FD CY
Sbjct: 348 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRY---SLFDTCYNF 404
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDGDY 348
+ + D+ P++ HF N L LP N+ P N++ CL F SM+
Sbjct: 405 ---SGKASVDV-PALVLHFENGADLQLPPTNYLI----PINTAGSFCLAFAPTSME---- 452
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ QQQ+ + +DL+K + F P C
Sbjct: 453 --LGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 482
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 109/375 (29%), Positives = 168/375 (44%), Gaps = 64/375 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C + C Y+ ++ F PS+S+S S TC S+ C +
Sbjct: 159 LSLIFDTGSDLTWTQCEPCARSC-----YKQQDVI--FDPSKSTSYSNITCTSALCTQLS 211
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ DP GCS ST + C + YG+ G +R+ L V +
Sbjct: 212 TATGN-DP----GCSAST-------KACI-YGIQYGDSSFSVGYFSRERLTVTATDV--- 255
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ F FGC + + G+ G GR +S Q +K FS+C P+
Sbjct: 256 --VDNFLFGCGQNNQGLFGGSAGLIGLGRHPISFVQQTAAKYRKIFSYCL--------PS 305
Query: 178 ISSP---LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
SS L G A + L++TP ++Y + + AI +G ++P+S F
Sbjct: 306 TSSSTGHLSFGPAA--TGRYLKYTPFSTISRGSSFYGLDITAIAVGG---VKLPVSSSTF 360
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+ GG ++DSGT T LP Y L S + ++ YP A E+ D CY +
Sbjct: 361 ST---GGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSI---LDTCYDL---- 410
Query: 295 NTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ + P+I F F V++ L PQG F A S+ CL F + +GD +
Sbjct: 411 SGYKVFSIPTIEFSFAGGVTVKLPPQGILFVA------STKQVCLAFAA--NGDDSDVTI 462
Query: 354 FGSFQQQNVEVVYDL 368
+G+ QQ+ +EVVYD+
Sbjct: 463 YGNVQQRTIEVVYDV 477
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 157/384 (40%), Gaps = 58/384 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C D + F+PS+S+S +C+S+ C ++
Sbjct: 146 LSLIFDTGSDLTWTQCQPCVRTCYDQKE-------PIFNPSKSTSYYNVSCSSAACGSLS 198
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C+ S C + YG+ G L +D + S
Sbjct: 199 SATGNAGSCSASNCI---------------YGIQYGDQSFSVGFLAKDKFTLTSSDV--- 240
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPN 177
FGC + + G+ G GR LS PSQ K FS+C + +
Sbjct: 241 --FDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-----PSSAS 293
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L G IS +++FTP+ ++Y + + AIT+G L P+ F +
Sbjct: 294 YTGHLTFGSAGISR--SVKFTPISTITDGTSFYGLNIVAITVGGQKL---PIPSTVFSTP 348
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G L+DSGT T LP Y+ L S ++ ++ YP V D C+ + + F
Sbjct: 349 G---ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSI---LDTCFDL----SGF 398
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + F F + L FYA CL F + D + +FG+
Sbjct: 399 KTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQ-----VCLAFAG--NSDDSNAAIFGNV 451
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ +EVVYD R+GF P C+
Sbjct: 452 QQQTLEVVYDGAGGRVGFAPNGCS 475
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 162/382 (42%), Gaps = 53/382 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT SDLTW+ C C C Y + + F P R S+S R+ ++ +++D
Sbjct: 155 LDTASDLTWLQCQ----PCRRC--YPQSGPV--FDP-RHSTSYRE-------MSFNAAD- 197
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C G S K C + YG+G G +TL G +P
Sbjct: 198 ----CQALGRSGGGDAKRGTC----VYTVGYGDGSTTVGDFIEETLTFAGGV-----RLP 244
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ GC G GI G GRG +S P+Q+ FS+C + F + ++SS
Sbjct: 245 RISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDH-NGTFSYCLVDF-LSGPGSLSST 302
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGN---SSLTEVPLSLREFDSQG 238
L G A+ + + FTP + + P +YY+ L I++G +TE L L + G
Sbjct: 303 LTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPY--TG 360
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG++VDSGT T L P Y+ ++ + FD CY V
Sbjct: 361 RGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTV----GGRG 416
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+++ HF +V + L N+ P +S C F + GD+ S + G+ Q
Sbjct: 417 MKKVPTVSMHFAGSVEVKLQPKNYLI----PVDSMGTVCFAFAAT--GDHSVS-IIGNIQ 469
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ +VYD+ R+GF P C
Sbjct: 470 QQGFRIVYDI-GGRVGFAPNSC 490
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 168/388 (43%), Gaps = 58/388 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C + D F PSRS++ S +C S+ C + +
Sbjct: 118 DTGSDLVWVNCSSNGGGGGASDG------AVVFHPSRSTTYSLLSCQSAACQALSQASCD 171
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI-- 124
D S C + Y YG+G G+L+ +T + G ++
Sbjct: 172 AD--------------SEC-----QYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRV 212
Query: 125 PKFCFGC-VGS--TYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFLAFKYANDPNI 178
P+ FGC GS ++R G+ G G GALS+ SQLG + + FS+C + YA N
Sbjct: 213 PRVSFGCSTGSAGSFRSD-GLVGLGAGALSLVSQLGAAARIARRFSYCLVP-PYAA-ANS 269
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SS L G A+ S TP++ S + +YY + LE++ + ++ S
Sbjct: 270 SSTLSFGARAVVSDPGAASTPLVPSEV-DSYYTVALESVAVAG----------QDVASAN 318
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
+ ++VDSGTT T L L++ L+ I PRA+ E+ LCY V +
Sbjct: 319 SSRIIVDSGTTLTFLDPALLRPLVAELERRI-RLPRAQPPEQL--LQLCYDVQGKSQA-E 374
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
D P +T F S+ L N F + CL+ + + P + G+
Sbjct: 375 DFGIPDVTLRFGGGASVTLRPENTFSLLE-----EGTLCLVLVPVSESQ--PVSILGNIA 427
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASA 386
QQN V YDL+ + F +DC ++++
Sbjct: 428 QQNFHVGYDLDARTVTFAAVDCTRSSAS 455
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 161/381 (42%), Gaps = 63/381 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C Y ++ F+PS+S+S + +C+S C + S
Sbjct: 156 DTGSDLTWT-------QCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGN 208
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C+ S C + YG+ G +D L + +
Sbjct: 209 SPSCSASTCV---------------YGIQYGDQSYSVGFFAQDKLALTSTD-----VFNN 248
Query: 127 FCFGCVGSTYREPIGIAGF---GRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP- 181
F FGC + +G+AG GR ALS+ SQ K FS+C P+ SS
Sbjct: 249 FLFGCGQNNRGLFVGVAGLIGLGRNALSLVSQTAQKYGKLFSYCL--------PSTSSST 300
Query: 182 --LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L G +SK ++FTP L + P++Y++ L AI++G L+ S F + G
Sbjct: 301 GYLTFGSGGGTSK-AVKFTPSLVNSQGPSFYFLNLIAISVGGRKLST---SASVFSTAGT 356
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT + LP YS L + Q ++ YP+A D CY + +
Sbjct: 357 ---IIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASI---LDTCYDF----SQYDT 406
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P I +F + + L FY + N S V CL F + D + G+ QQ
Sbjct: 407 VDVPKINLYFSDGAEMDLDPSGIFYIL----NISQV-CLAFAG--NSDATDIAILGNVQQ 459
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ +VVYD+ RIGF P C
Sbjct: 460 KTFDVVYDVAGGRIGFAPGGC 480
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 116/393 (29%), Positives = 167/393 (42%), Gaps = 81/393 (20%)
Query: 4 VYMDTGSDLTWV---PCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL-- 58
+ DTGSDL+WV PCG+ S C D F PS+SS+ + C C
Sbjct: 164 LIFDTGSDLSWVQCQPCGS-SGHCHPQQD-------PLFDPSKSSTYAAVHCGEPQCAAA 215
Query: 59 -NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
+ S DN +TC + YG+G TG+L+RDTL + S
Sbjct: 216 GGLCSEDN-----------------TTCL-----YLVHYGDGSSTTGVLSRDTLALTSS- 252
Query: 118 PGIIREIPKFCFGCVGSTYREPIGIAGFGR---------GALSVPSQLGF-LQKGFSHCF 167
R + F FGC G+ + FGR G LS+PSQ FS+C
Sbjct: 253 ----RALAGFPFGC-GTR-----NLGDFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL 302
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
+ + + L IG + Q+T ML+ P +P++Y++ L +I IG L
Sbjct: 303 -----PSSNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVP 357
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P GG L+DSGT T+LP Y L + T+ Y A + D C
Sbjct: 358 PAVFTR------GGTLLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDV---LDAC 408
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
Y + + P+++F F + L + F M + V CL F +MD G
Sbjct: 409 YDFAGESEV----IVPAVSFRFGDGAVFEL---DFFGVMIFLDEN--VGCLAFAAMDAGG 459
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P + G+ QQ++ EV+YD+ E+IGF P C
Sbjct: 460 L-PLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 117/401 (29%), Positives = 186/401 (46%), Gaps = 74/401 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY-RNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV +C+ CD R + L ++ + P+ S+SS TC FC
Sbjct: 104 VQVDTGSDILWV-------NCISCDSCPRKSGLGIDLTLYDPTASASSKTVTCGQEFCAT 156
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGS 116
+ P C+ ++ PC ++ TYG+G TG D L+ V G
Sbjct: 157 ATNGGVP------PSCAANS--------PC-QYSITYGDGSSTTGFFVADFLQYDQVSGD 201
Query: 117 SPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
+ FGC +GS+ GI GFG+ S+ SQL G + K FSHC
Sbjct: 202 GQTNLAN-ASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHC 260
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
+ N IG+V + ++ TP++ P P+Y + L+ I +G S+L +
Sbjct: 261 L------DTVNGGGIFAIGNVV---QPKVKTTPLV--PGMPHYNVV-LKTIDVGGSTL-Q 307
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRA--KEVEERTGF 284
+P ++ + G+ G ++DSGTT +LPE Y +LS + S +P K V++
Sbjct: 308 LPTNIFDIGG-GSRGTIIDSGTTLAYLPEVVYKAVLSAVFSN---HPDVTLKNVQDF--- 360
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-- 342
LC++ D+ FP +TFHF ++ LV+ ++ + N+ V C+ FQS
Sbjct: 361 -LCFQYSGS----VDNGFPEVTFHFDGDLPLVVYPHDYLF-----QNTEDVYCVGFQSGG 410
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ D + G N VVYDLE + IG+ +C+S+
Sbjct: 411 VQSKDGKDMVLLGDLALSNKLVVYDLENQVIGWTNYNCSSS 451
>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
Length = 488
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 132/293 (45%), Gaps = 27/293 (9%)
Query: 92 FAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGR 147
+ Y Y + + TG+L D G +P FGC G GIAGFGR
Sbjct: 216 YTYYYNDKSVTTGLLEVDKFTF-----GAGASVPGVAFGCGLFNNGVFKSNETGIAGFGR 270
Query: 148 GALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP 207
G LS+PSQL FSHCF A + L + D+ + + +Q TP++++ P
Sbjct: 271 GPLSLPSQLKV--GNFSHCFTAVNGLKQSTVLLDL-LADLYKNGRGAVQSTPLIQNSANP 327
Query: 208 NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQS 267
YY+ L+ IT+G++ L VP S + G GG ++DSGT+ T LP Y + +
Sbjct: 328 TLYYLSLKGITVGSTRL-PVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAA 385
Query: 268 TITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
I P TG C+ P + P + HF ++ LP+ N+ + +
Sbjct: 386 QIK-LPVVP--GNATGPYTCFSAP----SQAKPDVPKLVLHF-EGATMDLPRENYVFEVP 437
Query: 328 APSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ +S + CL + D G+FQQQN+ V+YDL+ + F C
Sbjct: 438 DDAGNSMI-CLAINELGD----ERATIGNFQQQNMHVLYDLQNNMLSFVAAQC 485
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 68/153 (44%), Gaps = 16/153 (10%)
Query: 213 GLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY 272
G IT+G++ L VP S + G GG ++DSGT+ T LP Y + + I
Sbjct: 38 GRPGITVGSTRL-PVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK-L 94
Query: 273 PRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNS 332
P TG C+ P + P + HF ++ LP+ N+ + + + +
Sbjct: 95 PVVP--GNATGPYTCFSAP----SQAKPDVPKLVLHF-EGATMDLPRENYVFEVPDDAGN 147
Query: 333 SAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVV 365
S + CL D+ + + G+FQQQN+ +
Sbjct: 148 SII-CLAINKGDE-----TTIIGNFQQQNMHAL 174
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 162/378 (42%), Gaps = 54/378 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y + F P+ SSS S +C S+ C
Sbjct: 147 VDSGSDVIWVQC----RPCEQC--YAQTDPL--FDPAASSSFSGVSCGSAICR------- 191
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
T+SG + C ++ TYG+G G L +TL + G++ ++ +
Sbjct: 192 -----TLSGTGCGGGGDAGKC----DYSVTYGDGSYTKGELALETLTLGGTA---VQGVA 239
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
C + G+ G G GA+S+ QLG G FS+C LA + A + LV+
Sbjct: 240 IGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYC-LASRGAGG---AGSLVL 295
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL--SLREFDSQGNGGL 242
G + + P++++ ++YY+GL I +G L PL SL + G GG+
Sbjct: 296 GRTEAVPVGAV-WVPLVRNNQASSFYYVGLTGIGVGGERL---PLQDSLFQLTEDGAGGV 351
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP Y+ L + PR+ V D CY + + +
Sbjct: 352 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSL---LDTCYDL----SGYASVRV 404
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P+++F+F L LP N + AV CL F G + G+ QQ+ +
Sbjct: 405 PTVSFYFDQGAVLTLPARNLLVEVGG-----AVFCLAFAPSSSG----ISILGNIQQEGI 455
Query: 363 EVVYDLEKERIGFQPMDC 380
++ D +GF P C
Sbjct: 456 QITVDSANGYVGFGPNTC 473
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 171/384 (44%), Gaps = 65/384 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD++W+ C S C Y+ + + F P++S++ S C C
Sbjct: 178 IDTGSDVSWIQCLPCSGHC-----YKQHDPV--FDPTKSATYSAVPCGHPQCAAAGGK-- 228
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ SG L + TYG+G G+L+ +TL + + R++P
Sbjct: 229 ----CSNSGTCL--------------YKVTYGDGSSTAGVLSHETLSLSST-----RDLP 265
Query: 126 KFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
F FGC + E G+ G GRGALS+PSQ FS+C ++ +
Sbjct: 266 GFAFGCGQTNLGEFGGVDGLVGLGRGALSLPSQAAATFGATFSYCLPSYDTTH-----GY 320
Query: 182 LVIGD---VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
L +G A + D++Q+T M++ YP+ Y++ + +I IG L VP ++ D
Sbjct: 321 LTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYIL-PVPPTVFTRD--- 376
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G L DSGT T+LP Y+ L + T+T Y A + FD CY N F
Sbjct: 377 --GTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYDP---FDTCYDFTGHNAIF- 430
Query: 299 DDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSS-AVKCLLFQSMDDGDYGPSGVFGS 356
P++ F F + L P Y P +++ A CL F + P + G+
Sbjct: 431 ---MPAVAFKFSDGAVFDLSPVAILIY----PDDTAPATGCLAF--VPRPSTMPFNIIGN 481
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ+ EV+YD+ E+IGF C
Sbjct: 482 TQQRGTEVIYDVAAEKIGFGQFTC 505
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 110/393 (27%), Positives = 173/393 (44%), Gaps = 71/393 (18%)
Query: 2 IQVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
I++Y DTGSDL W C C C Y+ M F P SSS + TC + C
Sbjct: 71 IKIYAEADTGSDLVWFQC----IPCTKC--YKQQNPM--FDPRSSSSYTNITCGTESCNK 122
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ SS L + + TC ++ Y+Y + + G+L ++TL + S+ G
Sbjct: 123 LDSS-------------LCSTDQKTC-----NYTYSYADNSITQGVLAQETLTLT-STTG 163
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGF-LQKG---FSHCFLAFK 171
FGC G RE +G+ G GRG LS+ SQ+G L G FS C + F
Sbjct: 164 EPVAFQGIIFGCGHNNSGFNDRE-MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFN 222
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
DP+I+S + G + + TP++ Y+ L I S+ ++ L
Sbjct: 223 --TDPSITSQMNFGKGSEVLGNGTVSTPLISKD--GTGYFATLLGI-----SVEDINLPF 273
Query: 232 REFDSQG---NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
S G G +L+DSGTT T+LPE FY +L+ +++ + P + G++LCY
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRID-----GYELCY 328
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ P N P++T HF L+ P M P +F + ++
Sbjct: 329 QTPTNLNG------PTLTIHFEGGDVLLTPA-----QMFIPVQDDNFCFAVFDTNEE--- 374
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+G++ Q N + +DLE++ + F+ DC
Sbjct: 375 --YVTYGNYAQSNYLIGFDLERQVVSFKATDCT 405
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 163/382 (42%), Gaps = 67/382 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSD++WV C C Y + F PS+SS+ + C + C + D+
Sbjct: 148 MDTGSDVSWVQC----TPCNSTKCYPQKDPL--FDPSKSSTYAPIACNTDACRKL--GDH 199
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ CT G T C ++ Y +G G+ + +TL + +PGI E
Sbjct: 200 YHNGCTSGG---------TQC----GYSVEYADGSHSRGVYSNETLTL---APGITVE-- 241
Query: 126 KFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
F FGC G R P G+ G G +S+ Q + G FS+C A + +
Sbjct: 242 DFHFGC-GRDQRGPSDKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALN-----SEAG 295
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
LV+G +K FTPM P Y +Y + + I++G L +P S G
Sbjct: 296 FLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPL-HIP------QSAFRG 348
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGT T LPE Y+ L + L+ + YP + FD CY +++
Sbjct: 349 GMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD----FDTCYNF----TGYSNI 400
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM--DDGDYGPSGVFGSFQ 358
P + F F ++ L P+ CL FQ DDG G+ G+
Sbjct: 401 TVPRVAFTFSGGATIDL---------DVPNGILVNDCLAFQESGPDDG----LGIIGNVN 447
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ +EV+YD + +GF+ C
Sbjct: 448 QRTLEVLYDAGRGNVGFRAGAC 469
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 148/350 (42%), Gaps = 75/350 (21%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LN 59
+Q+ +DTGSDL W C C C D + + F PS SS+ S +C S+ C L
Sbjct: 95 VQLTLDTGSDLIWTQCQ----PCPACFD----QALPYFDPSTSSTLSLTSCDSTLCQGLP 146
Query: 60 IHSSDNP-FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ S +P F P TC + Y+YG+ + TG L D G+
Sbjct: 147 VASCGSPKFWP------------NQTCV-----YTYSYGDKSVTTGFLEVDKFTFVGAG- 188
Query: 119 GIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+P FGC G GIAGFGRG LS+PSQL FSHCF A
Sbjct: 189 ---ASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV--GNFSHCFTAVN-GL 242
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
P+ + D+ S + +Q TP++++P P +YY+ L+ IT+G+ T +P+ EF
Sbjct: 243 KPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGS---TRLPVPESEF 299
Query: 235 D-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G GG ++DSGT T LP Y + R F ++P
Sbjct: 300 ALKNGTGGTIIDSGTAMTSLPTRVYRLV-------------------RDAFAAQVKLPVV 340
Query: 294 NNTFTDDLF------------PSITFHFLNNVSLVLPQGNHFYAMSAPSN 331
+ TD F P + HF ++ LP+ N+ + P
Sbjct: 341 SGNTTDPYFCLSAPLRAKPYVPKLVLHF-EGATMDLPRENYVWLKHYPKR 389
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 109/404 (26%), Positives = 184/404 (45%), Gaps = 74/404 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C L ++ + P SSS S +C FC +
Sbjct: 99 VQVDTGSDILWVNC----ISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYG 154
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHG---S 116
P GC+ + PC ++ YG+G TG D L+ V G +
Sbjct: 155 GKLP-------GCTANV--------PC-EYSVMYGDGSSTTGFFVTDALQFDQVTGDGQT 198
Query: 117 SPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
PG FGC +GS+ + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 199 QPGN----ATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHC 254
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
K IG+V + ++ TP++ P +Y + L++I +G ++L
Sbjct: 255 LDTIKGG------GIFAIGNVV---QPKVKTTPLVAD--MP-HYNVNLKSIDVGGTTLQ- 301
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD- 285
L F++ G ++DSGTT T+LPE + ++++ + + + +++ D
Sbjct: 302 --LPAHVFETGERKGTIIDSGTTLTYLPELVFKEVMAAI------FNKHQDIVFHNVQDF 353
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SM 343
+C++ P DD FP+ITFHF ++++L + +F+ N + + C+ FQ ++
Sbjct: 354 MCFQYPGS----VDDGFPTITFHFEDDLALHVYPHEYFFP-----NGNDMYCVGFQNGAL 404
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V+YDLE + IG+ +C+S+ +
Sbjct: 405 QSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYNCSSSIKIE 448
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 153/384 (39%), Gaps = 49/384 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT SDLTW+ C C C Y + + F P S+S +N + D
Sbjct: 151 LDTASDLTWLQC----QPCRRC--YPQSGPV--FDPRHSTSYGE--------MNYDAPD- 193
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C G S K C + +G G L +TL G +R+
Sbjct: 194 ----CQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG----VRQA- 244
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL--QKGFSHCFLAFKYANDPNIS 179
GC G GI G GRG +S+P Q+ FL FS+C + F + + S
Sbjct: 245 YLSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDF-ISGPGSPS 303
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGN---SSLTEVPLSLREFDS 236
S L G A+ + FTP + + P +YY+ L +++G +TE L L +
Sbjct: 304 STLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPY-- 361
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG+++DSGTT T L P Y ++ T + FD CY V
Sbjct: 362 TGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTV----GG 417
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P+++ HF V + L N+ P +S C F D V G+
Sbjct: 418 RAGVKVPAVSMHFAGGVEVSLQPKNYLI----PVDSRGTVCFAFAGTGDRSVS---VIGN 470
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ VVYDL +R+GF P +C
Sbjct: 471 ILQQGFRVVYDLAGQRVGFAPNNC 494
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 164/383 (42%), Gaps = 58/383 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGS LTW C + C D F PS+SSS + C SS C
Sbjct: 153 LSLIFDTGSYLTWTQCEPCAGSCYKQQD-------PIFDPSKSSSYTNIKCTSSLCTQFR 205
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ GCS ST ++C + YG+ + G L+++ L + ++ I+
Sbjct: 206 SA----------GCSSST--DASCI-----YDVKYGDNSISRGFLSQERLTI--TATDIV 246
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPN 177
+ F FGC +R G+ G R +S Q + K FS+C + P+
Sbjct: 247 HD---FLFGCGQDNEGLFRGTAGLMGLSRHPISFVQQTSSIYNKIFSYCL-----PSTPS 298
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L G A ++ NL++TP ++Y + + I++G + L V S F +
Sbjct: 299 SLGHLTFGASA-ATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSS--TFSA- 354
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GG ++DSGT T LP Y+ L S + + YP A D CY + +
Sbjct: 355 --GGSIIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRL---LDTCYDF----SGY 405
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P I F F V + LP Y SA CL F + +G+ +FG+
Sbjct: 406 KEISVPRIDFEFAGGVKVELPLVGILYGESAQQ-----LCLAFAANGNGN--DITIFGNV 458
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ+ +EVVYD+E RIGF C
Sbjct: 459 QQKTLEVVYDVEGGRIGFGAAGC 481
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 171/393 (43%), Gaps = 50/393 (12%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL+W+ C C DC + N +++P+ SSS +C C +
Sbjct: 183 VWLILDTGSDLSWIQCD----PCYDC--FEQNG--PHYNPNESSSYRNISCYDPRC-QLV 233
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG-- 119
SS +P C T + CP F Y Y +G TG +T V+ + P
Sbjct: 234 SSPDPLQHC------------KTENQTCPYF-YDYADGSNTTGDFALETFTVNLTWPNGK 280
Query: 120 -IIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYAN 174
+ + FGC + G+ G GRG LS PSQL FS+C +
Sbjct: 281 EKFKHVVDVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDL--FS 338
Query: 175 DPNISSPLVIG-DVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSL 231
+ ++SS L+ G D + + NL FT +L P+ +YY+ +++I +G L ++P
Sbjct: 339 NTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVL-DIPEKT 397
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ S+G GG ++DSG+T T P+ Y + + I ++ D P
Sbjct: 398 WHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKL--------QQIAADDFIMSP 449
Query: 292 CPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
C N + + P HF + P N+FY V CL + ++
Sbjct: 450 CYNVSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEP----DEVICLAI--LKTPNHSH 503
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ G+ QQN ++YD+++ R+G+ P CA
Sbjct: 504 LTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 536
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 156/382 (40%), Gaps = 68/382 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGS++ W+ C C Y + + F P+ SS+ +C S+ C + S
Sbjct: 31 VIFDTGSNVNWIQCKPCVVSC-----YPQQEPL--FDPTLSSTYRNISCTSAACTGLSSR 83
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS ST + + TYG+G G L +T + +
Sbjct: 84 ----------GCSGSTCV----------YGVTYGDGSSTVGFLATETFTLAAGN-----V 118
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYAND-PNI 178
F FGC + + G+ G GR S+ SQL L FS+C + A NI
Sbjct: 119 FNNFIFGCGQNNQGLFTGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNI 178
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+PL +T ML + P Y+I L I++G T + LS F S G
Sbjct: 179 GNPL----------RTPGYTAMLTNSRAPTLYFIDLIGISVGG---TRLALSSTVFQSVG 225
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
++DSGT T LP Y L + ++ +T Y RA D CY + T
Sbjct: 226 T---IIDSGTVITRLPPTAYGALRTAFRAAMTQYTRAAAASI---LDTCYDF----SRTT 275
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
FP+I H+ + + +P FY + SS+ CL F D G+ G+ Q
Sbjct: 276 TVTFPTIKLHY-TGLDVTIPGAGVFYVI-----SSSQVCLAFAGNSDST--QIGIIGNVQ 327
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ +EV YD +RIGF C
Sbjct: 328 QRTMEVTYDNALKRIGFAAGAC 349
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/381 (24%), Positives = 159/381 (41%), Gaps = 60/381 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D WVPC C + + + F P+ S++ C+ + C +
Sbjct: 113 MVLDTSNDAAWVPCSG-------CTGFSS----TTFLPNASTTLGSLDCSGAQCSQVRGF 161
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C +G S+ C F +YG +T L +D + +
Sbjct: 162 S-----CPATG--------SSACL----FNQSYGGDSSLTATLVQDAITLAND------V 198
Query: 124 IPKFCFGCVGSTYR---EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP F FGC+ + P G+ G GRG +S+ SQ G + G FS+C +FK S
Sbjct: 199 IPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYY---FS 255
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L++P P+ YY+ L +++G + +P FD
Sbjct: 256 GSLKLGPVG--QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKV-PIPSEQLVFDPNTG 312
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + + FD C+ +
Sbjct: 313 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAAT------NE 361
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P+IT HF ++LVLP N S+S ++ CL + + V + QQ
Sbjct: 362 AEAPAITLHF-EGLNLVLPMENSLIH----SSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D R+G C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/381 (27%), Positives = 155/381 (40%), Gaps = 60/381 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ DTGSDLTW C + C + R P++S+S +C+S+FC
Sbjct: 148 LIFDTGSDLTWTQCEPCAKTCYKQKEPR-------LDPTKSTSYKNISCSSAFCK----- 195
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
L T +C P + YG+G G +TL + SS + +
Sbjct: 196 ------------LLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTL--SSSNVFKN 241
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
F FGC +R G+ G GR LS+PSQ +K FS+C P S
Sbjct: 242 ---FLFGCGQQNSGLFRGAAGLLGLGRTKLSLPSQTAQKYKKLFSYCL--------PASS 290
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S ++FTP+ + +Y + + +++G + L+ + F + G
Sbjct: 291 SSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLS---IDASIFSTSGT 347
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT T LP YS L S Q +T YP + + FD CY N T
Sbjct: 348 ---VIDSGTVITRLPSTAYSALSSAFQKLMTDYP---STDGYSIFDTCYDFS-KNETIK- 399
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + F V + + Y P N CL F +GD + +FG+ QQ
Sbjct: 400 --IPKVGVSFKGGVEMDIDVSGILY----PVNGLKKVCLAFAG--NGDDVKAAIFGNTQQ 451
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ +VVYD K R+GF P C
Sbjct: 452 KTYQVVYDDAKGRVGFAPSGC 472
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 182/397 (45%), Gaps = 68/397 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + ++ + P S S TC FC+ +
Sbjct: 105 VQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYG 160
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P CT + PC ++ +YG+G G D L+ + S G +
Sbjct: 161 GVLP--SCTST-------------SPC-EYSISYGDGSSTAGFFVTDFLQYNQVS-GDGQ 203
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 204 TTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL-- 261
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++ P P+Y I L+ I +G ++L +P
Sbjct: 262 ----DTVNGGGIFAIGNVV---QPKVKTTPLV--PDMPHYNVI-LKGIDVGGTALG-LPT 310
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ FDS + G ++DSGTT ++PE Y L +++ + + +++ +T D C+
Sbjct: 311 NI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMV------FDKHQDISVQTLQDFSCF 362
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDG 346
+ DD FP +TFHF +VSL++ ++ + N + C+ FQ+ +
Sbjct: 363 QYSGS----VDDGFPEVTFHFEGDVSLIVSPHDYLF-----QNGKNLYCMGFQNGGVQTK 413
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
D + G N V+YDLE + IG+ +C+S+
Sbjct: 414 DGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSSS 450
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 160/385 (41%), Gaps = 55/385 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ WV C C C Y + + F P RSSS C ++ C + S
Sbjct: 3 LDTGSDVVWVQCA----PCRRC--YEQSGPV--FDPRRSSSYGAVGCGAALCRRLDSG-- 52
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
GC L + C + YG+G + G +TL G + +
Sbjct: 53 --------GCDLR---RGACM-----YQVAYGDGSVTAGDFVTETLTFAGGA-----RVA 91
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFL-----AFKYANDP 176
+ GC + G+ G GRG LS P+Q+ + FS+C + A
Sbjct: 92 RVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGS 151
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFD 235
+ SS + G ++ + + FTPM+++P +YY+ L I++G + + V S LR
Sbjct: 152 HRSSTVSFGAGSVGAS-SASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDP 210
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G GG++VDSGT+ T L YS L ++ R FD CY +
Sbjct: 211 STGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSL-FDTCYDLGGRRV 269
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P+++ HF LP N+ P +S C F D G + G
Sbjct: 270 V----KVPTVSMHFAGGAEAALPPENYLI----PVDSRGTFCFAFAGTDGG----VSIIG 317
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ VV+D + +R+GF P C
Sbjct: 318 NIQQQGFRVVFDGDGQRVGFAPKGC 342
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 114/404 (28%), Positives = 173/404 (42%), Gaps = 58/404 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C ++ S F+P SSS S C+SS C +
Sbjct: 86 VTMVIDTGSELSWLHCNT---------SQNSSSSSSTFNPVWSSSYSPIPCSSSTCTD-Q 135
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ D P P C + +T +Y + G L DT + S
Sbjct: 136 TRDFPIRP----SCDSNQFCHATL---------SYADASSSEGNLATDTFYIGSSG---- 178
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
IP FGC+ S + + G+ G RG+LS SQ+GF + FS+C + +
Sbjct: 179 --IPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFPK--FSYCISEYDF-- 232
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
S L++GD S L +TP+++ S P + Y + LE I + + L +P
Sbjct: 233 ----SGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHK-LLPIPE 287
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEERTGFD 285
S+ E D G G +VDSGT +T L P Y+ L L+ ++ Y + V + D
Sbjct: 288 SVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQ-GAMD 346
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
LCYRVP N T L PS+T F V + + ++ C F + D
Sbjct: 347 LCYRVPT-NQTRLPPL-PSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDL 404
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQNV + +DL+K RIG + C G+
Sbjct: 405 LGV-EAFVIGHLHQQNVWMEFDLKKSRIGLAEIRCDLAGQKLGM 447
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 163/390 (41%), Gaps = 78/390 (20%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DT SDL WV C C C + + + F P +SS+ + +C S
Sbjct: 108 DTASDLIWVQCS----PCETC--FPQDTPL--FEPHKSSTFANLSCDS------------ 147
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
PCT S L+ + C + TYG+G G+L ++ +H S + PK
Sbjct: 148 -QPCTSSNIYYCPLVGNLCL-----YTNTYGDGSSTKGVLCTES--IHFGSQTV--TFPK 197
Query: 127 FCFGC------VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAF--------K 171
FGC + + GI G G G LS+ SQLG + FS+C L F K
Sbjct: 198 TIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLK 257
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
+ ND I+ V+ TP++ P YP+YY++ L ITIG L +
Sbjct: 258 FGNDTTITGNGVVS------------TPLIIDPHYPSYYFLHLVGITIGQKM-----LQV 300
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
R D NG +++D GT T+L FY +++L+ + ++ FD C+
Sbjct: 301 RTTD-HTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPY--PFDFCF--- 354
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
PN + FP I F F + P+ N F+ + + + D
Sbjct: 355 -PNQ--ANITFPKIVFQFTGAKVFLSPK-NLFFRF------DDLNMICLAVLPDFYAKGF 404
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
VFG+ Q + +V YD + +++ F P DC+
Sbjct: 405 SVFGNLAQVDFQVEYDRKGKKVSFAPADCS 434
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 115/407 (28%), Positives = 183/407 (44%), Gaps = 77/407 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C +C C + + ++ + P RS +S +C +FC + +
Sbjct: 84 VQVDTGSDILWVNC----VECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYE 139
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
+ GC PCP ++ +YG+G TG +D L +V+G+ P
Sbjct: 140 G-------RILGCKAE--------NPCP-YSISYGDGSATTGYYVQDYLTFNRVNGN-PH 182
Query: 120 IIREIPKFCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC S+ E + GI GFG+ SV SQL G ++K FSHC
Sbjct: 183 TATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL- 241
Query: 169 AFKYANDPNISSPLV-IGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSL 224
D N+ + IG+V P +K+ P+ PN +Y + L+ I + + +
Sbjct: 242 ------DTNVGGGIFSIGEVV---------EPKVKTTPLVPNMAHYNVILKNIEV-DGDI 285
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKE--VEERT 282
++P FDS+ G ++DSGTT +LP Y QL+S + PR K VEE+
Sbjct: 286 LQLPSD--TFDSENGKGTVIDSGTTLAYLPRIVYDQLMS---KVLAKQPRLKVYLVEEQY 340
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ- 341
C++ D FP + HF +++SL + ++ + S C+ +Q
Sbjct: 341 S---CFQYTGN----VDSGFPIVKLHFEDSLSLTVYPHDYLFNYKGDS----YWCIGWQK 389
Query: 342 SMDDGDYGPS-GVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
S + G + G F N VVYDLE IG+ +C+S+ +
Sbjct: 390 SASETKNGKDMTLLGDFVLSNKLVVYDLENMTIGWTDYNCSSSIKVK 436
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 162/380 (42%), Gaps = 54/380 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GSD+ WV C C C Y + F P+ SSS S +C S+ C
Sbjct: 145 LVVDSGSDVIWVQC----RPCEQC--YAQTDPL--FDPAASSSFSGVSCGSAICR----- 191
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
T+SG + C ++ TYG+G G L +TL + G++ ++
Sbjct: 192 -------TLSGTGCGGGGDAGKC----DYSVTYGDGSYTKGELALETLTLGGTA---VQG 237
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
+ C + G+ G G GA+S+ QLG G FS+C LA + A + L
Sbjct: 238 VAIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYC-LASRGAGG---AGSL 293
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL--SLREFDSQGNG 240
V+G + + P++++ ++YY+GL I +G L PL L + G G
Sbjct: 294 VLGRTEAVPVGAV-WVPLVRNNQASSFYYVGLTGIGVGGERL---PLQDGLFQLTEDGAG 349
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++D+GT T LP Y+ L + PR+ V D CY + + +
Sbjct: 350 GVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSL---LDTCYDL----SGYASV 402
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++F+F L LP N + AV CL F G + G+ QQ+
Sbjct: 403 RVPTVSFYFDQGAVLTLPARNLLVEVGG-----AVFCLAFAPSSSG----ISILGNIQQE 453
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+++ D +GF P C
Sbjct: 454 GIQITVDSANGYVGFGPNTC 473
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 163/390 (41%), Gaps = 57/390 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN---FSPSRSSSSSRDTCASSFCLNIHS 62
+DTGSDL W+ C C DC + N + P SSS C C ++ S
Sbjct: 209 LDTGSDLNWIQC----VPCYDC-------FVQNGPYYDPKESSSFKNIGCHDPRC-HLVS 256
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG--- 119
S +P PC + CP F Y YG+ TG +T V+ +SP
Sbjct: 257 SPDPPQPCKAEN------------QTCPYF-YWYGDSSNTTGDFALETFTVNLTSPAGKS 303
Query: 120 IIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYAND 175
+ + FGC + G+ G GRG LS SQL L FS+C + +D
Sbjct: 304 EFKRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSD 361
Query: 176 PNISSPLVIG-DVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLR 232
N+SS L+ G D + + + FT ++ P +YY+ +++I +G L ++P
Sbjct: 362 TNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVL-KIPEETW 420
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+G GG +VDSGTT ++ EP Y + + YP K D PC
Sbjct: 421 HLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIK--------DFPILDPC 472
Query: 293 PNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
N + + + P F + P N+F + + CL
Sbjct: 473 YNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEP----EEIVCLAILGTPRSALS-- 526
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G++QQQN ++YD +K R+G+ PM CA
Sbjct: 527 -IIGNYQQQNFHILYDTKKSRLGYAPMKCA 555
>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
Length = 469
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 80/292 (27%), Positives = 131/292 (44%), Gaps = 27/292 (9%)
Query: 89 CPSFAYTYGEGGLVT-GILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYRE---PIGIAG 144
C S++ TYG T G L DT ++ +P FGC ++Y + G+ G
Sbjct: 175 CDSYSLTYGGSAANTSGYLATDTFTFGATA------VPGVVFGCSDASYGDFAGASGVIG 228
Query: 145 FGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSP 204
GRG LS+ SQL F + FS+ LA + +D + S + GD A+ Q TP+L S
Sbjct: 229 IGRGNLSLISQLQFGK--FSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSST 286
Query: 205 MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSI 264
+YP++YY+ L + + + L +P + + G GG+++ S T T+L + Y + +
Sbjct: 287 LYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA 346
Query: 265 LQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFY 324
+ S I A DLCY ++ P +T F + L N+FY
Sbjct: 347 VASRIGL--PAVNGSAALELDLCYNA----SSMAKVKVPKLTLVFDGGADMDLSAANYFY 400
Query: 325 AMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQ 376
N + ++CL G V G+ Q ++YD++ R+ F+
Sbjct: 401 I----DNDTGLECLTMLPSQGGS-----VLGTLLQTGTNMIYDVDAGRLTFE 443
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/384 (28%), Positives = 158/384 (41%), Gaps = 65/384 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V DTGSDL+WV C C +C Y+ + + F PS+S++ S C + CL+
Sbjct: 201 LLVVFDTGSDLSWVQCK----PCNNC--YKQHDPL--FDPSQSTTYSAVPCGAQECLD-- 250
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S S CR + YG+ G L RDTL + SS
Sbjct: 251 ----------------SGTCSSGKCR----YEVVYGDMSQTDGNLARDTLTLGPSSD--- 287
Query: 122 REIPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
++ F FGC + G+ G GR +S+ SQ GFS+C + A
Sbjct: 288 -QLQGFVFGCGDDDTGLFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAE--- 343
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L +G A + + QFT M+ P++YY+ L I + ++ P +
Sbjct: 344 --GYLSLGSAA--APPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKA---- 395
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T LP YS L S + Y RA + D CY
Sbjct: 396 --PGTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSI---LDTCYDF----TGR 446
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T PS+ F +L L G Y +N S CL F S +GD G+ G+
Sbjct: 447 TKVQIPSVALLFDGGATLNLGFGGVLYV----ANRSQA-CLAFAS--NGDDTSVGILGNM 499
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQ+ VVYDL ++IGF C+
Sbjct: 500 QQKTFAVVYDLANQKIGFGAKGCS 523
>gi|383143501|gb|AFG53178.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143503|gb|AFG53179.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143507|gb|AFG53181.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143509|gb|AFG53182.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143517|gb|AFG53186.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143519|gb|AFG53187.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 79/133 (59%), Gaps = 6/133 (4%)
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N SS +V+G+ A+ +L +TP++ +P+YP +YY+GLEA++IG L +P + FDS
Sbjct: 7 NNSSKIVVGNKAVPGDISLTYTPLIINPIYPFFYYLGLEAVSIGRKRL-NLPFNSATFDS 65
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGT++T PE YSQ+ S I Y R E TG LCY V NT
Sbjct: 66 KGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRVPGAESTTGLGLCYNVSGVENT 124
Query: 297 FTDDLFPSITFHF 309
FP FHF
Sbjct: 125 ----QFPQFAFHF 133
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 106/385 (27%), Positives = 161/385 (41%), Gaps = 55/385 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ WV C C C Y + + F P RSSS C ++ C + S
Sbjct: 146 LDTGSDVVWVQCA----PCRRC--YEQSGPV--FDPRRSSSYGAVGCGAALCRRLDSG-- 195
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
GC L + C + YG+G + G +TL G + +
Sbjct: 196 --------GCDLR---RGACM-----YQVAYGDGSVTAGDFVTETLTFAGGA-----RVA 234
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFL-----AFKYANDP 176
+ GC + G+ G GRG LS P+Q+ + FS+C + A
Sbjct: 235 RVALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGS 294
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFD 235
+ SS + G ++ + + FTPM+++P +YY+ L I++G + + V S LR
Sbjct: 295 HRSSTVSFGAGSVGAS-SASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDP 353
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G GG++VDSGT+ T L YS L ++ R + FD CY +
Sbjct: 354 STGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSP-GGFSLFDTCYDLGGRRV 412
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P+++ HF LP N+ P +S C F D G + G
Sbjct: 413 V----KVPTVSMHFAGGAEAALPPENYLI----PVDSRGTFCFAFAGTDGG----VSIIG 460
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ VV+D + +R+GF P C
Sbjct: 461 NIQQQGFRVVFDGDGQRVGFAPKGC 485
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 108/390 (27%), Positives = 161/390 (41%), Gaps = 54/390 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V DTGSDL+WV CG C Y + F+PS SS+ S C C
Sbjct: 98 LTVVFDTGSDLSWVQCG----PCSSGGCYHQQDPL--FAPSSSSTFSAVRCGEPEC---- 147
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI- 120
P CS S CP + YG+ G L DTL + G++P
Sbjct: 148 -------PRARQSCSSSPGDDR-----CP-YEVVYGDKSRTVGHLGNDTLTL-GTTPSTN 193
Query: 121 -----IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFK 171
++P F FGC + + + G+ G GRG +S+ SQ G +GFS+C +
Sbjct: 194 ASENNSNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPS-- 251
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
+ N L +G A + + +FTPML P++YY+ L I + ++ +
Sbjct: 252 --SSSNAHGYLSLGTPA-PAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAI-----KV 303
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ GL+VDSGT T L YS L + S + Y K + D CY
Sbjct: 304 SSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGY-KRAPRLSILDTCYDFT 362
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
N P++ F ++ + Y A CL F +G +
Sbjct: 363 AHANATVS--IPAVALVFAGGATISVDFSGVLYVAKV-----AQACLAFAPNGNGR--SA 413
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ G+ QQ+ V VVYD+ +++IGF C+
Sbjct: 414 GILGNTQQRTVAVVYDVGRQKIGFAAKGCS 443
>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
Length = 339
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 140/300 (46%), Gaps = 38/300 (12%)
Query: 92 FAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGR 147
+ Y+YG+ + TG L D G+ +P FGC G GIAGFGR
Sbjct: 64 YTYSYGDKSVTTGFLEVDKFTFVGAG----ASVPGVAFGCGLFNNGVFKSNETGIAGFGR 119
Query: 148 GALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVI---GDVAISSKDNLQFTPML--- 201
G LS+PSQL FSHCF A I S +++ D+ + + +Q TP++
Sbjct: 120 GPLSLPSQLKV--GNFSHCFTTITGA----IPSTVLLDLPADLFSNGQGAVQTTPLIQYA 173
Query: 202 KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL 261
K+ P YY+ L+ IT+G++ L VP S + G GG ++DSGT+ T LP Q+
Sbjct: 174 KNEANPTLYYLSLKGITVGSTRL-PVPESAFAL-TNGTGGTIIDSGTSITSLPP----QV 227
Query: 262 LSILQSTITYYPRAKEVE-ERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQG 320
+++ + V TG C+ P + P + HF ++ LP+
Sbjct: 228 YQVVRDEFAAQIKLPVVPGNATGHYTCFSAP----SQAKPDVPKLVLHF-EGATMDLPRE 282
Query: 321 NHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
N+ + + + +S + CL D+ + + G+FQQQN+ V+YDL+ + F C
Sbjct: 283 NYVFEVPDDAGNSII-CLAINKGDE-----TTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336
>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 455
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 172/390 (44%), Gaps = 57/390 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +DTGS++ W+PC N C DC N+ S F+P SS+ C S C
Sbjct: 111 IHAAIDTGSNVIWIPCIN----CKDC----FNQSSSIFNPLASSTYQDAPCDSYQCETTS 162
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
SS + C S C L CP+ G + DT+ + SS G
Sbjct: 163 SSCQSDNVCLYS-CDEKHQLN------CPN------------GRIAVDTMTL-TSSDGRP 202
Query: 122 REIPKFCFGCVGSTYR--EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+P F C S Y+ +G+ G GRGALS+ S+L L G FS+C LA Y+ P
Sbjct: 203 FPLPYSDFVCGNSIYKTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYC-LADYYSKQP-- 259
Query: 179 SSPLVIGDVAISSKDNLQF-TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD-- 235
S + G + S D+L+ + L + YY+ LE I++G E L D
Sbjct: 260 -SKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVG-----EKRQDLYYVDDP 313
Query: 236 -SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV---P 291
+ G +L+DSGT +T LP+ FY L S + I P+ R F + + P
Sbjct: 314 FAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSP 373
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
C + + FP IT HF + + L N F + + V C F + G S
Sbjct: 374 C-FWYYPELKFPKITIHF-TDADVELSDDNSFIRV-----AEDVVCFAFAATQPGQ---S 423
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
V+GS+QQ N + YDL++ + F+ DC+
Sbjct: 424 TVYGSWQQMNFILGYDLKRGTVSFKRTDCS 453
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 106/384 (27%), Positives = 160/384 (41%), Gaps = 64/384 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+ DTGSDL WV S C C + F P +SS+ C+S C +
Sbjct: 69 RAIADTGSDLVWVQ----SEPCTGCSG------GTIFDPRQSSTFREMDCSSQLCTELPG 118
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S P S+ C S++Y YG G G RDT+ + G++ G +
Sbjct: 119 SCEP---------------GSSAC----SYSYEYGSG-ETEGEFARDTISL-GTTSGGSQ 157
Query: 123 EIPKFCFGC--VGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNIS 179
+ P F GC V S + G+ G G+G +S+ SQL + FS+C + N + S
Sbjct: 158 KFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDI---NSQSES 214
Query: 180 SPLVIGDVAISSKDNLQFTPMLK-SPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SPL+ G A +Q T + S YP YY + + I + ++
Sbjct: 215 SPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGS------------ 262
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G ++DSGTT T++P Y ++LS ++S +T PR G DLCY N
Sbjct: 263 PGTTIIDSGTTLTYVPSGVYGRVLSRMESMVT-LPRVD--GSSMGLDLCYDRSSNRNY-- 317
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
FP++T L ++ P N+F + +S CL +M P + G+
Sbjct: 318 --KFPALTIR-LAGATMTPPSSNYFLVV---DDSGDTVCL---AMGSAGGLPVSIIGNVM 368
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQ ++YD + F C S
Sbjct: 369 QQGYHILYDRGSSELSFVQAKCES 392
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 162/382 (42%), Gaps = 64/382 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C + FSP++S+S +C++ C +
Sbjct: 116 MDTSSDVAWIPCSG----CVGCPSN------TAFSPAKSTSFKNVSCSAPQCKQV----- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C CS F TYG + L++DT+++ I
Sbjct: 161 PNPACGARACS---------------FNLTYGSSSIAAN-LSQDTIRLAAD------PIK 198
Query: 126 KFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
F FGCV G T P G+ G GRG LS+ SQ + K FS+C +F+ S
Sbjct: 199 AFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLT---FS 255
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S +++T +L++P + YY+ L AI +G + ++P + F+
Sbjct: 256 GSLRLGPT--SQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRK-VVDLPPAAIAFNPSTG 312
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G + DSGT YT L +P Y + + + + P V GFD CY
Sbjct: 313 AGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK--PPTAVVTSLGGFDTCYS--------GQ 362
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P+ITF F V++ +P N +A S S CL S + V S QQ
Sbjct: 363 VKVPTITFMF-KGVNMTMPADNLMLHSTAGSTS----CLAMASAPENVNSVVNVIASMQQ 417
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
QN V+ D+ R+G C+
Sbjct: 418 QNHRVLIDVPNGRLGLARERCS 439
>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
Length = 440
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 108/401 (26%), Positives = 172/401 (42%), Gaps = 67/401 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+ +DTGS+L W C C + N +S + PSRS ++
Sbjct: 85 EAIIDTGSNLIWTQCST----CQPAGCFSQN--LSFYDPSRSRTA--------------- 123
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEG-GLVTGILTRDTLKVHGSSPGII 121
P C + C+L + ++ C R + A G G++ G+L + S +
Sbjct: 124 --RPV-ACNDTACALGS--ETRCARDNKACAVLTAYGAGVIGGVLGTEAFTFQPQSENV- 177
Query: 122 REIPKFCFGCVGSTYREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
FGC+ +T P GI G GRG LS+ SQLG FS+C + ++
Sbjct: 178 ----SLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLG--DNKFSYCLTPY-FSQS 230
Query: 176 PNISSPLVIGDVAISSKDN-LQFTPMLKSP---MYPNYYYIGLEAITIGNSSLT--EVPL 229
N S V +SS P LK+P + +YY+ L IT+G++ L E
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYS----QLLSILQSTITYYPRAKEVEERTGFD 285
LR+ + G L+DSG+ +T L + Y +L+ L ++I P E G D
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAE-----GLD 345
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVS-LVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
LC V + L P + HF + + +P N++ P + S ++F S
Sbjct: 346 LCAAVA---HGDVGKLVPPLVLHFGSGGGDVAVPPENYW----GPVDDSTACMVVFSSGG 398
Query: 345 DGDYGP---SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
P + + G++ QQ++ ++YDLEK + FQP DC+S
Sbjct: 399 PNSTLPMNETTIIGNYMQQDMHLLYDLEKGMLSFQPADCSS 439
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 162/382 (42%), Gaps = 64/382 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C + FSP++S+S +C++ C +
Sbjct: 132 MDTSSDVAWIPCSG----CVGCPSN------TAFSPAKSTSFKNVSCSAPQCKQV----- 176
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C CS F TYG + L++DT+++ I
Sbjct: 177 PNPTCGARACS---------------FNLTYGSSSIAAN-LSQDTIRLAAD------PIK 214
Query: 126 KFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
F FGCV G T P G+ G GRG LS+ SQ + K FS+C +F+ S
Sbjct: 215 AFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLT---FS 271
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S +++T +L++P + YY+ L AI +G + ++P + F+
Sbjct: 272 GSLRLGPT--SQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRK-VVDLPPAAIAFNPSTG 328
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G + DSGT YT L +P Y + + + + P V GFD CY
Sbjct: 329 AGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK--PTTAVVTSLGGFDTCYS--------GQ 378
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P+ITF F V++ +P N +A S S CL + + V S QQ
Sbjct: 379 VKVPTITFMF-KGVNMTMPADNLMLHSTAGSTS----CLAMAAAPENVNSVVNVIASMQQ 433
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
QN V+ D+ R+G C+
Sbjct: 434 QNHRVLIDVPNGRLGLARERCS 455
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 113/392 (28%), Positives = 167/392 (42%), Gaps = 68/392 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
+ +DTGSDL W+ C C C Y+ + F P SSS R C S C L +H
Sbjct: 69 MVVDTGSDLPWLQCQ----PCKSC--YKQADPI--FDPRNSSSFQRIPCLSPLCKALEVH 120
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S CS S S C S+ YG+G G + D + S +
Sbjct: 121 S------------CSGSRGATSRC-----SYQVAYGDGSFSVGDFSSDLFTLGTGSKAM- 162
Query: 122 REIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQL------GFLQKGFSHCFLAFKY 172
FGC + G+ G G G LS PSQ+ FS+C L +
Sbjct: 163 ----SVAFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYC-LVDRS 217
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
SS L+ G AI S L +P+LK+P +YY + +++G + L P+SL+
Sbjct: 218 NPMTRSSSSLIFGVAAIPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQL---PISLK 272
Query: 233 --EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+ G+GG+++DSGT+ T P Y+ + ++ P A FD CY
Sbjct: 273 SLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSL---FDTCYNF 329
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDGDY 348
+ + D+ P++ HF N L LP N+ P N++ CL F SM+
Sbjct: 330 ---SGKASVDV-PALVLHFENGADLQLPPTNYLI----PINTAGSFCLAFAPTSME---- 377
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ QQQ+ + +DL+K + F P C
Sbjct: 378 --LGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 101/399 (25%), Positives = 165/399 (41%), Gaps = 56/399 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL+W+ C C DC + N S++ P SS+ +C C +
Sbjct: 184 VWLILDTGSDLSWIQCD----PCYDC--FEQNG--SHYYPKDSSTYRNISCYDPRCQLVS 235
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG-- 119
SSD P C + CP F Y Y +G TG +T V+ + P
Sbjct: 236 SSD-PLQHCKAEN------------QTCPYF-YDYADGSNTTGDFASETFTVNLTWPNGK 281
Query: 120 -IIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYAN 174
+++ FGC + G+ G GRG +S PSQ+ FS+C +
Sbjct: 282 EKFKQVVDVMFGCGHWNKGFFYGASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDL--FS 339
Query: 175 DPNISSPLVIG-DVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSL 231
+ ++SS L+ G D + + NL FT +L P+ +YY+ +++I +G L ++
Sbjct: 340 NTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVL-DISEQT 398
Query: 232 REFDSQGNGGL-----LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+ S+G ++DSG+T T P+ Y + + I ++ D
Sbjct: 399 WHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKL--------QQIAADD 450
Query: 287 CYRVPCPN--NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
PC N P HF + P N+FY V CL M
Sbjct: 451 FVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEP----DEVICLAI--MK 504
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
++ + G+ QQN ++YD+++ R+G+ P CA
Sbjct: 505 TPNHSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAEV 543
>gi|383143511|gb|AFG53183.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/133 (42%), Positives = 79/133 (59%), Gaps = 6/133 (4%)
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N SS +V+G+ A+ +L +TP++ +P+YP +YY+GLEA++IG + +P + FDS
Sbjct: 7 NNSSKIVVGNKAVPGDISLTYTPLIINPIYPFFYYLGLEAVSIGRKRM-NLPFNSATFDS 65
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGT++T PE YSQ+ S I Y R E TG LCY V NT
Sbjct: 66 KGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRVPGAESTTGLGLCYNVSGVENT 124
Query: 297 FTDDLFPSITFHF 309
FP FHF
Sbjct: 125 ----QFPQFAFHF 133
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 162/382 (42%), Gaps = 64/382 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C + FSP++S+S +C++ C +
Sbjct: 116 MDTSSDVAWIPCSG----CVGCPSN------TAFSPAKSTSFKNVSCSAPQCKQV----- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C CS F TYG + L++DT+++ I
Sbjct: 161 PNPTCGARACS---------------FNLTYGSSSIAAN-LSQDTIRLAAD------PIK 198
Query: 126 KFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
F FGCV G T P G+ G GRG LS+ SQ + K FS+C +F+ S
Sbjct: 199 AFTFGCVNKVAGGGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLT---FS 255
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S +++T +L++P + YY+ L AI +G + ++P + F+
Sbjct: 256 GSLRLGPT--SQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRK-VVDLPPAAIAFNPSTG 312
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G + DSGT YT L +P Y + + + + P V GFD CY
Sbjct: 313 AGTIFDSGTVYTRLAKPVYEAVRNEFRKRVK--PTTAVVTSLGGFDTCYS--------GQ 362
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P+ITF F V++ +P N +A S S CL + + V S QQ
Sbjct: 363 VKVPTITFMF-KGVNMTMPADNLMLHSTAGSTS----CLAMAAAPENVNSVVNVIASMQQ 417
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
QN V+ D+ R+G C+
Sbjct: 418 QNHRVLIDVPNGRLGLARERCS 439
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 158/383 (41%), Gaps = 62/383 (16%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+YM DTGSD+TWV C C DC Y+ + + F PS S+S + C + C ++
Sbjct: 179 QLYMVLDTGSDVTWVQCQ----PCADC--YQQSDPV--FDPSLSTSYASVACDNPRCHDL 230
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ + C ST C + YG+G G +TL + S+P
Sbjct: 231 DA----------AACRNST---GACL-----YEVAYGDGSYTVGDFATETLTLGDSAP-- 270
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ GC + G+ G G LS PSQ+ FS+C + D
Sbjct: 271 ---VSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS--ATTFSYCLV----DRDSP 321
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS L GD A D P+++SP +YY+GL +++G L+ +P S DS
Sbjct: 322 SSSTLQFGDAA----DAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILS-IPPSAFAMDST 376
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG++VDSGT T L Y+ L PR V FD CY + +
Sbjct: 377 GAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL---FDTCYDL----SDR 429
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T P+++ F L LP N+ P + + CL F + + G+
Sbjct: 430 TSVEVPAVSLRFAGGGELRLPAKNYLI----PVDGAGTYCLAFAPTN----AAVSIIGNV 481
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +D K +GF C
Sbjct: 482 QQQGTRVSFDTAKSTVGFTTNKC 504
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 159/384 (41%), Gaps = 64/384 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+ DTGSDL WV S C C + F P +SS+ C+S C +
Sbjct: 69 RAIADTGSDLVWVQ----SEPCTGCSG------GTIFDPRQSSTFREMDCSSQLCAELPG 118
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S P STC S++Y YG G G RDT+ + +S G +
Sbjct: 119 SCEPG--------------SSTC-----SYSYEYGSG-ETEGEFARDTISLGTTSDGS-Q 157
Query: 123 EIPKFCFGC--VGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNIS 179
+ P F GC V S + G+ G G+G +S+ SQL + FS+C + N + S
Sbjct: 158 KFPSFAVGCGMVNSGFDGVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVDI---NSQSES 214
Query: 180 SPLVIGDVAISSKDNLQFTPMLK-SPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SPL+ G A +Q T + S YP YY + + I + ++
Sbjct: 215 SPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGS------------ 262
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G ++DSGTT T++P Y ++LS ++S +T PR G DLCY N
Sbjct: 263 PGTTIIDSGTTLTYVPSGVYGRVLSRMESMVT-LPRVD--GSSMGLDLCYDRSSNRNY-- 317
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
FP++T L ++ P N+F + +S CL +M P + G+
Sbjct: 318 --KFPALTIR-LAGATMTPPSSNYFLVV---DDSGDTVCL---AMGSASGLPVSIIGNVM 368
Query: 359 QQNVEVVYDLEKERIGFQPMDCAS 382
QQ ++YD + F C S
Sbjct: 369 QQGYHILYDRGSSELSFVQAKCES 392
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 161/381 (42%), Gaps = 49/381 (12%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD+ W+ C C C Y + ++ F P +S + + C S C +
Sbjct: 151 VYMVLDTGSDVVWLQCS----PCKAC--YNQSDVI--FDPKKSKTFATVPCGSRLCRRLD 202
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S C T TC + +YG+G G + +TL HG+ +
Sbjct: 203 DSSE----CV-------TRRSKTCL-----YQVSYGDGSFTEGDFSTETLTFHGAR---V 243
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFL-AFKYANDPNIS 179
+P C + G+ G GRG LS PSQ G FS+C + +
Sbjct: 244 DHVPLGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPP 303
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S +V G+ A+ FTP+L +P +YY+ L I++G S + V S + D+ GN
Sbjct: 304 STIVFGNDAVPKTS--VFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 361
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG+++DSGT+ T L + Y L + T RA FD C+ + + T
Sbjct: 362 GGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSL---FDTCFDL----SGMTT 414
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ FHF + LP N+ P N+ C F G G + G+ QQ
Sbjct: 415 VKVPTVVFHF-GGGEVSLPASNYLI----PVNTEGRFCFAFA----GTMGSLSIIGNIQQ 465
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
Q V YDL R+GF C
Sbjct: 466 QGFRVAYDLVGSRVGFLSRAC 486
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 165/387 (42%), Gaps = 55/387 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSD+ W C C +C Y+ N M F PS+S++ C+S C
Sbjct: 96 IVAVADTGSDVIWTQCK----PCSNC--YQQNAPM--FDPSKSTTYKNVACSSPVC---- 143
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ SG S S C ++ YG+ G L DT+ + +S G
Sbjct: 144 ---------SYSGDGSSCSDDSECL-----YSIAYGDDSHSQGNLAVDTVTMQSTS-GRP 188
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
P+ GC G+ GI G GRG S+ +QLG G FS+C + +
Sbjct: 189 VAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGST- 247
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N S+ L G A S TP+ S Y +Y + LEA+++G++ P +
Sbjct: 248 NDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKF-NFPEGASKLGG 306
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+ N +++DSGTT T+LP + S + +++ P A++ E D C+ T
Sbjct: 307 ESN--IIIDSGTTLTYLPSALLNSFGSAISQSMS-LPHAQDPSEF--LDYCFA------T 355
Query: 297 FTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
TDD P +T HF + L + N F + S CL F S D + ++G
Sbjct: 356 TTDDYEMPPVTMHF-EGADVPLQRENLFVRL-----SDDTICLAFGSFPDDNI---FIYG 406
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ Q N V YD++ + FQP C +
Sbjct: 407 NIAQSNFLVGYDIKNLAVSFQPAHCGA 433
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 157/390 (40%), Gaps = 78/390 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y+ + + F P+RSS+ + +CA+ C +++
Sbjct: 176 VVFDTGSDTTWVQCEPCVVVC-----YKQQEKL--FDPARSSTYANISCAAPACSDLY-- 226
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ GCS L + YG+G G DTL +
Sbjct: 227 --------IKGCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 263
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCF---------LAF 170
I F FGC Y E G+ G GRG S+P Q G F+HCF L F
Sbjct: 264 IKGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDF 323
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
+ P +S+ L TPML P +YY+GL I +G L +P S
Sbjct: 324 GPGSLPAVSAKLT--------------TPMLVDNG-PTFYYVGLTGIRVGGK-LLSIPQS 367
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+ F + G +VDSGT T LP YS L S S + K+ + D CY
Sbjct: 368 V--FTTSGT---IVDSGTVITRLPPAAYSSLRSAFASAMAERGY-KKAPALSLLDTCYDF 421
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
++ P+++ F SL + YA S + CL F + D
Sbjct: 422 ----TGMSEVAIPTVSLLFQGGASLDVHASGIIYAASV-----SQACLGFAGNKEDD--D 470
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ Q + VVYD+ K+ +GF P C
Sbjct: 471 VGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/400 (28%), Positives = 180/400 (45%), Gaps = 75/400 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY-RNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV +C+ CD R + L ++ + PS SSS + TC FC+
Sbjct: 96 VQVDTGSDILWV-------NCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVA 148
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCP-SFAYTYGEGGLVTGILTRDTLK---VHG 115
H P +C P ++ +YG+G TG D L+ V G
Sbjct: 149 THGGVIP-----------------SCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSG 191
Query: 116 SSPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSH 165
+S + FGC +GS+ + GI GFG+ S+ SQL G ++K F+H
Sbjct: 192 NSQTTLANT-SITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAH 250
Query: 166 CFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
C + N IGDV + + TP++ P P +Y + LEAI +G L
Sbjct: 251 CL------DTINGGGIFAIGDVV---QPKVSTTPLV--PGMP-HYNVNLEAIDVGGVKL- 297
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
++P ++ FD + G ++DSGTT +LP Y+ ++S + + P + + +
Sbjct: 298 QLPTNI--FDIGESKGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQ---- 351
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--M 343
C+R DD FP ITFHF + L + ++ + + + C+ FQ+ +
Sbjct: 352 -CFRYSGS----VDDGFPIITFHFEGGLPLNIHPHDYLF------QNGELYCMGFQTGGL 400
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
D + G N V+YDLE + IG+ +C+S+
Sbjct: 401 QTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDYNCSSS 440
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 159/384 (41%), Gaps = 69/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y+ + + F P++SS+ + +C S C ++ ++
Sbjct: 178 VVFDTGSDTTWVQCRPCVVKC-----YKQKEPL--FDPAKSSTYANVSCTDSACADLDTN 230
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC+ L +A YG+G G +DTL + +
Sbjct: 231 ----------GCTGGHCL----------YAVQYGDGSYTVGFFAQDTLTIAHDA------ 264
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
I F FGC + + G+ G GRG S+ Q G F++C A
Sbjct: 265 IKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDF 324
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
P S+ +N + TPML +YY+G+ I +G +VP++ F + G
Sbjct: 325 GPG-------SAGNNARLTPMLTDKGQ-TFYYVGMTGIRVGGQ---QVPVAESVFSTAGT 373
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCPNNT 296
LVDSGT T LP Y+ L S + A+ ++ G+ D CY
Sbjct: 374 ---LVDSGTVITRLPATAYTALSSAFDKVML----ARGYKKAPGYSILDTCYDF----TG 422
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+D P+++ F L + YA+S A CL F S +GD + G+
Sbjct: 423 LSDVELPTVSLVFQGGACLDVDVSGIVYAIS-----EAQVCLAFAS--NGDDESVAIVGN 475
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ+ V+YDL K+ +GF P C
Sbjct: 476 TQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
gi|219886805|gb|ACL53777.1| unknown [Zea mays]
gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
Length = 440
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 110/402 (27%), Positives = 160/402 (39%), Gaps = 69/402 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+ +DTGS+L W C C +R N + + PSRS ++ C + C
Sbjct: 85 EAIIDTGSNLIWTQCSRCRPTC-----FRQN--LPYYDPSRSRAARAVGCNDAAC----- 132
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
G L + C + YG G + G L + L
Sbjct: 133 ---------ALGSETQCLSDNKTC----AVVTGYGAGN-IAGTLATENLTFQS------- 171
Query: 123 EIPKFCFGCVGSTYREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
E FGC+ T P GI G GRG LS+PSQLG FS+C Y D
Sbjct: 172 ETVSLVFGCIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLG--DTRFSYCLT--PYFEDT 227
Query: 177 NISSPLVIGDVA-----ISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLT--E 226
S +V+G A +S + P ++SP + +YY+ L IT G L
Sbjct: 228 IEPSHMVVGASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPS 287
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
LR+ G +DSG T L + Y L + L + + + TGFDL
Sbjct: 288 AAFDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGA-ALVQPLAGTTGFDL 346
Query: 287 CYRVPCPNNTFTDDLFPSITFHFL----NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS 342
C + + L P + HF LV+P N++ AP +S+ ++F S
Sbjct: 347 CVAL-----KDAERLVPPLVLHFGGGSGTGTDLVVPPANYW----APVDSATACMVVFSS 397
Query: 343 MDDGD--YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+D + V G++ QQN+ V+YDL + FQP DC+S
Sbjct: 398 VDRKSLPMNETTVIGNYMQQNMHVLYDLAGGVLSFQPADCSS 439
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 112/399 (28%), Positives = 175/399 (43%), Gaps = 77/399 (19%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C+ C + ++ + SS++ +C+ +FC S
Sbjct: 99 HVQVDTGSDILWVNCAG----CIRCPRKSDLVELTPYDVDASSTAKSVSCSDNFC----S 150
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH-------- 114
N C SG STC + YG+G G L +D + +
Sbjct: 151 YVNQRSEC-HSG--------STC-----QYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQT 196
Query: 115 GSSPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFS 164
GS+ G I FGC +G + GI GFG+ S SQL G +++ F+
Sbjct: 197 GSTNGTI------IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFA 250
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
HC ++ N IG+V +S K ++ TPML + Y + L AI +GNS L
Sbjct: 251 HCL------DNNNGGGIFAIGEV-VSPK--VKTTPMLSKSAH---YSVNLNAIEVGNSVL 298
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
LS FDS + G+++DSGTT +LP+ Y+ LL+ + +P E T
Sbjct: 299 ---ELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLN---EILASHP------ELTLH 346
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+ C + T D FP++TF F +VSL + + + + + C +Q+
Sbjct: 347 TVQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVYPREYLFQVREDT-----WCFGWQNGG 401
Query: 345 DGDYGPSG--VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G + + G N VVYD+E + IG+ +C+
Sbjct: 402 LQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 113/392 (28%), Positives = 158/392 (40%), Gaps = 82/392 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+RSS+ + +CA+ C ++++
Sbjct: 201 VVFDTGSDTTWVQCEPCVVVCYE----QQEKL---FDPARSSTDANISCAAPACSDLYTK 253
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS L + YG+G G DTL +
Sbjct: 254 ----------GCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 288
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCF---------LAF 170
I F FGC + E G+ G GRG S+P Q G F+HCF L F
Sbjct: 289 IKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDF 348
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
+ P +S+ L TPML +YY+GL I +G L +P S
Sbjct: 349 GPGSSPAVSTKLT--------------TPMLVDNGL-TFYYVGLTGIRVGGK-LLSIPPS 392
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCY 288
+ F + G +VDSGT T LP YS L S S I Y +A + D CY
Sbjct: 393 V--FTTAGT---IVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSL---LDTCY 444
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ P+++ F SL + YA S + CL F + ++ D
Sbjct: 445 DF----TGMSQVAIPTVSLLFQGGASLDVDASGIIYAASV-----SQACLGFAANEEDD- 494
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ Q + VVYD+ K+ +GF P C
Sbjct: 495 -DVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 161/385 (41%), Gaps = 66/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C +C Y + F+PS S S S C S+ C + ++
Sbjct: 169 MVLDTGSDVVWIQCE----PCREC--YSQADPI--FNPSSSVSFSTVGCDSAVCSQLDAN 220
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D C GC + +YG+G G +TL +S
Sbjct: 221 D-----CHGGGCL---------------YEVSYGDGSYTVGSYATETLTFGTTS------ 254
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
I GC VG + G+ G G G+LS P+QLG + FS+C + D
Sbjct: 255 IQNVAIGCGHDNVG-LFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVD----RDSES 309
Query: 179 SSPLVIG--DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP-LSLREFD 235
S L G V I S FTP++ +P P +YY+ + AI++G L VP + R +
Sbjct: 310 SGTLEFGPESVPIGSI----FTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDE 365
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ G GG+++DSGT T L Y L + + PRA + + FD CY + +
Sbjct: 366 TTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGI---SIFDTCYDL----S 418
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P++ FHF N +LP N P +S C F D + G
Sbjct: 419 ALQSVSIPAVGFHFSNGAGFILPAKNCLI----PMDSMGTFCFAFAPADSN----LSIMG 470
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ + V +D +GF C
Sbjct: 471 NIQQQGIRVSFDSANSLVGFAIDQC 495
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 155/383 (40%), Gaps = 64/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+RSS+ + +CA+ C ++ +
Sbjct: 194 VVFDTGSDTTWVQCQPCVVVCYE----QQEKL---FDPARSSTYANVSCAAPACFDLDTR 246
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS L + YG+G G DTL +
Sbjct: 247 ----------GCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 281
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ F FGC + E G+ G GRG S+P Q G F+HC A +
Sbjct: 282 VKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-----T 336
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L G + ++ TPML + P +YY+G+ I +G L +P S+
Sbjct: 337 GYLDFGPGSPAAAGARLTTPML-TDNGPTFYYVGMTGIRVGGQ-LLSIPQSVFA-----T 389
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPNNTF 297
G +VDSGT T LP P YS L S S + Y +A V D CY
Sbjct: 390 AGTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSL---LDTCYDF----TGM 442
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P+++ F L + YA S + CL F + +DG G G+ G+
Sbjct: 443 SQVAIPTVSLLFQGGAILDVDASGIMYAASV-----SQVCLGFAANEDG--GDVGIVGNT 495
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q + V YD+ K+ +GF P C
Sbjct: 496 QLKTFGVAYDIGKKVVGFSPGAC 518
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 160/385 (41%), Gaps = 66/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C +C Y + F+PS S S S C S+ C + ++
Sbjct: 23 MVLDTGSDVVWIQCE----PCREC--YSQADPI--FNPSSSVSFSTVGCDSAVCSQLDAN 74
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D C GC + +YG+G G +TL +S
Sbjct: 75 D-----CHGGGCL---------------YEVSYGDGSYTVGSYATETLTFGTTS------ 108
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
I GC VG + G+ G G G+LS P+QLG + FS+C + D
Sbjct: 109 IQNVAIGCGHDNVG-LFVGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVD----RDSES 163
Query: 179 SSPLVIG--DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP-LSLREFD 235
S L G V I S FTP++ +P P +YY+ + AI++G L VP + R +
Sbjct: 164 SGTLEFGPESVPIGSI----FTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDE 219
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ G GG+++DSGT T L Y L + + PRA + FD CY + +
Sbjct: 220 TTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI---FDTCYDLSALQS 276
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P++ FHF N +LP N P +S C F D + G
Sbjct: 277 V----SIPAVGFHFSNGAGFILPAKNCLI----PMDSMGTFCFAFAPADSN----LSIMG 324
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQ + V +D +GF C
Sbjct: 325 NIQQQGIRVSFDSANSLVGFAIDQC 349
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 112/385 (29%), Positives = 168/385 (43%), Gaps = 68/385 (17%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C +C N+ F+P RSSS + +CAS C ++ S
Sbjct: 108 DTGSDLTWTQC----LPCREC----FNQSQPIFNPRRSSSYRKVSCASDTCRSLESYHCG 159
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
D + CS + Y+YG+ G L D + + GS ++PK
Sbjct: 160 PD---LQSCS---------------YGYSYGDRSFTYGDLASDQITI-GSF-----KLPK 195
Query: 127 FCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
GC G IG+ G +S + ++ FS+C F ++ NI+
Sbjct: 196 TVIGCGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTF--FSNANIT 253
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPN-YYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ G A+ S + TP++ P P+ +Y++ LEAI++G + + + G
Sbjct: 254 GTISFGRKAVVSGRQVVSTPLV--PRSPDTFYFLTLEAISVGKKRF-KAANGISAMTNHG 310
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTF 297
N +++DSGTT T LP Y + S L I +AK V++ +G +LCY +
Sbjct: 311 N--IIIDSGTTLTLLPRSLYYGVFSTLARVI----KAKRVDDPSGILELCY-----SAGQ 359
Query: 298 TDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
DDL P IT HF + L N F ++ V CL F +FG+
Sbjct: 360 VDDLNIPIITAHFAGGADVKLLPVNTFAPVA-----DNVTCLTFAPATQ-----VAIFGN 409
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
Q N EV YDL +R+ F+P CA
Sbjct: 410 LAQINFEVGYDLGNKRLSFEPKLCA 434
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 157/383 (40%), Gaps = 62/383 (16%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+YM DTGSD+TWV C C DC Y+ + + F PS S+S + C + C ++
Sbjct: 175 QLYMVLDTGSDVTWVQCQ----PCADC--YQQSDPV--FDPSLSTSYASVACDNPRCHDL 226
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ + C ST C + YG+G G +TL + S+P
Sbjct: 227 DA----------AACRNST---GACL-----YEVAYGDGSYTVGDFATETLTLGDSAP-- 266
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ GC + G+ G G LS PSQ+ FS+C + D
Sbjct: 267 ---VSSVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS--ATTFSYCLV----DRDSP 317
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS L GD A D P+++SP +YY+GL I++G L+ +P S D
Sbjct: 318 SSSTLQFGDAA----DAEVTAPLIRSPRTSTFYYVGLSGISVGGQILS-IPPSAFAMDGT 372
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G GG++VDSGT T L Y+ L PR V FD CY + +
Sbjct: 373 GAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL---FDTCYDL----SDR 425
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T P+++ F L LP N+ P + + CL F + + G+
Sbjct: 426 TSVEVPAVSLRFAGGGELRLPAKNYLI----PVDGAGTYCLAFAPTN----AAVSIIGNV 477
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +D K +GF C
Sbjct: 478 QQQGTRVSFDTAKSTVGFTSNKC 500
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 178/397 (44%), Gaps = 68/397 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C L ++ + P SS+ S C +FC
Sbjct: 101 VQVDTGSDILWVNC----ITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFC----- 151
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSPGII 121
T G L K PC ++ TYG+G G D L+ + G
Sbjct: 152 ------AATFGG----KLPKCGANVPC-EYSVTYGDGSSTIGSFVTDALQFDQVTRDGQT 200
Query: 122 REI-PKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
+ FGC +GS+ + GI GFG S+ SQL G ++K F+HC
Sbjct: 201 QPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTI 260
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
K IGDV + ++ TP++ +Y + L+ I +G ++L ++P
Sbjct: 261 KGGG------IFSIGDVV---QPKVKTTPLVADK---PHYNVNLKTIDVGGTTL-QLPAH 307
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEV--EERTGFDLCY 288
+ F+ G ++DSGTT T+LPE + +++ + + + +++ + GF LC+
Sbjct: 308 I--FEPGEKKGTIIDSGTTLTYLPELVFKEVM------LAVFNKHQDITFHDVQGF-LCF 358
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDG 346
+ P DD FP+ITFHF ++++L + +F+A N + V C+ FQ +
Sbjct: 359 QYPGS----VDDGFPTITFHFEDDLALHVYPHEYFFA-----NGNDVYCVGFQNGASQSK 409
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
D + G N V+YDLE IG+ +C+S+
Sbjct: 410 DGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCSSS 446
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 181/397 (45%), Gaps = 68/397 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + ++ + P S S TC FC+ +
Sbjct: 105 VQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYG 160
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P CT + PC ++ +YG+G G D L+ + S G +
Sbjct: 161 GVLP--SCTST-------------SPC-EYSISYGDGSSTAGFFVTDFLQYNQVS-GDGQ 203
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 204 TTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL-- 261
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++ P+Y I L+ I +G ++L +P
Sbjct: 262 ----DTVNGGGIFAIGNVV---QPKVKTTPLVSD--MPHYNVI-LKGIDVGGTALG-LPT 310
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ FDS + G ++DSGTT ++PE Y L +++ + + +++ +T D C+
Sbjct: 311 NI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMV------FDKHQDISVQTLQDFSCF 362
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDG 346
+ DD FP +TFHF +VSL++ ++ + N + C+ FQ+ +
Sbjct: 363 QYSGS----VDDGFPEVTFHFEGDVSLIVSPHDYLF-----QNGKNLYCMGFQNGGVQTK 413
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
D + G N V+YDLE + IG+ +C+S+
Sbjct: 414 DGKDMVLLGDLVLSNKLVLYDLENQAIGWADYNCSSS 450
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 157/384 (40%), Gaps = 58/384 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C D + F+PS+S+S +C+S+ C ++
Sbjct: 145 LSLIFDTGSDLTWTQCQPCVRTCYDQKE-------PIFNPSKSTSYYNVSCSSAACGSLS 197
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C+ S C + YG+ G L ++ + S
Sbjct: 198 SATGNAGSCSASNCI---------------YGIQYGDQSFSVGFLAKEKFTLTNSDV--- 239
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPN 177
FGC + + G+ G GR LS PSQ K FS+C + +
Sbjct: 240 --FDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-----PSSAS 292
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L G IS +++FTP+ ++Y + + AIT+G L P+ F +
Sbjct: 293 YTGHLTFGSAGISR--SVKFTPISTITDGTSFYGLNIVAITVGGQKL---PIPSTVFSTP 347
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G L+DSGT T LP Y+ L S ++ ++ YP V D C+ + + F
Sbjct: 348 G---ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSI---LDTCFDL----SGF 397
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + F F + L FY + CL F + D + +FG+
Sbjct: 398 KTVTIPKVAFSFSGGAVVELGSKGIFYVFKI-----SQVCLAFAG--NSDDSNAAIFGNV 450
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ +EVVYD R+GF P C+
Sbjct: 451 QQQTLEVVYDGAGGRVGFAPNGCS 474
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 156/384 (40%), Gaps = 58/384 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C D + F+PS+S+S +C+S+ C ++
Sbjct: 117 LSLIFDTGSDLTWTQCQPCVRTCYDQKE-------PIFNPSKSTSYYNVSCSSAACGSLS 169
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C+ S C + YG+ G L ++ + S
Sbjct: 170 SATGNAGSCSASNCI---------------YGIQYGDQSFSVGFLAKEKFTLTNSDV--- 211
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPN 177
FGC + + G+ G GR LS PSQ K FS+C + +
Sbjct: 212 --FDGVYFGCGENNQGLFTGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-----PSSAS 264
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L G IS +++FTP+ ++Y + + AIT+G L P+ F +
Sbjct: 265 YTGHLTFGSAGISR--SVKFTPISTITDGTSFYGLNIVAITVGGQKL---PIPSTVFSTP 319
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G L+DSGT T LP Y+ L S ++ ++ YP V D C+ + + F
Sbjct: 320 G---ALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSI---LDTCFDL----SGF 369
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + F F + L FY CL F + D + +FG+
Sbjct: 370 KTVTIPKVAFSFSGGAVVELGSKGIFYVFKISQ-----VCLAFAG--NSDDSNAAIFGNV 422
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQQ +EVVYD R+GF P C+
Sbjct: 423 QQQTLEVVYDGAGGRVGFAPNGCS 446
>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
Length = 469
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 79/292 (27%), Positives = 131/292 (44%), Gaps = 27/292 (9%)
Query: 89 CPSFAYTYGEGGLVT-GILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYRE---PIGIAG 144
C S++ TYG T G L DT ++ +P FGC ++Y + G+ G
Sbjct: 175 CDSYSLTYGGSAANTSGYLATDTFTFGATA------VPGVVFGCSDASYGDFAGASGVIG 228
Query: 145 FGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSP 204
GRG LS+ SQL F + FS+ LA + +D + S + GD A+ + TP+L S
Sbjct: 229 IGRGNLSLISQLQFGK--FSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSST 286
Query: 205 MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSI 264
+YP++YY+ L + + + L +P + + G GG+++ S T T+L + Y + +
Sbjct: 287 LYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAA 346
Query: 265 LQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFY 324
+ S I A DLCY ++ P +T F + L N+FY
Sbjct: 347 VASRIGL--PAVNGSAALELDLCYNA----SSMAKVKVPKLTLVFDGGADMDLSAANYFY 400
Query: 325 AMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQ 376
N + ++CL G V G+ Q ++YD++ R+ F+
Sbjct: 401 I----DNDTGLECLTMLPSQGGS-----VLGTLLQTGTNMIYDVDAGRLTFE 443
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 119/389 (30%), Positives = 174/389 (44%), Gaps = 60/389 (15%)
Query: 3 QVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+Y MDT +D W C C C N F PS+SS+ C+S C N+
Sbjct: 101 QLYGVMDTANDNIWFQCN----PCKPCF----NTTSPMFDPSKSSTYKTIPCSSPKCKNV 152
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
++ D K C +++TYG G L+ DTL ++ ++
Sbjct: 153 ENTHCSSDD------------KKVC-----EYSFTYGGEAYSQGDLSIDTLTLNSNNDTP 195
Query: 121 IREIPKFCFGCVGSTYREPI-----GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN 174
I GC G + P+ G G GRG LS SQL G FS+C + ++N
Sbjct: 196 I-SFKNIVIGC-GHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPL-FSN 252
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
+ IS L GD ++ S TP+ + Y L A+++G+ + + S +
Sbjct: 253 E-GISGKLHFGDKSVVSGVGTVSTPITAGEIG---YSTTLNALSVGDH-IIKFENSTSKN 307
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
D+ GN ++DSGTT T LPE YS+L SI+ S + RAK ++ F LCY+
Sbjct: 308 DNLGN--TIIDSGTTLTILPENVYSRLESIVTSMVK-LERAKSPNQQ--FKLCYKA---- 358
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T + P IT HF N + L N FY + V C F S+ G++ P +
Sbjct: 359 -TLKNLDVPIITAHF-NGADVHLNSLNTFYPI-----DHEVVCFAFVSV--GNF-PGTII 408
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G+ QQN V +DL+K I F+P DC +
Sbjct: 409 GNIAQQNFLVGFDLQKNIISFKPTDCTKS 437
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 158/386 (40%), Gaps = 70/386 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y+ + + F P+RSS+ + +CA+ C ++++
Sbjct: 197 VVFDTGSDTTWVQCQPCVVVC-----YKQQEKL--FDPARSSTYANVSCAAPACSDLYTR 249
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS L ++ YG+G G DTL +
Sbjct: 250 ----------GCSGGHCL----------YSVQYGDGSYSIGFFAMDTLTLSS-----YDA 284
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN---DP 176
+ F FGC + E G+ G GRG S+P Q G F+HC A D
Sbjct: 285 VKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDF 344
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
SP +G Q TPML + P +YY+G+ I +G L +P S+ F +
Sbjct: 345 GPGSPAAVG--------ARQTTPML-TDNGPTFYYVGMTGIRVGGQ-LLSIPQSV--FST 392
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPN 294
G +VDSGT T LP YS L S S + Y +A + D CY
Sbjct: 393 AGT---IVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSL---LDTCYDF---- 442
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
++ P ++ F L + YA S + CL F + +D D G+
Sbjct: 443 TGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASL-----SQVCLGFAANEDDD--DVGIV 495
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ Q + VVYD+ K+ +GF P C
Sbjct: 496 GNTQLKTFGVVYDIGKKTVGFSPGAC 521
>gi|414871328|tpg|DAA49885.1| TPA: hypothetical protein ZEAMMB73_545054 [Zea mays]
Length = 565
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 129/292 (44%), Gaps = 47/292 (16%)
Query: 104 GILTRDTLKVHGSSPGIIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQ-LG 157
+L +D L +H + I + FGC+ GS + G+ GF RG LS PSQ
Sbjct: 308 ALLGQDALALHDD----VDAIAAYTFGCLCVVTGGSVPSQ--GLVGFNRGPLSFPSQNKN 361
Query: 158 FLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAI 217
FS+C ++K N S L +G ++ TP+L +P P+ YY+ + I
Sbjct: 362 VYGSVFSYCLPSYK---SSNFSGTLRLGPAG--QPKRIKTTPLLSNPHRPSLYYVNMVGI 416
Query: 218 TIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKE 277
+G + VP S FD G +VD+GT +T L P Y+ + + +S + RA
Sbjct: 417 RVGGRPVA-VPASALAFDPASGHGTIVDAGTMFTRLSAPVYAAVCDVFRSRV----RAPV 471
Query: 278 VEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKC 337
GFD CY V P++TF F VS+ LP+ N + S+ + C
Sbjct: 472 AGPLGGFDTCYNVTIS--------VPTVTFLFDGRVSVTLPEEN----VVIRSSLDGIAC 519
Query: 338 LLFQSMDDGDYGPS-------GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
L + GPS V S QQQN V++D+ R+GF C +
Sbjct: 520 LAMAA------GPSDSVDAVLNVMASMQQQNHRVLFDVANGRVGFSRELCTA 565
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 105/384 (27%), Positives = 158/384 (41%), Gaps = 69/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y+ + F P++SS+ + +C S C ++ ++
Sbjct: 178 VVFDTGSDTTWVQCRPCVVKC-----YKQKGPL--FDPAKSSTYANVSCTDSACADLDTN 230
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC+ L +A YG+G G +DTL + +
Sbjct: 231 ----------GCTGGHCL----------YAVQYGDGSYTVGFFAQDTLTIAHDA------ 264
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
I F FGC + + G+ G GRG S+ Q G F++C A
Sbjct: 265 IKGFRFGCGEKNNGLFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDF 324
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
P S+ +N + TPML +YY+G+ I +G +VP++ F + G
Sbjct: 325 GPG-------SAGNNARLTPMLTDKGQ-TFYYVGMTGIRVGGQ---QVPVAESVFSTAGT 373
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCPNNT 296
LVDSGT T LP Y+ L S + A+ ++ G+ D CY
Sbjct: 374 ---LVDSGTVITRLPATAYTALSSAFDKVML----ARGYKKAPGYSILDTCYDF----TG 422
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+D P+++ F L + YA+S A CL F S +GD + G+
Sbjct: 423 LSDVELPTVSLVFQGGACLDVDVSGIVYAIS-----EAQVCLAFAS--NGDDESVAIVGN 475
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ+ V+YDL K+ +GF P C
Sbjct: 476 TQQKTYGVLYDLGKKTVGFAPGSC 499
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 165/382 (43%), Gaps = 62/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ DTGSDLTW C S C +D + F P++S+S +C+S C +I
Sbjct: 147 LLFDTGSDLTWTQCEPCSGGCFPQNDEK-------FDPTKSTSYKNLSCSSEPCKSIGKE 199
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ GCS S ++C + YG G V G L +TL + +P + E
Sbjct: 200 -------SAQGCSSS----NSCL-----YGVKYGTGYTV-GFLATETLTI---TPSDVFE 239
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
F GC G + G+ G GR +++PSQ K FS+C A + +
Sbjct: 240 --NFVIGCGERNGGRFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPA----SSSSTG 293
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
G V+ ++K FTP+ + P Y + + I++G L P R
Sbjct: 294 HLSFGGGVSQAAK----FTPI--TSKIPELYGLDVSGISVGGRKLPIDPSVFR------T 341
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGTT T+LP +S L S Q +T Y K +G CY N D
Sbjct: 342 AGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGT---SGLQPCYDFSKHAN---D 395
Query: 300 DL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
++ P I+ F V + + F A +N CL F+ D+G+ +FG+ Q
Sbjct: 396 NITIPQISIFFEGGVEVDIDDSGIFIA----ANGLEEVCLAFK--DNGNDTDVAIFGNVQ 449
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ EVVYD+ K +GF P C
Sbjct: 450 QKTYEVVYDVAKGMVGFAPGGC 471
>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
Length = 429
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 114/407 (28%), Positives = 172/407 (42%), Gaps = 67/407 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + L S F P RSSS S C S C
Sbjct: 69 VTMVLDTGSELSWLHCK------------KAPNLHSVFDPLRSSSYSPIPCTSPTC---- 112
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D C K C S Y + + G L DT + S+
Sbjct: 113 -RTRTRDFSIPVSCD-----KKKLCHAIIS----YADASSIEGNLASDTFHIGNSA---- 158
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
IP FGC+ S + + G+ G RG+LS +Q+G LQK FS+C +
Sbjct: 159 --IPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMG-LQK-FSYCI------S 208
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ S L+ G+ + S L++TP+++ S P + Y + LE I + NS L ++P
Sbjct: 209 GQDSSGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSML-QLPK 267
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEE-----RT 282
S+ D G G +VDSGT +T L P Y+ L + + Q+ + K +E+ +
Sbjct: 268 SVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASL----KVLEDPNFVFQG 323
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS 342
DLCYRVP T P++T F V + + S +V C F +
Sbjct: 324 AMDLCYRVPLTRRTLPP--LPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGN 381
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ S + G QQNV + +DL K R+GF + C G+
Sbjct: 382 SELLGV-ESYIIGHHHQQNVWMEFDLAKSRVGFAEVRCXLAGQRLGV 427
>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 407
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 109/404 (26%), Positives = 173/404 (42%), Gaps = 58/404 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + + F+ +RS S C+SS C N
Sbjct: 44 VSMVIDTGSELSWLYCNKTT---------TTTSYPTTFNQTRSISYRPIPCSSSTCTN-- 92
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D + C ++L +T +Y + G L DT + S
Sbjct: 93 ---QTRDFSIPASCDSNSLCHATL---------SYADASSSEGNLASDTFHMGAS----- 135
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+IP FGC+ S + + G+ G RG+LS SQ+GF + FS+C +
Sbjct: 136 -DIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFPK--FSYCI------S 186
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ S L++G+ + L +TP+++ S P + Y + LE I + + L +P
Sbjct: 187 GTDFSGMLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDR-LLPIPK 245
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE---RTGFDL 286
S+ E D G G +VDSGT +T L P Y+ L S + T + R E + + DL
Sbjct: 246 SVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRVLEDPDFVFQGAMDL 305
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP-SNSSAVKCLLFQSMDD 345
CYRVP P+++ F N + + Y + + +V CL F + D
Sbjct: 306 CYRVPISQRVLPR--LPTVSLVF-NGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDL 362
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQNV + +DLE+ RIG + C GL
Sbjct: 363 LGV-EAYVIGHHHQQNVWMEFDLERSRIGLAQVRCDLAGKRFGL 405
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 111/385 (28%), Positives = 154/385 (40%), Gaps = 68/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
V DTGSD TWV C C + + KL F P+RSS+ + +CA+ C LNIH
Sbjct: 195 VVFDTGSDTTWVQCQPCVVVCYE----QREKL---FDPARSSTYANVSCAAPACSDLNIH 247
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
GCS L + YG+G G DTL +
Sbjct: 248 ------------GCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----Y 280
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + E G+ G GRG S+P Q G F+HC A
Sbjct: 281 DAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTG---- 336
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L G ++++ TPML + P +YY+G+ I +G L +P S+
Sbjct: 337 -TGYLDFGAGSLAAARARLTTPML-TENGPTFYYVGMTGIRVGGQ-LLSIPQSVFA---- 389
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQL--LSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G +VDSGT T LP YS L Y +A V D CY
Sbjct: 390 -TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL---LDTCYDF----T 441
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ P+++ F L + YA SA + CL F + +DG G G+ G
Sbjct: 442 GMSQVAIPTVSLLFQGGARLDVDASGIMYAASA-----SQVCLAFAANEDG--GDVGIVG 494
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ Q + V YD+ K+ +GF P C
Sbjct: 495 NTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 102/385 (26%), Positives = 170/385 (44%), Gaps = 61/385 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL WV C C+ C N++ F P +SS+ + +C S C + +
Sbjct: 81 VDTGSDLIWVQC----VPCLGC----YNQINPMFDPLKSSTYTNISCDSPLCYKPYIGE- 131
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYG--EGGLVTGILTRDTLKVHGSSPGIIRE 123
C P YTYG + L G+L ++T+ + S+ G
Sbjct: 132 --------------------CSPEKRCDYTYGYADSSLTKGVLAQETVTLT-SNTGKPIS 170
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPN 177
+ FGC G+ +G+ G G G S+ SQ+G F K FS C + F D
Sbjct: 171 LQGILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPF--LTDIT 228
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
ISS + G + + + TP+++ YY+ L I++ ++ L P++ +
Sbjct: 229 ISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYL---PMN----STI 281
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G +LVDSGT LP+ Y ++ +++ + P + G LCYR T
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPITDD--PSLGPQLCYR------TQ 333
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P++T+HF +L+L F + + V CL + + D G++G+F
Sbjct: 334 TNLKGPTLTYHF-EGANLLLTPIQTF--IPPTPETKGVFCLAITNCANSD---PGIYGNF 387
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
Q N + +DL+++ + F+P DC
Sbjct: 388 AQTNYLIGFDLDRQIVSFKPTDCTK 412
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 169/392 (43%), Gaps = 52/392 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C DC + N + P S S TC C + S D
Sbjct: 213 LDTGSDLNWIQC----VPCFDC--FEQNG--PYYDPKDSISFRNITCNDPRCQLVSSPDP 264
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI----I 121
P PC S CP F Y YG+ TG +T V+ +S
Sbjct: 265 P-RPCKFETQS------------CPYF-YWYGDSSNTTGDFALETFTVNLTSSTTGKSEF 310
Query: 122 REIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPN 177
R + FGC + G+ G GRG LS SQL L FS+C + +D +
Sbjct: 311 RRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRDSDTS 368
Query: 178 ISSPLVIG-DVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREF 234
+SS L+ G D + + L FT ++ P +YY+ +++I +G L ++P
Sbjct: 369 VSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKL-QIPEENWNL 427
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+ G GG ++DSGTT ++ +P Y + + Y K VE+ F + + PC N
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGY---KLVED---FPILH--PCYN 479
Query: 295 NTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ TD+L FP F + P N+F + + CL +M +
Sbjct: 480 VSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLD----IVCL---AMLGTPKSALSI 532
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
G++QQQN ++YD + R+G+ PM CA +
Sbjct: 533 IGNYQQQNFHILYDTKNSRLGYAPMRCAEIEA 564
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 161/390 (41%), Gaps = 57/390 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V DTGSDL+WV CG C Y+ + F+PS SS+ S C + C
Sbjct: 167 LTVVFDTGSDLSWVQCG----PCSSGGCYKQQDPL--FAPSDSSTFSAVRCGARECRARQ 220
Query: 62 S-SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S +P D CP + YG+ G L DTL + +P
Sbjct: 221 SCGGSPGD------------------DRCP-YEVVYGDKSRTQGHLGNDTLTLGTMAPAN 261
Query: 121 I-----REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFK 171
++P F FGC + + + G+ G GRG +S+ SQ G +GFS+C L
Sbjct: 262 ASAENDNKLPGFVFGCGENNTGLFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYC-LPSS 320
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
++ P L +G + + + QFTPML P++YY+ L I + ++
Sbjct: 321 SSSAPGY---LSLG-TPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAI------- 369
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
R + L+VDSGT T L Y L + S + Y K + D CY
Sbjct: 370 RVSSPRVALPLIVDSGTVITRLAPRAYRALRAAFLSAMGKYGY-KRAPRLSILDTCYDFT 428
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
N P++ F ++ + Y A CL F +GD +
Sbjct: 429 AHANATVS--IPAVALVFAGGATISVDFSGVLYVAKV-----AQACLAFAP--NGDGRSA 479
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ G+ QQ+ + VVYD+ +++IGF C+
Sbjct: 480 GILGNTQQRTLAVVYDVARQKIGFAAKGCS 509
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 162/386 (41%), Gaps = 49/386 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C C + N + P SSS TC C + S D
Sbjct: 212 LDTGSDLNWIQC----VPCYAC--FEQNGPY--YDPKDSSSFKNITCHDPRCQLVSSPDP 263
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG---IIR 122
P PC + CP F Y YG+ TG +T V+ ++P ++
Sbjct: 264 P-QPCKGE------------TQSCPYF-YWYGDSSNTTGDFALETFTVNLTTPEGKPELK 309
Query: 123 EIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
+ FGC + G+ G GRG LS +QL L FS+C + ++ ++
Sbjct: 310 IVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFATQLQSLYGHSFSYCLV--DRNSNSSV 367
Query: 179 SSPLVIG-DVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L+ G D + S NL FT + P +YY+ +++I +G L ++P
Sbjct: 368 SSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVL-KIPEETWHLS 426
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+QG GG ++DSGTT T+ EP Y + I +P VE CY V +
Sbjct: 427 AQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL---VETFPPLKPCYNV----S 479
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P F + P N+F + V CL + G
Sbjct: 480 GVEKMELPEFAILFADGAMWDFPVENYFIQIEPED----VVCLAILGTPRSALS---IIG 532
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
++QQQN ++YDL+K R+G+ PM CA
Sbjct: 533 NYQQQNFHILYDLKKSRLGYAPMKCA 558
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 172/396 (43%), Gaps = 63/396 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C C R + L ++ + P SS++S +C+
Sbjct: 17 VQVDTGSDVLWVNCR----PCSGCP--RKSALNIPLTMYDPRESSTTSLVSCS------- 63
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG-SSPG 119
DP + G + S C + ++YG+G G RD ++ + SS G
Sbjct: 64 -------DPLCVRGRRFAEAQCSQATNNC-EYIFSYGDGSTSEGYYVRDAMQYNVISSNG 115
Query: 120 IIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQ---KGFSHCFLA 169
+ + FGC + ++ + GI GFG+ LSVP+QL Q + FSHC
Sbjct: 116 LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEG 175
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
K + ++ + +TP++ ++ Y + L I++ ++ L P+
Sbjct: 176 EKRGGGILVI--------GGIAEPGMTYTPLVPDSVH---YNVVLRGISVNSNRL---PI 221
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
+F S + G+++DSGTT + P Y+ + ++ + P + + F + R
Sbjct: 222 DAEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGR 281
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ DLFP++T +F + P + +AP+ ++ V C+ +QS G
Sbjct: 282 L--------SDLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQS-SSSSAG 332
Query: 350 PSG-----VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P + G ++ VVYDL+ RIG+ +C
Sbjct: 333 PKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 368
>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 436
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 113/398 (28%), Positives = 170/398 (42%), Gaps = 67/398 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + L S F P RSSS S C S C
Sbjct: 76 VTMVLDTGSELSWLHCK------------KAPNLHSVFDPLRSSSYSPIPCTSPTC---- 119
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D C K C S Y + + G L DT + S+
Sbjct: 120 -RTRTRDFSIPVSCD-----KKKLCHAIIS----YADASSIEGNLASDTFHIGNSA---- 165
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
IP FGC+ S + + G+ G RG+LS +Q+G LQK FS+C +
Sbjct: 166 --IPATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMG-LQK-FSYCI------S 215
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ S L+ G+ + S L++TP+++ S P + Y + LE I + NS L ++P
Sbjct: 216 GQDSSGILLFGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSML-QLPK 274
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEE-----RT 282
S+ D G G +VDSGT +T L P Y+ L + + Q+ + K +E+ +
Sbjct: 275 SVYAPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASL----KVLEDPNFVFQG 330
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS 342
DLCYRVP T P++T F V + + S +V C F +
Sbjct: 331 AMDLCYRVPLTRRTLPP--LPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGN 388
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ S + G QQNV + +DL K R+GF + C
Sbjct: 389 SELLGV-ESYIIGHHHQQNVWMEFDLAKSRVGFAEVRC 425
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 102/384 (26%), Positives = 166/384 (43%), Gaps = 63/384 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ DTGSD++W+ C+ C + + F P++S++ S C C
Sbjct: 135 LMFDTGSDVSWI-------QCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHPQCAAAGGK 187
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C+ +G L + YG+G G+L+ +TL + + R
Sbjct: 188 ------CSSNGTCL--------------YKVQYGDGSSTAGVLSHETLSLTSA-----RA 222
Query: 124 IPKFCFGCVGST----YREPIGIAGFGRGALSVPSQLGFLQKGFS-HCFLAFKYANDPNI 178
+P F FGC G T + + G+ G GRG LS+ SQ +C ++ ++
Sbjct: 223 LPGFAFGC-GETNLGDFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSH---- 277
Query: 179 SSPLVIGDVA-ISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L IG S D +++T M++ YP++Y++ L +I +G L P+
Sbjct: 278 -GYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTR---- 332
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G L+DSGT T+LP Y+ L + T+T Y A + FD CY N F
Sbjct: 333 --DGTLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYDP---FDTCYDFAGQNAIF 387
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQSMDDGDYGPSGVFGS 356
P ++F F + S L + F + P +++ A CL F + P + G+
Sbjct: 388 ----MPLVSFKFSDGSSFDL---SPFGVLIFPDDTAPATGCLAF--VPRPSTMPFTIVGN 438
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ+N E++YD+ E+IGF C
Sbjct: 439 TQQRNTEMIYDVAAEKIGFVSGSC 462
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 167/386 (43%), Gaps = 75/386 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
IQ +DTGS++TW C C+ C Y N + F PS+SS+ C C
Sbjct: 78 IQAIIDTGSEITWTQC----LPCVHC--YEQNAPI--FDPSKSSTFKEKRCDGHSC---- 125
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P++ F +TY G L T +T+ +H +S G
Sbjct: 126 ----PYE--------------------VDYFDHTYTMGTLAT-----ETITLHSTS-GEP 155
Query: 122 REIPKFCFGC-VGSTYREPI--GIAGFGRGALSVPSQLGFLQKGF-SHCFLAFKYANDPN 177
+P+ GC +++ +P G+ G G S+ +Q+G G S+CF
Sbjct: 156 FVMPETIIGCGHNNSWFKPSFSGMVGLNWGPSSLITQMGGEYPGLMSYCF-------SGQ 208
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G AI + D + T M + P +YY+ L+A+++GN+ + + + +
Sbjct: 209 GTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALE-- 266
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-LCYRVPCPNNT 296
G +++DSGTT T+ P + + + ++ +T A + TG D LCY N+
Sbjct: 267 --GNIVIDSGTTLTYFPVSYCNLVRQAVEHVVT----AVRAADPTGNDMLCY------NS 314
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T D+FP IT HF V LVL + Y M SN+ V CL +FG+
Sbjct: 315 DTIDIFPVITMHFSGGVDLVLDK----YNMYMESNNGGVFCLAIICNSPTQ---EAIFGN 367
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
Q N V YD + F P +C++
Sbjct: 368 RAQNNFLVGYDSSSLLVSFSPTNCSA 393
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 174/399 (43%), Gaps = 77/399 (19%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C+ C + ++ + SS++ +C+ +FC S
Sbjct: 99 HVQVDTGSDILWVNCAG----CIRCPRKSDLVELTPYDADASSTAKSVSCSDNFC----S 150
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH-------- 114
N C SG STC + YG+G G L RD + +
Sbjct: 151 YVNQRSEC-HSG--------STC-----QYVILYGDGSSTNGYLVRDVVHLDLVTGNRQT 196
Query: 115 GSSPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFS 164
GS+ G I FGC +G + GI GFG+ S SQL G +++ F+
Sbjct: 197 GSTNGTI------IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFA 250
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
HC ++ N IG+V +S K ++ TPML + Y + L AI +GNS L
Sbjct: 251 HCL------DNNNGGGIFAIGEV-VSPK--VKTTPMLSKSAH---YSVNLNAIEVGNSVL 298
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
LS FDS + G+++DSGTT +LP+ Y+ L++ + ++ + T F
Sbjct: 299 ---QLSSDAFDSGDDKGVIIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFTCF 355
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
R+ D FP++TF F +VSL + + + + + C +Q+
Sbjct: 356 HYIDRL---------DRFPTVTFQFDKSVSLAVYPQEYLFQVREDT-----WCFGWQNGG 401
Query: 345 DGDYGPSG--VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G + + G N VVYD+E + IG+ +C+
Sbjct: 402 LQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440
>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 431
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 153/381 (40%), Gaps = 52/381 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D TW C CD S F P+ SSS + CAS +C
Sbjct: 96 LDTSADATWS-------HCAPCDTCPAG---SRFIPASSSSYASLPCASDWCPLFEG--- 142
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PC + + + L +P FA T + L + DTL++ + I
Sbjct: 143 --QPCPANQDASAPLPACAFSKP---FADTSFQASLGS-----DTLRLGKDA------IA 186
Query: 126 KFCFGCVGS-----TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ FGCVG+ T G+ G GRG +S+ SQ G G FS+C +++ S
Sbjct: 187 GYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYY---FS 243
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G A N+++TP+L +P P+ YY+ + +++G + +VP FD
Sbjct: 244 GSLRLG--AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGR-TWVKVPAGSFAFDPATG 300
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T P Y+ L + + FD C+ +
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVA---APSGYTSLGAFDTCFN----TDEVAA 353
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P +T H V L LP N SA + + CL V + QQ
Sbjct: 354 GGAPPVTLHMDGGVDLTLPMENTLIHSSA----TPLACLAMAEAPQNVNAVVNVVANLQQ 409
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QNV VV D+ R+GF C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 169/392 (43%), Gaps = 52/392 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C DC + N + P S S TC C + S D
Sbjct: 213 LDTGSDLNWIQC----VPCFDC--FEQNG--PYYDPKDSISFRNITCNDPRCQLVSSPDP 264
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI----I 121
P PC S CP F Y YG+ TG +T V+ +S
Sbjct: 265 P-RPCKFETQS------------CPYF-YWYGDSSNTTGDFALETFTVNLTSSTTGKSEF 310
Query: 122 REIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPN 177
R + FGC + G+ G GRG LS SQL L FS+C + +D +
Sbjct: 311 RRVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRDSDTS 368
Query: 178 ISSPLVIG-DVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREF 234
+SS L+ G D + + L FT ++ P +YY+ +++I +G L ++P
Sbjct: 369 VSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKL-QIPEENWNL 427
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+ G GG ++DSGTT ++ +P Y + + Y K VE+ F + + PC N
Sbjct: 428 SADGAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGY---KLVED---FPILH--PCYN 479
Query: 295 NTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ TD+L FP F + P N+F + + CL +M +
Sbjct: 480 VSGTDELNFPEFLIQFADGAVWNFPVENYFIRIQQLD----IVCL---AMLGTPKSALSI 532
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
G++QQQN ++YD + R+G+ PM CA +
Sbjct: 533 IGNYQQQNFHILYDTKNSRLGYAPMRCAEIEA 564
>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
gi|194703964|gb|ACF86066.1| unknown [Zea mays]
gi|219886221|gb|ACL53485.1| unknown [Zea mays]
gi|219886359|gb|ACL53554.1| unknown [Zea mays]
gi|223950085|gb|ACN29126.1| unknown [Zea mays]
gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 431
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 153/381 (40%), Gaps = 52/381 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D TW C CD S F P+ SSS + CAS +C
Sbjct: 96 LDTSADATWS-------HCAPCDTCPAG---SRFIPASSSSYASLPCASDWCPLFEG--- 142
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PC + + + L +P FA T + L + DTL++ + I
Sbjct: 143 --QPCPANQDASAPLPACAFSKP---FADTSFQASLGS-----DTLRLGKDA------IA 186
Query: 126 KFCFGCVGS-----TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ FGCVG+ T G+ G GRG +S+ SQ G G FS+C +++ S
Sbjct: 187 GYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYY---FS 243
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G A N+++TP+L +P P+ YY+ + +++G + +VP FD
Sbjct: 244 GSLRLG--AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGR-TWVKVPAGSFAFDPATG 300
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T P Y+ L + + FD C+ +
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVA---APSGYTSLGAFDTCFN----TDEVAA 353
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P +T H V L LP N SA + + CL V + QQ
Sbjct: 354 GGAPPVTLHMDGGVDLTLPMENTLIHSSA----TPLACLAMAEAPQNVNAVVNVVANLQQ 409
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QNV VV D+ R+GF C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430
>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
gi|194690728|gb|ACF79448.1| unknown [Zea mays]
Length = 431
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 153/381 (40%), Gaps = 52/381 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D TW C CD S F P+ SSS + CAS +C
Sbjct: 96 LDTSADATWS-------HCAPCDTCPAG---SRFIPASSSSYASLPCASDWCPLFEG--- 142
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
PC + + + L +P FA T + L + DTL++ + I
Sbjct: 143 --QPCPANQDASAPLPACAFSKP---FADTSFQASLGS-----DTLRLGKDA------IA 186
Query: 126 KFCFGCVGS-----TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ FGCVG+ T G+ G GRG +S+ SQ G G FS+C +++ S
Sbjct: 187 GYAFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYY---FS 243
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G A N+++TP+L +P P+ YY+ + +++G + +VP FD
Sbjct: 244 GSLRLG--AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGR-TWVKVPAGSFAFDPATG 300
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T P Y+ L + + FD C+ +
Sbjct: 301 AGTVIDSGTVITRWTAPVYAALREEFRRQVA---APSGYTSLGAFDTCFN----TDEVAA 353
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P +T H V L LP N SA + + CL V + QQ
Sbjct: 354 GGAPPVTLHMDGGVDLTLPMENTLIHSSA----TPLACLAMAEAPQNVNAVVNVVANLQQ 409
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QNV VV D+ R+GF C
Sbjct: 410 QNVRVVVDVAGSRVGFAREPC 430
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 169/386 (43%), Gaps = 57/386 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C C C N+ F+PS S S C SS C ++
Sbjct: 78 MTVIVDTGSDLTWVQCQ----PCRLC----YNQQDPLFNPSGSPSYQTILCNSSTCQSLQ 129
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ C + P ++ YG+G G L + L + +
Sbjct: 130 YATGNLGVCGSN-------------TPTCNYVVNYGDGSYTRGDLGMEQLNLGTT----- 171
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + + G+ G G+ LS+ SQ + +G FS+C +
Sbjct: 172 -HVSNFIFGCGRNNKGLFGGASGLMGLGKSDLSLVSQTSAIFEGVFSYCLPT----TAAD 226
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S L++G + K+ + +T M+ +P P +Y++ L I+IG +L + P + R+
Sbjct: 227 ASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVAL-QAP-NYRQ-- 282
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G+L+DSGT T LP P Y L + + +P A D C+ + N
Sbjct: 283 ----SGILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSI---LDTCFNL----N 331
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ + P+I F N L + FY + +++S V CL S+ D P + G
Sbjct: 332 GYDEVDIPTIRMQFEGNAELTVDVTGIFYFVK--TDASQV-CLALASLSFDDEIP--IIG 386
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
++QQ+N V+Y+ ++ ++GF C+
Sbjct: 387 NYQQRNQRVIYNTKESKLGFAAEACS 412
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 172/396 (43%), Gaps = 63/396 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C C R + L ++ + P SS++S +C+
Sbjct: 44 VQVDTGSDVLWVNC----RPCSGCP--RKSALNIPLTMYDPRESSTTSLVSCS------- 90
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG-SSPG 119
DP + G + S C + ++YG+G G RD ++ + SS G
Sbjct: 91 -------DPLCVRGRRFAEAQCSQTTNNC-EYIFSYGDGSTSEGYYVRDAMQYNVISSNG 142
Query: 120 IIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQ---KGFSHCFLA 169
+ + FGC + ++ + GI GFG+ LSVP+QL Q + FSHC
Sbjct: 143 LANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEG 202
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
K + ++ + +TP++ ++ Y + L I++ ++ L P+
Sbjct: 203 EKRGGGILVI--------GGIAEPGMTYTPLVPDSVH---YNVVLRGISVNSNRL---PI 248
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
+F S + G+++DSGTT + P Y+ + ++ + P + + F + R
Sbjct: 249 DAEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGR 308
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ DLFP++T +F + P + +AP+ ++ V C+ +QS G
Sbjct: 309 L--------SDLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQS-SSSSAG 359
Query: 350 PSG-----VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P + G ++ VVYDL+ RIG+ +C
Sbjct: 360 PKDGSQLTILGDIVLKDKLVVYDLDNSRIGWMSYNC 395
>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 330
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/287 (29%), Positives = 132/287 (45%), Gaps = 28/287 (9%)
Query: 92 FAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGR 147
+ Y Y + + TG++ D G +P FGC G GIAGFGR
Sbjct: 64 YTYYYNDKSVTTGLIEVDKFTF-----GAGASVPGVAFGCGLFNNGVFKSNETGIAGFGR 118
Query: 148 GALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYP 207
G LS+PSQL FSHCF A + L D+ + + +Q TP++++ P
Sbjct: 119 GPLSLPSQLKV--GNFSHCFTAVNGLKQSTVLLDLP-ADLYKNGRGAVQSTPLIQNSANP 175
Query: 208 NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQS 267
+YY+ L+ IT+G++ L VP S + G GG ++DSGT+ T LP Y + +
Sbjct: 176 TFYYLSLKGITVGSTRL-PVPESAFAL-TNGTGGTIIDSGTSITSLPPQVYQVVRDEFAA 233
Query: 268 TITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
I P TG C+ P + P + HF ++ LP+ N+ + +
Sbjct: 234 QIK-LPVVP--GNATGPYTCFSAP----SQAKPDVPKLVLHF-EGATMDLPRENYVFEVP 285
Query: 328 APSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIG 374
+ +S + CL D+ + + G+FQQQN+ V+YDL+ G
Sbjct: 286 DDAGNSII-CLAINKGDE-----TTIIGNFQQQNMHVLYDLQNMHRG 326
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/391 (27%), Positives = 157/391 (40%), Gaps = 78/391 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ DTGSDL W C D S++ P+ SS+ +R C+ C +
Sbjct: 113 LTALADTGSDLIWTKC--------DAGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALR 164
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG---LVTGILTRDTLKVHGSSP 118
S C G + Y YG G G L +T + G +
Sbjct: 165 SYS--LARCAAGGAECD-------------YKYAYGLGDDPDFTQGFLGSETFTLGGDA- 208
Query: 119 GIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+P FGC + Y E G+ G GRG LS+ SQL F +C A D
Sbjct: 209 -----VPGVGFGCTTALEGDYGEGAGLVGLGRGPLSLVSQLD--AGTFMYCLTA-----D 256
Query: 176 PNISSPLVIGDVAI--SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+ +SPL+ G +A + +Q T +L S +Y + L +ITIG+++ V
Sbjct: 257 ASKASPLLFGALATMTGAGAGVQSTGLLAST---TFYAVNLRSITIGSATTAGVGGPGGV 313
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQL-LSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+ DSGTT T+L EP Y++ + L T + P VE R GF+ CY P
Sbjct: 314 ---------VFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTP----VEGRYGFEACYEKPD 360
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS- 351
L P++ HF + LP N+ + V C + Q PS
Sbjct: 361 SAR-----LIPAMVLHFDGGADMALPVANYVVEV-----DDGVVCWVVQR------SPSL 404
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+ Q N V++D+ K + FQP +C S
Sbjct: 405 SIIGNIMQMNYLVLHDVRKSVLSFQPANCDS 435
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 154/373 (41%), Gaps = 62/373 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT +D WVPC C+ C + F+P +S++ + C +S C + N
Sbjct: 123 MDTSNDAAWVPCT----ACVGCST------TTPFAPPKSTTFKKVGCGASQCKQVR---N 169
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ F +TYG V L +DT+ + + P +P
Sbjct: 170 PT--CDGSACA---------------FNFTYGTSS-VAASLVQDTVTL-ATDP-----VP 205
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ GS+ + + Q FS+C +FK N
Sbjct: 206 AYTFGCIQKATGSSLPPQGLLGLGRGPLSLLAQTQKLYQSTFSYCLPSFKTLNFSGHX-- 263
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
D+ ++ Q P K+P + YY+ L AI +G + ++P F+ G
Sbjct: 264 ----DLXPVAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRR-IVDIPPEALAFNPXTGAG 318
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L EP Y+ + + + ++ + + V GFD CY VP +
Sbjct: 319 TVFDSGTVFTRLVEPAYTAVRNEFRRRVSVH-KKLTVTSLGGFDTCYTVPI--------V 369
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F + +++ LP N S + +V CL D V + QQQN
Sbjct: 370 APTITFMF-SGMNVTLPPDNILIH----STAGSVTCLAMAPAPDNVNSVLNVIANMQQQN 424
Query: 362 VEVVYDLEKERIG 374
V++D+ R+G
Sbjct: 425 HRVLFDVPNSRLG 437
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 168/389 (43%), Gaps = 58/389 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C C C Y N + F PS S S C S+ C ++
Sbjct: 133 MSVIVDTGSDLTWVQCE----PCRSC--YNQNGPL--FKPSTSPSYQPILCNSTTCQSLE 184
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
DP T + C + YG+G +G L + L G S
Sbjct: 185 LGACGSDPSTSATCD---------------YVVNYGDGSYTSGELGIEKLGFGGIS---- 225
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + + G+ G GR LS+ SQ G FS+C + +
Sbjct: 226 --VSNFVFGCGRNNKGLFGGASGLMGLGRSELSMISQTNATFGGVFSYCLPS---TDQAG 280
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S LV+G+ + K+ + +T ML + N+Y + L I +G SL +
Sbjct: 281 ASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLH------VQAS 334
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S GNGG+++DSGT + L Y L + + +P A GF + C N
Sbjct: 335 SFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAP------GFSILD--TCFNL 386
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T D + P+I+ +F N L + FY + ++S V CL S+ D +Y G+
Sbjct: 387 TGYDQVNIPTISMYFEGNAELNVDATGIFYLVK--EDASRV-CLALASLSD-EY-EMGII 441
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G++QQ+N V+YD + ++GF C T
Sbjct: 442 GNYQQRNQRVLYDAKLSQVGFAKEPCTFT 470
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 166/387 (42%), Gaps = 82/387 (21%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ DTGSDL W CG C C + ++ P++SSS S+ C+ S C
Sbjct: 95 LSALADTGSDLIWAKCGA----CTRCVPQGS----PSYYPNKSSSFSKLPCSGSLC---- 142
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG----LVTGILTRDTLKVHGSS 117
SD P C+ G + Y+YG G L +T + +
Sbjct: 143 -SDLPSSQCSAGGAECD-------------YKYSYGLASDPHHYTQGYLGSETFTLGSDA 188
Query: 118 PGIIREIPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+P FGC Y G+ G GRG LS+ SQL FS+C +
Sbjct: 189 ------VPGIGFGCTTMSEGGYGSGSGLVGLGRGPLSLVSQLNV--GAFSYCL-----TS 235
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
D +SPL+ G A++ +Q TP+L++ Y YY + LE+I+IG ++
Sbjct: 236 DAAKTSPLLFGSGALTGA-GVQSTPLLRTSTY--YYTVNLESISIGAATTA--------- 283
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+ G++ DSGTT L EP Y+ + S T A R G+++C++
Sbjct: 284 -GTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMA---SGRDGYEVCFQT---- 335
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GV 353
+ +FPS+ HF + + LP N+F A+ +V C + Q PS +
Sbjct: 336 ---SGAVFPSMVLHF-DGGDMDLPTENYFGAVD-----DSVSCWIVQK------SPSLSI 380
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ Q N + YD+EK + FQP +C
Sbjct: 381 VGNIMQMNYHIRYDVEKSMLSFQPANC 407
>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 514
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 162/390 (41%), Gaps = 50/390 (12%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ MDTGSDL W+ C C+DC + R F P+ S S TC C +
Sbjct: 166 QMIMDTGSDLNWLQCA----PCLDCFEQRG----PVFDPAASLSYRNVTCGDPRCGLVAP 217
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P C + PCP + Y YG+ TG L + V+ ++PG R
Sbjct: 218 ------PTAPRAC------RRPHSDPCPYY-YWYGDQSNTTGDLALEAFTVNLTAPGASR 264
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNI 178
+ FGC S + G+ G GRGALS SQL FS+C + + ++
Sbjct: 265 RVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVD----HGSSV 320
Query: 179 SSPLVIGDV-AISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREFD 235
S +V GD A+ L +T S +YY+ L+ + +G L P S +
Sbjct: 321 GSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISP-STWDVG 379
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+GG ++DSGTT ++ EP Y + + ++ YP D PC N
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVA--------DFPVLSPCYN 431
Query: 295 NTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ + + P + F + P N+F + + + CL +
Sbjct: 432 VSGVERVEVPEFSLLFADGAVWDFPAENYFVRL----DPDGIMCLAVLGTPRSAMS---I 484
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G+FQQQN V+YDL+ R+GF P CA
Sbjct: 485 IGNFQQQNFHVLYDLQNNRLGFAPRRCAEV 514
>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
Length = 404
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 170/397 (42%), Gaps = 65/397 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMS---NFSPSRSSSSSRDTCASSFCL 58
+ + +DTGS+L+W+ C NK +S F P+RS+S C+S C
Sbjct: 44 VSMVIDTGSELSWLHC---------------NKTLSYPTTFDPTRSTSYQTIPCSSPTCT 88
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
N + D P + C + L +T +Y + G L D + GSS
Sbjct: 89 N-RTQDFPIP----ASCDSNNLCHAT---------LSYADASSSDGNLASDVFHI-GSS- 132
Query: 119 GIIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK 171
+I FGC+ S + + G+ G RG+LS SQLGF + FS+C
Sbjct: 133 ----DISGLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFPK--FSYCI---- 182
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTE 226
+ + S L++G+ ++ L +TP+++ S P + Y + LE I + + L
Sbjct: 183 --SGTDFSGLLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDK-LLP 239
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE---RTG 283
+P S E D G G +VDSGT +T L P Y+ L S + + R E + +
Sbjct: 240 IPKSTFEPDHTGAGQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRVLEDPDFVFQGA 299
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM 343
DLCY VP L P++T F V + + +V CL F +
Sbjct: 300 MDLCYLVPLSQRVLP--LLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNS 357
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
D + V G QQNV + +DLEK RIG + C
Sbjct: 358 DLLGV-EAYVIGHHHQQNVWMEFDLEKSRIGLAQVRC 393
>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
Length = 514
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 162/390 (41%), Gaps = 50/390 (12%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ MDTGSDL W+ C C+DC + R F P+ S S TC C +
Sbjct: 166 QMIMDTGSDLNWLQCA----PCLDCFEQRG----PVFDPATSLSYRNVTCGDPRCGLVAP 217
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P C + PCP + Y YG+ TG L + V+ ++PG R
Sbjct: 218 ------PTAPRAC------RRPHSDPCPYY-YWYGDQSNTTGDLALEAFTVNLTAPGASR 264
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNI 178
+ FGC S + G+ G GRGALS SQL FS+C + + ++
Sbjct: 265 RVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVD----HGSSV 320
Query: 179 SSPLVIGDV-AISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREFD 235
S +V GD A+ L +T S +YY+ L+ + +G L P S +
Sbjct: 321 GSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISP-STWDVG 379
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+GG ++DSGTT ++ EP Y + + ++ YP D PC N
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVA--------DFPVLSPCYN 431
Query: 295 NTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ + + P + F + P N+F + + + CL +
Sbjct: 432 VSGVERVEVPEFSLLFADGAVWDFPAENYFVRL----DPDGIMCLAVLGTPRSAMS---I 484
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G+FQQQN V+YDL+ R+GF P CA
Sbjct: 485 IGNFQQQNFHVLYDLQNNRLGFAPRRCAEV 514
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 167/387 (43%), Gaps = 51/387 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C DC ++ N + P S+S TC C N+ SS +
Sbjct: 187 LDTGSDLNWIQC----LPCYDC--FQQNGAF--YDPKASASYKNITCNDQRC-NLVSSPD 237
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE-- 123
P PC S CP + Y YG+ TG +T V+ ++ G E
Sbjct: 238 PPMPCKSDNQS------------CP-YYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELY 284
Query: 124 -IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
+ FGC + G+ G GRG LS SQL L FS+C + +D N+
Sbjct: 285 NVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSDTNV 342
Query: 179 SSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L+ G D + S NL FT + K + +YY+ +++I + L +P
Sbjct: 343 SSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLN-IPEETWNIS 401
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLS-ILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
S G GG ++DSGTT ++ EP Y + + I + YP ++ D C+ V +
Sbjct: 402 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPI---LDPCFNVSGIH 458
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
N P + F + P N F ++ + CL + +
Sbjct: 459 NV----QLPELGIAFADGAVWNFPTENSFIWLNED-----LVCLAMLGTPKSAFS---II 506
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQQN ++YD ++ R+G+ P CA
Sbjct: 507 GNYQQQNFHILYDTKRSRLGYAPTKCA 533
>gi|383130042|gb|AFG45741.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/147 (40%), Positives = 81/147 (55%), Gaps = 11/147 (7%)
Query: 182 LVIGDVAISSKDNLQFTPML-----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LV+GD A+ ++ +L +TP L S Y +YYI L ++IG L +P L FDS
Sbjct: 2 LVLGDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRL-NLPSKLFSFDS 60
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T E FY + + S I + RA EVE RTG LCY V ++
Sbjct: 61 KGNGGTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHV 119
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHF 323
L P FHF +VLP N+F
Sbjct: 120 ----LLPDFAFHFKGGSDMVLPVANYF 142
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/399 (26%), Positives = 168/399 (42%), Gaps = 56/399 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ MDTGSDL W+ C C+DC + R F P+ SSS TC C ++
Sbjct: 165 RMIMDTGSDLNWLQCA----PCLDCFEQRG----PVFDPAASSSYRNVTCGDHRCGHVAP 216
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRP----CPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
TC RP CP + Y YG+ TG L ++ V+ ++P
Sbjct: 217 PP-----------EPEASSPRTCRRPGEDPCPYY-YWYGDQSNTTGDLALESFTVNLTAP 264
Query: 119 GIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYAN 174
G R + FGC + G+ G GRG LS SQL FS+C + +
Sbjct: 265 GASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD----H 320
Query: 175 DPNISSPLVIGD----VAISSKDNLQFTPMLKSPMYP----NYYYIGLEAITIGNSSLTE 226
++ S +V G+ +A+++ L++T + +YY+ L+ + +G L
Sbjct: 321 GSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGE-LLN 379
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFD 285
+ + G+GG ++DSGTT ++ EP Y + ++ YP E +
Sbjct: 380 ISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLS--- 436
Query: 286 LCYRVPCPNNTFTDD-LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
PC N + + P ++ F + P N+F + P S + + +
Sbjct: 437 -----PCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLD-PDGGSIMCLAVLGTPR 490
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G + G+FQQQN VVYDL+ R+GF P CA
Sbjct: 491 TG----MSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 525
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 152/383 (39%), Gaps = 67/383 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+ SS+ + +CA+ C ++
Sbjct: 198 VVFDTGSDTTWVQCQPCVVACYE----QREKL---FDPASSSTYANVSCAAPACSDLD-- 248
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGCS L + YG+G G DTL +
Sbjct: 249 --------VSGCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 285
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC + E G+ G GRG S+P Q G F+HC P S
Sbjct: 286 VKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL--------PARS 337
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ D S TPML P +YY+G+ I +G L P++ F + G
Sbjct: 338 TGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLL---PIAPSVFAAAGT 393
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T LP YS L S + + Y +A V D CY
Sbjct: 394 ---IVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL---LDTCYDF----TGM 443
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P+++ F +L + Y +SA CL F +DG G G+ G+
Sbjct: 444 SQVAIPTVSLLFQGGAALDVDASGIMYTVSASQ-----VCLAFAGNEDG--GDVGIVGNT 496
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q + V YD+ K+ +GF P C
Sbjct: 497 QLKTFGVAYDIGKKVVGFSPGAC 519
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 105/380 (27%), Positives = 162/380 (42%), Gaps = 61/380 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + + F+P+ SSS S TC S C ++
Sbjct: 174 MVLDTGSDINWIQCQ----PCSDC--YQQSDPI--FTPAASSSYSPLTCDSQQCNSLQ-- 223
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
MS C ++ CR + YG+G G +T+ GS G +
Sbjct: 224 --------MSSC------RNGQCR----YQVNYGDGSFTFGDFVTETMSFGGS--GTVNS 263
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI---SS 180
I C + G+ G G G LS+ SQL FS+C + A + S+
Sbjct: 264 IALGCGHDNEGLFVGAAGLLGLGGGPLSLTSQLK--ATSFSYCLVNRDSAASSTLDFNSA 321
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
P +GD I+ P+LKS +YY+GL +++G L +P + + D G+G
Sbjct: 322 P--VGDSVIA--------PLLKSSKIDTFYYVGLSGMSVGGE-LLRIPQEVFKLDDSGDG 370
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G++VD GT T L Y+ S+ S ++ + FD CY + ++
Sbjct: 371 GVIVDCGTAITRLQSEAYN---SLRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSV---- 423
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++FHF S LP N+ P +S+ C F + G+ QQQ
Sbjct: 424 KVPTVSFHFDGGKSWDLPAANYLI----PVDSAGTYCFAFAPTTSS----LSIIGNVQQQ 475
Query: 361 NVEVVYDLEKERIGFQPMDC 380
V +DL R+GF C
Sbjct: 476 GTRVSFDLANNRVGFSTNKC 495
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 166/392 (42%), Gaps = 61/392 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C C C R+ F P+ S++ + C +S C
Sbjct: 161 LTVIVDTGSDLTWVQCK----PCSACYAQRDPL----FDPAGSATYAAVRCNASAC---- 208
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+D+ G ST S C +A YG+G G+L DT+ + G+S G
Sbjct: 209 -ADSLRAATGTPGSCGSTGAGSEKCY----YALAYGDGSFSRGVLATDTVALGGASLG-- 261
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
F FGC S + G+ G GR LS+ SQ G FS+C A + D +
Sbjct: 262 ----GFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPA-ATSGDAS 316
Query: 178 ISSPLVIGDVAISSKDN---LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
S L GD A SS N + +T M+ P P +Y++ + +G ++L L
Sbjct: 317 GSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGL----- 371
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGF---DLCYR 289
G +L+DSGT T L Y + + + Q YP A GF D CY
Sbjct: 372 ---GASNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAP------GFSILDTCYD 422
Query: 290 VPCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ T D++ P +T + + + + + S V CL S+ D
Sbjct: 423 L-----TGHDEVKVPLLTLRLEGGADVTVDAAGMLFVVR--KDGSQV-CLAMASLSYEDE 474
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P + G++QQ+N VVYD R+GF DC
Sbjct: 475 TP--IIGNYQQKNKRVVYDTLGSRLGFADEDC 504
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 165/383 (43%), Gaps = 69/383 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD+ W+ C + C Y + + F PS SS+ +C C+ + +
Sbjct: 31 VVFDTGSDVNWLQCKPCAVRC-----YAQQEPL--FDPSLSSTYRNVSCTEPACVGLSTR 83
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS ST L + YG+G G L DT + +P ++
Sbjct: 84 ----------GCSSSTCL----------YGVFYGDGSSTIGFLAMDTFML---TPA--QK 118
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGAL-SVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
F FGC + ++ G+ G GR + S+ SQ+ L FS+C P+
Sbjct: 119 FKNFIFGCGQNNTGLFQGTAGLVGLGRSSTYSLNSQVAPSLGNVFSYCL--------PST 170
Query: 179 SSPLVIGDVAISSKDNL-QFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS G + I + N +T ML P Y+I L I++G + L+ LS F S
Sbjct: 171 SS--ATGYLNIGNPQNTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLS---LSSTVFQSV 225
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T LP YS L + +++ +T Y A V T D CY +
Sbjct: 226 GT---IIDSGTVITRLPPTAYSALKTAVRAAMTQYTLAPAV---TILDTCYDF----SRT 275
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T ++P I HF + + +P F+ NSS V CL F D G+ G+
Sbjct: 276 TSVVYPVIVLHF-AGLDVRIPATGVFFVF----NSSQV-CLAFAGNTDSTM--IGIIGNV 327
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ +EV YD E +RIGF C
Sbjct: 328 QQLTMEVTYDNELKRIGFSAGAC 350
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 156/383 (40%), Gaps = 64/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+RSS+ + +CA+ C ++ +
Sbjct: 195 VVFDTGSDTTWVQCQPCVVVCYE----QREKL---FDPARSSTYANISCAAPACSDLDTR 247
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS L + YG+G G DTL +
Sbjct: 248 ----------GCSGGNCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 282
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ F FGC + E G+ G GRG S+P Q G F+HC A +
Sbjct: 283 VKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSG-----T 337
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L G + ++ TPML + P +YY+G+ I +G L +P S+ F + G
Sbjct: 338 GYLDFGPGSPAAAGARLTTPML-TDNGPTFYYVGMTGIRVGGQ-LLSIPQSV--FTTAGT 393
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T LP YS L S S + Y +A V D CY
Sbjct: 394 ---IVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSL---LDTCYDF----TGM 443
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P+++ F L + YA S + CL F + +DG G G+ G+
Sbjct: 444 SQVAIPTVSLLFQGGARLDVDASGIMYAASV-----SQVCLGFAANEDG--GDVGIVGNT 496
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q + V YD+ K+ +GF P C
Sbjct: 497 QLKTFGVAYDIGKKVVGFSPGAC 519
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 152/383 (39%), Gaps = 67/383 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+ SS+ + +CA+ C ++
Sbjct: 194 VVFDTGSDTTWVQCQPCVVACYE----QREKL---FDPASSSTYANVSCAAPACSDLD-- 244
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGCS L + YG+G G DTL +
Sbjct: 245 --------VSGCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 281
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC + E G+ G GRG S+P Q G F+HC P S
Sbjct: 282 VKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL--------PARS 333
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ D S TPML P +YY+G+ I +G L P++ F + G
Sbjct: 334 TGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLL---PIAPSVFAAAGT 389
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T LP YS L S + + Y +A V D CY
Sbjct: 390 ---IVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL---LDTCYDF----TGM 439
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P+++ F +L + Y +SA CL F +DG G G+ G+
Sbjct: 440 SQVAIPTVSLLFQGGAALDVDASGIMYTVSASQ-----VCLAFAGNEDG--GDVGIVGNT 492
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q + V YD+ K+ +GF P C
Sbjct: 493 QLKTFGVAYDIGKKVVGFSPGAC 515
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 178/388 (45%), Gaps = 64/388 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDL W C C DC ++ F P SS+ +C+SS C +
Sbjct: 107 IMAIADTGSDLLWTQCK----PCDDC----YTQVDPLFDPKASSTYKDVSCSSSQCTALE 158
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + CS +TC S++ +YG+ G + DTL + GS+
Sbjct: 159 N---------QASCSTE---DNTC-----SYSTSYGDRSYTKGNIAVDTLTL-GSTDTRP 200
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
++ GC G+ ++ GI G G GA+S+ +QLG G FS+C + ND
Sbjct: 201 VQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDR 260
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+S + G A+ S + TP++ +YY+ L++I++G+ + + P S DS
Sbjct: 261 --TSKINFGTNAVVSGTGVVSTPLIAKSQ-ETFYYLTLKSISVGSKEV-QYPGS----DS 312
Query: 237 -QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G G +++DSGTT T LP FYS+L + S+I K+ + +TG LCY
Sbjct: 313 GSGEGNIIIDSGTTLTLLPTEFYSELEDAVASSI---DAEKKQDPQTGLSLCYSA----- 364
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GV 353
T DL P+IT HF + + L N F +S + C F+ PS +
Sbjct: 365 --TGDLKVPAITMHF-DGADVNLKPSNCFVQISED-----LVCFAFRG------SPSFSI 410
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+G+ Q N V YD + + F+P DCA
Sbjct: 411 YGNVAQMNFLVGYDTVSKTVSFKPTDCA 438
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 187/397 (47%), Gaps = 68/397 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV C + C DC R + L +S F PS SS++S +C+ C +
Sbjct: 100 NVQIDTGSDILWVTCNS----CNDCP--RTSGLGIELSFFDPSSSSTTSLVSCSHPICTS 153
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGS 116
+ + T + CS +S C S+++ YG+G TG D L V G
Sbjct: 154 LVQT-------TAAECSP----QSNQC----SYSFHYGDGSGTTGYYVSDMLYFDTVLGD 198
Query: 117 SPGIIREIPKFCFGCVGSTY---------REPIGIAGFGRGALSVPSQL---GFLQKGFS 164
S I FGC STY + GI GFG+ LSV SQL G K FS
Sbjct: 199 SL-IANSSASIVFGC--STYQSGDLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFS 255
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
HC + + LV+G++ + N+ ++P++ S ++Y + L++I++ L
Sbjct: 256 HCL-----KGEGDGGGKLVLGEIL---EPNIIYSPLVPSQ---SHYNLNLQSISVNGQLL 304
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
P+ F + N G +VDSGTT T+L E Y +S + +T++ + +
Sbjct: 305 ---PIDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVS----SSTTPVLSKG 357
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+ CY V +T D++FP ++ +F S+VL G + + S+ +A+ C+ FQ +
Sbjct: 358 NQCYLV----STSVDEIFPPVSLNFAGGASMVLKPGEYLMHLGF-SDGAAMWCIGFQKVA 412
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ + G ++ VYDL +RIG+ DC+
Sbjct: 413 EPGI---TILGDLVLKDKIFVYDLAHQRIGWANYDCS 446
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 174/383 (45%), Gaps = 58/383 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C C N+ F+PS+SSS +C+S C ++ +
Sbjct: 104 VDTGSDIVWLQCE----PCEQC----YNQTTPKFNPSKSSSYKNISCSSKLCQSVRDT-- 153
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
S K C ++ YG G L+ +TL + S+ G P
Sbjct: 154 ------------SCNDKKNC-----EYSINYGNQSHSQGDLSLETLTLE-STTGRPVSFP 195
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCF--LAFKYANDPNI 178
K GC +GS R G+ G G G S+ +QLG + FS+C ++ N
Sbjct: 196 KTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMG 255
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SS L GDVAI S N+ TP++K + +YY+ +EA ++G+ + E S + +
Sbjct: 256 SSKLNFGDVAIVSGHNVLSTPIVKKD-HSFFYYLTIEAFSVGDKRV-EFAGSSKGVE--- 310
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G +++DS T T +P Y++L S + +T R + ++ F LCY V
Sbjct: 311 EGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTL-ERVDDPNQQ--FSLCYNVSSDE---- 363
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ FP +T HF ++L N F ++ V C F + G +FGSF
Sbjct: 364 EYDFPYMTAHF-KGADILLYATNTFVEVARD-----VLCFAFAPSNGG-----AIFGSFS 412
Query: 359 QQNVEVVYDLEKERIGFQPMDCA 381
QQ+ V YDL+++ + F+ +DC
Sbjct: 413 QQDFMVGYDLQQKTVSFKSVDCT 435
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 166/387 (42%), Gaps = 51/387 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C DC ++ N + P S+S TC C N+ S +
Sbjct: 172 LDTGSDLNWIQC----LPCHDC--FQQNGAF--YDPKASASYKNITCNDPRC-NLVSPPD 222
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE-- 123
P PC S CP + Y YG+ TG +T V+ ++ G E
Sbjct: 223 PPKPCKSDNQS------------CP-YYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELY 269
Query: 124 -IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
+ FGC + G+ G GRG LS SQL L FS+C + +D N+
Sbjct: 270 NVENMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSDTNV 327
Query: 179 SSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L+ G D + S NL FT + K + +YY+ +++I + L +P
Sbjct: 328 SSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLN-IPEETWNIS 386
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G GG ++DSGTT ++ EP Y +++ I + K R D PC N
Sbjct: 387 SDGAGGTIIDSGTTLSYFAEPAYE----FIKNKIAEKAKGKYPVYR---DFPILDPCFNV 439
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ D + P + F + P N F ++ + CL + +
Sbjct: 440 SGIDSIQLPELGIAFADGAVWNFPTENSFIWLN-----EDLVCLAILGTPKSAFS---II 491
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQQN ++YD ++ R+G+ P CA
Sbjct: 492 GNYQQQNFHILYDTKRSRLGYAPTKCA 518
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 112/386 (29%), Positives = 168/386 (43%), Gaps = 76/386 (19%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C+ C +L F+P +S+S S C + C H+ D+
Sbjct: 110 DTGSDLTWAQC----LPCLKC----YQQLRPIFNPLKSTSFSHVPCNTQTC---HAVDD- 157
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C + G ++YTYG+ G L + + + SS K
Sbjct: 158 -GHCGVQGVC--------------DYSYTYGDRTYSKGDLGFEKITIGSSSV-------K 195
Query: 127 FCFGCVGST---YREPIGIAGFGRGALSVPSQLG---FLQKGFSHCF-LAFKYANDPNIS 179
GC ++ + G+ G G G LS+ SQ+ + + FS+C +AN
Sbjct: 196 SVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN----- 250
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ G+ A+ S + TP++ S YYYI LEAI+IGN F QGN
Sbjct: 251 GKINFGENAVVSGPGVVSTPLI-SKNTVTYYYITLEAISIGNERHMA-------FAKQGN 302
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFT 298
+++DSGTT T LP+ Y ++S L + +AK V++ G DLC+ N
Sbjct: 303 --VIIDSGTTLTILPKELYDGVVSSLLKVV----KAKRVKDPHGSLDLCFDDGI--NAAA 354
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS---GVFG 355
P IT HF ++ L N F ++ V CL ++ P+ G+ G
Sbjct: 355 SLGIPVITAHFSGGANVNLLPINTFRKVA-----DNVNCLTLKAAS-----PTTEFGIIG 404
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
+ Q N + YDLE +R+ F+P CA
Sbjct: 405 NLAQANFLIGYDLEAKRLSFKPTVCA 430
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 151/386 (39%), Gaps = 72/386 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+RSS+ + +CA+ C ++ +
Sbjct: 194 VVFDTGSDTTWVQCQPCVVVCYE----QREKL---FDPARSSTYANVSCAAPACSDLDTR 246
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS L + YG+G G DTL +
Sbjct: 247 ----------GCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 281
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN---DP 176
+ F FGC + E G+ G GRG S+P Q G F+HC A D
Sbjct: 282 VKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDF 341
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
SP L TPML P +YY+GL I +G L +P S+
Sbjct: 342 GAGSPAA----------RLTTTPMLVDNG-PTFYYVGLTGIRVGGR-LLYIPQSVFA--- 386
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPN 294
G +VDSGT T LP YS L S + ++ Y +A V D CY
Sbjct: 387 --TAGTIVDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSL---LDTCYDFA--- 438
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ P+++ F L + YA SA CL F + +DG G G+
Sbjct: 439 -GMSQVAIPTVSLLFQGGARLDVDASGIMYAASASQ-----VCLAFAANEDG--GDVGIV 490
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ Q + V YD+ K+ + F P C
Sbjct: 491 GNTQLKTFGVAYDIGKKVVSFSPGAC 516
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 159/387 (41%), Gaps = 67/387 (17%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+YM DTGSD+TWV C C DC Y+ + + F PS S+S + +C S C +
Sbjct: 178 QLYMVLDTGSDVTWVQCQ----PCADC--YQQSDPV--FDPSLSASYAAVSCDSQRCRD- 228
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPS---FAYTYGEGGLVTGILTRDTLKVHGSS 117
L + CR + YG+G G +TL + S+
Sbjct: 229 --------------------LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST 268
Query: 118 PGIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
P + GC + G+ G G LS PSQ+ FS+C +
Sbjct: 269 P-----VGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS--ASTFSYCLV----DR 317
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
D +S L GD A ++ P+++SP +YY+ L I++G L+ +P S
Sbjct: 318 DSPAASTLQFGDGA--AEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLS-IPASAFAM 374
Query: 235 DS-QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
D+ G+GG++VDSGT T L Y+ L PR V FD CY +
Sbjct: 375 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSL---FDTCYDL--- 428
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ T P+++ F +L LP N+ P + + CL F + +
Sbjct: 429 -SDRTSVEVPAVSLRFEGGGALRLPAKNYLI----PVDGAGTYCLAFAPTN----AAVSI 479
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQ V +D + +GF P C
Sbjct: 480 IGNVQQQGTRVSFDTARGAVGFTPNKC 506
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 107/406 (26%), Positives = 182/406 (44%), Gaps = 80/406 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGSD+ WV C C C + + + S++SS+S++ C +FC
Sbjct: 92 VQVDTGSDILWVNCA----PCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFC----- 142
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC---RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
S +++S C +PC S+ YG+G G +D + + + G
Sbjct: 143 ---------------SFIMQSETCGAKKPC-SYHVVYGDGSTSDGDFVKDNITLDQVT-G 185
Query: 120 IIREIP---KFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
+R P + FGC +G T GI GFG+ SV SQL G +++ FSHC
Sbjct: 186 NLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHC 245
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSS 223
++ N IG+V +P++K+ P+ PN +Y + L+ + +
Sbjct: 246 L------DNMNGGGIFAIGEVE---------SPVVKTTPLVPNQVHYNVILKGMDVDGEP 290
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
+ ++P SL + G+GG ++DSGTT +LP+ Y+ L+ + T + V+E
Sbjct: 291 I-DLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKI--TAKQQVKLHMVQETFA 345
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS- 342
C+ + TD FP + HF +++ L + ++ +++ + C +QS
Sbjct: 346 ---CFSF----TSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-----MYCFGWQSG 393
Query: 343 -MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
M D + G N VVYDLE E IG+ +C+S+ +
Sbjct: 394 GMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 439
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 179/393 (45%), Gaps = 62/393 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ W+ C C +C + + F + SS+++ +CA C
Sbjct: 98 VQIDTGSDILWINC----ITCSNCPHSSGLGIELDFFDTAGSSTAALVSCADPICSYAVQ 153
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL----------K 112
+ SGCS + C S+ + YG+G TG DT+
Sbjct: 154 T-------ATSGCS-------SQANQC-SYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSM 198
Query: 113 VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
V SS I+ + G + T + GI GFG GALSV SQL G K FSHC
Sbjct: 199 VANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL-- 256
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + ++ ++P++ P P +Y + L++I + L P+
Sbjct: 257 ---KGGENGGGVLVLGEIL---EPSIVYSPLV--PSLP-HYNLNLQSIAVNGQLL---PI 304
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + N G +VDSGTT +L + Y+ + + + ++ + +K + + + CY
Sbjct: 305 DSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQF--SKPIISKG--NQCYL 360
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
V + D+FP ++ +F+ S+VL P+ H+ +S+A+ C+ FQ ++ G
Sbjct: 361 V----SNSVGDIFPQVSLNFMGGASMVLNPE--HYLMHYGFLDSAAMWCIGFQKVERG-- 412
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ +C+
Sbjct: 413 --FTILGDLVLKDKIFVYDLANQRIGWADYNCS 443
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 101/393 (25%), Positives = 170/393 (43%), Gaps = 75/393 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGSDL WV C C+ C + + K+ + + S+SSS+
Sbjct: 53 VDTGSDLLWVNC----HPCIGCPAFSDLKIPIVPYDVKASASSSKV-------------- 94
Query: 65 NPFDPCTMSGCSLSTLLKSTCCR---PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
PC+ C+L T + + C C +++ YG+G G L D L ++
Sbjct: 95 ----PCSDPSCTLITQISESGCNDQNQC-GYSFQYGDGSGTLGYLVEDVLHY------MV 143
Query: 122 REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQKG---FSHCFLAFK 171
FGC + ++ R GI GFG LS SQL K F+HC +
Sbjct: 144 NATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
LV+G+V + ++Q+TP++ Y ++Y + L++I++ N++LT P
Sbjct: 204 RGG-----GILVLGNVI---EPDIQYTPLVP---YMSHYNVVLQSISVNNANLTIDP--- 249
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ F + G + DSGTT +LP+ Y + + + R
Sbjct: 250 KLFSNDVMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRLSR---------- 299
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
F LFP++ +F S+ L + ++ +N+ + C+ +QSM +
Sbjct: 300 -----FIYKLFPNVVLYF-EGASMTLTPAEYLIRQASAANA-PIWCMGWQSMGSAESELQ 352
Query: 352 -GVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+FG +N VVYDLE+ RIG++P DC ++
Sbjct: 353 YTIFGDLVLKNKLVVYDLERGRIGWRPFDCKTS 385
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 154/383 (40%), Gaps = 63/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D W PC C+ C + FS SS+ + C+ C
Sbjct: 110 MVLDTSNDAAWAPCSG----CIGCSS------TTTFSAQNSSTFATLDCSKPECTQAR-- 157
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
G S T C F TYG + L +D+L + P +I
Sbjct: 158 ----------GLSCPTTGNVDCL-----FNQTYGGDSTFSATLVQDSLHL---GPNVI-- 197
Query: 124 IPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
P F FGC+ S P G+ G GRG LS+ SQ G L G FS+C +FK S
Sbjct: 198 -PNFSFGCISSASGSSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYY---FS 253
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDSQ 237
L +G V ++ TP+L +P P+ YY+ L I++G VP+S L FD
Sbjct: 254 GSLKLGPVG--QPKAIRTTPLLHNPHRPSLYYVNLTGISVGR---VLVPISPELLAFDPN 308
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T Y+ + + + FD C+ NN
Sbjct: 309 TGAGTIIDSGTVITRFVPAIYTAVRDEFRKQV-----GGSFSPLGAFDTCFAT---NNEV 360
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P+IT H L+ + L LP N SA S + CL + + V +
Sbjct: 361 SA---PAITLH-LSGLDLKLPMENSLIHSSAGS----LACLAMAAAPNNVNSVVNVIANL 412
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQN +++D+ ++G C
Sbjct: 413 QQQNHRILFDINNSKLGIARELC 435
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 112/388 (28%), Positives = 169/388 (43%), Gaps = 73/388 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V MDTGSD+ W+ C C +CD N L F PS SS+ S C +
Sbjct: 116 VVMDTGSDILWIMCN----PCTNCD----NHLGLLFDPSMSSTFS-PLCKT--------- 157
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
PC GC C P P F +Y + +G RD L + G +
Sbjct: 158 -----PCGFKGCK---------CDPIP-FTISYVDNSSASGTFGRDILVFETTDEGT-SQ 201
Query: 124 IPKFCFGC---VGSTYREP--IGIAGFGRGALSVPSQLGFLQKGFSHCF--LAFKYANDP 176
I GC +G +P GI G G S+ +Q+G + FS+C LA Y N
Sbjct: 202 ISDVIIGCGHNIGFN-SDPGYNGILGLNNGPNSLATQIG---RKFSYCIGNLADPYYN-- 255
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ L +G+ A + F +Y +YY+ +E I++G L ++ L E
Sbjct: 256 --YNQLRLGEGADLEGYSTPF------EVYHGFYYVTMEGISVGEKRL-DIALETFEMKR 306
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG+++DSGTT T+L + + L + +++ + + R + + E + LCY
Sbjct: 307 NGTGGVILDSGTTITYLVDSAHKLLYNEVRNLLKWSFR-QVIFENAPWKLCYY-----GI 360
Query: 297 FTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDGDYGPSG 352
+ DL FP +TFHF++ L L G+ F S + C+ S+ + PS
Sbjct: 361 ISRDLVGFPVVTFHFVDGADLALDTGSFF------SQRDDIFCMTVSPASILNTTISPS- 413
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G QQ+ V YDL + + FQ +DC
Sbjct: 414 VIGLLAQQSYNVGYDLVNQFVYFQRIDC 441
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 110/386 (28%), Positives = 168/386 (43%), Gaps = 80/386 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D+GSD++WV C C+ C ++++ F PS SS+ S +C+S+ C +
Sbjct: 146 VLIDSGSDVSWVQCK----PCLQC----HSQVDPLFDPSLSSTYSPFSCSSAACAQLGQD 197
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GCS S+ + + Y +G TG + DTL + ++
Sbjct: 198 GN--------GCSSSSQCQ---------YIVRYADGSSTTGTYSSDTLALGSNT------ 234
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNIS 179
I F FGC V S + + G+ G G GA S+ SQ G FS+C P+ S
Sbjct: 235 ISNFQFGCSHVESGFNDLTDGLMGLGGGAPSLASQTAGTFGTAFSYCL-----PPTPSSS 289
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G + TPML+S P +Y + LEAI +G + L+ +P S+ +
Sbjct: 290 GFLTLG----AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLS-IPTSVF------S 338
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G+++DSGT T LP YS L S ++ + Y + R+ D C
Sbjct: 339 AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQY---RPAPPRSIMDTC------------ 383
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA-----VKCLLFQSMDDGDYGPSGVF 354
F F S+ LP ++ A N A CL F + D D P G+
Sbjct: 384 -------FDFSGQSSVRLPSVALVFSGGAVVNLDANGIILGNCLAFAANSD-DSSP-GIV 434
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ+ EV+YD+ +GF+ C
Sbjct: 435 GNVQQRTFEVLYDVGGGAVGFKAGAC 460
>gi|383130052|gb|AFG45746.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 81/147 (55%), Gaps = 11/147 (7%)
Query: 182 LVIGDVAISSKDNLQFTPML-----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LV+GD A+ ++ +L +TP L S Y +YYI L ++IG L +P L FD+
Sbjct: 2 LVLGDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRL-NLPSKLFSFDT 60
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T E FY + + S I + RA EVE RTG LCY V ++
Sbjct: 61 KGNGGTIIDSGTTFTIFNEEFYKNITAAFSSQIG-FRRASEVEARTGMRLCYNVSGVDHV 119
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHF 323
L P FHF +VLP N+F
Sbjct: 120 ----LLPDFAFHFKGGSDMVLPVANYF 142
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 106/380 (27%), Positives = 159/380 (41%), Gaps = 53/380 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C C N+ F+P++S + + C S C + S
Sbjct: 153 LDTGSDVVWLQCS----PCKVC----YNQSDPVFNPAKSKTFATVPCGSRLCRRLDDSSE 204
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C +S C + +YG+G G + +TL HG+ +
Sbjct: 205 ----CVSR--------RSKACL----YQVSYGDGSFTVGDFSTETLTFHGA------RVD 242
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFL-AFKYANDPNISS 180
GC + G+ G GRG LS PSQ G FS+C + + S
Sbjct: 243 HVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 302
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+V G+ A+ FTP+L +P +YY+ L I++G S + V S + D+ GNG
Sbjct: 303 TIVFGNGAV--PKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNG 360
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGT+ T L + Y L + T R K + FD C+ + + T
Sbjct: 361 GVIIDSGTSVTRLTQSAYVALRDAFRLGAT---RLKRAPSYSLFDTCFDL----SGMTTV 413
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P++ FHF + LP N+ P N+ C F G G + G+ QQQ
Sbjct: 414 KVPTVVFHFTGG-EVSLPASNYLI----PVNNQGRFCFAFA----GTMGSLSIIGNIQQQ 464
Query: 361 NVEVVYDLEKERIGFQPMDC 380
V YDL R+GF C
Sbjct: 465 GFRVAYDLVGSRVGFLSRAC 484
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 108/383 (28%), Positives = 152/383 (39%), Gaps = 67/383 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C + + KL F P+ SS+ + +CA+ C ++
Sbjct: 195 VVFDTGSDTTWVQCQPCVVACYE----QREKL---FDPASSSTYANVSCAAPACSDLD-- 245
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGCS L + YG+G G DTL +
Sbjct: 246 --------VSGCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----YDA 282
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC + E G+ G GRG S+P Q G F+HC P S
Sbjct: 283 VKGFRFGCGERNDGLFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL--------PPRS 334
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ D S TPML P +YY+G+ I +G L P++ F + G
Sbjct: 335 TGTGYLDFGAGSPPATTTTPMLTGNG-PTFYYVGMTGIRVGGRLL---PIAPSVFAAAGT 390
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T LP YS L S + + Y +A V D CY
Sbjct: 391 ---IVDSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL---LDTCYDF----TGM 440
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P+++ F +L + Y +SA CL F +DG G G+ G+
Sbjct: 441 SQVAIPTVSLLFQGGAALDVDASGIMYTVSASQ-----VCLAFAGNEDG--GDVGIVGNT 493
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q + V YD+ K+ +GF P C
Sbjct: 494 QLKTFGVAYDIGKKVVGFSPGAC 516
>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
gi|194689376|gb|ACF78772.1| unknown [Zea mays]
gi|224031455|gb|ACN34803.1| unknown [Zea mays]
gi|238011528|gb|ACR36799.1| unknown [Zea mays]
gi|238015454|gb|ACR38762.1| unknown [Zea mays]
Length = 304
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 142/318 (44%), Gaps = 27/318 (8%)
Query: 73 SGCSLSTLLKSTCCRP-CPSFAYTYGEGGLVTGILT--RDTLKVHGSSPGIIREIPKFCF 129
+G S +L +C RP ++ Y YG+G + G+ R T G +P F
Sbjct: 4 AGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVP-LGF 62
Query: 130 GC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG 185
GC VGS GI GFGR LS+ SQL + FS+C ++ + +
Sbjct: 63 GCGSVNVGS-LNNGSGIVGFGRNPLSLVSQLSI--RRFSYCLTSYASRRQSTLLFGSLSD 119
Query: 186 DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVD 245
V + +Q TP+L+SP P +YY+ +T+G L +P S G+GG++VD
Sbjct: 120 GVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRL-RIPESAFALRPDGSGGVIVD 178
Query: 246 SGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP--NNTFTDDL-F 302
SGT T LP ++++ + + P A G +C+ VP ++ T +
Sbjct: 179 SGTALTLLPAAVLAEVVRAFRQQLR-LPFANGGNPEDG--VCFLVPAAWRRSSSTSQMPV 235
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P + HF L LP+ N+ + CLL D GD G + G+ QQ++
Sbjct: 236 PRMVLHF-QGADLDLPRRNYVL----DDHRRGRLCLLLA--DSGDDGST--IGNLVQQDM 286
Query: 363 EVVYDLEKERIGFQPMDC 380
V+YDLE E + P C
Sbjct: 287 RVLYDLEAETLSIAPARC 304
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 178/400 (44%), Gaps = 79/400 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C + C C ++ +S F P SSS+S +C+ C +
Sbjct: 99 VQIDTGSDVLWVSCTS----CNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQ 154
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD----------TLK 112
++ SGCS + L S+++ YG+G +G D TL
Sbjct: 155 TE--------SGCSPNNLC---------SYSFKYGDGSGTSGFYISDFMSFDTVITSTLA 197
Query: 113 VHGSSPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKG 162
++ S+P F FGC + R GI G G+G+LSV SQL G +
Sbjct: 198 INSSAP--------FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRV 249
Query: 163 FSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNS 222
FSHC D + +V+G + + + +TP++ P P +Y + L++I +
Sbjct: 250 FSHCL-----KGDKSGGGIMVLGQI---KRPDTVYTPLV--PSQP-HYNVNLQSIAVNGQ 298
Query: 223 SLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT 282
L P+ F G ++D+GTT +LP+ YS + + + ++ Y R E
Sbjct: 299 IL---PIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAIANAVSQYGRPITYESYQ 355
Query: 283 GFDLCYRVPCPNNTFTD-DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
F++ T D D+FP ++ F S+VL H Y S+ S++ C+ FQ
Sbjct: 356 CFEI---------TAGDVDVFPEVSLSFAGGASMVLRP--HAYLQIFSSSGSSIWCIGFQ 404
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
M + + G ++ VVYDL ++RIG+ DC+
Sbjct: 405 RM---SHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|383130038|gb|AFG45739.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 154
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 81/147 (55%), Gaps = 11/147 (7%)
Query: 182 LVIGDVAISSKDNLQFTPML-----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LV+GD A+ ++ +L +TP L S Y +YYI L ++IG L +P L FD+
Sbjct: 2 LVLGDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRL-NLPSKLFSFDT 60
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T E FY + + S I + RA EVE RTG LCY V ++
Sbjct: 61 KGNGGTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHV 119
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHF 323
L P FHF +VLP N+F
Sbjct: 120 ----LLPDFAFHFKGGSDMVLPVANYF 142
>gi|361067845|gb|AEW08234.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130032|gb|AFG45736.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130034|gb|AFG45737.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130036|gb|AFG45738.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130046|gb|AFG45743.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130048|gb|AFG45744.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130050|gb|AFG45745.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130054|gb|AFG45747.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
gi|383130056|gb|AFG45748.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 81/147 (55%), Gaps = 11/147 (7%)
Query: 182 LVIGDVAISSKDNLQFTPML-----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LV+GD A+ ++ +L +TP L S Y +YYI L ++IG L +P L FD+
Sbjct: 2 LVLGDKALPTEMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRL-NLPSKLFSFDT 60
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T E FY + + S I + RA EVE RTG LCY V ++
Sbjct: 61 KGNGGTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHV 119
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHF 323
L P FHF +VLP N+F
Sbjct: 120 ----LLPDFAFHFKGGSDMVLPVANYF 142
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 165/385 (42%), Gaps = 58/385 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C + C Y+ + F PS+SSS + TC SS C +
Sbjct: 59 LSLVFDTGSDLTWTQCEPCAGSC-----YKQQDAI--FDPSKSSSYTNITCTSSLCTQLT 111
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S S CS ST ++C + YG+ G L+++ L + +
Sbjct: 112 SDG------IKSECSSST--DASCI-----YDAKYGDNSTSVGFLSQERLTITATDI--- 155
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ F FGC + G+ G GR +S+ Q K FS+C P
Sbjct: 156 --VDDFLFGCGQDNEGLFNGSAGLMGLGRHPISIVQQTSSNYNKIFSYCL--------PA 205
Query: 178 ISSPL--VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L + + ++ +L +TP+ ++Y + + +I++G + L V S F
Sbjct: 206 TSSSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSS--TFS 263
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ GG ++DSGT T L Y+ L S + + YP A E D CY + +
Sbjct: 264 A---GGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGL---LDTCYDL----S 313
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ + P I F F V++ L H + S CL F + +G VFG
Sbjct: 314 GYKEISVPRIDFEFSGGVTVELX---HRGILXVESEQQV--CLAFAA--NGSDNDITVFG 366
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQ+ +EVVYD++ RIGF C
Sbjct: 367 NVQQKTLEVVYDVKGGRIGFGAAGC 391
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 178/399 (44%), Gaps = 72/399 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + ++ + P S S TC FC+ +
Sbjct: 105 VQVDTGSDILWVNC----VSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVANYG 160
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
L T PC ++ +YG+G G D L+ + S G +
Sbjct: 161 G---------------VLPSCTSTSPC-EYSISYGDGSSTAGFFVTDFLQYNQVS-GDGQ 203
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 204 TTPANASVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCL-- 261
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++ P P+Y I L+ I +G ++L +P
Sbjct: 262 ----DTVNGGGIFAIGNVV---QPKVKTTPLV--PDMPHYNVI-LKGIDVGGTALG-LPT 310
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ FDS + G ++DSGTT ++PE Y L +++ + + +++ +T D C+
Sbjct: 311 NI--FDSGNSKGTIIDSGTTLAYVPEGVYKALFAMV------FDKHQDISVQTLQDFSCF 362
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS----MD 344
+ DD FP +TFHF +VSL++ ++ + N + C+ FQ+
Sbjct: 363 QYSGS----VDDGFPEVTFHFEGDVSLIVSPHDYLF-----QNGKNLYCMGFQNGGGKTK 413
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
DG N V+YDLE + IG+ +C+S+
Sbjct: 414 DGKDLGLLG--DLVLSNKLVLYDLENQAIGWADYNCSSS 450
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 111/410 (27%), Positives = 177/410 (43%), Gaps = 72/410 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C L ++ + P SS+ S C FC +
Sbjct: 103 VQVDTGSDILWVNC----ITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFCADTFG 158
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P K + PC ++ TYG+G G D L+ G +
Sbjct: 159 GRLP---------------KCSANVPC-EYSVTYGDGSSTVGSFVNDALQFD-QVTGDGQ 201
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ + GI GFG S+ SQL G ++K F+HC
Sbjct: 202 TQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDT 261
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
K IGDV + ++ TP++ +Y + L+ I +G ++L E+P
Sbjct: 262 IKGGG------IFAIGDVV---QPKVKTTPLVADK---PHYNVNLKTIDVGGTTL-ELPA 308
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL---QSTITYYPRAKEVEERTGFDL 286
+ F G ++DSGTT T+LPE + +++ + IT++ +V++ F+
Sbjct: 309 DI--FKPGEKRGTIIDSGTTLTYLPELVFKKVMLAVFNKHQDITFH----DVQDFLCFEY 362
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMD 344
V DD FP++TFHF ++++L + +F+ N + V C+ FQ ++
Sbjct: 363 SGSV--------DDGFPTLTFHFEDDLALHVYPHEYFFP-----NGNDVYCVGFQNGALQ 409
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKKKT 394
D + G N VVYDLE IG+ +C+S+ + KT
Sbjct: 410 SKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNCSSSIKIKDDKTGKT 459
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 114/410 (27%), Positives = 168/410 (40%), Gaps = 73/410 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C + S F+P S + ++ C+S C
Sbjct: 80 ITMVLDTGSELSWLHCK------------KEPNFNSIFNPLASKTYTKIPCSSPTC-ETR 126
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ D P C P F +Y + V G L +T +V GS G
Sbjct: 127 TRDLPL---------------PVSCDPAKLCHFIISYADASSVEGNLAFETFRV-GSVTG 170
Query: 120 IIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
P FGC+ S + + G+ G RG+LS +Q+GF + FS+C
Sbjct: 171 -----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGF--RKFSYCI----- 218
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEV 227
+D + S L++G+ + S L +TP+++ S P + Y + LE I + + L+ +
Sbjct: 219 -SDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLS-L 276
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQST-ITYYPRAKEVEERTGF 284
P S+ D G G +VDSGT +T L P YS L +LQ+ + +
Sbjct: 277 PKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQGAM 336
Query: 285 DLCY-----RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLL 339
DLCY R PN P + F V Q + +V C
Sbjct: 337 DLCYLIEPTRAALPN-------LPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFT 389
Query: 340 FQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
F + D S V G QQQNV + YDLEK RIGF + C GL
Sbjct: 390 FGNSDSLGI-ESFVIGHHQQQNVWMEYDLEKSRIGFAEVRCDLAGQRLGL 438
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 166/381 (43%), Gaps = 66/381 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C C +++ F PS SS+ S +C S+ C +
Sbjct: 143 MLIDTGSDVSWVQCK----PCSQC----HSQADPLFDPSSSSTYSPFSCGSAACAQLGQE 194
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GCS S+ + + TYG+G TG + DTL + S+
Sbjct: 195 GN--------GCSSSSQCQ---------YIVTYGDGSSTTGTYSSDTLALGSSA------ 231
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC V S + + G+ G G GA S+ SQ G L + FS+C P+ S
Sbjct: 232 VKSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL-----PPTPSSS 286
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S TPML+S P +Y + L+AI +G L+ +P S+ +
Sbjct: 287 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------S 339
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T LP YS L S ++ + YP A+ D C+ ++
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVS-- 394
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
PS+ F + L + CL F + + D G+ G+ QQ
Sbjct: 395 --IPSVALVFSGGAVVSLDASGIILS----------NCLAFAA--NSDDSSLGIIGNVQQ 440
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ + +GF+ C
Sbjct: 441 RTFEVLYDVGRGVVGFRAGAC 461
>gi|383143497|gb|AFG53176.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143499|gb|AFG53177.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143505|gb|AFG53180.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143513|gb|AFG53184.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
gi|383143515|gb|AFG53185.1| Pinus taeda anonymous locus 2_9704_01 genomic sequence
Length = 135
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 77/133 (57%), Gaps = 6/133 (4%)
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N SS +V+G+ A+ +L +TP++ +P+YP +YY+GLEA++IG L +P + FDS
Sbjct: 7 NNSSKIVVGNKAVPGDISLTYTPLIINPIYPFFYYLGLEAVSIGRKRL-NLPFNSATFDS 65
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGT++T PE YSQ+ S I Y R E T LCY V N
Sbjct: 66 KGNGGTIIDSGTSFTIFPEAMYSQIAGEFASQIG-YKRVPGAESTTALGLCYNVSGVENI 124
Query: 297 FTDDLFPSITFHF 309
FP FHF
Sbjct: 125 ----QFPQFAFHF 133
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 165/389 (42%), Gaps = 66/389 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
++MDT SDL W+ C C++C + + F PSRS + ++C +S
Sbjct: 100 LHMDTASDLLWLQCR----PCINC----YAQSLPIFDPSRSYTHRNESCRTS-------- 143
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG----SSPG 119
S+ +L + R C ++ Y +G GIL ++ L + SS
Sbjct: 144 ----------QYSMPSLRFNAKTRSC-EYSMRYMDGTGSKGILAKEMLMFNTIYDESSSA 192
Query: 120 IIREIPKFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+ ++ FGC Y EP+ GI G G G S+ + G FS+CF + + P
Sbjct: 193 ALHDV---VFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFG---TKFSYCFGSLDDPSYP 246
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + LV+GD + + TP+ +Y +YY+ +EAI++ L P
Sbjct: 247 H--NVLVLGDDGANILGDT--TPL---EIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQ 299
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG ++D+G + T L E Y L++ I Y + D ++V C N
Sbjct: 300 TGLGGTIIDTGNSLTSLVEEAYKP----LKNKIEDYFEGRFTAADVNQDDMFKVECYNGN 355
Query: 297 FTDDL----FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
DL FP +TFHF + L L + F +S V CL G
Sbjct: 356 LERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSP-----NVFCLAVTP------GNMN 404
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ QQ+ + YDLE ++I F+ +DC
Sbjct: 405 SIGATAQQSYNIGYDLEAKKISFERIDCG 433
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 111/411 (27%), Positives = 181/411 (44%), Gaps = 80/411 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C L ++ + P SSS S +C FC +
Sbjct: 102 VQVDTGSDILWVNC----ISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCAATYG 157
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHG---S 116
P GC+ + PC ++ YG+G TG D L +V G +
Sbjct: 158 GKLP-------GCTANV--------PC-EYSVMYGDGSSTTGFFITDALQFDQVTGDGQT 201
Query: 117 SPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
PG FGC +G++ + GI GFG+ S+ SQL G +K F+HC
Sbjct: 202 QPG----NATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHC 257
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQF---TPMLKSPMY--------PNYYYIGLE 215
K IG+V + K F +L P++ +Y + L+
Sbjct: 258 LDTIKGGG------IFAIGNV-VQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLK 310
Query: 216 AITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRA 275
+I +G ++L L F++ G ++DSGTT T+LPE + Q++ ++ + +
Sbjct: 311 SIDVGGTTLQ---LPAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVV------FSKH 361
Query: 276 KEVEERTGFD-LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA 334
+++ D LC++ DD FP+ITFHF ++++L + +F+ N +
Sbjct: 362 RDIAFHNLQDFLCFQYSGS----VDDGFPTITFHFEDDLALHVYPHEYFF-----PNGND 412
Query: 335 VKCLLFQ--SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ C+ FQ ++ D + G N VVYDLE + IG+ +C+S+
Sbjct: 413 IYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQVIGWTDYNCSSS 463
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 109/392 (27%), Positives = 163/392 (41%), Gaps = 52/392 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C+ C + + P SSS TC C + S D
Sbjct: 209 LDTGSDLNWIQC----VPCIACFEQSG----PYYDPKESSSFENITCHDPRCKLVSSPDP 260
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE-- 123
P PC + CP F Y YG+ TG +T V+ ++P E
Sbjct: 261 P-KPCKDEN------------QTCPYF-YWYGDSSNTTGDFALETFTVNLTTPNGKSEQK 306
Query: 124 -IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNI 178
+ FGC + G+ G GRG LS SQL FS+C + +D ++
Sbjct: 307 HVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLQSIYGHSFSYCLV--DRNSDTSV 364
Query: 179 SSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L+ G D + S NL FT + + +YY+G+++I + L ++P
Sbjct: 365 SSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVL-KIPEETWHLS 423
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+G GG ++DSGTT T+ EP Y + I Y E GF PC N
Sbjct: 424 KEGGGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGY------ELVEGFPPL--KPCYNV 475
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ + + P F + P N+F + + CL +
Sbjct: 476 SGIEKMELPDFGILFSDGAMWDFPVENYFIQIEP-----DLVCLAILGTPKSALS---II 527
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
G++QQQN ++YD++K R+G+ PM C +T S
Sbjct: 528 GNYQQQNFHILYDMKKSRLGYAPMKCTATTSG 559
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 109/404 (26%), Positives = 174/404 (43%), Gaps = 73/404 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSP---SRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C +C R + L +P S++ +C FCL +
Sbjct: 102 VQVDTGSDIVWVNC----IQCRECP--RTSSLGMELTPYDLEESTTGKLVSCDEQFCLEV 155
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ +SGC T CP + YG+G G +D ++ + S +
Sbjct: 156 NGG-------PLSGC--------TTNMSCP-YLQIYGDGSSTAGYFVKDYVQYNRVSGDL 199
Query: 121 IREIPK--FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQLG---FLQKGFSHCF 167
FGC +GS+ E + GI GFG+ S+ SQL ++K F+HC
Sbjct: 200 ETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL 259
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ N +G V + K N+ +P+ PN +Y + + + +G+ L
Sbjct: 260 ------DGTNGGGIFAMGHV-VQPKVNM-------TPLVPNQPHYNVNMTGVQVGHIILN 305
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
+S F++ G ++DSGTT +LPE Y L++ + S EV+ G
Sbjct: 306 ---ISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQ----QHNLEVQTIHGEY 358
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--M 343
C++ + DD FP + FHF N SL+L H Y + + C+ +Q+ M
Sbjct: 359 KCFQY----SERVDDGFPPVIFHFEN--SLLLKVYPHEYLFQYEN----LWCIGWQNSGM 408
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D +FG N V+YDLE + IG+ +C+S+ Q
Sbjct: 409 QSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCSSSIKVQ 452
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 105/403 (26%), Positives = 178/403 (44%), Gaps = 71/403 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C N C C + + ++ + P S+S++R C FC ++
Sbjct: 97 VQVDTGSDILWVNCAN----CDKCPTKSDLGVKLTLYDPQSSTSATRIYCDDDFCAATYN 152
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ GC+ PC ++ YG+G G +D L+ + +
Sbjct: 153 G-------VLQGCTKDL--------PC-QYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQT 196
Query: 123 EIPK--FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
FGC +G++ GI GFG+ S+ SQL G +++ F+HC
Sbjct: 197 SSANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCL--- 253
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVP 228
++ IG+V +S K N +PM PN +Y + ++ I +G + L E+P
Sbjct: 254 ---DNVKGGGIFAIGEV-VSPKVN-------TTPMVPNQPHYNVVMKEIEVGGNVL-ELP 301
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK--EVEERTGFDL 286
+ FD+ G ++DSGTT +LPE Y S++ ++ P K VEE+
Sbjct: 302 TDI--FDTGDRRGTIIDSGTTLAYLPEVVYE---SMMTKIVSEQPGLKLHTVEEQF---T 353
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MD 344
C++ N + FP + FHF ++SL + ++ + + V C +Q+ M
Sbjct: 354 CFQYTGNVN----EGFPVVKFHFNGSLSLTVNPHDYLFQI-----HEEVWCFGWQNSGMQ 404
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V+YDLE + IG+ +C+S+ +
Sbjct: 405 SKDGRDMTLLGDLVLSNKLVLYDLENQAIGWTDYNCSSSIKVR 447
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 106/399 (26%), Positives = 177/399 (44%), Gaps = 64/399 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C ++ NF P S ++S +C+ C I
Sbjct: 96 VQVDTGSDVLWVSCAS----CNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQ 151
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
SSD SGCS+ ++ C ++ + YG+G +G D L+ + GSS
Sbjct: 152 SSD--------SGCSV----QNNLC----AYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 119 GIIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC S + R GI GFG+ +SV SQL G + FSHC
Sbjct: 196 -VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL- 253
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ LV+G++ + N+ FTP++ P P +Y + L +I++ +L P
Sbjct: 254 ----KGENGGGGILVLGEIV---EPNMVFTPLV--PSQP-HYNVNLLSISVNGQAL---P 300
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
++ F + G ++D+GTT +L E Y + + + ++ R + CY
Sbjct: 301 INPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CY 356
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+ T D+FP ++ +F S+ L PQ + +AV C+ FQ + +
Sbjct: 357 VI----TTSVGDIFPPVSLNFAGGASMFLNPQ--DYLIQQNNVGGTAVWCIGFQRIQNQG 410
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYDL +RIG+ DC+++ +
Sbjct: 411 I---TILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNV 446
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 105/395 (26%), Positives = 174/395 (44%), Gaps = 66/395 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ W+ C C +C + + F + SS+++ +C
Sbjct: 98 VQIDTGSDILWINC----ITCSNCPHSSGLGIELDFFDTAGSSTAALVSCG--------- 144
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL----------K 112
DP +T S+ C S+ + YG+G TG DT+
Sbjct: 145 -----DPICSYAVQTATSECSSQANQC-SYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSV 198
Query: 113 VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
V SS II + G + T + GI GFG GALSV SQL G K FSHC
Sbjct: 199 VANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL-- 256
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEV 227
N LV+G++ S ++ SP+ P+ +Y + L++I + L
Sbjct: 257 ---KGGENGGGVLVLGEILEPS--------IVYSPLVPSQPHYNLNLQSIAVNGQLL--- 302
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P+ F + N G +VDSGTT +L + Y+ + + + ++ + +K + + + C
Sbjct: 303 PIDSNVFATTNNQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQF--SKPIISKG--NQC 358
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
Y V + D+FP ++ +F+ S+VL P+ H+ + +A+ C+ FQ ++ G
Sbjct: 359 YLV----SNSVGDIFPQVSLNFMGGASMVLNPE--HYLMHYGFLDGAAMWCIGFQKVEQG 412
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ DC+
Sbjct: 413 ----FTILGDLVLKDKIFVYDLANQRIGWADYDCS 443
>gi|383130040|gb|AFG45740.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/147 (40%), Positives = 80/147 (54%), Gaps = 11/147 (7%)
Query: 182 LVIGDVAISSKDNLQFTPML-----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LV+GD A+ + +L +TP L S Y +YYI L ++IG L +P L FD+
Sbjct: 2 LVLGDKALPTAMSLNYTPFLINTKASSSGYHTFYYIDLRGVSIGRKRL-NLPSKLFSFDT 60
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T E FY + + S I + RA EVE RTG LCY V ++
Sbjct: 61 KGNGGTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNVSGVDHV 119
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHF 323
L P FHF +VLP N+F
Sbjct: 120 ----LLPDFAFHFKGGSDMVLPVANYF 142
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 107/403 (26%), Positives = 176/403 (43%), Gaps = 71/403 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C +C + + ++ ++ + S + C FC I+
Sbjct: 93 VQVDTGSDIMWVNC----IQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQEFCYEING 148
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P GC T CP + YG+G G +D ++ + G ++
Sbjct: 149 GQLP-------GC--------TANMSCP-YLEIYGDGSSTAGYFVKDVVQ-YARVSGDLK 191
Query: 123 EIPK---FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
FGC +GS+ E + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 192 TTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL- 250
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTE 226
+ N VIG V + K N+ +P+ PN +Y + + A+ +G+ L+
Sbjct: 251 -----DGTNGGGIFVIGHV-VQPKVNM-------TPLIPNQPHYNVNMTAVQVGHEFLS- 296
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
L F++ G ++DSGTT +LPE Y L+S I+ P K R +
Sbjct: 297 --LPTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPLVS---KIISQQPDLKVHTVRDEYT- 350
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MD 344
C++ + DD FP++TFHF N+V L + + + + C+ +Q+ +
Sbjct: 351 CFQY----SDSLDDGFPNVTFHFENSVILKVYPHEYLFPF------EGLWCIGWQNSGVQ 400
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V+YDLE + IG+ +C+S+ Q
Sbjct: 401 SRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCSSSIQVQ 443
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 111/382 (29%), Positives = 157/382 (41%), Gaps = 61/382 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y+ + F P++SS+ + +CA C ++ +S
Sbjct: 178 VVFDTGSDTTWVQCRPCVVSC-----YKQKDRL--FDPAKSSTYANVSCADPACADLDAS 230
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC+ L + YG+G G +DTL V +
Sbjct: 231 ----------GCNAGHCL----------YGIQYGDGSYTVGFFAKDTLAVAQDA------ 264
Query: 124 IPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
I F FGC G R + G+ G GRG S+ Q G FS+C A A
Sbjct: 265 IKGFKFGC-GEKNRGLFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAAT---- 319
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SS N + TPML + P +YY+GL I +G L +P S+
Sbjct: 320 GYLEFGPLSPSSSGSNAKTTPML-TDKGPTFYYVGLTGIRVGGKQLGAIPESVFS----- 373
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
N G LVDSGT T LP+ Y+ L S + + K+ + D CY +
Sbjct: 374 NSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGY-KKAAAYSILDTCYDF----TGLS 428
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+++ F L L YA+S + CL F S +GD G+ G+ Q
Sbjct: 429 QVSLPTVSLVFQGGACLDLDASGIVYAIS-----QSQVCLGFAS--NGDDESVGIVGNTQ 481
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ V+YD+ K+ +GF P C
Sbjct: 482 QRTYGVLYDVSKKVVGFAPGAC 503
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 119/408 (29%), Positives = 173/408 (42%), Gaps = 66/408 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C RN +F P SS+ + CAS+ C
Sbjct: 98 VTMVLDTGSELSWLLCAPAG--------ARNKFSAMSFRPRASSTFAAVPCASAQC---R 146
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S D P P G S S C S + +Y +G G L D V GS P +
Sbjct: 147 SRDLP-SPPACDGAS------SRC-----SVSLSYADGSSSDGALATDVFAV-GSGPPL- 192
Query: 122 REIPKFCFGCVGSTY-REPIGIA-----GFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC+ S + P G+A G RGALS SQ + FS+C +D
Sbjct: 193 ----RAAFGCMSSAFDSSPDGVASAGLLGMNRGALSFVSQAS--TRRFSYCI------SD 240
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYY------IGLEAITIGNSSLTEVPL 229
+ + L++G + + L +TPM + P P Y+ + L I +G L +P
Sbjct: 241 RDDAGVLLLGHSDLPTFLPLNYTPMYQ-PALPLPYFDRVAYSVQLLGIRVGGKHL-PIPA 298
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE-----RTGF 284
S+ D G G +VDSGT +T L YS L + + T P +++ + F
Sbjct: 299 SVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKA--EFTRQARPLLPALDDPSFAFQEAF 356
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQSM 343
D C+RVP + T L P +T F N + + Y + V CL F
Sbjct: 357 DTCFRVPQGRSPPTARL-PGVTLLF-NGAEMAVAGDRLLYKVPGERRGGDGVWCLTF--- 411
Query: 344 DDGDYGP--SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ D P + V G Q NV V YDLE+ R+G P+ C + GL
Sbjct: 412 GNADMVPIMAYVIGHHHQMNVWVEYDLERGRVGLAPVRCDVASQRLGL 459
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 164/391 (41%), Gaps = 81/391 (20%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C +C RN F P +S++ +C S C
Sbjct: 90 DTGSDLTWTSC----VPCNNCYKQRN----PMFDPQKSTTYRNISCDSKLCHK------- 134
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
L + C P ++ Y Y + G+L ++T+ + S+ G +
Sbjct: 135 --------------LDTGVCSPQKRCNYTYAYASAAITRGVLAQETITL-SSTKGKSVPL 179
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDPNI 178
FGC G +GI G G G +S+ SQ+G F K FS C + F D ++
Sbjct: 180 KGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSFGGKRFSQCLVPFH--TDVSV 237
Query: 179 SSPLVIGDVAISSKDNLQFTPML----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
SS + G + S + TP++ K+P Y++ L I++ N+ L F
Sbjct: 238 SSKMSFGKGSKVSGKGVVSTPLVAKQDKTP-----YFVTLLGISVENTYL--------HF 284
Query: 235 DSQG----NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+ G + +DSGT T LP Y Q+++ ++S + P + + G LCYR
Sbjct: 285 NGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPD--LGPQLCYR- 341
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF-QSMDDGDYG 349
T + P +T HF + L F S V CL F + DG
Sbjct: 342 -----TKNNLRGPVLTAHF-EGADVKLSPTQTFI-----SPKDGVFCLGFTNTSSDG--- 387
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
GV+G+F Q N + +DL+++ + F+P DC
Sbjct: 388 --GVYGNFAQSNYLIGFDLDRQVVSFKPKDC 416
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 107/386 (27%), Positives = 154/386 (39%), Gaps = 47/386 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT SDLTW+ C C C Y + + F P S+S +N + D
Sbjct: 158 LDTASDLTWLQC----QPCRRC--YPQSGPV--FDPRHSTSYGE--------MNYDAPD- 200
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVT--GILTRDTLKVHGSSPGIIRE 123
C G S K C + G G T G L +TL G +R+
Sbjct: 201 ----CQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGG----VRQ 252
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL--QKGFSHCFLAFKYANDPN 177
GC G GI G RG +S+P Q+ FL FS+C + F + +
Sbjct: 253 A-YLSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLVDF-ISGPGS 310
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGN---SSLTEVPLSLREF 234
SS L G A+ + FTP + + P +YY+ L +++G +TE L L +
Sbjct: 311 PSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLDPY 370
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+GG+++DSGTT T L P Y+ ++ T + FD CY V
Sbjct: 371 --TGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRA 428
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
P+++ HF V L L N+ + +S C F D V
Sbjct: 429 GLRHCVKVPAVSMHFAGGVELSLQPKNYLITV----DSRGTVCFAFAGTGDRSVS---VI 481
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ VVYD+ +R+GF P C
Sbjct: 482 GNILQQGFRVVYDIGGQRVGFAPNSC 507
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 103/398 (25%), Positives = 172/398 (43%), Gaps = 72/398 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ WV C C +C + NF + SS++
Sbjct: 83 VQIDTGSDILWVNCNT----CSNCPQSSQLGIELNFFDTVGSSTAA-------------- 124
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRP----CPSFAYTYGEGGLVTGILTRDTL---KVHGS 116
PC+ C+ + C P C S+ + YG+G +G D + + G
Sbjct: 125 ---LIPCSDLICTSGVQGAAAECSPRVNQC-SYTFQYGDGSGTSGYYVSDAMYFNLIMGQ 180
Query: 117 SPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
P + FGC + T + GI GFG G LSV SQL G K FSHC
Sbjct: 181 PPAV-NSTATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHC 239
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSL 224
D N LV+G++ S ++ SP+ P+ +Y + L++I + L
Sbjct: 240 L-----KGDGNGGGILVLGEILEPS--------IVYSPLVPSQPHYNLNLQSIAVNGQPL 286
Query: 225 TEVPLSLREFDSQGN-GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
P++ F N GG +VD GTT +L + Y L++ + + ++ R + +
Sbjct: 287 ---PINPAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSAR----QTNSK 339
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM 343
+ CY V +T D+FP ++ +F S+VL + + + + + + C+ FQ +
Sbjct: 340 GNQCYLV----STSIGDIFPLVSLNFEGGASMVL-KPEQYLMHNGYLDGAEMWCVGFQKL 394
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+G + + G ++ VVYD+ ++RIG+ DC+
Sbjct: 395 QEG----ASILGDLVLKDKIVVYDIAQQRIGWANYDCS 428
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 167/390 (42%), Gaps = 75/390 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGSDL WV C C+ C + + K+ + + S+SSS+
Sbjct: 53 VDTGSDLLWVNC----HPCIGCPAFSDLKIPIVPYDVKASASSSKV-------------- 94
Query: 65 NPFDPCTMSGCSLSTLLKSTCCR---PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
PC+ C+L T + + C C +++ YG+G G L D L ++
Sbjct: 95 ----PCSDPSCTLITQISESGCNDQNQC-GYSFQYGDGSGTLGYLVEDVLHY------MV 143
Query: 122 REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQKG---FSHCFLAFK 171
FGC + ++ R GI GFG LS SQL K F+HC +
Sbjct: 144 NATATVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
LV+G+V + ++Q+TP++ Y +Y + L++I++ N++LT P
Sbjct: 204 RGG-----GILVLGNVI---EPDIQYTPLVP---YMYHYNVVLQSISVNNANLTIDP--- 249
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ F + G + DSGTT +LP+ Y + + + R
Sbjct: 250 KLFSNDVMQGTIFDSGTTLAYLPDEAYQAFTQAVSLVVAPFLLCDTRLSR---------- 299
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
F LFP++ +F S+ L + ++ +N+ + C+ +QSM +
Sbjct: 300 -----FIYKLFPNVVLYF-EGASMTLTPAEYLIRQASAANA-PIWCMGWQSMGSAESELQ 352
Query: 352 -GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG +N VVYDLE+ RIG++P DC
Sbjct: 353 YTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382
>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
Length = 443
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 107/396 (27%), Positives = 160/396 (40%), Gaps = 69/396 (17%)
Query: 3 QVYMDTGSDLTWVPC-GNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGS L W C L C+ D + F+ S S S + C C
Sbjct: 100 EALIDTGSSLIWTQCTACLRKVCVRQD-------LPYFNASSSGSFAPVPCQDKAC---- 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ N C + G TC +F TYG GG++ G L D
Sbjct: 149 -AGNYLHFCALDG---------TC-----TFRVTYGAGGII-GFLGTDAFTFQSGGA--- 189
Query: 122 REIPKFCFGCVGST-YREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
FGCV T + P G+ G GRG LS+ SQ G K FS+C + + N
Sbjct: 190 ----TLAFGCVSFTRFAAPDVLHGASGLIGLGRGRLSLASQTG--AKRFSYCLTPYFHNN 243
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPM--LKSPM---YPNYYYIGLEAITIGNSSLT--EV 227
SS L +G A S M ++SP Y +YY+ L IT+G + L
Sbjct: 244 --GASSHLFVGAAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPST 301
Query: 228 PLSLREFDSQ-GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
L+E + GG+++DSG+ +T L E Y L+ L + E+ G L
Sbjct: 302 AFDLQEVEEGFWEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMAL 361
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
C D + P++ HF + LP N++ + + A+ QS
Sbjct: 362 CV-----ARGDLDRVVPTLVLHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQS---- 412
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+FQQQN+ +++D+ R+ FQ DC++
Sbjct: 413 ------IIGNFQQQNMHILFDVGGGRLSFQNADCST 442
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 171/393 (43%), Gaps = 59/393 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + +F+P SS++SR TC+
Sbjct: 20 VQIDTGSDILWVTCS----PCTGCPTSSGLNIQLESFNPDSSSTASRITCSD-------- 67
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCP-SFAYTYGEGGLVTGILTRDTL---KVHGSSP 118
D CT + + +++ + P + +TYG+G +G DT+ V G+
Sbjct: 68 -----DRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122
Query: 119 GIIREIPKFCFGCVGSTY-------REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
FGC S R GI GFG+ LSV SQL G K FSHC
Sbjct: 123 -TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL- 180
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
N LV+G++ + L +TP++ P P +Y + LE+I + L P
Sbjct: 181 ----KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIAVNGQKL---P 227
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
+ F + G +VDSGTT +L + Y +S + + ++ P + + + C+
Sbjct: 228 IDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKG--SQCF 283
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
++ D FP++T +F+ V++ + N+ ++ N S + C+ +Q +
Sbjct: 284 ----ITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDN-SVLWCIGWQRNQGQEI 338
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL R+G+ DC+
Sbjct: 339 ---TILGDLVLKDKIFVYDLANMRMGWADYDCS 368
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 103/384 (26%), Positives = 157/384 (40%), Gaps = 65/384 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+TWV C C DC Y+ + + F PS S+S + +C S C +
Sbjct: 1 MVLDTGSDVTWVQCQP----CADC--YQQSDPV--FDPSLSASYAAVSCDSQRCRD---- 48
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPS---FAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
L + CR + YG+G G +TL + S+P
Sbjct: 49 -----------------LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDSTP-- 89
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ GC + G+ G G LS PSQ+ FS+C + D
Sbjct: 90 ---VGNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS--ASTFSYCLVD----RDSP 140
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS- 236
+S L GD A ++ P+++SP +YY+ L I++G L+ +P S D+
Sbjct: 141 AASTLQFGDGA--AEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLS-IPASAFAMDAT 197
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G+GG++VDSGT T L Y+ L PR V FD CY + +
Sbjct: 198 SGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSL---FDTCYDL----SD 250
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T P+++ F +L LP N+ P + + CL F + + G+
Sbjct: 251 RTSVEVPAVSLRFEGGGALRLPAKNYLI----PVDGAGTYCLAFAPTN----AAVSIIGN 302
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +D + +GF P C
Sbjct: 303 VQQQGTRVSFDTARGAVGFTPNKC 326
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 164/380 (43%), Gaps = 56/380 (14%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD++W+ C C C YR + F+PS SSS CASS C +
Sbjct: 27 VYMVADTGSDVSWLQCS----PCRKC--YRQQDPI--FNPSLSSSFKPLACASSICGKLK 78
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ GCS K+ C + +YG+G G + +TL + +
Sbjct: 79 ----------IKGCSR----KNKCM-----YQVSYGDGSFTVGDFSTETLSFGEHA---V 116
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISS 180
R + C + G+ G GRG LS PSQ G FS+C + A I++
Sbjct: 117 RSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESA----IAA 172
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
LV G A+ K +FT +L + YYY+GL I + S + +P S+G G
Sbjct: 173 SLVFGPSAVPEK--ARFTKLLPNRRLDTYYYVGLARIRVAGSPV-NIPPDAFAMGSRGTG 229
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G++VDSGT + L P Y+ L +S +T +P A + FD CY + ++
Sbjct: 230 GVIVDSGTAISRLTTPAYTALRDAFRSLVT-FPSAPGISL---FDTCYDL----SSMKTA 281
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P++ F S+ LP + + CL F ++ + G+ QQQ
Sbjct: 282 TLPAVVLDFDGGASMPLPADGILVNV----DDEGTYCLAFAPEEEA----FSIIGNVQQQ 333
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+ D +KE++G P C
Sbjct: 334 TFRISIDNQKEQMGIAPDQC 353
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 164/380 (43%), Gaps = 56/380 (14%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD++W+ C C C YR + F+PS SSS CASS C +
Sbjct: 94 VYMVADTGSDVSWLQCS----PCRKC--YRQQDPI--FNPSLSSSFKPLACASSICGKLK 145
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ GCS K+ C + +YG+G G + +TL + +
Sbjct: 146 ----------IKGCSR----KNECM-----YQVSYGDGSFTVGDFSTETLSFGEHA---V 183
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISS 180
R + C + G+ G GRG LS PSQ G FS+C + A I++
Sbjct: 184 RSVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESA----IAA 239
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
LV G A+ K +FT +L + YYY+GL I + S + +P S+G G
Sbjct: 240 SLVFGPSAVPEK--ARFTKLLPNRRLDTYYYVGLARIRVAGSPV-NIPPDAFAMGSRGTG 296
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G++VDSGT + L P Y+ L +S +T +P A + FD CY + ++
Sbjct: 297 GVIVDSGTAISRLTTPAYTALRDAFRSLVT-FPSAPGISL---FDTCYDL----SSMKTA 348
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P++ F S+ LP + + CL F ++ + G+ QQQ
Sbjct: 349 TLPAVVLDFDGGASMPLPADGILVNV----DDEGTYCLAFAPEEEA----FSIIGNVQQQ 400
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+ D +KE++G P C
Sbjct: 401 TFRISIDNQKEQMGIAPDQC 420
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 106/389 (27%), Positives = 166/389 (42%), Gaps = 79/389 (20%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C C +C ++ L F P +SS+ TC S C ++ S
Sbjct: 110 DTGSDLIWVQCS----PCQNCFP-QDTPL---FEPLKSSTFKAATCDSQPCTSVPPSQRQ 161
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C G + ++Y+YG+ G++ +TL + P
Sbjct: 162 ---CGKVGQCI--------------YSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPS 204
Query: 127 FCFGC------VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
FGC T + G+ G G G LS+ SQLG + FS+C L F N +
Sbjct: 205 SIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQIGYKFSYCLLPFS----SNST 260
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L G AI + + + TP++ P++P++Y++ LEA+TIG VP + +
Sbjct: 261 SKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKV---VP------TGRTD 311
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G +++DSGT T+L + FY+ ++ LQ ++ VE ++ P + D
Sbjct: 312 GNIIIDSGTVLTYLEQTFYNNFVASLQEVLS-------VESAQDLPFPFKFCFP---YRD 361
Query: 300 DLFPSITFHFL--------NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
P I F F N+ + L N PS+ S +
Sbjct: 362 MTIPVIAFQFTGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGI---------------- 405
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ Q + +VVYDLE +++ F P DC
Sbjct: 406 SIFGNVAQFDFQVVYDLEGKKVSFAPTDC 434
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 165/398 (41%), Gaps = 67/398 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ MDTGSDL W+ C C+DC + R F P+ SSS TC C +
Sbjct: 163 RMIMDTGSDLNWLQCA----PCLDCFEQRG----PVFDPAASSSYRNVTCGDQRCGLVAP 214
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRP----CPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ P C RP CP + Y YG+ TG L ++ V+ ++P
Sbjct: 215 PEAP----------------RACRRPAEDSCPYY-YWYGDQSNTTGDLALESFTVNLTAP 257
Query: 119 GIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYAN 174
G R + FGC + G+ G GRG LS SQL FS+C + ++ +
Sbjct: 258 GASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV--EHGS 315
Query: 175 DPNISSPLVIG-DVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
D S +V G D + + L++T S +YY+ L+ + +G L +
Sbjct: 316 D--AGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGD-LLNISSDTW 372
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYS-------QLLSILQSTITYYPRAKEVEERTGFD 285
+ G+GG ++DSGTT ++ EP Y L+S L I +P +
Sbjct: 373 DVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPV---------LN 423
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
CY V + P ++ F + P N+F + + + CL +
Sbjct: 424 PCYNV----SGVERPEVPELSLLFADGAVWDFPAENYFVRL----DPDGIMCLAVRGTPR 475
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ G+FQQQN VVYDL+ R+GF P CA
Sbjct: 476 TGMS---IIGNFQQQNFHVVYDLQNNRLGFAPRRCAEV 510
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 177/391 (45%), Gaps = 60/391 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASS-FCLNIHS 62
+ +DTGS+LTW+ C C C + + + +RS+S TC +S C N S
Sbjct: 115 LIVDTGSELTWLQC----LPCKVCAP----SVDTIYDAARSASYRPVTCNNSQLCSN--S 164
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S + C + + C+ FA YG+G G L+ DTL + G
Sbjct: 165 SQGTYAYCA----------RGSQCQ----FAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 123 EIPKFCFGCV-GSTYREPIG---IAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
+ F FGC G P G I G G +++P QLG F K FSHCF ++
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWK-FSHCFP--DRSSHL 267
Query: 177 NISSPLVIGDVAISSKDNLQFT--PMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N + + G+ + + +Q+T + S + +Y++ L+ ++I + L +P
Sbjct: 268 NSTGVVFFGNAELP-HEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPR----- 321
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL--CYRVPC 292
+++DSG++++ PF+SQL P K +E + DL C++V
Sbjct: 322 ----GSVVILDSGSSFSSFVRPFHSQLREAFLKHRP--PSLKHLEGDSFGDLGTCFKV-- 373
Query: 293 PNNTFTDDL---FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+N D+L PS++ F + V++ +P ++ N + C F +DG
Sbjct: 374 -SNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARFQNHVKM-CFAF---EDGGPN 428
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P V G++QQQN+ V YD+++ R+GF C
Sbjct: 429 PVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 106/399 (26%), Positives = 178/399 (44%), Gaps = 64/399 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C ++ NF P S ++S +C+ C I
Sbjct: 96 VQVDTGSDVLWVSCAS----CNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQ 151
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
SSD SGCS+ ++ C ++ + YG+G +G D L+ + GSS
Sbjct: 152 SSD--------SGCSV----QNNLC----AYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 119 GIIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC S + R GI GFG+ +SV SQL G + FSHC
Sbjct: 196 -VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL- 253
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ LV+G++ + N+ FTP++ P P +Y + L +I++ +L P
Sbjct: 254 ----KGENGGGGILVLGEIV---EPNMVFTPLV--PSQP-HYNVNLLSISVNGQAL---P 300
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
++ F + G ++D+GTT +L E Y + + + ++ R + + CY
Sbjct: 301 INPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG----NQCY 356
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+ T D+FP ++ +F S+ L PQ + +AV C+ FQ + +
Sbjct: 357 VI----TTSVGDIFPPVSLNFAGGASMFLNPQ--DYLIQQNNVGGTAVWCIGFQRIQNQG 410
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYDL +RIG+ DC+++ +
Sbjct: 411 I---TILGDLVLKDKIFVYDLVGQRIGWANYDCSTSVNV 446
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 166/388 (42%), Gaps = 50/388 (12%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W+ C C+ C + + P SSS +C C + S
Sbjct: 210 LILDTGSDLNWIQC----VPCIACFEQSG----PYYDPKDSSSFRNISCHDPRCQLVSSP 261
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG---I 120
D P +PC S CP F Y YG+G TG +T V+ ++P
Sbjct: 262 DPP-NPCKAENQS------------CPYF-YWYGDGSNTTGDFALETFTVNLTTPNGKSE 307
Query: 121 IREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDP 176
++ + FGC + G+ G G+G LS SQ+ L + FS+C + ++
Sbjct: 308 LKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLV--DRNSNA 365
Query: 177 NISSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
++SS L+ G D + S NL FT K +YY+ + ++ + + L ++P
Sbjct: 366 SVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVL-KIPEETWH 424
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
S+G GG ++DSGTT T+ EP Y + I Y + VE CY V
Sbjct: 425 LSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGY---ELVEGLPPLKPCYNV--- 478
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P F + P N+F + V CL ++ +
Sbjct: 479 -SGIEKMELPDFGILFADGAVWNFPVENYFIQIDP-----DVVCL---AILGNPRSALSI 529
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQQN ++YD++K R+G+ PM CA
Sbjct: 530 IGNYQQQNFHILYDMKKSRLGYAPMKCA 557
>gi|168008086|ref|XP_001756738.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691976|gb|EDQ78335.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 174
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 66/196 (33%), Positives = 103/196 (52%), Gaps = 28/196 (14%)
Query: 194 NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDSQGNGGLLVDSGTTYT 251
+L+FTP+LK P+ +Y++ L A+ + + L P+S + + +S+GNGG ++D T +T
Sbjct: 1 HLEFTPLLKHPLVETFYFVNLVAVAVNGAKL---PISSKVLKMNSEGNGGAILDMSTRFT 57
Query: 252 HLPEPFYSQLLSILQSTI----TYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITF 307
P + L+ L++ I PR F LCY NT T + P++T
Sbjct: 58 RFPNSAFDHLVKALKALIRLPTMVVPR---------FQLCYSTV---NTGTL-IIPTVTL 104
Query: 308 HFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYD 367
F N V + LP N F +++ + V CL +M G+ G + V GS QQQN +V D
Sbjct: 105 IFENGVRMRLPMENTFVSVTEQGD---VMCL---AMVPGNPGTATVIGSAQQQNFLIVID 158
Query: 368 LEKERIGFQPMDCAST 383
E R+GF P+ CAS+
Sbjct: 159 REASRLGFAPLQCASS 174
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 114/384 (29%), Positives = 166/384 (43%), Gaps = 63/384 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C C + ++ F P++S + +C C N+
Sbjct: 113 DTGSDLLWRQCK----PCDSCYE----QIEPIFDPAKSKTYQILSCEGKSCSNLGG---- 160
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
GCS +TC ++Y+YG+G +G L DTL + GS+ G +PK
Sbjct: 161 -----QGGCSD----DNTCI-----YSYSYGDGSHTSGDLAVDTLTI-GSTTGRPVSVPK 205
Query: 127 FCFGC---VGSTYREPIGIAGFGRGAL-SVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
FGC G T+ G S+ SQL L G FS+C + NDP++SS
Sbjct: 206 VVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPL--GNDPSVSSK 263
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL-----TEVPLSLREFDS 236
+ G I S TP L S +YY+ LE++++G+ L ++V L + D
Sbjct: 264 MHFGSRGIVSGAGAVSTP-LASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADAD- 321
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G +++DSGTT T LP+ FY L S + S I P + F LCY +
Sbjct: 322 --EGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR---DPNNVFSLCY------SN 370
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P+IT HF+ L L N F + + C + D +FG+
Sbjct: 371 LSGLRIPTITAHFV-GADLELKPLNTFVQV-----QEDLFCFAMIPVSD-----LAIFGN 419
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
Q N V YDL+ + F+P DC
Sbjct: 420 LAQMNFLVGYDLKSRTVSFKPTDC 443
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 113/385 (29%), Positives = 172/385 (44%), Gaps = 65/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W C C C YR M F P RS + S C S C S +
Sbjct: 99 VDTGSDLVWAQCT----PCGGC--YRQKSPM--FEPLRSKTYSPIPCESEQCSFFGYSCS 150
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS--PGIIRE 123
P C +++Y+Y + + G+L R+ + + P ++ +
Sbjct: 151 PQKMC--------------------AYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGD 190
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL--QKGFSHCFLAFKYANDPN 177
I FGC G+ +GI G G G LS+ SQ+G L K FS C + F D +
Sbjct: 191 I---IFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFH--TDAH 245
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S + G+ + S + + TP L S Y + LE I++G+ T V + E S+
Sbjct: 246 TSGTINFGEESDVSGEGVVTTP-LASEEGQTSYLVTLEGISVGD---TFVRFNSSETLSK 301
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GN +++DSGT T++P+ FY +L+ L+ + P E + G LCYR +
Sbjct: 302 GN--IMIDSGTPATYIPQEFYERLVEELKVQSSLLPI--EDDPDLGTQLCYR------SE 351
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T+ P +T HF +LP P + V C DGDY +FG+F
Sbjct: 352 TNLEGPILTAHFEGADVQLLP----IQTFIPPKD--GVFCFAMAGSTDGDY----IFGNF 401
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
Q N+ + +DL+++ I F+P DC +
Sbjct: 402 AQSNILMGFDLDRKTISFKPTDCTN 426
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 165/386 (42%), Gaps = 50/386 (12%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL W+ C C+ C + + P SSS +C C + + D
Sbjct: 214 LDTGSDLNWIQC----VPCIACFEQSG----PYYDPKDSSSFRNISCHDPRCQLVSAPDP 265
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG---IIR 122
P PC S CP F Y YG+G TG +T V+ ++P ++
Sbjct: 266 P-KPCKAENQS------------CPYF-YWYGDGSNTTGDFALETFTVNLTTPNGTSELK 311
Query: 123 EIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNI 178
+ FGC + G+ G G+G LS SQ+ L + FS+C + ++ ++
Sbjct: 312 HVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQSFSYCLV--DRNSNASV 369
Query: 179 SSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS L+ G D + S NL FT K +YY+ ++++ + + L ++P
Sbjct: 370 SSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVL-KIPEETWHLS 428
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S+G GG ++DSGTT T+ EP Y + I Y + VE CY V +
Sbjct: 429 SEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGY---QLVEGLPPLKPCYNV----S 481
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P F + P N+F + V CL ++ + G
Sbjct: 482 GIEKMELPDFGILFADEAVWNFPVENYFIWIDP-----EVVCL---AILGNPRSALSIIG 533
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
++QQQN ++YD++K R+G+ PM CA
Sbjct: 534 NYQQQNFHILYDMKKSRLGYAPMKCA 559
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 159/380 (41%), Gaps = 63/380 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GSD+ WV C C C Y + F P+ SSS S +C S+ C
Sbjct: 145 LVVDSGSDVIWVQC----RPCEQC--YAQTDPL--FDPAASSSFSGVSCGSAICR----- 191
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
T+SG + C ++ TYG+G G L +TL + G++ ++
Sbjct: 192 -------TLSGTGCGGGGDAGKC----DYSVTYGDGSYTKGELALETLTLGGTA---VQG 237
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
+ C + G+ G G GA+S+ QLG G FS+C LA + A + L
Sbjct: 238 VAIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYC-LASRGAGG---AGSL 293
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL--SLREFDSQGNG 240
V+G + + + ++YY+GL I +G L PL SL + G G
Sbjct: 294 VLG----------RTEAVPRGRRASSFYYVGLTGIGVGGERL---PLQDSLFQLTEDGAG 340
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++D+GT T LP Y+ L + PR+ V D CY + + +
Sbjct: 341 GVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSL---LDTCYDL----SGYASV 393
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++F+F L LP N + AV CL F G + G+ QQ+
Sbjct: 394 RVPTVSFYFDQGAVLTLPARNLLVEVGG-----AVFCLAFAPSSSG----ISILGNIQQE 444
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+++ D +GF P C
Sbjct: 445 GIQITVDSANGYVGFGPNTC 464
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 164/386 (42%), Gaps = 67/386 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V MDTGSD+ WV C C +CD N L F PS SS+ S C +
Sbjct: 116 VVMDTGSDILWVMCT----PCTNCD----NHLGLLFDPSMSSTFS-PLCKT--------- 157
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
PC GCS C P P F TY + +G+ RDT+ + G R
Sbjct: 158 -----PCDFKGCSR--------CDPIP-FTVTYADNSTASGMFGRDTVVFETTDEGTSR- 202
Query: 124 IPKFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKGFSHCF--LAFKYANDPN 177
IP FGC + ++ GI G G S+ +++G + FS+C LA Y N
Sbjct: 203 IPDVLFGCGHNIGQDTDPGHNGILGLNNGPDSLATKIG---QKFSYCIGDLADPYYN--- 256
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L++G+ A + F ++ +YY+ +E I++G L P + E
Sbjct: 257 -YHQLILGEGADLEGYSTPFE------VHNGFYYVTMEGISVGEKRLDIAPETF-EMKKN 308
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GG+++D+G+T T L + + L +++ + + R +E+ Y +
Sbjct: 309 RTGGVIIDTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFY------GSI 362
Query: 298 TDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVF 354
+ DL FP +TFHF + L L G+ F + + V C+ + + +
Sbjct: 363 SRDLVGFPVVTFHFADGADLALDSGSFFNQL-----NDNVFCMTVGPVSSLNLKSKPSLI 417
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G QQ+ V YDL + + FQ +DC
Sbjct: 418 GLLAQQSYSVGYDLVNQFVYFQRIDC 443
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 176/391 (45%), Gaps = 60/391 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASS-FCLNIHS 62
+ +DTGS+LTW+ C C C + + + +RS S TC +S C N S
Sbjct: 115 LIVDTGSELTWLKC----LPCKVCAP----SVDTIYDAARSVSYKPVTCNNSQLCSN--S 164
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S + C + + C+ FA YG+G G L+ DTL + G
Sbjct: 165 SQGTYAYCA----------RGSQCQ----FAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210
Query: 123 EIPKFCFGCV-GSTYREPIG---IAGFGRGALSVPSQLG--FLQKGFSHCFLAFKYANDP 176
+ F FGC G P G I G G +++P QLG F K FSHCF ++
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWK-FSHCFP--DRSSHL 267
Query: 177 NISSPLVIGDVAISSKDNLQFT--PMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N + + G+ + + +Q+T + S + +Y++ L+ ++I + L +P
Sbjct: 268 NSTGVVFFGNAELP-HEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPR----- 321
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL--CYRVPC 292
+++DSG++++ PF+SQL P K +E + DL C++V
Sbjct: 322 ----GSVVILDSGSSFSSFVRPFHSQLREAFLKHRP--PSLKHLEGDSFGDLGTCFKV-- 373
Query: 293 PNNTFTDDL---FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+N D+L PS++ F + V++ +P ++ N + C F +DG
Sbjct: 374 -SNDDIDELHRTLPSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKM-CFAF---EDGGPN 428
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P V G++QQQN+ V YD+++ R+GF C
Sbjct: 429 PVNVIGNYQQQNLWVEYDIQRSRVGFARASC 459
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 178/400 (44%), Gaps = 79/400 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C + C C ++ +S F P SSS+S +C+ C +
Sbjct: 99 VQIDTGSDVLWVSCTS----CNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQ 154
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD----------TLK 112
++ SGCS + L S+++ YG+G +G D TL
Sbjct: 155 TE--------SGCSPNNLC---------SYSFKYGDGSGTSGYYISDFMSFDTVITSTLA 197
Query: 113 VHGSSPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKG 162
++ S+P F FGC + R GI G G+G+LSV SQL G +
Sbjct: 198 INSSAP--------FVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRV 249
Query: 163 FSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNS 222
FSHC D + +V+G + + + +TP++ P P +Y + L++I +
Sbjct: 250 FSHCL-----KGDKSGGGIMVLGQI---KRPDTVYTPLV--PSQP-HYNVNLQSIAVNGQ 298
Query: 223 SLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT 282
L P+ F G ++D+GTT +LP+ YS + + + ++ Y R E
Sbjct: 299 IL---PIDPSVFTIATGDGTIIDTGTTLAYLPDEAYSPFIQAVANAVSQYGRPITYESYQ 355
Query: 283 GFDLCYRVPCPNNTFTD-DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
F++ T D D+FP ++ F S+VL G Y S+ S++ C+ FQ
Sbjct: 356 CFEI---------TAGDVDVFPQVSLSFAGGASMVL--GPRAYLQIFSSSGSSIWCIGFQ 404
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
M + + G ++ VVYDL ++RIG+ DC+
Sbjct: 405 RM---SHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDCS 441
>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 441
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 169/394 (42%), Gaps = 73/394 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS+L W CG + C + + ++ SRSS+ + CA S L
Sbjct: 101 IDTGSNLIWTQCGT-TCGLKAC----AKQDLPYYNLSRSSTFAAVPCADSAKL------- 148
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C +G L L +C +FA +YG G + + T G++
Sbjct: 149 ----CAANGVHLCGL-DGSC-----TFAASYGAGSVFGSLGTEAFTFQSGAA-------- 190
Query: 126 KFCFGCVGST------YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
K FGCV T G+ G GRG LS+ SQ G + FS+C Y + S
Sbjct: 191 KLGFGCVSLTRITKGALNGASGLIGLGRGRLSLVSQTGATK--FSYCLT--PYLRNHGAS 246
Query: 180 SPLVIGDVAISSKDNLQFT--PMLKSPM---YPNYYYIGLEAITIGNSSLT--EVPLSLR 232
S L +G A S T P +KSP Y +YY+ L I++G + L LR
Sbjct: 247 SHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELR 306
Query: 233 EFDSQ-GNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEERTGFDLC 287
+ +GG+++D+G+ T L E YS L L ++ P TG DLC
Sbjct: 307 RVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPP------ADTGLDLC 360
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
D + P + FHF + + G+++ P + S C+L +++G
Sbjct: 361 V-----ARQDVDKVVPVLVFHFGGGADMAVSAGSYW----GPVDKS-TACML---IEEGG 407
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
Y V G+FQQQ+V ++YD+ K + FQ DC+
Sbjct: 408 Y--ETVIGNFQQQDVHLLYDIGKGELSFQTADCS 439
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 167/392 (42%), Gaps = 57/392 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + +F+P SS++SR TC+ C
Sbjct: 106 VQIDTGSDILWVTCS----PCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQ 161
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
+ C S S C + +TYG+G +G DT+ V G+
Sbjct: 162 TGEAI-------CQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ- 208
Query: 120 IIREIPKFCFGCVGSTY-------REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S R GI GFG+ LSV SQL G K FSHC
Sbjct: 209 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-- 266
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + L +TP++ P P +Y + LE+I + L P+
Sbjct: 267 ---KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIAVNGQKL---PI 314
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + G +VDSGTT +L + Y +S + + ++ P + + + C+
Sbjct: 315 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKG--SQCF- 369
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ D FP++T +F+ V++ + N+ ++ N S + C+ +Q +
Sbjct: 370 ---ITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDN-SVLWCIGWQRNQGQEI- 424
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL R+G+ DC+
Sbjct: 425 --TILGDLVLKDKIFVYDLANMRMGWADYDCS 454
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 161/383 (42%), Gaps = 68/383 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ WV C C DC Y+ + F P+ S+S S +C + C ++ S
Sbjct: 164 LILDTGSDVNWVQCA----PCADC--YQQADPI--FEPASSASFSTLSCNTRQCRSLDVS 215
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ D C + +YG+G G +T+ + GS+P
Sbjct: 216 ECRNDTCL--------------------YEVSYGDGSYTVGDFVTETITL-GSAP----- 249
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ GC + + G+ G G G+LS PSQ+ FS+C + + +
Sbjct: 250 VDNVAIGCGHNNEGLFVGAAGLLGLGGGSLSFPSQIN--ATSFSYCLVDRDSESASTLEF 307
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ A+S+ P+L++ +YY+GL +++G L +P S + D GNG
Sbjct: 308 NSTLPPNAVSA-------PLLRNHHLDTFYYVGLTGLSVGGE-LVSIPESAFQIDESGNG 359
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG---FDLCYRVPCPNNTF 297
G++VDSGT T L Y+ L + R +++ G FD CY + N
Sbjct: 360 GVIVDSGTAITRLQTDVYNSLRD------AFVKRTRDLPSTNGIALFDTCYDLSSKGNV- 412
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P+++FHF + L LP N+ P +S C F + G+
Sbjct: 413 ---EVPTVSFHFPDGKELPLPAKNYL----VPLDSEGTFCFAFAPTASS----LSIIGNV 461
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ VVYDL +GF P C
Sbjct: 462 QQQGTRVVYDLVNHLVGFVPNKC 484
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 167/392 (42%), Gaps = 57/392 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + +F+P SS++SR TC+ C
Sbjct: 104 VQIDTGSDILWVTCS----PCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQ 159
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
+ C S S C + +TYG+G +G DT+ V G+
Sbjct: 160 TGEAI-------CQTSNSQSSPC-----GYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ- 206
Query: 120 IIREIPKFCFGCVGSTY-------REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S R GI GFG+ LSV SQL G K FSHC
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-- 264
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + L +TP++ P P +Y + LE+I + L P+
Sbjct: 265 ---KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIAVNGQKL---PI 312
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + G +VDSGTT +L + Y +S + + ++ P + + + C+
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVS--PSVRSLVSKG--SQCF- 367
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ D FP++T +F+ V++ + N+ ++ N S + C+ +Q +
Sbjct: 368 ---ITSSSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDN-SVLWCIGWQRNQGQEI- 422
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL R+G+ DC+
Sbjct: 423 --TILGDLVLKDKIFVYDLANMRMGWADYDCS 452
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 164/388 (42%), Gaps = 48/388 (12%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ MDTGSDL W+ C C+DC D R F P S+S TC + C +
Sbjct: 164 QMIMDTGSDLNWLQCA----PCLDCFDQRG----PVFDPMASTSYRNVTCGDTRCGLVSP 215
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
P C +S+ PCP + Y YG+ TG L + V+ ++ R
Sbjct: 216 ------PAAPRTC------RSSRSDPCPYY-YWYGDQSNTTGDLALEAFTVNLTASSS-R 261
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNI 178
+ GC + G+ G GRG LS SQL FS+C + A +
Sbjct: 262 RVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSA----V 317
Query: 179 SSPLVIGD-VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S +V GD + S L +T S +YY+ L+ I +G L ++P + +
Sbjct: 318 GSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEML-DIPSNTWGVSKE 376
Query: 238 -GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G+GG ++DSGTT ++ PEP Y +I Q+ + +A + D PC N +
Sbjct: 377 DGSGGTIIDSGTTLSYFPEPAYK---AIRQAFVDRMDKAYPLIA----DFPVLSPCYNVS 429
Query: 297 FTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ + P + F + P N+F + ++ + CL + G
Sbjct: 430 GVERVEVPEFSLLFADGAVWDFPAENYFIRL----DTEGIMCLAVLGTPRS---AMSIIG 482
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAST 383
++QQQN V+YDL R+GF P CA
Sbjct: 483 NYQQQNFHVLYDLHHNRLGFAPRRCAEV 510
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 119/373 (31%), Positives = 159/373 (42%), Gaps = 80/373 (21%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW C C C Y+ +++ F P SS+ +C +SFCL + +
Sbjct: 109 VDTGSDLTWTQCR----PCTHC--YK--QVVPLFDPKNSSTYRDSSCGTSFCLALGKDRS 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
CS K C +F Y+Y +G G L +TL V S+ G P
Sbjct: 161 ---------CS-----KEKKC----TFRYSYADGSFTGGNLASETLTVD-STAGKPVSFP 201
Query: 126 KFCFGCVGSTY----REPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
F FGC S+ + GI G G G LS+ SQL G FS+C L + D +ISS
Sbjct: 202 GFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPV--STDSSISS 259
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ G S TP L+ P Y Y S TEV G
Sbjct: 260 RINFGASGRVSGYGTVSTP-LRLP-YKGY------------SKKTEVE----------EG 295
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFTD 299
++VDSGTTYT LP+ FYS+L + ++I + K V + G F LCY NT +
Sbjct: 296 NIIVDSGTTYTFLPQEFYSKLEKSVANSI----KGKRVRDPNGIFSLCY------NTTAE 345
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P IT HF + ++ L N F M + C D GV G+ Q
Sbjct: 346 INAPIITAHF-KDANVELQPLNTFMRM-----QEDLVCFTVAPTSD-----IGVLGNLAQ 394
Query: 360 QNVEVVYDLEKER 372
N V +DL K+R
Sbjct: 395 VNFLVGFDLRKKR 407
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 68/144 (47%), Gaps = 23/144 (15%)
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNNTF 297
G ++VDSGTTYT+LP FY + L+ ++ + + K V + G LCY NT
Sbjct: 417 EGNIIVDSGTTYTYLPLEFYVK----LEESVAHSIKGKRVRDPNGISSLCY------NTT 466
Query: 298 TDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
D + P IT HF + ++ L N F M + F + D G+ G+
Sbjct: 467 VDQIDAPIITAHF-KDANVELQPWNTFLRMQE-------DLVCFTVLPTSDI---GILGN 515
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
Q N V +DL K+R+ F+ DC
Sbjct: 516 LAQVNFLVGFDLRKKRVSFKAADC 539
>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
Length = 204
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 112/223 (50%), Gaps = 26/223 (11%)
Query: 160 QKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITI 219
+ FS+C + D + +S L++G +A ++KD + TP+L +P P++YY+ LE I +
Sbjct: 3 EAKFSYCLTSM----DDSKASVLLLGSLAKATKDAIS-TPLLTNPSQPSFYYLSLEGIPV 57
Query: 220 GNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKE 277
G + L+ + S+ + G+GG+++DSGTT T+L + + L I QS + +
Sbjct: 58 GGTQLS-IEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQSNLQL-----D 111
Query: 278 VEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKC 337
TG D+C+ +P P + FHF L LP ++ A S V C
Sbjct: 112 KSSSTGLDVCFSLPSETTQVE---VPKLVFHFKGG-DLELPAESYMIADSKL----GVAC 163
Query: 338 LLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
L G +FG+ QQQN+ V +DLEKE I F P C
Sbjct: 164 LAM-----GASNGMSIFGNVQQQNILVNHDLEKETISFVPTQC 201
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 174/392 (44%), Gaps = 60/392 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFCLN-IH 61
V +DTGSD+ WV C + C C ++ NF P SS+SS C+ C N I
Sbjct: 90 VQIDTGSDVLWVSCNS----CSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQ 145
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
SSD + CS + C S+ + YG+G +G D + ++ G +
Sbjct: 146 SSD--------ATCSSQ---NNQC-----SYTFQYGDGSGTSGYYVSDMMHLNTIFEGSV 189
Query: 122 --REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC + + R GI GFG+ +SV SQL G + FSHC
Sbjct: 190 TTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL-- 247
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
D + LV+G++ + N+ +T ++ P P +Y + L++I + +L +
Sbjct: 248 ---KGDSSGGGILVLGEIV---EPNIVYTSLV--PAQP-HYNLNLQSIAVNGQTL---QI 295
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + + G +VDSGTT +L E Y +S + ++I P++ G + CY
Sbjct: 296 DSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI---PQSVHTVVSRG-NQCYL 351
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ ++FP ++ +F S++L ++ ++ +AV C+ FQ +
Sbjct: 352 ITSS----VTEVFPQVSLNFAGGASMILRPQDYLIQQNSI-GGAAVWCIGFQKIQGQGI- 405
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VVYDL +RIG+ DC+
Sbjct: 406 --TILGDLVLKDKIVVYDLAGQRIGWANYDCS 435
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 160/391 (40%), Gaps = 72/391 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
V +DTGSDL+WV C C D Y + F PS+SS+ + CAS C L +
Sbjct: 140 VLIDTGSDLSWVQCK----PCNASDCYPQKDPL--FDPSKSSTFATIPCASDACKQLPVD 193
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
DN GC+ +T C +A YG G + G+ + +TL + S+
Sbjct: 194 GYDN--------GCTNNTSGMPPQC----GYAIEYGNGAITEGVYSTETLALGSSA---- 237
Query: 122 REIPKFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKG-FSHCF------LAF 170
+ F FGC GS P G+ G G S+ SQ + G FS+C F
Sbjct: 238 -VVKSFRFGC-GSDQHGPYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAGF 295
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNYYYIGLEAITIGNSSLTEVPL 229
PN ++ +S FTPM SP +Y + L I++G +L P
Sbjct: 296 LTLGAPNSTN---------NSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPA 346
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
++GN +VDSGT T +P Y L + +S + YP + + D CY
Sbjct: 347 VF----AKGN---IVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SALDTCYN 397
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ T T P + F+ ++ L PS CL F DG +
Sbjct: 398 F-TGHGTVT---VPKVALTFVGGATVDL---------DVPSGVLVEDCLAFADAGDGSF- 443
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ + +EV+YD K +GF+ C
Sbjct: 444 --GIIGNVNTRTIEVLYDSGKGHLGFRAGAC 472
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 111/389 (28%), Positives = 150/389 (38%), Gaps = 76/389 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
V DTGSD TWV C C + + KL F P RSS+ + +CA+ C LNIH
Sbjct: 193 VVFDTGSDTTWVQCQPCVVVCYE----QQEKL---FDPVRSSTYANVSCAAPACSDLNIH 245
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
GCS L + YG+G G DTL +
Sbjct: 246 ------------GCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSSYD---- 279
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + E G+ G GRG S+P Q G F+HC A
Sbjct: 280 -AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT--- 335
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMY----PNYYYIGLEAITIGNSSLTEVPLSLRE 233
G + + + L +PM P +YYIG+ I +G L +P S+
Sbjct: 336 -------GYLDFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQ-LLSIPQSVFA 387
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQL--LSILQSTITYYPRAKEVEERTGFDLCYRVP 291
G +VDSGT T LP P YS L Y +A V D CY
Sbjct: 388 -----TAGTIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSL---LDTCYDF- 438
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ P+++ F L + YA SA + CL F + +DG G
Sbjct: 439 ---TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASA-----SQVCLAFAANEDG--GDV 488
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ Q + V YD+ K+ +GF P C
Sbjct: 489 GIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 153/383 (39%), Gaps = 67/383 (17%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C Y + F PS S S S +C S C + S+
Sbjct: 165 DTGSDLTWT-------QCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPSCEKLESATG- 216
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
GCS ST L + YG+G G R+ L + +
Sbjct: 217 ----NSPGCSSSTCL----------YGIRYGDGSYSIGFFAREKLSLTSTD-----VFNN 257
Query: 127 FCFGCVGSTYRE----PIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
F FGC G R G+ G R LS+ SQ K FS+C + +
Sbjct: 258 FQFGC-GQNNRGLFGGTAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST------- 309
Query: 182 LVIGDVAISSKDN----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
G ++ S D ++FTP + YP++Y++ + I++G L P+ F +
Sbjct: 310 ---GYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKL---PIPKSVFSTA 363
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT + LP YS + + + ++ YPR K V + D CY + + +
Sbjct: 364 GT---IIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGV---SILDTCYDL----SKY 413
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P I +F + L Y + + CL F D D + G+
Sbjct: 414 KTVKVPKIILYFSGGAEMDLAPEGIIYVLKV-----SQVCLAFAGNSDDD--EVAIIGNV 466
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ+ + VVYD + R+GF P C
Sbjct: 467 QQKTIHVVYDDAEGRVGFAPSGC 489
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/395 (26%), Positives = 168/395 (42%), Gaps = 68/395 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ +DTGSDL W C LS + + + + P SS+ + C+ C
Sbjct: 105 KLIVDTGSDLIWTQC-KLSSSTAVAARHGSPPV---YDPGESSTFAFLPCSDRLCQEGQF 160
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S F CT K+ C + YG V G+L +T G R
Sbjct: 161 S---FKNCTS---------KNRCV-----YEDVYGSAAAV-GVLASETFTF-----GARR 197
Query: 123 EIP-KFCFGC--------VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ + FGC +G+T GI G +LS+ +QL + FS+C F
Sbjct: 198 AVSLRLGFGCGALSAGSLIGAT-----GILGLSPESLSLITQLKIQR--FSYCLTPFADK 250
Query: 174 NDPNISSPLVIGDVAISSKDN----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+SPL+ G +A S+ +Q T ++ +P+ YYY+ L I++G+ L VP
Sbjct: 251 K----TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLA-VPA 305
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
+ G GG +VDSG+T +L E + + + + + VE+ ++LC+
Sbjct: 306 ASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED---YELCFV 362
Query: 290 VPCPNNTFTDDLF--PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+P + P + HF ++VLP+ N+F A + CL DG
Sbjct: 363 LPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-----GLMCLAVGKTTDG- 416
Query: 348 YGPSGV--FGSFQQQNVEVVYDLEKERIGFQPMDC 380
SGV G+ QQQN+ V++D++ + F P C
Sbjct: 417 ---SGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 448
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 167/393 (42%), Gaps = 64/393 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ +DTGSDL W C LS + + + + P SS+ + C+ C
Sbjct: 27 KLIVDTGSDLIWTQC-KLSSSTAAAARHGSPPV---YDPGESSTFAFLPCSDRLC---QE 79
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
F CT K+ C + YG V G+L +T G R
Sbjct: 80 GQFSFKNCTS---------KNRCV-----YEDVYGSAAAV-GVLASETFTF-----GARR 119
Query: 123 EIP-KFCFGC--------VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ + FGC +G+T GI G +LS+ +QL + FS+C F
Sbjct: 120 AVSLRLGFGCGALSAGSLIGAT-----GILGLSPESLSLITQLKI--QRFSYCLTPFADK 172
Query: 174 NDPNISSPLVIGDVAISSKDN----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+SPL+ G +A S+ +Q T ++ +P+ YYY+ L I++G+ L VP
Sbjct: 173 K----TSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLA-VPA 227
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
+ G GG +VDSG+T +L E + + + + + VE+ ++LC+
Sbjct: 228 ASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED---YELCFV 284
Query: 290 VPCPNNTFTDDLF--PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+P + P + HF ++VLP+ N+F A + CL DG
Sbjct: 285 LPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-----GLMCLAVGKTTDGS 339
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G S + G+ QQQN+ V++D++ + F P C
Sbjct: 340 -GVS-IIGNVQQQNMHVLFDVQHHKFSFAPTQC 370
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 165/390 (42%), Gaps = 63/390 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GSD+ WV C C++C Y + F P+ S++ S +C S+ C +
Sbjct: 186 LVVDSGSDVMWVQC----KPCLEC--YVQADPL--FDPATSATFSGVSCGSAICRIL--- 234
Query: 64 DNPFDPC---TMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P C + GC + +Y +G G L +TL + G++
Sbjct: 235 --PTSACGDGELGGCE---------------YEVSYADGSYTKGALALETLTLGGTA--- 274
Query: 121 IREIPKFCFGCVGSTYRE----PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLA---FKY 172
+ GC G R G+ G G G +S+ QLG G FS+C + +
Sbjct: 275 ---VEGVVIGC-GHRNRGLFVGAAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGS 330
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ + LV+G + + + + P++++P P++YY+GL I +G+ L + L
Sbjct: 331 GAADDDAGWLVLGR-SEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERL-PLQAGLF 388
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFDLCYRVP 291
+ G G +++D+GTT T LP+ Y+ L + PRA+ V D CY +
Sbjct: 389 QLTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSV-LDTCYDL- 446
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ + P+++F F + L+L N + + CL F G
Sbjct: 447 ---SGYASVRVPTVSFCFDGDARLILAARNVLLEVDM-----GIYCLAFAPSSSG----L 494
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G+ QQ +++ D IGF P +C
Sbjct: 495 SIMGNTQQAGIQITVDSANGYIGFGPANCG 524
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 159/387 (41%), Gaps = 96/387 (24%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGS L W C C +C + F P+ SS+ S+ CASS C + S
Sbjct: 105 VLADTGSSLIWTQCA----PCTECAA----RPAPPFQPASSSTFSKLPCASSLCQFLTS- 155
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P+ C +GC + Y YG G G L +TL V G+S
Sbjct: 156 --PYRTCNATGCV---------------YYYPYGMG-FTAGYLATETLHVGGAS------ 191
Query: 124 IPKFCFGC-----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
P FGC VG++ GI G GR LS+ SQ+G + FS+C + A D
Sbjct: 192 FPGVTFGCSTENGVGNSSS---GIVGLGRSPLSLVSQVGVAR--FSYCLRSNADAGD--- 243
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREFDS 236
SP++ G +A + N+Q TP+L++P P +YYY+ L IT+G T++P+++ +
Sbjct: 244 -SPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGA---TDLPMAMANLTT 299
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
NG R GFDLC+
Sbjct: 300 V-NG---------------------------------------TRFGFDLCFDA-TAAGG 318
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNS-SAVKCLLFQSMDDGDYGPSGVFG 355
P++ F + + ++F + S +AV+CLL + + + G
Sbjct: 319 GGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECLLV--LPASEKLSISIIG 376
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ Q ++ V+YDL+ F P DCA+
Sbjct: 377 NVMQMDLHVLYDLDGGMFSFAPADCAN 403
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 165/381 (43%), Gaps = 66/381 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C C +++ F PS SS+ S +C S+ C +
Sbjct: 143 MLIDTGSDVSWVQCK----PCSQC----HSQADPLFDPSSSSTYSPFSCGSADCAQLGQE 194
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GCS S+ + + TYG+G TG + DTL + S+
Sbjct: 195 GN--------GCSSSSQCQ---------YIVTYGDGSSTTGTYSSDTLALGSSA------ 231
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC V S + + G+ G G GA S+ SQ G L + FS+C P+ S
Sbjct: 232 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL-----PPTPSSS 286
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S TPML+S P +Y + L+AI +G L+ +P S+ +
Sbjct: 287 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------S 339
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T LP YS L S ++ + YP A+ D C+ ++
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVS-- 394
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
PS+ F + L + CL F + D G+ G+ QQ
Sbjct: 395 --IPSVALVFSGGAVVSLDASGIILS----------NCLAF--AGNSDDSSLGIIGNVQQ 440
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ + +GF+ C
Sbjct: 441 RTFEVLYDVGRGVVGFRAGAC 461
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 165/388 (42%), Gaps = 58/388 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSD+ W C C +C Y+ + M F+PS+S++ + +C+S C +
Sbjct: 98 IIAVADTGSDIIWTQC----VPCTNC--YQQDLPM--FNPSKSTTYRKVSCSSPVC-SFT 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
DN CS +P +++ +YG+ G DTL + GS+ G +
Sbjct: 149 GEDN--------SCSF---------KPDCTYSISYGDNSHSQGDFAVDTLTM-GSTSGRV 190
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
P+ GC GS GI G G G S+ Q+G G FS+C ND
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPI--GNDD 248
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIG-NSSLTEVPLSLREFD 235
S+ L G A S TP+ S + ++Y + L+A+++G N++ S+
Sbjct: 249 GGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSIL--- 305
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G +++DSGTT T LP Y + ++I + + + C+
Sbjct: 306 -GGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINL---QRTDDPNQFLEYCFE------ 355
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T TDD P I HF +L L + N + S V CL F D D ++
Sbjct: 356 TTTDDYKVPFIAMHF-EGANLRLQRENVLIRV-----SDNVICLAFAGAQDNDI---SIY 406
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G+ Q N V YD+ + F+PM+C +
Sbjct: 407 GNIAQINFLVGYDVTNMSLSFKPMNCVA 434
>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
Length = 414
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/358 (28%), Positives = 160/358 (44%), Gaps = 56/358 (15%)
Query: 39 FSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGE 98
F P+ SS+ S+ CASS C + S P+ C +GC + Y YG
Sbjct: 96 FQPASSSTFSKLPCASSLCQFLTS---PYLTCNATGCV---------------YYYPYGM 137
Query: 99 GGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC-----VGSTYREPIGIAGFGRGALSVP 153
G G L +TL V G+S P FGC VG++ GI G GR LS+
Sbjct: 138 G-FTAGYLATETLHVGGAS------FPGVAFGCSTENGVGNSSS---GIVGLGRSPLSLV 187
Query: 154 SQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYY 211
SQ+G + FS+C + A D SP++ G +A + +L++P P+ YYY
Sbjct: 188 SQVGVGR--FSYCLRSDADAGD----SPILFGSLAKVTGGK-SSPAILENPEMPSSSYYY 240
Query: 212 IGLEAITIGNSSLTEVPLSLREFD-SQGNG-----GLLVDSGTTYTHLPEPFYSQLLSIL 265
+ L IT+G T++P++ F ++G G G +VDSGTT T+L + Y+ +
Sbjct: 241 VNLTGITVGA---TDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAF 297
Query: 266 QSTITYYPRAKEVE-ERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFY 324
S + V R GFDLC+ + P++ F + + ++
Sbjct: 298 LSQMATANLTTTVNGTRFGFDLCFDANAAGGG-SGVPVPTLVLRFAGGAEYAVRRRSYVG 356
Query: 325 AMSAPSNS-SAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ S +AV+CLL + + + G+ Q ++ V+YDL+ F P DCA
Sbjct: 357 VVEVDSQGRAAVECLLV--LPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCA 412
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/387 (28%), Positives = 169/387 (43%), Gaps = 80/387 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSD++WV C C C ++++ S F PS SS+ S +C+S+ C+ + S
Sbjct: 148 MDTGSDVSWVQCK----PCSQC----HSEVDSLFDPSASSTYSPFSCSSAACVQLSQSQQ 199
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+GCS S+ C+ + +Y +G TG + DTL + ++ I
Sbjct: 200 G------NGCS------SSQCQ----YIVSYVDGSSTTGTYSSDTLTLGSNA------IK 237
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNISS 180
F FGC G + G+ G G A S+ SQ G K FS+C P S
Sbjct: 238 GFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCL-----PPTPGSSG 292
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G +S+ TPML+S P YY + LEAI +G L +P S+ +
Sbjct: 293 FLTLG---AASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQL-NIPTSVF------SA 342
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G ++DSGT T LP YS L S ++ + YP A + D C+ ++
Sbjct: 343 GSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPA---QPSGILDTCFDFSGQSSVS--- 396
Query: 301 LFPSITFHF-------LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
PS+ F L+ ++L N A +A S+ S++ G
Sbjct: 397 -IPSVALVFSGGAVVNLDFNGIMLELDNWCLAFAANSDDSSL----------------GF 439
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ+ EV+YD+ +GF+ C
Sbjct: 440 IGNVQQRTFEVLYDVGGGAVGFRAGAC 466
>gi|383130044|gb|AFG45742.1| Pinus taeda anonymous locus 2_3758_01 genomic sequence
Length = 155
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 79/147 (53%), Gaps = 11/147 (7%)
Query: 182 LVIGDVAISSKDNLQFTPML-----KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LV+GD A+ + +L +TP L S Y +YYI L ++IG L +P L FD+
Sbjct: 2 LVLGDKALPTAMSLNYTPFLINTKASSSGYNTFYYIDLRGVSIGRKRL-NLPSKLFSFDN 60
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+GNGG ++DSGTT+T E FY + + S I + RA EVE RTG LCY ++
Sbjct: 61 KGNGGTIIDSGTTFTIFNEEFYKNITAAFASQIG-FRRASEVEARTGMRLCYNASGVDHV 119
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHF 323
L P FHF +VLP N+F
Sbjct: 120 ----LLPDFAFHFKGGSDMVLPVANYF 142
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/382 (27%), Positives = 158/382 (41%), Gaps = 61/382 (15%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
VYM DTGSD+ WV C C DC Y+ + F PS SSS + TC + C ++
Sbjct: 167 HVYMVVDTGSDVNWVQCA----PCADC--YQQADPI--FEPSFSSSYAPLTCETHQCKSL 218
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S+ D C + +YG+G G +T+ + GS+
Sbjct: 219 DVSECRNDSCL--------------------YEVSYGDGSYTVGDFATETITLDGSAS-- 256
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY--ANDPNI 178
+ + C + G+ G G G+LS PSQ+ FS+C + A+
Sbjct: 257 LNNVAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQIN--ASSFSYCLVNRDTDSASTLEF 314
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+SP+ V P+L++ +YY+G+ I +G L+ +P S E D G
Sbjct: 315 NSPIPSHSVT---------APLLRNNQLDTFYYLGMTGIGVGGQMLS-IPRSSFEVDESG 364
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
NGG++VDSGT T L Y+ L + P V FD CY + ++
Sbjct: 365 NGGIIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVAL---FDTCYDLSSRSSVEV 421
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+++FHF + L LP N+ P +S+ C F + G+ Q
Sbjct: 422 ----PTVSFHFPDGKYLALPAKNYLI----PVDSAGTFCFAFAPTTSA----LSIIGNVQ 469
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ V YDL +GF P C
Sbjct: 470 QQGTRVSYDLSNSLVGFSPNGC 491
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 107/409 (26%), Positives = 169/409 (41%), Gaps = 79/409 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS +T+VPC + +C + + F P+ SSSS+ C S C+
Sbjct: 77 VIVDTGSTITYVPCASCGRNCGP------HHKDAAFDPASSSSSAVIGCDSDKCI----C 126
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P PC GCS R C ++ TY E G+L D L++ + ++
Sbjct: 127 GRP--PC---GCSEK--------REC-TYQRTYAEQSSSAGLLVSDQLQLRDGAVEVV-- 170
Query: 124 IPKFCFGC----VGSTY-REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
FGC G Y +E GI G G +S+ +QL G + F+ CF + +
Sbjct: 171 -----FGCETKETGEIYNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEG--- 222
Query: 176 PNISSPLVIGDVAISSKD-NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
L++GDV + D LQ+T +L S +P+YY + LEA+ +G L P E
Sbjct: 223 ---DGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEE- 278
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY-----------PRAKEVEERTG 283
G ++DSGTT+T+LP S+ + + ++ Y P KE
Sbjct: 279 ----GYGTVLDSGTTFTYLP----SEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQF 330
Query: 284 FDLCY----RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLL 339
D+C+ + + + +FP F + V L N+ + + + +
Sbjct: 331 HDICFGGAPHAGHADQSKLEKVFPVFELQFADGVRLRTGPLNYLFMHTGEMGAYCLGVF- 389
Query: 340 FQSMDDGDYGPSG-VFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D G SG + G +N+ V YD R+GF C + Q
Sbjct: 390 -------DNGASGTLLGGISFRNILVQYDRRNRRVGFGAASCQEIGARQ 431
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 109/381 (28%), Positives = 166/381 (43%), Gaps = 66/381 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C C +++ F PS SS+ S +C S+ C +
Sbjct: 67 MLIDTGSDVSWVQCK----PCSQC----HSQADPLFDPSSSSTYSPFSCGSADCAQLGQE 118
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GCS S+ C+ + TYG+G TG + DTL + S+
Sbjct: 119 GN--------GCS-----SSSQCQ----YIVTYGDGSSTTGTYSSDTLALGSSA------ 155
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC V S + + G+ G G GA S+ SQ G L + FS+C P+ S
Sbjct: 156 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL-----PPTPSSS 210
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S TPML+S P +Y + L+AI +G L+ +P S+ +
Sbjct: 211 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------S 263
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T LP YS L S ++ + YP A+ D C+ ++
Sbjct: 264 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVS-- 318
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
PS+ F + L + CL F + D G+ G+ QQ
Sbjct: 319 --IPSVALVFSGGAVVSLDASGIILS----------NCLAF--AGNSDDSSLGIIGNVQQ 364
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ + +GF+ C
Sbjct: 365 RTFEVLYDVGRGVVGFRAGAC 385
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 158/383 (41%), Gaps = 67/383 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSD++WV C C + Y + F PS+SS+ + C + C + D+
Sbjct: 142 MDTGSDVSWVQCA----PCNSTECYPQKDPL--FDPSKSSTYAPIACGADACNKL--GDH 193
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ CT G T C + YG+G G+ + +T+ +PGI +
Sbjct: 194 YRNGCTSGG---------TQC----GYRVEYGDGSSTRGVYSNETITF---APGIT--VK 235
Query: 126 KFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
F FGC G R P G+ G G S+ Q + G FS+C A N
Sbjct: 236 DFHFGC-GHDQRGPSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPAL---NSEAGFL 291
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L + A ++ FTPM PM Y + + I++G L ++P S G
Sbjct: 292 ALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPL-DIPRSAFR------G 344
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+L+DSGT T LPE Y+ L + L+ YP + FD CY +++
Sbjct: 345 GMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASED----FDTCYNF----TGYSNV 396
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS---GVFGSF 357
P + F ++ L P+ CL F+ + GP G+ G+
Sbjct: 397 TVPRVALTFSGGATIDL---------DVPNGILVKDCLAFR-----ESGPDVGLGIIGNV 442
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
Q+ +EV+YD ++GF+ C
Sbjct: 443 NQRTLEVLYDAGHGKVGFRAGAC 465
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 165/388 (42%), Gaps = 58/388 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSD+ W C C +C Y+ + M F+PS+S++ + +C+S C +
Sbjct: 98 IIAVADTGSDIIWTQCE----PCTNC--YQQDLPM--FNPSKSTTYRKVSCSSPVC-SFT 148
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
DN CS +P +++ +YG+ G DTL + GS+ G +
Sbjct: 149 GEDN--------SCSF---------KPDCTYSISYGDNSHSQGDFAVDTLTM-GSTSGRV 190
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
P+ GC GS GI G G G S+ Q+G G FS+C ND
Sbjct: 191 VAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPI--GNDD 248
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIG-NSSLTEVPLSLREFD 235
S+ L G A S TP+ S + ++Y + L+A+++G N++ S+
Sbjct: 249 GGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSIL--- 305
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G +++DSGTT T LP Y + ++I + + + C+
Sbjct: 306 -GGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINL---QRTDDPNQFLEYCFE------ 355
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T TDD P I HF +L L + N + S V CL F D D ++
Sbjct: 356 TTTDDYKVPFIAMHF-EGANLRLQRENVLIRV-----SDNVICLAFAGAQDNDI---SIY 406
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G+ Q N V YD+ + F+PM+C +
Sbjct: 407 GNIAQINFLVGYDVTNMSLSFKPMNCVA 434
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 160/387 (41%), Gaps = 67/387 (17%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
++YM DTGSD+TWV C C DC Y+ + + F PS S+S + +C S C +
Sbjct: 181 ELYMVLDTGSDVTWVQCQP----CADC--YQQSDPV--FDPSLSASYAAVSCDSPRCRD- 231
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPS---FAYTYGEGGLVTGILTRDTLKVHGSS 117
L + CR + YG+G G +TL + S+
Sbjct: 232 --------------------LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST 271
Query: 118 PGIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
P + GC + G+ G G LS PSQ+ FS+C +
Sbjct: 272 P-----VTNVAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS--ASTFSYCLVD----R 320
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
D +S L G A ++ + P+++SP +YY+ L I++G +L+ +P S
Sbjct: 321 DSPAASTLQFG--ADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALS-IPSSAFAM 377
Query: 235 DS-QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
D+ G+GG++VDSGT T L Y+ L PR V FD CY +
Sbjct: 378 DATSGSGGVIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSL---FDTCYDL--- 431
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ T P+++ F +L LP N+ P + + CL F + +
Sbjct: 432 -SDRTSVEVPAVSLRFEGGGALRLPAKNYLI----PVDGAGTYCLAFAPTN----AAVSI 482
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQ V +D K +GF P C
Sbjct: 483 IGNVQQQGTRVSFDTAKGVVGFTPNKC 509
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 105/394 (26%), Positives = 174/394 (44%), Gaps = 64/394 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C ++ NF P S +++ +C+ C I
Sbjct: 96 VQVDTGSDVLWVSCAS----CNGCPQTSGLQIQLNFFDPGSSVTATPVSCSDQRCSWGIQ 151
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
SSD SGCS+ ++ C ++ + YG+G +G D L+ + GSS
Sbjct: 152 SSD--------SGCSV----QNNLC----AYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 119 GIIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC S + R GI GFG+ +SV SQL G + FSHC
Sbjct: 196 -VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL- 253
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ LV+G++ + N+ FTP++ P P +Y + L +I++ +L P
Sbjct: 254 ----KGENGGGGILVLGEIV---EPNMVFTPLV--PSQP-HYNVNLLSISVNGQAL---P 300
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
++ F + G ++D+GTT +L E Y + + + ++ R + CY
Sbjct: 301 INPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CY 356
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+ T D+FP ++ +F S+ L PQ + +AV C+ FQ + +
Sbjct: 357 VIA----TSVADIFPPVSLNFAGGASMFLNPQ--DYLIQQNNVGGTAVWCIGFQRIQNQG 410
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ DC+
Sbjct: 411 I---TILGDLVLKDKIFVYDLVGQRIGWANYDCS 441
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 107/384 (27%), Positives = 157/384 (40%), Gaps = 61/384 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V DTGSDL+WV C CD Y+ + + F PS+S++ S C + C + S
Sbjct: 153 VVFDTGSDLSWV-------QCKPCDGCYQQHDPL--FDPSQSTTYSAVPCGAQECRRLDS 203
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH-GSSPGII 121
CS S CR + YG+ G L RDTL + SS
Sbjct: 204 GS----------CS------SGKCR----YEVVYGDMSQTDGNLARDTLTLGPSSSSSSS 243
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
++ +F FGC + + G+ G GR +S+ SQ GFS+C P+
Sbjct: 244 DQLQEFVFGCGDDDTGLFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCL--------PS 295
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S+ + ++ N +FT M+ P++YY+ L I + ++ P R
Sbjct: 296 SSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFR----- 350
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T LP Y+ L S + Y K + D CY N
Sbjct: 351 -TPGTVIDSGTVITRLPSRAYAALRSSFAGLMRRYSY-KRAPALSILDTCYDFTGRNKV- 407
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
PS+ F +L L G Y +N S CL F S +GD + G+
Sbjct: 408 ---QIPSVALLFDGGATLNLGFGEVLYV----ANKSQA-CLAFAS--NGDDTSIAILGNM 457
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQ+ VVYD+ ++IGF C+
Sbjct: 458 QQKTFAVVYDVANQKIGFGAKGCS 481
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/381 (28%), Positives = 165/381 (43%), Gaps = 66/381 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C C +++ F PS SS+ S +C S+ C +
Sbjct: 213 MLIDTGSDVSWVQCK----PCSQC----HSQADPLFDPSSSSTYSPFSCGSADCAQLGQE 264
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GCS S S C + TYG+G TG + DTL + S+
Sbjct: 265 GN--------GCSSS----SQC-----QYIVTYGDGSSTTGTYSSDTLALGSSA------ 301
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC V S + + G+ G G GA S+ SQ G L + FS+C P+ S
Sbjct: 302 VRSFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCL-----PPTPSSS 356
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G S TPML+S P +Y + L+AI +G L+ +P S+ +
Sbjct: 357 GFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLS-IPASVF------S 409
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T LP YS L S ++ + YP A+ D C+ ++
Sbjct: 410 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVS-- 464
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
PS+ F + L + CL F + D G+ G+ QQ
Sbjct: 465 --IPSVALVFSGGAVVSLDASGIILS----------NCLAF--AGNSDDSSLGIIGNVQQ 510
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ + +GF+ C
Sbjct: 511 RTFEVLYDVGRGVVGFRAGAC 531
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 169/391 (43%), Gaps = 58/391 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C + C C ++ NF P SS+SS C+ C N
Sbjct: 93 VQIDTGSDVLWVSCNS----CNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGKQ 148
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII- 121
S + + CS + + YG+G +G D + ++ G +
Sbjct: 149 SSDATCSSQNNQCS---------------YTFQYGDGSGTSGYYVSDMMHLNTIFEGSMT 193
Query: 122 -REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
FGC + + R GI GFG+ +SV SQL G + FSHC
Sbjct: 194 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCL--- 250
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
D + LV+G++ + N+ +T ++ P P +Y + L++I++ +L +
Sbjct: 251 --KGDSSGGGILVLGEIV---EPNIVYTSLV--PAQP-HYNLNLQSISVNGQTL---QID 299
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
F + + G +VDSGTT +L E Y +S + + I P++ G + CY +
Sbjct: 300 SSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITAAI---PQSVRTVVSRG-NQCYLI 355
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
D+FP ++ +F S++L ++ ++ +AV C+ FQ +
Sbjct: 356 TSS----VTDVFPQVSLNFAGGASMILRPQDYLIQQNSI-GGAAVWCIGFQKIQGQGI-- 408
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VVYDL +RIG+ DC+
Sbjct: 409 -TILGDLVLKDKIVVYDLAGQRIGWANYDCS 438
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 105/406 (25%), Positives = 180/406 (44%), Gaps = 80/406 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGSD+ WV C C C + + + S++SS+S++ C FC
Sbjct: 93 VQVDTGSDILWVNCA----PCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFC----- 143
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC---RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
S +++S C +PC S+ YG+G G +D + + + G
Sbjct: 144 ---------------SFIMQSETCGAKKPC-SYHVVYGDGSTSDGDFIKDNITLEQVT-G 186
Query: 120 IIREIP---KFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
+R P + FGC +G T GI GFG+ S+ SQL G ++ FSHC
Sbjct: 187 NLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHC 246
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSS 223
++ N +G+V +P++K+ P+ PN +Y + L+ + +
Sbjct: 247 L------DNMNGGGIFAVGEVE---------SPVVKTTPIVPNQVHYNVILKGMDVDGDP 291
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
+ ++P SL + G+GG ++DSGTT +LP+ Y+ L+ + T + V+E
Sbjct: 292 I-DLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKI--TAKQQVKLHMVQETFA 346
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS- 342
C+ + TD FP + HF +++ L + ++ +++ + C +QS
Sbjct: 347 ---CFSF----TSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-----MYCFGWQSG 394
Query: 343 -MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
M D + G N VVYDLE E IG+ +C+S+ +
Sbjct: 395 GMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 440
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 150/389 (38%), Gaps = 76/389 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIH 61
V DTGSD TWV C C + + KL F P+RSS+ + +CA+ C LNIH
Sbjct: 195 VVFDTGSDTTWVQCQPCVVVCYE----QREKL---FDPARSSTYANVSCAAPACSDLNIH 247
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
GCS L + YG+G G DTL +
Sbjct: 248 ------------GCSGGHCL----------YGVQYGDGSYSIGFFAMDTLTLSS-----Y 280
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + E G+ G GRG S+P Q G F+HC A
Sbjct: 281 DAVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGT--- 337
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMY----PNYYYIGLEAITIGNSSLTEVPLSLRE 233
G + + + L +PM P +YY+G+ I +G L +P S+
Sbjct: 338 -------GYLDFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQ-LLSIPQSVFA 389
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQL--LSILQSTITYYPRAKEVEERTGFDLCYRVP 291
G +VDSGT T LP YS L Y +A V D CY
Sbjct: 390 -----TAGTIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSL---LDTCYDF- 440
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ P+++ F L + YA SA + CL F + +DG G
Sbjct: 441 ---TGMSQVAIPTVSLLFQGGARLDVDASGIMYAASA-----SQVCLAFAANEDG--GDV 490
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ G+ Q + V YD+ K+ +GF P C
Sbjct: 491 GIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/394 (24%), Positives = 172/394 (43%), Gaps = 67/394 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ WV C + C +C + N+ + SSS++R
Sbjct: 96 VQIDTGSDVLWVTCSS----CSNCPQTSGLGIQLNYFDTTSSSTAR-------------- 137
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCP---SFAYTYGEGGLVTGILTRDTLK-------- 112
PC+ C+ +T C P S+A+ YG+G +G DT
Sbjct: 138 ---LVPCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGES 194
Query: 113 -VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ SS I+ + G + T + GI GFG+G LSV SQL G + FSHC
Sbjct: 195 LIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL- 253
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ + LV+G++ + + ++P++ P P +Y + L++I + L P
Sbjct: 254 ----KGEDSGGGILVLGEIL---EPGIVYSPLV--PSQP-HYNLDLQSIAVSGQLL---P 300
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGFDL 286
+ F + N G ++D+GTT +L E Y +S + + ++ P + +
Sbjct: 301 IDPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQ------ 354
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CY V + ++FP ++F+F +++L + ++ + +A+ C+ FQ + G
Sbjct: 355 CYLV----SNSVSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAG-AALWCIGFQKIQGG 409
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G ++ VYDL +RIG+ DC
Sbjct: 410 ----ITILGDLVLKDKIFVYDLAHQRIGWANYDC 439
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 164/386 (42%), Gaps = 75/386 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I+ +DTGS++TW C C+ C Y+ N + F PS+SS+ C H
Sbjct: 393 IEAVIDTGSEITWTQC----LPCVHC--YKQNAPI--FDPSKSSTFKEKRC--------H 436
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS--PG 119
P++ F TY +G L T DT+ +H +S P
Sbjct: 437 DHSCPYE--------------------VDYFDKTYTKGTLAT-----DTVTIHSTSGEPF 471
Query: 120 IIREIPKFCFGCVGSTYREPI-GIAGFGRGALSVPSQLGFLQKGF-SHCFLAFKYANDPN 177
++ E C G S +R G G G LS+ +Q+G G S+CF N
Sbjct: 472 VMAETIIGC-GRNNSWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAG-------N 523
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G AI + T M + P +YY+ L+A+++G++ + + +
Sbjct: 524 GTSKINFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALE-- 581
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-LCYRVPCPNNT 296
G +++DSGTT T+ PE + + + ++ + P A + TG D LCY +
Sbjct: 582 --GNIVIDSGTTLTYFPESYCNLVRQAVEHVVPAVPAA----DPTGNDLLCYY------S 629
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T ++FP IT HF LVL + Y M S S + CL + +FG+
Sbjct: 630 NTTEIFPVITMHFSGGADLVLDK----YNMFMESYSGGLFCLAIICNNPTQ---EAIFGN 682
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
Q N V YD + F+P +C++
Sbjct: 683 RAQNNFLVGYDSSSLLVSFKPTNCSA 708
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 90/372 (24%), Positives = 142/372 (38%), Gaps = 97/372 (26%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
++ +DTGS+L W C C+ C D + F PS+SS+ C
Sbjct: 78 VEAVLDTGSELIWTQC----LPCLHCYDQK----APIFDPSKSSTFKETRC--------- 120
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+T CP + Y + G L +T+ +H +S G+
Sbjct: 121 ---------------------NTPDHSCP-YKLVYDDKSYTQGTLATETVTIHSTS-GVP 157
Query: 122 REIPKFCFGCV----GSTYR-EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+P+ GC GS +R GI G RG+LS+ SQ+G G
Sbjct: 158 FVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGGAYPG-------------- 203
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
D + T M YY+ L+A+++G++ + V
Sbjct: 204 ----------------DGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHAL-- 245
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-LCYRVPCPNN 295
NG +++DSGT T+ P + + + ++ +T A V + + D LCY +N
Sbjct: 246 --NGNIVIDSGTPLTYFPVSYCNLVRKAVERVVT----ADRVVDPSRNDMLCYY----SN 295
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
T ++FP IT HF LVL + N + + N V CL + +FG
Sbjct: 296 TI--EIFPVITVHFSGGADLVLDKYNMYMEL----NRGGVFCLAIICNNPTQV---AIFG 346
Query: 356 SFQQQNVEVVYD 367
+ Q N V YD
Sbjct: 347 NRAQNNFLVGYD 358
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 113/404 (27%), Positives = 174/404 (43%), Gaps = 72/404 (17%)
Query: 6 MDTGSDLTWVPCGNLSFD-CMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGS+L W C + C D ++ + PSRS ++ C + CL +
Sbjct: 101 IDTGSNLIWTQCSTCRANGCFGQD-------LTFYDPSRSRTAKPVACNDTACLLGSETR 153
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSPGIIRE 123
C G + + L YG G + G L + HG S
Sbjct: 154 -----CARDGKACAVLT-------------AYGAGA-IGGFLGTEVFTFGHGQSS---EN 191
Query: 124 IPKFCFGCVGSTYREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
FGC+ ++ P GI G GRG LS+PSQLG FS+C Y +D
Sbjct: 192 NVSLAFGCITASRLTPGSLDGASGIIGLGRGKLSLPSQLG--DNKFSYCLT--PYFSDAA 247
Query: 178 ISSPLVIGDVAISSKDNLQFT--PMLKSP---MYPNYYYIGLEAITIGNSSLTEVPLS-- 230
+S L +G A S T P LK+P + ++YY+ L IT+G + L +VP +
Sbjct: 248 NTSTLFVGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKL-DVPAAAF 306
Query: 231 -LREFDSQGNGGLLVDSGTTYTHLPEPFYS----QLLSILQSTITYYPRAKEVEERTGFD 285
LRE GG L+DSG+ +T L + Y +L+ L +++ P E G D
Sbjct: 307 DLREVAPAKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAE-----GLD 361
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNV----SLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
LC P + L P + HF + +V+P N++ P + S ++F
Sbjct: 362 LCVGGVAPGDA--GKLVPPLVLHFGSGGGGGGDVVVPPENYW----GPVDDSTACMVVFS 415
Query: 342 SMDDGDYGP---SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
S P + + G++ QQ++ ++YDL + + FQP DC+S
Sbjct: 416 SGGPNSTLPLNETTIIGNYMQQDMHLLYDLGQGVLSFQPADCSS 459
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 154/385 (40%), Gaps = 74/385 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD++WV C + L F P +SS+ + +C+S+ C +
Sbjct: 140 VMIDTGSDVSWVHC--------HARAGAGSSLF--FDPGKSSTYTPFSCSSAACTRLEGR 189
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
DN GCSL STC + YG+G TG DTL ++ + +
Sbjct: 190 DN--------GCSL----NSTC-----QYTVRYGDGSNTTGTYGSDTLALNST-----EK 227
Query: 124 IPKFCFGCV-------GSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYAND 175
+ F FGC G + G+ G G GA S+ SQ FS+C A +
Sbjct: 228 VENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRS-- 285
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S L +G A + TPM +S P +Y++ L+ I +G + P
Sbjct: 286 ---SGFLTLG--ASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA-- 338
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G ++DSGT T LP YS L + ++ + YPRA+ D C+ +N
Sbjct: 339 -----GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSI---LDTCFDFTGQDN 390
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P++ F + L Y CL F G + G
Sbjct: 391 VS----IPAVELVFSGGAVVDLDADGIMYG----------SCLAFAPATGGI---GSIIG 433
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQ+ EV++D+ + +GF+P C
Sbjct: 434 NVQQRTFEVLHDVGQSVLGFRPGAC 458
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 105/406 (25%), Positives = 180/406 (44%), Gaps = 80/406 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGSD+ WV C C C + + + S++SS+S++ C FC
Sbjct: 89 VQVDTGSDILWVNCA----PCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFC----- 139
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC---RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
S +++S C +PC S+ YG+G G +D + + + G
Sbjct: 140 ---------------SFIMQSETCGAKKPC-SYHVVYGDGSTSDGDFIKDNITLEQVT-G 182
Query: 120 IIREIP---KFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
+R P + FGC +G T GI GFG+ S+ SQL G ++ FSHC
Sbjct: 183 NLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHC 242
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSS 223
++ N +G+V +P++K+ P+ PN +Y + L+ + +
Sbjct: 243 L------DNMNGGGIFAVGEVE---------SPVVKTTPIVPNQVHYNVILKGMDVDGDP 287
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
+ ++P SL + G+GG ++DSGTT +LP+ Y+ L+ + T + V+E
Sbjct: 288 I-DLPPSLAS--TNGDGGTIIDSGTTLAYLPQNLYNSLIEKI--TAKQQVKLHMVQETFA 342
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS- 342
C+ + TD FP + HF +++ L + ++ +++ + C +QS
Sbjct: 343 ---CFSF----TSNTDKAFPVVNLHFEDSLKLSVYPHDYLFSLRED-----MYCFGWQSG 390
Query: 343 -MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
M D + G N VVYDLE E IG+ +C+S+ +
Sbjct: 391 GMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKVK 436
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 162/392 (41%), Gaps = 68/392 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C Y + + F P RS S + C + C + S+
Sbjct: 143 MVLDTGSDVVWLQCA----PCRHC--YAQSGRV--FDPRRSRSYAAVDCVAPICRRLDSA 194
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC +++C + YG+G + G +TL +
Sbjct: 195 ----------GCDRR---RNSCL-----YQVAYGDGSVTAGDFASETLTFARGA-----R 231
Query: 124 IPKFCFGCVGSTYREPIGIAG-----FGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ + GC E + IA GRG LS PSQ+ + FS+C + + P+
Sbjct: 232 VQRVAIGC--GHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPS 289
Query: 178 --ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREF 234
SS + G A+++ FTPM ++P +YY+ L ++G + + V S LR
Sbjct: 290 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 349
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG------FDLCY 288
+ G GG+++DSGT+ T L P Y + RA V R FD CY
Sbjct: 350 PTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRVSPGGFSLFDTCY 401
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ P+++ H S+ LP N+ P ++S C D G
Sbjct: 402 NLSGRRVV----KVPTVSMHLAGGASVALPPENYLI----PVDTSGTFCFAMAGTDGG-- 451
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ VV+D + +R+GF P C
Sbjct: 452 --VSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 481
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 101/385 (26%), Positives = 168/385 (43%), Gaps = 57/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDL+WV C C C + ++ F+PS S S C+S C ++
Sbjct: 146 MTVIVDTGSDLSWVQCQ----PCKRCYNQQD----PVFNPSTSPSYRTVLCSSPTCQSLQ 197
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C + P ++ YG+G G L + L + S+
Sbjct: 198 SATGNLGVCGSN-------------PPSCNYVVNYGDGSYTRGELGTEHLDLGNST---- 240
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC + + G+ G GR +LS+ SQ + G FS+C +
Sbjct: 241 -AVNNFIFGCGRNNQGLFGGASGLVGLGRSSLSLISQTSAMFGGVFSYCLPI----TETE 295
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S LV+G + K+ + +T M+ +P P +Y++ L IT+G+ ++ + P
Sbjct: 296 ASGSLVMGGNSSVYKNTTPISYTRMIPNPQLP-FYFLNLTGITVGSVAV-QAP------- 346
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G G+++DSGT T LP Y L + +P A D C+ + +
Sbjct: 347 SFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMI---LDTCFNL----S 399
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ + P+I HF N L + FY + +++S V CL S+ + G+ G
Sbjct: 400 GYQEVEIPNIKMHFEGNAELNVDVTGVFYFVK--TDASQV-CLAIASLSYEN--EVGIIG 454
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQ+N V+YD + +GF C
Sbjct: 455 NYQQKNQRVIYDTKGSMLGFAAEAC 479
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 152/381 (39%), Gaps = 63/381 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD++WV C C + Y + F P++SS+ +CA++ C +
Sbjct: 142 VTIDTGSDVSWVQCN----PCPNPPCYAQTGAL--FDPAKSSTYRAVSCAAAECAQLEQQ 195
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GC + C+ + YG+G G +RDTL + G+S +
Sbjct: 196 GN--------GCGATNYE----CQ----YGVQYGDGSTTNGTYSRDTLTLSGASDAV--- 236
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
F FGC V S + + G+ G G GA S+ SQ FS+C P
Sbjct: 237 -KGFQFGCSHVESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL-------PPTSG 288
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + T ML+S P +Y L+ I +G L P S
Sbjct: 289 SSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSP-------SVFA 341
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G +VDSGT T LP YS L S ++ + Y + R+ D C+ T
Sbjct: 342 AGSVVDSGTIITRLPPTAYSALSSAFKAGMKQY---RSAPARSILDTCFDFAGQ----TQ 394
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ F ++ L Y CL F + GD G +G+ G+ QQ
Sbjct: 395 ISIPTVALVFSGGAAIDLDPNGIMYG----------NCLAFAAT--GDDGTTGIIGNVQQ 442
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ +GF+ C
Sbjct: 443 RTFEVLYDVGSSTLGFRSGAC 463
>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
Length = 430
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 109/386 (28%), Positives = 169/386 (43%), Gaps = 69/386 (17%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C D + + SSS S C+S+ CL I SS
Sbjct: 101 DTGSDLTWTQCKPCKL-CFGQD-------TPIYDTTTSSSFSPLPCSSATCLPIWSSR-- 150
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
CS S CR + Y Y +G + + G S G I
Sbjct: 151 --------CST----PSATCR----YRYAYDDGAY--------SPECAGISVGGI----- 181
Query: 127 FCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC G G G GRG+LS+ +QLG + FS+C F + ++SSP+
Sbjct: 182 -AFGCGVDNGGLSYNSTGTVGLGRGSLSLVAQLGVGK--FSYCLTDFF---NTSLSSPVF 235
Query: 184 IGDVAISSKDN-------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
G +A + + +Q TP+++SP P+ YY+ LE I++G++ L + D
Sbjct: 236 FGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDD 295
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CYRVPCPNN 295
G+GG++VDSGT +T L E + ++ + + + V + D C+ P
Sbjct: 296 DGSGGMIVDSGTIFTILVETGFRVVVDHVAGVL-----GQPVVNASSLDRPCFPAPAAGV 350
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
D+ P + HF + L + N+ MS S+ CL + V G
Sbjct: 351 QELPDM-PDMVLHFAGGADMRLHRDNY---MSFNEEESSF-CLNIVGTESAS---GSVLG 402
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
+FQQQN+++++D+ ++ F P DC+
Sbjct: 403 NFQQQNIQMLFDITVGQLSFMPTDCS 428
>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
Length = 375
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 160/387 (41%), Gaps = 77/387 (19%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ +DTGSDL W C LS + + L S +P+R+ + +R
Sbjct: 54 KLIVDTGSDLIWTQC-KLSSSTAAAARHGSPPL-SRTAPARTGAFTRT------------ 99
Query: 63 SDNPFDPCTMSGCSLSTLLKST---CCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
CT S ++ L T R S +G G L G L
Sbjct: 100 -------CTASAAAVGVLASETFTFGARRAVSLRLGFGCGALSAGSL------------- 139
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+G+T GI G +LS+ +QL + FS+C F +
Sbjct: 140 ------------IGAT-----GILGLSPESLSLITQLKI--QRFSYCLTPFADKK----T 176
Query: 180 SPLVIGDVAISSKDN----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SPL+ G +A S+ +Q T ++ +P+ YYY+ L I++G+ L VP +
Sbjct: 177 SPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLA-VPAASLAMR 235
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G GG +VDSG+T +L E + + + + + VE+ ++LC+ +P
Sbjct: 236 PDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVED---YELCFVLPRRTA 292
Query: 296 TFTDDLF--PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P + HF ++VLP+ N+F A + CL DG G S +
Sbjct: 293 AAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRA-----GLMCLAVGKTTDGS-GVS-I 345
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQN+ V++D++ + F P C
Sbjct: 346 IGNVQQQNMHVLFDVQHHKFSFAPTQC 372
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 164/382 (42%), Gaps = 64/382 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C +C Y + F+PS S+S S C S+ C + + D
Sbjct: 174 LDTGSDVAWIQCE----PCREC--YSQADPI--FNPSYSASFSTVGCDSAVCSQLDAYD- 224
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C GC + +YG+G TG +TL +S +
Sbjct: 225 ----CHSGGCL---------------YEASYGDGSYSTGSFATETLTFGTTS------VA 259
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDPNIS 179
GC VG + G+ G G GALS P+Q+G Q G FS+C + + + S
Sbjct: 260 NVAIGCGHKNVG-LFIGAAGLLGLGAGALSFPNQIG-TQTGHTFSYCLVD----RESDSS 313
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD-SQG 238
PL G ++ FTP+ K+P P +YY+ + AI++G + L +P + D + G
Sbjct: 314 GPLQFGPKSVPVGS--IFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSG 371
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
+GG ++DSGT T L Y + + PR V + FD CY + F
Sbjct: 372 HGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAV---SIFDTCYDL--SGLQFV 426
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++ FHF N SL+LP N+ P ++ C F + G+ Q
Sbjct: 427 S--VPTVGFHFSNGASLILPAKNYLI----PMDTVGTFCFAFAPAASS----VSIMGNTQ 476
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ++ V +D +GF C
Sbjct: 477 QQHIRVSFDSANSLVGFAFDQC 498
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 104/392 (26%), Positives = 165/392 (42%), Gaps = 60/392 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V++DTGSD+ WV C C +C N L +S F P +S+S + +C C +
Sbjct: 63 VHVDTGSDVAWVNC----VPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASN 118
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
S F+ +MS C STL YG+G G L D L +V +
Sbjct: 119 SKCSFN--SMS-CPYSTL---------------YGDGSSTAGYLINDVLSFNQVPSGNST 160
Query: 120 IIREIPKFCFGCVGSTYREPI--GIAGFGRGALSVPSQLGFLQKG---FSHCFLAFKYAN 174
+ FGC + + G+ GFG+ +S+PSQL F+HC
Sbjct: 161 ATSGTARLTFGCGSNQTGTWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCL-----QG 215
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
D S LVIG + + L +TP++ P + +E + IG S T V + F
Sbjct: 216 DNKGSGTLVIGHI---REPGLVYTPIV-----PKQSHYNVELLNIGVSG-TNVT-TPTAF 265
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
D +GG+++DSGTT T+L +P Y Q + + ++ + ++ C
Sbjct: 266 DLSNSGGVIMDSGTTLTYLVQPAYDQ----------FQAKVRDCMRSGVLPVAFQFFCT- 314
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ FP++T +F +++L ++ Y + SA +S Y +F
Sbjct: 315 ---IEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIF 371
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
G ++ VVYD RIG++ DC S
Sbjct: 372 GDNVLKDQLVVYDNVNNRIGWKNFDCTKEISV 403
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 103/392 (26%), Positives = 162/392 (41%), Gaps = 68/392 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C Y + + F P RS S + C + C + S+
Sbjct: 137 MVLDTGSDVVWLQCA----PCRHC--YAQSGRV--FDPRRSRSYAAVDCVAPICRRLDSA 188
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC +++C + YG+G + G +TL +
Sbjct: 189 ----------GCDRR---RNSCL-----YQVAYGDGSVTAGDFASETLTFARGA-----R 225
Query: 124 IPKFCFGCVGSTYREPIGIAG-----FGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ + GC E + IA GRG LS PSQ+ + FS+C + + P+
Sbjct: 226 VQRVAIGC--GHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPS 283
Query: 178 --ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREF 234
SS + G A+++ FTPM ++P +YY+ L ++G + + V S LR
Sbjct: 284 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 343
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG------FDLCY 288
+ G GG+++DSGT+ T L P Y + RA V R FD CY
Sbjct: 344 PTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRVSPGGFSLFDTCY 395
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ P+++ H S+ LP N+ P ++S C D G
Sbjct: 396 NLSGRRVV----KVPTVSMHLAGGASVALPPENYLI----PVDTSGTFCFAMAGTDGG-- 445
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ VV+D + +R+GF P C
Sbjct: 446 --VSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 109/404 (26%), Positives = 170/404 (42%), Gaps = 73/404 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C +C R + L ++ ++ S S C FC +
Sbjct: 101 VQVDTGSDIMWVNC----IQCRECP--RTSSLGMELTLYNIKDSVSGKLVPCDEEFCYEV 154
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ +SGC T CP + YG+G G +D ++ S +
Sbjct: 155 NGG-------PLSGC--------TANMSCP-YLEIYGDGSSTAGYFVKDVVQYDRVSGDL 198
Query: 121 --IREIPKFCFGC-------VGSTYREPI-GIAGFGRGALSVPSQLGF---LQKGFSHCF 167
FGC +G T E + GI GFG+ S+ SQL ++K F+HC
Sbjct: 199 QTTSSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL 258
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ N IG V + K N+ +P+ PN +Y + + A+ +G L
Sbjct: 259 ------DGINGGGIFAIGHV-VQPKVNM-------TPLIPNQPHYNVNMTAVQVGEDFLH 304
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
L EF++ G ++DSGTT +LPE Y L+S I+ P K R +
Sbjct: 305 ---LPTEEFEAGDRKGAIIDSGTTLAYLPEIVYEPLVS---KIISQQPDLKVHIVRDEYT 358
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--M 343
C++ DD FP++TFHF N+V L + + + + C+ +Q+ M
Sbjct: 359 -CFQYSGS----VDDGFPNVTFHFENSVFLKVHPHEYLFPF------EGLWCIGWQNSGM 407
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V+YDLE + IG+ +C+S+ Q
Sbjct: 408 QSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTEYNCSSSIKVQ 451
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 174/397 (43%), Gaps = 59/397 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + F+P SS+SS+ C+
Sbjct: 106 VQIDTGSDILWVACS----PCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSD-------- 153
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
D CT + + + +++ PC + +TYG+G +G DT+ V G+
Sbjct: 154 -----DRCTAALQTSEAVCQTSDNSPC-GYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ- 206
Query: 120 IIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S T R GI GFG+ LSV SQL G K FSHC
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-- 264
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + L +TP++ P P +Y + LE+I + L P+
Sbjct: 265 ---KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIVVNGQKL---PI 312
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + G +VDSGTT +L + Y ++ + + ++ P + + + + C+
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKG--NQCFV 368
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ D FP+++ +F+ V++ + N+ ++ N + + C+ +Q
Sbjct: 369 T----SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDN-NVLWCIGWQRNQGQQI- 422
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYDL R+G+ DC+++ +
Sbjct: 423 --TILGDLVLKDKIFVYDLANMRMGWTDYDCSTSVNV 457
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 174/397 (43%), Gaps = 59/397 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + F+P SS+SS+ C+
Sbjct: 106 VQIDTGSDILWVACS----PCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSD-------- 153
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
D CT + + + +++ PC + +TYG+G +G DT+ V G+
Sbjct: 154 -----DRCTAALQTSEAVCQTSDNSPC-GYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQ- 206
Query: 120 IIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S T R GI GFG+ LSV SQL G K FSHC
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-- 264
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + L +TP++ P P +Y + LE+I + L P+
Sbjct: 265 ---KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIVVNGQKL---PI 312
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + G +VDSGTT +L + Y ++ + + ++ P + + + + C+
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKG--NQCFV 368
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ D FP+++ +F+ V++ + N+ ++ N + + C+ +Q
Sbjct: 369 T----SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDN-NVLWCIGWQRNQGQQI- 422
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYDL R+G+ DC+++ +
Sbjct: 423 --TILGDLVLKDKIFVYDLANMRMGWTDYDCSTSVNV 457
>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 409
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 74/283 (26%), Positives = 125/283 (44%), Gaps = 26/283 (9%)
Query: 97 GEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYRE---PIGIAGFGRGALSVP 153
G +G L DT ++ +P FGC ++Y + G+ G GRG LS+
Sbjct: 124 GSAANTSGYLATDTFTFGATA------VPGVVFGCSDASYGDFAGASGVIGIGRGNLSLI 177
Query: 154 SQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIG 213
SQL F + FS+ LA + +D + S + GD A+ + TP+L S +YP++YY+
Sbjct: 178 SQLQFGK--FSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVN 235
Query: 214 LEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP 273
L + + + L +P + + G GG+++ S T T+L + Y + + + S I
Sbjct: 236 LTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGL-- 293
Query: 274 RAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS 333
A DLCY ++ P +T F + L N+FY N +
Sbjct: 294 PAVNGSAALELDLCYNA----SSMAKVKVPKLTLVFDGGADMDLSAANYFYI----DNDT 345
Query: 334 AVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQ 376
++CL G V G+ Q ++YD++ R+ F+
Sbjct: 346 GLECLTMLPSQGGS-----VLGTLLQTGTNMIYDVDAGRLTFE 383
>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 478
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 112/405 (27%), Positives = 165/405 (40%), Gaps = 69/405 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL W C C C + F S ++ C+ C
Sbjct: 114 VALTLDTGSDLVWTQCA-----CHVC----FAQPFPTFDALASQTTLAVPCSDPICT--- 161
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV------HG 115
S P CT + +TC + Y Y + + +G + DT +G
Sbjct: 162 SGKYPLSGCTFN--------DNTCF-----YLYDYADKSITSGRIVEDTFTFRSPQGNNG 208
Query: 116 SSPGIIREIPKFCFGCVGSTYREPI------GIAGFGRGALSVPSQLGFLQKGFSHCFLA 169
S +P FGC Y + I GIAGF RG +S+PSQL + FSHCF A
Sbjct: 209 SKAHAGVAVPNVRFGC--GQYNKGIFKSNESGIAGFSRGPMSLPSQLKVAR--FSHCFTA 264
Query: 170 FKYANDPNISSPLVIG------DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSS 223
A +SP+ +G ++ + +Q TP S + YY+ L+ IT+G
Sbjct: 265 IADAR----TSPVFLGGAPGPDNLGAHATGPVQSTPFANS--NGSLYYLTLKGITVGK-- 316
Query: 224 LTEVPLSLREF----DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVE 279
T +PL+ F G+GG ++DSGT LP P Y L + + + P A E
Sbjct: 317 -TRLPLNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESA 374
Query: 280 ERTGFDLCY---RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK 336
LC+ R P + H + LP+ ++ + + S
Sbjct: 375 ADAESTLCFEAARSASLPPEAPAPALPKVVLH-VAGADWDLPRESYVLDLLEDEDGSGSG 433
Query: 337 -CLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
CL+ S D D + G+FQQQN+ V YDLEK ++ F P C
Sbjct: 434 LCLVMNSAGDSDLT---IIGNFQQQNMHVAYDLEKNKLVFVPARC 475
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 112/391 (28%), Positives = 175/391 (44%), Gaps = 72/391 (18%)
Query: 2 IQVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
+QV+ +DTGSD+ W+ C C C + + F S+S + C S+ C +
Sbjct: 100 LQVFGILDTGSDIIWLQCQ----PCKKCYE----QTTPIFDSSKSQTYKTLPCPSNTCQS 151
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ + CS R ++ Y +G G L+ +TL + GS+ G
Sbjct: 152 VQGTF----------CS---------SRKHCLYSIHYVDGSQSLGDLSVETLTL-GSTNG 191
Query: 120 IIREIPKFCFGC-----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYA 173
+ P GC +G + GI G GRG +S+ +QL G FS+C +
Sbjct: 192 SPVQFPGTVIGCGRYNAIGIEEKNS-GIVGLGRGPMSLITQLSPSTGGKFSYCLVP---- 246
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
SS L G+ A+ S TP+ K+ + +Y++ LEA ++G + +
Sbjct: 247 GLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLV--FYFLTLEAFSVGRNRI-------- 296
Query: 233 EFDSQGNGG---LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
EF S G+GG +++DSGTT T LP YS+L + + T+ R ++ + G LCY+
Sbjct: 297 EFGSPGSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVI-LQRVRDPNQVLG--LCYK 353
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
V P+ D P IT HF + + L N F + + V C FQ + G
Sbjct: 354 V-TPDK--LDASVPVITAHF-SGADVTLNAINTFVQV-----ADDVVCFAFQPTETG--- 401
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
VFG+ QQN+ V YDL+ + F+ DC
Sbjct: 402 --AVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 107/394 (27%), Positives = 176/394 (44%), Gaps = 66/394 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+TW+ C + C+ + KL + + PSRSS+ +C S C S
Sbjct: 52 VQVDTGSDVTWLNCAPCT-SCVTETQLPSIKL-TTYDPSRSSTDGALSCRDSNCGAALGS 109
Query: 64 DNPFDPCTMSG-CSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
+ CT +G C+ ST TYG+G G +D + ++H ++
Sbjct: 110 NEV--SCTSAGYCAYST---------------TYGDGSSTQGYFIQDVMTFQEIHNNTQ- 151
Query: 120 IIREIPKFCFGCVGSTY--------REPIGIAGFGRGALSVPSQLGFLQK---GFSHCFL 168
+ FGC G+T R G+ GFG+ A+S+PSQL + K F+HC
Sbjct: 152 -VNGTASVYFGC-GTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL- 208
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
D +VIG V S+ N+ +TP++ N+Y +G++ I + ++T P
Sbjct: 209 ----QGDNQGGGTIVIGSV---SEPNISYTPIVSR----NHYAVGMQNIAVNGRNVT-TP 256
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
S + S GG+++DSGTT +L +P Y+Q ++ + + E + C
Sbjct: 257 ASF-DTTSTSAGGVIMDSGTTLAYLVDPAYTQFVNAVSTF--------ESSMFSSHSQCL 307
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP-SNSSAVKCLLFQ-SMDDG 346
++ + D FP++ F + L N+ Y S P N A C+ +Q S
Sbjct: 308 QLAWC--SLQAD-FPTVKLFFDAGAVMNLTPRNYLY--SQPLQNGQAAYCMGWQKSTTKA 362
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
Y + G ++ VVYD + +G++ DC
Sbjct: 363 GYLSYSILGDIVLKDHLVVYDNDNRVVGWKSFDC 396
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 159/387 (41%), Gaps = 65/387 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C Y N+ F PS+S++ S +C+S C +
Sbjct: 144 LSLIFDTGSDLTWT-------QCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPDCSQLE 196
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S GCS + R C + YG+ G ++TL + +
Sbjct: 197 SGTG-----NQPGCSAA--------RACI-YGIQYGDQSFSVGYFAKETLTLTSTD---- 238
Query: 122 REIPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDP 176
I F FGC G R G+ G G+ +S+ Q + FS+C P
Sbjct: 239 -VIENFLFGC-GQNNRGLFGSAAGLIGLGQDKISIVKQTAQKYGQVFSYCL--------P 288
Query: 177 NISSPL-VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS + L++TP+ K+ N+Y + + + +G T++P+S F
Sbjct: 289 KTSSSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGG---TQIPISSSVFS 345
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ G ++DSGT T LP YS L S + + YP+A E+ D CY + +
Sbjct: 346 TSG---AIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSI---LDTCYDL----S 395
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS--GV 353
++ P + F F L L Y S++ CL F D PS +
Sbjct: 396 KYSTIQIPKVGFVFKGGEELDLDGIGIMYGA-----STSQVCLAFAGNQD----PSTVAI 446
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ+ ++VVYD+ +IGF C
Sbjct: 447 IGNVQQKTLQVVYDVGGGKIGFGYNGC 473
>gi|361066669|gb|AEW07646.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
Length = 136
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/140 (41%), Positives = 80/140 (57%), Gaps = 12/140 (8%)
Query: 217 ITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK 276
ITIG L ++P SL FD +GNGGL+VDSGTT+T LPE Y ++L L+S I Y R+
Sbjct: 1 ITIGGQRL-KLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYREVLKKLKSAIR-YSRSV 58
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS--------A 328
E G DLCY +P +F +FP+ + HF +N ++ LP N+ MS +
Sbjct: 59 RYEAALGLDLCYELPSEVGSF--PVFPTFSLHFKDNATIRLPAENYMSMMSDTYDATRPS 116
Query: 329 PSNSSAVKCLLFQSMDDGDY 348
S ++AV CL+ S D Y
Sbjct: 117 TSATAAVGCLIILSSGDEVY 136
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 174/397 (43%), Gaps = 59/397 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + F+P SS+SS+ C+
Sbjct: 132 VQIDTGSDILWVACS----PCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSD-------- 179
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
D CT + + + +++ PC + +TYG+G +G DT+ V G+
Sbjct: 180 -----DRCTAALQTSEAVCQTSDNSPC-GYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ- 232
Query: 120 IIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S T R GI GFG+ LSV SQL G K FSHC
Sbjct: 233 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-- 290
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + L +TP++ P P +Y + LE+I + L P+
Sbjct: 291 ---KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIVVNGQKL---PI 338
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + G +VDSGTT +L + Y ++ + + ++ P + + + + C+
Sbjct: 339 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS--PSVRSLVSKG--NQCFV 394
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ D FP+++ +F+ V++ + N+ ++ N + + C+ +Q
Sbjct: 395 T----SSSVDSSFPTVSLYFMGGVAMTVKPENYLLQQASIDN-NVLWCIGWQRNQGQQI- 448
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYDL R+G+ DC+++ +
Sbjct: 449 --TILGDLVLKDKIFVYDLANMRMGWTDYDCSTSVNV 483
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 156/382 (40%), Gaps = 64/382 (16%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD+ W+ C C DC Y + F P+ S+S S +C + C ++
Sbjct: 157 VYMVLDTGSDVNWIQCA----PCADC--YHQADPI--FEPASSTSYSPLSCDTKQCQSLD 208
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C +T L + +YG+G G +T+ + +S
Sbjct: 209 VSE----------CRNNTCL----------YEVSYGDGSYTVGDFVTETITLGSAS---- 244
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
+ GC + + G+ G G G LS PSQ+ FS+C + D +
Sbjct: 245 --VDNVAIGCGHNNEGLFIGAAGLLGLGGGKLSFPSQIN--ASSFSYCLVD----RDSDS 296
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S L + P+L++ +YY+G+ +++G L +P S+ E D G
Sbjct: 297 ASTLEFNSALLPHAIT---APLLRNRELDTFYYVGMTGLSVGGE-LLSIPESMFEMDESG 352
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
NGG+++DSGT T L Y+ L P EV FD CY + + T
Sbjct: 353 NGGIIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEV---ALFDTCYDL----SRKT 405
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++TFH L LP N+ P +S C F + G+ Q
Sbjct: 406 SVEVPTVTFHLAGGKVLPLPATNYLI----PVDSDGTFCFAFAPTSSA----LSIIGNVQ 457
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ V +DL +GF+P C
Sbjct: 458 QQGTRVGFDLANSLVGFEPRQC 479
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 97/388 (25%), Positives = 169/388 (43%), Gaps = 54/388 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDLTWV C C C N+ + PS SSS C SS C ++
Sbjct: 149 MSLIVDTGSDLTWVQCQ----PCRSC----YNQQGPLYDPSVSSSYKTVFCNSSTCQDLV 200
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ PC + ++K+TC + +YG+G G L +++ V G +
Sbjct: 201 AATGNSGPCG----GFNGVVKTTC-----EYVVSYGDGSYTRGDLASESI-VLGDT---- 246
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPN 177
++ FGC + + G+ G GR ++S+ SQ L FS+C + +
Sbjct: 247 -KLENLVFGCGRNNKGLFGGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL----EDG 301
Query: 178 ISSPLVIGD--VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S L G+ + ++ +TP++++P ++Y + L +IG L +
Sbjct: 302 ASGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGR---- 357
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G+L+DSGT T LP Y + + + +P A + D C+ +
Sbjct: 358 -----GILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGY---SILDTCFNL----T 405
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ D P+I F N L + FY + +++ CL S+ + G+ G
Sbjct: 406 SYEDISIPTIKMIFEGNAELEVDVTGVFYFVKP---DASLVCLALASLSYEN--EVGIIG 460
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAST 383
++QQ+N V+YD +ER+G +C T
Sbjct: 461 NYQQKNQRVIYDTTQERLGIAGENCMPT 488
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 164/389 (42%), Gaps = 70/389 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V + TGSDL W+PC + +CD + F P SS+ C S C +
Sbjct: 111 LLVNVATGSDLVWIPCLSFKPCTHNCD-------LRFFDPMESSTYKNVPCDSYRCQITN 163
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRP-----CPSFAYTYGEGGLVTGILTRDTLKVHGS 116
++ F C S C P CP G L DTL ++ S
Sbjct: 164 AATCQFSDCFYS------------CDPRHQDSCPD------------GDLAMDTLTLN-S 198
Query: 117 SPGIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKY 172
+ G +P F C +G Y +GI G G G+LS+ +++ L G FSHC + +
Sbjct: 199 TTGKSFMLPNTGFICGNRIGGDY-PGVGILGLGHGSLSLLNRISHLIDGKFSHCIVPYS- 256
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
N +S L GD A+ S + F+ L P Y + I++GN S++ +
Sbjct: 257 ---SNQTSKLSFGDKAVVSGSAM-FSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSD 312
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+ GL +DSGT +T+ PE FYSQL ++ I P + R LCYR
Sbjct: 313 YY----MNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRR--LRLCYR--- 363
Query: 293 PNNTFTDDLF-PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
++ D P+IT HF S+ L N F M + + CL F +
Sbjct: 364 ----YSPDFSPPTITMHF-EGGSVELSSSNSFIRM-----TEDIVCLAFATSSSEQ---D 410
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
VFG +QQ N+ + YDL+ + F DC
Sbjct: 411 AVFGYWQQTNLLIGYDLDAGFLSFLKTDC 439
>gi|148907478|gb|ABR16870.1| unknown [Picea sitchensis]
Length = 242
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 54/126 (42%), Positives = 73/126 (57%), Gaps = 9/126 (7%)
Query: 2 IQVYMDTGSDLTWVPCG----NLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC 57
++V+MDTGSDL WVPC SF+C+ C+D + FS +S+SS C+S C
Sbjct: 106 LEVFMDTGSDLVWVPCSANSSKPSFECIMCEDLD----IPTFSAFQSNSSRPAVCSSDSC 161
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
IH+SDNP D CTM+GC ++ C PCP+F Y YG+G L L RD L VH +
Sbjct: 162 SAIHNSDNPKDLCTMAGCPFESIDIDPCLAPCPAFYYAYGDGSL-RAELMRDRLSVHLAK 220
Query: 118 PGIIRE 123
G ++
Sbjct: 221 GGAKKK 226
>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
vinifera]
Length = 451
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 155/373 (41%), Gaps = 50/373 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C N SP+ ++ S C ++ C + +
Sbjct: 118 MDTSSDVAWIPCNG----CLGCSSTLFN------SPASTTYKSLG-CQAAQCKQVLHLLS 166
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P S S + K TC SF TYG L L++DT+ + + +P
Sbjct: 167 PLL------TSPSVVPKPTCGGGVCSFNLTYGGSSLAAN-LSQDTITLATDA------VP 213
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ G + + + Q FS+C +FK N S
Sbjct: 214 GYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 270
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V +++TP+LK+P P+ Y++ L A+ +G + P S F+ G
Sbjct: 271 LRLGPVG--QPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSF-TFNPSTGAG 327
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y + ++ + R V GFD CY VP
Sbjct: 328 TIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTVPI--------A 376
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LP N +A S + CL + D V + QQQN
Sbjct: 377 APTITFMF-TGMNVTLPPDNLLIHSTAGSTT----CLAMAAAPDNVNSVLNVIANLQQQN 431
Query: 362 VEVVYDLEKERIG 374
++YD+ R+G
Sbjct: 432 HRLLYDVPNSRLG 444
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 109/387 (28%), Positives = 169/387 (43%), Gaps = 61/387 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDL W C C DC ++ F P SS+ +C+SS C +
Sbjct: 103 IMAIADTGSDLLWTQCA----PCDDC----YTQVDPLFDPKTSSTYKDVSCSSSQCTALE 154
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + CS + +TC S++ +YG+ G + DTL + GSS
Sbjct: 155 N---------QASCSTN---DNTC-----SYSLSYGDNSYTKGNIAVDTLTL-GSSDTRP 196
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
++ GC G+ ++ GI G G G +S+ QLG G FS+C + D
Sbjct: 197 MQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQ 256
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+S + G AI S + TP++ +YY+ L++I++G+ + +
Sbjct: 257 --TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEG 314
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+++DSGTT T LP FYS+L + S+I K+ + ++G LCY
Sbjct: 315 N----IIIDSGTTLTLLPTEFYSELEDAVASSID---AEKKQDPQSGLSLCYSA------ 361
Query: 297 FTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVF 354
T DL P IT HF + + L N F + S + C F+ PS ++
Sbjct: 362 -TGDLKVPVITMHF-DGADVKLDSSNAFVQV-----SEDLVCFAFRG------SPSFSIY 408
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q N V YD + + F+P DCA
Sbjct: 409 GNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 169/387 (43%), Gaps = 61/387 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDL W C C DC ++ F P SS+ +C+SS C +
Sbjct: 103 IMAIADTGSDLLWTQCA----PCDDC----YTQVDPLFDPKTSSTYKDVSCSSSQCTALE 154
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + CS + +TC S++ +YG+ G + DTL + GSS
Sbjct: 155 N---------QASCSTN---DNTC-----SYSLSYGDNSYTKGNIAVDTLTL-GSSDTRP 196
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
++ GC G+ ++ GI G G G +S+ QLG G FS+C + D
Sbjct: 197 MQLKNIIIGCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQ 256
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+S + G AI S + TP++ +YY+ L++I++G+ +
Sbjct: 257 --TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS----E 310
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G +++DSGTT T LP FYS+L + S+I K+ + ++G LCY
Sbjct: 311 SSEGNIIIDSGTTLTLLPTEFYSELEDAVASSID---AEKKQDPQSGLSLCYSA------ 361
Query: 297 FTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVF 354
T DL P IT HF + + L N F + S + C F+ PS ++
Sbjct: 362 -TGDLKVPVITMHF-DGADVKLDSSNAFVQV-----SEDLVCFAFRG------SPSFSIY 408
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q N V YD + + F+P DCA
Sbjct: 409 GNVAQMNFLVGYDTVSKTVSFKPTDCA 435
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 166/388 (42%), Gaps = 59/388 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSD +WV C C DC + R+ F P+ SS+ S C + C
Sbjct: 152 LVVELDTGSDQSWVQCK----PCADCYEQRDPV----FDPTASSTYSAVPCGARECQE-- 201
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ S S S + CP + +Y + G L RDTL + S
Sbjct: 202 ----------LASSSSSRNCSSDNNKNCP-YEVSYDDDSHTVGDLARDTLTLSPSPSPSP 250
Query: 122 RE-IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDP 176
+ +P F FGC S T+ E G+ G G G S+PSQ+ FS+C + P
Sbjct: 251 ADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCL-----PSSP 305
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + L G A ++ N QFT M+ P YY+ L I + ++ +VP S F +
Sbjct: 306 SAAGYLSFGGAA--ARANAQFTEMVTG-QDPTSYYLNLTGIVVAGRAI-KVPAS--AFAT 359
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G ++DSGT ++ LP Y+ L S +S + Y R K FD CY
Sbjct: 360 AA--GTIIDSGTAFSRLPPSAYAALRSSFRSAMGRY-RYKRAPSSPIFDTCY-------D 409
Query: 297 FTDD---LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
FT P++ F + ++ L Y N A CL F D G+
Sbjct: 410 FTGHETVRIPAVELVFADGATVHLHPSGVLYTW----NDVAQTCLAFVPNHD-----LGI 460
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ QQ+ + V+YD+ +RIGF CA
Sbjct: 461 LGNTQQRTLAVIYDVGSQRIGFGRKGCA 488
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 167/380 (43%), Gaps = 45/380 (11%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C ++ F PS SS+ S +C+S+ C +
Sbjct: 156 MLIDTGSDISWV-------RCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSSAACAQLFQE 208
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGL-VTGILTRDTLKVHGSSPGIIR 122
N +GCS S C+ + YG+G + TG + DTL + +S ++
Sbjct: 209 GN------ANGCS-----SSGQCQ----YIAMYGDGSVGTTGTYSSDTLALGSNSNTVV- 252
Query: 123 EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP- 181
+ KF FGC S I G L +Q Q + AF Y P SS
Sbjct: 253 -VSKFRFGC--SHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYCLPPTPSSSG 309
Query: 182 -LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G SS ++ TPML+S P +Y + LEAI +G L+ +P ++ +
Sbjct: 310 FLTLGAAGTSSAGFVK-TPMLRSSQVPAFYGVRLEAIRVGGRQLS-IPTTVF------SA 361
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGT T LP YS L S ++ + YP A D C+ + ++
Sbjct: 362 GMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPT 421
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ ++ F + L M +S++ CL F + D G +G+ G+ QQ+
Sbjct: 422 V--ALVFSGAGGAVVNLDASGILLQM----ETSSIFCLAFVATSDD--GSTGIIGNVQQR 473
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+V+YD+ +GF+ C
Sbjct: 474 TFQVLYDVAGGAVGFKAGAC 493
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/392 (26%), Positives = 162/392 (41%), Gaps = 68/392 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C Y + + F P RS S + C + C + S+
Sbjct: 137 MVLDTGSDVVWLQCA----PCRHC--YAQSGRV--FDPRRSRSYAAVDCVAPICRRLDSA 188
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC +++C + YG+G + G +TL +
Sbjct: 189 ----------GCDRR---RNSCL-----YQVAYGDGSVTAGDFASETLTFARGA-----R 225
Query: 124 IPKFCFGCVGSTYREPIGIAG-----FGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+ + GC E + IA GRG LS P+Q+ + FS+C + + P+
Sbjct: 226 VQRVAIGC--GHDNEGLFIAASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPS 283
Query: 178 --ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREF 234
SS + G A+++ FTPM ++P +YY+ L ++G + + V S LR
Sbjct: 284 STRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLN 343
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG------FDLCY 288
+ G GG+++DSGT+ T L P Y + RA V R FD CY
Sbjct: 344 PTTGRGGVILDSGTSVTRLARPVYEAVRDAF--------RAAAVGLRVSPGGFSLFDTCY 395
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ P+++ H S+ LP N+ P ++S C D G
Sbjct: 396 NLSGRRVV----KVPTVSMHLAGGASVALPPENYLI----PVDTSGTFCFAMAGTDGG-- 445
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ VV+D + +R+GF P C
Sbjct: 446 --VSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 475
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 161/382 (42%), Gaps = 60/382 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C C +++ F+PS S+S S C S+ C + +
Sbjct: 212 MVLDTGSDVVWIQCE----PCSKC----YSQVDPIFNPSLSASFSTLGCNSAVCSYLDAY 263
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ C GC + +YG+G G + L +S +R
Sbjct: 264 N-----CHGGGCL---------------YKVSYGDGSYTIGSFATEMLTFGTTS---VRN 300
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G LS PSQLG + FS+C L +++ S L
Sbjct: 301 VAIGCGHDNAGLFVGAAGLLGLGAGLLSFPSQLGTQTGRAFSYC-LVDRFSES---SGTL 356
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD-SQGNGG 241
G ++ L TP+L +P P +YY+ L +I++G + L VP + D + G GG
Sbjct: 357 EFGPESVPLGSIL--TPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGG 414
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR---VPCPNNTFT 298
+VDSGT T L P Y + + P+A+ V + FD CY +P N
Sbjct: 415 FIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGV---SIFDTCYDLSGLPLVN---- 467
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++ FHF N SL+LP N+ P + C F + G+ Q
Sbjct: 468 ---VPTVVFHFSNGASLILPAKNYMI----PMDFMGTFCFAFAPATSD----LSIMGNIQ 516
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ + V +D +GF C
Sbjct: 517 QQGIRVSFDTANSLVGFALRQC 538
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 161/386 (41%), Gaps = 68/386 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V MDTGSD+ WV C C +CD N L F PS+SS+ S C +
Sbjct: 116 VVMDTGSDILWVMCT----PCTNCD----NDLGLLFDPSKSSTFS-PLCKT--------- 157
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
PC GC C P P F TY + +G RDT+ + G R
Sbjct: 158 -----PCDFEGCR---------CDPIP-FTVTYADNSTASGTFGRDTVVFETTDEGTSR- 201
Query: 124 IPKFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKGFSHCF--LAFKYANDPN 177
I FGC + + GI G G S+ ++LG + FS+C LA Y N
Sbjct: 202 ISDVLFGCGHNIGHDTDPGHNGILGLNNGPDSLVTKLG---QKFSYCIGNLADPYYN--- 255
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L++G+ A + F +Y +YY+ +E I++G L P + E
Sbjct: 256 -YHQLILGEGADLEGYSTPFE------VYNGFYYVTMEGISVGEKRLDIAPETF-EMKEN 307
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GG+++D+G+T T L + + L +++ + + R +E+ Y +
Sbjct: 308 RAGGVIIDTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFY------GSI 361
Query: 298 TDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVF 354
+ DL FP +TFHF + L L G+ F + + V C+ + + +
Sbjct: 362 SRDLVGFPVVTFHFSDGADLALDSGSFFNQL-----NDNVFCMTVGPVSSLNIKSKPSLI 416
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G QQ+ V YDL + + FQ +DC
Sbjct: 417 GLLAQQSYNVGYDLVNQFVYFQRIDC 442
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 110/392 (28%), Positives = 176/392 (44%), Gaps = 68/392 (17%)
Query: 2 IQVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
+ VY +DTGSDL W C C C YR M F P RS++ + C S C
Sbjct: 61 VDVYGLVDTGSDLVWAQCT----PCQGC--YRQKSPM--FEPLRSNTYTPIPCDSEEC-- 110
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCC-RPCPSFAYTYGEGGLVTGILTRDTLKVHGS-- 116
++L +C + +++Y Y + + G+L R+T+ +
Sbjct: 111 ------------------NSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDG 152
Query: 117 SPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL--QKGFSHCFLAF 170
P ++ +I FGC G+ +GI G G G LS+ SQ G L K FS C + F
Sbjct: 153 EPVVVGDI---VFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPF 209
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
DP+ + GD + S + + TP++ S Y + LE I++G+ T V +
Sbjct: 210 H--ADPHTLGTISFGDASDVSGEGVAATPLV-SEEGQTPYLVTLEGISVGD---TFVSFN 263
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
E S+GN +++DSGT T+LP+ FY +L+ L+ P + + G LCYR
Sbjct: 264 SSEMLSKGN--IMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPD--LGTQLCYR- 318
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
+ T+ P + HF ++P P + V C DG+Y
Sbjct: 319 -----SETNLEGPILIAHFEGADVQLMP----IQTFIPPKD--GVFCFAMAGTTDGEY-- 365
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+FG+F Q NV + +DL+++ + F+ DC++
Sbjct: 366 --IFGNFAQSNVLIGFDLDRKTVSFKATDCSN 395
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 103/397 (25%), Positives = 176/397 (44%), Gaps = 60/397 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV CG+ C C + NF P S ++S +C+ C L +
Sbjct: 67 VQIDTGSDVLWVSCGS----CNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQ 122
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK--------- 112
SSD S CS ++ C + + YG+G +G D L
Sbjct: 123 SSD--------SVCSA----QNNLC----GYNFQYGDGSGTSGYYVSDLLHFDTVLGGSV 166
Query: 113 VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
++ SS I+ G + + R GI GFG+ +SV SQL G + FSHC
Sbjct: 167 MNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCL-- 224
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
D + LV+G++ + N+ +TP++ P P +Y + +++I++ +L P
Sbjct: 225 ---KGDDSGGGILVLGEIV---EPNIVYTPLV--PSQP-HYNLNMQSISVNGQTLAIDPS 275
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
SQG ++DSGTT +L E Y +S + S ++ P + + + CY
Sbjct: 276 VFGTSSSQGT---IIDSGTTLAYLAEAAYDPFISAITSIVS--PSVRPYLSKG--NHCYL 328
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ N D+FP ++ +F S++L ++ S+ +A+ C+ FQ +
Sbjct: 329 ISSSIN----DIFPQVSLNFAGGASMILIPQDYLIQQSSI-GGAALWCIGFQKIQGQGI- 382
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYD+ +RIG+ DC+ + +
Sbjct: 383 --TILGDLVLKDKIFVYDIANQRIGWANYDCSMSVNV 417
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 170/387 (43%), Gaps = 58/387 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT S+LTWV C C C D ++ F PS S S + C SS C + +
Sbjct: 133 VVVDTASELTWVQCQ----PCESCHDQQDPL----FDPSSSPSYAAVPCNSSSCDALRVA 184
Query: 64 DNP-FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
PC +P S+A +Y +G G+L RD L++ G +
Sbjct: 185 MAAGTSPCA----------DDNEQQPACSYALSYRDGSYSRGVLARDKLRLAG------Q 228
Query: 123 EIPKFCFGCVGSTYREPIG----IAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPN 177
+I F FGC S P G + G GR +S+ SQ + FS+C +
Sbjct: 229 DIEGFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCL----PMRESG 284
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLK--SPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S LV+GD + + +++ + +T M+ P+ +Y++ L IT+G + S
Sbjct: 285 SSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSA-- 342
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G +++DSGT T L Y+ + + S + YP+A + D C+ +
Sbjct: 343 ------GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAF---SILDTCFNL--- 390
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ PS+ F F +V + + Y +S S++S V CL S+ +Y S +
Sbjct: 391 -TGLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVS--SDASQV-CLALASLKS-EYDTS-I 444
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQ+N+ V++D +IGF C
Sbjct: 445 IGNYQQKNLRVIFDTLGSQIGFAQETC 471
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 110/403 (27%), Positives = 170/403 (42%), Gaps = 60/403 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C ++ L S F+P SSS S C+S C
Sbjct: 53 VTMVLDTGSELSWLHC------------KKSPNLTSVFNPLSSSSYSPIPCSSPVC-RTR 99
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ D P +P T C L + +Y + + G L D ++ S+
Sbjct: 100 TRDLP-NPVT---CDPKKLCHAIV---------SYADASSLEGNLASDNFRIGSSA---- 142
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+P FGC+ S + + G+ G RG+LS +QLG + FS+C +
Sbjct: 143 --LPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPK--FSYCI------S 192
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ S L+ GD +S NL +TP+++ S P + Y + L+ I +GN L +P
Sbjct: 193 GRDSSGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKIL-PLPK 251
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAK-EVEERTGFDL 286
S+ D G G +VDSGT +T L P Y+ L + + Q+ P + DL
Sbjct: 252 SIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDL 311
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CYRVP P+++ F +V + + V CL F + D
Sbjct: 312 CYRVPAGGKL---PELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLL 368
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQNV + +DL K R+GF C GL
Sbjct: 369 GI-EAFVIGHHHQQNVWMEFDLVKSRVGFVETRCDLAGQRLGL 410
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 173/388 (44%), Gaps = 60/388 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V DTGSDL WV C C +C Y+ + F+P +SS+ R C + +C N
Sbjct: 107 VLVIADTGSDLIWVQCQ----PCQEC--YKQKSPI--FNPKQSSTYRRVLCETRYC-NAL 157
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+SD M CS K+ ++Y+YG+ G L + + GS+ I
Sbjct: 158 NSD-------MRACSAHGFFKAC------GYSYSYGDHSFTMGYLATERFII-GSTNNSI 203
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGAL-SVPSQLGF-LQKGFSHCFLAFKYANDPNIS 179
+E+ C G + E G S+ SQLG + FS+C + ++ ++
Sbjct: 204 QELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLG 263
Query: 180 SPLVIGDVA-ISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+V GD + IS D TP++ K P +YY+ LEAI++GN L + R +
Sbjct: 264 K-IVFGDNSFISGSDTYVSTPLVSKEP--ETFYYLTLEAISVGNERLAYE--NSRNDGNV 318
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNT 296
G +++DSGTT T L Y++L +L+ + + V + G F +C+R
Sbjct: 319 EKGNIIIDSGTTLTFLDSKLYNKLELVLEKAV----EGERVSDPNGIFSICFR------- 367
Query: 297 FTDDL---FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
D + P IT HF + + L N F + A + LL +M + +
Sbjct: 368 --DKIGIELPIITVHF-TDADVELKPINTF--------AKAEEDLLCFTMIPSN--GIAI 414
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
FG+ Q N V YDL+K + F P DC+
Sbjct: 415 FGNLAQMNFLVGYDLDKNCVSFMPTDCS 442
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 169/387 (43%), Gaps = 66/387 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT S+LTWV C C C D + F P+ S S + C SS C + +
Sbjct: 139 VIVDTASELTWVQCA----PCASCHDQQGPL----FDPASSPSYAVLPCNSSSCDALQVA 190
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ +P S+ +Y +G G+L D L + G
Sbjct: 191 T-----------GSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAG------EV 233
Query: 124 IPKFCFGCVGSTYREPIG----IAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNI 178
I F FGC G++ + P G + G GR LS+ SQ + FS+C L K +
Sbjct: 234 IDGFVFGC-GTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYC-LPLK---ESES 288
Query: 179 SSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S LV+GD +++ + +T M+ P+ +Y++ L ITIG +E +S
Sbjct: 289 SGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGG----------QEVES 338
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCP 293
G ++VDSGT T L Y+ + + S YP+A GF D C+ +
Sbjct: 339 SA-GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAP------GFSILDTCFNL--- 388
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
F + PS+ F F NV + + Y +S S+SS V CL S+ +Y S +
Sbjct: 389 -TGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVS--SDSSQV-CLALASLKS-EYETS-I 442
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQ+N+ V++D +IGF C
Sbjct: 443 IGNYQQKNLRVIFDTLGSQIGFAQETC 469
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 164/380 (43%), Gaps = 52/380 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSD+ W+ C C C Y + F+PS SS+ TC SS C
Sbjct: 94 VNMVADTGSDVLWLQC----LPCQSC--YGQTDPL--FNPSFSSTFQSITCGSSLC---- 141
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
LL C R + +YG+G G + +TL ++ +
Sbjct: 142 ----------------QQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGSNA---V 182
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
+ C + G+ G G+G LS PSQ+G L FS+C + S
Sbjct: 183 NSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCL----PTRESTGSV 238
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PL+ G+ A++S N QFT +L +P +YY+ + I +G +S++ SL S GNG
Sbjct: 239 PLIFGNQAVAS--NAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNG 296
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGT T L Y+ + ++ + AK + FD CY + ++
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMP--SDAKMTSGFSLFDTCYDLSGRSSI---- 350
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P+++F F ++ LP N + P ++S CL F + + G+ QQQ
Sbjct: 351 MLPAVSFVFNGGATMALPAQN----IMVPVDNSGTYCLAFAPNSEN----FSIIGNIQQQ 402
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+ + +D R+G C
Sbjct: 403 SFRMSFDSTGNRVGIGANQC 422
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/381 (28%), Positives = 169/381 (44%), Gaps = 66/381 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C C +++ F PS SS+ S +C+S+ C +
Sbjct: 148 MLIDTGSDVSWVQCK----PCSQC----HSQADPLFDPSSSSTYSPFSCSSAACAQLGQE 199
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GCS S+ C+ + TYG+G TG + DTL + ++
Sbjct: 200 GN--------GCS------SSQCQ----YTVTYGDGSSTTGTYSSDTLALGSNA------ 235
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNIS 179
+ KF FGC V S + + G+ G G GA S+ SQ G FS+C A + S
Sbjct: 236 VRKFQFGCSNVESGFNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSS-----S 290
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G + TPML+S P +Y + ++AI +G L+ +P S+ +
Sbjct: 291 GFLTLG----AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLS-IPTSVF------S 339
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T LP YS L S ++ + YP A D C+ ++
Sbjct: 340 AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGI---LDTCFDFSGQSSVS-- 394
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ F + + M SNS + CL F + + D G+ G+ QQ
Sbjct: 395 --IPTVALVFSGGAVVDIASDG---IMLQTSNS--ILCLAFAA--NSDDSSLGIIGNVQQ 445
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ +GF+ C
Sbjct: 446 RTFEVLYDVGGGAVGFKAGAC 466
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 166/388 (42%), Gaps = 76/388 (19%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL+W+ C C C Y + F P++SS+ C S
Sbjct: 106 DTGSDLSWLQCT----PCKTC--YPQEAPL--FDPTQSSTYVDVPCES------------ 145
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR---E 123
PCT+ + S C + + YG G L DT+ SS G+ +
Sbjct: 146 -QPCTLFPQNQRECGSSKQCI----YLHQYGTDSFTIGRLGYDTISF--SSTGMGQGGAT 198
Query: 124 IPKFCFGCV---GSTYR---EPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDP 176
PK FGC T++ + G G G G LS+ SQLG + FS+C + F +
Sbjct: 199 FPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIGHKFSYCMVPFSSTS-- 256
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ L G +A + + TP + +P YP+YY + LE IT+G +
Sbjct: 257 --TGKLKFGSMA--PTNEVVSTPFMINPSYPSYYVLNLEGITVGQK---------KVLTG 303
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEER--TGFDLCYRVPCPN 294
Q G +++DS THL + Y+ +S ++ I EV E T F+ C R P
Sbjct: 304 QIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINV-----EVAEDAPTPFEYCVRNP--- 355
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T+ FP FHF +VL N F A+ +N + + + + +F
Sbjct: 356 ---TNLNFPEFVFHF-TGADVVLGPKNMFIALD--NNLVCMTVVPSKGIS--------IF 401
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G++ Q N +V YDL ++++ F P +C++
Sbjct: 402 GNWAQVNFQVEYDLGEKKVSFAPTNCST 429
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 169/387 (43%), Gaps = 66/387 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT S+LTWV C C C D + F P+ S S + C SS C + +
Sbjct: 140 VIVDTASELTWVQCA----PCASCHDQQGPL----FDPASSPSYAVLPCNSSSCDALQVA 191
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ +P S+ +Y +G G+L D L + G
Sbjct: 192 T-----------GSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAG------EV 234
Query: 124 IPKFCFGCVGSTYREPIG----IAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNI 178
I F FGC G++ + P G + G GR LS+ SQ + FS+C L K +
Sbjct: 235 IDGFVFGC-GTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYC-LPLK---ESES 289
Query: 179 SSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S LV+GD +++ + +T M+ P+ +Y++ L ITIG +E +S
Sbjct: 290 SGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGG----------QEVES 339
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCP 293
G ++VDSGT T L Y+ + + S YP+A GF D C+ +
Sbjct: 340 SA-GKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAP------GFSILDTCFNL--- 389
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
F + PS+ F F NV + + Y +S S+SS V CL S+ +Y S +
Sbjct: 390 -TGFREVQIPSLKFVFEGNVEVEVDSSGVLYFVS--SDSSQV-CLALASLKS-EYETS-I 443
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQ+N+ V++D +IGF C
Sbjct: 444 IGNYQQKNLRVIFDTLGSQIGFAQETC 470
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 116/386 (30%), Positives = 175/386 (45%), Gaps = 61/386 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDL W C C DC Y+ + F P SS+ + +C+SS C +
Sbjct: 99 ILAIADTGSDLIWTQCN----PCEDC--YQQTSPL--FDPKESSTYRKVSCSSSQCRALE 150
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ CS ++TC S+ TYG+ G + DT+ + GSS
Sbjct: 151 DA----------SCSTD---ENTC-----SYTITYGDNSYTKGDVAVDTVTM-GSSGRRP 191
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
+ GC G+ GI G G G+ S+ SQL G FS+C + F ++
Sbjct: 192 VSLRNMIIGCGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPF--TSET 249
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
++S + G I S D + T M+K YY++ LEAI++G+ ++ + F +
Sbjct: 250 GLTSKINFGTNGIVSGDGVVSTSMVKKDP-ATYYFLNLEAISVGSK---KIQFTSTIFGT 305
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNN 295
G G +++DSGTT T LP FY +L S++ STI +A+ V++ G LCYR ++
Sbjct: 306 -GEGNIVIDSGTTLTLLPSNFYYELESVVASTI----KAERVQDPDGILSLCYR---DSS 357
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+F P IT HF + L N F A+S V C F + + +FG
Sbjct: 358 SFK---VPDITVHFKGG-DVKLGNLNTFVAVSED-----VSCFAFAANEQ-----LTIFG 403
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
+ Q N V YD + F+ DC+
Sbjct: 404 NLAQMNFLVGYDTVSGTVSFKKTDCS 429
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 101/380 (26%), Positives = 163/380 (42%), Gaps = 52/380 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSD+ W+ C C C Y + F+PS SS+ TC SS C
Sbjct: 94 VNMVADTGSDVLWLQC----LPCQSC--YGQTDPL--FNPSFSSTFQSITCGSSLC---- 141
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
LL C R + +YG+G G + +TL ++ +
Sbjct: 142 ----------------QQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFGSNA---V 182
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
+ C + G+ G G+G LS PSQ+G L FS+C + S
Sbjct: 183 NSVAIGCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCL----PTRESTGSV 238
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
PL+ G+ A++S N QFT +L +P +YY+ + I +G +S+ SL S GNG
Sbjct: 239 PLIFGNQAVAS--NAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNG 296
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++DSGT T L Y+ + ++ + AK + FD CY + ++
Sbjct: 297 GVILDSGTAVTRLVTSAYNPMRDAFRAGMP--SDAKMTSGFSLFDTCYDLSGRSSI---- 350
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P+++F F ++ LP N + P ++S CL F + + G+ QQQ
Sbjct: 351 MLPAVSFVFNGGATMALPAQN----IMVPVDNSGTYCLAFAPNSEN----FSIIGNIQQQ 402
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+ + +D R+G C
Sbjct: 403 SFRMSFDSTGNRVGIGANQC 422
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/397 (24%), Positives = 171/397 (43%), Gaps = 69/397 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ W+ C C +C + NF + SS++
Sbjct: 99 VQIDTGSDILWINCNT----CSNCPKSSGLGIELNFFDTVGSSTA--------------- 139
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCP---SFAYTYGEGGLVTGILTRDTLK----VHGS 116
PC+ C+ + + C P S+ + Y +G +G+ D + + S
Sbjct: 140 --ALVPCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQS 197
Query: 117 SPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
+P + FGC + T + GI GFG G LSV SQL G K FSHC
Sbjct: 198 TPANVASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHC 257
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSL 224
D N LV+G++ S ++ SP+ P+ +Y + L++I + L
Sbjct: 258 L-----KGDGNGGGILVLGEILEPS--------IVYSPLVPSQPHYNLNLQSIAVNGQVL 304
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
+ P D +G ++DSGTT ++L + Y L++ + + ++ + + +
Sbjct: 305 SINPAVFATSDKRGT---IIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-- 359
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
CY V T DD FP+++F+F S+ L + + + + + + C+ FQ +
Sbjct: 360 --CYLVL----TSIDDSFPTVSFNFEGGASMDL-KPSQYLLNRGFQDGAKMWCIGFQKVQ 412
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+G + G ++ VVYDL +++IG+ DC+
Sbjct: 413 EG----VTILGDLVLKDKIVVYDLARQQIGWTNYDCS 445
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 96/378 (25%), Positives = 151/378 (39%), Gaps = 76/378 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y + F P+ SSS S +C S+ C
Sbjct: 147 VDSGSDVIWVQC----RPCEQC--YAQTDPL--FDPAASSSFSGVSCGSAICR------- 191
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
T+SG + C ++ TYG+G G L +TL + G++ ++ +
Sbjct: 192 -----TLSGTGCGGGGDAGKC----DYSVTYGDGSYTKGELALETLTLGGTA---VQGVA 239
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
C + G+ G G GA+S+ QLG G FS+C + +++S
Sbjct: 240 IGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLAS---- 295
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL--SLREFDSQGNGGL 242
++YY+GL I +G L PL SL + G GG+
Sbjct: 296 -----------------------SFYYVGLTGIGVGGERL---PLQDSLFQLTEDGAGGV 329
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP Y+ L + PR+ V D CY + + +
Sbjct: 330 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSL---LDTCYDL----SGYASVRV 382
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P+++F+F L LP N + AV CL F G + G+ QQ+ +
Sbjct: 383 PTVSFYFDQGAVLTLPARNLLVEVGG-----AVFCLAFAPSSSG----ISILGNIQQEGI 433
Query: 363 EVVYDLEKERIGFQPMDC 380
++ D +GF P C
Sbjct: 434 QITVDSANGYVGFGPNTC 451
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 108/404 (26%), Positives = 181/404 (44%), Gaps = 65/404 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C ++ NF P S+++S +C+ C L +
Sbjct: 98 VQIDTGSDVLWVSCNS----CNGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQ 153
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDT--LKVHGSSPG 119
SSD S C +S C ++ + YG+G +G D L V S
Sbjct: 154 SSD--------SAC----FGQSNQC----AYVFQYGDGSGTSGYYVMDMIHLDVVIDSSV 197
Query: 120 IIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S + R GI GFG+ LSV SQL G K FSHC
Sbjct: 198 TSNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCL-- 255
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
D + LV+G++ + N+ +TP++ P P +Y + L++I++ L P+
Sbjct: 256 ---KGDDSGGGILVLGEIV---EPNVVYTPLV--PSQP-HYNLNLQSISVNGQVL---PI 303
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
S F + + G ++DSGTT +L E Y+ + + + ++ ++ ++ + CY
Sbjct: 304 SPAVFATSSSQGTIIDSGTTLAYLAEEAYNAFVVAVTNIVSQSTQSVVLKG----NRCY- 358
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ D+FP ++ +F SLVL ++ ++ ++ V C+ FQ +
Sbjct: 359 ---VTSSSVSDIFPQVSLNFAGGASLVLGAQDYLIQQNSVGGTT-VWCIGFQKIPGQGI- 413
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA-----STASAQG 388
+ G ++ +YDL +RIG+ DC+ STA+ G
Sbjct: 414 --TILGDLVLKDKIFIYDLANQRIGWTNYDCSMSVNVSTATKTG 455
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 188/412 (45%), Gaps = 83/412 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C +C + + +S +SPS SS+S+R TC FC + +
Sbjct: 89 VQVDTGSDILWVNCAG----CTNCPKKSDLGIELSLYSPSSSSTSNRVTCNQDFCTSTY- 143
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHG---- 115
D P + GC+ L + + YG+G G RD + +V G
Sbjct: 144 -DGP-----IPGCTPELLCE---------YRVAYGDGSSTAGYFVRDHVVLDRVTGNFQT 188
Query: 116 -SSPGIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFS 164
S+ G I FGC +G+T GI GFG+ S+ SQL G +++ F+
Sbjct: 189 TSTNGSI------VFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFA 242
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
HC ++ N IG+V + ++ TP++ + N + ++AI + N L
Sbjct: 243 HCL------DNINGGGIFAIGEVV---QPKVRTTPLVPQQAHYNVF---MKAIEVDNEVL 290
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL---QSTITYYPRAKEVEER 281
L FD+ G ++DSGTT + P+ Y L+S + QST+ + VEE+
Sbjct: 291 N---LPTDVFDTDLRKGTIIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHT----VEEQ 343
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAV--KCL 338
C+ + DD FP++TFHF +++SL V P H Y SN V +
Sbjct: 344 F---TCFEY----DGNVDDGFPTVTFHFEDSLSLTVYP---HEYLFDIDSNKWCVGWQNS 393
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLH 390
QS D D + G QN V+YDLE + IG+ +C+S+ + H
Sbjct: 394 GAQSRDGKDM---ILLGDLVLQNRLVMYDLENQTIGWTEYNCSSSIKVRDEH 442
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 158/385 (41%), Gaps = 67/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT SD+ WV C F C Y ++ + PS+S SS C+S C +
Sbjct: 184 MLLDTASDVAWVQC----FPCPASQCYAQTDVL--YDPSKSRSSESFACSSPTCRQL--- 234
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P+ +GCS S+ C + Y +G +G L D L + +S +
Sbjct: 235 -GPY----ANGCSSSSNSAGQC-----QYRVRYPDGSTTSGTLVADQLSLSPTS-----Q 279
Query: 124 IPKFCFGCV----GSTYR-EPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPN 177
+PKF FGC GS R + GI GRG S+ SQ + FS+CF P
Sbjct: 280 VPKFEFGCSHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCF-------PPT 332
Query: 178 ISSP--LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S V+G V S TPMLK+PM Y + LEAI + L +VP ++
Sbjct: 333 ASHKGFFVLG-VPRRSSSRYAVTPMLKTPML---YQVRLEAIAVAGQRL-DVPPTVFA-- 385
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G +DS T T LP Y L S + ++ Y + D CY
Sbjct: 386 ----AGAALDSRTVITRLPPTAYQALRSAFRDKMSMY---RPAAANGQLDTCYD------ 432
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
F ++ L +SLV + + PS CL F S GD +G+ G
Sbjct: 433 ------FTGVSSIMLPTISLVFDRTGAGVQLD-PSGVLFGSCLAFASTA-GDDRATGIIG 484
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
Q Q +EV+Y++ +GF+ C
Sbjct: 485 FLQLQTIEVLYNVAGGSVGFRRGAC 509
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 162/378 (42%), Gaps = 53/378 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+TW+ C C DC Y+ + + ++P+ SSS C ++ C +
Sbjct: 160 MVLDTGSDVTWIQCE----PCSDC--YQQSDPI--YNPALSSSYKLVGCQANLCQQLD-- 209
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGCS ++ C + +YG+G G +TL + G+ ++
Sbjct: 210 --------VSGCS-----RNGSCL----YQVSYGDGSYTQGNFATETLTLGGAP---LQN 249
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G+LS PSQL K FS+C + D SS L
Sbjct: 250 VAIGCGHDNEGLFVGAAGLLGLGGGSLSFPSQLTDENGKIFSYCLVD----RDSESSSTL 305
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
G A+ + PMLK+ +YY+ L I++G L+ + S+ D+ GNGG+
Sbjct: 306 QFGRAAV--PNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLS-ISDSVFGIDASGNGGV 362
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
+VDSGT T L Y L ++ P V + FD CY + +
Sbjct: 363 IVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGV---SLFDTCYDLSSKESVDV---- 415
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P++ FHF S+ LP N+ P +S C F + G+ QQQ +
Sbjct: 416 PTVVFHFSGGGSMSLPAKNYL----VPVDSMGTFCFAFAPTSS----SLSIVGNIQQQGI 467
Query: 363 EVVYDLEKERIGFQPMDC 380
V +D ++GF C
Sbjct: 468 RVSFDRANNQVGFAVNKC 485
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 168/397 (42%), Gaps = 60/397 (15%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ MDTGSDL W+ C C+DC + R F P+ SSS TC C ++
Sbjct: 160 QMIMDTGSDLNWLQCA----PCLDCFEQRGPV----FDPAASSSYRNLTCGDPRCGHV-- 209
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCR-----PCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
+ CR PCP + Y YG+ TG L ++ V+ ++
Sbjct: 210 -------------APPEAPAPRACRRPGEDPCPYY-YWYGDQSNSTGDLALESFTVNLTA 255
Query: 118 PGIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
PG + FGC + G+ G GRG LS SQL + G + + + +
Sbjct: 256 PGASSRVDGVVFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLVDHGS 315
Query: 175 DPNISSPLVIGD---VAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
D ++S +V G+ +A+++ L++T S +YY+ L + +G L +S
Sbjct: 316 D--VASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLN---IS 370
Query: 231 LREFDSQ--GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFDLC 287
+D+ G+GG ++DSGTT ++ EP Y + ++ YP D
Sbjct: 371 SDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVP--------DFP 422
Query: 288 YRVPCPNNTFTDD-LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
PC N + + P ++ F + P N+F + + + CL +
Sbjct: 423 VLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRL----DPDGIMCLAV--LGTP 476
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G S + G+FQQQN V YDL R+GF P CA
Sbjct: 477 RTGMS-IIGNFQQQNFHVAYDLHNNRLGFAPRRCAEV 512
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 161/382 (42%), Gaps = 66/382 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRN--NKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+DTGSD+TW+ C+ C ++ F P SSS + +C S C + +
Sbjct: 14 LDTGSDVTWL-------QCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQLLDEA 66
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPGIIR 122
GC++++ + + YG+G G L +TL VH +S
Sbjct: 67 ----------GCNVNSCI----------YKVEYGDGSFTIGELATETLTFVHSNS----- 101
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
IP GC + G+ G G GA+S+ SQL FS+C + +I
Sbjct: 102 -IPNISIGCGHDNEGLFVGADGLIGLGGGAISISSQLK--ASSFSYCLV--------DID 150
Query: 180 SP-LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SP D + +P++K+ +P++ Y+ + +++G L + S E D G
Sbjct: 151 SPSFSTLDFNTDPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPL-PISSSRFEIDESG 209
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG++VDSGTT T LP Y L T P A E+ + FD CY + +N
Sbjct: 210 LGGIIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEI---SPFDTCYDLSSQSNVEV 266
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+I F SL LP N + +S+ CL F S P + G+FQ
Sbjct: 267 ----PTIAFILPGENSLQLPAKNCLIQV----DSAGTFCLAFVSAT----FPLSIIGNFQ 314
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ + V YDL +GF C
Sbjct: 315 QQGIRVSYDLTNSLVGFSTNKC 336
>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
Length = 193
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 95/183 (51%), Gaps = 17/183 (9%)
Query: 198 TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPF 257
TP++ +P+ P++YYI LE I++G++ L+ + S E G+GG+++DSGTT T++ E
Sbjct: 25 TPLITNPLQPSFYYISLEVISVGDTKLS-IEQSTFEVSDDGSGGVIIDSGTTITYIEENA 83
Query: 258 YSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVL 317
+ L S T P K TG D+C+ +P T+ P + FHF L L
Sbjct: 84 FDSLKKEFTSQ-TKLPVDK--SGSTGLDVCFSLPSGK---TEVEIPKLVFHFKGG-DLEL 136
Query: 318 PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQP 377
P N+ A S S V CL G +FG+ QQQN+ V +DL+KE I F P
Sbjct: 137 PGENYMIADS----SLGVACLAM-----GASNGMSIFGNIQQQNILVNHDLQKETITFIP 187
Query: 378 MDC 380
C
Sbjct: 188 TQC 190
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 72/383 (18%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C+ C Y+ ++ + F P +S+S S C S C I S
Sbjct: 110 DTGSDLMWAQC----LPCLKC--YKQSRPI--FDPLKSTSFSHVPCNSQNCKAIDDSH-- 159
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C G ++YTYG+ G L + + + SS K
Sbjct: 160 ---CGAQGVC--------------DYSYTYGDQTYTKGDLGFEKITIGSSSV-------K 195
Query: 127 FCFGC---VGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCF-LAFKYANDPNIS 179
GC G + G+ G G G LS+ SQ+ + + FS+C +AN
Sbjct: 196 SVIGCGHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN----- 250
Query: 180 SPLVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ G A+ S + TP++ K+P+ YYY+ LEAI+IGN R S
Sbjct: 251 GKINFGQNAVVSGPGVVSTPLISKNPV--TYYYVTLEAISIGNE---------RHMASAK 299
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNNTF 297
G +++DSGTT + LP+ Y ++S L + +AK V++ F DLC+ N
Sbjct: 300 QGNVIIDSGTTLSFLPKELYDGVVSSLLKVV----KAKRVKDPGNFWDLCFDDGI--NVA 353
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T P IT F ++ L N F + ++ V CL D G+ G+
Sbjct: 354 TSSGIPIITAQFSGGANVNLLPVNTFQKV-----ANNVNCLTLTPASPTD--EFGIIGNL 406
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
N + YDLE +R+ F+P C
Sbjct: 407 ALANFLIGYDLEAKRLSFKPTVC 429
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 160/382 (41%), Gaps = 62/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D+GSD+ WV C C C Y + F P+ S+S C+SS C I ++
Sbjct: 157 VVIDSGSDIVWVQCQ----PCTQC--YHQTDPV--FDPADSASFMGVPCSSSVCERIENA 208
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C GC + YG+G G L +TL + ++R
Sbjct: 209 G-----CHAGGCRYEVM---------------YGDGSYTKGTLALETLTFGRT---VVRN 245
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G++S+ QLG G FS+C ++ D S
Sbjct: 246 VAIGCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVS--RGTDSAGSLEF 303
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDSQGNG 240
G + + + + P++++P P++YYI L + +G +VP+S + + + GNG
Sbjct: 304 GRGAMPVGAA----WIPLIRNPRAPSFYYIRLSGVGVGG---MKVPISEDVFQLNEMGNG 356
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++D+GT T +P Y PRA V FD CY + N F
Sbjct: 357 GVVMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSI---FDTCYNL----NGFVSV 409
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQ 358
P+++F+F L LP N P + C F + PSG + G+ Q
Sbjct: 410 RVPTVSFYFAGGPILTLPARNFLI----PVDDVGTFCFAFAA------SPSGLSIIGNIQ 459
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ +++ +D +GF P C
Sbjct: 460 QEGIQISFDGANGFVGFGPNVC 481
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 109/390 (27%), Positives = 160/390 (41%), Gaps = 75/390 (19%)
Query: 1 VIQVY-MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
V QV +DTGSD++WV C + C + +KL F P+ S++ S +C S+ C
Sbjct: 140 VTQVMSIDTGSDVSWVQCAPCA--AQSCSS-QKDKL---FDPAMSATYSAFSCGSAQCAQ 193
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ N GC LKS C + YG+G G DTL + S
Sbjct: 194 LGDEGN--------GC-----LKSQC-----QYIVKYGDGSNTAGTYGSDTLSLTSSD-- 233
Query: 120 IIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYAND 175
+ F FGC E G+ G G S+ SQ K FS+C
Sbjct: 234 ---AVKSFQFGCSHRAAGFVGELDGLMGLGGDTESLVSQTAATYGKAFSYCL-------- 282
Query: 176 PNISSP----LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
P SS L +G +S TPM++ + P +Y + L+ IT+ + L VP S+
Sbjct: 283 PPPSSSGGGFLTLGAAGGASSSRYSHTPMVRFSV-PTFYGVFLQGITVAGTML-NVPASV 340
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+G +VDSGT T LP Y L + + + YP A V D C+
Sbjct: 341 F------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGS---LDTCFDF- 390
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF-QSMDDGDYGP 350
+ F P++T F ++ L YA CL F + DGD
Sbjct: 391 ---SGFNTITVPTVTLTFSRGAAMDLDISGILYA----------GCLAFTATAHDGD--- 434
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+G+ G+ QQ+ E+++D+ IGF+ C
Sbjct: 435 TGILGNVQQRTFEMLFDVGGRTIGFRSGAC 464
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 171/393 (43%), Gaps = 63/393 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV CG+ C C + NF P SS++S +C+ C L +
Sbjct: 83 VQIDTGSDVLWVSCGS----CNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQ 138
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
SSD +GCS + C + + YG+G +G D L + GSS
Sbjct: 139 SSD--------AGCSSQ---GNQCI-----YTFQYGDGSGTSGYYVSDLLNFDAIVGSS- 181
Query: 119 GIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC + + R GI GFG+ +SV SQ+ G K FSHC
Sbjct: 182 -VTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLK 240
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ +V ++++ ++P++ P P +Y + L++I++ SL P
Sbjct: 241 GDGGGGGILVLGEIV--------EEDIVYSPLV--PSQP-HYNLNLQSISVNGKSLAIDP 289
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
F + N G +VDSGTT +L E Y +S + ++ R + CY
Sbjct: 290 ---EVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ----CY 342
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ + +FP+++ +F VS+ L ++ ++ + +AV C+ FQ +
Sbjct: 343 LI----TSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGD-AAVWCIGFQKIQGQGI 397
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ DC+
Sbjct: 398 ---TILGDLVLKDKIFVYDLAGQRIGWANYDCS 427
>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 442
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/396 (26%), Positives = 168/396 (42%), Gaps = 74/396 (18%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCA--SSFCLNI 60
+ +DTGSDL W C C+ + + ++ S+SS+ CA + FC
Sbjct: 100 EALIDTGSDLIWTQCAT---TCLPKSCAKQG--LPYYNLSQSSTFVPVPCADKAGFC--- 151
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ N C + G +C +F +YG G ++ + T G++
Sbjct: 152 --AANGVHLCGLDG---------SC-----TFIASYGAGRVIGSLGTESFAFESGTT--- 192
Query: 121 IREIPKFCFGCVGST------YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
FGCV T + G+ G GRG LS+ SQ+G + FS+C Y +
Sbjct: 193 -----SLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIGATR--FSYCLT--PYFH 243
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEV---P 228
SS L +G A P +KSP Y +YY+ LE IT+G + L V
Sbjct: 244 SSGASSHLFVGASASLGGGGASM-PFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTT 302
Query: 229 LSLRE-FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI---TYYPRAKEVEERTGF 284
LR+ F GG+++D+G+ T L Y L + + + + P E +G
Sbjct: 303 FQLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVP----APEDSGL 358
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+LC F + P++ FHF + +P +++ AP + +A ++ +
Sbjct: 359 ELC----VAREGF-QKVVPALVFHFGGGADMAVPAASYW----APVDKAAACMMILEG-- 407
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G + G+FQQQ++ ++YDL + R FQ DC
Sbjct: 408 ----GYDSIIGNFQQQDMHLLYDLRRGRFSFQTADC 439
>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 169/405 (41%), Gaps = 56/405 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C S N ++NF P+RSSS S C+S C
Sbjct: 86 ISMVIDTGSELSWLRCNRSS----------NPNPVNNFDPTRSSSYSPIPCSSPTC---- 131
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D + C L +T +Y + G L + S+
Sbjct: 132 -RTRTRDFLIPASCDSDKLCHATL---------SYADASSSEGNLAAEIFHFGNST---- 177
Query: 122 REIPKFCFGCVGSTY-------REPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
FGC+GS + G+ G RG+LS SQ+GF + FS+C +
Sbjct: 178 -NDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPK--FSYCI-----SG 229
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ L++GD + L +TP+++ S P + Y + L I + N L +P
Sbjct: 230 TDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKV-NGKLLPIPK 288
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEERTGFD 285
S+ D G G +VDSGT +T L P Y+ L L+ +T Y + V + T D
Sbjct: 289 SVLLPDHTGAGQTMVDSGTQFTFLLGPVYTALRSDFLNQTNGILTVYEDPEFVFQGT-MD 347
Query: 286 LCYRV-PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
LCYR+ P T P+++ F V Q + + + +V C F + D
Sbjct: 348 LCYRISPFRIRTGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSD 407
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQN+ + +DL++ RIG P+ C + G+
Sbjct: 408 LMGM-EAYVIGHHHQQNMWIEFDLQRSRIGLAPVQCDVSGQRLGI 451
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 171/393 (43%), Gaps = 63/393 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV CG+ C C + NF P SS++S +C+ C L +
Sbjct: 98 VQIDTGSDVLWVSCGS----CNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQ 153
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
SSD +GCS + C + + YG+G +G D L + GSS
Sbjct: 154 SSD--------AGCSSQ---GNQCI-----YTFQYGDGSGTSGYYVSDLLNFDAIVGSS- 196
Query: 119 GIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC + + R GI GFG+ +SV SQ+ G K FSHC
Sbjct: 197 -VTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLK 255
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ +V ++++ ++P++ P P +Y + L++I++ SL P
Sbjct: 256 GDGGGGGILVLGEIV--------EEDIVYSPLV--PSQP-HYNLNLQSISVNGKSLAIDP 304
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
F + N G +VDSGTT +L E Y +S + ++ R + CY
Sbjct: 305 ---EVFATSTNRGTIVDSGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ----CY 357
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ + +FP+++ +F VS+ L ++ ++ + +AV C+ FQ +
Sbjct: 358 LI----TSSVKGIFPTVSLNFAGGVSMNLKPEDYLLQQNSIGD-AAVWCIGFQKIQGQGI 412
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ DC+
Sbjct: 413 ---TILGDLVLKDKIFVYDLAGQRIGWANYDCS 442
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 160/379 (42%), Gaps = 57/379 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + + F+P+ SS+ TC++ C
Sbjct: 177 LVLDTGSDVNWIQCE----PCADC--YQQSDPV--FNPTSSSTYKSLTCSAPQC------ 222
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPS-FAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+LL+++ CR + +YG+G G L DT+ S G I
Sbjct: 223 ---------------SLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNS--GKIN 265
Query: 123 EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G LS+ +Q+ FS+C + D SS L
Sbjct: 266 NVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMK--ATSFSYCLVD----RDSGKSSSL 319
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
V + D P+L++ +YY+GL ++G + +P ++ + D+ G+GG+
Sbjct: 320 DFNSVQLGGGD--ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGV 376
Query: 243 LVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++D GT T L Y+ L + L+ T+ + + FD CY ++ +
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISL---FDTCYDF----SSLSTVK 429
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P++ FHF SL LP N+ P + S C F + G+ QQQ
Sbjct: 430 VPTVAFHFTGGKSLDLPAKNYLI----PVDDSGTFCFAFAPTS----SSLSIIGNVQQQG 481
Query: 362 VEVVYDLEKERIGFQPMDC 380
+ YDL K IG C
Sbjct: 482 TRITYDLSKNVIGLSGNKC 500
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 109/382 (28%), Positives = 163/382 (42%), Gaps = 70/382 (18%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C+ C +L F+P +S+S S C + C H+ D+
Sbjct: 98 DTGSDLTWAQC----LPCLKC----YQQLRPIFNPLKSTSFSHVPCNTQTC---HAVDD- 145
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C + G ++YTYG+ G L + + + SS K
Sbjct: 146 -GHCGVQGVC--------------DYSYTYGDRTYSKGDLGFEKITIGSSSV-------K 183
Query: 127 FCFGCVGST---YREPIGIAGFGRGALSVPSQLG---FLQKGFSHCF-LAFKYANDPNIS 179
GC ++ + G+ G G G LS+ SQ+ + + FS+C +AN
Sbjct: 184 SVIGCGHASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN----- 238
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ G A+ S + TP++ S YYYI LEAI+IGN F QGN
Sbjct: 239 GKINFGQNAVVSGPGVVSTPLI-SKNTVTYYYITLEAISIGNERH-------MAFAKQGN 290
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNNTFT 298
+++DSGTT + LP+ Y ++S L + +AK V++ F DLC+ N T
Sbjct: 291 --VIIDSGTTLSFLPKELYDGVVSSLLKVV----KAKRVKDPGNFWDLCFDDGI--NVAT 342
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P IT F ++ L N F + ++ V CL D G+ G+
Sbjct: 343 SSGIPIITAQFSGGANVNLLPVNTFQKV-----ANNVNCLTLTPASPTD--EFGIIGNLA 395
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
N + YDLE +R+ F+P C
Sbjct: 396 LANFLIGYDLEAKRLSFKPTVC 417
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 160/379 (42%), Gaps = 57/379 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + + F+P+ SS+ TC++ C
Sbjct: 177 LVLDTGSDVNWIQCE----PCADC--YQQSDPV--FNPTSSSTYKSLTCSAPQC------ 222
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPS-FAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+LL+++ CR + +YG+G G L DT+ S G I
Sbjct: 223 ---------------SLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNS--GKIN 265
Query: 123 EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G LS+ +Q+ FS+C + D SS L
Sbjct: 266 NVALGCGHDNEGLFTGAAGLLGLGGGVLSITNQMK--ATSFSYCLVD----RDSGKSSSL 319
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
V + D P+L++ +YY+GL ++G + +P ++ + D+ G+GG+
Sbjct: 320 DFNSVQLGGGD--ATAPLLRNKKIDTFYYVGLSGFSVGGEKVV-LPDAIFDVDASGSGGV 376
Query: 243 LVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++D GT T L Y+ L + L+ T+ + + FD CY ++ +
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISL---FDTCYDF----SSLSTVK 429
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P++ FHF SL LP N+ P + S C F + G+ QQQ
Sbjct: 430 VPTVAFHFTGGKSLDLPAKNYLI----PVDDSGTFCFAFAPTS----SSLSIIGNVQQQG 481
Query: 362 VEVVYDLEKERIGFQPMDC 380
+ YDL K IG C
Sbjct: 482 TRITYDLSKNVIGLSGNKC 500
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 78/387 (20%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I+ +DTGS+ W C C+ C N+ F PS+SS+ C + H
Sbjct: 72 IEAVLDTGSEHIWTQC----LPCVHC----YNQTAPIFDPSKSSTFKEIRC------DTH 117
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P++ YG G L +T+ +H +S G
Sbjct: 118 DHSCPYE-------------------------LVYGGKSYTKGTLVTETVTIHSTS-GQP 151
Query: 122 REIPKFCFGC-VGSTYREP--IGIAGFGRGALSVPSQLGFLQKGF-SHCFLAFKYANDPN 177
+P+ GC ++ +P G+ G RG S+ +Q+G G S+CF
Sbjct: 152 FVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG-------K 204
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G AI + D + T + P +YY+ L+A+++GN+ + V
Sbjct: 205 GTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHAL--- 261
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT--YYPRAKEVEERTGFDLCYRVPCPNN 295
G +++DSG+T T+ PE + + + ++ +T +PR+ LCY
Sbjct: 262 -KGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSD--------ILCYY------ 306
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ T D+FP IT HF LVL + Y M SN+ V CL + +FG
Sbjct: 307 SKTIDIFPVITMHFSGGADLVLDK----YNMYVASNTGGVFCLAIICNSPIE---EAIFG 359
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ Q N V YD + F+P +C++
Sbjct: 360 NRAQNNFLVGYDSSSLLVSFKPTNCSA 386
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 150/382 (39%), Gaps = 70/382 (18%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTW C C + + F P+ S+S +C+S FC I + P
Sbjct: 158 DTGSDLTWTQCEPCLGGCFPQNQPK-------FDPTTSTSYKNVSCSSEFCKLIAEGNYP 210
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C + +TC + YG G + G L +TL + S
Sbjct: 211 AQDC----------ISNTCL-----YGIQYGSGYTI-GFLATETLAIASSD-----VFKN 249
Query: 127 FCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
F FGC T+ G+ G GR +++PSQ K FS+C A P+ + L
Sbjct: 250 FLFGCSEESRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPA-----SPSSTGHL 304
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
G + + +P LK +Y L V +S+R + NG +
Sbjct: 305 SFGVEVSQAAKSTPISPKLKQ-LY----------------GLNTVGISVRGRELPINGSI 347
Query: 243 ---LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP-CPNNTFT 298
++DSGTT+T LP P YS L S + + Y F CY N T T
Sbjct: 348 SRTIIDSGTTFTFLPSPTYSALGSAFREMMANYTLTNGTSS---FQPCYDFSNIGNGTLT 404
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P I+ F V + + + P N CL F D G +FG++Q
Sbjct: 405 ---IPGISIFFEGGVEVEI----DVSGIMIPVNGLKEVCLAFA--DTGSDSDFAIFGNYQ 455
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ EV+YD+ K +GF P C
Sbjct: 456 QKTYEVIYDVAKGMVGFAPKGC 477
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 106/397 (26%), Positives = 161/397 (40%), Gaps = 75/397 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSS----SSSRDTCASSFCLN 59
V +DT S+LTWV C C C D + + SPS ++ S S D
Sbjct: 156 VIVDTASELTWVQCA----PCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATG 211
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ P D + CS +A +Y +G G+L D L + G
Sbjct: 212 AGAGAPPCDAGRPAACS---------------YALSYRDGSYSRGVLAHDRLSLAG---- 252
Query: 120 IIREIPKFCFGCVGSTYREPIG----IAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN 174
I F FGC S P G + G GR LS+ SQ G FS+C +
Sbjct: 253 --EVIDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCL---PLSR 307
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--------YYYIGLEAITIGNSSLTE 226
+ + S LV+GD + +++ TP++ + M N +Y + L IT+G +
Sbjct: 308 ESDASGSLVLGDDPSAYRNS---TPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVES 364
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-- 284
S R +VDSGT T L Y+ + + S + YP+A GF
Sbjct: 365 TGFSARA---------IVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAP------GFSI 409
Query: 285 -DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM 343
D C+ + + PS+T F + + G Y +S S+SS V CL S+
Sbjct: 410 LDTCFNM----TGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVS--SDSSQV-CLAVASL 462
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
D + + G++QQ+N+ VV+D ++GF C
Sbjct: 463 KSED--ETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 97/387 (25%), Positives = 159/387 (41%), Gaps = 78/387 (20%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I+ +DTGS+ W C C+ C N+ F PS+SS+ C + H
Sbjct: 78 IEAVLDTGSEHIWTQC----LPCVHC----YNQTAPIFDPSKSSTFKEIRC------DTH 123
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P++ YG G L +T+ +H +S G
Sbjct: 124 DHSCPYE-------------------------LVYGGKSYTKGTLVTETVTIHSTS-GQP 157
Query: 122 REIPKFCFGC-VGSTYREP--IGIAGFGRGALSVPSQLGFLQKGF-SHCFLAFKYANDPN 177
+P+ GC ++ +P G+ G RG S+ +Q+G G S+CF
Sbjct: 158 FVMPETIIGCGRNNSGFKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAG-------K 210
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G AI + D + T + P +YY+ L+A+++GN+ + V
Sbjct: 211 GTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHAL--- 267
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT--YYPRAKEVEERTGFDLCYRVPCPNN 295
G +++DSG+T T+ PE + + + ++ +T +PR+ LCY
Sbjct: 268 -KGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTAVRFPRSD--------ILCYY------ 312
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ T D+FP IT HF LVL + Y M SN+ V CL + +FG
Sbjct: 313 SKTIDIFPVITMHFSGGADLVLDK----YNMYVASNTGGVFCLAIICNSPIE---EAIFG 365
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ Q N V YD + F+P +C++
Sbjct: 366 NRAQNNFLVGYDSSSLLVSFKPTNCSA 392
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 106/414 (25%), Positives = 162/414 (39%), Gaps = 88/414 (21%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DT SDL W C C+ C +L F+P S+S + C S C L+ H
Sbjct: 105 IDTASDLIWTQCQ----PCVKC----YKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRC 156
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D C + Y+YG GIL D L + + R
Sbjct: 157 ARDGDSDDEDACQ---------------YTYSYGGNATTRGILAVDRLAIGDD---VFRG 198
Query: 124 IPKFCFGCVGSTYREP----IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ FGC S+ P G+ G GRGALS+ SQL + F Y P +S
Sbjct: 199 V---VFGCSSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRR---------FMYCLPPPVS 246
Query: 180 SP---LVIGDVAISSKDNLQ---FTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
LV+G A ++ N PM YP+YYY+ L+ I+IG+ +++ +
Sbjct: 247 RSAGRLVLGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMN 306
Query: 234 FDSQGNG------------------------GLLVDSGTTYTHLPEPFYSQLLSILQSTI 269
+ G G+++D +T T L E Y +++ L+ I
Sbjct: 307 ATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEI 366
Query: 270 TYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP 329
PR + G DLC+ +P + P ++ F V L L + F
Sbjct: 367 RL-PRGSGSD--LGLDLCFILP-EGVPMSRVYAPPVSLAF-EGVWLRLDKEQMF----VE 417
Query: 330 SNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+S + CL+ D + G++QQQN++V+Y+L + RI F C S
Sbjct: 418 DRASGMMCLMVGKTDG-----VSILGNYQQQNMQVMYNLRRGRITFIKTACESV 466
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 169/400 (42%), Gaps = 72/400 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGS WV C C + F RSS SS++ C + C
Sbjct: 98 VQLDTGSKAFWVN----GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC----- 148
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ P PC M+ CP + Y +GGL GIL D L H G +
Sbjct: 149 TSRP--PCNMT-------------LRCP-YITGYADGGLTMGILFTDLLHYH-QLYGNGQ 191
Query: 123 EIP---KFCFGC----VGSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC GS + GI GFG + SQL G +K FSHC
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 249
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++K+ Y+ + L++I + ++L ++P
Sbjct: 250 ----DSTNGGGIFAIGEVV---EPKVKTTPIVKNN--EVYHLVNLKSINVAGTTL-QLPA 299
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ F + G +DSG+T +LPE YS+L+ + + + ++ ++ C+
Sbjct: 300 NI--FGTTKTKGTFIDSGSTLVYLPEIIYSELI------LAVFAKHPDITMGAMYNFQCF 351
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
DD FP ITFHF N+++L V P + Y + N C FQ
Sbjct: 352 HFLGS----VDDKFPKITFHFENDLTLDVYP---YDYLLEYEGNQY---CFGFQDAGIHG 401
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
Y + G N VVYD+EK+ IG+ +C+S+ +
Sbjct: 402 YKDMIILGDMVISNKVVVYDMEKQAIGWTEHNCSSSVKIK 441
>gi|361066667|gb|AEW07645.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134456|gb|AFG48207.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134472|gb|AFG48215.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134476|gb|AFG48217.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134478|gb|AFG48218.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134480|gb|AFG48219.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134482|gb|AFG48220.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134484|gb|AFG48221.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
Length = 136
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/111 (45%), Positives = 70/111 (63%), Gaps = 4/111 (3%)
Query: 217 ITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK 276
ITIG L ++P SL FD +GNGGL+VDSGTT+T LPE Y Q+L+ L+S I Y R+
Sbjct: 1 ITIGGQRL-KLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYRQVLNKLKSAIR-YSRSV 58
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
+ E G DLCY +P +F + P+ + HF +NV++ LP N+ MS
Sbjct: 59 KYEAALGLDLCYELPSAGGSFP--VLPTFSLHFKDNVTITLPAENYMSMMS 107
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 169/390 (43%), Gaps = 58/390 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ WV C + C +C + NF S SSS++ S
Sbjct: 81 VQIDTGSDVLWVCCNS----CNNCPRTSGLGIQLNFFDSSSSSTAGQVRCS--------- 127
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---------VH 114
DP S + S+ C S+ + YG+G +G DTL +
Sbjct: 128 ----DPICTSAVQTTATQCSSQTDQC-SYTFQYGDGSGTSGYYVSDTLYFDAILGQSLID 182
Query: 115 GSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
SS I+ + G + T + GI GFG+G LSV SQL G + FSHC
Sbjct: 183 NSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCL---- 238
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
D + LV+G++ + + ++P++ P P +Y + L +I + L P +
Sbjct: 239 -KGDGSGGGILVLGEIL---EPGIVYSPLV--PSQP-HYNLNLLSIAVNGQLLPIDPAAF 291
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+SQG +VDSGTT +L Y +S + + ++ P + + + CY V
Sbjct: 292 ATSNSQGT---IVDSGTTLAYLVAEAYDPFVSAVNAIVS--PSVTPITSKG--NQCYLV- 343
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+T +FP +F+F S+VL ++ + S SA+ C+ FQ +
Sbjct: 344 ---STSVSQMFPLASFNFAGGASMVLKPEDYLIPFGS-SGGSAMWCIGFQKVQG-----V 394
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL ++RIG+ DC+
Sbjct: 395 TILGDLVLKDKIFVYDLVRQRIGWANYDCS 424
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/391 (26%), Positives = 171/391 (43%), Gaps = 60/391 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C + NF P S ++S +C+ C L +
Sbjct: 105 VQIDTGSDVLWVSCSS----CNGCPVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQ 160
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK--------- 112
SSD S+ + C + + YG+G +G D L
Sbjct: 161 SSD-----------SVCAAQNNQC-----GYTFQYGDGSGTSGYYVSDLLHFDTILGGSV 204
Query: 113 VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
+ SS I+ G + R GI GFG+ +SV SQL G + FSHC
Sbjct: 205 MKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCL-- 262
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
D + LV+G++ + N+ +TP++ P P +Y + L++I + +L P
Sbjct: 263 ---KGDDSGGGILVLGEIV---EPNIVYTPLV--PSQP-HYNLNLQSIYVNGQTLAIDP- 312
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + N G ++DSGTT +L E Y +S + ST++ P + + CY
Sbjct: 313 --SVFATSSNQGTIIDSGTTLAYLTEAAYDPFISAITSTVS--PSVSPYLSKG--NQCYL 366
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ +D+FP ++ +F S++L ++ S+ N +A+ C+ FQ + +
Sbjct: 367 ----TSSSINDVFPQVSLNFAGGTSMILIPQDYLIQQSS-INGAALWCVGFQKIQGQEI- 420
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G ++ VYD+ +RIG+ DC
Sbjct: 421 --TILGDLVLKDKIFVYDIAGQRIGWANYDC 449
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 112/394 (28%), Positives = 168/394 (42%), Gaps = 59/394 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W+ C C DC + N + + P S+S TC C I S
Sbjct: 175 LILDTGSDLNWLQC----LPCYDC--FHQNGMF--YDPKTSASFKNITCNDPRCSLISSP 226
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH------GSS 117
D P C S CP F Y YG+ TG +T V+ GSS
Sbjct: 227 DPPVQ-CESDNQS------------CPYF-YWYGDRSNTTGDFAVETFTVNLTTTEGGSS 272
Query: 118 PGIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYA 173
++ FGC + G+ G GRG LS SQL L FS+C +
Sbjct: 273 E---YKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRN 327
Query: 174 NDPNISSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
++ N+SS L+ G D + + NL FT + K +YYI +++I +G +L ++P
Sbjct: 328 SNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKAL-DIPEE 386
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDLCYR 289
S G+GG ++DSGTT ++ EP Y + + + YP ++ D C+
Sbjct: 387 TWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPV---LDPCFN 443
Query: 290 VPC--PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
V NN P + F++ P N F +S + CL
Sbjct: 444 VSGIEENNIH----LPELGIAFVDGTVWNFPAENSFIWLSED-----LVCLAILGTPKST 494
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ + G++QQQN ++YD ++ R+GF P CA
Sbjct: 495 FS---IIGNYQQQNFHILYDTKRSRLGFTPTKCA 525
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 100/381 (26%), Positives = 152/381 (39%), Gaps = 63/381 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD++WV C C + + + F P++SS+ +CA++ C +
Sbjct: 142 VTIDTGSDVSWVQCN----PCPNPPCHAQTGAL--FDPAKSSTYRAVSCAAAECAQLEQQ 195
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GC + C+ + YG+G G +RDTL + G+S
Sbjct: 196 GN--------GCGATNYE----CQ----YGVQYGDGSTTNGTYSRDTLTLSGAS----DA 235
Query: 124 IPKFCFGC--VGSTYREPI-GIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
+ F FGC + S + + G+ G G GA S+ SQ FS+C P
Sbjct: 236 VKGFQFGCSHLESGFSDQTDGLMGLGGGAQSLVSQTAAAYGNSFSYCL-------PPTSG 288
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + T ML+S P +Y L+ I +G L P S
Sbjct: 289 SSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSP-------SVFA 341
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G +VDSGT T LP YS L S ++ + Y + R+ D C+ T
Sbjct: 342 AGSVVDSGTIITRLPPTAYSALSSAFKAGMKQY---RSAPARSILDTCFDFAGQ----TQ 394
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ F ++ L Y CL F + GD G +G+ G+ QQ
Sbjct: 395 ISIPTVALVFSGGAAIDLDPNGIMYG----------NCLAFAAT--GDDGTTGIIGNVQQ 442
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ EV+YD+ +GF+ C
Sbjct: 443 RTFEVLYDVGSSTLGFRSGAC 463
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/387 (25%), Positives = 161/387 (41%), Gaps = 49/387 (12%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W C LS + + P RSSS + C+ C S
Sbjct: 99 LIVDTGSDLIWTQCSMLSRRTRTAASASRQR-EPLYEPRRSSSFAYLPCSDRLCQEGQFS 157
Query: 64 DNPFDPCTMSG-CSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ C + C L Y E G G+L +T G+
Sbjct: 158 ---YKNCARNNRCMYDEL-------------YGSAEAG---GVLASETFTF-----GVNA 193
Query: 123 EIP-KFCFGCVGSTYREPIG---IAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
++ FGC + + +G + G G +S+ SQL + FS+C F
Sbjct: 194 KVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPR--FSYCLTPFAERK---- 247
Query: 179 SSPLVIGDVA----ISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+SPL+ G +A + +Q T +L++P M YYY+ L +++G L SL
Sbjct: 248 TSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGM 307
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G+GG +VDSG+T ++L E + + + + E+ ++LC+ +P
Sbjct: 308 IKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRLPVANGTDEDYDDYELCFALPT- 366
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
P + HF ++ LP+ N+F A + CL + DG +G S +
Sbjct: 367 GVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRA-----GLMCLAVGTSPDG-FGVS-I 419
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQN+ V++D+ ++ F P C
Sbjct: 420 IGNVQQQNMHVLFDVRNQKFSFAPTKC 446
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 178/388 (45%), Gaps = 54/388 (13%)
Query: 7 DTGSDLTWVPCGNLSFDCM--DCDDYRNNKLMSN--FSPSRSSSSSRDTCASSFCLNIHS 62
DTGSDLTW+ C + C +C + + ++ F + SSS C + C
Sbjct: 101 DTGSDLTWMSC---KYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMC----- 152
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D +++ C T PC + Y Y +G G +T+ V G
Sbjct: 153 KIELMDLFSLTNCP-------TPLTPC-GYDYRYSDGSTALGFFANETVTVE-LKEGRKM 203
Query: 123 EIPKFCFGC----VGSTYREPIGIAGFG--RGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
++ GC G +++ G+ G G + + ++ + F K FS+C + + +
Sbjct: 204 KLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK-FSYCLV--DHLSHK 260
Query: 177 NISSPLVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N+S+ L G +N+ +T ++ M ++Y + + I+IG + L ++P + +
Sbjct: 261 NVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAML-KIPSEV--W 316
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCP 293
D +G GG ++DSG++ T L EP Y +++ L+ ++ + ++VE G + C+
Sbjct: 317 DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKF---RKVEMDIGPLEYCF----- 368
Query: 294 NNT-FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N+T F + L P + FHF + P + Y +SA + V+CL F S+ + +
Sbjct: 369 NSTGFEESLVPRLVFHFADGAEFEPPVKS--YVISA---ADGVRCLGFVSV---AWPGTS 420
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+ QQN +DL +++GF P C
Sbjct: 421 VVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 97/382 (25%), Positives = 150/382 (39%), Gaps = 52/382 (13%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C+ C YR +L + P SS+ ++ C+ C N +
Sbjct: 114 LVIDTGSDVVWLQCK----PCVHC--YR--QLSPLYDPRGSSTYAQTPCSPPQCRNPQTC 165
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D T GC + YG+ +G L D L +
Sbjct: 166 DG-----TTGGCG---------------YRIVYGDASSTSGNLATDRLVFSNDT-----S 200
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+ GC + G+ G RG S +Q+ + F++C + S
Sbjct: 201 VGNVTLGCGHDNEGLFGSAAGLLGVARGNNSFATQVADSYGRYFAYCL--GDRTRSGSSS 258
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD-SQG 238
S LV G A ++ FTP+ +P P+ YY+ + ++G +T + D + G
Sbjct: 259 SYLVFGRTAPEPPSSV-FTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATG 317
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG++VDSGT+ T Y L + K + FD CY +
Sbjct: 318 RGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDL---RGVAV 374
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
D P + HF + LP N+ P S C ++ G G S V G+
Sbjct: 375 ADA-PGVVLHFAGGADVALPPENYL----VPEESGRYHCFALEAA--GHDGLS-VIGNVL 426
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ VV+D+E ER+GF+P C
Sbjct: 427 QQRFRVVFDVENERVGFEPNGC 448
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 178/388 (45%), Gaps = 54/388 (13%)
Query: 7 DTGSDLTWVPCGNLSFDCM--DCDDYRNNKLMSN--FSPSRSSSSSRDTCASSFCLNIHS 62
DTGSDLTW+ C + C +C + + ++ F + SSS C + C
Sbjct: 101 DTGSDLTWMSC---KYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMC----- 152
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D +++ C T PC + Y Y +G G +T+ V G
Sbjct: 153 KIELMDLFSLTNCP-------TPLTPC-GYDYRYSDGSTALGFFANETVTVE-LKEGRKM 203
Query: 123 EIPKFCFGC----VGSTYREPIGIAGFG--RGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
++ GC G +++ G+ G G + + ++ + F K FS+C + + +
Sbjct: 204 KLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK-FSYCLV--DHLSHK 260
Query: 177 NISSPLVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N+S+ L G +N+ +T ++ M ++Y + + I+IG + L ++P + +
Sbjct: 261 NVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAML-KIPSEV--W 316
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCP 293
D +G GG ++DSG++ T L EP Y +++ L+ ++ + ++VE G + C+
Sbjct: 317 DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKF---RKVEMDIGPLEYCF----- 368
Query: 294 NNT-FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N+T F + L P + FHF + P + Y +SA + V+CL F S+ + +
Sbjct: 369 NSTGFEESLVPRLVFHFADGAEFEPPVKS--YVISA---ADGVRCLGFVSV---AWPGTS 420
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+ QQN +DL +++GF P C
Sbjct: 421 VVGNIMQQNHLWEFDLGLKKLGFAPSSC 448
>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
SURVIVAL 1; Flags: Precursor
gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 453
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 168/405 (41%), Gaps = 56/405 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C S N ++NF P+RSSS S C+S C
Sbjct: 86 ISMVIDTGSELSWLRCNRSS----------NPNPVNNFDPTRSSSYSPIPCSSPTC---- 131
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D + C L +T +Y + G L + S+
Sbjct: 132 -RTRTRDFLIPASCDSDKLCHATL---------SYADASSSEGNLAAEIFHFGNST---- 177
Query: 122 REIPKFCFGCVGSTY-------REPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
FGC+GS + G+ G RG+LS SQ+GF + FS+C +
Sbjct: 178 -NDSNLIFGCMGSVSGSDPEEDTKTTGLLGMNRGSLSFISQMGFPK--FSYCI-----SG 229
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ L++GD + L +TP+++ S P + Y + L I + N L +P
Sbjct: 230 TDDFPGFLLLGDSNFTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKV-NGKLLPIPK 288
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEERTGFD 285
S+ D G G +VDSGT +T L P Y+ L L+ +T Y V + T D
Sbjct: 289 SVLVPDHTGAGQTMVDSGTQFTFLLGPVYTALRSHFLNRTNGILTVYEDPDFVFQGT-MD 347
Query: 286 LCYRV-PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
LCYR+ P + P+++ F V Q + + +V C F + D
Sbjct: 348 LCYRISPVRIRSGILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSD 407
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQN+ + +DL++ RIG P++C + G+
Sbjct: 408 LMGM-EAYVIGHHHQQNMWIEFDLQRSRIGLAPVECDVSGQRLGI 451
>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
Length = 446
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 100/395 (25%), Positives = 159/395 (40%), Gaps = 68/395 (17%)
Query: 3 QVYMDTGSDLTWVPCGN-LSFDCM-DCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ +DTGSDL W C L C Y N+ S F+P CA+ C
Sbjct: 104 EALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPV--------PCAARIC--- 152
Query: 61 HSSDNPFDPCTMS-GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
++D+ C ++ GCS+ YG G+V G L + +
Sbjct: 153 AANDDIIHFCDLAAGCSVIA---------------GYG-AGVVAGTLGTEAFAFQSGTA- 195
Query: 120 IIREIPKFCFGCVGST------YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ FGCV T G+ G GRG LS+ SQ G + FS+C + +
Sbjct: 196 ------ELAFGCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTGATK--FSYCLTPY-FH 246
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
N+ V ++ ++ T +K P +YY+ L +T+G T +P+
Sbjct: 247 NNGATGHLFVGASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGE---TRLPIPATV 303
Query: 234 FDSQG------NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
FD + +GG+++DSG+ +T L Y L S L + + A + G
Sbjct: 304 FDLREVAPGLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDG---- 359
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
C + P++ FHF + +P +++ AP + +A G
Sbjct: 360 --ALCVARRDVGRVVPAVVFHFRGGADMAVPAESYW----APVDKAAAC---MAIASAGP 410
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
Y V G++QQQN+ V+YDL FQP DC++
Sbjct: 411 YRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCSA 445
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 107/404 (26%), Positives = 171/404 (42%), Gaps = 73/404 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C C R + L ++ ++ S S +C FC I
Sbjct: 95 VQVDTGSDIMWVNC----IQCKQCP--RRSTLGIELTLYNIDESDSGKLVSCDDDFCYQI 148
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S P +SGC + CP + YG+G G +D ++ + +
Sbjct: 149 --SGGP-----LSGCKANM--------SCP-YLEIYGDGSSTAGYFVKDVVQYDSVAGDL 192
Query: 121 IREIPK--FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCF 167
+ FGC + S+ E + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 193 KTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL 252
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ N IG V + K N+ +P+ PN +Y + + A+ +G LT
Sbjct: 253 ------DGRNGGGIFAIGRV-VQPKVNM-------TPLVPNQPHYNVNMTAVQVGQEFLT 298
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
+P L F G ++DSGTT +LPE Y L+ + S ++ F
Sbjct: 299 -IPADL--FQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQ 355
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SM 343
RV D+ FP++TFHF N+V L + ++ + + C+ +Q +M
Sbjct: 356 YSGRV--------DEGFPNVTFHFENSVFLRVYPHDYLFP------HEGMWCIGWQNSAM 401
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V+YDLE + IG+ +C+S+ +
Sbjct: 402 QSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKVK 445
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 168/392 (42%), Gaps = 60/392 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ WV C C +C + NF DT SS I S
Sbjct: 93 VQIDTGSDILWVNCNT----CSNCPQSSQLGIELNF---------FDTVGSSTAALIPCS 139
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSPGI 120
D P + G + + C S+ + YG+G +G D + + G P +
Sbjct: 140 D-PICTSRVQGAAAECSPRVNQC----SYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAV 194
Query: 121 IREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
FGC + T + GI GFG G LSV SQL G K FSHC
Sbjct: 195 -NSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGD 253
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
+ ++ + ++ ++P++ P P +Y + L++I + L P++
Sbjct: 254 GDGGGVLVLGEIL--------EPSIVYSPLV--PSQP-HYNLNLQSIAVNGQLL---PIN 299
Query: 231 LREFD-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F S GG +VD GTT +L + Y L++ + + ++ R + + + CY
Sbjct: 300 PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSAR----QTNSKGNQCYL 355
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
V +T D+FPS++ +F S+VL + + + + + + C+ FQ +G
Sbjct: 356 V----STSIGDIFPSVSLNFEGGASMVL-KPEQYLMHNGYLDGAEMWCIGFQKFQEG--- 407
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ + G ++ VVYD+ ++RIG+ DC+
Sbjct: 408 -ASILGDLVLKDKIVVYDIAQQRIGWANYDCS 438
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 161/378 (42%), Gaps = 58/378 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y + + F P+ S+S + +C+SS C
Sbjct: 157 IDSGSDIVWVQCQ----PCTQC--YHQSDPV--FDPADSASFTGVSCSSSVC-------- 200
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D +GC + CR + +YG+G G L +TL + ++R +
Sbjct: 201 --DRLENAGC------HAGRCR----YEVSYGDGSYTKGTLALETLTFGRT---MVRSVA 245
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
C + G+ G G G++S QLG G FS+C + + + S LV
Sbjct: 246 IGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLV----SRGTDSSGSLVF 301
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--SQGNGGL 242
G A+ + + P++++P P++YYIGL + +G VP+S F G+GG+
Sbjct: 302 GREALPA--GAAWVPLVRNPRAPSFYYIGLAGLGVGG---IRVPISEEVFRLTELGDGGV 356
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP Y + PRA V FD CY + F
Sbjct: 357 VMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAI---FDTCYDLL----GFVSVRV 409
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P+++F+F L LP N P + + C F G + G+ QQ+ +
Sbjct: 410 PTVSFYFSGGPILTLPARNFLI----PMDDAGTFCFAFAPSTSG----LSILGNIQQEGI 461
Query: 363 EVVYDLEKERIGFQPMDC 380
++ +D +GF P C
Sbjct: 462 QISFDGANGYVGFGPNIC 479
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 157/388 (40%), Gaps = 58/388 (14%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSD+TW+ C C C Y + + F P R S+S R+ + + D
Sbjct: 151 MDTGSDITWLQCQ----PCRRC--YPQSGPV--FDP-RHSTSYRE-------MGYDAPD- 193
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVT-GILTRDTLKVHGSSPGIIREI 124
C G S K C +A YG+ G T G +TL G ++
Sbjct: 194 ----CQALGRSGGGDAKRMTC----VYAVGYGDDGSTTVGDFIEETLTFAGGV-----QV 240
Query: 125 PKFCFGC----VGSTYREPIGIAGFGRGALSVPSQ---LGFLQKGFSHCFLAFKYANDP- 176
P GC G GI G GRG +S PSQ LG+ FS+C F + + P
Sbjct: 241 PHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYCLADF-FLSSPG 299
Query: 177 -NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYY---IGLEAITIGNSSLTEVPLSLR 232
++SS L IGD A + FTP +++ +YY +G+ + +TE L L
Sbjct: 300 RSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLD 359
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
+ G GG+++DSGT T L Y ++ + FD CY +
Sbjct: 360 PY--TGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGG 417
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
P+++ HF V L LP N+ P +S C F D
Sbjct: 418 RAMK-----VPTVSMHFAGGVELTLPPKNYLI----PVDSMGTVCFAFAGTGDRSV---S 465
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ VVY++ R+GF P C
Sbjct: 466 IIGNIQQQGFRVVYNIGGGRVGFAPNSC 493
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 93/380 (24%), Positives = 145/380 (38%), Gaps = 74/380 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D +D WVPC C C +FSP++SS+ C S C + S
Sbjct: 117 VAIDPSNDAAWVPCS----ACAGCAASS-----PSFSPTQSSTYRTVPCGSPQCAQVPSP 167
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P + S+C F TY +L +D+L + +
Sbjct: 168 SCPAG------------VGSSC-----GFNLTYAASTF-QAVLGQDSLALENN------V 203
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ + FGC+ AG R L+ + +A + P
Sbjct: 204 VVSYTFGCLRVVNGNSRAAAGAHR-----------LRPRAALLLVADQGHLGP------- 245
Query: 184 IGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLL 243
I ++ TP+L +P P+ YY+ + I +G S + +VP S F+ G +
Sbjct: 246 -----IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVG-SKVVQVPQSALAFNPVTGSGTI 299
Query: 244 VDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFP 303
+D+GT +T L P Y+ + + + R GFD CY V P
Sbjct: 300 IDAGTMFTRLAAPVYAAVRDAFRGRV----RTPVAPPLGGFDTCYNV--------TVSVP 347
Query: 304 SITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPSGVFGSFQQQNV 362
++TF F V++ LP+ N S+S V CL + DG V S QQQN
Sbjct: 348 TVTFMFAGAVAVTLPEENVMIH----SSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQ 403
Query: 363 EVVYDLEKERIGFQPMDCAS 382
V++D+ R+GF C +
Sbjct: 404 RVLFDVANGRVGFSRELCTA 423
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 153/385 (39%), Gaps = 63/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGS LTWV C C C + + + F PS+SS+ S +C
Sbjct: 110 MDTGSSLTWVMC----HPCSSC----SQQSVPIFDPSKSSTYSNLSC------------- 148
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
S C+ ++ C ++ Y G GI R+ L + II+ +P
Sbjct: 149 -------SECNKCDVVNGEC-----PYSVEYVGSGSSQGIYAREQLTLETIDESIIK-VP 195
Query: 126 KFCFGC--------VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
FGC G Y+ G+ G G G S+ G K FS+C + N
Sbjct: 196 SLIFGCGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFG---KKFSYCIGNLRNTNYK- 251
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ LV+GD A D+ + YY+ LEAI+IG L P +
Sbjct: 252 -FNRLVLGDKANMQGDSTTLNVI------NGLYYVNLEAISIGGRKLDIDPTLFERSITD 304
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
N G+++DSG +T L + + L +++ + + ++ + LCY +
Sbjct: 305 NNSGVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCY-----SGVV 359
Query: 298 TDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
+ DL FP +TFHF L L + F + N + L D DY G
Sbjct: 360 SQDLSGFPLVTFHFAEGAVLDLDVTSMF--IQTTENEFCMAMLPGNYFGD-DYESFSSIG 416
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
QQN V YDL + R+ FQ +DC
Sbjct: 417 MLAQQNYNVGYDLNRMRVYFQRIDC 441
>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
Length = 382
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 116/229 (50%), Gaps = 24/229 (10%)
Query: 154 SQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKD--NLQFTPMLKSPMYPNYYY 211
SQLG + FS+C + N +S L+ G +A S+ + + TP++++P P+YYY
Sbjct: 173 SQLG--TQKFSYCLTSIH----ENKTSSLLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYY 226
Query: 212 IGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY 271
+ L+ IT+G +L +P + G+GG+++DSGTT T+L E + L + + I+
Sbjct: 227 LALKGITVG-YTLLPIPEFAFQLGKDGSGGMILDSGTTITYLQEDAFDVLKN---AFISQ 282
Query: 272 YPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSN 331
TG DLC+ +P N + P + FHF + L LP N Y +S P
Sbjct: 283 TELQVANSSTTGLDLCFHLPVKNA--AEVKVPKLIFHF-KGLDLALPVEN--YMVSDP-- 335
Query: 332 SSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ CL + G +FG+ QQQN+ V++DL+K + P C
Sbjct: 336 EMGLICLAIDAT-----GSLSIFGNIQQQNMLVLHDLKKSTLSLVPTQC 379
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 115/391 (29%), Positives = 166/391 (42%), Gaps = 62/391 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDL WV C C C Y+ N + F P RSSS C + FC
Sbjct: 106 ILAIADTGSDLIWVQCQ----PCEMC--YKQNSPI--FDPRRSSSYRNVLCGNEFC---- 153
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV---HGSSP 118
N D S C +K TC + Y+YG+ G L + + + ++
Sbjct: 154 ---NKLDGEARS-CDARGFVK-TC-----GYTYSYGDQSFSDGHLAIERFGIGSTNSNTS 203
Query: 119 GIIREIPKFCFGC---VGSTYREPIGIAGFGRGA-LSVPSQLG-FLQKGFSHCFLAFKYA 173
I + FGC G T+ E G +S+ SQLG L FS+C + +
Sbjct: 204 AAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKFSYCLVP--TS 261
Query: 174 NDPNISSPLVIG-DVAISSKD-NLQFTPMLKSPMYPN-YYYIGLEAITIGNSSLTEVPLS 230
N +S + G D+ IS + N+ TP+L P P YYY+ LEAI++ N L L
Sbjct: 262 EQSNYTSKINFGNDINISGSNYNVVSTPLL--PKKPETYYYLTLEAISVENKRLPYTNLW 319
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYR 289
E + G +++DSGTT T L F++ L S ++ + + + V + G F++C++
Sbjct: 320 NGEVEK---GNIIIDSGTTLTFLDSEFFNNLDSAVEEAV----KGERVSDPHGLFNICFK 372
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
P IT HF + L N F + L F + D
Sbjct: 373 DEKAIE------LPIITAHF-TGADVELQPVNTFAKVEE-------DLLCFTMIPSNDIA 418
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ Q N V YDLEK+ + F P DC
Sbjct: 419 ---IFGNLAQMNFLVGYDLEKKAVSFLPTDC 446
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 106/392 (27%), Positives = 176/392 (44%), Gaps = 59/392 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN-FSPSRSSSSSRDTCASSFCLN-IH 61
V +DTGSD+ WV CG+ C C ++ N F P SS+SS +C C + +
Sbjct: 92 VQIDTGSDVLWVSCGS----CNGCPQTSGLQIQLNYFDPGSSSTSSLISCLDRRCRSGVQ 147
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+SD + CS ++ C ++ + YG+G +G D + G +
Sbjct: 148 TSD--------ASCSG----RNNQC----TYTFQYGDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 122 --REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC + + R GI GFG+ +SV SQL G + FSHC
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCL-- 249
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
D + LV+G++ + N+ ++P++ P P +Y + L++I++ N + +
Sbjct: 250 ---KGDNSGGGVLVLGEIV---EPNIVYSPLV--PSQP-HYNLNLQSISV-NGQIVRIAP 299
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
S+ F + N G +VDSGTT +L E Y+ + + + I R+ V R + CY
Sbjct: 300 SV--FATSNNRGTIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRS--VLSRG--NQCYL 353
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
+ +N D+FP ++ +F SLVL ++ + S V C+ FQ +
Sbjct: 354 ITTSSNV---DIFPQVSLNFAGGASLVLRPQDYLMQQNFIGEGS-VWCIGFQKISGQSI- 408
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ DC+
Sbjct: 409 --TILGDLVLKDKIFVYDLAGQRIGWANYDCS 438
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 96/387 (24%), Positives = 176/387 (45%), Gaps = 52/387 (13%)
Query: 7 DTGSDLTWVPCGNLSFDCM--DCDDYRNNKLMSN--FSPSRSSSSSRDTCASSFCLNIHS 62
DTGSDLTW+ C + C +C + + ++ F + SSS C + C
Sbjct: 30 DTGSDLTWMSC---KYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMC----- 81
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D +++ C T PC + Y Y +G G +T+ V G
Sbjct: 82 KIELMDLFSLTNCP-------TPLTPC-GYDYRYSDGSTALGFFANETVTVE-LKEGRKM 132
Query: 123 EIPKFCFGCV----GSTYREPIGIAGFG--RGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
++ GC G +++ G+ G G + + ++ + F K FS+C + + +
Sbjct: 133 KLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGK-FSYCLV--DHLSHK 189
Query: 177 NISSPLVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N+S+ L G +N+ +T ++ M ++Y + + I+IG + L ++P + +
Sbjct: 190 NVSNYLTFGSSRSKEALLNNMTYTELVLG-MVNSFYAVNMMGISIGGAML-KIPSEV--W 245
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCP 293
D +G GG ++DSG++ T L EP Y +++ L+ ++ + ++VE G + C+
Sbjct: 246 DVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKF---RKVEMDIGPLEYCFN---- 298
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ F + L P + FHF + P + Y +SA + V+CL F S+ + + V
Sbjct: 299 STGFEESLVPRLVFHFADGAEFEPPVKS--YVISA---ADGVRCLGFVSV---AWPGTSV 350
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQN +DL +++GF P C
Sbjct: 351 VGNIMQQNHLWEFDLGLKKLGFAPSSC 377
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 103/389 (26%), Positives = 159/389 (40%), Gaps = 61/389 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL-NI 60
+ V +DTGSDLTWV C C C R+ F P+ S++ + C +S C ++
Sbjct: 203 LTVIVDTGSDLTWVQCK----PCSACYAQRDPL----FDPAGSATYAAVRCNASACAASL 254
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
++ C C +A YG+G G+L DT+ + G+S
Sbjct: 255 KAATGTPGSCGGG--------NERC-----YYALAYGDGSFSRGVLATDTVALGGAS--- 298
Query: 121 IREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
+ F FGC S + G+ G GR LS+ SQ G FS+C A
Sbjct: 299 ---LDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTALRYGGVFSYCLPA---TTSG 352
Query: 177 NISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
+ S L +G A S ++ + +T M+ P P +Y++ + +G ++L L
Sbjct: 353 DASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGL----- 407
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G +L+DSGT T L Y + + Q YP A D CY +
Sbjct: 408 ---GASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSI---LDTCYDL-- 459
Query: 293 PNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
T D++ P +T + + + + + S V CL S+ D P
Sbjct: 460 ---TGHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVR--KDGSQV-CLAMASLSYEDQTP- 512
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G++QQ+N VVYD R+GF DC
Sbjct: 513 -IIGNYQQKNKRVVYDTVGSRLGFADEDC 540
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 173/397 (43%), Gaps = 65/397 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C ++ ++ F P S++++ +C+ C I
Sbjct: 99 VQIDTGSDVLWVSCSS----CNGCPVTSGLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQ 154
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS--SPG 119
SSD+ L S+ C + + YG+G +G D + + S G
Sbjct: 155 SSDS---------------LCSSRTNQC-GYTFQYGDGSGTSGYYVADLMHLDTLLLSSG 198
Query: 120 IIREI-----PKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFS 164
+ +I F C + + R GI GFG+ +SV SQL G + FS
Sbjct: 199 ELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFS 258
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
HC D + LV+G++ + N+ +TP++ S + N Y L++I++ +L
Sbjct: 259 HCL-----KGDDSGGGVLVLGEIV---EPNIVYTPLVPSQPHYNLY---LQSISVAGQTL 307
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
P F + N G +VDSGTT +L E Y +S + S ++ R +
Sbjct: 308 AIDP---SVFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKG---- 360
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+ CY V N D+FP ++ +F SL+L ++ ++ +AV C+ FQ
Sbjct: 361 NQCYLVTSSVN----DVFPQVSLNFAGGASLILNPQDYLLQQNS-VGGAAVWCVGFQKTP 415
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYD+ +R+G+ DC+
Sbjct: 416 GQQI---TILGDLVLKDKIFVYDIANQRVGWTNYDCS 449
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 110/408 (26%), Positives = 168/408 (41%), Gaps = 63/408 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C + S F+P S + ++ C+S C
Sbjct: 80 ITMVLDTGSELSWLRCK------------KEPNFTSIFNPLASKTYTKIPCSSQTC-KTR 126
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+SD L C P F +Y + V G L +T + GS
Sbjct: 127 TSD---------------LTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRF-GS--- 167
Query: 120 IIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKY 172
+ R P FGC+ S + G+ G RG+LS +Q+GF + FS+C
Sbjct: 168 LTR--PATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGF--RKFSYCISGL-- 221
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEV 227
+ + L++G+ S L +TP+++ S P + Y + LE I + N L +
Sbjct: 222 ----DSTGFLLLGEARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVL-PL 276
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQST-ITYYPRAKEVEERTGF 284
P S+ D G G +VDSGT +T L P YS L +LQ+ + + +
Sbjct: 277 PKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQGAM 336
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
DLCY + ++T + P + F V Q + +V C F + D
Sbjct: 337 DLCYLIDSTSSTLPN--LPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSD 394
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
+ S + G QQQNV + YDLE RIGF + C GL K
Sbjct: 395 ELGIS-SFLIGHHQQQNVWMEYDLENSRIGFAELRCDLAGQRLGLDVK 441
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 175/391 (44%), Gaps = 57/391 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN-FSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV CG+ C C ++ N F P SS+SS +C+ C
Sbjct: 92 VQIDTGSDVLWVSCGS----CNGCPQTSGLQIQLNYFDPRSSSTSSLISCSDRRC----- 142
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII- 121
SG S S+ C ++ + YG+G +G D + G G +
Sbjct: 143 ---------RSGVQTSDASCSSQNNQC-TYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLT 192
Query: 122 -REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
FGC + + R GI GFG+ +SV SQL G + FSHC
Sbjct: 193 TNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCL--- 249
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
D + LV+G++ + N+ ++P+++S +Y + L++I++ N + VP++
Sbjct: 250 --KGDNSGGGVLVLGEIV---EPNIVYSPLVQSQ---PHYNLNLQSISV-NGQI--VPIA 298
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
F + N G +VDSGTT +L E Y+ ++ + + + R+ V R + CY +
Sbjct: 299 PAVFATSNNRGTIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRS--VLSRG--NQCYLI 354
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
+N D+FP ++ +F SLVL ++ + S V C+ FQ +
Sbjct: 355 TTSSNV---DIFPQVSLNFAGGASLVLRPQDYLMQQNYIGEGS-VWCIGFQRIPGQSI-- 408
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL +RIG+ DC+
Sbjct: 409 -TILGDLVLKDKIFVYDLAGQRIGWANYDCS 438
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 118/397 (29%), Positives = 182/397 (45%), Gaps = 63/397 (15%)
Query: 2 IQVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
I+V+ DTGSDLTWV C C C Y+ N + F +SS+ + C S C
Sbjct: 96 IKVFAIADTGSDLTWVQCK----PCQQC--YKENGPI--FDKKKSSTYKSEPCDSRNCQA 147
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ S++ GC S + C+ + Y+YG+ G + +T+ + +S G
Sbjct: 148 LSSTER--------GCDES----NNICK----YRYSYGDQSFSKGDVATETVSIDSAS-G 190
Query: 120 IIREIPKFCFGC---VGSTYREPIGIAGFGRGA-LSVPSQLGF-LQKGFSHCFLAFKYAN 174
P FGC G T+ E G LS+ SQLG + K FS+C L+ K A
Sbjct: 191 SPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYC-LSHKSAT 249
Query: 175 DPNISSPLVIGDVAISS---KDN-LQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N +S + +G +I S KD+ + TP++ K P+ YYY+ LEAI++G +
Sbjct: 250 -TNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL--TYYYLTLEAISVGKKKIPYTGS 306
Query: 230 SLREFD----SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
S D S+ +G +++DSGTT T L F+ + S ++ ++T AK V + G
Sbjct: 307 SYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVT---GAKRVSDPQGL- 362
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
L + C + + P IT HF + L N F +S + CL +
Sbjct: 363 LSH---CFKSGSAEIGLPEITVHF-TGADVRLSPINAFVKLSED-----MVCLSMVPTTE 413
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
++G+F Q + V YDLE + FQ MDC++
Sbjct: 414 -----VAIYGNFAQMDFLVGYDLETRTVSFQHMDCSA 445
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 120/382 (31%), Positives = 176/382 (46%), Gaps = 58/382 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C DC N+ F PS+S + C+S+ C ++ S+
Sbjct: 111 VDTGSDIIWLQCQ----PCEDC----YNQTTPIFDPSQSKTYKTLPCSSNICQSVQSA-- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ CS + C + TYG+ G L+ +TL + GS+ G + P
Sbjct: 161 -------ASCSSN---NDEC-----EYTITYGDNSHSQGDLSVETLTL-GSTDGSSVQFP 204
Query: 126 KFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
K GC G+ RE GI G G G +S+ SQL G FS+C + N SS
Sbjct: 205 KTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPL--FSQSNSSS 262
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L GD A+ S TP++ +Y++ LEA ++G++ + S +GN
Sbjct: 263 KLNFGDEAVVSGRGTVSTPIVPKNGL-GFYFLTLEAFSVGDNRIEFGSSSFESSGGEGN- 320
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNNTFTD 299
+++DSGTT T LPE Y L S + I + VE+ + F LCYR T +D
Sbjct: 321 -IIIDSGTTLTILPEDDYLNLESAVADAI----ELERVEDPSKFLRLCYRT-----TSSD 370
Query: 300 DL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+L P IT HF + L + F + V C F+S GP +FG+
Sbjct: 371 ELNVPVITAHF-KGADVELNPISTFIEVD-----EGVVCFAFRS---SKIGP--IFGNLA 419
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQN+ V YDL K+ + F+P DC
Sbjct: 420 QQNLLVGYDLVKQTVSFKPTDC 441
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 108/400 (27%), Positives = 161/400 (40%), Gaps = 75/400 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNI-- 60
V +DTGSD+ WV C C C N + F+P SS+SSR C+ C
Sbjct: 104 VQIDTGSDILWVACS----PCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQ 159
Query: 61 ------HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL--- 111
SSD+P PC + +TYG+G +G DT+
Sbjct: 160 TGEAVCQSSDSPSSPC--------------------GYTFTYGDGSGTSGFYVSDTMYFD 199
Query: 112 KVHGSSPGIIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQK 161
V G+ FGC S T R GI GFG+ LSV SQL G K
Sbjct: 200 TVMGNEQ-TANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPK 258
Query: 162 GFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGN 221
FSHC N LV+G++ + L FTP++ P P +Y + LE+I +
Sbjct: 259 TFSHCL-----KGSDNGGGILVLGEIV---EPGLVFTPLV--PSQP-HYNLNLESIAVSG 307
Query: 222 SSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEER 281
L P+ F + G +VDSGTT +L + Y ++ + + ++ R+ +
Sbjct: 308 QKL---PIDSSLFATSNTQGTIVDSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGI 364
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
F V D FP+ T +F VS+ + N+ + N + + C+ +Q
Sbjct: 365 QCFVTTSSV--------DSSFPTATLYFKGGVSMTVKPENYLLQQGSVDN-NVLWCIGWQ 415
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL R+G+ DC+
Sbjct: 416 RSQG-----ITILGDLVLKDKIFVYDLANMRMGWADYDCS 450
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 158/385 (41%), Gaps = 68/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 97 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 146
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 147 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 183
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 184 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSER 242
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 243 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 299
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 300 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 348
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ + P+I+ HF + L G+H + V CL F +
Sbjct: 349 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAFAPTE-----SVS 399
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQP 377
+ GS Q + EVVYDL+++ IG P
Sbjct: 400 IIGSLMQTSKEVVYDLKRQLIGIGP 424
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 158/377 (41%), Gaps = 53/377 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ W+ C C +C Y+ + + F P+ SS+ TC+ C ++
Sbjct: 179 VVLDTGSDVNWIQC----LPCSEC--YQQSDPI--FDPTSSSTFKSLTCSDPKCASLD-- 228
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+S C +S C + +YG+G G DT+ S G + +
Sbjct: 229 --------VSAC------RSNKCL----YQVSYGDGSFTVGNYATDTVTFGES--GKVND 268
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ C + G+ G G GALS+ +Q+ K FS+C + D SS L
Sbjct: 269 VALGCGHDNEGLFTGAAGLLGLGGGALSMTNQIK--AKSFSYCLVD----RDSAKSSSLD 322
Query: 184 IGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLL 243
V I + D P+L++ +YY+GL ++G ++ +P SL E D+ G GG++
Sbjct: 323 FNSVQIGAGDAT--APLLRNSKMDTFYYVGLSGFSVGGQQVS-IPSSLFEVDASGAGGVI 379
Query: 244 VDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFP 303
+D GT T L Y+ L T + K + FD CY ++ + P
Sbjct: 380 LDCGTAVTRLQTQAYNSLRDAFVKLTTDFK--KGTSPISLFDTCYDF----SSLSTVKVP 433
Query: 304 SITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVE 363
++TFHF SL LP N+ P + + C F + G+ QQQ
Sbjct: 434 TVTFHFTGGKSLNLPAKNYLI----PIDDAGTFCFAFAPTS----SSLSIIGNVQQQGTR 485
Query: 364 VVYDLEKERIGFQPMDC 380
+ YDL IG C
Sbjct: 486 ITYDLANNLIGLSANKC 502
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/403 (25%), Positives = 170/403 (42%), Gaps = 75/403 (18%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ WV C C +C N N +S F + SS+S + C FC I
Sbjct: 88 HVQVDTGSDILWVNCK----PCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFIS 143
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
SD+ C+P S+ Y + G RD L + + G
Sbjct: 144 QSDS--------------------CQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVT-G 182
Query: 120 IIREIP---KFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
++ P + FGC +G + G+ GFG+ SV SQL G ++ FSHC
Sbjct: 183 DLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHC 242
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
K I V + ++ TPM+ + M+ N +G++ + ++L
Sbjct: 243 LDNVKGGG---------IFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMD---VDGTALDL 290
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
P +R NGG +VDSGTT + P+ Y S++++ + P + E T F
Sbjct: 291 PPSIMR------NGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEDT-FQ- 339
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MD 344
C+ + D FP ++F F ++V L + ++ + + + C +Q+ +
Sbjct: 340 CFSF----SENVDVAFPPVSFEFEDSVKLTVYPHDYLFTL-----EKELYCFGWQAGGLT 390
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
G+ + G N VVYDLE E IG+ +C+S+ +
Sbjct: 391 TGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNCSSSIKIK 433
>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
Length = 446
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/196 (34%), Positives = 95/196 (48%), Gaps = 17/196 (8%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ + MDTGSDL W PC + + C +C +N + F P SSSS C + C I
Sbjct: 102 TLPLIMDTGSDLVWFPCTH-RYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGCVNPKCGWI 160
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
H S S C C + CP + YG G + GI+ +TL + G
Sbjct: 161 HGSK------VQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDLPG----- 208
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ +P F GC + +P GI+GFGRG S+PSQLG K FS+C L+ +Y +D SS
Sbjct: 209 -KGVPNFIVGCSVLSTSQPAGISGFGRGPPSLPSQLGL--KKFSYCLLSRRY-DDTTESS 264
Query: 181 PLVIGDVAISSKDNLQ 196
L+ VA + +Q
Sbjct: 265 SLIFELVAAEFEKQVQ 280
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 37/113 (32%), Positives = 50/113 (44%), Gaps = 16/113 (14%)
Query: 274 RAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS 333
RA EVE TG C+ + + FP +T F + LP N+ +
Sbjct: 283 RATEVEGITGLRPCFNI----SGLNTPSFPELTLKFRGGAEMELPLANYVAFLGG----D 334
Query: 334 AVKCLLFQSMDDGDYG------PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V CL + DG G P+ + G+FQQQN V YDL ER+GF+ C
Sbjct: 335 DVVCLTI--VTDGAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 385
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 155/377 (41%), Gaps = 55/377 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + F P SSS + C S C + +S
Sbjct: 170 MVLDTGSDINWLQCQ----PCTDC--YQQTDPI--FDPRSSSSFASLPCESQQCQALETS 221
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC S L + +YG+G G +TL S G+I +
Sbjct: 222 ----------GCRASKCL----------YQVSYGDGSFTVGEFVTETLTFGNS--GMIND 259
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ C + G+ G G G LS+ SQ+ FS+C + D + SS L
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGPLSLTSQMK--ASSFSYCLVD----RDSSSSSDLE 313
Query: 184 IGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLL 243
A S N P+LKS +YY+GL +++G L +P +L + D G GG++
Sbjct: 314 FNSAAPSDSVN---APLLKSGKVDTFYYVGLTGMSVGGQ-LLSIPPNLFQMDDSGYGGII 369
Query: 244 VDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFP 303
VDSGT T L Y+ L ++ P K+ FD CY + + P
Sbjct: 370 VDSGTAITRLQTQAYNTLRDAF---VSRTPYLKKTNGFALFDTCYDLSSQSRV----TIP 422
Query: 304 SITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVE 363
+++F F SL LP N+ P +S C F + G+ QQQ
Sbjct: 423 TVSFEFAGGKSLQLPPKNYLI----PVDSVGTFCFAFAPTTSS----LSIIGNVQQQGTR 474
Query: 364 VVYDLEKERIGFQPMDC 380
V YDL +GF P C
Sbjct: 475 VHYDLANSVVGFSPHKC 491
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 105/386 (27%), Positives = 153/386 (39%), Gaps = 71/386 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y + F P++S++ + +C+SS+C +++
Sbjct: 176 VVFDTGSDTTWV-------QCQPCVAYCYRQKEPLFDPTKSATYANISCSSSYCSDLY-- 226
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGCS L + YG+G G +DTL +
Sbjct: 227 --------VSGCSGGHCL----------YGIQYGDGSYTIGFYAQDTLTL------AYDT 262
Query: 124 IPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
I F FGC G R G+ G GRG S+P Q G F++C P
Sbjct: 263 IKNFRFGC-GEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL--------PAT 313
Query: 179 SSPLVIGDVAISS-KDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S+ D+ + N + TPML P +YY+G+ I +G L P+ F +
Sbjct: 314 SAGTGFLDLGPGAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVL---PIPGSVFSTA 369
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT--YYPRAKEVEERTGFDLCYRVPCPNN 295
G LVDSGT T LP Y+ L S + Y A D CY + +
Sbjct: 370 GT---LVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSI---LDTCYDL--TGH 421
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPSGVF 354
P+++ F L + Y + CL F + DD D +
Sbjct: 422 KGGSIALPAVSLVFQGGACLDVDASGILYVADV-----SQACLAFAPNADDTDV---AIV 473
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ+ V+YD+ K+ +GF P C
Sbjct: 474 GNTQQKTHGVLYDIGKKIVGFAPGAC 499
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 128/300 (42%), Gaps = 50/300 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSDL W C C+ C D + F +S++ C SS C ++ S
Sbjct: 106 MDTGSDLIWTQCA----PCLLCAD----QPTPYFDVKKSATYRALPCRSSRCASLSS--- 154
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+C + + Y YG+ G+L +T ++ +R
Sbjct: 155 -----------------PSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRAT- 196
Query: 126 KFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC GS + G+ GFGRG LS+ SQLG FS+C ++ A S
Sbjct: 197 NIAFGC-GSLNAGDLANSSGMVGFGRGPLSLVSQLG--PSRFSYCLTSYLSAT----PSR 249
Query: 182 LVIGDVAISSKDN------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L G A S N +Q TP + +P PN Y++ L+AI++G L PL +
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVF-AIN 308
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G GG+++DSGT+ T L + Y + L S I P + G D C++ P P N
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVSAI---PLTAMNDTDIGLDTCFQWPPPPN 365
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 171/387 (44%), Gaps = 59/387 (15%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTWV C C C Y+ N + F +SS+ ++C S C + +
Sbjct: 103 DTGSDLTWVQCK----PCQQC--YKQNTPL--FDKKKSSTYKTESCDSITCNALSEHEE- 153
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
GC S R + Y+YG+ G + +T+ + SS G P
Sbjct: 154 -------GCDES--------RNACKYRYSYGDESFTKGEVATETISIDSSS-GSPVSFPG 197
Query: 127 FCFGC---VGSTYREPIGIAGFGRGA-LSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
FGC G T+ E G LS+ SQLG + K FS+C L+ A N +S
Sbjct: 198 TAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYC-LSHTSAT-TNGTSV 255
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMY----PNYYYIGLEAITIGNSSLTEVP---LSLREF 234
+ +G +++SK + + + +L +P+ YY++ LEAIT+G + L SL
Sbjct: 256 INLGTNSMTSKPS-KDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNR- 313
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
S+ G +++DSGTT T L FY ++++ ++T AK V + G C
Sbjct: 314 KSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVT---GAKRVSDPQGI----LTHCFK 366
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ + P+IT HF + L N F +S + CL + ++
Sbjct: 367 SGDKEIGLPTITMHF-TGADVKLSPINSFVKLSED-----IVCLSMIPTTE-----VAIY 415
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ Q + V YDLE + + FQ MDC+
Sbjct: 416 GNMVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 106/386 (27%), Positives = 155/386 (40%), Gaps = 71/386 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C YR + + F P++S++ + +C+SS+C +++
Sbjct: 111 VVFDTGSDTTWVQCQPCVAYC-----YRQKEPL--FDPTKSATYANISCSSSYCSDLY-- 161
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+SGCS L + YG+G G +DTL +
Sbjct: 162 --------VSGCSGGHCL----------YGIQYGDGSYTIGFYAQDTLTL------AYDT 197
Query: 124 IPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
I F FGC G R G+ G GRG S+P Q G F++C P
Sbjct: 198 IKNFRFGC-GEKNRGLFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL--------PAT 248
Query: 179 SSPLVIGDVAISS-KDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S+ D+ + N + TPML P +YY+G+ I +G L P+ F +
Sbjct: 249 SAGTGFLDLGPGAPAANARLTPMLVD-RGPTFYYVGMTGIKVGGHVL---PIPGSVFSTA 304
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT--YYPRAKEVEERTGFDLCYRVPCPNN 295
G LVDSGT T LP Y+ L S + Y A D CY + +
Sbjct: 305 GT---LVDSGTVITRLPPSAYAPLRSAFSKAMQGLGYSAAPAFSI---LDTCYDL--TGH 356
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPSGVF 354
P+++ F L + Y + CL F + DD D +
Sbjct: 357 KGGSIALPAVSLVFQGGACLDVDASGILYVADV-----SQACLAFAPNADDTDV---AIV 408
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQ+ V+YD+ K+ +GF P C
Sbjct: 409 GNTQQKTHGVLYDIGKKIVGFAPGAC 434
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 106/391 (27%), Positives = 172/391 (43%), Gaps = 75/391 (19%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ DTGSDL W CG C C + +++ P++SSS S+ C+S+ C +
Sbjct: 93 TLSALADTGSDLIWAKCGA----CKRCAP----RGSASYYPTKSSSFSKLPCSSALCRTL 144
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG----LVTGILTRDTLKVHGS 116
S +++ C + + C S+ Y+YG G + +T +
Sbjct: 145 ESQ-------SLATCGGTRARGAVC-----SYRYSYGLSSNPHHYTQGYMGSETFTLGSD 192
Query: 117 SPGIIREIPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ + FGC Y G+ G GRG LS+ QL FS+C
Sbjct: 193 A------VQGIGFGCTTMSEGGYGSGSGLVGLGRGKLSLVRQLKV--GAFSYCL-----T 239
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+DP+ SSPL+ G A++ +Q TP++ +Y + L++I+IG + + P
Sbjct: 240 SDPSTSSPLLFGAGALTGP-GVQSTPLVNLKT-STFYTVNLDSISIGAA---KTP----- 289
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G G++ DSGTT T L EP Y+ + L S T R + G+++C++
Sbjct: 290 --GTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGTD---GYEVCFQ---- 340
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-- 351
T +FPS+ HF + + L N+F A+ + +V C L Q PS
Sbjct: 341 --TSGGAVFPSMVLHF-DGGDMALKTENYFGAV-----NDSVSCWLVQK------SPSEM 386
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+ Q + + YDL+K + FQP +C S
Sbjct: 387 SIVGNIMQMDYHIRYDLDKSVLSFQPTNCDS 417
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 104/398 (26%), Positives = 174/398 (43%), Gaps = 72/398 (18%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ WV C + C +C + + F S ++ TC+ C ++
Sbjct: 114 NVQIDTGSDILWVTCSS----CSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVF 169
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSP 118
+ T + CS ++ C +++ YG+G +G DT + G S
Sbjct: 170 QT-------TAAQCS-----ENNQC----GYSFRYGDGSGTSGYYMTDTFYFDAILGESL 213
Query: 119 GIIREIPKFCFGCVGSTY---------REPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
P FGC STY + GI GFG+G LSV SQL G FSHC
Sbjct: 214 VANSSAP-IVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC 270
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
D + V+G++ + M+ SP+ P+ + L ++IG +
Sbjct: 271 L-----KGDGSGGGVFVLGEILVPG--------MVYSPLVPSQPHYNLNLLSIGVNG-QM 316
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGF 284
+PL F++ G +VD+GTT T+L + Y L+ + ++++ P E+
Sbjct: 317 LPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ---- 372
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSM 343
CY V +T D+FPS++ +F S++L PQ F+ + +++ C+ FQ
Sbjct: 373 --CYLV----STSISDMFPSVSLNFAGGASMMLRPQDYLFHY--GIYDGASMWCIGFQKA 424
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ + G ++ VYDL ++RIG+ DC+
Sbjct: 425 PE----EQTILGDLVLKDKVFVYDLARQRIGWASYDCS 458
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 163/385 (42%), Gaps = 77/385 (20%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W CG C + ++ P+ SS+ ++ C+ C + S
Sbjct: 109 DTGSDLIWAKCGGA------CTTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLLRSDSVA 162
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG----LVTGILTRDTLKVHGSSPGIIR 122
+ C +G + Y+YG G G L R+T + +
Sbjct: 163 W--CAAAGAECD-------------YRYSYGLGDDDHHYTQGFLARETFTLGADA----- 202
Query: 123 EIPKFCFGCVGSTYREPIGIAGFG---RGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+P FGC ++ +G RG LS+ SQL F +C +D + +
Sbjct: 203 -VPSVRFGCTTASEGGYGSGSGLVGLGRGPLSLVSQLN--ASTFMYCL-----TSDASKA 254
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
SPL+ G +A + +Q T +L S +Y + L +I+IG+++ + G
Sbjct: 255 SPLLFGSLASLTGAQVQSTGLLAST---TFYAVNLRSISIGSAT------------TPGV 299
Query: 240 G---GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G G++ DSGTT T+L EP YS+ + S + +VE+ GF+ C++ P N
Sbjct: 300 GEPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSL----DQVEDTDGFEACFQKPA-NGR 354
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVFG 355
++ P++ HF + + LP N+ + V C + Q PS + G
Sbjct: 355 LSNAAVPTMVLHF-DGADMALPVANYVVEV-----EDGVVCWIVQR------SPSLSIIG 402
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ Q N V++D+ + + FQP +C
Sbjct: 403 NIMQVNYLVLHDVHRSVLSFQPANC 427
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 151/379 (39%), Gaps = 64/379 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C N SP+ ++ S C ++ C +
Sbjct: 53 MDTSSDVAWIPCNG----CLGCSSTLFN------SPASTTYKSLG-CQAAQCKQVP---- 97
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
K TC SF TYG L L++DT+ + + +P
Sbjct: 98 ----------------KPTCGGGVCSFNLTYGGSSLAAN-LSQDTITLATDA------VP 134
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ G + + + Q FS+C +FK N S
Sbjct: 135 GYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 191
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V +++TP+LK+P P+ Y++ L A+ +G + P S F+ G
Sbjct: 192 LRLGPVG--QPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSF-TFNPSTGAG 248
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y + ++ + R V GFD CY VP
Sbjct: 249 TIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTVPIAA------- 298
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LP N +A S + CL + D V + QQQN
Sbjct: 299 -PTITFMF-TGMNVTLPPDNLLIHSTAGSTT----CLAMAAAPDNVNSVLNVIANLQQQN 352
Query: 362 VEVVYDLEKERIGFQPMDC 380
++YD+ R+G C
Sbjct: 353 HRLLYDVPNSRLGVARELC 371
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 107/401 (26%), Positives = 164/401 (40%), Gaps = 88/401 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN------NKLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSDL+WVPC DC+ C ++ +S + PS S++S +C C
Sbjct: 117 VALDAGSDLSWVPC-----DCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLC 171
Query: 58 -LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG- 115
L H + LK PCP A +G L D L +
Sbjct: 172 ELGSHCKN----------------LKD----PCPYIADYADPNTSSSGFLVEDILHLASV 211
Query: 116 ---SSPGIIREIPKFCFGCVGSTY------REPIGIAGFGRGALSVPSQL---GFLQKGF 163
S+ R GC P G+ G G G++SVPS L G ++K F
Sbjct: 212 SDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSF 271
Query: 164 SHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSS 223
S CF D N S ++ GD +S+ + TP+L + + Y I +E+ +GNS
Sbjct: 272 SLCF-------DVNGSGTILFGDQGHTSQKS---TPLLPTQGNYDAYLIEVESYCVGNSC 321
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
L Q LVDSG ++T+LP Y++++ + A+ + + G
Sbjct: 322 L-----------KQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQV----NAQRISSQGG 366
Query: 284 -FDLCYRVPCPNNTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLL 339
++ CY NT + L P++ FL N SL++ ++ P N AV CL
Sbjct: 367 PWNYCY------NTSSKQLDNVPAMRLSFLMNQSLLIHNSTYY----VPQNQEFAVFCLT 416
Query: 340 FQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
Q D G+ G VV+D+E ++G+ +C
Sbjct: 417 LQPTDLN----YGIIGQNYMTGYRVVFDMENLKLGWSSSNC 453
>gi|296087864|emb|CBI35120.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 152/379 (40%), Gaps = 64/379 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C N SP+ ++ S C ++ C +
Sbjct: 1 MDTSSDVAWIPCNG----CLGCSSTLFN------SPASTTYKSLG-CQAAQCKQVP---- 45
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
K TC SF TYG L L++DT+ + + +P
Sbjct: 46 ----------------KPTCGGGVCSFNLTYGGSSLAAN-LSQDTITLATDA------VP 82
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ G + + + Q FS+C +FK N S
Sbjct: 83 GYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 139
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V + +++TP+LK+P P+ Y++ L A+ +G + P S F+ G
Sbjct: 140 LRLGPVGQPKR--IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSF-TFNPSTGAG 196
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y + ++ + R V GFD CY VP
Sbjct: 197 TIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTVPIAA------- 246
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LP N +A S + CL + D V + QQQN
Sbjct: 247 -PTITFMF-TGMNVTLPPDNLLIHSTAGSTT----CLAMAAAPDNVNSVLNVIANLQQQN 300
Query: 362 VEVVYDLEKERIGFQPMDC 380
++YD+ R+G C
Sbjct: 301 HRLLYDVPNSRLGVARELC 319
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/381 (27%), Positives = 160/381 (41%), Gaps = 62/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C C + F+P+ S S C S C + N
Sbjct: 125 VDTSNDAAWIPCSG----CAGCPT------TTPFNPAASKSYRAVPCGSPAC---SRAPN 171
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CSL+T +C F+ TY + L L++D+L V +
Sbjct: 172 P-------SCSLNT---KSC-----GFSLTYADSSL-EAALSQDSLAVAND------VVK 209
Query: 126 KFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ + +G FS+C +FK N S
Sbjct: 210 SYTFGCLQKATGTATPPQGLLGLGRGPLSFLSQTKDMYEGTFSYCLPSFKSLN---FSGT 266
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + ++ TP+L +P + YY+ + I +G + +P + FD G
Sbjct: 267 LRLGRKGQPLR--IKTTPLLVNPHRSSLYYVSMTGIRVGKK-VVPIPPAALAFDPATGAG 323
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT +T L P Y + ++ I R + GFD CY N T
Sbjct: 324 TVLDSGTMFTRLVAPAYVAVRDEVRRRI----RGAPLSSLGGFDTCY-----NTTVK--- 371
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P +TF F + + LP N + S CL + DG V S QQQN
Sbjct: 372 WPPVTFMF-TGMQVTLPADN----LVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQN 426
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
+++D+ R+GF C +
Sbjct: 427 HRILFDVPNGRVGFAREQCTA 447
>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
gi|238008190|gb|ACR35130.1| unknown [Zea mays]
gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
Length = 269
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 119/263 (45%), Gaps = 25/263 (9%)
Query: 127 FCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
FGC T GI G G LSV QL + FS+C F + +SP++
Sbjct: 24 LTFGCGKLTNGTIAGASGIMGVSPGPLSVLKQLSITK--FSYCLTPFT----DHKTSPVM 77
Query: 184 IGDVAISSK----DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
G +A K +Q P+LK+P+ YYY+ + I+IG+ L +VP ++ G
Sbjct: 78 FGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRL-DVPEAILALRPDGT 136
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG ++DS TT +L EP + +L + + + +++ + +C+ +P +
Sbjct: 137 GGTVLDSATTLAYLVEPAFKELKKAVMEGMKLPAANRSIDD---YPVCFELPR-GMSMEG 192
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + HF + + LP+ ++F S + CL M G V G+ QQ
Sbjct: 193 VQVPPLVLHFAGDAEMSLPRDSYFQ-----EPSPGMMCLAV--MQAPFEGAPNVIGNVQQ 245
Query: 360 QNVEVVYDLEKERIGFQPMDCAS 382
QN+ V+YDL + + P C S
Sbjct: 246 QNMHVLYDLGNRKFSYAPTKCDS 268
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/384 (27%), Positives = 151/384 (39%), Gaps = 67/384 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSD TWV C C Y + F+P++S++ + +C SS+C ++ +
Sbjct: 180 VVFDTGSDTTWV-------QCQPCVAYCYQQKEPLFTPTKSATYANISCTSSYCSDLDTR 232
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS L +A YG+G G +DTL + +
Sbjct: 233 ----------GCSGGHCL----------YAVQYGDGSYTVGFYAQDTLTLGYDT------ 266
Query: 124 IPKFCFGCVGSTYR----EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+ F FGC G R + G+ G GRG SVP Q G F++C P
Sbjct: 267 VKDFRFGC-GEKNRGLFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCI--------PAT 317
Query: 179 SSPLVIGD--VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
SS D + N + TPML P +YY+G+ I +G L +P ++
Sbjct: 318 SSGTGFLDFGPGAPAAANARLTPMLVDNG-PTFYYVGMTGIKVGGH-LLSIPATVFS--- 372
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+ G LVDSGT T LP Y L S + K + D CY + +
Sbjct: 373 --DAGALVDSGTVITRLPPSAYEPLRSAFAKGMEGLGY-KTAPAFSILDTCYDLTGYQGS 429
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P+++ F L + Y CL F + DD + G+
Sbjct: 430 IA---LPAVSLVFQGGACLDVDASGILYVADVSQ-----ACLAFAANDDDT--DMTIVGN 479
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQ+ V+YDL K+ +GF P C
Sbjct: 480 TQQKTYSVLYDLGKKVVGFAPGAC 503
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/402 (27%), Positives = 176/402 (43%), Gaps = 78/402 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCL-NIH 61
V +DTGSD+ WV C C C + ++ + P+ S ++ C FC+ N
Sbjct: 99 VQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGSGTTV--GCEQEFCVANSA 152
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG-- 119
P P T S PC F TYG+G TG D ++ + S
Sbjct: 153 GGVPPTCPSTSS--------------PC-QFRITYGDGSTTTGFYVTDFVQYNQVSGNGQ 197
Query: 120 IIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFLA 169
FGC +GS+ + GI GFG+ S+ SQL ++K F+HC
Sbjct: 198 TTTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDT 257
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSLTE 226
+ IG+V P +K+ P+ PN +Y + L+ I++G ++L +
Sbjct: 258 VRGGG------IFAIGNVV---------QPKVKTTPLVPNVTHYNVNLQGISVGGATL-Q 301
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD- 285
+P S FDS + G ++DSGTT +LP Y LL+ + + + +++ D
Sbjct: 302 LPTS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAV------FDKYQDLPLHNYQDF 353
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
+C++ DD FP ITF F +++L + ++ + N + + C+ F +D
Sbjct: 354 VCFQFSGS----IDDGFPVITFSFKGDLTLNVYPDDYLF-----QNRNDLYCMGF--LDG 402
Query: 346 GDYGPSG----VFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G G + G N VVYDLEKE IG+ +C+S+
Sbjct: 403 GVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIGWTDYNCSSS 444
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 174/397 (43%), Gaps = 72/397 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C + C +C + + F S ++ TC+ C ++
Sbjct: 120 VQIDTGSDILWVTCSS----CSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQ 175
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
+ T + CS ++ C +++ YG+G +G DT + G S
Sbjct: 176 T-------TAAQCS-----ENNQC----GYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 219
Query: 120 IIREIPKFCFGCVGSTY---------REPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
P FGC STY + GI GFG+G LSV SQL G FSHC
Sbjct: 220 ANSSAP-IVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL 276
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
D + V+G++ + M+ SP+ P+ + L ++IG + +
Sbjct: 277 -----KGDGSGGGVFVLGEILVPG--------MVYSPLVPSQPHYNLNLLSIGVNG-QML 322
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGFD 285
PL F++ G +VD+GTT T+L + Y L+ + ++++ P E+
Sbjct: 323 PLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----- 377
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMD 344
CY V +T D+FPS++ +F S++L PQ F+ + +++ C+ FQ
Sbjct: 378 -CYLV----STSISDMFPSVSLNFAGGASMMLRPQDYLFHY--GIYDGASMWCIGFQKAP 430
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ + G ++ VYDL ++RIG+ DC+
Sbjct: 431 E----EQTILGDLVLKDKVFVYDLARQRIGWASYDCS 463
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 110/388 (28%), Positives = 168/388 (43%), Gaps = 72/388 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD+ W+ C C DC Y+ + F PS+S + C+S+ C ++ ++
Sbjct: 108 VDTGSDILWLQCE----PCEDC--YKQTTPI--FDPSKSKTYKTLPCSSNTCESLRNT-- 157
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
CS + + ++ YG+G G L+ +TL + GS+ G P
Sbjct: 158 --------ACSSDNVCE---------YSIDYGDGSHSDGDLSVETLTL-GSTDGSSVHFP 199
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDPNISS 180
K GC G T++E G G FS+C ++ N SS
Sbjct: 200 KTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPI--FSESNSSS 257
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPN-YYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L GD A+ S TP+ P+ +Y++ LEA ++G++ + E S G+
Sbjct: 258 KLNFGDAAVVSGRGTVSTPL--DPLNGQVFYFLTLEAFSVGDNRI-EFSGSSSSGSGSGD 314
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G +++DSGTT T LP+ Y L S + I RA++ + LCY+ T +D
Sbjct: 315 GNIIIDSGTTLTLLPQEDYLNLESAVSDVIKL-ERARDPSKL--LSLCYK------TTSD 365
Query: 300 DL-FPSITFHF------LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+L P IT HF LN +S +P V C F S G
Sbjct: 366 ELDLPVITAHFKGADVELNPISTFVPV------------EKGVVCFAFISSKIG-----A 408
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ QQN+ V YDL K+ + F+P DC
Sbjct: 409 IFGNLAQQNLLVGYDLVKKTVSFKPTDC 436
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 108/403 (26%), Positives = 173/403 (42%), Gaps = 70/403 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C +C C R + L ++ + P S +S +C FC
Sbjct: 85 VQVDTGSDILWVNC----VECSRCP--RKSDLGIDLTLYDPKGSETSDVVSCDQDFC--S 136
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ D P C KS PCP ++ TYG+G TG +D L + G
Sbjct: 137 ATFDGPIPGC-----------KSEI--PCP-YSITYGDGSATTGYYVQDYL-TYNRINGN 181
Query: 121 IREIPK---FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHC 166
+R P+ FGC +GS+ E + GI GFG+ SV SQL G ++K FSHC
Sbjct: 182 LRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHC 241
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
++ IG+V P + +Y + L++I + ++ + +
Sbjct: 242 L------DNVRGGGIFAIGEVVEPKVSTTPLVPRMA------HYNVVLKSIEV-DTDILQ 288
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+P + FDS G ++DSGTT +LP+ Y +L +Q + P K F
Sbjct: 289 LPSDI--FDSVNGKGTVIDSGTTLAYLPDIVYDEL---IQKVLARQPGLKLYLVEQQFR- 342
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDD 345
C+ D FP + HF +++SL + ++ + + C+ +Q S+
Sbjct: 343 CFLY----TGNVDRGFPVVKLHFKDSLSLTVYPHDYLFQF-----KDGIWCIGWQRSVAQ 393
Query: 346 GDYGPS-GVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
G + G N V+YDLE IG+ +C+S+ +
Sbjct: 394 TKNGKDMTLLGDLVLSNKLVIYDLENMVIGWTDYNCSSSIKVK 436
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 160/382 (41%), Gaps = 62/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL+WV C C Y + F PSRSS+ + C + C ++
Sbjct: 135 LLIDTGSDLSWVQCA----PCNSTTCYPQKDPL--FDPSRSSTYAPIPCNTDACRDLTRD 188
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D C+ + + C +A TYG+G TG+ + +TL + +PG+
Sbjct: 189 GYGSD------CTSGSGGGAQC-----GYAITYGDGSQTTGVYSNETLTM---APGVT-- 232
Query: 124 IPKFCFGCVGSTYREP----IGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+ F FGC G P G+ G G S+ Q + G FS+C A AND
Sbjct: 233 VKDFHFGC-GHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPA---ANDQ-- 286
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ L +G ++ FTPM++ +Y + + IT+G + P +
Sbjct: 287 AGFLALG-APVNDASGFVFTPMVREQQ--TFYVVNMTGITVGGEPIDVPPSAF------- 336
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
+GG+++DSGT T L Y+ L + + + YP E D CY +N
Sbjct: 337 SGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGE----LDTCYNFTGHSNVTV 392
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ ++TF V L +P G CL FQ + G G+ G+
Sbjct: 393 PRV--ALTFSGGATVDLDVPDGILLD-----------NCLAFQ--EAGPDNQPGILGNVN 437
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ +EV+YD+ R+GF C
Sbjct: 438 QRTLEVLYDVGHGRVGFGADAC 459
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 160/381 (41%), Gaps = 64/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y+ + + F P+ SSS + +C S C
Sbjct: 160 IDSGSDIVWVQCK----PCSRC--YQQSDPV--FDPADSSSFAGVSCGSDVC-------- 203
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D +GC+ CR + +YG+G G L +TL V +IR++
Sbjct: 204 --DRLENTGCNAGR------CR----YEVSYGDGSYTKGTLALETLTV---GQVMIRDVA 248
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS---SP 181
C + G+ G G G++S QLG G FS+C ++ + +
Sbjct: 249 IGCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGA 308
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G IS ++++P P++YYIGL I +G ++ VP + G G
Sbjct: 309 LPVGATWIS---------LIRNPRAPSFYYIGLAGIGVGGVRVS-VPEETFQLTEYGTNG 358
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+++D+GT T P Y + + PRA V FD CY + N F
Sbjct: 359 VVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSI---FDTCYDL----NGFESVR 411
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQ 359
P+++F+F + L LP N P + CL F PSG + G+ QQ
Sbjct: 412 VPTVSFYFSDGPVLTLPARNFLI----PVDGGGTFCLAFAP------SPSGLSIIGNIQQ 461
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ +++ +D +GF P C
Sbjct: 462 EGIQISFDGANGFVGFGPNIC 482
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 158/385 (41%), Gaps = 63/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +W+ C + C +D F+PS S + C+SS
Sbjct: 120 VDTGSSFSWLQCQPCTIYCHIQED-------PVFNPSASKTYKTVPCSSS---------- 162
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAY--TYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
S +TL + TC + + Y +YG+ G L++D L + S +
Sbjct: 163 -----QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPS-----QT 212
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCF-LAFKYANDPNI 178
+ F +GC + GI G LS+ SQL G FS+C +F N P
Sbjct: 213 LSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPK- 271
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
L IG +++ + +FTP+LK+P P+ Y+I LE+IT+ L S +
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKV----- 326
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG---FDLCYRVPCPNN 295
++DSGT T LP P Y+ L + + ++ K+ ++ G D C++
Sbjct: 327 --PTIIDSGTVITRLPTPVYTTLKNAYVTILS-----KKYQQAPGISLLDTCFKGSLAG- 378
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ P I F L L N + + + CL + G
Sbjct: 379 --ISEVAPDIRIIFKGGADLQLKGHNSLVEL-----ETGITCLAMAGSSS-----IAIIG 426
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQQ V+V YD+ R+GF P C
Sbjct: 427 NYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 177/403 (43%), Gaps = 80/403 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCL-NIH 61
V +DTGSD+ WV C C C + ++ + P+ S ++ C FC+ N
Sbjct: 99 VQVDTGSDILWVNC----IRCDGCPTRSGLGIELTQYDPAGSGTTV--GCEQEFCVANSA 152
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
P P T S PC F TYG+G TG D ++ V G+
Sbjct: 153 GGVPPTCPSTSS--------------PC-QFRITYGDGSTTTGFYVTDFVQYNQVSGNGQ 197
Query: 119 GIIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFL 168
FGC +GS+ + GI GFG+ S+ SQL ++K F+HC
Sbjct: 198 TTTSN-ASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLD 256
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSLT 225
+ IG+V P +K+ P+ PN +Y + L+ I++G ++L
Sbjct: 257 TVRGGG------IFAIGNVV---------QPKVKTTPLVPNVTHYNVNLQGISVGGATL- 300
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
++P S FDS + G ++DSGTT +LP Y LL+ + + + +++ D
Sbjct: 301 QLPTS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAV------FDKYQDLPLHNYQD 352
Query: 286 -LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+C++ DD FP ITF F +++L + ++ + N + + C+ F +D
Sbjct: 353 FVCFQFSGS----IDDGFPVITFSFEGDLTLNVYPDDYLF-----QNRNDLYCMGF--LD 401
Query: 345 DGDYGPSG----VFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G G + G N VVYDLEKE IG+ +C+S+
Sbjct: 402 GGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIGWTDYNCSSS 444
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 150/373 (40%), Gaps = 64/373 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDT SD+ W+PC C+ C N SP+ ++ S C ++ C +
Sbjct: 118 MDTSSDVAWIPCNG----CLGCSSTLFN------SPASTTYKSLG-CQAAQCKQVP---- 162
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
K TC SF TYG L L++DT+ + + +P
Sbjct: 163 ----------------KPTCGGGVCSFNLTYGGSSLAAN-LSQDTITLATDA------VP 199
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ G + + + Q FS+C +FK N S
Sbjct: 200 GYSFGCIQKATGGSLPAQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKSLN---FSGS 256
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V +++TP+LK+P P+ Y++ L A+ +G + P S F+ G
Sbjct: 257 LRLGPVG--QPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSF-TFNPSTGAG 313
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y + ++ + R V GFD CY VP
Sbjct: 314 TIFDSGTVFTRLVTPAYIAVRDAFRNRVG---RNLTVTSLGGFDTCYTVPI--------A 362
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+ITF F +++ LP N +A S + CL + D V + QQQN
Sbjct: 363 APTITFMF-TGMNVTLPPDNLLIHSTAGSTT----CLAMAAAPDNVNSVLNVIANLQQQN 417
Query: 362 VEVVYDLEKERIG 374
++YD+ R+G
Sbjct: 418 HRLLYDVPNSRLG 430
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 151/382 (39%), Gaps = 64/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDL+WV C C DC Y + F PS SS+ + C + C + +S
Sbjct: 164 VIFDTGSDLSWVQCK----PCADC--YEQQDPL--FDPSLSSTYAAVACGAPECQELDAS 215
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS + CR + YG+ G L RDTL + S
Sbjct: 216 ----------GCS-----SDSRCR----YEVQYGDQSQTDGNLVRDTLTLSASD-----T 251
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+P F FGC + + G+ G GR +S+PSQ GF++C P+ S
Sbjct: 252 LPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL--------PSSS 303
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + + N QFT L P++YYI L I +G ++ +P +
Sbjct: 304 SGRGYLSLGGAPPANAQFT-ALADGATPSFYYIDLVGIKVGGRAI-RIPATAFAAAGG-- 359
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT T LP Y+ L + ++ Y +A + D CY
Sbjct: 360 --TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSI---LDTCYDF----TGHRT 410
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ F ++ L Y + + CL F + D + G+ QQ
Sbjct: 411 AQIPTVELAFAGGATVSLDFTGVLYV-----SKVSQACLAF--APNADDSSIAILGNTQQ 463
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
+ V YD+ +RIGF C+
Sbjct: 464 KTFAVAYDVANQRIGFGAKGCS 485
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 170/404 (42%), Gaps = 73/404 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C C R + L ++ ++ S S +C FC I
Sbjct: 95 VQVDTGSDIMWVNC----IQCKQCP--RRSTLGIELTLYNIDESDSGKLVSCDDDFCYQI 148
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S P +SGC + CP + YG+G G +D ++ + +
Sbjct: 149 --SGGP-----LSGCKANM--------SCP-YLEIYGDGSSTAGYFVKDVVQYDSVAGDL 192
Query: 121 IREIPK--FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCF 167
+ FGC + S+ E + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 193 KTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL 252
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ N IG V + K N+ +P+ PN +Y + + A+ +G L
Sbjct: 253 ------DGRNGGGIFAIGRV-VQPKVNM-------TPLVPNQPHYNVNMTAVQVGQEFLN 298
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
+P L F G ++DSGTT +LPE Y L+ + S ++ F
Sbjct: 299 -IPADL--FQPGDRKGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQ 355
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SM 343
RV D+ FP++TFHF N+V L + ++ + + C+ +Q +M
Sbjct: 356 YSGRV--------DEGFPNVTFHFENSVFLRVYPHDYLFPY------EGMWCIGWQNSAM 401
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V+YDLE + IG+ +C+S+ +
Sbjct: 402 QSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEYNCSSSIKVK 445
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 104/396 (26%), Positives = 173/396 (43%), Gaps = 72/396 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C + C +C + + F S ++ TC+ C ++
Sbjct: 115 VQIDTGSDILWVTCSS----CSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQ 170
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
+ T + CS ++ C +++ YG+G +G DT + G S
Sbjct: 171 T-------TAAQCS-----ENNQC----GYSFRYGDGSGTSGYYMTDTFYFDAILGESLV 214
Query: 120 IIREIPKFCFGCVGSTY---------REPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
P FGC STY + GI GFG+G LSV SQL G FSHC
Sbjct: 215 ANSSAP-IVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCL 271
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
D + V+G++ + M+ SP+ P+ + L ++IG + +
Sbjct: 272 -----KGDGSGGGVFVLGEILVPG--------MVYSPLVPSQPHYNLNLLSIGVNG-QML 317
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGFD 285
PL F++ G +VD+GTT T+L + Y L+ + ++++ P E+
Sbjct: 318 PLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ----- 372
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMD 344
CY V +T D+FPS++ +F S++L PQ F+ + +++ C+ FQ
Sbjct: 373 -CYLV----STSISDMFPSVSLNFAGGASMMLRPQDYLFHY--GIYDGASMWCIGFQKAP 425
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + G ++ VYDL ++RIG+ DC
Sbjct: 426 E----EQTILGDLVLKDKVFVYDLARQRIGWASYDC 457
>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
Length = 426
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 142/298 (47%), Gaps = 30/298 (10%)
Query: 89 CPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC-VGSTYREPI----GIA 143
CP +AY YG G TG ++ + + G+ + FGC + ST P+ G+
Sbjct: 138 CP-YAYQYGPGISTTGYISAEEVTAVGT-----HITGRALFGCSLASTV--PLDGESGVL 189
Query: 144 GFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS 203
GF RG S+ SQL + FS+ F+ A+ P+ S L++GD A+ ++ + TP+L++
Sbjct: 190 GFSRGPYSLLSQLKISR--FSY-FMLPDDADKPDSESVLLLGDDAVPQTNSSRSTPLLRN 246
Query: 204 PMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG-NGGLLVDSGTTYTHLPEPFYSQLL 262
YP+ YY+ L I + + SL+ +P + + G +GG+++ + + T+L Y+ L
Sbjct: 247 EAYPDLYYVKLTGIKVDDKSLSGIPAGTFDLAANGCSGGVVMSTLSPITYLQPAAYNALT 306
Query: 263 SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSIT--FHFLNN--VSLVLP 318
L S I P + ++ LCY + + + FP IT FH ++ + L
Sbjct: 307 RALASKIKSQPVRPKADDVADLRLCYNI----QSVANLTFPKITLVFHGVDGRPAPMELT 362
Query: 319 QGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQ 376
++F NS+ ++CL G S V GS Q ++YDL + F+
Sbjct: 363 TAHYFIR----ENSTGLQCLTMLPTPAGS-PVSSVLGSLLQTGTHMIYDLRGGSLTFE 415
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 99/385 (25%), Positives = 158/385 (41%), Gaps = 63/385 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +W+ C + C +D F+PS S + C+SS
Sbjct: 120 VDTGSSFSWLQCQPCTIYCHIQED-------PVFNPSASKTYKTVPCSSS---------- 162
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAY--TYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
S +TL + TC + + Y +YG+ G L++D L + S +
Sbjct: 163 -----QCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTLTPS-----QT 212
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCF-LAFKYANDPNI 178
+ F +GC + GI G LS+ SQL G FS+C +F N P
Sbjct: 213 LSSFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPK- 271
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
L IG +++ + +FTP+LK+P P+ Y+I LE+IT+ L S +
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKV----- 326
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG---FDLCYRVPCPNN 295
++DSGT T LP P Y+ L + + ++ K+ ++ G D C++
Sbjct: 327 --PTIIDSGTVITRLPTPVYTTLKNAYVTILS-----KKYQQAPGISLLDTCFKGSLAG- 378
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ P I F L L N + + + CL + G
Sbjct: 379 --ISEVAPDIRIIFKGGADLQLKGHNSLVEL-----ETGITCLAMAGSSS-----IAIIG 426
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQQ V+V YD+ R+GF P C
Sbjct: 427 NYQQQTVKVAYDVGNSRVGFAPGGC 451
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 160/387 (41%), Gaps = 61/387 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT S+LTWV C C C D + F PS S S + C SS C +
Sbjct: 126 VIVDTASELTWVQC----EPCDACHDQQEPL----FDPSSSPSYAAVPCNSSSCDALR-- 175
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCP---SFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
++T + C P S+ +Y +G G+L D L + G
Sbjct: 176 -------------VATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAG----- 217
Query: 121 IREIPKFCFGCVGSTYREPIG----IAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYAND 175
+I F FGC G++ + P G + G GR LS+ SQ + FS+C +
Sbjct: 218 -EDIQGFVFGC-GTSNQGPFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLPP----KE 271
Query: 176 PNISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S LV+GD A +++ + +T M+ P+ +Y L IT+G + + P
Sbjct: 272 SGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDV-QSP----G 326
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
F + G G +VDSGT T L Y+ + + S + YP+A D C+ +
Sbjct: 327 FSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFSI---LDTCFDL--- 380
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ PS+ F + + Y ++ ++ CL S+ P +
Sbjct: 381 -TGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQ---VCLALASLKSEYDTP--I 434
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQ+N+ V++D +IGF C
Sbjct: 435 IGNYQQKNLRVIFDTVGSQIGFAQETC 461
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 151/382 (39%), Gaps = 64/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDL+WV C C DC Y + F PS SS+ + C + C + +S
Sbjct: 164 VIFDTGSDLSWVQCK----PCADC--YEQQDPL--FDPSLSSTYAAVACGAPECQELDAS 215
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS + CR + YG+ G L RDTL + S
Sbjct: 216 ----------GCS-----SDSRCR----YEVQYGDQSQTDGNLVRDTLTLSASD-----T 251
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+P F FGC + + G+ G GR +S+PSQ GF++C P+ S
Sbjct: 252 LPGFVFGCGDQNAGLFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCL--------PSSS 303
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + + N QFT L P++YYI L I +G ++ +P +
Sbjct: 304 SGRGYLSLGGAPPANAQFT-ALADGATPSFYYIDLVGIKVGGRAI-RIPATAFAAAGG-- 359
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT T LP Y+ L + ++ Y +A + D CY
Sbjct: 360 --TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSI---LDTCYDF----TGHRT 410
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ F ++ L Y + + CL F + D + G+ QQ
Sbjct: 411 AQIPTVELAFAGGATVSLDFTGVLYV-----SKVSQACLAF--APNADDSSIAILGNTQQ 463
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
+ V YD+ +RIGF C+
Sbjct: 464 KTFAVTYDVANQRIGFGAKGCS 485
>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 151/388 (38%), Gaps = 61/388 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I D DLTW+PC C DC K F PS SS+ + C S C
Sbjct: 110 ILALADITGDLTWLPCKT----CQDC-----TKDGFTFFPSESSTYTSAACESYQCQ--- 157
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+G T + C P P + GLV DT+ H SS G
Sbjct: 158 ---------ITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVA----MDTISFHSSS-GQA 203
Query: 122 REIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
P F C + + + GI G GRG S+ SQ+ L G FS C + +
Sbjct: 204 LSYPNTNFICGTFIDNWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSSKQ--- 260
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS + G + S + + TP+ Y++ LEA+++G + + F S
Sbjct: 261 -SSKINFGLKGVVSGEGVVSTPIADDGE-SGAYFLFLEAMSVGGNRVAN------NFYSA 312
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+ +D TT+T LP FY + + ++ I P E + LCY+ +
Sbjct: 313 PKSNIYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERK--LSLCYKSESDH--- 367
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-----G 352
D P IT HF N + L N F M V C F DG + +
Sbjct: 368 -DFDAPPITMHF-TNADVQLSPLNTFVRMDWN-----VVCFAFL---DGTFNATKRITHA 417
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V+GS+QQ N V YDL+ + F+ DC
Sbjct: 418 VYGSWQQMNFIVGYDLKSSTVSFKQADC 445
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 103/399 (25%), Positives = 171/399 (42%), Gaps = 75/399 (18%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ W+ C C C N N +S F + SS+S + C FC I
Sbjct: 88 HVQVDTGSDILWINCK----PCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFIS 143
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
SD+ C+P S+ Y + G RD L + + G
Sbjct: 144 QSDS--------------------CQPALGCSYHIVYADESTSDGKFIRDMLTLEQVT-G 182
Query: 120 IIREIP---KFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
++ P + FGC +G+ G+ GFG+ SV SQL G ++ FSHC
Sbjct: 183 DLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHC 242
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
K I V + ++ TPM+ + M+ N +G++ + +SL +
Sbjct: 243 LDNVKGGG---------IFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMD---VDGTSL-D 289
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+P S+ NGG +VDSGTT + P+ Y S++++ + P + E T F
Sbjct: 290 LPRSIVR-----NGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEET-FQ- 339
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MD 344
C+ +T D+ FP ++F F ++V L + ++ + + + C +Q+ +
Sbjct: 340 CFSF----STNVDEAFPPVSFEFEDSVKLTVYPHDYLFTL-----EEELYCFGWQAGGLT 390
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ + G N VVYDL+ E IG+ +C+S+
Sbjct: 391 TDERSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCSSS 429
>gi|383134454|gb|AFG48206.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134458|gb|AFG48208.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134460|gb|AFG48209.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134462|gb|AFG48210.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134464|gb|AFG48211.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134466|gb|AFG48212.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134468|gb|AFG48213.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134470|gb|AFG48214.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134474|gb|AFG48216.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
gi|383134486|gb|AFG48222.1| Pinus taeda anonymous locus 0_7863_01 genomic sequence
Length = 136
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 49/111 (44%), Positives = 69/111 (62%), Gaps = 4/111 (3%)
Query: 217 ITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK 276
ITIG L ++P SL FD +GNGGL+VDSGTT+T LPE Y ++L+ L+S I Y R+
Sbjct: 1 ITIGGQRL-KLPSSLTTFDKEGNGGLIVDSGTTFTMLPESLYRRVLNKLKSAIR-YSRSV 58
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
+ E G DLCY +P +F + P+ + HF +N ++ LP N+ MS
Sbjct: 59 KYEAALGLDLCYELPSAGGSFP--VLPTFSLHFKDNATITLPAENYMSMMS 107
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 100/388 (25%), Positives = 153/388 (39%), Gaps = 60/388 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W C D +R L + P++SSS +
Sbjct: 104 LILDTGSDLIWTQC-----KLFDTRQHREKPL---YDPAKSSSFAAA------------- 142
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
PC C + C R + Y YG G L +T G R
Sbjct: 143 -----PCDGRLCETGSFNTKNCSRNKCIYTYNYGSA-TTKGELASETFTF-----GEHRR 191
Query: 124 IP-KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ FGC + GI G LS+ SQL + FS+C F D N +
Sbjct: 192 VSVSLDFGCGKLTSGSLPGASGILGISPDRLSLVSQLQIPR--FSYCLTPFL---DRNTT 246
Query: 180 SPLVIGDVAISSKDN----LQFTPMLKSPMYPNYYY-IGLEAITIGNSSLTEVPLSLREF 234
S + G +A SK +Q T ++ +P NYYY + L I++G L VP+S
Sbjct: 247 SHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL-NVPVSSFAI 305
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+GG VDSG T LP L + + P + ++LC+++P
Sbjct: 306 GRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKL-PVVNATDHGYEYELCFQLPRNG 364
Query: 295 NTFTDDLF--PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ P + +HF +++L + ++ +SA CL+ S G
Sbjct: 365 GGAVETAVQVPPLVYHFDGGAAMLLRRDSYMVEVSA-----GRMCLVISSGARG-----A 414
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G++QQQN+ V++D+E F P C
Sbjct: 415 IIGNYQQQNMHVLFDVENHEFSFAPTQC 442
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 110/404 (27%), Positives = 181/404 (44%), Gaps = 74/404 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + ++ + P+ S ++ C FC + +
Sbjct: 100 VQVDTGSDILWVNC----IRCDGCPTTSGLGIELTQYDPAGSGTTV--GCDQEFC--VAN 151
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S N P S S PC F YG+G TG D+++ + S G +
Sbjct: 152 SPNGLPPACPSTSS-----------PC-QFRIAYGDGSSTTGFYVSDSVQYNQVS-GNGQ 198
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFLA 169
P FGC +GS+ + GI GFG+ S+ SQL ++K F+HC
Sbjct: 199 TTPSNASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL-- 256
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ + IG+V + ++ TP++++ +Y + L+ I++G ++L ++P
Sbjct: 257 ----DTVHGGGIFAIGNVV---QPKVKTTPLVQNV---THYNVNLQGISVGGATL-QLPS 305
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-LCY 288
S FDS + G ++DSGTT +LP Y LL+ + + + +++ D +C+
Sbjct: 306 S--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAV------FDKYQDLALHNYQDFVCF 357
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+ DD FP +TF F ++L V P F N + + C+ F +D G
Sbjct: 358 QFSGS----IDDGFPVVTFSFEGEITLNVYPHDYLF------QNENDLYCMGF--LDGGV 405
Query: 348 YGPSG----VFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
G + G N VVYDLEK+ IG+ +C+S+ Q
Sbjct: 406 QTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADYNCSSSIKIQ 449
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 116/395 (29%), Positives = 169/395 (42%), Gaps = 68/395 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C ++ L S F+P SS+ S C+S C
Sbjct: 74 ISMVLDTGSELSWLHC------------KKSPNLGSVFNPVSSSTYSPVPCSSPIC-RTR 120
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSF---AYTYGEGGLVTGILTRDTLKVHGSSP 118
+ D P C P F A +Y + + G L DT V GS
Sbjct: 121 TRDLPI---------------PASCDPKTHFCHVAISYADATSIEGNLAHDTF-VIGS-- 162
Query: 119 GIIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK 171
+ R P FGC+ S + G+ G RG+LS +QLGF + FS+C
Sbjct: 163 -VTR--PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFSK--FSYCI---- 213
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPM-LKSPMYPNY----YYIGLEAITIGNSSLTE 226
+ + S L++GD + S +Q+TP+ L++ P + Y + LE I +G S +
Sbjct: 214 --SGSDSSGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVG-SKILS 270
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEERT 282
+P S+ D G G +VDSGT +T L P Y+ L ++ +S + V + T
Sbjct: 271 LPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGT 330
Query: 283 GFDLCYRVPCPNN-TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSN-SSAVKCLLF 340
DLCYRV FT P I+ F V Q + A S V C F
Sbjct: 331 -MDLCYRVGSSTRPNFTG--LPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF 387
Query: 341 QSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
+ D + V G QQNV + +DL K R+GF
Sbjct: 388 GNSDLLGI-EAFVIGHHHQQNVWMEFDLAKSRVGF 421
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 152/383 (39%), Gaps = 62/383 (16%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q YM DTGSD+ W+ C C DC Y+ + F P+ SS+ + TC S C ++
Sbjct: 32 QFYMVLDTGSDINWLQCQ----PCTDC--YQQTDPI--FDPTASSTYAPVTCQSQQCSSL 83
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
MS C L + YG+G G +++ S G
Sbjct: 84 E----------MSSCRSGQCL----------YQVNYGDGSYTFGDFATESVSFGNS--GS 121
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI-- 178
++ + C + G+ G G G LS+ +QL FS+C + A +
Sbjct: 122 VKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLK--ATSFSYCLVNRDSAGSSTLDF 179
Query: 179 -SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S+ L + V P++K+ +YY+GL +++G + +P S D
Sbjct: 180 NSAQLGVDSVT---------APLMKNRKIDTFYYVGLSGMSVGGQ-MVSIPESTFRLDES 229
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG++VD GT T L Y+ L + K FD CY + +
Sbjct: 230 GNGGIIVDCGTAITRLQTQAYNPLRDAF---VRMTQNLKLTSAVALFDTCYDLSGQASV- 285
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P+++FHF + S LP N+ P +S+ C F + G+
Sbjct: 286 ---RVPTVSFHFADGKSWNLPAANYLI----PVDSAGTYCFAFAPTTSS----LSIIGNV 334
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +DL R+GF P C
Sbjct: 335 QQQGTRVTFDLANNRMGFSPNKC 357
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 114/388 (29%), Positives = 171/388 (44%), Gaps = 57/388 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTWV C C C Y+ N + F +SS+ + C S C + SS+
Sbjct: 103 DTGSDLTWVQCK----PCQQC--YKENGPI--FDKKKSSTYKSEPCDSRNCHALSSSER- 153
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
GC S K+ C + Y+YG+ G + +T+ + +S G P
Sbjct: 154 -------GCDES---KNVC-----KYRYSYGDQSFSKGDVATETISIDSAS-GSPVSFPG 197
Query: 127 FCFGC---VGSTYREPIGIAGFGRGA-LSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
FGC G T+ E G LS+ SQLG + K FS+C L+ K A N +S
Sbjct: 198 TVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYC-LSHKSAT-TNGTSV 255
Query: 182 LVIGDVAISS---KDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--- 235
+ +G +I S KD+ + L YYY+ LEAI++G + S D
Sbjct: 256 INLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGI 315
Query: 236 -SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
S+ +G +++DSGTT T L F+ + + ++ +T AK V + G L + C
Sbjct: 316 FSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVT---GAKRVSDPQGL-LSH---CFK 368
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ + P IT HF + L N F +S + CL + ++
Sbjct: 369 SGSAEIGLPEITVHF-TGADVRLSPINAFVKVSED-----MVCLSMVPTTE-----VAIY 417
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G+F Q + V YDLE + FQ MDC++
Sbjct: 418 GNFAQMDFLVGYDLETRTVSFQRMDCSA 445
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 152/383 (39%), Gaps = 62/383 (16%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q YM DTGSD+ W+ C C DC Y+ + F P+ SS+ + TC S C ++
Sbjct: 173 QFYMVLDTGSDINWLQCQ----PCTDC--YQQTDPI--FDPTASSTYAPVTCQSQQCSSL 224
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
MS C L + YG+G G +++ S G
Sbjct: 225 E----------MSSCRSGQCL----------YQVNYGDGSYTFGDFATESVSFGNS--GS 262
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI-- 178
++ + C + G+ G G G LS+ +QL FS+C + A +
Sbjct: 263 VKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLK--ATSFSYCLVNRDSAGSSTLDF 320
Query: 179 -SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S+ L + V P++K+ +YY+GL +++G + +P S D
Sbjct: 321 NSAQLGVDSVT---------APLMKNRKIDTFYYVGLSGMSVGGQ-MVSIPESTFRLDES 370
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG++VD GT T L Y+ L + K FD CY + +
Sbjct: 371 GNGGIIVDCGTAITRLQTQAYNPLRDAF---VRMTQNLKLTSAVALFDTCYDLSGQASV- 426
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P+++FHF + S LP N+ P +S+ C F + G+
Sbjct: 427 ---RVPTVSFHFADGKSWNLPAANYLI----PVDSAGTYCFAFAPTTSS----LSIIGNV 475
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +DL R+GF P C
Sbjct: 476 QQQGTRVTFDLANNRMGFSPNKC 498
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 107/377 (28%), Positives = 155/377 (41%), Gaps = 55/377 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + F P SSS + C S C + +S
Sbjct: 170 MVLDTGSDINWLQCQ----PCTDC--YQQTDPI--FDPRSSSSFASLPCESQQCQALETS 221
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC S L + +YG+G G +TL S G+I
Sbjct: 222 ----------GCRASKCL----------YQVSYGDGSFTVGEFVIETLTFGNS--GMINN 259
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ C + G+ G G G+LS+ SQ+ FS+C + D + SS L
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGSLSLTSQMK--ASSFSYCLVD----RDSSSSSDLE 313
Query: 184 IGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLL 243
A S N P+LKS +YY+GL +++G L +P +L + D G GG++
Sbjct: 314 FNSAAPSDSVN---APLLKSGKVDTFYYVGLTGMSVGGQ-LLSIPPNLFQMDDSGYGGII 369
Query: 244 VDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFP 303
VDSGT T L Y+ L ++ P K+ FD CY + + P
Sbjct: 370 VDSGTAITRLQTQAYNTLRDAF---VSRTPYLKKTNGFALFDTCYDLSSQSRV----TIP 422
Query: 304 SITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVE 363
+++F F SL LP N+ P +S C F + G+ QQQ
Sbjct: 423 TVSFEFAGGKSLQLPPKNYLI----PVDSVGTFCFAFAPTTSS----LSIIGNVQQQGTR 474
Query: 364 VVYDLEKERIGFQPMDC 380
V YDL +GF P C
Sbjct: 475 VHYDLANSVVGFSPHKC 491
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 159/390 (40%), Gaps = 86/390 (22%)
Query: 2 IQVYMDTGSDLTWVPCGNL-SFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+Q+ +DTGSD+TW C + C N+ + F PS SSS + C+S C
Sbjct: 101 VQLTLDTGSDITWTQCKRCPASACF-------NQTLPLFDPSASSSFASLPCSSPACETT 153
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK-VHGSSPG 119
PC + S RPC +++ +YG+G + G + R+ G+ G
Sbjct: 154 -------PPCGGGNDATS--------RPC-NYSISYGDGSVSRGEIGREVFTFASGTGEG 197
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+P FGC G GIAGFGRG+LS+PSQL FSHCF +
Sbjct: 198 SSAAVPGLVFGCGHANRGVFTSNETGIAGFGRGSLSLPSQLKV--GNFSHCFTTITGSK- 254
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+S +++G ++ P SP+ G+ P S
Sbjct: 255 ---TSAVLLGLPGVA--------PPSASPL----------GRRRGSYRCRSTPRS----- 288
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+SGT+ T LP Y + Q + P + T F R P P
Sbjct: 289 --------SNSGTSITSLPPRTYRAVREEFAAQVKLPVVP-GNATDPFTCFSAPLRGPKP 339
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM---SAPSNSSAVKCLLFQSMDDGDYGP 350
+ P++ HF ++ LPQ N+ + + NSS + CL ++ G+
Sbjct: 340 D-------VPTMALHF-EGATMRLPQENYVFEVVDDDDAGNSSRIICLAV--IEGGEI-- 387
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQN+ V+YDL+ ++ F P C
Sbjct: 388 --ILGNIQQQNMHVLYDLQNSKLSFVPAQC 415
>gi|357131275|ref|XP_003567264.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like, partial [Brachypodium distachyon]
Length = 364
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 140/317 (44%), Gaps = 40/317 (12%)
Query: 95 TYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTY-REPIGIA-----GFGRG 148
+Y +G G L D V ++P + + FGC+ S + P G+A G RG
Sbjct: 64 SYADGSSSDGALATDVFAVGSATPSL-----RAAFGCMASAFDSSPDGVASAGLLGMNRG 118
Query: 149 ALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN 208
ALS SQ G + FS+C +D + + L++G + + L +TP+ + +
Sbjct: 119 ALSFVSQAG--TRRFSYCI------SDRDDAGVLLLGHSDLPNFLPLNYTPLYQPSLPLP 170
Query: 209 Y-----YYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS 263
Y Y + L I +G+ L +P S+ D G G +VDSGT +T L Y+ L +
Sbjct: 171 YFDRVAYSVQLLGILVGSKPL-PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYAALKA 229
Query: 264 ILQSTITYYPRAKEVEE---RTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQG 320
T + RA + + FD C+RVP + L PS+T F N +V+
Sbjct: 230 EFYRQSTPFLRALDEPSFAFQGAFDTCFRVPRGMSPPPGRLLPSVTLRF-NGAEMVVGGD 288
Query: 321 NHFYAM------SAPSNSSAVKCLLFQSMDDGDYGP--SGVFGSFQQQNVEVVYDLEKER 372
Y + A ++ AV CL F + D P + V G Q N+ V YDLE+ R
Sbjct: 289 RLLYKVPGERRGGAGADDDAVWCLTF---GNADMVPIMAYVIGHHHQMNLWVEYDLERGR 345
Query: 373 IGFQPMDCASTASAQGL 389
+G + C + GL
Sbjct: 346 VGLAQVRCDVASQRLGL 362
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 104/390 (26%), Positives = 156/390 (40%), Gaps = 59/390 (15%)
Query: 7 DTGSDLTWVPC--GNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
DTGSDL W+ C G D F PS+S++ C S C S+
Sbjct: 118 DTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTTFRLVDCDSVAC-----SE 172
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH----GSSPGI 120
P C + CR ++Y+YG+G +G+L+ +T G
Sbjct: 173 LPEASCG----------ADSKCR----YSYSYGDGSHTSGVLSTETFTFADAPGARGDGT 218
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFLAFKYA 173
+ FGC VGS+ + + G G +L SQLG L + FS+C + +
Sbjct: 219 TTRVANVNFGCSTTFVGSSVGDGLVGLGGGDLSLV--SQLGADTSLGRRFSYCLVPYSV- 275
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
SS L G A + TP++ S + YY + L ++ +GN +
Sbjct: 276 ---KASSALNFGPRAAVTDPGAVTTPLIPSQVK-AYYIVELRSVKVGN----------KT 321
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
F++ L+VDSGTT T LPE L+ L I P + ER LC+ V
Sbjct: 322 FEAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPP--AQSPERL-LPLCFDVSGV 378
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P +T ++ L N F + CL +M + P+ +
Sbjct: 379 REGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQ-----EGTLCLAVSAMSE--QFPASI 431
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G+ QQN+ V YDL+K + F P CAS+
Sbjct: 432 IGNIAQQNMHVGYDLDKGTVTFAPAACASS 461
>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 402
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 139/300 (46%), Gaps = 36/300 (12%)
Query: 88 PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGST---YREPIGIAG 144
P ++A YG+G G L + LK G I + F FGC + + G+ G
Sbjct: 131 PICNYAINYGDGSFTRGELGHEKLKF-----GTIL-VKDFIFGCGRNNKGLFGGVSGLMG 184
Query: 145 FGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDN--LQFTPML 201
GR LS+ SQ G FS+C + + S L++G + +++ + + M+
Sbjct: 185 LGRSDLSLISQTSGIFGGVFSYCLPSTERKG----SGSLILGGNSSVYRNSSPISYAKMI 240
Query: 202 KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL 261
++P N+Y+I L I+IG +L + P S G +LVDSGT T LP Y L
Sbjct: 241 ENPQLYNFYFINLTGISIGGVAL-QAP-------SVGPSRILVDSGTVITRLPPTIYKAL 292
Query: 262 LSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGN 321
+ T +P A + D C+ + + + + P+I HF N L +
Sbjct: 293 KAEFLKQFTGFPPAPAF---SILDTCFNL----SAYQEVDIPTIKMHFEGNAELTVDVTG 345
Query: 322 HFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
FY + S++S V CL S++ D + G++QQ+N+ V+YD ++ ++GF C+
Sbjct: 346 VFYFVK--SDASQV-CLALASLEYQD--EVAILGNYQQKNLRVIYDTKETKVGFALETCS 400
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 160/383 (41%), Gaps = 66/383 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
++MDT SDL W+ C C++C + + F PSRS + +TC +S
Sbjct: 100 LHMDTASDLLWIQC----LPCINC----YAQSLPIFDPSRSYTHRNETCRTS-------- 143
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG----SSPG 119
S+ +L + R C ++ Y + GIL R+ L + SS
Sbjct: 144 ----------QYSMPSLKFNANTRSC-EYSMRYVDDTGSKGILAREMLLFNTIYDESSSA 192
Query: 120 IIREIPKFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
+ ++ FGC Y EP+ GI G G G S+ + G K FS+CF + + P
Sbjct: 193 ALHDV---VFGCGHDNYGEPLVGTGILGLGYGEFSLVHRFG---KKFSYCFGSLDDPSYP 246
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + LV+GD + + TP+ ++ +YY+ +EAI++ L P
Sbjct: 247 H--NVLVLGDDGANILGDT--TPL---EIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQ 299
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GG ++D+G + T L E Y L + ++ A +V + D ++ C N
Sbjct: 300 TGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQ----DDMIKMECYNGN 355
Query: 297 FTDDL----FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
F DL FP +TFHF L L + F +S V CL G
Sbjct: 356 FERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSP-----NVFCLAVTP------GNLN 404
Query: 353 VFGSFQQQNVEVVYDLEKERIGF 375
G+ QQ+ + YDLE + F
Sbjct: 405 SIGATAQQSYNIGYDLEAMEVSF 427
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 110/385 (28%), Positives = 164/385 (42%), Gaps = 55/385 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTWV C C C Y+ N + F +SS+ ++C S C + +
Sbjct: 103 DTGSDLTWVQCK----PCQQC--YKQNSPL--FDKKKSSTYKTESCDSKTCQALSEHEE- 153
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
GC S K C + Y+YG+ G + +T+ + SS P
Sbjct: 154 -------GCDES---KDIC-----KYRYSYGDNSFTKGDVATETISIDSSSG-SSVSFPG 197
Query: 127 FCFGC---VGSTYREPIGIAGFGRGA-LSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
FGC G T+ E G LS+ SQLG + K FS+C A N +S
Sbjct: 198 TVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSSIGKKFSYCLS--HTAATTNGTSV 255
Query: 182 LVIGDVAISS---KDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL--TEVPLSLREFDS 236
+ +G +I S KD+ T L YY++ LEA+T+G + L T L S
Sbjct: 256 INLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSS 315
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+ G +++DSGTT T L FY + ++ ++T AK V + G C +
Sbjct: 316 KRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVT---GAKRVSDPQGL----LTHCFKSG 368
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P+IT HF N + L N F + N V + + + ++G+
Sbjct: 369 DKEIGLPAITMHF-TNADVKLSPINAFVKL----NEDTVCLSMIPTTE------VAIYGN 417
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
Q + V YDLE + + FQ MDC+
Sbjct: 418 MVQMDFLVGYDLETKTVSFQRMDCS 442
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 108/384 (28%), Positives = 166/384 (43%), Gaps = 66/384 (17%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+VYM DTGSD+ W+ C C DC Y + + F PS SSS +C + C
Sbjct: 160 EVYMVLDTGSDVNWLQCT----PCADC--YHQTEPI--FEPSSSSSYEPLSCDTPQC--- 208
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ +S C +T L + +YG+G G +TL + GS+ +
Sbjct: 209 -------NALEVSECRNATCL----------YEVSYGDGSYTVGDFATETLTI-GST--L 248
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
++ + C + G+ G G G L++PSQL FS+C + D + +S
Sbjct: 249 VQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLN--TTSFSYCLV----DRDSDSAS 302
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ D S + P+L++ +YY+GL I++G L ++P S E D G+G
Sbjct: 303 TV---DFGTSLSPDAVVAPLLRNHQLDTFYYLGLTGISVGG-ELLQIPQSSFEMDESGSG 358
Query: 241 GLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTG---FDLCYRVPCPNNT 296
G+++DSGT T L Y+ L S ++ T+ ++E+ G FD CY +
Sbjct: 359 GIIIDSGTAVTRLQTEIYNSLRDSFVKGTL-------DLEKAAGVAMFDTCYNLSAK--- 408
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T P++ FHF L LP N+ P +S CL F + G+
Sbjct: 409 -TTVEVPTVAFHFPGGKMLALPAKNYMI----PVDSVGTFCLAFAPTASS----LAIIGN 459
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +DL IGF C
Sbjct: 460 VQQQGTRVTFDLANSLIGFSSNKC 483
>gi|54290725|dbj|BAD62395.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 500
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/394 (25%), Positives = 163/394 (41%), Gaps = 67/394 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTG ++ V C C +++F PSRSS+ + C S C
Sbjct: 159 LAMAFDTGLGISLV-------RCAACRPGAPCDGLASFDPSRSSTFAPVPCGSPDC---- 207
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
SGCS ST P SF + ++G + +D L + S+
Sbjct: 208 ----------RSGCSSG----STPSCPLTSFPF-------LSGAVAQDVLTLTPSA---- 242
Query: 122 REIPKFCFGCVGSTYREPIGIAGF---GRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGCV + EP+G AG R + SV S+L G FS+C L +
Sbjct: 243 -SVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSVASRLAADAGGTFSYC-LPLSTTSSHG 300
Query: 178 ISSPLVIGDVAISSKDNLQFT---PMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
L IG+ + + T P++ P +PN+Y I L +++G + P +
Sbjct: 301 F---LAIGEADVPHNRTARVTAVAPLVYDPAFPNHYVIDLAGVSLGGRDIPIPPHA---- 353
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+ + +++D+ YT++ Y+ L + + YPRA + + D CY
Sbjct: 354 -ATASAAMVLDTALPYTYMKPSMYAPLRDAFRRAMARYPRAPAMGD---LDTCYNF---T 406
Query: 295 NTFTDDLFPSITFHF-----LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-DDGDY 348
+ L P + F ++ + + MS P N +V CL F ++ DGD
Sbjct: 407 GVRHEVLIPLVHLTFRGIGGGGGGQVLGLGADQMFYMSEPGNFFSVTCLAFAALPSDGDA 466
Query: 349 GP--SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ V G+ Q ++EVV+D+ +IGF P C
Sbjct: 467 EAPLAMVMGTLAQSSMEVVHDVPGGKIGFIPGSC 500
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 159/378 (42%), Gaps = 55/378 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + + F+P+ SS+ TC++ C
Sbjct: 177 LVLDTGSDVNWIQCE----PCSDC--YQQSDPV--FNPTSSSTYKSLTCSAPQC------ 222
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPS-FAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+LL+++ CR + +YG+G G L DT+ S G I
Sbjct: 223 ---------------SLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNS--GKIN 265
Query: 123 EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
++ C + G+ G G GALS+ +Q+ FS+C + D SS L
Sbjct: 266 DVALGCGHDNEGLFTGAAGLLGLGGGALSITNQMK--ATSFSYCLVD----RDSGKSSSL 319
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
V + S D P+L++ +YY+GL ++G + +P ++ + D+ G+GG+
Sbjct: 320 DFNSVQLGSGDAT--APLLRNQKIDTFYYVGLSGFSVGGQKVM-MPDAIFDVDASGSGGV 376
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D GT T L Y+ L T K + FD CY ++ +
Sbjct: 377 ILDCGTAVTRLQTQAYNSLRDAFLKLTTNLK--KGTSSISLFDTCYDF----SSLSSVKV 430
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P++ FHF SL LP N+ P + + C F + G+ QQQ
Sbjct: 431 PTVAFHFTGGKSLDLPAKNYLI----PVDDNGTFCFAFAPTS----SSLSIIGNVQQQGT 482
Query: 363 EVVYDLEKERIGFQPMDC 380
+ YDL + IG C
Sbjct: 483 RITYDLANKIIGLSGNKC 500
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 152/386 (39%), Gaps = 88/386 (22%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ +DTGSDL W C C + ++P+RS++ + +C S C +
Sbjct: 105 LTAVLDTGSDLIWTQCDAPCRRCFP-------QPAPLYAPARSATYANVSCRSPMCQALQ 157
Query: 62 SSDNPFDPCTM--SGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
S P+ C+ +GC+ + ++YG+G G+L +T + G
Sbjct: 158 S---PWSRCSPPDTGCA---------------YYFSYGDGTSTDGVLATETFTL-----G 194
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFS-HCFLAFKYAN 174
+ FGC +GST G+ G GRG LS+ SQLG + S A +
Sbjct: 195 SDTAVRGVAFGCGTENLGSTDNSS-GLVGMGRGPLSLVSQLGVTRPRRSCRARAAARGGG 253
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
P +SPL E IT+G++ L P R
Sbjct: 254 APTTTSPL--------------------------------EGITVGDTLLPIDPAVFR-L 280
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+GG+++DSGTT+T L E + L L S + P A G LC+ P
Sbjct: 281 TPMGDGGVIIDSGTTFTALEERAFVALARALASRV-RLPLASGAH--LGLSLCFAAASPE 337
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
P + HF + + L + ++ S+ V CL S V
Sbjct: 338 AVE----VPRLVLHF-DGADMELRRESYVVE----DRSAGVACLGMVSARG-----MSVL 383
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
GS QQQN ++YDLE+ + F+P C
Sbjct: 384 GSMQQQNTHILYDLERGILSFEPAKC 409
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 111/403 (27%), Positives = 181/403 (44%), Gaps = 78/403 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C + ++ + P+ S ++ C FC+ +S
Sbjct: 100 VQVDTGSDILWVN----GISCDGCPTRSGLGIELTQYDPAGSGTTV--GCEQEFCV-ANS 152
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ + P S S PC F TYG+G TG D ++ + S G +
Sbjct: 153 AASGVPPACPSAAS-----------PC-QFRITYGDGSSTTGFYVTDFVQYNQVS-GNGQ 199
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFLA 169
P FGC +GS+ + GI GFG+ S+ SQL ++K F+HC
Sbjct: 200 TTPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDT 259
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSLTE 226
+ I + N+ P++K+ P+ PN +Y + L+ I++G ++L +
Sbjct: 260 VRGG--------------GIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATL-Q 304
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD- 285
+P S FDS + G ++DSGTT +LP Y LL+ + + + ++ R D
Sbjct: 305 LPTS--TFDSGDSKGTIIDSGTTLAYLPREVYRTLLTAV------FDKHPDLAVRNYEDF 356
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
+C++ + D+ FP ITF F +++L V P F N + + C+ F +D
Sbjct: 357 ICFQF----SGSLDEEFPVITFSFEGDLTLNVYPHDYLF------QNGNDLYCMGF--LD 404
Query: 345 DGDYGPSG----VFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G G + G N VVYDLEK+ IG+ +C+S+
Sbjct: 405 GGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWTDYNCSSS 447
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 166/382 (43%), Gaps = 55/382 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GSD+ W+ C C +C Y+ + F P+ S+S + C S C +
Sbjct: 148 LVVDSGSDVIWIQC----RPCAEC--YQQADPL--FDPAASASFTAVPCDSGVCRTL--- 196
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P SGC+ S CR + +YG+G G+L +TL S+P ++
Sbjct: 197 -----PGGSSGCA-----DSGACR----YQVSYGDGSYTQGVLAMETLTFGDSTP--VQG 240
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G +S+ QLG G FS+C LA + A D S +
Sbjct: 241 VAIGCGHRNRGLFVGAAGLLGLGWGPMSLVGQLGGAAGGAFSYC-LASRGA-DAGAGSLV 298
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--SQGNG 240
D A+ + P+L++ P++YY+GL + +G L PL FD G G
Sbjct: 299 FGRDDAMPV--GAVWVPLLRNAQQPSFYYVGLTGLGVGGERL---PLQDGLFDLTEDGGG 353
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G+++D+GT T LP Y+ L STI PRA V D CY + + +
Sbjct: 354 GVVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSL---LDTCYDL----SGYAS 406
Query: 300 DLFPSITFHF-LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++ +F + +L LP N M V CL F + G + G+ Q
Sbjct: 407 VRVPTVALYFGRDGAALTLPARNLLVEMGG-----GVYCLAFAASASG----LSILGNIQ 457
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ +++ D +GF P C
Sbjct: 458 QQGIQITVDSANGYVGFGPSTC 479
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 158/381 (41%), Gaps = 58/381 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDLTW+ C+ C Y + + F PSRSS+ +C S+ H+
Sbjct: 93 LLIDTGSDLTWI-------HCLPCKCY--PQTIPFFHPSRSSTYRNASCVSA----PHAM 139
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
F C + Y + GIL + L S G+I +
Sbjct: 140 PQIFRDEKTGNCQ---------------YHLRYRDFSNTRGILAEEKLTFETSDDGLISK 184
Query: 124 IPKFCFGC--VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
FGC S + + G+ G G G S+ ++ F K FS+CF + P+ +
Sbjct: 185 -QNIVFGCGQDNSGFTKYSGVLGLGPGTFSIVTR-NFGSK-FSYCFGSLTNPTYPH--NI 239
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L++G+ A D TP+ ++ + YY+ L+AI+ G L P + + + SQG G
Sbjct: 240 LILGNGAKIEGDP---TPL---QIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQG--G 291
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++D+G + T L Y L + + EV R Y PC DL
Sbjct: 292 TVIDTGCSPTILAREAYETLSEEIDFLL------GEVLRRVKDWDQYTTPCYEGNLKLDL 345
Query: 302 --FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
FP +TFHF L L + F +S+ S S + + DD V G+ Q
Sbjct: 346 YGFPVVTFHFAGGAELALDVESLF--VSSESGDSFCLAMTMNTFDD-----MSVIGAMAQ 398
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN V Y+L ++ FQ DC
Sbjct: 399 QNYNVGYNLRTMKVYFQRTDC 419
>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
Length = 454
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 106/414 (25%), Positives = 168/414 (40%), Gaps = 58/414 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + + F+ S SS+ + C+S C
Sbjct: 75 VTMVLDTGSELSWLRCNGSRVPSTP-----PPQAPAAFNGSASSTYAAAHCSSPEC-QWR 128
Query: 62 SSDNPFDP-CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
D P P C S CR + +Y + GIL DT + G+ P
Sbjct: 129 GRDLPVPPFCAGP--------PSNSCR----VSLSYADASSADGILAADTFLLGGAPP-- 174
Query: 121 IREIPKFCFGCV----------GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAF 170
+R + FGCV S G+ G RG+LS +Q L+ F++C
Sbjct: 175 VRAL----FGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLR--FAYCI--- 225
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLT 225
+ P + LV+G + L +TP+++ S P + Y + LE I +G ++L
Sbjct: 226 APGDGPGL---LVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVG-AALL 281
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAK-EVEERT 282
+P S+ D G G +VDSGT +T L Y+ L + Q++ P + + +
Sbjct: 282 PIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQG 341
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM----SAPSNSSAVKCL 338
FD C+R + P + L + + Y + + AV CL
Sbjct: 342 AFDACFRASEARVAAASQMLPEVGL-VLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCL 400
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
F + D + V G QQNV V YDL+ R+GF P C + Q L +
Sbjct: 401 TFGNSDMAGMS-AYVIGHHHQQNVWVEYDLQNGRVGFAPARCDLATATQRLRAR 453
>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
Length = 414
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 133/305 (43%), Gaps = 34/305 (11%)
Query: 91 SFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPI--------GI 142
S+ Y +G + TG+ +D L+ GS IP F FGC + G+
Sbjct: 132 SYTRRYDDGSITTGVAAQDILQSEGSE-----RIP-FYFGCSRDNQNFSVFEHTGKSGGV 185
Query: 143 AGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPML 201
G +S+ QL + Q+ FS+C +++ ++P SS L G+ + Q TP++
Sbjct: 186 MGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPSSLLRFGNDIRKGRRRFQSTPLM 245
Query: 202 KSPMYPNYYYIGLEAITIGNSSLTEVP--LSLREFDSQGNGGLLVDSGTTYTHLPEPFYS 259
SP PN Y++ L +T+ L P +LR+ G GG ++DSGT T + + Y
Sbjct: 246 SSPDRPN-YFLNLLDMTVAGQRLHLPPGTFALRQ---DGTGGTIIDSGTGLTFITQTAYP 301
Query: 260 QLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQ 319
+L+S Q+ + R + FDLCY N+TF D S+TFHF V Q
Sbjct: 302 RLISAFQNYFDH--RGFQRVHIPEFDLCYSFRG-NHTFHDHA--SMTFHFERADFTV--Q 354
Query: 320 GNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMD 379
++ Y P C+ Q V G+ Q N +YD ++ F +
Sbjct: 355 ADYVY---LPMEDDNAFCVALQPTPPQQ---RTVIGAINQGNTRFIYDAAAHQLLFIAEN 408
Query: 380 CASTA 384
C + A
Sbjct: 409 CRNDA 413
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 104/400 (26%), Positives = 174/400 (43%), Gaps = 69/400 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGSD+ WV C C +C N + ++ + SSS C FC I+
Sbjct: 102 VDTGSDIMWVNC----IQCKECPTRSNLGMDLTLYDIKESSSGKFVPCDQEFCKEINGG- 156
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
++GC+ + CP + YG+G G +D + S + +
Sbjct: 157 ------LLTGCTANI--------SCP-YLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDS 201
Query: 125 PK--FCFGC-------VGSTYREPIG-IAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
FGC + S+ E +G I GFG+ S+ SQL G ++K F+HC
Sbjct: 202 ANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL---- 257
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
N N IG V + + TP+L P P +Y + + A+ +G++ L+ +
Sbjct: 258 --NGVNGGGIFAIGHVV---QPKVNMTPLL--PDQP-HYSVNMTAVQVGHAFLSLSTDTS 309
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD--LCYR 289
+ D +G ++DSGTT +LPE Y L + I+ +P +++ RT D C++
Sbjct: 310 TQGDRKGT---IIDSGTTLAYLPEGIYEPL---VYKIISQHP---DLKVRTLHDEYTCFQ 360
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDGD 347
+ DD FP++TF+F N +SL + ++ + S C+ +Q+ D
Sbjct: 361 Y----SESVDDGFPAVTFYFENGLSLKVYPHDYLFP------SGDFWCIGWQNSGTQSRD 410
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ G N V YDLE + IG+ +C+S+ +
Sbjct: 411 SKNMTLLGDLVLSNKLVFYDLENQVIGWTEYNCSSSIKVR 450
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 155/381 (40%), Gaps = 55/381 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL+WV C C Y + F PS+SS+ + C + C ++ +
Sbjct: 139 LLIDTGSDLSWVQCQ----PCNSTTCYPQKDPL--FDPSKSSTYAPIPCNTDACRDL--T 190
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D+ + GC+ S + C FA TYG+G G+ + +TL + +PG+
Sbjct: 191 DDGYG----GGCA-SGDGAAQC-----GFAITYGDGSQTRGVYSNETLAL---APGV--A 235
Query: 124 IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ F FGC + G+ G G S+ Q + G FS+C A
Sbjct: 236 VKDFRFGCGHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNNQVGFLAL 295
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ + FTPM++ +Y + + IT+G + P + +
Sbjct: 296 GGGGAPSGGVVNTSGFVFTPMIREE--ETFYVVNMTGITVGGEPIDVPPSAF-------S 346
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG+++DSGT T L Y+ L + + + YP + E D CY + +++
Sbjct: 347 GGMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGE----LDTCYDF----SGYSN 398
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + F G + P+ CL FQ D G+ G+ Q
Sbjct: 399 VTLPKVALTF---------SGGATIDLDVPNGILLDDCLAFQESGPDDQ--PGILGNVNQ 447
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ +EV+YD + R+GF+ C
Sbjct: 448 RTLEVLYDAGRGRVGFRAAVC 468
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 158/386 (40%), Gaps = 75/386 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I+ +DTGSDL W C C +C ++ F PS SS+ C + C
Sbjct: 74 IEAEIDTGSDLIWTQC----MPCTNC----YSQYAPIFDPSNSSTFKEKRCNGNSC---- 121
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ Y + G L +T+ +H +S G
Sbjct: 122 -----------------------------HYKIIYADTTYSKGTLATETVTIHSTS-GEP 151
Query: 122 REIPKFCFGC-VGSTYREPI--GIAGFGRGALSVPSQLGFLQKGF-SHCFLAFKYANDPN 177
+P+ GC S++ +P G+ G G S+ +Q+G G S+CF +
Sbjct: 152 FVMPETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFAS-------Q 204
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G AI + D + T M + P YY+ L+A+++G++ + + + +
Sbjct: 205 GTSKINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE-- 262
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-LCYRVPCPNNT 296
G +++DSGTT T+ P + ++++ + +Y A + TG D LCY T
Sbjct: 263 --GNIIIDSGTTLTYFPVSY----CNLVREAVDHYVTAVRTADPTGNDMLCYY------T 310
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T D+FP IT HF LVL + Y M + + CL + +FG+
Sbjct: 311 DTIDIFPVITMHFSGGADLVLDK----YNMYIETITRGTFCLAIICNNPPQ---DAIFGN 363
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
Q N V YD + F P +C++
Sbjct: 364 RAQNNFLVGYDSSSLLVSFSPTNCSA 389
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 104/388 (26%), Positives = 165/388 (42%), Gaps = 53/388 (13%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT S+LTWV C C C D ++ F PS S S + C SS C + +
Sbjct: 166 VIVDTASELTWVQCA----PCESCHDQQDPL----FDPSSSPSYAAVPCNSSSCDALQLA 217
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
SG + + + C S+ +Y +G G+L D L + G
Sbjct: 218 TG-----GTSGGAAACQGQDQSAAAC-SYTLSYRDGSYSRGVLAHDRLSLAGEV------ 265
Query: 124 IPKFCFGCVGSTYREPIG----IAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNI 178
I F FGC S P G + G GR LS+ SQ + FS+C L K + +
Sbjct: 266 IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYC-LPLK---ESDS 321
Query: 179 SSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S LVIGD + +++ + + M+ P+ +Y++ L IT+G + S
Sbjct: 322 SGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGG 381
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCP 293
+ ++DSGT T L Y+ + + S YP+A GF D C+ +
Sbjct: 382 KA----IIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAP------GFSILDTCFNM--- 428
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ PS+ F V + + G Y +S S+SS V CL + +Y + +
Sbjct: 429 -TGLREVQVPSLKLVFDGGVEVEVDSGGVLYFVS--SDSSQV-CLAMAPLKS-EY-ETNI 482
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N+ V++D ++GF C
Sbjct: 483 IGNYQQKNLRVIFDTSGSQVGFAQETCG 510
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 171/398 (42%), Gaps = 69/398 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ WV C C +C + + + ++P SS+S+ TC FC +
Sbjct: 87 HVQVDTGSDILWVNC----VGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCSATY 142
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS--SPG 119
+ P GC L + + YG+G G D +++ + +
Sbjct: 143 DAPIP-------GCKPDLLCQ---------YKVIYGDGSATAGYFVNDYIQLQRAVGNHK 186
Query: 120 IIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC +GS+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 187 TSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL-- 244
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSLTE 226
+ + IG+V P LK+ P+ PN +Y + L + +G+++L +
Sbjct: 245 ----DSISGGGIFAIGEVV---------EPKLKTTPVVPNQAHYNVVLNGVKVGDTAL-D 290
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+PL L F++ G ++DSGTT +LP+ Y L +++ + P K F
Sbjct: 291 LPLGL--FETSYKRGAIIDSGTTLAYLPDSIY---LPLMEKILGAQPDLKLRTVDDQFT- 344
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MD 344
C+ + DD FP++TF F SL+L H Y + V C+ +Q+
Sbjct: 345 CFVF----DKNVDDGFPTVTFKF--EESLILTIYPHEYLFQIRDD---VWCVGWQNSGAQ 395
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
D + G QN V Y+LE + IG+ +C+S
Sbjct: 396 SKDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCSS 433
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 161/387 (41%), Gaps = 53/387 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ MDTGSDL W+ C C+DC + F P+ S S TC C +
Sbjct: 163 RMIMDTGSDLNWLQCA----PCLDCFEQSG----PIFDPAASISYRNVTCGDDRCRLV-- 212
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+P C + PCP + Y YG+ TG L + V+ + G R
Sbjct: 213 --SPPAESAPREC------RRPRSDPCPYY-YWYGDQSNTTGDLALEAFTVNLTQSGT-R 262
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDPN 177
+ FGC + G+ G GRG LS SQL + G FS+C + A
Sbjct: 263 RVDGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSA---- 318
Query: 178 ISSPLVIG-DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S ++ G D A+ + L +T + +YY+ L++I +G ++ D+
Sbjct: 319 AGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNI------SSDT 372
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT-YYPRAKEVEERTGFDLCYRVPCPNN 295
GG ++DSGTT ++ PEP Y + ++ YP GF + PC N
Sbjct: 373 LSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLI------LGFPVL--SPCYNV 424
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ + + P ++ F + + P N+F + + CL +
Sbjct: 425 SGAEKVEVPELSLVFADGAAWEFPAENYFIRL----EPEGIMCLAVLGTPRSGMS---II 477
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQQN V+YDLE R+GF P CA
Sbjct: 478 GNYQQQNFHVLYDLEHNRLGFAPRRCA 504
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 169/397 (42%), Gaps = 67/397 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ WV C C +C + + + ++P SS+S+ TC FC +
Sbjct: 87 HVQVDTGSDILWVNC----VGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCSATY 142
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS--SPG 119
+ P GC L + + YG+G G D +++ + +
Sbjct: 143 DAPIP-------GCKPDLLCQ---------YKVIYGDGSATAGYFVNDYIQLQRAVGNHK 186
Query: 120 IIREIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC +GS+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 187 TSETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL-- 244
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEV 227
+ + IG+V N +P+ PN +Y + L + +G+++L ++
Sbjct: 245 ----DSISGGGIFAIGEVVEPKLXN--------TPVVPNQAHYNVVLNGVKVGDTAL-DL 291
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
PL L F++ G ++DSGTT +LPE Y L +++ + P K F C
Sbjct: 292 PLGL--FETSYKRGAIIDSGTTLAYLPESIY---LPLMEKILGAQPDLKLRTVDDQFT-C 345
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDD 345
+ + DD FP++TF F SL+L H Y + V C+ +Q+
Sbjct: 346 FVF----DKNVDDGFPTVTFKF--EESLILTIYPHEYLFQIRDD---VWCVGWQNSGAQS 396
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
D + G QN V Y+LE + IG+ +C+S
Sbjct: 397 KDGNEVTLLGDLVLQNKLVYYNLENQTIGWTEYNCSS 433
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/405 (25%), Positives = 160/405 (39%), Gaps = 77/405 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY------RNNKLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSDL WVPC DCM C R + ++ +SPS SS+S +C
Sbjct: 108 VALDAGSDLLWVPC-----DCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQL- 161
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV---- 113
C L + KS+ PCP A Y E +G+L D L +
Sbjct: 162 -----------------CELGSDCKSS-KDPCPYLASYYSENTSSSGLLIEDRLHLAPFS 203
Query: 114 -HGSSPGIIREIPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
H S + + C + + P G+ G G G LSVPS L G ++ FS C
Sbjct: 204 EHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSIC 263
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
F D N S ++ GD + ++ + F P+ + Y I +E +G+SSL
Sbjct: 264 F-------DDNHSGTILFGDQGLVTQKSTSFVPLEGKFV---TYLIEVEGYLVGSSSLKT 313
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
LVDSGT++T LP Y +++ + R+ F
Sbjct: 314 AGFQ-----------ALVDSGTSFTFLPYEIYEKIVVEFDKQVN--------ATRSSFKG 354
Query: 287 CYRVPCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
C N++ + L P++T F N S ++ N + + + V CL Q + +
Sbjct: 355 SPWKYCYNSSSQELLNIPTVTLVFAMNQSFIV--HNPVIKLISENEEFNVFCLPIQPIHE 412
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLH 390
G+ G +V+D E ++G+ +C + +H
Sbjct: 413 ----EFGIIGQNFMWGYRMVFDRENLKLGWSTSNCQDITDGKIMH 453
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 102/405 (25%), Positives = 160/405 (39%), Gaps = 77/405 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY------RNNKLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSDL WVPC DCM C R + ++ +SPS SS+S +C
Sbjct: 118 VALDAGSDLLWVPC-----DCMQCAPLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQL- 171
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV---- 113
C L + KS+ PCP A Y E +G+L D L +
Sbjct: 172 -----------------CELGSDCKSS-KDPCPYLASYYSENTSSSGLLIEDRLHLAPFS 213
Query: 114 -HGSSPGIIREIPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
H S + + C + + P G+ G G G LSVPS L G ++ FS C
Sbjct: 214 EHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSIC 273
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
F D N S ++ GD + ++ + F P+ + Y I +E +G+SSL
Sbjct: 274 F-------DDNHSGTILFGDQGLVTQKSTSFVPLEGKFV---TYLIEVEGYLVGSSSLKT 323
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
LVDSGT++T LP Y +++ + R+ F
Sbjct: 324 AGFQ-----------ALVDSGTSFTFLPYEIYEKIVVEFDKQVN--------ATRSSFKG 364
Query: 287 CYRVPCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
C N++ + L P++T F N S ++ N + + + V CL Q + +
Sbjct: 365 SPWKYCYNSSSQELLNIPTVTLVFAMNQSFIV--HNPVIKLISENEEFNVFCLPIQPIHE 422
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLH 390
G+ G +V+D E ++G+ +C + +H
Sbjct: 423 ----EFGIIGQNFMWGYRMVFDRENLKLGWSTSNCQDITDGKIMH 463
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 157/381 (41%), Gaps = 60/381 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C C F P+ S+S C S C +
Sbjct: 127 VDTSNDAAWIPCAG----CAGCP----TSSAPPFDPAASTSYRSVPCGSPLC-----AQA 173
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C G + C F+ TY + L L++D+L V G + +
Sbjct: 174 PNAACPPGG------------KAC-GFSLTYADSSL-QAALSQDSLAVAGDA------VK 213
Query: 126 KFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ + +G FS+C +FK N S
Sbjct: 214 TYTFGCLQKATGTAAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSL---NFSGT 270
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + ++ TP+L +P + YY+ + I +G + +P FD G
Sbjct: 271 LRLGRNGQPPR--IKTTPLLANPHRSSLYYVNMTGIRVGR-KVVPIPPPALAFDPATGAG 327
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT +T L P Y + ++ + V GFD C+ NT T
Sbjct: 328 TVLDSGTMFTRLVAPAYVAVRDEVRRRV-----GAPVSSLGGFDTCF------NT-TAVA 375
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P +T F + + + LP+ N S + CL + DG V S QQQN
Sbjct: 376 WPPVTLLF-DGMQVTLPEENVVIH----STYGTISCLAMAAAPDGVNTVLNVIASMQQQN 430
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
V++D+ R+GF C +
Sbjct: 431 HRVLFDVPNGRVGFARERCTA 451
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 104/394 (26%), Positives = 159/394 (40%), Gaps = 71/394 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS +T++PC DC C + F P +S+++ + C C
Sbjct: 28 VIIDTGSTITYIPCK----DCSHCGKH----TAEWFDPDKSTTAKKLACGDPLC------ 73
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C C TC ++ TY E G + DT S +
Sbjct: 74 -----NCGTPSC--------TCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPV--- 117
Query: 124 IPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
+ FGC G YR+ GI G G + SQL ++ FS CF Y D
Sbjct: 118 --RLVFGCENGETGEIYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCF---GYPKD 172
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
L++GDV + N +TP+L + ++ +YY + ++ IT+ +L FD
Sbjct: 173 ----GILLLGDVTLPEGANTVYTPLL-THLHLHYYNVKMDGITVNGQTLA---FDASVFD 224
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-----LCYRV 290
+G G +L DSGTT+T+LP + + + Y K ++ G D +C++
Sbjct: 225 -RGYGTVL-DSGTTFTYLP----TDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKG 278
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
D FP F F L LP + + +S P A CL D+G+ G
Sbjct: 279 APDQFKDLDKYFPPAEFVFGGGAKLTLPPLRYLF-LSKP----AEYCLGI--FDNGNSG- 330
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
+ G ++V V YD ++GF M CA A
Sbjct: 331 -ALVGGVSVRDVVVTYDRRNSKVGFTTMACADVA 363
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 95/385 (24%), Positives = 157/385 (40%), Gaps = 68/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 97 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 146
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 147 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 183
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q FS+C K
Sbjct: 184 -VQKIPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDCFSYCLPLQKSER 242
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 243 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF- 299
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 300 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERNCYDM------ 348
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ + P+I+ HF + L G+H + V CL F +
Sbjct: 349 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAFAPTE-----SVS 399
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQP 377
+ GS Q + EVVYDL+++ IG P
Sbjct: 400 IIGSLMQTSKEVVYDLKRQLIGIGP 424
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 171/403 (42%), Gaps = 70/403 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C C R + L ++ + P S +S +C FC
Sbjct: 85 VQVDTGSDILWVNC----VKCSRCP--RKSDLGIDLTLYDPKGSETSELISCDQEFCSAT 138
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ D P C KS PCP ++ TYG+G TG +D L + + +
Sbjct: 139 Y--DGPIPGC-----------KSEI--PCP-YSITYGDGSATTGYYVQDYLTYNHVNDNL 182
Query: 121 IREIPK---FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHC 166
R P+ FGC + S+ E + GI GFG+ SV SQL G ++K FSHC
Sbjct: 183 -RTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHC 241
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
++ IG+V P + +Y + L++I + ++ + +
Sbjct: 242 L------DNIRGGGIFAIGEVVEPKVSTTPLVPRMA------HYNVVLKSIEV-DTDILQ 288
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+P + FDS G ++DSGTT +LP Y +L+ + PR K F
Sbjct: 289 LPSDI--FDSGNGKGTIIDSGTTLAYLPAIVYDELIP---KVMARQPRLKLYLVEQQFS- 342
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDD 345
C++ D FP + HF +++SL + ++ + + C+ +Q S+
Sbjct: 343 CFQY----TGNVDRGFPVVKLHFEDSLSLTVYPHDYLFQF-----KDGIWCIGWQKSVAQ 393
Query: 346 GDYGPS-GVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
G + G N V+YDLE IG+ +C+S+ +
Sbjct: 394 TKNGKDMTLLGDLVLSNKLVIYDLENMAIGWTDYNCSSSIKVK 436
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 109/384 (28%), Positives = 165/384 (42%), Gaps = 66/384 (17%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+VYM DTGSD+ W+ C C DC Y + + F PS SSS +C + C
Sbjct: 163 EVYMVLDTGSDVNWLQCT----PCADC--YHQTEPI--FEPSSSSSYEPLSCDTPQC--- 211
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ +S C +T L + +YG+G G +TL + GS+ +
Sbjct: 212 -------NALEVSECRNATCL----------YEVSYGDGSYTVGDFATETLTI-GST--L 251
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
++ + C + G+ G G G L++PSQL FS+C + D + +S
Sbjct: 252 VQNVAVGCGHSNEGLFVGAAGLLGLGGGLLALPSQLN--TTSFSYCLV----DRDSDSAS 305
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ G S + P+L++ +YY+GL I++G L ++P S E D G+G
Sbjct: 306 TVEFG---TSLPPDAVVAPLLRNHQLDTFYYLGLTGISVGG-ELLQIPQSSFEMDESGSG 361
Query: 241 GLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTG---FDLCYRVPCPNNT 296
G+++DSGT T L Y+ L S L+ T ++E+ G FD CY +
Sbjct: 362 GIIIDSGTAVTRLQTGIYNSLRDSFLKGT-------SDLEKAAGVAMFDTCYNLSAK--- 411
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T P++ FHF L LP N+ P +S CL F + G+
Sbjct: 412 -TTIEVPTVAFHFPGGKMLALPAKNYMI----PVDSVGTFCLAFAPTASS----LAIIGN 462
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQ V +DL IGF C
Sbjct: 463 VQQQGTRVTFDLANSLIGFSSNKC 486
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 160/380 (42%), Gaps = 62/380 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y + F P+ S+S +C+S+ C
Sbjct: 60 IDSGSDIVWVQCK----PCTQC--YHQTDPL--FDPADSASFMGVSCSSAVC-------- 103
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D +GC S CR + +YG+G G L +TL + + +++ +
Sbjct: 104 --DQVDNAGC------NSGRCR----YEVSYGDGSSTKGTLALETLTLGRT---VVQNVA 148
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISSPLVI 184
C + G+ G G G++S QL + FS+C ++ N + L
Sbjct: 149 IGCGHMNQGMFVGAAGLLGLGGGSMSFVGQLSRERGNAFSYCLVS----RVTNSNGFLEF 204
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDSQGNGGL 242
G A+ + P++++P P+YYYIGL + +G+ +VP+S + E GNGG+
Sbjct: 205 GSEAMPV--GAAWIPLIRNPHSPSYYYIGLSGLGVGD---MKVPISEDIFELTELGNGGV 259
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T P Y PRA V FD CY + F
Sbjct: 260 VMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSI---FDTCYNL----FGFLSVRV 312
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQQ 360
P+++F+F L LP N P + + C F PSG + G+ QQ+
Sbjct: 313 PTVSFYFSGGPILTLPANNFLI----PVDDAGTFCFAFAP------SPSGLSILGNIQQE 362
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+++ D E +GF P C
Sbjct: 363 GIQISVDGANEFVGFGPNVC 382
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 159/380 (41%), Gaps = 65/380 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C+ C + F+ S++ C + C +
Sbjct: 107 LDTSNDAAWIPCNG----CVGCSS-------TVFNSVTSTTFKTLGCDAPQCKQV----- 150
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ +T TYG G + LTRDT+ + +P
Sbjct: 151 PNPTCGGSTCTWNT---------------TYG-GSTILSNLTRDTIALSTD------IVP 188
Query: 126 KFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ L K FS+C +F+ N S
Sbjct: 189 GYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLN---FSGT 245
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + ++ TP+LK+P + YY+ L I +G + ++P S F+ G
Sbjct: 246 LRLGPAGQPLR--IKTTPLLKNPRRSSLYYVNLIGIRVGRK-IVDIPASALAFNPTTGAG 302
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y+ + + + V GFD CY P +
Sbjct: 303 TIFDSGTVFTRLVAPVYTAVRDEFRKRVG----NAIVSSLGGFDTCYTGPI--------V 350
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P++TF F + +++ LP N +A S S CL + D V + QQQN
Sbjct: 351 APTMTFMF-SGMNVTLPTDNLLIRSTAGSTS----CLAMAAAPDNVNSVLNVIANMQQQN 405
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+++D+ RIG C+
Sbjct: 406 HRILFDVPNSRIGVAREPCS 425
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 117/403 (29%), Positives = 169/403 (41%), Gaps = 71/403 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGSDLTW+ S C C + F PS S++ + C ++
Sbjct: 93 ILAIADTGSDLTWLQ----SKPCDQCYPQKG----PIFDPSNSTTFHKLPCTTA------ 138
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
PC S + T C + Y+YG+ TG L DT+ V +S
Sbjct: 139 -------PCNALDESARSCTDPTTC----GYTYSYGDHSYTTGYLASDTVTVGNAS---- 183
Query: 122 REIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFK----- 171
+I FGC G+ + GI G G G LS SQLG + K FS+C L +
Sbjct: 184 VQIRNVAFGCGTRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISS 243
Query: 172 YANDPNISSPLVIGDVAI---SSKDNLQF--TPML-KSPMYPNYYYIGLEAITIGNSSLT 225
+D +S +V GD + SS + + F TP++ K P YYY+ +EAIT+G L
Sbjct: 244 QPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEP--STYYYLTIEAITVGRKKLL 301
Query: 226 EVPLSLR--EFDSQGN-----GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEV 278
S + +DS G +++DSGTT T L E FY L + L I R +V
Sbjct: 302 YSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKM-ERVNDV 360
Query: 279 EERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL 338
+ + F LC++ + + P + HF + L N F + C
Sbjct: 361 K-NSMFSLCFK-----SGKEEVELPLMKVHFRGGADVELKPVNTFVRAE-----EGLVCF 409
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+D G++G+ Q N V YDL K + F P DC+
Sbjct: 410 TMLPTND-----VGIYGNLAQMNFVVGYDLGKRTVSFLPADCS 447
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 102/410 (24%), Positives = 165/410 (40%), Gaps = 83/410 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNN------KLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSDL WVPC DC+ C N + +S ++P+ SS+S C C
Sbjct: 118 VALDVGSDLLWVPC-----DCIQCAPLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLC 172
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV---- 113
+ + DPCT Y + +G + D L++
Sbjct: 173 AWSTTCKSANDPCTYK-------------------RDYYSDNTSTSGFMIEDKLQLTSFS 213
Query: 114 -HGSSPGIIREIPKFCFGC---VGSTYRE---PIGIAGFGRGALSVP---SQLGFLQKGF 163
HG+ + + FGC +Y + P G+ G G G +SVP +Q G ++ F
Sbjct: 214 KHGTHSLLQASV---VFGCGRKQSGSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTF 270
Query: 164 SHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSS 223
S CF D N S ++ GD +++ QF P+ Y+IG+E+ +G+S
Sbjct: 271 SLCF-------DNNGSGRILFGDDGPATQQTTQFLPLFGEFA---AYFIGVESFCVGSSC 320
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
L + LVDSG+++T+LP Y +++ + + V
Sbjct: 321 L-----------QRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVKVNA-TRIVLRELP 368
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHF-LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS 342
++ CY + +T PS+ F LN + + P Y + A + V CL +
Sbjct: 369 WNYCYNI----STLVSFNIPSMQLVFPLNQIFIHDP----VYVLPA-NQGYKVFCLTLEE 419
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKK 392
D+ DY GV G +V+D E ++G+ C S+ H K
Sbjct: 420 TDE-DY---GVIGQNLMVGYRMVFDRENLKLGWSKSKCLDINSSTTEHAK 465
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 165/397 (41%), Gaps = 72/397 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGS WV C C + F RSS SS++ C + C
Sbjct: 74 VQLDTGSKAFWVN----GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC----- 124
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ P PC M+ CP + Y +GGL GIL D L H G +
Sbjct: 125 TSRP--PCNMT-------------LRCP-YITGYADGGLTMGILFTDLLHYH-QLYGNGQ 167
Query: 123 EIP---KFCFGC----VGSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC GS + GI GFG + SQL G +K FSHC
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 225
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++K+ Y+ + L++I + ++L ++P
Sbjct: 226 ----DSTNGGGIFAIGEVV---EPKVKTTPIVKNNEV--YHLVNLKSINVAGTTL-QLPA 275
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ F + G +DSG+T +LPE YS+L+ + + + ++ ++ C+
Sbjct: 276 NI--FGTTKTKGTFIDSGSTLVYLPEIIYSELI------LAVFAKHPDITMGAMYNFQCF 327
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
DD FP ITFHF N+++L V P + Y + N C FQ
Sbjct: 328 HFLGS----VDDKFPKITFHFENDLTLDVYP---YDYLLEYEGNQY---CFGFQDAGIHG 377
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
Y + G N VVYD+EK+ IG+ + A
Sbjct: 378 YKDMIILGDMVISNKVVVYDMEKQAIGWTEHNSVEEA 414
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/390 (27%), Positives = 162/390 (41%), Gaps = 66/390 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL WV C + D + F PSRSS+ R +C + C
Sbjct: 119 DTGSDLVWVKCKK-----GNNDTSSAAAPTTQFDPSRSSTYGRVSCQTDAC--------- 164
Query: 67 FDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVH----GSSPGI 120
L ++TC C ++ Y YG+G TG+L+ +T G SP
Sbjct: 165 -----------EALGRATCDDGSNC-AYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQ 212
Query: 121 IREIPKFCFGCVGSTYRE--PIGIAGFGRGALSVPSQLG---FLQKGFSHCFLAFKYAND 175
+R + FGC +T G+ G G GA+S+ +QLG L + FS+C +
Sbjct: 213 VR-VGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFSYCLVPHSV--- 268
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
N SS L G +A ++ TP++ + YY + L+++ +GN ++
Sbjct: 269 -NASSALNFGALADVTEPGAASTPLVAGDV-DTYYTVVLDSVKVGNKTVA---------- 316
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPN 294
S + ++VDSGTT T L ++ L IT P V+ G LCY V
Sbjct: 317 SAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPP----VQSPDGLLQLCYNV-AGR 371
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ P +T F ++ L N F A+ CL + + P +
Sbjct: 372 EVEAGESIPDLTLEFGGGAAVALKPENAFVAVQ-----EGTLCLAIVATTEQQ--PVSIL 424
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
G+ QQN+ V YDL+ + F DCA ++
Sbjct: 425 GNLAQQNIHVGYDLDAGTVTFAGADCAGSS 454
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/399 (23%), Positives = 155/399 (38%), Gaps = 57/399 (14%)
Query: 4 VYMDTGSDLTWVPC-----GNLSFDCMDCDDYRNNKLMSNFSPSRSS-SSSRDTCASSFC 57
+ DTGSDLTWV C N S D S + + S + DTC S
Sbjct: 112 LVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAPISCASDTCTKSLP 171
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
++ + P PC ++ Y Y +G G + ++ + S
Sbjct: 172 FSLATCPTPGSPC--------------------AYDYRYKDGSAARGTVGTESATIALSG 211
Query: 118 PGIIR-EIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLA-- 169
+ ++ GC G ++ G+ G +S S G FS+C +
Sbjct: 212 REERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHL 271
Query: 170 --------FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGN 221
+ +P +SSP ++ + TP+L +Y + L+AI++
Sbjct: 272 SPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331
Query: 222 SSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEER 281
L ++P ++ +D + GG+++DSGT+ T L +P Y +++ L + PR
Sbjct: 332 EFL-KIPRAV--WDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMDP-- 386
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
F+ CY P+ D P + HF L P G + +AP VKC+ Q
Sbjct: 387 --FEYCYNWTSPSGKDADVAVPKMAVHFAGAARLE-PPGKSYVIDAAP----GVKCIGLQ 439
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+G + V G+ QQ +D++ R+ FQ C
Sbjct: 440 ---EGPWPGISVIGNILQQEHLWEFDIKNRRLKFQRSRC 475
>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
Length = 464
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 165/408 (40%), Gaps = 82/408 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DT SDL W C C C +++ F+P SS+ + C+S C L++H
Sbjct: 106 IDTASDLIWTQCQ----PCTGC----YHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRC 157
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ D +C + YTY G L D L + +
Sbjct: 158 GHDDD--------------ESC-----QYTYTYSGNATTEGTLAVDKLVIGEDA------ 192
Query: 124 IPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
FGC G+ + G+ G GRG LS+ SQL + F++C I
Sbjct: 193 FRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSV--RRFAYCL----PPPASRI 246
Query: 179 SSPLVIGDVAISSKD--NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT----------- 225
LV+G A ++++ N PM + P YP+YYY+ L+ + IG+ +++
Sbjct: 247 PGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATA 306
Query: 226 ----------EVPLSLREFDSQGNG-GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR 274
P + N G+++D +T T L Y +L++ L+ I PR
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIR-LPR 365
Query: 275 AKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA 334
G DLC+ +P F P++ F + L L + F A S
Sbjct: 366 G--TGSSLGLDLCFILP-DGVAFDRVYVPAVALAF-DGRWLRLDKARLF----AEDRESG 417
Query: 335 VKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ CL+ + + G + G+FQQQN++V+Y+L + R+ F C +
Sbjct: 418 MMCLM---VGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPCGA 462
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/395 (25%), Positives = 166/395 (42%), Gaps = 71/395 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN-FSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C D + N F ++SSS+ C C + +
Sbjct: 99 VQIDTGSDILWVTCS----PCDGCPDSSGLGIELNLFDTTKSSSARVLPCTDPICAAVST 154
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---------V 113
+ D C L ++ C S+++ Y + +G D++ +
Sbjct: 155 TT---DQC---------LTQTDHC----SYSFHYRDRSGTSGFYVTDSMHFDILLGESTI 198
Query: 114 HGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
SS I+ + +G + + GI GFG+G SV SQL G K FSHC
Sbjct: 199 ANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL--- 255
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSL---T 225
N LV+G++ S ++ SP+ P+ +Y + L++I + T
Sbjct: 256 --KGGENGGGILVLGEILEPS--------IVYSPLIPSQPHYTLKLQSIALSGQLFPNPT 305
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
P+S G ++DSGTT +L E Y ++S++ S ++ A R
Sbjct: 306 MFPIS-------NAGETIIDSGTTLAYLVEEVYDWIVSVITSAVS--QSATPTISRG--S 354
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
C+RV D+FP + F+F S+V+ + + A+ C+ FQ +D
Sbjct: 355 QCFRVSMS----VADIFPVLRFNFEGIASMVVTP-EEYLQFDSIVREPALWCIGFQKAED 409
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G + G ++ +VYDL ++RIG+ DC
Sbjct: 410 G----LNILGDLVLKDKIIVYDLARQRIGWANYDC 440
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 159/381 (41%), Gaps = 60/381 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C C S F+P+ S+S C S C+ + N
Sbjct: 71 VDTSNDAAWIPCSG----CAGCPTS------SPFNPAASASYRPVPCGSPQCV---LAPN 117
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P S S KS F+ +Y + L L++DTL V G +
Sbjct: 118 P---------SCSPNAKSC------GFSLSYADSSL-QAALSQDTLAVAGD------VVK 155
Query: 126 KFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ + FS+C +FK N S
Sbjct: 156 AYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLN---FSGT 212
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + ++ TP+L +P + YY+ + I +G + +P S FD G
Sbjct: 213 LRLGRNGQPRR--IKTTPLLANPHRSSLYYVNMTGIRVGKK-VVSIPASALAFDPATGAG 269
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT +T L P Y L ++ + A V GFD CY T
Sbjct: 270 TVLDSGTMFTRLVAPVYLALRDEVRRRVGA--GAAAVSSLGGFDTCYN--------TTVA 319
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P +T F + + + LP+ N + + S CL + DG V S QQQN
Sbjct: 320 WPPVTLLF-DGMQVTLPEENVVIHTTYGTTS----CLAMAAAPDGVNTVLNVIASMQQQN 374
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
V++D+ R+GF C +
Sbjct: 375 HRVLFDVPNGRVGFARESCTA 395
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 163/388 (42%), Gaps = 72/388 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGS WV C C + F RSS SS++ C + C
Sbjct: 98 VQLDTGSKAFWVN----GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC----- 148
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ P PC M+ CP + Y +GGL GIL D L H G +
Sbjct: 149 TSRP--PCNMT-------------LRCP-YITGYADGGLTMGILFTDLLHYH-QLYGNGQ 191
Query: 123 EIP---KFCFGC----VGSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC GS + GI GFG + SQL G +K FSHC
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 249
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++K+ Y+ + L++I + ++L ++P
Sbjct: 250 ----DSTNGGGIFAIGEVV---EPKVKTTPIVKNNEV--YHLVNLKSINVAGTTL-QLPA 299
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ F + G +DSG+T +LPE YS+L+ + + + ++ ++ C+
Sbjct: 300 NI--FGTTKTKGTFIDSGSTLVYLPEIIYSELI------LAVFAKHPDITMGAMYNFQCF 351
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
DD FP ITFHF N+++L V P + Y + N C FQ
Sbjct: 352 HFLGS----VDDKFPKITFHFENDLTLDVYP---YDYLLEYEGNQ---YCFGFQDAGIHG 401
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGF 375
Y + G N VVYD+EK+ IG+
Sbjct: 402 YKDMIILGDMVISNKVVVYDMEKQAIGW 429
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/382 (24%), Positives = 158/382 (41%), Gaps = 44/382 (11%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDLTWV C D + ++ F P+ S S + C+S C +
Sbjct: 128 DTGSDLTWVKCRGRRASSPDASPLASPRV---FRPANSKSWAPIPCSSDTCKS------- 177
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD--TLKVHGSSPGIIREI 124
+ P +++ CS T + C + Y Y + G++ D T+ + GS ++
Sbjct: 178 YVPFSLANCSAGTTPPAPC-----GYDYRYKDKSSARGVVGTDAATIALSGSGSDRKAKL 232
Query: 125 PKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ GC G +++ G+ G +S S+ G FS+C + + N +
Sbjct: 233 QEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLV--DHLAPRNAT 290
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S L G V + + TP+L +Y + ++A+++ +L +P + +D + N
Sbjct: 291 SYLTFGPVGAAHSPSR--TPLLLDAQVAPFYAVTVDAVSVAGKAL-NIPAEV--WDVKKN 345
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG ++DSGT+ T L P Y +++ L + PR F+ CY T
Sbjct: 346 GGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM----DPFEYCYNW---TATRRP 398
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P + F + L P + Y + A + VKC+ Q +G + V G+ Q
Sbjct: 399 PAVPRLEVRFAGSARLRPPTKS--YVIDA---APGVKCIGLQ---EGVWPGVSVIGNILQ 450
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
Q +DL + FQ CA
Sbjct: 451 QEHLWEFDLANRWLRFQESRCA 472
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 159/387 (41%), Gaps = 61/387 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGS L W+ C C C N+ + F+P+ SS+ +C FC
Sbjct: 85 MDTGSSLLWIQC----HPCKHCSS--NHMIHPVFNPALSSTFVECSCDDRFCRYA----- 133
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C+ + C + Y G G G+L ++ L + + P
Sbjct: 134 PNGHCSSNKCVYEQV-------------YISGTGS--KGVLAKERLTFTTPNGNTVVTQP 178
Query: 126 KFCFGCVGSTYREPI-----GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
FGC G E + GI G G S+ QLG FS+C AN +
Sbjct: 179 -IAFGC-GHENGEQLESEFTGILGLGAKPTSLAVQLG---SKFSYCIGDL--ANKNYGYN 231
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
LV+G+ A D L ++ YY+ LE I++G+ L P+ + S+
Sbjct: 232 QLVLGEDA----DILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRT-- 285
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD--LCYRVPCPNNTFT 298
G+++D+GT YT L + Y +L + ++S + P+ ER F LCY +
Sbjct: 286 GVILDTGTLYTWLADIAYRELYNEIKSILD--PKL----ERFWFRDFLCY-----HGRVN 334
Query: 299 DDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD--GDYGPSGVF 354
++L FP +TFHF L + + FY M+ V C+ + + G+Y
Sbjct: 335 EELIGFPVVTFHFAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAI 394
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G QQ + YDL++ I Q +DC
Sbjct: 395 GLMAQQYYNIAYDLKERNIYLQRIDCV 421
>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
Length = 464
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 165/408 (40%), Gaps = 82/408 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DT SDL W C C C +++ F+P SS+ + C+S C L++H
Sbjct: 106 IDTASDLIWTQCQ----PCTGC----YHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRC 157
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ D +C + YTY G L D L + +
Sbjct: 158 GHDDD--------------ESC-----QYTYTYSGNATTEGTLAVDKLVIGEDA------ 192
Query: 124 IPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
FGC G+ + G+ G GRG LS+ SQL + F++C I
Sbjct: 193 FRGVAFGCSTSSTGGAPPPQASGVVGLGRGPLSLVSQLSV--RRFAYCL----PPPASRI 246
Query: 179 SSPLVIGDVAISSKD--NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT----------- 225
LV+G A ++++ N PM + P YP+YYY+ L+ + IG+ +++
Sbjct: 247 PGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATA 306
Query: 226 ----------EVPLSLREFDSQGNG-GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR 274
P + N G+++D +T T L Y +L++ L+ I PR
Sbjct: 307 TATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDELVNDLEVEIR-LPR 365
Query: 275 AKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA 334
G DLC+ +P F P++ F + L L + F A S
Sbjct: 366 G--TGSSLGLDLCFILP-DGVAFDRVYVPAVALAF-DGRWLRLDKARLF----AEDRESG 417
Query: 335 VKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ CL+ + + G + G+FQQQN++V+Y+L + R+ F C +
Sbjct: 418 MMCLM---VGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPCGA 462
>gi|125555054|gb|EAZ00660.1| hypothetical protein OsI_22681 [Oryza sativa Indica Group]
Length = 337
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 93/362 (25%), Positives = 153/362 (42%), Gaps = 67/362 (18%)
Query: 36 MSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYT 95
+++F PSRSS+ + C S C SGCS ST P SF +
Sbjct: 26 LASFDPSRSSTFAPVPCGSPDC--------------RSGCSSG----STPSCPLTSFPF- 66
Query: 96 YGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPIGIAGF---GRGALSV 152
++G + +D L + S+ + F FGCV + EP+G AG R + S+
Sbjct: 67 ------LSGAVAQDVLTLTPSA-----SVDDFTFGCVEGSSGEPLGAAGLLDLSRDSRSL 115
Query: 153 PSQLGFLQKG-FSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFT---PMLKSPMYPN 208
S+L G FS+C L + LVIG+ + + + T P++ P +PN
Sbjct: 116 ASRLAAGAGGTFSYC-LPLSTTSSHGF---LVIGEADVPHNRSARVTAVAPLVYDPAFPN 171
Query: 209 YYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQST 268
+Y I L +++G R+ + +++D+ YT++ Y+ L +
Sbjct: 172 HYVIDLAGVSLGG----------RDIPIPPHAAMVLDTALPYTYMKPSMYAPLRDAFRRA 221
Query: 269 ITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHF-------LNNVSLVLPQGN 321
+ YPRA + + D CY + L P + F ++ +
Sbjct: 222 MARYPRAPAMGD---LDTCYNF---TGVRHEVLIPLVHLTFRGISGGGGGEGQVLGLGAD 275
Query: 322 HFYAMSAPSNSSAVKCLLFQSM-DDGDYGP--SGVFGSFQQQNVEVVYDLEKERIGFQPM 378
MS P N +V CL F ++ DGD + V G+ Q ++EVV+D++ +IGF P
Sbjct: 276 QMLYMSEPGNFFSVTCLAFAALPSDGDAAAPLAMVMGTLAQSSMEVVHDVQGGKIGFIPG 335
Query: 379 DC 380
C
Sbjct: 336 SC 337
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 158/386 (40%), Gaps = 75/386 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I+ +DTGSDL W C C +C ++ F PS SS+ C + C
Sbjct: 74 IEAEIDTGSDLIWTQC----MPCTNC----YSQYAPIFDPSNSSTFKEKRCNGNSC---- 121
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ Y + G L +T+ +H +S G
Sbjct: 122 -----------------------------HYKIIYADTTYSKGTLATETVTIHSTS-GEP 151
Query: 122 REIPKFCFGC-VGSTYREPI--GIAGFGRGALSVPSQLGFLQKGF-SHCFLAFKYANDPN 177
+P+ GC S++ +P G+ G G S+ +Q+G G S+CF +
Sbjct: 152 FVMPETTIGCGHNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFAS-------Q 204
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+S + G AI + D + T M + P YY+ L+A+++G++ + + + +
Sbjct: 205 GTSKINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALE-- 262
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD-LCYRVPCPNNT 296
G +++DSGTT T+ P + ++++ + +Y A + TG D LCY T
Sbjct: 263 --GNIIIDSGTTLTYFPVSY----CNLVREAVDHYVTAVRTADPTGNDMLCYY------T 310
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
T D+FP IT HF LVL + Y M + + CL + +FG+
Sbjct: 311 DTIDIFPVITMHFSGGADLVLDK----YNMYIETITRGTFCLAIICNNPPQ---DAIFGN 363
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
Q N V YD + F P +C++
Sbjct: 364 RAQNNFLVGYDSSSLLVFFSPTNCSA 389
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 161/381 (42%), Gaps = 60/381 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C C S F+P+ S+S C S C+ + N
Sbjct: 124 VDTSNDAAWIPCSG----CAGCPTS------SPFNPAASASYRPVPCGSPQCV---LAPN 170
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P CS + +C F+ +Y + L L++DTL V G +
Sbjct: 171 P-------SCSPN---AKSC-----GFSLSYADSSL-QAALSQDTLAVAGD------VVK 208
Query: 126 KFCFGCV---GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ + FS+C +FK N S
Sbjct: 209 AYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYGATFSYCLPSFKSLN---FSGT 265
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + ++ TP+L +P + YY+ + I +G + +P S FD G
Sbjct: 266 LRLGRNGQPRR--IKTTPLLANPHRSSLYYVNMTGIRVGKK-VVSIPASALAFDPATGAG 322
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT +T L P Y L ++ + A V GFD CY N T
Sbjct: 323 TVLDSGTMFTRLVAPVYLALRDEVRRRVGA--GAAAVSSLGGFDTCY-----NTTVA--- 372
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P +T F + + + LP+ N + + S CL + DG V S QQQN
Sbjct: 373 WPPVTLLF-DGMQVTLPEENVVIHTTYGTTS----CLAMAAAPDGVNTVLNVIASMQQQN 427
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
V++D+ R+GF C +
Sbjct: 428 HRVLFDVPNGRVGFARESCTA 448
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/380 (27%), Positives = 159/380 (41%), Gaps = 65/380 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D W+PC C+ C + F+ S++ C + C +
Sbjct: 107 LDTSNDAAWIPCNG----CVGCSS-------TVFNSVTSTTFKTLGCDAPQCKQV----- 150
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C S C+ +T TYG G + LTRDT+ + +P
Sbjct: 151 PNPTCGGSTCTWNT---------------TYG-GSTILSNLTRDTIALSTD------IVP 188
Query: 126 KFCFGCVGSTYRE---PIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ L K FS+C +F+ N S
Sbjct: 189 GYTFGCIQKTTGSSVPPQGLLGLGRGPLSFLSQTQDLYKSTFSYCLPSFRTLN---FSGT 245
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G + ++ TP+LK+P + YY+ L I +G + ++P S F+ G
Sbjct: 246 LRLGPAGQPLR--IKTTPLLKNPRRSSLYYVNLIGIRVGRK-IVDIPASALAFNPTTGAG 302
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+ DSGT +T L P Y+ + + + V GFD CY P +
Sbjct: 303 TIFDSGTVFTRLVAPVYTAVRDEFRKRVG----NAIVSSLGGFDTCYTGPI--------V 350
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P++TF F + +++ LP N +A S S CL + D V + QQQN
Sbjct: 351 APTMTFMF-SGMNVTLPPDNLLIRSTAGSTS----CLAMAAAPDNVNSVLNVIANMQQQN 405
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+++D+ RIG C+
Sbjct: 406 HRILFDVPNSRIGVAREPCS 425
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 156/380 (41%), Gaps = 58/380 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D+GSD+ WV C C C Y + + F+P+ SSS + +CAS+ C ++
Sbjct: 149 VVIDSGSDIIWVQCE----PCTQC--YHQSDPV--FNPADSSSYAGVSCASTVCSHV--- 197
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
DN +GC CR + +YG+G G L +TL + +IR
Sbjct: 198 DN-------AGCHEGR------CR----YEVSYGDGSYTKGTLALETLTFGRT---LIRN 237
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G +S QLG G FS+C ++ + S L
Sbjct: 238 VAIGCGHHNQGMFVGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQS----SGLL 293
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDSQGNG 240
G A+ + P++ +P ++YY ++ VP+S + + G+G
Sbjct: 294 QFGREAVPV--GAAWVPLIHNPRAQSFYY---VGLSGLGVGGLRVPISEDVFKLSELGDG 348
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++D+GT T LP Y + T PRA V FD CY + F
Sbjct: 349 GVVMDTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSI---FDTCYDL----FGFVSV 401
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++F+F L LP N P + C F G + G+ QQ+
Sbjct: 402 RVPTVSFYFSGGPILTLPARNFLI----PVDDVGSFCFAFAPSSSG----LSIIGNIQQE 453
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+E+ D +GF P C
Sbjct: 454 GIEISVDGANGFVGFGPNVC 473
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 109/391 (27%), Positives = 165/391 (42%), Gaps = 53/391 (13%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL W+ C C DC + N+ + P S+S TC C I S
Sbjct: 177 LILDTGSDLNWLQC----LPCYDC--FHQNEAF--YDPKTSASFKNITCNDPRCSLISSP 228
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH-GSSPGIIR 122
+ P C S CP F Y YG+ TG +T V+ ++ G
Sbjct: 229 EPPVQ-CKSDNQS------------CPYF-YWYGDRSNTTGDFAVETFTVNLTTTEGRSS 274
Query: 123 E--IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDP 176
E + FGC + G+ G GRG LS SQL L FS+C + +D
Sbjct: 275 EYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSDT 332
Query: 177 NISSPLVIG-DVAISSKDNLQFTPML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
N+SS L+ G D + + NL FT + K +YYI +++I +G +L ++P
Sbjct: 333 NVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEAL-DIPEETWN 391
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G GG ++DSGTT ++ EP Y I+++ + + R D PC
Sbjct: 392 ISPDGAGGTIIDSGTTLSYFAEPAYE----IIKNKFAEKMKENYLVFR---DFPVLDPCF 444
Query: 294 NNTFTDD---LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
N + ++ P + F + P N F +S + CL +
Sbjct: 445 NVSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSED-----LVCLAILGTPKSTFS- 498
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G++QQQN ++YD + R+GF P CA
Sbjct: 499 --IIGNYQQQNFHILYDTKMSRLGFTPTKCA 527
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 101/383 (26%), Positives = 152/383 (39%), Gaps = 65/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DT SD+ WV C L C ++ + P++SS+ + C S C + SS
Sbjct: 171 VVVDTSSDIPWVQC--LPCPIPQCHLQKDPL----YDPAKSSTFAPIPCGSPACKELGSS 224
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+GCS +T C + YG+G TG DTL + SP I+
Sbjct: 225 YG-------NGCSPTT---DEC-----KYIVNYGDGKATTGTYVTDTLTM---SPTIV-- 264
Query: 124 IPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
+ F FGC GS + GI G G S+ Q FS+C A ++
Sbjct: 265 VKDFRFGCSHAVRGSFSNQNAGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSL 324
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
P + + +TP++K+ P +Y + LEAI + L P +
Sbjct: 325 GGP-------VEASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT----- 372
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY-PRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSG T LP Y+ L + +S + Y P A V D CY F
Sbjct: 373 --GAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPLAAPVRN---LDTCYDF----TRF 423
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
D P ++ F +L L P++ CL F + + G G+
Sbjct: 424 PDVKVPKVSLVFAGGATLDL----------EPASIILDGCLAFAATPGEES--VGFIGNV 471
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ EV+YD+ ++GF+ C
Sbjct: 472 QQQTYEVLYDVGGGKVGFRRGAC 494
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 108/388 (27%), Positives = 163/388 (42%), Gaps = 72/388 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGS WV C C + F RSS SS++ C + C
Sbjct: 74 VQLDTGSKAFWVN----GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC----- 124
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ P PC M+ CP + Y +GGL GIL D L H G +
Sbjct: 125 TSRP--PCNMT-------------LRCP-YITGYADGGLTMGILFTDLLHYH-QLYGNGQ 167
Query: 123 EIP---KFCFGC----VGSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC GS + GI GFG + SQL G +K FSHC
Sbjct: 168 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 225
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++K+ Y+ + L++I + ++L ++P
Sbjct: 226 ----DSTNGGGIFAIGEVV---EPKVKTTPIVKNN--EVYHLVNLKSINVAGTTL-QLPA 275
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ F + G +DSG+T +LPE YS+L+ + + + ++ ++ C+
Sbjct: 276 NI--FGTTKTKGTFIDSGSTLVYLPEIIYSELI------LAVFAKHPDITMGAMYNFQCF 327
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
DD FP ITFHF N+++L V P + Y + N C FQ
Sbjct: 328 HFLGS----VDDKFPKITFHFENDLTLDVYP---YDYLLEYEGNQY---CFGFQDAGIHG 377
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGF 375
Y + G N VVYD+EK+ IG+
Sbjct: 378 YKDMIILGDMVISNKVVVYDMEKQAIGW 405
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 158/387 (40%), Gaps = 56/387 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C C Y + F P+ S + + C S C
Sbjct: 194 LTVIVDTGSDLTWVQC----EPCPGSSCYAQRDPL--FDPAASPTFAAVPCGSPACA-AS 246
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D P + C+ S C +A +YG+G G+L +DTL + G
Sbjct: 247 LKDATGAPGS---CARSAGNSEQRCY----YALSYGDGSFSRGVLAQDTLGL-----GTT 294
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
++ F FGC S + G+ G GR LS+ SQ G FS+C A +
Sbjct: 295 TKLDGFVFGCGLSNRGLFGGTAGLMGLGRTDLSLVSQTAARFGGVFSYCLPATTTS---- 350
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L +G SS N+ +T M+ P P +Y+I + +G + P
Sbjct: 351 -TGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAP-------GF 402
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCPN 294
G G +LVDSGT T L S+ ++ + R E GF D CY +
Sbjct: 403 GAGNVLVDSGTVITRLAP-------SVYKAVRAEFARRFEYPAAPGFSILDACYDL---- 451
Query: 295 NTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
T D++ P +T + + + + + S V CL S+ D P +
Sbjct: 452 -TGRDEVNVPLLTLTLEGGAQVTVDAAGMLFVVR--KDGSQV-CLAMASLPYEDQTP--I 505
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G++QQ+N VVYD R+GF DC
Sbjct: 506 IGNYQQRNKRVVYDTVGSRLGFADEDC 532
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/382 (27%), Positives = 163/382 (42%), Gaps = 60/382 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDLTW+ C+ C Y + + F PSRSS+ +C S+
Sbjct: 103 LLIDTGSDLTWI-------QCLPCKCY--PQTIPFFHPSRSSTYRNASCESA-------- 145
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P M K+ CR + Y + GIL ++ L S G+I +
Sbjct: 146 -----PHAMPQIFRDE--KTGNCR----YHLRYRDFSNTRGILAKEKLTFQTSDEGLISK 194
Query: 124 IPKFCFGC--VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
P FGC S + + G+ G G G S+ ++ F K FS+CF + P+ +
Sbjct: 195 -PNIVFGCGQDNSGFTQYSGVLGLGPGTFSIVTR-NFGSK-FSYCFGSLIDPTYPH--NF 249
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L++G+ A D TP+ ++ + YY+ L+AI++G L P + + S+G G
Sbjct: 250 LILGNGARIEGDP---TPL---QIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKG--G 301
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTI-TYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
++D+G + T L Y L + + R K+ E+ T C D
Sbjct: 302 TVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNH-------CYEGNLKLD 354
Query: 301 L--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
L FP +TFHF L L + F +S+ S S + + DD V G+
Sbjct: 355 LYGFPVVTFHFAGGAELALDVESLF--VSSESGDSFCLAMTMNTFDD-----MSVIGAMA 407
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQN V Y+L ++ FQ DC
Sbjct: 408 QQNYNVGYNLRTMKVYFQRTDC 429
>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 315
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 137/298 (45%), Gaps = 32/298 (10%)
Query: 91 SFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFG 146
++ Y YG+ L G+L +DT S+ G + + +F FGC G +G+ G G
Sbjct: 41 NYTYGYGDNSLTKGVLAQDT-ATFTSNTGKLVSLSRFLFGCGHNNTGGFNDHEMGLIGLG 99
Query: 147 RGALSVPSQLG--FLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSP 204
G S+ SQ+G F K FS C + F D ISS + G + D + TP+++
Sbjct: 100 GGPTSLISQIGPLFGGKKFSQCLVPF--LTDIKISSRMSFGKGSQVLGDGVVTTPLVQRE 157
Query: 205 MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSI 264
Y++ L I++ + T +P++ + G +LVDSGT LP+ Y ++
Sbjct: 158 QDMTSYFVTLLGISVED---TYLPMN----STIEKGNMLVDSGTPPNILPQQLYDRVYVE 210
Query: 265 LQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFY 324
+++ + + G LCYR T T+ P++T+HF L+ P
Sbjct: 211 VKNNVPL--ELITNDPSLGPQLCYR------TQTNLKGPTLTYHFEGANLLLTP----IQ 258
Query: 325 AMSAPS-NSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
P+ + V CL + + + GV+G+F Q N + +DL+++ + F+ DC
Sbjct: 259 TFIPPTPETKGVFCLAINNYTNSN---GGVYGNFAQSNYLIGFDLDRQVVSFKATDCT 313
>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
Length = 452
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 166/414 (40%), Gaps = 56/414 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + + + F+ S SS+ + C+S C
Sbjct: 73 VTMVLDTGSELSWLRC-----NGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSPEC-QWR 126
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D P P S S CR + +Y + GIL DT + G+ P
Sbjct: 127 GRDLPVPPFCAGPPSXS-------CR----VSLSYADASSADGILAADTFLLGGAPPVXA 175
Query: 122 REIPKFCFGCV----------GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK 171
FGCV S G+ G RG+LS +Q L+ F++C
Sbjct: 176 ------LFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATLR--FAYCI---A 224
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTE 226
+ P + LV+G + L +TP+++ S P + Y + LE I +G ++L
Sbjct: 225 PGDGPGL---LVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVG-AALLP 280
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAK-EVEERTG 283
+P S+ D G G +VDSGT +T L Y+ L + Q++ P + + +
Sbjct: 281 IPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSALLAPLGESDFVFQGA 340
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM----SAPSNSSAVKCLL 339
FD C+R + P + L + + Y + + AV CL
Sbjct: 341 FDACFRASEARVAAASXMLPEVGL-VLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLT 399
Query: 340 FQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLHKKK 393
F + D + V G QQNV V YDL+ R+GF P C + Q L +
Sbjct: 400 FGNSDMAGMS-AYVIGHHHQQNVWVEYDLQNGRVGFAPARCDLATATQRLRARA 452
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 160/382 (41%), Gaps = 64/382 (16%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD++WV C C +C Y + F P+ S+S + +C + C ++
Sbjct: 164 VYMVLDTGSDVSWVQCA----PCAEC--YEQTDPI--FEPTSSASFTSLSCETEQCKSLD 215
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C T L + +YG+G G +T+ + +S G I
Sbjct: 216 VSE----------CRNGTCL----------YEVSYGDGSYTVGDFVTETVTLGSTSLGNI 255
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
GC + + G+ G G G+LS PSQL FS+C + D +
Sbjct: 256 ------AIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN--ASSFSYCLVD----RDSDS 303
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S L D + P+ ++P ++Y+GL +++G + L +P + + G
Sbjct: 304 TSTL---DFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVL-PIPETSFQMSEDG 359
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
NGG++VDSGT T L Y+ L + A+ V FD CY + +
Sbjct: 360 NGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL---FDTCYDLSSKSRV-- 414
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+++FHF N L LP N+ P +S C F D + G+ Q
Sbjct: 415 --EVPTVSFHFANGNELPLPAKNYLI----PVDSEGTFCFAFAPTD----STLSILGNAQ 464
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ V +DL +GF P C
Sbjct: 465 QQGTRVGFDLANSLVGFSPNKC 486
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 157/380 (41%), Gaps = 62/380 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y+ + + F P++S S + +C SS C I +S
Sbjct: 149 IDSGSDMVWVQCQ----PCKLC--YKQSDPV--FDPAKSGSYTGVSCGSSVCDRIENSG- 199
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C GC + YG+G G L +TL + ++R +
Sbjct: 200 ----CHSGGCRYEVM---------------YGDGSYTKGTLALETLTFAKT---VVRNVA 237
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNISSPLVI 184
C + G+ G G G++S QL G F +C ++ + + LV
Sbjct: 238 MGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS----RGTDSTGSLVF 293
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--SQGNGGL 242
G A+ + P++++P P++YY + +PL FD G+GG+
Sbjct: 294 GREALPV--GASWVPLVRNPRAPSFYY---VGLKGLGVGGVRIPLPDGVFDLTETGDGGV 348
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP Y+ +S PRA V FD CY + + F
Sbjct: 349 VMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSI---FDTCYDL----SGFVSVRV 401
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQQ 360
P+++F+F L LP N P + S C F + P+G + G+ QQ+
Sbjct: 402 PTVSFYFTEGPVLTLPARNFL----MPVDDSGTYCFAFAA------SPTGLSIIGNIQQE 451
Query: 361 NVEVVYDLEKERIGFQPMDC 380
++V +D +GF P C
Sbjct: 452 GIQVSFDGANGFVGFGPNVC 471
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 107/400 (26%), Positives = 175/400 (43%), Gaps = 67/400 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ WV C + C C + NF + SSSSS S
Sbjct: 94 VQIDTGSDILWVNCNS----CNGCPRSSGLGIQLNFFDASSSSSSSLV----------SC 139
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPGI 120
+P + L +S C S+ + YG+G +G +++ V G S I
Sbjct: 140 SDPICNSAFQTTATQCLTQSNQC----SYTFQYGDGSGTSGYYVSESMYFDMVMGQSM-I 194
Query: 121 IREIPKFCFGCVGSTYREPI---------GIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
FGC STY+ GI GFG G LSV SQL G K FSHC
Sbjct: 195 ANSSASVVFGC--STYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL- 251
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ N LV+G+V + + ++P++ S + N Y L++I++ +L P
Sbjct: 252 ----KGEGNGGGILVLGEVL---EPGIVYSPLVPSQPHYNLY---LQSISVNGQTL---P 298
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI--TYYPRAKEVEERTGFDL 286
+ F + N G ++DSGTT +L E Y+ +S + + + + P + +
Sbjct: 299 IDPSVFATSINRGTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQ------ 352
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CY V +T ++FP ++ +F + S+VL + + + +A+ C+ FQ + +G
Sbjct: 353 CYLV----STSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGF-YDGAALWCIGFQKVQEG 407
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
+ G ++ VYDL ++RIG+ DC+ +
Sbjct: 408 ----VTILGDLVMKDKIFVYDLARQRIGWASYDCSQAVNV 443
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 102/382 (26%), Positives = 160/382 (41%), Gaps = 64/382 (16%)
Query: 4 VYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
VYM DTGSD++WV C C +C + + F P+ S+S + +C + C ++
Sbjct: 164 VYMVLDTGSDVSWVQCA----PCAECYEQTD----PXFEPTSSASFTSLSCETEQCKSLD 215
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C T L + +YG+G G +T+ + +S G I
Sbjct: 216 VSE----------CRNGTCL----------YEVSYGDGSYTVGDFVTETVTLGSTSLGNI 255
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
GC + + G+ G G G+LS PSQL FS+C + D +
Sbjct: 256 ------AIGCGHNNEGLFIGAAGLLGLGGGSLSFPSQLN--ASSFSYCLVD----RDSDS 303
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S L D + P+ ++P ++Y+GL +++G + L +P + + G
Sbjct: 304 TSTL---DFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVL-PIPETSFQMSEDG 359
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
NGG++VDSGT T L Y+ L + A+ V FD CY + +
Sbjct: 360 NGGIIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVAL---FDTCYDLSSKSRV-- 414
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+++FHF N L LP N+ P +S C F D + G+ Q
Sbjct: 415 --EVPTVSFHFANGNELPLPAKNYLI----PVDSEGTFCFAFAPTD----STLSILGNAQ 464
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ V +DL +GF P C
Sbjct: 465 QQGTRVGFDLANSLVGFSPNKC 486
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 98/388 (25%), Positives = 167/388 (43%), Gaps = 62/388 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDL+WV C C C + ++ F+PS+S S C S C ++
Sbjct: 77 MTVIVDTGSDLSWVQCQ----PCNRCYNQQD----PVFNPSKSPSYRTVLCNSLTCRSLQ 128
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCR--PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
L+T C P ++ YG+G +G + + L + ++
Sbjct: 129 ---------------LATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-- 171
Query: 120 IIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAND 175
+ F FGC + G+ G GR LS+ SQ+ + G FS+C +
Sbjct: 172 ----VNNFIFGCGRKNQGLFGGASGLVGLGRTDLSLISQISPMFGGVFSYCLPT----TE 223
Query: 176 PNISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S LV+G + K+ + +T M+ +P+ P +Y++ L IT+G + + P
Sbjct: 224 AEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLP-FYFLNLTGITVGGVEV-QAP----- 276
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
S G +++DSGT + LP Y L + + YP A D C+ +
Sbjct: 277 --SFGKDRMIIDSGTVISRLPPSIYQALKAEFVKQFSGYPSAPSFMI---LDSCFNL--- 328
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ + + P I +F + L + FY S +++S V CL S+ D G+
Sbjct: 329 -SGYQEVKIPDIKMYFEGSAELNVDVTGVFY--SVKTDASQV-CLAIASLPYED--EVGI 382
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N ++YD + +GF C+
Sbjct: 383 IGNYQQKNQRIIYDTKGSMLGFAEEACS 410
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 99/398 (24%), Positives = 160/398 (40%), Gaps = 86/398 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-------NKLMSNFSPSRSSSSSRDTCASSF 56
V +D+GSDL W+PC +C+ C + K ++ F PS S++S C+
Sbjct: 112 VALDSGSDLLWIPC-----NCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKL 166
Query: 57 CLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYG-EGGLVTGILTRDTLKVHG 115
C + + ++P + C + TY E +G+L D L +
Sbjct: 167 CESAPACESPKEQCP--------------------YTVTYASENTSSSGLLVEDVLHLAY 206
Query: 116 SSPGIIREIPKFCFGCVGSTYRE------PIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
S+ + GC E P G+ G G G +SVPS L G ++ FS C
Sbjct: 207 SANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMGLGPGEISVPSFLAKAGLMRNSFSMC 266
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN---YYYIGLEAITIGNSS 223
F D S + GDV S++ + +F P Y N Y++G+E +GNS
Sbjct: 267 F-------DEEDSGRIYFGDVGPSTQQSTRFLP------YKNEFVAYFVGVEVCCVGNSC 313
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
L Q + L+DSG ++T LPE Y ++ + S I K++E
Sbjct: 314 L-----------KQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHIN--ATVKKIEGGP- 359
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK-CLLFQS 342
++ CY T + P+I F +N + V+ H + V+ CL +
Sbjct: 360 WEYCY------ETSFEPKVPAIKLKFSSNNTFVI----HKPLFVLQRSEGLVQFCLPISA 409
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
++G GV G +V+D E ++G+ C
Sbjct: 410 SEEGT---GGVIGQNYMAGYRIVFDRENMKLGWSASKC 444
>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 342
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 111/245 (45%), Gaps = 21/245 (8%)
Query: 141 GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDN----LQ 196
G+ G G +S+ SQL + FS+C F +SP++ G +A K N +Q
Sbjct: 111 GLMGLSPGTMSLISQLSVPR--FSYCLTPFAERK----TSPMLFGAMADLRKYNTTGPIQ 164
Query: 197 FTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPE 255
T +L++P M YYY+ L +++G L VP + + G GG +VDSG+T HL
Sbjct: 165 TTAILRNPAMDTFYYYVPLVGLSLGTKRL-RVPAASLAINPDGTGGTIVDSGSTMAHLAG 223
Query: 256 PFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSL 315
+ + + + VE+ ++LC+ VP P + HF ++
Sbjct: 224 KAFDAVKKAVLEAVKLPVFNGTVED---YELCFAVPS-GVAMAAVKTPPLVLHFDGGAAM 279
Query: 316 VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
LP+ N+F A + CL + P + G+ QQQN+ V++D+ ++ F
Sbjct: 280 ALPRDNYFQEPRA-----GLMCLAVARSPEDLGAPISIIGNVQQQNMHVLFDVHNQKFSF 334
Query: 376 QPMDC 380
P C
Sbjct: 335 APTKC 339
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 101/382 (26%), Positives = 153/382 (40%), Gaps = 72/382 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DTGSDL+WV C C YR + F P++SSS + C S C L I++S
Sbjct: 154 VDTGSDLSWVQCK----PCAAPSCYRQKDPL--FDPAQSSSYAAVPCGRSACAGLGIYAS 207
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C+ + C + +YG+G TG+ + DTL + ++
Sbjct: 208 A-----CSAAQCG---------------YVVSYGDGSNTTGVYSSDTLTLAANA-----T 242
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNI 178
+ F FGC G + G+ GFGR S+ Q G FS+C +
Sbjct: 243 VQGFLFGCGHAQSGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCL-----PTKSST 297
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+ L +G + T +L SP P YY + L I++G L+ VP S
Sbjct: 298 TGYLTLGGPS-GVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLS-VPASAFA----- 350
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G +VD+GT T LP Y+ L S +S + YP A + D CY +
Sbjct: 351 -AGTVVDTGTVITRLPPAAYAALRSAFRSGMASYPSAPPIGI---LDTCYSF----AGYG 402
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
S+ F + ++ L + CL F S G G + G+ Q
Sbjct: 403 TVNLTSVALTFSSGATMTLGADGIM----------SFGCLAFAS--SGSDGSMAILGNVQ 450
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q++ EV ++ +GF+P C
Sbjct: 451 QRSFEV--RIDGSSVGFRPSSC 470
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 110/389 (28%), Positives = 159/389 (40%), Gaps = 69/389 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+V +DTGSD++WV C + + + F P+ SS+ + C+++ C +
Sbjct: 149 RVVIDTGSDVSWVQC-----EPCPAPSPCHAHAGALFDPAASSTYAAFNCSAAACAQLGD 203
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S +GC KS C + YG+G TG + D L + GS ++R
Sbjct: 204 SGE------ANGCDA----KSRC-----QYIVKYGDGSNTTGTYSSDVLTLSGSD--VVR 246
Query: 123 EIPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDP 176
F FGC +G+ + G+ G G A S+ SQ K FS+C A P
Sbjct: 247 ---GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPA-----TP 298
Query: 177 NISSPLVIGDVAISSKD---NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S L +G A TPML+S P YY+ LE I +G L P
Sbjct: 299 ASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSP----- 353
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
S G LVDSGT T LP Y+ L S ++ +T Y RA+ + D C+
Sbjct: 354 --SVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGI---LDTCF----- 403
Query: 294 NNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPS 351
N T D + P++ F + L H CL F + DD +
Sbjct: 404 NFTGLDKVSIPTVALVFAGGAVVDLDA--HGIVSGG--------CLAFAPTRDDKAF--- 450
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G G+ QQ+ EV+YD+ GF+ C
Sbjct: 451 GTIGNVQQRTFEVLYDVGGGVFGFRAGAC 479
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 136/331 (41%), Gaps = 56/331 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D WVPC C C + F P+ S++ C+ + C +
Sbjct: 60 MVLDTSNDAAWVPCSG----CTGCSS-------TTFLPNASTTLGSLDCSEAQCSQVRGF 108
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C +G S+ C F +YG + L +D + +
Sbjct: 109 S-----CPATG--------SSACL----FNQSYGGDSSLAATLVQDAITLAND------V 145
Query: 124 IPKFCFGCVGSTYR---EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP F FGC+ + P G+ G GRG +S+ SQ G + G FS+C +FK S
Sbjct: 146 IPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFK---SYYFS 202
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L++P P+ YY+ L +++G + +P FD
Sbjct: 203 GSLKLGPVG--QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKV-PIPSEQLVFDPNTG 259
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + + FD C+ +
Sbjct: 260 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFAAT------NE 308
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPS 330
P++T HF ++LVLP N S+ S
Sbjct: 309 AEAPAVTLHF-EGLNLVLPMENSLIHSSSGS 338
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/332 (25%), Positives = 154/332 (46%), Gaps = 40/332 (12%)
Query: 69 PCTMSGCSLSTLLKSTCCRPCPSFAY--TYG-----EGGLVTGILTRDTLKVHGSSPGII 121
PC CS + + ST C P S +Y +YG G LV+ I T D+++ + +
Sbjct: 53 PCGSPSCSAFSAV-STSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVRNRKVAANLS 111
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFL--QKGFSHCFLAFKYANDPNIS 179
+ G + + G GF +G +S QL L + F +C + +
Sbjct: 112 LGCGRDSGGLL--ELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGK---- 165
Query: 180 SPLVIGDVAI---SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
LVIG+ + S ++ +TPM+ +P Y+I L I+I + +VP+ + F S
Sbjct: 166 --LVIGNYKLRNASISSSMAYTPMITNPQAAELYFINLSTISIDKNKF-QVPI--QGFLS 220
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQS-TITYYPRAKEVEERTGFDLCYRVPCPNN 295
G GG ++D+ T ++L FY+QL+ +++ T + V + G +LCY + ++
Sbjct: 221 NGTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVELCYNISANSD 280
Query: 296 TFTDDLFP---SITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS- 351
FP ++T+HFL + + + ++ + + + C+ + GP+
Sbjct: 281 ------FPPPATLTYHFLGGAGV---EVSTWFLLDDSDSVNNTICMAIGRSES--VGPNL 329
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
V G++QQ ++ V YDLE+ R GF C +T
Sbjct: 330 NVIGTYQQLDLTVEYDLEQMRYGFGAQGCNTT 361
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 158/387 (40%), Gaps = 55/387 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+V +DTGS+LTWV C + D+ R F S S C + C
Sbjct: 120 RVVVDTGSELTWVNC---RYRARGKDNRRV------FRADESKSFKTVGCLTQTC----- 165
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC----RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
L L T C PC S+ Y Y +G G+ ++T+ V G +
Sbjct: 166 -----------KVDLMNLFSLTTCPTPSTPC-SYDYRYADGSAAQGVFAKETITV-GLTN 212
Query: 119 GIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYA 173
G + +P GC G +++ G+ G S S L FS+C + +
Sbjct: 213 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLV--DHL 270
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
++ N+S+ L+ G + S+K + T L P +Y I + I++G L ++P +
Sbjct: 271 SNKNVSNYLIFGS-SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDML-DIPSQV-- 326
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+D+ GG ++DSGT+ T L + Y Q+++ L + R K E + C+
Sbjct: 327 WDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK--PEGVPIEYCFSF--- 381
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ F P +TFH L + P + +AP VKCL F S + V
Sbjct: 382 TSGFNVSKLPQLTFH-LKGGARFEPHRKSYLVDAAP----GVKCLGFVS---AGTPATNV 433
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQN +DL + F P C
Sbjct: 434 IGNIMQQNYLWEFDLMASTLSFAPSAC 460
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/397 (25%), Positives = 170/397 (42%), Gaps = 72/397 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN-FSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C D + N F ++SSS+ C C + +
Sbjct: 99 VQIDTGSDILWVTCS----PCDGCPDSSGLGIELNLFDTTKSSSARVLPCTDPICAAVST 154
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---------V 113
+ D C L ++ C S+++ Y + +G D++ +
Sbjct: 155 TT---DQC---------LTQTDHC----SYSFHYRDRSGTSGFYVTDSMHFDILLGESTI 198
Query: 114 HGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAF 170
SS I+ + +G + + GI GFG+G SV SQL G K FSHC
Sbjct: 199 ANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL--- 255
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSL---T 225
N LV+G++ S ++ SP+ P+ +Y + L++I + T
Sbjct: 256 --KGGENGGGILVLGEILEPS--------IVYSPLIPSQPHYTLKLQSIALSGQLFPNPT 305
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
P+S G ++DSGTT +L E Y ++S++ S ++ A R
Sbjct: 306 MFPIS-------NAGETIIDSGTTLAYLVEEVYDWIVSVITSAVS--QSATPTISRG--S 354
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNH--FYAMSAPSNSSAVKCLLFQSM 343
C+RV D+FP + F+F S+V+ + F ++ + +++ C+ FQ
Sbjct: 355 QCFRVSMS----VADIFPVLRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKA 410
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+DG + G ++ +VYDL ++RIG+ DC
Sbjct: 411 EDG----LNILGDLVLKDKIIVYDLAQQRIGWANYDC 443
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 106/398 (26%), Positives = 165/398 (41%), Gaps = 71/398 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C +C + + ++ + S + +C FC I+
Sbjct: 113 VQVDTGSDIMWVNC----IQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAING 168
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI-- 120
+ MS CS + + Y +G G RD ++ S +
Sbjct: 169 GPPSYCIANMS-CSYTEI---------------YADGSSSFGYFVRDIVQYDQVSGDLET 212
Query: 121 IREIPKFCFGCVG------STYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
FGC S+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 213 TSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL---- 268
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPL 229
+ N IG + + K N +P+ PN +Y + ++A+ +G L L
Sbjct: 269 --DGLNGGGIFAIGHI-VQPKVN-------TTPLVPNQTHYNVNMKAVEVGGYFLN---L 315
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD--LC 287
FD G ++DSGTT +LPE Y QLLS + + +++ T D C
Sbjct: 316 PTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLLSKI------FSWQSDLKVHTIHDQFTC 369
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDD 345
++ + DD FP++TFHF N SL L H Y S + C+ +Q+ M
Sbjct: 370 FQY----SESLDDGFPAVTFHFEN--SLYLKVHPHEYLFSY----DGLWCIGWQNSGMQS 419
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
D + G N V+YDLE + IG+ +C+S+
Sbjct: 420 RDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCSSS 457
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 159/383 (41%), Gaps = 68/383 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DTGS L+W+ C C Y ++++ F PS S++ C+SS C L +
Sbjct: 137 LDTGSSLSWL-------QCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSECSLLKAATL 189
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
++P CT SG + T +YG+ G L+RD L + S +
Sbjct: 190 NDPL--CTASGVCVYTA--------------SYGDASYSMGYLSRDLLTLTPS-----QT 228
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+P F +GC + + GI G R LS+ +QL + G+ AF Y + SS
Sbjct: 229 LPSFTYGCGQDNEGLFGKAAGIVGLARDKLSMLAQLS-PKYGY-----AFSYCLPTSTSS 282
Query: 181 P---LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L IG ++ SS +FTPM+++ P+ Y++ L AIT+ P+ + Q
Sbjct: 283 GGGFLSIGKISPSS---YKFTPMIRNSQNPSLYFLRLAAITVAGR-----PVGVAAAGYQ 334
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
++DSGT T LP Y+ L ++ R ++ + D C++ + +
Sbjct: 335 VP--TIIDSGTVVTRLPISIYAALREAFVKIMSR--RYEQAPAYSILDTCFKGSLKSMSG 390
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P I F L L N + CL F S + + G+
Sbjct: 391 A----PEIRMIFQGGADLSLRAPNILI-----EADKGIACLAFASSNQ-----IAIIGNH 436
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ + YD+ +IGF P C
Sbjct: 437 QQQTYNIAYDVSASKIGFAPGGC 459
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 161/380 (42%), Gaps = 62/380 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y + F P+ S+S +C+S+ C
Sbjct: 60 IDSGSDIVWVQCK----PCTQC--YHQTDPL--FDPADSASFMGVSCSSAVC-------- 103
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D +GC+ S CR + +YG+G G L +TL + ++R +
Sbjct: 104 --DRVENAGCN------SGRCR----YEVSYGDGSYTKGTLALETLTFGRT---VVRNVA 148
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNISSPLVI 184
C + G+ G G G++S QL G FS+C ++ N + L
Sbjct: 149 IGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVS----RGTNTNGFLEF 204
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS--LREFDSQGNGGL 242
G A+ + P++++P P++YYI L + +G+ T VP+S + + + G+GG+
Sbjct: 205 GSEAMPV--GAAWIPLVRNPRAPSFYYIRLLGLGVGD---TRVPVSEDVFQLNELGSGGV 259
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T P Y + PRA V FD CY + F
Sbjct: 260 VMDTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSI---FDTCYNL----FGFLSVRV 312
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQQ 360
P+++F+F L +P N P + + C F PSG + G+ QQ+
Sbjct: 313 PTVSFYFSGGPILTIPANNFLI----PVDDAGTFCFAFAP------SPSGLSILGNIQQE 362
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+++ D E +GF P C
Sbjct: 363 GIQISVDEANEFVGFGPNIC 382
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 158/387 (40%), Gaps = 55/387 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+V +DTGS+LTWV C + D+ R F S S C + C
Sbjct: 98 RVVVDTGSELTWVNC---RYRARGKDNRRV------FRADESKSFKTVGCLTQTC----- 143
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC----RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
L L T C PC S+ Y Y +G G+ ++T+ V G +
Sbjct: 144 -----------KVDLMNLFSLTTCPTPSTPC-SYDYRYADGSAAQGVFAKETITV-GLTN 190
Query: 119 GIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYA 173
G + +P GC G +++ G+ G S S L FS+C + +
Sbjct: 191 GRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLV--DHL 248
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
++ N+S+ L+ G + S+K + T L P +Y I + I++G L ++P +
Sbjct: 249 SNKNVSNYLIFGS-SRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDML-DIPSQV-- 304
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+D+ GG ++DSGT+ T L + Y Q+++ L + R K E + C+
Sbjct: 305 WDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVK--PEGVPIEYCFSF--- 359
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ F P +TFH L + P + +AP VKCL F S + V
Sbjct: 360 TSGFNVSKLPQLTFH-LKGGARFEPHRKSYLVDAAP----GVKCLGFVS---AGTPATNV 411
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQN +DL + F P C
Sbjct: 412 IGNIMQQNYLWEFDLMASTLSFAPSAC 438
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 166/387 (42%), Gaps = 58/387 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDLTWV C C C Y + + F+PS SSS C S C+ + +
Sbjct: 79 LIVDTGSDLTWVQC----LPCRLC--YNQQEPL--FNPSNSSSFLSLPCNSPTCVALQPT 130
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C+ ST C + YG+G G L + L + + E
Sbjct: 131 AGSSGLCSNK--------NSTSC----DYQIDYGDGSYSRGELGFEKLTLGKT------E 172
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
I F FGC + + G+ G R LS+ SQ L FS+C + S
Sbjct: 173 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS----S 228
Query: 180 SPLVIGDVAISSKDNLQ---FTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L +G S+ N+ +T M+++P N+Y++ L I+IG +L LS E
Sbjct: 229 GSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNE--- 285
Query: 237 QGNGGL-LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G L L+DSGT T L Y + + + Y RT C N
Sbjct: 286 ---GVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGY--------RTTPGFSILNTCFNL 334
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T +++ P++ F F N +++ FY + S++S + CL F S+ D + +
Sbjct: 335 TGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFV--KSDASQI-CLAFASLGYED--QTMII 389
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N V+Y+ ++ ++GF C+
Sbjct: 390 GNYQQKNQRVIYNSKESKVGFAGEPCS 416
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 153/388 (39%), Gaps = 63/388 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL+WV C C + Y + F PS SSS + C S C + +
Sbjct: 186 VLIDTGSDLSWVQC----KPCGAGECYAQKDPL--FDPSSSSSYASVPCDSDACRKLAAG 239
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC+ + + C + YG TG+ + +TL + PG++
Sbjct: 240 ------AYGHGCTGVSGGAAALCE----YGIEYGNRATTTGVYSTETLTLK---PGVV-- 284
Query: 124 IPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLA-------FKY 172
+ F FGC Y + G+ G G S+ SQ G FS+C
Sbjct: 285 VADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLTL 344
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
PN SS ++ L FTPM + P P +Y + L I++G + L P +
Sbjct: 345 GAPPNSSS--------STAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFS 396
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G+++DSGT T LP Y+ L S +S ++ Y R D CY
Sbjct: 397 S-------GMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGGVLDTCYDFTG 448
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N P+I+ F ++ L +AP+ CL F G G
Sbjct: 449 HANV----TVPTISLTFSGGATIDL---------AAPAGVLVDGCLAFAGA--GTDNAIG 493
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ Q+ EV+YD K +GF+ C
Sbjct: 494 IIGNVNQRTFEVLYDSGKGTVGFRAGAC 521
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 108/408 (26%), Positives = 167/408 (40%), Gaps = 68/408 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C D +R P S++ + C S+ C
Sbjct: 74 VTMVLDTGSELSWLLCATGRAAAAAADSFR---------PRASATFAAVPCGSARC---S 121
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S D P P S S CR + +Y +G G L D V G +P +
Sbjct: 122 SRDLPAPP--------SCDAASRRCR----VSLSYADGSASDGALATDVFAV-GDAPPL- 167
Query: 122 REIPKFCFGCVGSTYREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC+ + Y G+ G RGALS +Q + FS+C +D
Sbjct: 168 ----RSAFGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQAS--TRRFSYCI------SD 215
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMY------PNY----YYIGLEAITIGNSSLT 225
+ + L++G +L F P+ +P+Y P + Y + L I +G L
Sbjct: 216 RDDAGVLLLG------HSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPL- 268
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS-ILQSTITYYPRAKE--VEERT 282
+P S+ D G G +VDSGT +T L YS + + L+ T P ++ +
Sbjct: 269 PIPPSVLAPDHTGAGQTMVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPALEDPSFAFQE 328
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQ 341
FD C+RVP + L P +T F N + + Y + + V CL F
Sbjct: 329 AFDTCFRVPKGRPPPSARL-PPVTLLF-NGAQMSVAGDRLLYKVPGERRGADGVWCLTFG 386
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ D + V G Q N+ V YDLE+ R+G P+ C + GL
Sbjct: 387 NADMVPLT-AYVIGHHHQMNLWVEYDLERGRVGLAPVKCDVASERLGL 433
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 156/380 (41%), Gaps = 62/380 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y+ + + F P++S S + +C SS C I +S
Sbjct: 148 IDSGSDMVWVQCQ----PCKLC--YKQSDPV--FDPAKSGSYTGVSCGSSVCDRIENSG- 198
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C GC + YG+G G L +TL + ++R +
Sbjct: 199 ----CHSGGCRYEVM---------------YGDGSYTKGTLALETLTFAKT---VVRNVA 236
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNISSPLVI 184
C + G+ G G G++S QL G F +C ++ + + LV
Sbjct: 237 MGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLSGQTGGAFGYCLVS----RGTDSTGSLVF 292
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--SQGNGGL 242
G A+ + P++++P P++YY + +PL FD G+GG+
Sbjct: 293 GREALPV--GASWVPLVRNPRAPSFYY---VGLKGLGVGGVRIPLPDGVFDLTETGDGGV 347
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP Y +S PRA V FD CY + + F
Sbjct: 348 VMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI---FDTCYDL----SGFVSVRV 400
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQQ 360
P+++F+F L LP N P + S C F + P+G + G+ QQ+
Sbjct: 401 PTVSFYFTEGPVLTLPARNFL----MPVDDSGTYCFAFAA------SPTGLSIIGNIQQE 450
Query: 361 NVEVVYDLEKERIGFQPMDC 380
++V +D +GF P C
Sbjct: 451 GIQVSFDGANGFVGFGPNVC 470
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 166/387 (42%), Gaps = 58/387 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDLTWV C C C Y + + F+PS SSS C S C+ + +
Sbjct: 158 LIVDTGSDLTWVQC----LPCRLC--YNQQEPL--FNPSNSSSFLSLPCNSPTCVALQPT 209
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C+ ST C + YG+G G L + L + + E
Sbjct: 210 AGSSGLCSNK--------NSTSC----DYQIDYGDGSYSRGELGFEKLTLGKT------E 251
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKYANDPNIS 179
I F FGC + + G+ G R LS+ SQ L FS+C + S
Sbjct: 252 IDNFIFGCGRNNKGLFGGASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGS----S 307
Query: 180 SPLVIGDVAISSKDNLQ---FTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
L +G S+ N+ +T M+++P N+Y++ L I+IG +L LS E
Sbjct: 308 GSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLSSNE--- 364
Query: 237 QGNGGL-LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G L L+DSGT T L Y + + + Y RT C N
Sbjct: 365 ---GVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGY--------RTTPGFSILNTCFNL 413
Query: 296 TFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T +++ P++ F F N +++ FY + S++S + CL F S+ D + +
Sbjct: 414 TGYEEVNIPTVKFIFEGNAEMIVDVEGVFYFV--KSDASQI-CLAFASLGYED--QTMII 468
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G++QQ+N V+Y+ ++ ++GF C+
Sbjct: 469 GNYQQKNQRVIYNSKESKVGFAGEPCS 495
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 111/386 (28%), Positives = 154/386 (39%), Gaps = 70/386 (18%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
D GSD+TW+ C F C N+L +SSS+S C + C + SS
Sbjct: 148 DMGSDVTWLQC-MPCFRCYHQPGPVYNRL-------KSSSASDVGCYAPACRALGSS--- 196
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
GC + C + YG+G G +TL PG+ +P
Sbjct: 197 ------GGC---VQFLNEC-----QYKVEYGDGSSSAGDFGVETLTF---PPGV--RVPG 237
Query: 127 FCFGCVGSTYR-----EPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNISS 180
GC GS + GI G GRG+LS PSQ+ G + FS+C SS
Sbjct: 238 VAIGC-GSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAG---QGTGGRSS 293
Query: 181 PLVIGDVAISSKDNLQFTP---MLKSPMYPNYYYIGLEAITIGNSSLTEVPLS-LREFDS 236
L G A ++ ML + +YY+GL I++G + V S LR S
Sbjct: 294 TLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPS 353
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG---------FDLC 287
G+GG++VDSGT T L P Y+ R V+E FD C
Sbjct: 354 TGHGGVIVDSGTAVTRLSGPAYAAFRDAF--------RVAAVKELGWPSPGGPFAFFDTC 405
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
Y P+++ HF V + LP N Y + SN + C F GD
Sbjct: 406 Y---SSVRGRVMKKVPAVSMHFAGGVEVKLPPQN--YLIPVDSNKGTM-CFAFAG--SGD 457
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERI 373
G S + G+ Q Q VVYD++ +R+
Sbjct: 458 RGVS-IIGNIQLQGFRVVYDVDGQRV 482
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 167/398 (41%), Gaps = 63/398 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTG+D+ WV C C +C N + ++ ++ SSS C C I+
Sbjct: 90 VDTGTDMMWVNC----IQCKECPTRSNLGMDLTLYNIKESSSGKLVPCDQELCKEINGG- 144
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDT---------LKVHG 115
++GC+ S CP + YG+G G +D LK
Sbjct: 145 ------LLTGCT------SKTNDSCP-YLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTAS 191
Query: 116 SSPGIIREIPKFCFGCVGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
++ +I G + + E + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 192 ANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL---- 247
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
N N IG V + + TP+L P P +Y + + AI +G++ L +
Sbjct: 248 --NGVNGGGIFAIGHVV---QPTVNTTPLL--PDQP-HYSVNMTAIQVGHTFLNLSTDAS 299
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+ DS+G ++DSGTT +LP+ Y L+ + S +E T F V
Sbjct: 300 EQRDSKGT---IIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQYSGSV- 355
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDGDYG 349
DD FP++TF+F N +SL + ++ + S + C+ +Q+ D
Sbjct: 356 -------DDGFPNVTFYFENGLSLKVYPHDYLFL------SENLWCIGWQNSGAQSRDSK 402
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ G N V YDLE + IG+ +C+S+ +
Sbjct: 403 NMTLLGDLVLSNKLVFYDLENQVIGWTEYNCSSSIKVR 440
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/331 (25%), Positives = 136/331 (41%), Gaps = 56/331 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D WVPC C C + F P+ S++ C+ + C +
Sbjct: 60 MVLDTSNDAAWVPCSG----CTGCSS-------TTFLPNASTTLGSLDCSEAQCSQVRGF 108
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C +G S+ C F +YG + L +D + +
Sbjct: 109 S-----CPATG--------SSACL----FNQSYGGDSSLAATLVQDAITLAND------V 145
Query: 124 IPKFCFGCVGSTYR---EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP F FGC+ + P G+ G GRG +S+ SQ G + G FS+C +FK S
Sbjct: 146 IPGFTFGCINAVSGGSIPPQGLLGLGRGPISLISQAGAMYSGVFSYCLPSFKSYY---FS 202
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L++P P+ YY+ L +++G + +P FD
Sbjct: 203 GSLKLGPVG--QPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKV-PIPSEQLVFDPNTG 259
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y + + + + FD C+ +
Sbjct: 260 AGTIIDSGTVITRFVQPVYFAIRDEFRKQVN-----GPISSLGAFDTCFA------ETNE 308
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPS 330
P++T HF ++LVLP N S+ S
Sbjct: 309 AEAPAVTLHF-EGLNLVLPMENSLIHSSSGS 338
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 107/385 (27%), Positives = 162/385 (42%), Gaps = 74/385 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+++DTGSD++W+ C + +D P SS+ + +C++ C +
Sbjct: 146 MFIDTGSDVSWLRCKSRLYD-----------------PGTSSTYAPFSCSAPACAQLGRR 188
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+GCS STC ++ YG+G TG DTL + G+S +I
Sbjct: 189 G--------TGCSSG----STCV-----YSVKYGDGSNTTGTYGSDTLTLAGTSEPLIS- 230
Query: 124 IPKFCFGC--VGSTYRE--PIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP--N 177
F FGC V + E G+ G G A S SQ AF Y P N
Sbjct: 231 --GFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGS------AFSYCLPPTWN 282
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S L +G + S+ TPML+S +Y + L I++G +L E+P S+
Sbjct: 283 SSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTL-EIPSSVF----- 336
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC--PNN 295
+ G +VDSGT T LP Y L + + + Y + + R D C+ N
Sbjct: 337 -SAGSIVDSGTVITRLPPTAYGALSAAFRDGMARY-QYQPAAPRGLLDTCFDFTGHGEGN 394
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
FT + +V+LVL G P+ CL F + DD G +G+ G
Sbjct: 395 NFT-----------VPSVALVLDGGAVVDLH--PNGIVQDGCLAFAATDDD--GRTGIIG 439
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQ+ EV+YD+ + GF+P C
Sbjct: 440 NVQQRTFEVLYDVGQSVFGFRPGAC 464
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 103/400 (25%), Positives = 169/400 (42%), Gaps = 63/400 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + NF P SS++S +C S C+
Sbjct: 56 VQIDTGSDILWVNCK----PCNACPLTSGLGVALNFFDPRGSSTASPLSCIDSKCV---- 107
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCC--RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S + + +S C R C +++ YG+G G D +
Sbjct: 108 -------------SSNQISESVCTTDRYC-GYSFEYGDGSGTLGYYVSDEFDYNQYVNQY 153
Query: 121 I--REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ K FGC + R GI GFG+ LSV SQL G K FSHC
Sbjct: 154 VTNNASAKITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCL- 212
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
DP LV+G++ ++ + +TP++ P P +Y + L+ I + L+ P
Sbjct: 213 ---EGADPG-GGILVLGEI---TEPGMVYTPIV--PSQP-HYNLNLQGIAVNGQQLSIDP 262
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
+ F + G ++D GTT +L E Y ++ + + ++ + ++ F +
Sbjct: 263 ---QVFATTNTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVH 319
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDG 346
+ D++FPS+T +F + L ++ +P +SS V C+ +Q
Sbjct: 320 SI--------DEIFPSVTLYF-EGAPMDLKPKDYLIQQLSP-DSSPVWCIGWQKSGQQAT 369
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASA 386
D + G ++ VYDLE +RIG+ DC+ST +
Sbjct: 370 DSSKMTILGDLVLKDKVFVYDLENQRIGWTSFDCSSTVNV 409
>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
Length = 360
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 136/309 (44%), Gaps = 36/309 (11%)
Query: 87 RPCPSFAYTYGEGGLVTGILTRDTLKVH---GSSPGIIREIPKFCFGC---VGSTYREPI 140
+ CP + Y YG+ TG +T V+ S +R + FGC +
Sbjct: 72 QTCP-YYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRVENVMFGCGHWNRGLFHGAA 130
Query: 141 GIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISSPLVIG-DVAISSKDNLQFT 198
G+ G GRG LS SQL L FS+C + +D N+SS L+ G D + S L FT
Sbjct: 131 GLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSDANVSSKLIFGEDKDLLSHPELNFT 188
Query: 199 PMLKSPMYP--NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEP 256
++ P +YY+ +++I +G + +P + + G+GG ++DSGTT ++ EP
Sbjct: 189 TLVAGKENPVDTFYYVQIKSIVVG-GEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEP 247
Query: 257 FYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD--DLFPSITFHFLNNVS 314
Y + + + YP K D PC N T + DL P F +
Sbjct: 248 AYQVIKEAFMAKVKGYPVVK--------DFPVLEPCYNVTGVEQPDL-PDFGIVFSDGAV 298
Query: 315 LVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQQNVEVVYDLEKER 372
P N+F + V CL PS + G++QQQN ++YD +K R
Sbjct: 299 WNFPVENYFIEIEP----REVVCLAILGTP-----PSALSIIGNYQQQNFHILYDTKKSR 349
Query: 373 IGFQPMDCA 381
+GF P CA
Sbjct: 350 LGFAPTKCA 358
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 153/388 (39%), Gaps = 63/388 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL+WV C C + Y + F PS SSS + C S C + +
Sbjct: 106 VLIDTGSDLSWVQC----KPCGAGECYAQKDPL--FDPSSSSSYASVPCDSDACRKLAAG 159
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GC+ + + C + YG TG+ + +TL + PG++
Sbjct: 160 ------AYGHGCTGVSGGAAALCE----YGIEYGNRATTTGVYSTETLTLK---PGVV-- 204
Query: 124 IPKFCFGCVG---STYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLA-------FKY 172
+ F FGC Y + G+ G G S+ SQ G FS+C
Sbjct: 205 VADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLTL 264
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
PN SS ++ L FTPM + P P +Y + L I++G + L P +
Sbjct: 265 GAPPNSSSS--------TAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFS 316
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G+++DSGT T LP Y+ L S +S ++ Y R D CY
Sbjct: 317 S-------GMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGGVLDTCYDFTG 368
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N P+I+ F ++ L +AP+ CL F G G
Sbjct: 369 HANV----TVPTISLTFSGGATIDL---------AAPAGVLVDGCLAFAGA--GTDNAIG 413
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ Q+ EV+YD K +GF+ C
Sbjct: 414 IIGNVNQRTFEVLYDSGKGTVGFRAGAC 441
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 157/381 (41%), Gaps = 64/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D +W+PC C C + F P+ S+S C S C +
Sbjct: 129 VDTSNDASWIPCAG----CAGCP----TSSAAPFDPASSASYRTVPCGSPLC-----AQA 175
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C G + C F+ TY + L L++D+L V G++ +
Sbjct: 176 PNAACPPGG------------KAC-GFSLTYADSSL-QAALSQDSLAVAGNA------VK 215
Query: 126 KFCFGCV---GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ + FS+C +FK N S
Sbjct: 216 AYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSL---NFSGT 272
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G ++ TP+L +P + YY+ + I +G + +P FD G
Sbjct: 273 LRLGRNG--QPQRIKTTPLLANPHRSSLYYVNMTGIRVGR-KVVPIP----AFDPATGAG 325
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT +T L P Y + ++ + V GFD C+ NT T
Sbjct: 326 TVLDSGTMFTRLVAPAYVAVRDEVRRRV-----GAPVSSLGGFDTCF------NT-TAVA 373
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P +T F + + + LP+ N S + CL + DG V S QQQN
Sbjct: 374 WPPVTLLF-DGMQVTLPEENVVIH----STYGTISCLAMAAAPDGVNTVLNVIASMQQQN 428
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
V++D+ R+GF C +
Sbjct: 429 HRVLFDVPNGRVGFARERCTA 449
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 109/388 (28%), Positives = 165/388 (42%), Gaps = 69/388 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI-HS 62
V +DTGSDL WV C C DC +R + + F PS+SS+ + S C N
Sbjct: 74 VGIDTGSDLLWVQCR----PCADC--FRQSTPI--FDPSKSSTYVDLSYDSPICPNSPQK 125
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
N + C + +Y +G +G L + + S G +
Sbjct: 126 KYNHLNQCI--------------------YNASYADGSTSSGNLATEDIVFETSDQGTV- 164
Query: 123 EIPKFCFGCVGSTYR-----EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ FGC G + R + GI G G S+ S+LG FS+C DP+
Sbjct: 165 TVSSVVFGC-GHSNRGRFDGQQSGILGLSAGDQSIVSRLG---SRFSYCIGDLF---DPH 217
Query: 178 IS-SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + LV+GD + F + +YY+ LE I++G + L P + +S
Sbjct: 218 YTHNQLVLGDGVKMEGSSTPF------HTFNGFYYVTLEGISVGETRLDINPEVFQRTES 271
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT--GFDLCYRVPCPN 294
G GG+++DSGTT T L + + L + +Q + + ++V RT G+ LCY+
Sbjct: 272 -GQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGH--FQQVIYRTIPGW-LCYK----- 322
Query: 295 NTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+DL FP + FHF LVL + F + V CL + + G
Sbjct: 323 GRVNEDLRGFPELAFHFAEGADLVLDANSLFV-----QKNQDVFCLAVLESNLKNIGS-- 375
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G QQ+ V YDL +R+ FQ DC
Sbjct: 376 VIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 102/386 (26%), Positives = 155/386 (40%), Gaps = 62/386 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSDL+WV C C Y + F PS SS+ + C S C ++
Sbjct: 137 LLIDTGSDLSWVQCQ----PCNSSTCYPQKDPV--FDPSASSTYAPVPCGSEACRDL--- 187
Query: 64 DNPFDPCTMS-GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
DP + + GC+ S+ S C + YG G G+ + +TL + SP
Sbjct: 188 ----DPDSYANGCTNSSSGASLC-----QYGIQYGNGDTTVGVYSTETLTL---SPEAAT 235
Query: 123 EIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL-----GFLQKGFSHCFLAFKYANDPN 177
+ F FGC G + + G P L G FS+C A +
Sbjct: 236 VVNNFSFGC-GLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGN-----S 289
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+ L +G A + QFTP+ + +Y + L I++G L P
Sbjct: 290 TAGFLALGAPATGGNNTAGFQFTPLQV--VETTFYLVKLTGISVGGKQLDIEPTVFA--- 344
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
GG+++DSGT T LPE YS L + +S ++ YP ++ D CY N
Sbjct: 345 ----GGMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDED-LDTCYDFTGNTN 399
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPSGVF 354
P++ F V++ L PS CL F + DGD +G+
Sbjct: 400 V----TVPTVALTFEGGVTIDL---------DVPSGVLLDGCLAFVAGASDGD---TGII 443
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ Q+ EV+YD + +GF+ C
Sbjct: 444 GNVNQRTFEVLYDSARGHVGFRAGAC 469
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 166/388 (42%), Gaps = 69/388 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI-HS 62
V +DTGSDL WV C C DC +R + + F PS+SS+ + S C N
Sbjct: 74 VGIDTGSDLLWVQCR----PCADC--FRQSTPI--FDPSKSSTYVDLSYDSPICPNSPQK 125
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
N + C + +Y +G +G L + + S G +
Sbjct: 126 KYNHLNQCI--------------------YNASYADGSTSSGNLATEDIVFETSDQGTV- 164
Query: 123 EIPKFCFGCVGSTYR-----EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ FGC G + R + GI G G S+ S+LG FS+C DP+
Sbjct: 165 TVSSVVFGC-GHSNRGRFDGQQSGILGLSAGDQSIVSRLG---SRFSYCIGDLF---DPH 217
Query: 178 IS-SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + LV+GD K TP + +YY+ LE I++G + L P + +S
Sbjct: 218 YTHNQLVLGD---GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTES 271
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT--GFDLCYRVPCPN 294
G GG+++DSGTT T L + + L + +Q + + ++V RT G+ LCY+
Sbjct: 272 -GQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGH--FQQVIYRTIPGW-LCYK----- 322
Query: 295 NTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+DL FP + FHF LVL + F + V CL + + G
Sbjct: 323 GRVNEDLRGFPELAFHFAEGADLVLDANSLFV-----QKNQDVFCLAVLESNLKNIGS-- 375
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G QQ+ V YDL +R+ FQ DC
Sbjct: 376 VIGIMAQQHYNVAYDLIGKRVYFQRTDC 403
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 105/406 (25%), Positives = 168/406 (41%), Gaps = 74/406 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD WV C C C + ++ + P+ S +S C FC + +
Sbjct: 89 VQVDTGSDTLWVNC----VGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYD 144
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+SGC T CP ++ TYG+G +G +D L G +R
Sbjct: 145 GQ-------ISGC--------TKGMSCP-YSITYGDGSTTSGSYIKDDLTFD-RVVGDLR 187
Query: 123 EIP---KFCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+P FGC + ST + GI GFG+ SV SQL G +++ FSHC
Sbjct: 188 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCL- 246
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ + IG+V + ++ TP+L+ + Y + L+ I + + ++P
Sbjct: 247 -----DSISGGGIFAIGEVV---QPKVKTTPLLQGMAH---YNVVLKDIEVAGDPI-QLP 294
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
+ DS G ++DSGTT +LP Y QLL K + +R+G L Y
Sbjct: 295 SDI--LDSSSGRGTIIDSGTTLAYLPVSIYDQLLE------------KILAQRSGMKL-Y 339
Query: 289 RVPCPNNTF-------TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
V F DDLFP++ F F ++L ++ + + + Q
Sbjct: 340 LVEDQFTCFHYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQ 399
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ D + + G N VVYDL+ IG+ +C+S+ +
Sbjct: 400 TKDGKEL---ILLGDLVLANKLVVYDLDNMAIGWADYNCSSSIKVK 442
>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
Length = 442
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 170/395 (43%), Gaps = 68/395 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C ++ L S F+P SS+ S C+S C
Sbjct: 78 ISMVLDTGSELSWLHC------------KKSPNLGSVFNPVSSSTYSPVPCSSPIC-RTR 124
Query: 62 SSDNPF----DPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
+ D P DP K+ C A +Y + + G L +T V GS
Sbjct: 125 TRDLPIPASCDP------------KTHLCH----VAISYADATSIEGNLAHETF-VIGS- 166
Query: 118 PGIIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAF 170
+ R P FGC+ S + G+ G RG+LS +QLGF + FS+C
Sbjct: 167 --VTR--PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSK--FSYCI--- 217
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPM-LKSPMYPNY----YYIGLEAITIGNSSLT 225
+ + S L++GD + S +Q+TP+ L+S P + Y + LE I +G S +
Sbjct: 218 ---SGSDSSGFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVG-SKIL 273
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEER 281
+P S+ D G G +VDSGT +T L P Y+ L ++ +S + V +
Sbjct: 274 SLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQG 333
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSN-SSAVKCLLF 340
T DLCY+V L P ++ F V Q + A S V C F
Sbjct: 334 T-MDLCYKVGSTTRPNFSGL-PMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF 391
Query: 341 QSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
+ D + V G QQNV + +DL K R+GF
Sbjct: 392 GNSDLLGI-EAFVIGHHHQQNVWMEFDLAKSRVGF 425
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 104/401 (25%), Positives = 169/401 (42%), Gaps = 71/401 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
+DTGSD+ WV C C +C + + ++ + SSS C FC I+
Sbjct: 100 VDTGSDIMWVNC----IQCKECPTRSSLGMDLTLYDIKESSSGKLVPCDQEFCKEINGG- 154
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
++GC+ + CP + YG+G G +D + S + +
Sbjct: 155 ------LLTGCTANI--------SCP-YLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDS 199
Query: 125 PK--FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
FGC + S+ E + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 200 ANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL---- 255
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
N N IG V + + TP+L P P +Y + + A+ +G++ L+ +
Sbjct: 256 --NGVNGGGIFAIGHVV---QPKVNMTPLL--PDQP-HYSVNMTAVQVGHTFLSLSTDTS 307
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEV---EERTGFDLCY 288
+ D +G ++DSGTT +LPE Y L + I+ +P K +E T F
Sbjct: 308 AQGDRKGT---IIDSGTTLAYLPEGIYEPL---VYKMISQHPDLKVQTLHDEYTCFQYSE 361
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDG 346
V DD FP++TF F N +SL + ++ + S C+ +Q+
Sbjct: 362 SV--------DDGFPAVTFFFENGLSLKVYPHDYLFP------SVNFWCIGWQNSGTQSR 407
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D + G N V YDLE + IG+ +C+S+ +
Sbjct: 408 DSKNMTLLGDLVLSNKLVFYDLENQAIGWAEYNCSSSIKVR 448
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 153/379 (40%), Gaps = 59/379 (15%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSD+TW C C + R N PS S+S +C+S+ C + S
Sbjct: 149 DTGSDITWTQCEPCVKTCYKQKEPRLN-------PSTSTSYKNISCSSALCKLVASGKKF 201
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C+ S C + YG+G G +TL + SS + +
Sbjct: 202 SQSCSSSTCL---------------YQVQYGDGSYSIGFFATETLTL--SSSNVFKN--- 241
Query: 127 FCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSPL 182
F FGC + G+ G GR L++PSQ +K FS+C P SS
Sbjct: 242 FLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL--------PASSSSK 293
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
+ +++FTP+ +Y + + +++G L S+ E S + G
Sbjct: 294 GYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKL-----SIDE--SAFSAGT 346
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++DSGT T L YS+L S Q+ +T YP + FD CY + +
Sbjct: 347 VIDSGTVITRLSPTAYSELSSAFQNLMTDYP---STSGYSIFDTCYDF----SKYDTVRI 399
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P + F V + + Y P N CL F DD + +FG+ QQ+
Sbjct: 400 PKVGVTFKGGVEMDIDVSGILY----PVNGLKKVCLAFAGNDDDS--DTSIFGNVQQRTY 453
Query: 363 EVVYDLEKERIGFQPMDCA 381
+VVYD K R+GF P C+
Sbjct: 454 QVVYDGAKGRVGFAPGGCS 472
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 99/386 (25%), Positives = 155/386 (40%), Gaps = 74/386 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS LTW+ C C +R + + F P SSS + +C+S C
Sbjct: 134 VDTGSSLTWLQCSPCRVSC-----HRQSGPV--FDPKTSSSYAAVSCSSPQC-------- 178
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPS----FAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
G S +TL + C PS + +YG+ G L++DT+ +S
Sbjct: 179 -------DGLSTATLNPAVCS---PSNVCIYQASYGDSSFSVGYLSKDTVSFGANS---- 224
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPN 177
+P F +GC + G+ G R LS+ QL L FS+C P+
Sbjct: 225 --VPNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL--------PS 274
Query: 178 ISSPLVIGDVAISSKD--NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
SS G ++I S + +TPM+ + + + Y+I L +T+ L +S E+
Sbjct: 275 TSSS---GYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLA---VSSSEYT 328
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S ++DSGT T LP Y+ L + + + K + D C+
Sbjct: 329 SLPT---IIDSGTVITRLPTSVYTALSKAVAAAMK--GSTKRAAAYSILDTCFE----GQ 379
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
P+++ F +L L GN + A CL F + + G
Sbjct: 380 ASKLRAVPAVSMAFSGGATLKLSAGNLLVDVDG-----ATTCLAFAPARS-----AAIIG 429
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDCA 381
+ QQQ VVYD++ RIGF C+
Sbjct: 430 NTQQQTFSVVYDVKSNRIGFAAAGCS 455
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 162/383 (42%), Gaps = 67/383 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C + + F+PS SS+ +C+S C +
Sbjct: 145 LSLVFDTGSDLTWTQCEPCLGSCYSQKEPK-------FNPSSSSTYQNVSCSSPMCEDAE 197
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S C+ S C S + YG+ G L ++ + S ++
Sbjct: 198 S-------CSASNCVYSIV---------------YGDKSFTQGFLAKEKFTLTNSD--VL 233
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
++ FGC + + G+ G G G LS+P+Q FS+C +F N
Sbjct: 234 EDVY---FGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFT----SN 286
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L G IS ++++FTP+ P NY I + I++G+ L P +S
Sbjct: 287 STGHLTFGSAGIS--ESVKFTPISSFPSAFNYG-IDIIGISVGDKELAITP------NSF 337
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT +T LP Y++L S+ + ++ Y K FD CY +
Sbjct: 338 STEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSY---KSTSGYGLFDTCYDFTGLDTV- 393
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+P+I F F + + L +S P S V CL F DD +FG+
Sbjct: 394 ---TYPTIAFSFAGSTVVELDGS----GISLPIKISQV-CLAFAGNDD----LPAIFGNV 441
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ ++VVYD+ R+GF P C
Sbjct: 442 QQTTLDVVYDVAGGRVGFAPNGC 464
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 174/407 (42%), Gaps = 80/407 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + + ++ + S++S C +FC
Sbjct: 170 VQVDTGSDILWVNCAG----CDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC---SL 222
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D P C L ++L YG+G TG +D ++ + S G +
Sbjct: 223 YDGPLPGCKPGLQCLYSVL--------------YGDGSSTTGYFVQDFVQYNRIS-GNFQ 267
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ GI GFG+ S+ SQL G ++K FSHC
Sbjct: 268 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-- 325
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
++ + IG+V + + TP++++ + Y + ++ I +G L +VP
Sbjct: 326 ----DNVDGGGIFAIGEVV---EPKVNITPLVQNQAH---YNVVMKEIEVGGDPL-DVPS 374
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP--RAKEVEER-TGFDL 286
F+S G ++DSGTT + P+ Y + +++ ++ P R VE+ T FD
Sbjct: 375 D--AFESGDRKGTIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTCFDY 429
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLV------LPQGNHFYAMSAPSNSSAVKCLLF 340
V DD FP++T HF ++SL L Q F NS A
Sbjct: 430 TGNV--------DDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGA------ 475
Query: 341 QSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
Q+ D D + G N VVYDLEK+ IG+ +C+S+ +
Sbjct: 476 QTKDGKDL---TLLGDLVLSNKLVVYDLEKQGIGWVEYNCSSSIKVK 519
>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 437
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 98/381 (25%), Positives = 159/381 (41%), Gaps = 60/381 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D WVPC C C + S+ S S ++ T F
Sbjct: 112 MVLDTSNDAAWVPCSG----CTGCSSTTFSTNTSSTYGSLDCSMAQCTQVRGFS------ 161
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C +G S+C F +YG + L D+L++ +
Sbjct: 162 ------CPATG-------SSSCV-----FNQSYGGDSSFSATLVEDSLRL------VNDV 197
Query: 124 IPKFCFGCVGSTY---REPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP F FGC+ S P G+ G GRG LS+ +Q G L G FS+C +FK S
Sbjct: 198 IPNFAFGCINSISGGSVPPQGLLGLGRGPLSLIAQSGSLYSGLFSYCLPSFK---SYYFS 254
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G ++++TP+L++P P+ YY+ L +++G +L + L F+
Sbjct: 255 GSLKLGPAG--QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGR-TLVPIAPELLAFNPNTG 311
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T +P Y+ + + + A FD C+ +
Sbjct: 312 AGTIIDSGTVITRFVQPIYTAIRDEFRKQV-----AGPFSSLGAFDTCFAAT------NE 360
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
+ P++T HF ++LVLP N SA S + CL + + V + QQ
Sbjct: 361 AVAPAVTLHF-TGLNLVLPMENSLIHSSAGS----LACLAMAAAPNNVNSVLNVIANLQQ 415
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D+ R+G C
Sbjct: 416 QNLRLLFDVPNSRLGIARELC 436
>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
Length = 442
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 114/395 (28%), Positives = 170/395 (43%), Gaps = 68/395 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS+L+W+ C ++ L S F+P SS+ S C+S C
Sbjct: 78 ISMVLDTGSELSWLHCK------------KSPNLGSVFNPVSSSTYSPVPCSSPIC-RTR 124
Query: 62 SSDNPF----DPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
+ D P DP K+ C A +Y + + G L +T V GS
Sbjct: 125 TRDLPIPASCDP------------KTHLCH----VAISYADATSIEGNLAHETF-VIGS- 166
Query: 118 PGIIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAF 170
+ R P FGC+ S + G+ G RG+LS +QLGF + FS+C
Sbjct: 167 --VTR--PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFSK--FSYCI--- 217
Query: 171 KYANDPNISSPLVIGDVAISSKDNLQFTPM-LKSPMYPNY----YYIGLEAITIGNSSLT 225
+ + S L++GD + S +Q+TP+ L+S P + Y + LE I +G S +
Sbjct: 218 ---SGSDSSVFLLLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVG-SKIL 273
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEER 281
+P S+ D G G +VDSGT +T L P Y+ L ++ +S + V +
Sbjct: 274 SLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQG 333
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSN-SSAVKCLLF 340
T DLCY+V L P ++ F V Q + A S V C F
Sbjct: 334 T-MDLCYKVGSTTRPNFSGL-PMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF 391
Query: 341 QSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
+ D + V G QQNV + +DL K R+GF
Sbjct: 392 GNSDLLGI-EAFVIGHHHQQNVWMEFDLAKSRVGF 425
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 111/388 (28%), Positives = 166/388 (42%), Gaps = 69/388 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI-HS 62
V +DTGSDL WV C C DC +R + + F PS+SS+ + S C N
Sbjct: 106 VGIDTGSDLLWVQCR----PCADC--FRQSTPI--FDPSKSSTYVDLSYDSPICPNSPQK 157
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
N + C + +Y +G +G L + + S G +
Sbjct: 158 KYNHLNQCIYNA--------------------SYADGSTSSGNLATEDIVFETSDQGTV- 196
Query: 123 EIPKFCFGCVGSTYR-----EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
+ FGC G + R + GI G G S+ S+LG FS+C DP+
Sbjct: 197 TVSSVVFGC-GHSNRGRFDGQQSGILGLSAGDQSIVSRLG---SRFSYCIGDLF---DPH 249
Query: 178 IS-SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ + LV+GD K TP + +YY+ LE I++G + L P + +S
Sbjct: 250 YTHNQLVLGD---GVKMEGSSTPF---HTFNGFYYVTLEGISVGETRLDINPEVFQRTES 303
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT--GFDLCYRVPCPN 294
G GG+++DSGTT T L + + L + +Q + + ++V RT G+ LCY+
Sbjct: 304 -GQGGVVMDSGTTATFLAKDGFDPLSNEIQRLVRGH--FQQVIYRTIPGW-LCYK----- 354
Query: 295 NTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+DL FP + FHF LVL + F + V CL + + G
Sbjct: 355 GRVNEDLRGFPELAFHFAEGADLVLDANSLFV-----QKNQDVFCLAVLESNLKNIGS-- 407
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G QQ+ V YDL +R+ FQ DC
Sbjct: 408 VIGIMAQQHYNVAYDLIGKRVYFQRTDC 435
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 155/385 (40%), Gaps = 67/385 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYR-NNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+ +DTGS LTWV C C+ + + + F P+ SSS S C S C + +
Sbjct: 144 LILDTGSSLTWV-------QCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDSQECRALAA 196
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ D CT G C ++ YG G G + D L + PG I
Sbjct: 197 GID-GDGCTSDG-------DWGC-----AYEIHYGSGATPAGEYSTDALTL---GPGAI- 239
Query: 123 EIPKFCFGCVGSTYREPI----GIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDP 176
+ +F FGC R G+ G GR S+ Q + G FSHC P
Sbjct: 240 -VKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCL-------PP 291
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
S + A FTP+L P +Y + AI++ L P RE
Sbjct: 292 TGVSTGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFRE--- 348
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G++ DSGT + L E Y+ L + +S + YP A V D C+ N T
Sbjct: 349 ----GVITDSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGH---LDTCF-----NFT 396
Query: 297 FTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
D++ P+++ F +G + A S CL F S D +Y +G+ G
Sbjct: 397 GYDNVTVPTVSLTF---------RGGATVHLDASSGVLMDGCLAFWSSGD-EY--TGLIG 444
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
S Q+ +EV+YD+ ++GF+ C
Sbjct: 445 SVSQRTIEVLYDMPGRKVGFRTGAC 469
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 101/396 (25%), Positives = 174/396 (43%), Gaps = 68/396 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ WV C + C +C + + F S ++ TC+ C ++
Sbjct: 114 NVQIDTGSDILWVTCSS----CSNCPHSSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVF 169
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSP 118
+ T + CS ++ C +++ YG+G +G DT + G S
Sbjct: 170 QT-------TAAQCS-----ENNQC----GYSFRYGDGSGTSGYYMTDTFYFDAILGESL 213
Query: 119 GIIREIPKFCFGCVGSTY---------REPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
P FGC STY + GI GFG+G LSV SQL G FSHC
Sbjct: 214 VANSSAP-IVFGC--STYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHC 270
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
D + V+G++ + M+ SP+ P+ + L ++IG +
Sbjct: 271 L-----KGDGSGGGVFVLGEILVPG--------MVYSPLLPSQPHYNLNLLSIGVNGQI- 316
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+P+ F++ G +VD+GTT T+L + Y L+ + ++++ + + G +
Sbjct: 317 LPIDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLNAISNSVS---QLVTLIISNG-EQ 372
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
CY V +T D+FP ++ +F S++L PQ F+ + +++ C+ FQ +
Sbjct: 373 CYLV----STSISDMFPPVSLNFAGGASMMLRPQDYLFHY--GFYDGASMWCIGFQKAPE 426
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL ++RIG+ DC+
Sbjct: 427 ----EQTILGDLVLKDKVFVYDLARQRIGWANYDCS 458
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 110/407 (27%), Positives = 174/407 (42%), Gaps = 80/407 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + + ++ + S++S C +FC
Sbjct: 89 VQVDTGSDILWVNCAG----CDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC---SL 141
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D P C L ++L YG+G TG +D ++ + S G +
Sbjct: 142 YDGPLPGCKPGLQCLYSVL--------------YGDGSSTTGYFVQDFVQYNRIS-GNFQ 186
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ GI GFG+ S+ SQL G ++K FSHC
Sbjct: 187 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-- 244
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
++ + IG+V + + TP++++ + Y + ++ I +G L +VP
Sbjct: 245 ----DNVDGGGIFAIGEVV---EPKVNITPLVQNQAH---YNVVMKEIEVGGDPL-DVPS 293
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP--RAKEVEER-TGFDL 286
F+S G ++DSGTT + P+ Y + +++ ++ P R VE+ T FD
Sbjct: 294 D--AFESGDRKGTIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTCFDY 348
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLV------LPQGNHFYAMSAPSNSSAVKCLLF 340
V DD FP++T HF ++SL L Q F NS A
Sbjct: 349 TGNV--------DDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGA------ 394
Query: 341 QSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
Q+ D D + G N VVYDLEK+ IG+ +C+S+ +
Sbjct: 395 QTKDGKDL---TLLGDLVLSNKLVVYDLEKQGIGWVEYNCSSSIKVK 438
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 106/388 (27%), Positives = 166/388 (42%), Gaps = 63/388 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGS L W+ C C C ++ + F+P+ SS+ +C FC
Sbjct: 113 MDTGSSLLWIQCQ----PCKHCSS--DHMIHPVFNPALSSTFVECSCDDRFC-------- 158
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+ P G S + C + Y G G+L ++ L + + P
Sbjct: 159 RYAPNGHCGSS------NKCV-----YEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQP 207
Query: 126 KFCFGCVGSTYREPI-----GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
FGC G E + GI G G S+ QLG FS+C AN +
Sbjct: 208 -IAFGC-GYENGEQLESHFTGILGLGAKPTSLAVQLG---SKFSYCIGDL--ANKNYGYN 260
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG-N 239
LV+G+ A D L ++ + YY+ LE I++G++ L P+ F +G
Sbjct: 261 QLVLGEDA----DILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVV---FKRRGPR 313
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD--LCYRVPCPNNTF 297
G+++DSGT YT L + Y +L + ++S + P+ ER F LCY +
Sbjct: 314 TGVILDSGTLYTWLADIAYRELYNEIKSILD--PKL----ERFWFRDFLCY-----HGRV 362
Query: 298 TDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD--GDYGPSGV 353
+++L FP +TFHF L + + FY +S P N+ V C+ + + G+Y
Sbjct: 363 SEELIGFPVVTFHFAGGAELAMEATSMFYPLSEP-NTFNVFCMSVKPTKEHGGEYKEFTA 421
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G QQ + YDL+++ I Q +DC
Sbjct: 422 IGLMAQQYYNIGYDLKEKNIYLQRIDCV 449
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 153/380 (40%), Gaps = 59/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
DTGSD+TW C C + R N PS S+S +C+S+ C + S
Sbjct: 136 FDTGSDITWTQCEPCVKTCYKQKEPRLN-------PSTSTSYKNISCSSALCKLVASGKK 188
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ S C + YG+G G +TL + SS + +
Sbjct: 189 FSQSCSSSTCL---------------YQVQYGDGSYSIGFFATETLTL--SSSNVFKN-- 229
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
F FGC + G+ G GR L++PSQ +K FS+C P SS
Sbjct: 230 -FLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL--------PASSSS 280
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+ +++FTP+ +Y + + +++G LS+ E S + G
Sbjct: 281 KGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGR-----KLSIDE--SAFSAG 333
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT T L YS+L S Q+ +T YP + FD CY + +
Sbjct: 334 TVIDSGTVITRLSPTAYSELSSAFQNLMTDYP---STSGYSIFDTCYDF----SKYDTVR 386
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P + F V + + Y P N CL F DD + +FG+ QQ+
Sbjct: 387 IPKVGVTFKGGVEMDIDVSGILY----PVNGLKKVCLAFAGNDDDS--DTSIFGNVQQRT 440
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+VVYD K R+GF P C+
Sbjct: 441 YQVVYDGAKGRVGFAPGGCS 460
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 109/380 (28%), Positives = 165/380 (43%), Gaps = 65/380 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
MDTGSD++WV C C C ++++ S F PS SS+ S +C+S+ C + S
Sbjct: 139 MDTGSDVSWVQCK----PCSQC----HSEVDSLFDPSSSSTYSPFSCSSAPCAQLSQSQE 190
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+GC + S C + YG+ TG + DTL + S+ +
Sbjct: 191 G------NGC-----MSSQC-----QYIVNYGDSSSTTGTYSSDTLTLGSSA------MT 228
Query: 126 KFCFGCV----GSTYREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPNISS 180
F FGC G + G+ G G GA S+ SQ G FS+C S
Sbjct: 229 DFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCL-----PPTSGSSG 283
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G + TPML+S P YY + LE+I +G+ L +P S+ +
Sbjct: 284 FLTLG----TGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQL-NLPTSVF------SA 332
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G L+DSGT T LP YS L S ++ + YP A D C+ ++
Sbjct: 333 GSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGI---LDTCFDFSGQSSIS--- 386
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P++T F ++ L + SS+++CL F +GD G+ G+ QQ+
Sbjct: 387 -IPTVTLVFSGGAAVDLAFDGIMLEI-----SSSIRCLAF--TPNGDDSSLGIIGNVQQR 438
Query: 361 NVEVVYDLEKERIGFQPMDC 380
EV+YD+ +GF+ C
Sbjct: 439 TFEVLYDVGGGAVGFKAGAC 458
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 153/380 (40%), Gaps = 59/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
DTGSD+TW C C + R N PS S+S +C+S+ C + S
Sbjct: 88 FDTGSDITWTQCEPCVKTCYKQKEPRLN-------PSTSTSYKNISCSSALCKLVASGKK 140
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ S C + YG+G G +TL + SS + +
Sbjct: 141 FSQSCSSSTCL---------------YQVQYGDGSYSIGFFATETLTL--SSSNVFKN-- 181
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
F FGC + G+ G GR L++PSQ +K FS+C P SS
Sbjct: 182 -FLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCL--------PASSSS 232
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+ +++FTP+ +Y + + +++G L S+ E S + G
Sbjct: 233 KGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQL-----SIDE--SAFSAG 285
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT T L YS+L S Q+ +T YP + FD CY + +
Sbjct: 286 TVIDSGTVITRLSPTAYSELSSAFQNLMTDYP---STSGYSIFDTCYDF----SKYDTVR 338
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P + F V + + Y P N CL F DD + +FG+ QQ+
Sbjct: 339 IPKVGVTFKGGVEMDIDVSGILY----PVNGLKKVCLAFAGNDDDS--DTSIFGNVQQRT 392
Query: 362 VEVVYDLEKERIGFQPMDCA 381
+VVYD K R+GF P C+
Sbjct: 393 YQVVYDGAKGRVGFAPGGCS 412
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 138/306 (45%), Gaps = 55/306 (17%)
Query: 96 YGEGGLVTGILTRDTLKVH--------GSSPGIIREIPKFCFGC-------VGSTYREPI 140
YG+G G L +D + + GS+ G I FGC +G +
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTI------IFGCGSKQSGQLGESQAAVD 55
Query: 141 GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQF 197
GI GFG+ S SQL G +++ F+HC ++ N IG+V +S K ++
Sbjct: 56 GIMGFGQSNSSFISQLASQGKVKRSFAHCL------DNNNGGGIFAIGEV-VSPK--VKT 106
Query: 198 TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPF 257
TPML + Y + L AI +GNS L LS FDS + G+++DSGTT +LP+
Sbjct: 107 TPMLSKSAH---YSVNLNAIEVGNSVL---ELSSNAFDSGDDKGVIIDSGTTLVYLPDAV 160
Query: 258 YSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVL 317
Y+ LL+ + +P E T + C + T D FP++TF F +VSL +
Sbjct: 161 YNPLLN---EILASHP------ELTLHTVQESFTCFHYTDKLDRFPTVTFQFDKSVSLAV 211
Query: 318 PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQQNVEVVYDLEKERIGF 375
+ + + + C +Q+ G + + G N VVYD+E + IG+
Sbjct: 212 YPREYLFQVREDT-----WCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGW 266
Query: 376 QPMDCA 381
+C+
Sbjct: 267 TNHNCS 272
>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
Length = 467
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 108/421 (25%), Positives = 172/421 (40%), Gaps = 78/421 (18%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C N + + P + ++ + ASS H
Sbjct: 72 VTMVLDTGSELSWLLC--------------NGSRVPSTPPQPQAPAAFNGSASSTYAAAH 117
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPS----FAYTYGEGGLVTGILTRDTLKVHGSS 117
S +P C G L + C P PS + +Y + G+L DT + G+
Sbjct: 118 CSSSP--ECQWRGRDLP--VPPFCAGP-PSNSCRVSLSYADASSADGVLAADTFLLGGAP 172
Query: 118 PGIIREIPKFCFGCVGS--------------------TYREPIGIAGFGRGALSVPSQLG 157
P +R + FGC+ S + G+ G RG+LS +Q G
Sbjct: 173 P--VRAL----FGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTG 226
Query: 158 FLQKGFSHCFLAFKYANDPNISSPLVIGD----VAISSKDNLQFTPMLK-SPMYPNY--- 209
L+ F++C + P + LV+G A+S+ L +TP+++ S P +
Sbjct: 227 TLR--FAYCI---APGDGPGL---LVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRV 278
Query: 210 -YYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQ 266
Y + LE I +G ++L +P S+ D G G +VDSGT +T L Y+ L + Q
Sbjct: 279 AYSVQLEGIRVG-AALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQ 337
Query: 267 STITYYPRAK-EVEERTGFDLCYRVPCPN--NTFTDDLFPSITFHFLNNVSLVLPQGNHF 323
++ P + + + FD C+R L P + L + +
Sbjct: 338 TSALLAPLGEPDFVFQGAFDACFRASEARVAAATASQLLPEVGL-VLRGAEVAVGGEKLL 396
Query: 324 YAM----SAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMD 379
Y + S AV CL F + D + V G QQNV V YDL+ R+GF P
Sbjct: 397 YMVPGERRGEGGSEAVWCLTFGNSDMAGMS-AYVIGHHHQQNVWVEYDLQNSRVGFAPAR 455
Query: 380 C 380
C
Sbjct: 456 C 456
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 156/383 (40%), Gaps = 61/383 (15%)
Query: 3 QVYM--DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+ YM D +D TW+ C C+ C D + S F PS+SSS + +C + C +
Sbjct: 199 KFYMIFDLQTDFTWLQCQ----PCIKCYDQPD----SIFDPSQSSSYTLLSCETKHCNLL 250
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P C+ G CR + TY +G G+L +T+ S G
Sbjct: 251 -----PNSSCSDDGY----------CR----YNITYKDGTNTEGVLINETVSFESS--GW 289
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK--YANDP-N 177
+ + C + G G GRG+LS PS++ S+C + K Y++
Sbjct: 290 VDRVSLGCSNKNQGPFVGSDGTFGLGRGSLSFPSRIN--ASSMSYCLVESKDGYSSSTLE 347
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+SP G V +L++P N YY+GL+ I +G + +VP S D
Sbjct: 348 FNSPPCSGSVK---------AKLLQNPKAENLYYVGLKGIKVGGEKI-DVPNSTFTIDPY 397
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
GNGG++V S + T L Y+ + + + R K + FD CY + NNT
Sbjct: 398 GNGGMIVSSSSLITMLENDTYNVVRDAFVAKTQHLERLKAFLQ---FDTCYNLS-SNNTV 453
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P + F + S +LP+ ++ YA+ + + C F G + G+
Sbjct: 454 E---LPILEFEVNDGKSWLLPKESYLYAV----DKNGTFCFAFAPSK----GSFSILGTL 502
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ V +DL + + C
Sbjct: 503 QQYGTRVTFDLVNSFVYLHTLCC 525
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 160/389 (41%), Gaps = 57/389 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
+ DTGSD+ WV C C DC Y + F P+ S+S S C S C
Sbjct: 137 HLVADTGSDVIWVQCS----PCSDC--YAQGDPL--FDPANSASFSPVPCNSGVC----- 183
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ + + +YG+ G+L +TL + G +
Sbjct: 184 ----------RAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGT----- 228
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNI 178
E+ GC + E G+ G G G +S+ QL G FS+C LA Y+ + +
Sbjct: 229 EVQGVAMGCGHENRGLFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYC-LAGYYSGEGSG 287
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S LV+G ++ + P++++P P++YY+G+ + + L ++ L + G
Sbjct: 288 SGSLVLGR-EDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERL-QLQDGLFDLGDDG 345
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY-YPRAKEVEERTGFDLCYRVPCPNNTF 297
GG+++D+GT T LP Y+ L PRA V FD CY + + +
Sbjct: 346 GGGVVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSL---FDTCYDL----SGY 398
Query: 298 TDDLFPSITFHF------LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
P++ +F SL LP N + P + CL F ++ GPS
Sbjct: 399 ASVRVPTVALYFGGGGQGQEAASLTLPARN----LLVPVDDGGTYCLAFAAVAS---GPS 451
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ +E+ D +GF P C
Sbjct: 452 -ILGNIQQQGIEITVDSASGYVGFGPATC 479
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 157/381 (41%), Gaps = 64/381 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D +W+PC C C + F P+ S+S C S C +
Sbjct: 129 VDTSNDASWIPCAG----CAGCP----TSSAAPFDPAASASYRTVPCGSPLC-----AQA 175
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C G + C F+ TY + L L++D+L V G++ +
Sbjct: 176 PNAACPPGG------------KAC-GFSLTYADSSL-QAALSQDSLAVAGNA------VK 215
Query: 126 KFCFGCV---GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
+ FGC+ T P G+ G GRG LS SQ + FS+C +FK N S
Sbjct: 216 AYTFGCLQRATGTAAPPQGLLGLGRGPLSFLSQTKDMYEATFSYCLPSFKSL---NFSGT 272
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G ++ TP+L +P + YY+ + + +G + +P FD G
Sbjct: 273 LRLGRNG--QPQRIKTTPLLANPHRSSLYYVNMTGVRVGR-KVVPIP----AFDPATGAG 325
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT +T L P Y + ++ + V GFD C+ NT T
Sbjct: 326 TVLDSGTMFTRLVAPAYVAVRDEVRRRV-----GAPVSSLGGFDTCF------NT-TAVA 373
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
+P +T F + + + LP+ N S + CL + DG V S QQQN
Sbjct: 374 WPPMTLLF-DGMQVTLPEENVVIH----STYGTISCLAMAAAPDGVNTVLNVIASMQQQN 428
Query: 362 VEVVYDLEKERIGFQPMDCAS 382
V++D+ R+GF C +
Sbjct: 429 HRVLFDVPNGRVGFARERCTA 449
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 152/382 (39%), Gaps = 69/382 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSF-DCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD++W+ C S C D + PS SS+ S CAS C + +
Sbjct: 94 VVIDTGSDVSWLQCKPCSSGQCFPQKD-------PLYDPSHSSTYSAVPCASDVCKKLAA 146
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
SGC T + C FA +Y +G G ++D L + +PG I
Sbjct: 147 D------AYGSGC--------TSGKQC-GFAISYADGTSTVGAYSQDKLTL---APGAI- 187
Query: 123 EIPKFCFGCVGSTYREP---IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC + G+ G GR S+ ++ G + FS+C P++S
Sbjct: 188 -VQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYGGV---FSYCL--------PSVS 235
Query: 180 S-PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S P + A + FTPM P P + + L I +G L P +
Sbjct: 236 SKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------- 288
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
+GG++VDSGT T L Y L S + + Y + D CY + +
Sbjct: 289 SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAY----RLLPNGDLDTCYNL----TGYK 340
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ + P I F ++ L P+ CL F + G G +GV G+
Sbjct: 341 NVVVPKIALTFTGGATINL---------DVPNGILVNGCLAFA--ESGPDGSAGVLGNVN 389
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ EV++D + GF+ C
Sbjct: 390 QRAFEVLFDTSTSKFGFRAKAC 411
>gi|297740344|emb|CBI30526.3| unnamed protein product [Vitis vinifera]
Length = 379
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/248 (31%), Positives = 113/248 (45%), Gaps = 20/248 (8%)
Query: 141 GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPM 200
G+ G RG+LS SQ+ F FS+C +D + S L++GD S L +TP+
Sbjct: 133 GLMGMNRGSLSFVSQMDF--PKFSYCI------SDSDFSGVLLLGDANFSWLMPLNYTPL 184
Query: 201 LK-SPMYPNY----YYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPE 255
++ S P + Y + LE I + +S L +P S+ D G G +VDSGT +T L
Sbjct: 185 IQISTPLPYFDRVAYTVQLEGIKV-SSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLG 243
Query: 256 PFYSQLLSILQSTITYYPRAKEVEE---RTGFDLCYRVPCPNNTFTDDLFPSITFHFLNN 312
P YS L + + + R E + G DLCYRVP + P+++ F
Sbjct: 244 PVYSALRNEFLNQTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSL--PWLPTVSLMFRGA 301
Query: 313 VSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKER 372
V + S +V C F + D + V G QQNV + +DLEK R
Sbjct: 302 EMKVSGDRLLYRVPGEVRGSDSVYCFTFGNSDLLAV-EAYVIGHHHQQNVWMEFDLEKSR 360
Query: 373 IGFQPMDC 380
IGF + C
Sbjct: 361 IGFAQVQC 368
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 161/405 (39%), Gaps = 72/405 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ DTGSDL WV C D D+ F PS SS+ R C + C +
Sbjct: 123 VLAIADTGSDLVWVKCKG-----KDNDNNSTAPPSVYFVPSASSTYGRVGCDTKACRALS 177
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
S+ + C+ G +C + Y+YG+G +G L+ +T + SS
Sbjct: 178 SAAS----CSPDG---------SC-----EYLYSYGDGSRASGQLSTETFTFSTIADSSK 219
Query: 119 GIIR-------------EIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGF---L 159
EI K FGC T+R + G +S+ SQLG L
Sbjct: 220 TNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGGG-PVSLASQLGATTSL 278
Query: 160 QKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITI 219
+ FS+C YAN N SS L G A+ S+ TP++ + YY I L++I +
Sbjct: 279 GRKFSYCLA--PYAN-TNASSALNFGSRAVVSEPGAASTPLITGEV-ETYYTIALDSINV 334
Query: 220 GNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVE 279
+ + + ++VDSGTT T+L + L+ L I PRA+ E
Sbjct: 335 AGT---------KRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKL-PRAESPE 384
Query: 280 ERTGFDLCYRVPCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL 338
+ DLCY + D L P +T + L N F + V CL
Sbjct: 385 KI--LDLCYDISGVRGE--DALGIPDVTLVLGGGGEVTLKPDNTFVVVQ-----EGVLCL 435
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ + + G+ QQN+ V YDLEK + F DCA +
Sbjct: 436 ALVATSERQ--SVSILGNIAQQNLHVGYDLEKGTVTFAAADCAKS 478
>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 445
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 162/391 (41%), Gaps = 63/391 (16%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ +DTGS L+W+ C N ++F PS SSS C C
Sbjct: 102 QMVLDTGSQLSWIQCHN------------KTPPTASFDPSLSSSFYVLPCTHPLC----- 144
Query: 63 SDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P P L +TC R C ++Y Y +G G L R+ L S
Sbjct: 145 --KPRVP--------DFTLPTTCDQNRLC-HYSYFYADGTYAEGNLVREKLAFSPS---- 189
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI-S 179
+ P GC S R+ GI G G LS P Q + FS+C + AN+ N +
Sbjct: 190 -QTTPPLILGC-SSESRDARGILGMNLGRLSFPFQAKVTK--FSYCVPTRQPANNNNFPT 245
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSP-------MYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+G+ S++ ++ ML P + P Y + ++ I IG L +P S+
Sbjct: 246 GSFYLGNNPNSAR--FRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKL-NIPPSVF 302
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVP 291
++ G+G +VDSG+ +T L + Y ++ + + PR K+ G D+C+
Sbjct: 303 RPNAGGSGQTMVDSGSEFTFLVDVAYDRVREEIIRVLG--PRVKKGYVYGGVADMCFDG- 359
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGP 350
N L + F F V +V+P+ + V C+ + +S G
Sbjct: 360 --NAMEIGRLLGDVAFEFEKGVEIVVPKERVLADVGG-----GVHCVGIGRSERLG--AA 410
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
S + G+F QQN+ V +DL RIGF DC+
Sbjct: 411 SNIIGNFHQQNLWVEFDLANRRIGFGVADCS 441
>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
Length = 761
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 126/271 (46%), Gaps = 28/271 (10%)
Query: 131 CVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAIS 190
C T+ + G+ G RG+LS +Q+G LQK FS+C + + S L+ G+ + S
Sbjct: 431 CRTRTHSKTTGLIGMNRGSLSFVTQMG-LQK-FSYCI------SGQDSSGILLFGESSFS 482
Query: 191 SKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVD 245
L++TP+++ S P + Y + LE I + NS L ++P S+ D G G +VD
Sbjct: 483 WLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSML-QLPKSVYAPDHTGAGQTMVD 541
Query: 246 SGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEE-----RTGFDLCYRVPCPNNTFT 298
SGT +T L P Y+ L + + Q+ + K +E+ + DLCYRVP T
Sbjct: 542 SGTQFTFLLGPVYTALKNEFVRQTKASL----KVLEDPNFVFQGAMDLCYRVPLTRRTLP 597
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++T F V + + S +V C F + + S + G
Sbjct: 598 P--LPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGV-ESYIIGHHH 654
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
QQNV + +DL K R+GF + C G+
Sbjct: 655 QQNVWMEFDLAKSRVGFAEVRCDLAGQRLGV 685
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 103/383 (26%), Positives = 161/383 (42%), Gaps = 67/383 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSDLTW C C + + F+PS SS+ +C+S C +
Sbjct: 145 LSLVFDTGSDLTWTQCEPCLGSCYSQKEPK-------FNPSSSSTYQNVSCSSPMCEDAE 197
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S C+ S C ++ YG+ G L ++ + S ++
Sbjct: 198 S-------CSASNCV---------------YSIGYGDKSFTQGFLAKEKFTLTNSD--VL 233
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
++ FGC + + G+ G G G LS+P+Q FS+C +F N
Sbjct: 234 EDVY---FGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFT----SN 286
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ L G IS ++++FTP+ P NY I + I++G+ L P +S
Sbjct: 287 STGHLTFGSAGIS--ESVKFTPISSFPSAFNYG-IDIIGISVGDKELAITP------NSF 337
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT +T LP Y++L S+ + ++ Y K FD CY +
Sbjct: 338 STEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSY---KSTSGYGLFDTCYDFTGLDTV- 393
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+P+I F F + L +S P S V CL F DD +FG+
Sbjct: 394 ---TYPTIAFSFAGGTVVELDGS----GISLPIKISQV-CLAFAGNDD----LPAIFGNV 441
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ ++VVYD+ R+GF P C
Sbjct: 442 QQTTLDVVYDVAGGRVGFAPNGC 464
>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
Length = 367
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/388 (23%), Positives = 162/388 (41%), Gaps = 72/388 (18%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
++D +L W C C+ C ++ + + F P+ SS+ + C + C +I +
Sbjct: 40 FIDLTGELVWTQCSQ----CIHC--FKQD--LPVFVPNASSTFKPEPCGTDVCKSIPTPK 91
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
D C G + G GG GI+ DT + ++P
Sbjct: 92 CASDVCAFDGVT--------------------GLGGHTVGIVATDTFAIGTAAPA----- 126
Query: 125 PKFCFGCVGS----TYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
FGCV + T P G G GR S+ +Q+ + FS+C +D +S
Sbjct: 127 -SLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTR--FSYCL----APHDTGKNS 179
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSLREFD 235
L +G A + +TP +K+ PN YY I LE I G++++T
Sbjct: 180 RLFLGASAKLAGGG-AWTPFVKT--SPNDGMSQYYPIELEEIKAGDATITM--------- 227
Query: 236 SQGNGGLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+G +LV + L + Y + + +++ P A V E F++C+ P
Sbjct: 228 PRGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGEP--FEVCF----PK 281
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ P + F F +L +P N+ + + + +V + ++ D +
Sbjct: 282 AGVSGA--PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDG--LNIL 337
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GSFQQ+NV +++DL+K+ + F+P DC+S
Sbjct: 338 GSFQQENVHLLFDLDKDMLSFEPADCSS 365
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 165/389 (42%), Gaps = 70/389 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD+TW C C YR + + F P R SSS ++ SS I
Sbjct: 58 LSLALDTGSDITWTQCEPCVGSC-----YR--QAQTKFDP-RKSSSYKNVSCSSSSCRII 109
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ D GC + STC + YG+G G + L + SP +
Sbjct: 110 T-----DSGGARGC-----VSSTCI-----YKVQYGDGSYSVGFFATEKLTI---SPSDV 151
Query: 122 REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
I F FGC G G AL + L F++C +F ++
Sbjct: 152 --ISNFLFGCGQQNAGRFGRIAGLLGLGRGKLSLALQTSEKYNNL---FTYCLPSFSSSS 206
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLR 232
+++ + G V S K FTP+ SP + N +Y I ++ +++G L P+
Sbjct: 207 TGHLT---LGGQVPKSVK----FTPL--SPAFKNTPFYGIDIKGLSVGGHVL---PIDAS 254
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
F N G ++DSGT T L YS L S Q + YP+ + + D CY
Sbjct: 255 VFS---NAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPK---TDGFSILDTCYDF-S 307
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPS 351
N + + P I+F F V + + F+ + N+ CL F + DDGD+
Sbjct: 308 GNESIS---VPRISFFFKGGVEVDI----KFFGILTVINAWDKVCLAFAPNDDDGDFV-- 358
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
VFG+ QQQ +VV+DL K RIGF P C
Sbjct: 359 -VFGNSQQQTYDVVHDLAKGRIGFAPSGC 386
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 166/405 (40%), Gaps = 80/405 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD WV C C C + ++ + P+ S +S C FC + +
Sbjct: 90 VQVDTGSDTLWVNC----VGCTTCPKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTY- 144
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D P +SGC CP ++ TYG+G +G +D L G +R
Sbjct: 145 -DGP-----ISGCKKD--------MSCP-YSITYGDGSTTSGSYIKDDLTFD-RVVGDLR 188
Query: 123 EIPK---FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+P FGC + ST + GI GFG+ SV SQL G +++ FSHC
Sbjct: 189 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCL- 247
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSLT 225
+ N IG+V P +K+ P+ P +Y + L+ I + +
Sbjct: 248 -----DTVNGGGIFAIGEVV---------QPKVKTTPLVPRMAHYNVVLKDIEVAGDPI- 292
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
++P + FDS G ++DSGTT +LP Y QLL K + +R+G +
Sbjct: 293 QLPTDI--FDSTSGRGTIIDSGTTLAYLPVSIYDQLLE------------KTLAQRSGME 338
Query: 286 LCYRVPCPNNTF-------TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL 338
L Y V F DD FP++ F F ++L ++ + +
Sbjct: 339 L-YLVEDQFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQKS 397
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
Q+ D D + G N +YDL+ IG+ +C+S+
Sbjct: 398 TAQTKDGKDL---ILLGDLVLTNKLFIYDLDNMSIGWTDYNCSSS 439
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 152/382 (39%), Gaps = 69/382 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSF-DCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD++W+ C S C D + PS SS+ S CAS C + +
Sbjct: 128 VVIDTGSDVSWLQCKPCSSGQCFPQKD-------PLYDPSHSSTYSAVPCASDVCKKLAA 180
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
SGC T + C FA +Y +G G ++D L + +PG I
Sbjct: 181 D------AYGSGC--------TSGKQC-GFAISYADGTSTVGAYSQDKLTL---APGAI- 221
Query: 123 EIPKFCFGCVGSTYREP---IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ F FGC + G+ G GR S+ ++ G + FS+C P++S
Sbjct: 222 -VQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYGGV---FSYCL--------PSVS 269
Query: 180 S-PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
S P + A + FTPM P P + + L I +G L P +
Sbjct: 270 SKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF------- 322
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
+GG++VDSGT T L Y L S + + Y + D CY + +
Sbjct: 323 SGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAY----RLLPNGDLDTCYNL----TGYK 374
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
+ + P I F ++ L P+ CL F + G G +GV G+
Sbjct: 375 NVVVPKIALTFTGGATINL---------DVPNGILVNGCLAFA--ESGPDGSAGVLGNVN 423
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
Q+ EV++D + GF+ C
Sbjct: 424 QRAFEVLFDTSTSKFGFRAKAC 445
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 170/385 (44%), Gaps = 63/385 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I DTGS+L W C C DC ++ F P SS+ +C+SS C +
Sbjct: 107 IMAVADTGSNLIWTQCK----PCDDC----YTQVDPLFDPKASSTYKDVSCSSSQCTALE 158
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + CS TC S+ +Y +G G DTL + GS+
Sbjct: 159 N---------QASCSTE---DKTC-----SYLVSYADGSYTMGKFAVDTLTL-GSTDNRP 200
Query: 122 REIPKFCFGC---VGSTYR-EPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDP 176
++ GC T+R + G+ G G GA+S+ QLG G FS+C + ND
Sbjct: 201 VQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVP---ENDQ 257
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+S + G A+ S TP++ +YY+ L++I++G+ ++ + P DS
Sbjct: 258 --TSKINFGTNAVVSGPGTVSTPLVVKSR-DTFYYLTLKSISVGSKNM-QTP------DS 307
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G +++DSGTT T LP +Y ++ + + S I K +ER G LCY N
Sbjct: 308 NIKGNMVIDSGTTLTLLPVKYYIEIENAVASLIN---ADKSKDERIGSSLCY------NA 358
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
D P IT HF + L N F+ ++ + CL F + +G++G+
Sbjct: 359 TADLNIPVITMHF-EGADVKLYPYNSFFKVT-----EDLVCLAFGM----SFYRNGIYGN 408
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
Q+N V YD + + F+P DCA
Sbjct: 409 VAQKNFLVGYDTASKTMSFKPTDCA 433
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/395 (26%), Positives = 162/395 (41%), Gaps = 71/395 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C +C + + ++ + S + +C FC I+
Sbjct: 113 VQVDTGSDIMWVNC----IQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAING 168
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI-- 120
+ MS CS + + Y +G G RD ++ S +
Sbjct: 169 GPPSYCIANMS-CSYTEI---------------YADGSSSFGYFVRDIVQYDQVSGDLET 212
Query: 121 IREIPKFCFGCVG------STYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
FGC S+ GI GFG+ S+ SQL G ++K F+HC
Sbjct: 213 TSANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL---- 268
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPL 229
+ N IG + + K N +P+ PN +Y + ++A+ +G L L
Sbjct: 269 --DGLNGGGIFAIGHI-VQPKVN-------TTPLVPNQTHYNVNMKAVEVGGYFLN---L 315
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD--LC 287
FD G ++DSGTT +LPE Y QLLS + + +++ T D C
Sbjct: 316 PTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLLSKI------FSWQSDLKVHTIHDQFTC 369
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDD 345
++ + DD FP++TFHF N SL L H Y S + C+ +Q+ M
Sbjct: 370 FQY----SESLDDGFPAVTFHFEN--SLYLKVHPHEYLFSY----DGLWCIGWQNSGMQS 419
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
D + G N V+YDLE + IG+ +C
Sbjct: 420 RDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/383 (26%), Positives = 156/383 (40%), Gaps = 67/383 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC ++ F P+ SSS SR C + C N+
Sbjct: 175 MVIDTGSDVNWLQCK----PCDDC----YQQVDPIFDPASSSSFSRLGCQTPQCRNLDVF 226
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D C + +YG+G G +T+ S
Sbjct: 227 ACRNDSCL--------------------YQVSYGDGSYTVGDFATETVSFGNSG-----S 261
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ K GC + G+ G G G LS+ SQ+ FS+C + N ++ S
Sbjct: 262 VDKVAIGCGHDNEGLFVGAAGLIGLGGGPLSLTSQIK--ASSFSYCLV-----NRDSVDS 314
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ + A S D++ P+ K+ +YY+G+ +++G L +P S+ E D G G
Sbjct: 315 STLEFNSAKPS-DSVT-APIFKNSKVDTFYYVGITGMSVGGEKLA-IPPSIFEVDGSGKG 371
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG---FDLCYRVPCPNNTF 297
G++VD GT T L Y+ L T+ K++ +G FD CY + ++
Sbjct: 372 GIIVDCGTAVTRLQTQAYNALRD------TFVKLTKDLPSTSGFALFDTCYNL----SSR 421
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T P++ F F SL LP N+ P +S+ CL F + G+
Sbjct: 422 TSVRVPTVAFLFDGGKSLPLPPSNYLI----PVDSAGTFCLAFAPT----TASLSIIGNV 473
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQQ V YDL ++ F C
Sbjct: 474 QQQGTRVTYDLANSQVSFSSRKC 496
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 110/406 (27%), Positives = 175/406 (43%), Gaps = 79/406 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + + ++ + S++S C +FC
Sbjct: 170 VQVDTGSDILWVNCAG----CDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDNFC---SL 222
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D P C L ++L YG+G TG +D ++ + S G +
Sbjct: 223 YDGPLPGCKPGLQCLYSVL--------------YGDGSSTTGYFVQDFVQYNRIS-GNFQ 267
Query: 123 EIPK---FCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC +GS+ GI GFG+ S+ SQL G ++K FSHC
Sbjct: 268 TTPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCL-- 325
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
++ + IG+V + + TP++++ + Y + ++ I +G L +VP
Sbjct: 326 ----DNVDGGGIFAIGEVV---EPKVNITPLVQNQAH---YNVVMKEIEVGGDPL-DVPS 374
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP--RAKEVEER-TGFDL 286
F+S G ++DSGTT + P+ Y + +++ ++ P R VE+ T FD
Sbjct: 375 D--AFESGDRKGTIIDSGTTLAYFPQEVY---VPLIEKILSQQPDLRLHTVEQAFTCFDY 429
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSL-VLPQ----GNHFYAMSAPSNSSAVKCLLFQ 341
V DD FP++T HF ++SL V P + F NS A Q
Sbjct: 430 TGNV--------DDGFPTVTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGA------Q 475
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ D D + G N VVYDLEK+ IG+ +C+S+ +
Sbjct: 476 TKDGKDL---TLLGDLVLSNKLVVYDLEKQGIGWVEYNCSSSIKVK 518
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 157/352 (44%), Gaps = 57/352 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFCLN-IH 61
V +DTGSD+ WV C + C C ++ NF P SS+SS C+ C N I
Sbjct: 40 VQIDTGSDVLWVSCNS----CSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQ 95
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
SSD + CS + C S+ + YG+G +G D + ++ G +
Sbjct: 96 SSD--------ATCSSQ---NNQC-----SYTFQYGDGSGTSGYYVSDMMHLNTIFEGSV 139
Query: 122 --REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC + + R GI GFG+ +SV SQL G + FSHC
Sbjct: 140 TTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCL-- 197
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
D + LV+G++ + N+ +T ++ P P +Y + L++I + +L +
Sbjct: 198 ---KGDSSGGGILVLGEIV---EPNIVYTSLV--PAQP-HYNLNLQSIAVNGQTL---QI 245
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
F + + G +VDSGTT +L E Y +S + ++I P++ G + CY
Sbjct: 246 DSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVSAITASI---PQSVHTAVSRG-NQCYL 301
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
+ + ++FP ++ +F S++L ++ ++ +AV C+ FQ
Sbjct: 302 I----TSSVTEVFPQVSLNFAGGASMILRPQDYLIQQNSI-GGAAVWCIGFQ 348
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 166/385 (43%), Gaps = 54/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDLTWV C C C N+ + PS SSS C SS C ++
Sbjct: 146 MSLIVDTGSDLTWVQCQ----PCRSC----YNQQGPLYDPSVSSSYKTVFCNSSTCQDLV 197
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ + PC + + T PC + +YG+G G L +++ + +
Sbjct: 198 AATSNSGPCGGNNGVVKT--------PCE-YVVSYGDGSYTRGDLASESILLGDT----- 243
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPN 177
++ F FGC + + G+ G GR ++S+ SQ L FS+C + +
Sbjct: 244 -KLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL----EDG 298
Query: 178 ISSPLVIGD--VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S L G+ ++ ++ +TP++++P ++Y + L +IG L
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK---------S 349
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G+L+DSGT T LP Y + + +P A + D C+ +
Sbjct: 350 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNL----T 402
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ D P I F N L + FY + +++ CL S+ + G+ G
Sbjct: 403 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKP---DASLVCLALASLSYEN--EVGIIG 457
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQ+N V+YD +ER+G +C
Sbjct: 458 NYQQKNQRVIYDTTQERLGIVGENC 482
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 165/391 (42%), Gaps = 75/391 (19%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
V +DTGSD+ W+ C C C N N +S F + SS+S + C FC I
Sbjct: 88 HVQVDTGSDILWINCK----PCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFIS 143
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCP--SFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
SD+ C+P S+ Y + G RD L + + G
Sbjct: 144 QSDS--------------------CQPALGCSYHIVYADESTSDGKFIRDMLTLEQVT-G 182
Query: 120 IIREIP---KFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
++ P + FGC +G+ G+ GFG+ SV SQL G ++ FSHC
Sbjct: 183 DLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHC 242
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
K I V + ++ TPM+ + M+ N +G++ + +SL +
Sbjct: 243 LDNVKGGG---------IFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMD---VDGTSL-D 289
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+P S+ NGG +VDSGTT + P+ Y S++++ + P + E T
Sbjct: 290 LPRSIVR-----NGGTIVDSGTTLAYFPKVLYD---SLIETILARQPVKLHIVEETF--Q 339
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MD 344
C+ +T D+ FP ++F F ++V L + ++ + + + C +Q+ +
Sbjct: 340 CFSF----STNVDEAFPPVSFEFEDSVKLTVYPHDYLFTL-----EEELYCFGWQAGGLT 390
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
+ + G N VVYDL+ E IG+
Sbjct: 391 TDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/396 (26%), Positives = 162/396 (40%), Gaps = 69/396 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ WV C + C +C + NF S SSS++ +H S
Sbjct: 81 VQIDTGSDVLWVCCNS----CNNCPRTSGLGIQLNFFDSSSSSTAG---------LVHCS 127
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCP---SFAYTYGEGGLVTGILTRDTLK-------- 112
D P C+ + T C P S+ + Y +G +G DTL
Sbjct: 128 D-PI-------CTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGES 179
Query: 113 -VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
V SS I+ F G + T + GI GFG+G LSV SQL G + FSHC
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTP-MLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ ++ P M+ SP+ P+ +Y + L++I + L
Sbjct: 240 GEGIGGGILVLGEIL--------------EPGMVYSPLVPSQPHYNLNLQSIAVNGKLL- 284
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
P+ F + + G +VDSGTT +L Y +S + ++ P + + +
Sbjct: 285 --PIDPSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNVIVS--PSVTPIISKG--N 338
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDD 345
CY V +T +FP +F+F S+VL ++ S + C+ FQ +
Sbjct: 339 QCYLV----STSVSQMFPLASFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQG 394
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G ++ VYDL ++RIG+ DC+
Sbjct: 395 -----VTILGDLVLKDKIFVYDLVRQRIGWANYDCS 425
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 106/403 (26%), Positives = 159/403 (39%), Gaps = 85/403 (21%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS--- 63
DTGSDLTW+ C C C + P S+ C C+++HSS
Sbjct: 75 DTGSDLTWLQC---DAPCQQCTE--------TLHPLYQPSNDLVPCKDPLCMSLHSSMDH 123
Query: 64 --DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+NP D C + Y +GG G+L RD ++ ++ I
Sbjct: 124 RCENP-DQC--------------------DYEVEYADGGSSLGVLVRDVFPLNLTNGDPI 162
Query: 122 REIPKFCFGC-----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKY 172
R P+ GC GS+ P+ GI G GRGA+S+ SQL G ++ HCF
Sbjct: 163 R--PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCF----- 215
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGL-EAITIGNSSLTEVPLSL 231
+ L GD I L +TPM + YP +Y G E I G S+ L
Sbjct: 216 --NSKGGGYLFFGD-GIYDPYRLVWTPMSRD--YPKHYSPGFGELIFNGRST------GL 264
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
R N ++ DSG++YT+ Y L S+L + P + +++ T LC+R
Sbjct: 265 R------NLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDT-LPLCWRGR 317
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM-SAPSNSSAV------KCLLFQSMD 344
P + D + ++L G A+ P+ + CL +
Sbjct: 318 KPIKSLRD------VRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCLGILNGT 371
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D S + G Q+ VVY+ EK+ IG+ +C +Q
Sbjct: 372 DVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQ 414
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 108/410 (26%), Positives = 164/410 (40%), Gaps = 70/410 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + +F P S + + C S+ C
Sbjct: 78 VTMVLDTGSELSWLLCAPGGGGGG------GGRSALSFRPRASLTFASVPCGSAQC---R 128
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S D P P S CR + +Y +G G L + V G P +
Sbjct: 129 SRDLPSPPACDGA--------SKQCR----VSLSYADGSSSDGALATEVFTV-GQGPPL- 174
Query: 122 REIPKFCFGCVGSTY-REPIGIA-----GFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC+ + + P G+A G RGALS SQ + FS+C +D
Sbjct: 175 ----RAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAS--TRRFSYCI------SD 222
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMY------PNY----YYIGLEAITIGNSSLT 225
+ + L++G +L F P+ +P+Y P + Y + L I +G L
Sbjct: 223 RDDAGVLLLG------HSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPL- 275
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL-QSTITYYPRAKE--VEERT 282
+P S+ D G G +VDSGT +T L YS L + + T + P + +
Sbjct: 276 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQE 335
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQ 341
FD C+RVP P++T F N + + Y + V CL F
Sbjct: 336 AFDTCFRVP--QGRAPPARLPAVTLLF-NGAQMTVAGDRLLYKVPGERRGGDGVWCLTF- 391
Query: 342 SMDDGDYGP--SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ D P + V G Q NV V YDLE+ R+G P+ C + GL
Sbjct: 392 --GNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPIRCDVASERLGL 439
>gi|223950045|gb|ACN29106.1| unknown [Zea mays]
Length = 392
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 148/328 (45%), Gaps = 45/328 (13%)
Query: 74 GCSLSTLL----KSTCCRPCPSFAYTYGEGG--LVTGILTRDTLKVHGSSPGII---REI 124
GC S L K T C ++A YG G+L D L + + + +
Sbjct: 87 GCRRSELKAEAEKETKC----TYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSF 142
Query: 125 PKFCFGCVGST---YREP--IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN-- 177
+ GC S +++P G+ G GR A S+P QL F + FS+C +++ + P+
Sbjct: 143 EEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSK--FSYCLSSYQKPDLPSYL 200
Query: 178 -ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+++ + A+ + T + + Y Y++ L+ I+IG + L V +
Sbjct: 201 LLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPAV-------ST 253
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+ G + VD+GT++T L +++L++ L + KE R +CY P +T
Sbjct: 254 KSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICY---SPPST 310
Query: 297 FTDD--LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY-GPSGV 353
D+ P + HF ++ ++VLP ++ + +++ CL ++D + G V
Sbjct: 311 AADESSKLPDMVLHFADSANMVLPWDSYLW------KTTSKLCL---AIDKSNIKGGISV 361
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+FQ QN ++ D E++ F DC+
Sbjct: 362 LGNFQMQNTHMLLDTGNEKLSFVRADCS 389
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/378 (25%), Positives = 152/378 (40%), Gaps = 77/378 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ WV C C C Y + + F P+ S+S + +C+SS C
Sbjct: 218 IDSGSDIVWVQCQ----PCTQC--YHQSDPV--FDPADSASFTGVSCSSSVC-------- 261
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D +GC CR + +YG+G G L +TL + ++R +
Sbjct: 262 --DRLENAGCHAGR------CR----YEVSYGDGSYTKGTLALETLTFGRT---MVRSVA 306
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVI 184
C + G+ G G G++S QLG G FS+C ++ +
Sbjct: 307 IGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAW------------ 354
Query: 185 GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--SQGNGGL 242
P++++P P++YYIGL + +G VP+S F G+GG+
Sbjct: 355 -------------VPLVRNPRAPSFYYIGLAGLGVGG---IRVPISEEVFRLTELGDGGV 398
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP Y + PRA V FD CY + F
Sbjct: 399 VMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVAI---FDTCYDLL----GFVSVRV 451
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P+++F+F L LP N P + + C F G + G+ QQ+ +
Sbjct: 452 PTVSFYFSGGPILTLPARNFLI----PMDDAGTFCFAFAPSTSG----LSILGNIQQEGI 503
Query: 363 EVVYDLEKERIGFQPMDC 380
++ +D +GF P C
Sbjct: 504 QISFDGANGYVGFGPNIC 521
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 157/389 (40%), Gaps = 81/389 (20%)
Query: 1 VIQVYM-DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
V Q M DTGSD++WV C + ++ F PS+S++ + +C+S+ C
Sbjct: 140 VTQTMMIDTGSDVSWVRC-------------NSTDGLTLFDPSKSTTYAPFSCSSAACAQ 186
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ N D C+ SGC + YG+G TG + DTL + S
Sbjct: 187 L---GNNGDGCSNSGCQ---------------YRVQYGDGSNTTGTYSSDTLALSASD-- 226
Query: 120 IIREIPKFCFGCVGSTYREPI------GIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKY 172
+ F FGC S + E G+ G G A S+ SQ K FS+C
Sbjct: 227 ---TVTDFHFGC--SHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCL----- 276
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
N +S + + TPML+ P P Y + L+ I++G + L P L
Sbjct: 277 -PPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLS 335
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G ++DSGT T LP YS L S +S++T R + D CY
Sbjct: 336 N-------GSVMDSGTVITWLPRRAYSALSSAFRSSMTRL-RHQRAAPLGILDTCYD--- 384
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK-CLLFQSMDDGDYGPS 351
F + + VSLVL G + N ++ CL F + GD
Sbjct: 385 ---------FTGLVNVSIPAVSLVLDGGA---VVDLDGNGIMIQDCLAFAAT-SGD---- 427
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQ+ EV++D+ + GF+ C
Sbjct: 428 SIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 166/385 (43%), Gaps = 54/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDLTWV C C C N+ + PS SSS C SS C ++
Sbjct: 98 MSLIVDTGSDLTWVQCQ----PCRSC----YNQQGPLYDPSVSSSYKTVFCNSSTCQDLV 149
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ + PC + + T PC + +YG+G G L +++ + +
Sbjct: 150 AATSNSGPCGGNNGVVKT--------PCE-YVVSYGDGSYTRGDLASESILLGDT----- 195
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPN 177
++ F FGC + + G+ G GR ++S+ SQ L FS+C + +
Sbjct: 196 -KLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL----EDG 250
Query: 178 ISSPLVIGD--VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S L G+ ++ ++ +TP++++P ++Y + L +IG L
Sbjct: 251 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKS--------- 301
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G+L+DSGT T LP Y + + +P A + D C+ +
Sbjct: 302 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNL----T 354
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ D P I F N L + FY + +++ CL S+ + G+ G
Sbjct: 355 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKP---DASLVCLALASLSYEN--EVGIIG 409
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQ+N V+YD +ER+G +C
Sbjct: 410 NYQQKNQRVIYDTTQERLGIVGENC 434
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 159/387 (41%), Gaps = 62/387 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W CG C C + + ++ P+ SSS++ C C + P
Sbjct: 110 DTGSDLIWTKCGA----CARC----SPRGSPSYYPTSSSSAAFVACGDRTCGEL-----P 156
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG----LVTGILTRDTLKVHGSSPGIIR 122
C S + C S+ Y YG GIL +T +
Sbjct: 157 RPLC--SNVAGGGSGSGNC-----SYHYAYGNARDTHHYTEGILMTETFTFGDDA----A 205
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
P FGC + + G+ G GRG LS+ +QL G+ + ++D +
Sbjct: 206 AFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGY-------RLSSDLSAP 258
Query: 180 SPLVIG---DVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREF 234
SP+ G DV + D+ TP+L +P+ + +YY+GL I++G L ++P F
Sbjct: 259 SPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGK-LVQIPSGTFSF 317
Query: 235 D-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGFDLCYRVP 291
D S G GG++ DSGTT T LP+P Y+ + L S + + P A ++ +C+
Sbjct: 318 DRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDL----ICFTGG 373
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
T FPS+ HF + L N+ M N +C
Sbjct: 374 SSTTT-----FPSMVLHFDGGADMDLSTENYLPQMQG-QNGETARCWSVVKSSQALT--- 424
Query: 352 GVFGSFQQQNVEVVYDLE-KERIGFQP 377
+ G+ Q + VV+DL R+ FQP
Sbjct: 425 -IIGNIMQMDFHVVFDLSGNARMLFQP 450
>gi|413923876|gb|AFW63808.1| hypothetical protein ZEAMMB73_793799 [Zea mays]
Length = 415
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/328 (24%), Positives = 148/328 (45%), Gaps = 45/328 (13%)
Query: 74 GCSLSTLL----KSTCCRPCPSFAYTYGEGG--LVTGILTRDTLKVHGSSPGII---REI 124
GC S L K T C ++A YG G+L D L + + + +
Sbjct: 110 GCRRSELKAEAEKETKC----TYAIKYGGNANDSTAGVLYEDKLTIVAVASKAVPGSQSF 165
Query: 125 PKFCFGCVGST---YREP--IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN-- 177
+ GC S +++P G+ G GR A S+P QL F + FS+C +++ + P+
Sbjct: 166 EEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSK--FSYCLSSYQKPDLPSYL 223
Query: 178 -ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+++ + A+ + T + + Y Y++ L+ I+IG + L V +
Sbjct: 224 LLTAAPDMATGAVGGAAAVATTALQPNSDYKTRYFVDLQGISIGGTRLPAV-------ST 276
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
+ G + VD+GT++T L +++L++ L + KE R +CY P +T
Sbjct: 277 KSGGNMFVDTGTSFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQICYS---PPST 333
Query: 297 FTDD--LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY-GPSGV 353
D+ P + HF ++ ++VLP ++ + +++ CL ++D + G V
Sbjct: 334 AADESSKLPDMVLHFADSANMVLPWDSYLW------KTTSKLCL---AIDKSNIKGGISV 384
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+FQ QN ++ D E++ F DC+
Sbjct: 385 LGNFQMQNTHMLLDTGNEKLSFVRADCS 412
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 164/391 (41%), Gaps = 77/391 (19%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W C C +C Y + + F P S + C + FC ++ +
Sbjct: 112 DTGSDLIWRQC----LPCPNC--YEQVEPL--FDPKESETYKTLDCDNEFCQDLGQQGSC 163
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
D +TC +++Y+YG+ G L+ DTL + GS+ G P
Sbjct: 164 DD-------------DNTC-----TYSYSYGDRSYTRGDLSSDTLTI-GSTEGDPASFPG 204
Query: 127 FCFGC---VGSTYREP-----IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
FGC G T+ E G + + S++G FS+C + ++D +
Sbjct: 205 IAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG---GQFSYCLVPL--SSDSTV 259
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT--------EVPLS 230
SS + G + S TP++K +YY+ LE +++G+ ++ P +
Sbjct: 260 SSKINFGKSGVVSGSGTVSTPLIKG-TPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAA 318
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYR 289
+ E G +++DSGTT T LP+ FY+ + S L + I + + G F LCY
Sbjct: 319 VEE------GNIIIDSGTTLTLLPQDFYTDVESALTNAI----GGQTTTDPNGIFSLCY- 367
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
++ + P+IT HF + LP N F + + F + +
Sbjct: 368 -----SSVNNLEIPTITAHF-TGADVQLPPLNTFVQVQE-------DLVCFSMIPSSNL- 413
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ Q N V YDL+ ++ F+ DC
Sbjct: 414 --AIFGNLAQINFLVGYDLKNNKVSFKQTDC 442
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 109/403 (27%), Positives = 170/403 (42%), Gaps = 64/403 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C ++ L S F+P SSS S C+S C
Sbjct: 1013 VTMVLDTGSELSWLHCK------------KSPNLTSVFNPLSSSSYSPIPCSSPIC-RTR 1059
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ D P +P T C L + +Y + + G L D ++ S+
Sbjct: 1060 TRDLP-NPVT---CDPKKLCHAIV---------SYADASSLEGNLASDNFRIGSSA---- 1102
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+P FGC+ S + + G+ G RG+LS +QLG + FS+C +
Sbjct: 1103 --LPGTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLPK--FSYCI------S 1152
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
+ S L+ GD+ +S NL +TP+++ S P + Y + L+ I +GN L +P
Sbjct: 1153 GRDSSGVLLFGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKIL-PLPK 1211
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAK-EVEERTGFDL 286
S+ D G G +VDSGT +T L P Y+ L + + Q+ P + DL
Sbjct: 1212 SIFAPDHTGAGQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDL 1271
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CY V T PS++ F +V + + + V CL F + D
Sbjct: 1272 CYSVAAGGKLPT---LPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLL 1328
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQNV + +DL + F C S AQ L
Sbjct: 1329 GI-EAFVIGHHHQQNVWMEFDL----VAFAADLCGSIDHAQIL 1366
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 96/385 (24%), Positives = 166/385 (43%), Gaps = 54/385 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDLTWV C C C N+ + PS SSS C SS C ++
Sbjct: 146 MSLIVDTGSDLTWVQCQ----PCRSC----YNQQGPLYDPSVSSSYKTVFCNSSTCQDLV 197
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
++ + PC + + T PC + +YG+G G L +++ + +
Sbjct: 198 AATSNSGPCGGNNGVVKT--------PCE-YVVSYGDGSYTRGDLASESILLGDT----- 243
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPN 177
++ F FGC + + G+ G GR ++S+ SQ L FS+C + +
Sbjct: 244 -KLENFVFGCGRNNKGLFGGSSGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSL----EDG 298
Query: 178 ISSPLVIGD--VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
S L G+ ++ ++ +TP++++P ++Y + L +IG L
Sbjct: 299 ASGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK---------S 349
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S G+L+DSGT T LP Y + + +P A + D C+ +
Sbjct: 350 SSFGRGILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGY---SILDTCFNL----T 402
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ D P I F N L + FY + +++ CL S+ + G+ G
Sbjct: 403 SYEDISIPIIKMIFQGNAELEVDVTGVFYFVKP---DASLVCLALASLSYEN--EVGIIG 457
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
++QQ+N V+YD +ER+G +C
Sbjct: 458 NYQQKNQRVIYDSTQERLGIVGENC 482
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/387 (26%), Positives = 159/387 (41%), Gaps = 62/387 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSDL W CG C C + + ++ P+ SSS++ C C + P
Sbjct: 110 DTGSDLIWTKCGA----CARC----SPRGSPSYYPTSSSSAAFVACGDRTCGEL-----P 156
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG----LVTGILTRDTLKVHGSSPGIIR 122
C S + C S+ Y YG GIL +T +
Sbjct: 157 RPLC--SNVAGGGSGSGNC-----SYHYAYGNARDTHHYTEGILMTETFTFGDDA----A 205
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
P FGC + + G+ G GRG LS+ +QL G+ + ++D +
Sbjct: 206 AFPGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGY-------RLSSDLSAP 258
Query: 180 SPLVIG---DVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREF 234
SP+ G DV + D+ TP+L +P+ + +YY+GL I++G L ++P F
Sbjct: 259 SPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGK-LVQIPSGTFSF 317
Query: 235 D-SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGFDLCYRVP 291
D S G GG++ DSGTT T LP+P Y+ + L S + + P A ++ +C+
Sbjct: 318 DRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDL----ICFTGG 373
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
T FPS+ HF + L N+ M N +C
Sbjct: 374 SSTTT-----FPSMVLHFDGGADMDLSTENYLPQMQG-QNGETARCWSVVKSSQALT--- 424
Query: 352 GVFGSFQQQNVEVVYDLE-KERIGFQP 377
+ G+ Q + VV+DL R+ FQP
Sbjct: 425 -IIGNIMQMDFHVVFDLSGNARMLFQP 450
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 103/387 (26%), Positives = 152/387 (39%), Gaps = 64/387 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL+WV C C + Y + F PS SSS + C S C + +
Sbjct: 133 VLIDTGSDLSWVQCK----PCGAGECYAQKDPL--FDPSSSSSYASVPCDSDACRKLAAG 186
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
CT +L C + YG TG+ + +TL + PG++
Sbjct: 187 AYGHG-CTSGAAAL--------CE----YGIEYGNRATTTGVYSTETLTLK---PGVV-- 228
Query: 124 IPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCF------LAFKYA 173
+ F FGC Y + G+ G G S+ SQ G FS+C F
Sbjct: 229 VADFGFGCGDHQHGPYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAGFLAL 288
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
PN SS + ++ FTPM + P P +Y + L I++G + L P +
Sbjct: 289 GAPNSSS-------SSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSS 341
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G+++DSGT T LP Y+ L S +S ++ Y R D CY
Sbjct: 342 -------GMVIDSGTVITGLPATAYAALRSAFRSAMSEY-RLLPPSNGAVLDTCYDF--- 390
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
T+ P+I F ++ L + P+ CL F D G+
Sbjct: 391 -TGHTNVTVPTIALTFSGGATIDL---------ATPAGVLVDGCLAFAGAGTDDT--IGI 438
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ Q+ EV+YD K +GF+ C
Sbjct: 439 IGNVNQRTFEVLYDSGKGTVGFRAGAC 465
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 160/407 (39%), Gaps = 66/407 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W + C+ L F+ S SSS C S+ C
Sbjct: 68 VTMVLDTGSELSW----------LLCNGSYAPPLTPAFNASGSSSYGAVPCPSTAC-EWR 116
Query: 62 SSDNPFDP-CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
D P P C S CR + +Y + G+L DT + G +P +
Sbjct: 117 GRDLPVPPFCDTP--------PSNACR----VSLSYADASSADGVLATDTFLLTGGAPPV 164
Query: 121 IREIPKFCFGCVGS---------------TYREPIGIAGFGRGALSVPSQLGFLQKGFSH 165
+ + FGC+ S G+ G RG LS +Q G + F++
Sbjct: 165 --AVGAY-FGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTG--TRRFAY 219
Query: 166 CFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIG 220
C P + L++GD L +TP+++ S P + Y + LE I +G
Sbjct: 220 CI---APGEGPGV---LLLGDDG-GVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVG 272
Query: 221 NSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL--QSTITYYPRAKEV 278
+L +P S+ D G G +VDSGT +T L Y+ L + Q+ + P +
Sbjct: 273 -CALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPG 331
Query: 279 EERTG-FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM----SAPSNSS 333
G FD C+R P L P + L + + Y + +
Sbjct: 332 FVFQGAFDACFRGPEARVAAASGLLPEVGL-VLRGAEVAVSGEKLLYMVPGERRGEGGAE 390
Query: 334 AVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
AV CL F + D + V G QQNV V YDL+ R+GF P C
Sbjct: 391 AVWCLTFGNSDMAGMS-AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 151/387 (39%), Gaps = 78/387 (20%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSD+TW+ C C +S +TC F D
Sbjct: 166 DTGSDVTWL-------QCQPC-------------------ASENTCYKQF-------DPI 192
Query: 67 FDP----------CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
FDP C C L L K+ C + YG+G TG L +TL S
Sbjct: 193 FDPKSSSSYSPLSCNSQQCKL--LDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNS 250
Query: 117 SPGIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ IP GC + G+ G G GA+S+ SQL FS+C +
Sbjct: 251 N-----SIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLK--ASSFSYCLVNL--- 300
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
D + SS L S D+L +P++K+ + +Y Y+ + I++G +L P E
Sbjct: 301 -DSDSSSTLEFNSYMPS--DSLT-SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRF-E 355
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
D G GG++VDSGT + LP Y L + A + + FD CY
Sbjct: 356 IDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGI---SVFDTCYNFSGQ 412
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+N P+I F SL LP N+ + +++ CL F +
Sbjct: 413 SNVEV----PTIAFVLSEGTSLRLPARNYLIML----DTAGTYCLAFIKTKSS----LSI 460
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
GSFQQQ + V YDL +GF C
Sbjct: 461 IGSFQQQGIRVSYDLTNSIVGFSTNKC 487
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 151/387 (39%), Gaps = 78/387 (20%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DTGSD+TW+ C C +S +TC F D
Sbjct: 166 DTGSDVTWL-------QCQPC-------------------ASENTCYKQF-------DPI 192
Query: 67 FDP----------CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
FDP C C L L K+ C + YG+G TG L +TL S
Sbjct: 193 FDPKSSSSYSPLSCNSQQCKL--LDKANCNSDTCIYQVHYGDGSFTTGELATETLSFGNS 250
Query: 117 SPGIIREIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+ IP GC + G+ G G GA+S+ SQL FS+C +
Sbjct: 251 N-----SIPNLPIGCGHDNEGLFAGGAGLIGLGGGAISLSSQLK--ASSFSYCLVNL--- 300
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
D + SS L S D+L +P++K+ + +Y Y+ + I++G +L P E
Sbjct: 301 -DSDSSSTLEFNSNMPS--DSLT-SPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRF-E 355
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
D G GG++VDSGT + LP Y L + A + + FD CY
Sbjct: 356 IDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGI---SVFDTCYNFSGQ 412
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+N P+I F SL LP N+ + +++ CL F +
Sbjct: 413 SNVEV----PTIAFVLSEGTSLRLPARNYLIML----DTAGTYCLAFIKTKSS----LSI 460
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
GSFQQQ + V YDL +GF C
Sbjct: 461 IGSFQQQGIRVSYDLTNSLVGFSTNKC 487
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 165/396 (41%), Gaps = 58/396 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ MDTGSDL W+ C C+DC D ++ F P+ SSS TC C +
Sbjct: 165 RMIMDTGSDLNWLQCA----PCLDCFD----QVGPVFDPAASSSYRNVTCGDQRCGLVAP 216
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRP----CPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ P C RP CP + Y YG+ TG L ++ V+ ++P
Sbjct: 217 PEPP----------------RACRRPGEDSCPYY-YWYGDQSNTTGDLALESFTVNLTAP 259
Query: 119 GIIREIPKFCFGC---VGSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYAN 174
G R + FGC + G+ G GRG LS SQL FS+C + +
Sbjct: 260 GASRRVDDVVFGCGHWNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD----H 315
Query: 175 DPNISSPLVIGDVAISSKD----NLQFTPML-KSPMYPNYYYIGLEAITIGNSSLT-EVP 228
+++S +V G+ + L +T S +YY+ L+ + +G L
Sbjct: 316 GSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSD 375
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
G+GG ++DSGTT ++ EP Y I Q+ I R+ + D
Sbjct: 376 TWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQ---VIRQAFIDRMGRSYPLIP----DFPV 428
Query: 289 RVPCPNNTFTDD-LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
PC N + D P ++ F + P N+F + + + CL +
Sbjct: 429 LSPCYNVSGVDRPEVPELSLLFADGAVWDFPAENYFIRL----DPDGIMCLAV--LGTPR 482
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
G S + G+FQQQN VVYDL+ R+GF P CA
Sbjct: 483 TGMS-IIGNFQQQNFHVVYDLKNNRLGFAPRRCAEV 517
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 108/410 (26%), Positives = 164/410 (40%), Gaps = 70/410 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + +F P S + + C S+ C
Sbjct: 79 VTMVLDTGSELSWLLCAPGGGGGG------GGRSALSFRPRASLTFASVPCDSAQC---R 129
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S D P P S CR + +Y +G G L + V G P +
Sbjct: 130 SRDLPSPPACDGA--------SKQCR----VSLSYADGSSSDGALATEVFTV-GQGPPL- 175
Query: 122 REIPKFCFGCVGSTY-REPIGIA-----GFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC+ + + P G+A G RGALS SQ + FS+C +D
Sbjct: 176 ----RAAFGCMATAFDTSPDGVATAGLLGMNRGALSFVSQAS--TRRFSYCI------SD 223
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMY------PNY----YYIGLEAITIGNSSLT 225
+ + L++G +L F P+ +P+Y P + Y + L I +G L
Sbjct: 224 RDDAGVLLLG------HSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPL- 276
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL-QSTITYYPRAKE--VEERT 282
+P S+ D G G +VDSGT +T L YS L + + T + P + +
Sbjct: 277 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQE 336
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQ 341
FD C+RVP P++T F N + + Y + V CL F
Sbjct: 337 AFDTCFRVP--QGRAPPARLPAVTLLF-NGAQMTVAGDRLLYKVPGERRGGDGVWCLTF- 392
Query: 342 SMDDGDYGP--SGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ D P + V G Q NV V YDLE+ R+G P+ C + GL
Sbjct: 393 --GNADMVPITAYVIGHHHQMNVWVEYDLERGRVGLAPIRCDVASERLGL 440
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 160/407 (39%), Gaps = 66/407 (16%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W + C+ L F+ S SSS C S+ C
Sbjct: 68 VTMVLDTGSELSW----------LLCNGSYAPPLTPAFNASGSSSYGAVPCPSTAC-EWR 116
Query: 62 SSDNPFDP-CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
D P P C S CR + +Y + G+L DT + G +P +
Sbjct: 117 GRDLPVPPFCDTP--------PSNACR----VSLSYADASSADGVLATDTFLLTGGAPPV 164
Query: 121 IREIPKFCFGCVGS---------------TYREPIGIAGFGRGALSVPSQLGFLQKGFSH 165
+ + FGC+ S G+ G RG LS +Q G + F++
Sbjct: 165 --AVGAY-FGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTG--TRRFAY 219
Query: 166 CFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIG 220
C P + L++GD L +TP+++ S P + Y + LE I +G
Sbjct: 220 CI---APGEGPGV---LLLGDDG-GVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVG 272
Query: 221 NSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSIL--QSTITYYPRAKEV 278
+L +P S+ D G G +VDSGT +T L Y+ L + Q+ + P +
Sbjct: 273 -CALLPIPKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPG 331
Query: 279 EERTG-FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM----SAPSNSS 333
G FD C+R P L P + L + + Y + +
Sbjct: 332 FVFQGAFDACFRGPEARVAAASGLLPVVGL-VLRGAEVAVSGEKLLYMVPGERRGEGGAE 390
Query: 334 AVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
AV CL F + D + V G QQNV V YDL+ R+GF P C
Sbjct: 391 AVWCLTFGNSDMAGMS-AYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 108/393 (27%), Positives = 165/393 (41%), Gaps = 80/393 (20%)
Query: 1 VIQVY-MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
V QV +DTGSD++WV C + C + +KL F P++S++ S +C+S+ C
Sbjct: 141 VTQVMSIDTGSDVSWVQCAPCA--AQSCSS-QKDKL---FDPAKSATYSAFSCSSAQCAQ 194
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ N GC L S C + Y + TG DTL + S
Sbjct: 195 LGGEGN--------GC-----LNSHC-----QYIVKYVDHSNTTGTYGSDTLGLTTSD-- 234
Query: 120 IIREIPKFCFGCVGSTYR------EPIGIAGFGRGALSVPSQLGFL-QKGFSHCFLAFKY 172
+ F FGC ++R + G+ G G S+ SQ K FS+C
Sbjct: 235 ---AVKNFQFGC---SHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCL----- 283
Query: 173 ANDPNISSP---LVIGDVAI-SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
P+ SS L +G A +S TP+++ + P +Y + L+AIT+ + L VP
Sbjct: 284 --PPSSSSAGGFLTLGAAAGGTSSSRYSRTPLVRFNV-PTFYGVFLQAITVAGTKL-NVP 339
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
S+ +G +VDSGT T LP Y L + + + YP A V D C+
Sbjct: 340 ASVF------SGASVVDSGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGI---LDTCF 390
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF-QSMDDGD 347
+ P +T F + L FYA CL F + DGD
Sbjct: 391 DF----SGIKTVRVPVVTLTFSRGAVMDLDVSGIFYA----------GCLAFTATAQDGD 436
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+G+ G+ QQ+ E+++D+ +GF+P C
Sbjct: 437 ---TGILGNVQQRTFEMLFDVGGSTLGFRPGAC 466
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/385 (23%), Positives = 154/385 (40%), Gaps = 70/385 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS LTW+ C C + ++ + P SS+ + C++S C + ++
Sbjct: 149 MVVDTGSSLTWLQCSPCVVSC-------HRQVGPLYDPRASSTYATVPCSASQCDELQAA 201
Query: 64 D-NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
NP S CS+ ++ C + +YG+ G L+RDT+ S
Sbjct: 202 TLNP------SACSV----RNVCI-----YQASYGDSSFSVGYLSRDTVSFGSGS----- 241
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
P F +GC + G+ G R LS+ QL L FS+C +
Sbjct: 242 -YPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYC-----------L 289
Query: 179 SSPLVIGDVAIS--SKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+P G ++I + + +TPM S + + Y++ L +++G S L P +
Sbjct: 290 PTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPT 349
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
++DSGT T LP Y+ L + + + + + D C++
Sbjct: 350 ------IIDSGTVITRLPTAVYTALSKAVAAAMV---GVQSAPAFSILDTCFQ-----GQ 395
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
+ P++ F +L L N + + CL F D + + G+
Sbjct: 396 ASQLRVPAVAMAFAGGATLKLATQNVLIDVD-----DSTTCLAFAPTDS-----TTIIGN 445
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
QQQ VVYD+ + RIGF C+
Sbjct: 446 TQQQTFSVVYDVAQSRIGFAAGGCS 470
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 106/397 (26%), Positives = 154/397 (38%), Gaps = 87/397 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-----SPSRSSSSSRDTCASSFCL 58
V +DTGSDL WVPC DC C ++ S+F +P+ SS+S + TC +S C+
Sbjct: 111 VALDTGSDLFWVPC-----DCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCM 165
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ S C L TL CP +GIL D L +
Sbjct: 166 H------------RSQC-LGTLSN------CPYMVSYVSAETSTSGILVEDVLHLTQEDN 206
Query: 119 GIIREIPKFCFGC----VGS--TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC GS P G+ G G +SVPS L GF FS CF
Sbjct: 207 HHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-- 264
Query: 170 FKYANDPNISSPLVIGDVAISSKDNL--QFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
D IG ++ K + TP +P +P Y N ++T+V
Sbjct: 265 ---GRDG-------IGRISFGDKGSFDQDETPFNLNPSHPTY-----------NITVTQV 303
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
+ D + L DSGT++T+L +P Y++L S + R + R F+ C
Sbjct: 304 RVGTTLIDVEFTA--LFDSGTSFTYLVDPTYTRLTESFHSQVQ--DRRHRSDSRIPFEYC 359
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYA----MSAPSNSSAVKCLLFQSM 343
Y D+ P + +VSL + G+HF + + S V CL
Sbjct: 360 Y-----------DMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSELVYCLAVVKT 408
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + + G VV+D EK +G++ DC
Sbjct: 409 AELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 440
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 155/383 (40%), Gaps = 73/383 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DTGSDL+WV C C Y + F P++SSS + C C L I++S
Sbjct: 157 VDTGSDLSWVQC----TPCAAPACYSQKDPL--FDPAQSSSYAAVPCGGPVCGGLGIYAS 210
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C+ + C + +YG+G TG+ + DTL + SP
Sbjct: 211 S-----CSAAQCG---------------YVVSYGDGSKTTGVYSSDTLTL---SPN--DA 245
Query: 124 IPKFCFGC--VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
+ F FGC S + G+ G GR S+ Q G FS+C P+ +
Sbjct: 246 VRGFFFGCGHAQSGFTGNDGLLGLGREEASLVEQTAGTYGGVFSYCL-----PTRPSTTG 300
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G + ++ T +L SP YY + L I++G L+ VP S+ G
Sbjct: 301 YLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLS-VPSSVFA------G 353
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITY--YPRAKEVEERTG-FDLCYRVPCPNNTF 297
G +VD+GT T LP Y+ L S +S + YP A TG D CY + +
Sbjct: 354 GTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPA----TGILDTCYNF----SGY 405
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P++ F ++ L + CL F G G + G+
Sbjct: 406 GTVTLPNVALTFSGGATVTLGADGIL----------SFGCLAF--APSGSDGGMAILGNV 453
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ++ EV ++ +GF+P C
Sbjct: 454 QQRSFEV--RIDGTSVGFKPSSC 474
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 104/399 (26%), Positives = 167/399 (41%), Gaps = 88/399 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS L WV C C++C + S F P +S S C I+
Sbjct: 119 VVVDTGSSLLWVQC----LPCINC----FQQSTSWFDPLKSVSFKTLGCGFPGYNYINGY 170
Query: 64 D-NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
N F+ + Y G GIL +++L G I+
Sbjct: 171 KCNRFNQA--------------------EYKLRYLGGDSSQGILAKESLLFETLDEGKIK 210
Query: 123 EIPKFCFGCVG---STYREPIGIAGFGRGA---LSVPSQLGFLQKGFSHCFLAFKYANDP 176
+ FGC T + FG GA +++ +QLG FS+C N+P
Sbjct: 211 K-SNITFGCGHMNIKTNNDDAYNGVFGLGAYPHITMATQLG---NKFSYCI---GDINNP 263
Query: 177 NIS-SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+ + LV+G + D+ TP+ ++ +YY+ L++I++G+ +L P + +
Sbjct: 264 LYTHNHLVLGQGSYIEGDS---TPL---QIHFGHYYVTLQSISVGSKTLKIDPNAFK-IS 316
Query: 236 SQGNGGLLVDSGTTYTHLP----EPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
S G+GG+L+DSG TYT L E Y +++ +++ + P ++ E LC++
Sbjct: 317 SDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFE-----GLCFK-- 369
Query: 292 CPNNTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSA--------PSNSSAVKCLLFQ 341
+ DL FP++TFHF LVL G+ F PSNS +
Sbjct: 370 ---GVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNL---- 422
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G QQN V +DLE+ ++ F+ +DC
Sbjct: 423 ----------SVIGILAQQNYNVGFDLEQMKVFFRRIDC 451
>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 412
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 110/387 (28%), Positives = 160/387 (41%), Gaps = 86/387 (22%)
Query: 3 QVY--MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
Q+Y +DTG+D W C C C N+ F PS+SS+ C S C N
Sbjct: 102 QLYSLIDTGNDNIWFQCK----PCKPCL----NQTSPMFHPSKSSTYKTIPCTSPICKN- 152
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
G G+ DTL ++ S+ G
Sbjct: 153 -------------------------------------ADGHYLGV---DTLTLN-SNNGT 171
Query: 121 IREIPKFCFGCVGSTYREPI-----GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN 174
GC G + P+ G G RG LS SQL G FS+C + +
Sbjct: 172 PISFKNIVIGC-GHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPL--FS 228
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
N+SS L GD + S TP+ + N Y++ LEA ++G+ + L
Sbjct: 229 KENVSSKLHFGDKSTVSGLGTVSTPIKEE----NGYFVSLEAFSVGDHII-----KLENS 279
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
D++GN ++DSGTT T LP+ YS+L S++ + R K+ ++ F+LCY+
Sbjct: 280 DNRGNS--IIDSGTTMTILPKDVYSRLESVVLDMVKL-KRVKDPSQQ--FNLCYQT-TST 333
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
T L IT HF + + L N FY ++ V C F S G++ +F
Sbjct: 334 TLLTKVLI--ITAHF-SGSEVHLNALNTFYPIT-----DEVICFAFVS--GGNFSSLAIF 383
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ QQN V +DL K+ I F+P DC
Sbjct: 384 GNVVQQNFLVGFDLNKKTISFKPTDCT 410
>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
Length = 334
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 132/303 (43%), Gaps = 42/303 (13%)
Query: 91 SFAYTYGEG----GLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGST---YREPIGIA 143
S+ Y YG GIL +T + P FGC + + G+
Sbjct: 55 SYHYAYGNARDTHHYTEGILMTETFTFGDDA----AAFPGIAFGCTLRSEGGFGTGSGLV 110
Query: 144 GFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG---DVAISSKDNLQFTPM 200
G GRG LS+ +QL G+ + ++D + SP+ G DV + D+ TP+
Sbjct: 111 GLGRGKLSLVTQLNVEAFGY-------RLSSDLSAPSPISFGSLADVTGGNGDSFMSTPL 163
Query: 201 LKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREFD-SQGNGGLLVDSGTTYTHLPEPF 257
L +P+ + +YY+GL I++G L ++P FD S G GG++ DSGTT T LP+P
Sbjct: 164 LTNPVVQDLPFYYVGLTGISVGGK-LVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPA 222
Query: 258 YSQLLSILQSTITYY--PRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSL 315
Y+ + L S + + P A ++ +C+ T FPS+ HF +
Sbjct: 223 YTLVRDELLSQMGFQKPPPAANDDDL----ICFTGGSSTTT-----FPSMVLHFDGGADM 273
Query: 316 VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLE-KERIG 374
L N+ M + +A + +S + G+ Q + VV+DL R+
Sbjct: 274 DLSTENYLPQMQGQNGETARCWSVVKSSQ-----ALTIIGNIMQMDFHVVFDLSGNARML 328
Query: 375 FQP 377
FQP
Sbjct: 329 FQP 331
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 107/381 (28%), Positives = 162/381 (42%), Gaps = 53/381 (13%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +TW+ C C DC Y + F PS+S + C+S+ C ++ S+
Sbjct: 114 VDTGSGITWMQCQR----CEDC--YEQTTPI--FDPSKSKTYKTLPCSSNMCQSVIST-- 163
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P GC + YG+G G L+ +TL + GS+ G + P
Sbjct: 164 PSCSSDKIGCK---------------YTIKYGDGSHSQGDLSVETLTL-GSTNGSSVQFP 207
Query: 126 KFCFGCVGS---TYREPIGIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDPNISS 180
GC + T++ G G FS+C LA ++ N SS
Sbjct: 208 NTVIGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYC-LAPMFSQS-NSSS 265
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L GD A+ S TP++ +YY+ LEA ++G+ + V S S G G
Sbjct: 266 KLNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEG 325
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPNNTFTD 299
+++DSGTT T LP+ YS L S + I +A V + + F LCY+ P+
Sbjct: 326 NIIIDSGTTLTLLPQEDYSNLESAVADAI----QANRVSDPSNFLSLCYQT-TPSGQLD- 379
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P IT HF + L + F + + V C F S + +FG+ Q
Sbjct: 380 --VPVITAHF-KGADVELNPISTFVQV-----AEGVVCFAFHSSE-----VVSIFGNLAQ 426
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
N+ V YDL ++ + F+P DC
Sbjct: 427 LNLLVGYDLMEQTVSFKPTDC 447
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 128/278 (46%), Gaps = 60/278 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDY-RNNKL---MSNFSPSRSSSSSRDTCASSFCLN 59
V +DTGSD+ WV +C+ CD R + L ++ + P SS+ S+ +C FC
Sbjct: 48 VQVDTGSDILWV-------NCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAA 100
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV-HGSSP 118
+ P GC+ S PC ++ TYG+G TG D L+ S
Sbjct: 101 TYGGLLP-------GCTTSL--------PC-EYSVTYGDGSSTTGYFVSDLLQFDQVSGD 144
Query: 119 GIIREI-PKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
G R FGC +GS+ + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 145 GQTRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCL 204
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS-PMYPN--YYYIGLEAITIGNSSL 224
+ N IG+V P +K+ P+ PN +Y + L++I +G ++L
Sbjct: 205 ------DTINGGGIFAIGNVV---------QPKVKTTPLVPNMPHYNVNLKSIDVGGTAL 249
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLL 262
L FD+ G ++DSGTT T+LPE Y +++
Sbjct: 250 ---KLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIM 284
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 169/397 (42%), Gaps = 56/397 (14%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNIHSS 63
DTGSDLTWV C + + F P +S + + CAS C S
Sbjct: 113 DTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSKTWAPIPCASDTC----SK 168
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD--TLKVHGSSPGII 121
PF SLST T PC ++ Y Y +G G + + T+ + SS
Sbjct: 169 SLPF--------SLSTC--PTPGSPC-AYDYRYKDGSAARGTVGTESATIALSSSSSSSK 217
Query: 122 REIPK-----FCFGCVGS----TYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFK 171
++ K GC GS ++ G+ G +S S G FS+C +
Sbjct: 218 NKVKKAKLQGLVLGCTGSYTGPSFEASDGVLSLGYSNVSFASHAASRFGGRFSYCLV--D 275
Query: 172 YANDPNISSPLVIG-DVAIS------SKDNLQFTPM-LKSPMYPNYYYIGLEAITIGNSS 223
+ + N +S L G + A+S + + TP+ L S M P +Y + ++AI++ +
Sbjct: 276 HLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRP-FYDVSIKAISV-DGE 333
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
L ++P + E D G GG++VDSGT+ T L +P Y +++ L + +PR
Sbjct: 334 LLKIPRDVWEVD--GGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRV----AMDP 387
Query: 284 FDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM 343
F+ CY P+ D P + HF + L P + Y + A + VKC+ Q
Sbjct: 388 FEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKS--YVIDA---APGVKCIGVQ-- 440
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+G + V G+ QQ +DL+ R+ F+ C
Sbjct: 441 -EGPWPGISVIGNILQQEHLWEFDLKNRRLRFKRSRC 476
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 122/299 (40%), Gaps = 44/299 (14%)
Query: 92 FAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGST---YREPIGIAGFGRG 148
+ YG+G G DTL + I F FGC + E G+ G GRG
Sbjct: 23 YGVQYGDGSYTIGFFAMDTLTLSSHD-----AIKGFRFGCGERNEGLFGEAAGLLGLGRG 77
Query: 149 ALSVPSQLGFLQKG-FSHCFLAFK----YANDPNISSPLVIGDVAISSKDNLQFTPMLKS 203
S+P Q G F+HCF A Y SSP A+S+K L TPML
Sbjct: 78 KTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSP------AVSAK--LSTTPMLID 129
Query: 204 PMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLS 263
P +YY+G+ I +G L P+ F + G +VDSGT T LP YS L S
Sbjct: 130 TG-PTFYYVGMTGIRVGGKLL---PIPQSVFAAAGT---IVDSGTVITRLPPAAYSSLRS 182
Query: 264 ILQSTITY--YPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGN 321
+++ Y RA + D CY + + P+++ F VSL +
Sbjct: 183 AFAASMAARGYKRAPALSL---LDTCYDLTGASEV----AIPTVSLLFQGGVSLDVDASG 235
Query: 322 HFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
YA S CL F + D + G+ Q + VVYD+ + +GF P C
Sbjct: 236 IIYAASVSQ-----ACLGFAGNEAAD--DVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287
>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
Length = 389
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 81/291 (27%), Positives = 134/291 (46%), Gaps = 36/291 (12%)
Query: 88 PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGST---YREPIGIAG 144
P ++A YG+G G L + LK G I + F FGC + + G+ G
Sbjct: 74 PICNYAINYGDGSFTRGELGHEKLKF-----GTIL-VKDFIFGCGRNNKGLFGGVSGLMG 127
Query: 145 FGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDN--LQFTPML 201
GR LS+ SQ G FS+C + + S L++G + +++ + + M+
Sbjct: 128 LGRSDLSLISQTSGIFGGVFSYCLPS----TERKGSGSLILGGNSSVYRNSSPISYAKMI 183
Query: 202 KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQL 261
++P N+Y+I L I+IG +L + P S G +LVDSGT T LP Y L
Sbjct: 184 ENPQLYNFYFINLTGISIGGVAL-QAP-------SVGPSRILVDSGTVITRLPPTIYKAL 235
Query: 262 LSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGN 321
+ T +P A + D C+ + + + + P+I HF N L +
Sbjct: 236 KAEFLKQFTGFPPAPAF---SILDTCFNL----SAYQEVDIPTIKMHFEGNAELTVDVTG 288
Query: 322 HFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKER 372
FY + S++S V CL S++ D + G++QQ+N+ V+YD ++ +
Sbjct: 289 VFYFVK--SDASQV-CLALASLEYQD--EVAILGNYQQKNLRVIYDTKETK 334
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 104/380 (27%), Positives = 150/380 (39%), Gaps = 64/380 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDLTW+ C S C ++ F PS SS+ S CAS C + +
Sbjct: 127 VVIDTGSDLTWLQCKPCSSG--QCSPQKDPL----FDPSHSSTYSAVPCASGECKKLAAD 180
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
SGCS +PC FA +Y +G G+ +D L + +PG I
Sbjct: 181 ------AYGSGCSNG--------QPC-GFAISYVDGTSTVGVYGKDKLTL---APGAI-- 220
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLG---FLQKGFSHCFLAFKYANDPNISS 180
+ F FGC G + G+ G + LG GFS+C A S
Sbjct: 221 VKDFYFGC-GHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVN-------SK 272
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
P + A + FTPM + P P + + L IT+G L P S +G
Sbjct: 273 PGFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRP-------SAFSG 325
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G++VDSGT T L Y L + + + Y R + T +DL + +
Sbjct: 326 GMIVDSGTVVTVLQSTVYRALRAAFREAMKAY-RLVHGDLDTCYDL--------TGYKNV 376
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
+ P I F G + P+ CL F + G G +GV G+ Q+
Sbjct: 377 VVPKIALTF---------SGGATINLDVPNGILVNGCLAFA--ETGKDGTAGVLGNVNQR 425
Query: 361 NVEVVYDLEKERIGFQPMDC 380
EV++D + GF+ C
Sbjct: 426 TFEVLFDTSASKFGFRAKAC 445
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 91/332 (27%), Positives = 149/332 (44%), Gaps = 59/332 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-SPSRSSSSSRDTCASSFC-LNIH 61
V +DTGSD+ WV C + C C ++ NF P S ++S +C+ C I
Sbjct: 96 VQVDTGSDVLWVSCAS----CNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRCSWGIQ 151
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK---VHGSSP 118
SSD SGCS+ ++ C ++ + YG+G +G D L+ + GSS
Sbjct: 152 SSD--------SGCSV----QNNLC----AYTFQYGDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 119 GIIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+ FGC S + R GI GFG+ +SV SQL G + FSHC
Sbjct: 196 -VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL- 253
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ LV+G++ + N+ FTP++ P P +Y + L +I++ +L P
Sbjct: 254 ----KGENGGGGILVLGEIV---EPNMVFTPLV--PSQP-HYNVNLLSISVNGQAL---P 300
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
++ F + G ++D+GTT +L E Y + + + ++ R + + CY
Sbjct: 301 INPSVFSTSNGQGTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG----NQCY 356
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVL-PQ 319
+ T D+FP ++ +F S+ L PQ
Sbjct: 357 VI----TTSVGDIFPPVSLNFAGGASMFLNPQ 384
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 99/364 (27%), Positives = 146/364 (40%), Gaps = 54/364 (14%)
Query: 31 RNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCP 90
++N F P RS S TCAS C LS L + C P P
Sbjct: 183 KSNPCKGVFCPHRSKSFQAVTCASQKC----------------KIDLSQLFSLSLC-PKP 225
Query: 91 S----FAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCV-----GSTYREPIG 141
S + +Y +G G DT+ V + G ++ GC G + E G
Sbjct: 226 SDPCLYDISYADGSSAKGFFGTDTITVDLKN-GKEGKLNNLTIGCTKSMENGVNFNEDTG 284
Query: 142 -IAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTP 199
I G G S + + FS+C + + + N+SS L IG N +
Sbjct: 285 GILGLGFAKDSFIDKAAYEYGAKFSYCLV--DHLSHRNVSSYLTIG-----GHHNAKLLG 337
Query: 200 MLKSP---MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEP 256
+K ++P +Y + + I+IG L ++P + +F+SQG G L+DSGTT T L P
Sbjct: 338 EIKRTELILFPPFYGVNVVGISIGGQML-KIPPQVWDFNSQG--GTLIDSGTTLTALLVP 394
Query: 257 FYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLV 316
Y + L ++T R E+ D C+ F D + P + FHF
Sbjct: 395 AYEPVFEALIKSLTKVKRVTG-EDFGALDFCFDA----EGFDDSVVPRLVFHFAGGARFE 449
Query: 317 LPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQ 376
P ++ + AP VKC+ +D G + V G+ QQN +DL IGF
Sbjct: 450 PPVKSYIIDV-AP----LVKCIGIVPID--GIGGASVIGNIMQQNHLWEFDLSTNTIGFA 502
Query: 377 PMDC 380
P C
Sbjct: 503 PSIC 506
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 133/316 (42%), Gaps = 55/316 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSDL W C C DC D + + P+ SS+ + C + C +
Sbjct: 99 VALTLDTGSDLVWTQCA----PCRDCFD----QGIPLLDPAASSTYAALPCGAPRCRAL- 149
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV----HGSS 117
PF C C + Y YG+ + G + D +
Sbjct: 150 ----PFTSCGGRSCV---------------YVYHYGDKSVTVGKIATDRFTFGDNGRRNG 190
Query: 118 PGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
G + + FGC G GIAGFGRG S+PSQL FS+CF + +
Sbjct: 191 DGSLPATRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLN--ATSFSYCFTSMFDS 248
Query: 174 NDPNIS---SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
++ +P + A S + ++ TP+ K+P P+ Y++ L+ I++G T +P+
Sbjct: 249 KSSIVTLGGAPAALYSHAHSGE--VRTTPLFKNPSQPSLYFLSLKGISVGK---TRLPVP 303
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+F S ++DSG + T LPE Y + + + + P E + D+C+ +
Sbjct: 304 ETKFRST-----IIDSGASITTLPEEVYEAVKAEFAAQVGLPPSG---VEGSALDVCFAL 355
Query: 291 PCPNNTFTDDLFPSIT 306
P + + PS+T
Sbjct: 356 PV-SALWRRPAVPSLT 370
>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 397
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 91/388 (23%), Positives = 161/388 (41%), Gaps = 72/388 (18%)
Query: 5 YMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
++D +L W C C+ C ++ + + F P+ SS+ + C + C +I +
Sbjct: 70 FIDLTGELVWTQCSQ----CIHC--FKQD--LPVFVPNASSTFKPEPCGTDVCKSIPTPK 121
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
D C G + G GG GI+ DT + ++P
Sbjct: 122 CASDVCAYDGVT--------------------GLGGHTVGIVATDTFAIGTAAPA----- 156
Query: 125 PKFCFGCVGS----TYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
FGCV + T P G G GR S+ +Q+ + FS+C +D +S
Sbjct: 157 -SLGFGCVVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTR--FSYCL----APHDTGKNS 209
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPN-----YYYIGLEAITIGNSSLTEVPLSLREFD 235
L +G A + +TP +K+ PN YY I LE I G++++T
Sbjct: 210 RLFLGASAKLAGGG-AWTPFVKT--SPNDGMSQYYPIELEEIKAGDATITM--------- 257
Query: 236 SQGNGGLLVDSGTTYTHL-PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+G +LV + L + Y + + +++ P A V F++C+ P
Sbjct: 258 PRGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAP--FEVCF----PK 311
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+ P + F F +L +P N+ + + + +V + ++ D +
Sbjct: 312 AGVSGA--PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDG--LNIL 367
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
GSFQQ+NV +++DL+K+ + F+P DC+S
Sbjct: 368 GSFQQENVHLLFDLDKDMLSFEPADCSS 395
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 165/387 (42%), Gaps = 73/387 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + + ++ + P+ S S++R +C FC + ++
Sbjct: 42 VQVDTGSDILWVNC----IGCDKCPTKSDLGIKLTLYDPASSVSATRVSCDDDFCTSTYN 97
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCP-SFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
L C + P + YG+G G D ++ + +
Sbjct: 98 G-----------------LLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQ 140
Query: 122 REIPK--FCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
+ FGC G+ +G +G L + F+HC ++ N
Sbjct: 141 TGLSNGTVTFGC-GAQQSGGLGTSG---------EALDGILGAFAHCL------DNVNGG 184
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
IG++ +S K N +PM PN +Y + ++ I +G + L E+P + FDS
Sbjct: 185 GIFAIGEL-VSPKVN-------TTPMVPNQAHYNVYMKEIEVGGTVL-ELPTDV--FDSG 233
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP--RAKEVEERTGFDLCYRVPCPNN 295
G ++DSGTT +LPE Y +++ ++S P VEE+ +C++
Sbjct: 234 DRRGTIIDSGTTLAYLPEVVYDSMMNEIRSQ---QPGLSLHTVEEQF---ICFKYSGN-- 285
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS--MDDGDYGPSGV 353
DD FP I FHF ++++L + ++ + +S + C +Q+ M D +
Sbjct: 286 --VDDGFPDIKFHFKDSLTLTVYPHDYLFQISED-----IWCFGWQNGGMQSKDGRDMTL 338
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
G N V+YD+E + IG+ +C
Sbjct: 339 LGDLVLSNKLVLYDIENQAIGWTEYNC 365
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 104/379 (27%), Positives = 150/379 (39%), Gaps = 66/379 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD++WV C C Y + F P+RSSS S CA++ C + N
Sbjct: 159 VDTGSDVSWVQCK----PCPSPPCYSQRDPL--FDPTRSSSYSAVPCAAASCSQLALYSN 212
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
GCS C + +YG+G TG+ + DTL + GS+ +
Sbjct: 213 --------GCS-----GGQC-----GYVVSYGDGSTTTGVYSSDTLTLTGSN-----ALK 249
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
F FGC + + G+ G GR S+ SQ G F Y P +S
Sbjct: 250 GFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGV------FSYCLPPTQNSVG 303
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
I SS TP+L + P YY + L I++G L+ + F S G
Sbjct: 304 YISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLS---IDASVFAS----GA 356
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFTDDL 301
+VD+GT T LP YS L S ++ + P TG D CY +
Sbjct: 357 VVDTGTVVTRLPPTAYSALRSAFRAAMA--PYGYPSAPATGILDTCYDF----TRYGTVT 410
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+I+ F ++ L + CL F + GD S + G+ QQ++
Sbjct: 411 LPTISIAFGGGAAMDLGTSGILTS----------GCLAF-APTGGDSQAS-ILGNVQQRS 458
Query: 362 VEVVYDLEKERIGFQPMDC 380
EV +D +GF P C
Sbjct: 459 FEVRFD--GSTVGFMPASC 475
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 101/384 (26%), Positives = 147/384 (38%), Gaps = 57/384 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT SD+ WV C C + ++ + PS+SSSS+ C+S C N+
Sbjct: 158 MVIDTASDVPWVQCA----PCPAPHCHAQTDVL--YDPSKSSSSAAFPCSSPACRNLGPY 211
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N GC T C + Y +G G D L ++ + P
Sbjct: 212 AN--------GC---TPAGDQC-----QYRVQYPDGSASAGTYISDVLTLNPAKPA--SA 253
Query: 124 IPKFCFGCV------GSTYREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDP 176
I +F FGC GS + GI GRGA S+P+Q FS+C
Sbjct: 254 ISEFRFGCSHALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLPPTP----- 308
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
+ S I V + TPML+S P Y + L AI + L P
Sbjct: 309 -VHSGFFILGVPRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVF----- 362
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G ++DS T T LP Y L + + + Y RA +E D CY
Sbjct: 363 --AAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAY-RAAAPKEH--LDTCY-------D 410
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
F+ L ++LV N + PS CL F D +G+ G+
Sbjct: 411 FSGAAPGGGGGVKLPKITLVFDGPNGAVELD-PSGVLLDGCLAFAPNTDDQM--TGIIGN 467
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
QQQ +EV+Y+++ +GF+ C
Sbjct: 468 VQQQALEVLYNVDGATVGFRRGAC 491
>gi|56784900|dbj|BAD82194.1| aspartic proteinase nepenthesin I-like [Oryza sativa Japonica
Group]
Length = 260
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/265 (28%), Positives = 121/265 (45%), Gaps = 34/265 (12%)
Query: 125 PKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
P FGC + + G+ G GRG LS+ +QL G+ + ++D + SP
Sbjct: 15 PGIAFGCTLRSEGGFGTGSGLVGLGRGKLSLVTQLNVEAFGY-------RLSSDLSAPSP 67
Query: 182 LVIG---DVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLTEVPLSLREFD- 235
+ G DV + D+ TP+L +P+ + +YY+GL I++G L ++P FD
Sbjct: 68 ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGK-LVQIPSGTFSFDR 126
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYY--PRAKEVEERTGFDLCYRVPCP 293
S G GG++ DSGTT T LP+P Y+ + L S + + P A ++ +C+
Sbjct: 127 STGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDL----ICFTGGSS 182
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
T FPS+ HF + L N+ M + +A + +S +
Sbjct: 183 TTT-----FPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQ-----ALTI 232
Query: 354 FGSFQQQNVEVVYDLE-KERIGFQP 377
G+ Q + VV+DL R+ FQP
Sbjct: 233 IGNIMQMDFHVVFDLSGNARMLFQP 257
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/379 (27%), Positives = 150/379 (39%), Gaps = 66/379 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSD++WV C C Y + F P+RSSS S CA++ C + N
Sbjct: 148 VDTGSDVSWVQCK----PCPSPPCYSQRDPL--FDPTRSSSYSAVPCAAASCSQLALYSN 201
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
GCS C + +YG+G TG+ + DTL + GS+ +
Sbjct: 202 --------GCS-----GGQC-----GYVVSYGDGSTTTGVYSSDTLTLTGSN-----ALK 238
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPL 182
F FGC + + G+ G GR S+ SQ G F Y P +S
Sbjct: 239 GFLFGCGHAQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGV------FSYCLPPTQNSVG 292
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
I SS TP+L + P YY + L I++G L+ + F S G
Sbjct: 293 YISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLS---IDASVFAS----GA 345
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFTDDL 301
+VD+GT T LP YS L S ++ + P TG D CY +
Sbjct: 346 VVDTGTVVTRLPPTAYSALRSAFRAAMA--PYGYPSAPATGILDTCYDF----TRYGTVT 399
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+I+ F ++ L S CL F + GD S + G+ QQ++
Sbjct: 400 LPTISIAFGGGAAMDL----------GTSGILTSGCLAF-APTGGDSQAS-ILGNVQQRS 447
Query: 362 VEVVYDLEKERIGFQPMDC 380
EV +D +GF P C
Sbjct: 448 FEVRFD--GSTVGFMPASC 464
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 153/383 (39%), Gaps = 62/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS +T+VPC + C C ++++ + FSP+ SSS C S
Sbjct: 50 LIVDTGSTVTYVPCSS----CTHCGNHQDPR----FSPALSSSYKPLECGSE-------- 93
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
CS + C + Y E +G+L +D + SS
Sbjct: 94 -----------CS------TGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSD---LG 133
Query: 124 IPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNI 178
+ FGC G Y + GI G GRG LS+ QL ++K + Y
Sbjct: 134 GQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQL--VEKNAMEDVFSLCYGGMDEG 191
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+++G ++ FT P YY + L+ I +G S PL L+ G
Sbjct: 192 GGAMILG--GFQPPKDMVFTA--SDPHRSPYYNLMLKGIRVGGS-----PLRLKPEVFDG 242
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G ++DSGTTY + P + S ++ + +E+ D+CY N +
Sbjct: 243 KYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSLKEVPGPDEKFK-DICYAGAGTNVSNL 301
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVFGSF 357
FPS+ F F + S+ L N+ + + S CL +F++ D P+ + G
Sbjct: 302 SQFFPSVDFVFGDGQSVTLSPENYLFRH---TKISGAYCLGVFENGD-----PTTLLGGI 353
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
+N+ V Y+ K IGF C
Sbjct: 354 IVRNMLVTYNRGKASIGFLKTKC 376
>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
Length = 334
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 131/270 (48%), Gaps = 43/270 (15%)
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQ----LGFLQKGFSHCFLAFKYAND 175
I FGC G+ +G+ G G LS+ SQ LG +K FS C + F+ D
Sbjct: 93 ILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRK-FSQCLVPFR--TD 149
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL---TEVPLSLR 232
P+I+S ++ G A S ++ TP++ P YY++ L+ I++G+ + P++ +
Sbjct: 150 PSITSKIIFGPEAEVSGSDVVSTPLVTKD-DPTYYFVTLDGISVGDKLFPFSSSSPMATK 208
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYP-RAKEVEERTGFDLCYRVP 291
G + +D+GT T LP FY++L+ ++ I P + +++ + LCYR
Sbjct: 209 -------GNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQ----LCYR-- 255
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ T D P +T HF + + L N F S V C Q +D G +
Sbjct: 256 --SATLIDG--PILTAHF-DGADVQLKPLNTFI-----SPKEGVYCFAMQPID----GDT 301
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+FG+F Q N + +DL+ +++ F+ +DC
Sbjct: 302 GIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 331
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 151/383 (39%), Gaps = 68/383 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD- 64
+DTGS LTW+ C S C + + F P S + + C+SS C + ++
Sbjct: 148 VDTGSSLTWLQCSPCSVSC-------HRQAGPVFDPRASGTYAAVQCSSSECGELQAATL 200
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
NP S CS+S + + +YG+ G L++DT+ S
Sbjct: 201 NP------SACSVSNVCI---------YQASYGDSSYSVGYLSKDTVSFGSGS------F 239
Query: 125 PKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISS 180
P F +GC + G+ G + LS+ QL L FS+C +S
Sbjct: 240 PGFYYGCGQDNEGLFGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCL----------PTS 289
Query: 181 PLVIGDVAISSKDNLQF--TPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
G ++I S + Q+ TPM S + + Y++ L I++ + L P R +
Sbjct: 290 SAAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPT-- 347
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
++DSGT T LP Y+ L + + + A + D C+R +
Sbjct: 348 ----IIDSGTVITRLPPNVYTALSRAVAAAMAS--AAPRAPTYSILDTCFR-----GSAA 396
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P + F +L L GN + + CL F G + + G+ Q
Sbjct: 397 GLRVPRVDMAFAGGATLALSPGNVLIDVD-----DSTTCLAFAPT-----GGTAIIGNTQ 446
Query: 359 QQNVEVVYDLEKERIGFQPMDCA 381
QQ VVYD+ + RIGF C+
Sbjct: 447 QQTFSVVYDVAQSRIGFAAGGCS 469
>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 445
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 160/388 (41%), Gaps = 71/388 (18%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDD-YRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
DTGSDL W C+ CDD Y+ + + F P +S + C + FC ++ +
Sbjct: 112 DTGSDLIWR-------QCLPCDDCYKQVEPL--FDPKKSKTYKTLGCNNDFCQDLGQQGS 162
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D T C S +Y+YG+ L+ +T + GS+ G P
Sbjct: 163 CGDDNT-----------------CTS-SYSYGDQSYTRRDLSSETFTI-GSTEGDPASFP 203
Query: 126 KFCFGC---VGSTYREP-----IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPN 177
FGC G T+ E G + + S++G FS+C + ++D
Sbjct: 204 GLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG---GQFSYCLVPL--SSDST 258
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD-- 235
SS + G A+ S TP++K +YY+ LE +++G+ + S +
Sbjct: 259 ASSKINFGKSAVVSGSGTVSTPLIKG-TPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPA 317
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
+ +++DSGTT T LP FY+ + S L I + R F LCY +
Sbjct: 318 AAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIG---GQTTTDPRGTFSLCY------S 368
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS---G 352
P+IT HF+ + LP N F A + L+ SM PS
Sbjct: 369 GVKKLEIPTITAHFIG-ADVQLPPLNTFV--------QAQEDLVCFSMI-----PSSNLA 414
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+FG+ Q N V YDL+ ++ F+P DC
Sbjct: 415 IFGNLSQMNFLVGYDLKNNKVSFKPTDC 442
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 107/408 (26%), Positives = 166/408 (40%), Gaps = 61/408 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + +F P S++ + C S+ C
Sbjct: 76 VTMVLDTGSELSWLLCATGRQGSAA--AGAAAAMGESFRPRASATFAAVPCGSTQC---S 130
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S D P P + G S R C + +Y +G G L D V G +P +
Sbjct: 131 SRDLPAPP-SCDGAS----------RQC-HVSLSYADGSASDGALATDVFAV-GEAPPL- 176
Query: 122 REIPKFCFGCVGSTY-REPIGIA-----GFGRGALSVPSQLGFLQKGFSHCFLAFKYAND 175
+ FGC+ + Y P G+A G RG LS +Q + FS+C +D
Sbjct: 177 ----RSAFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQAS--TRRFSYCI------SD 224
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMY------PNY----YYIGLEAITIGNSSLT 225
+ + L++G +L F P+ +P+Y P + Y + L I +G +L
Sbjct: 225 RDDAGVLLLG------HSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKAL- 277
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE---RT 282
+P S+ D G G +VDSGT +T L YS L + RA + +
Sbjct: 278 PIPASVLAPDHTGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRALDDPSFAFQE 337
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-AVKCLLFQ 341
D C+RVP + L P +T F N + + Y + + V CL F
Sbjct: 338 ALDTCFRVPAGRPPPSARL-PPVTLLF-NGAEMSVAGDRLLYKVPGEHRGADGVWCLTFG 395
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ D + V G Q N+ V YDLE+ R+G P+ C + GL
Sbjct: 396 NADMVPLT-AYVIGHHHQMNLWVEYDLERGRVGLAPVKCDVASERLGL 442
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/383 (27%), Positives = 159/383 (41%), Gaps = 70/383 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD++WV C S C+ R+ F P++SS+ S C + C +
Sbjct: 158 VEVDTGSDVSWVQCKPCSAPA--CNSQRDQL----FDPAKSSTYSAVPCGADACSELRIY 211
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ +GCS S C + +YG+G TG+ DTL + +PG
Sbjct: 212 E--------AGCS-----GSQC-----GYVVSYGDGSNTTGVYGSDTLAL---APG--NT 248
Query: 124 IPKFCFGCVGSTYREPIGIAG---FGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ F FGC + GI G GR ++S+ SQ G FS+C + + A +
Sbjct: 249 VGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSA-----A 303
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G SS T +L + P +Y + L I++G + VP S
Sbjct: 304 GYLTLG--GPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQV-AVPASAFA------ 354
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTIT--YYPRAKEVEERTGFDLCYRVPCPNNTF 297
GG +VD+GT T LP Y+ L S + I YP A D CY + +
Sbjct: 355 GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPCGYPSAP---ANGILDTCYDF----SRY 407
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P++ F +L A+ AP S+ CL F +G G + + G+
Sbjct: 408 GVVTLPTVALTFSGGATL---------ALEAPGILSS-GCLAF--APNGGDGDAAILGNV 455
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ++ V +D +GF P C
Sbjct: 456 QQRSFAVRFD--GSTVGFMPGAC 476
>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
Length = 382
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/250 (26%), Positives = 111/250 (44%), Gaps = 25/250 (10%)
Query: 139 PIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFT 198
P G+ G GRG LS+ SQ G + FS+C + + N+ V ++ ++ T
Sbjct: 151 PSGLMGLGRGRLSLVSQTGATK--FSYCLTPY-FHNNGATGHLFVGASASLGGHGDVMTT 207
Query: 199 PMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG------NGGLLVDSGTTYTH 252
+K P +YY+ L +T+G T +P+ FD + +GG+++DSG+ +T
Sbjct: 208 QFVKGPKGSPFYYLPLIGLTVGE---TRLPIPATVFDLREVAPGLFSGGVIIDSGSPFTS 264
Query: 253 LPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNN 312
L Y L S L + + A + G C + P++ FHF
Sbjct: 265 LVHDAYDALASELAARLNGSLVAPPPDADDG------ALCVARRDVGRVVPAVVFHFRGG 318
Query: 313 VSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKER 372
+ +P +++ AP + +A G Y V G++QQQN+ V+YDL
Sbjct: 319 ADMAVPAESYW----APVDKAAACM---AIASAGPYRRQSVIGNYQQQNMRVLYDLANGD 371
Query: 373 IGFQPMDCAS 382
FQP DC++
Sbjct: 372 FSFQPADCSA 381
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 107/394 (27%), Positives = 160/394 (40%), Gaps = 85/394 (21%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD- 64
MDTGS++ WV C C C +N L+ PS+SS+ + C ++ C S+
Sbjct: 116 MDTGSNILWVRCA----PCKRCTQ-QNGPLLD---PSKSSTYASLPCTNTMCHYAPSAYC 167
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
N + C G +LS Y G G+L + L H S G+ +
Sbjct: 168 NRLNQC---GYNLS-----------------YATGLSSAGVLATEQLIFHSSDEGV-NAV 206
Query: 125 PKFCFGCVGSTY----REPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
P FGC R G+ G G+G S +++G FS+C NI+
Sbjct: 207 PSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMG---SKFSYCL--------GNIAD 255
Query: 181 P------LVIGDVAISSKDNLQ-FTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
P LV G+ K N + ++ LK + +YY+ LE I++G L +
Sbjct: 256 PHYGYNQLVFGE-----KANFEGYSTPLK--VVNGHYYVTLEGISVGEKRL---DIDSTA 305
Query: 234 FDSQGN-GGLLVDSGTTYTHLPEPFY----SQLLSILQSTITYYPRAKEVEERTGFDLCY 288
F +GN L+DSGT T L E + +++ +L + + R G CY
Sbjct: 306 FSMKGNEKSALIDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWR--------GSFACY 357
Query: 289 RVPCPNNTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
+ T + DL FP +TFHF L L + FY + AV+ S
Sbjct: 358 K-----GTVSQDLIGFPVVTFHFSGGADLDLDTESMFYQATPDILCIAVRQ---ASAYGN 409
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
D+ V G QQ + YDL ++ FQ +DC
Sbjct: 410 DFKSFSVIGLMAQQYYNMAYDLNSNKLFFQRIDC 443
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/378 (26%), Positives = 159/378 (42%), Gaps = 54/378 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +D+GSD+ WV C C +C Y+ + + F P+ S++ + +C SS C
Sbjct: 152 VVIDSGSDIVWVQCQ----PCSEC--YQQSDPV--FDPAGSATYAGISCDSSVC------ 197
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D +GC+ CR + +YG+G G L +TL +IR
Sbjct: 198 ----DRLDNAGCNDGR------CR----YEVSYGDGSYTRGTLALETLTFGRV---LIRN 240
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
I C + G+ G G GA+S QLG G FS+C ++ + L
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVS----RGTESTGTL 296
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
G A+ + P++++P P++YY+GL + +G + +P + E G GG+
Sbjct: 297 EFGRGAMPV--GAAWVPLIRNPRAPSFYYVGLSGLGVGGIRV-PIPEQIFELTDLGYGGV 353
Query: 243 LVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
++D+GT T LP P Y PR+ V + FD CY + N F
Sbjct: 354 VMDTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRV---SIFDTCYNL----NGFVSVRV 406
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P+++F+F L LP N P + C F + G + G+ QQ+ +
Sbjct: 407 PTVSFYFSGGPILTLPARNFLI----PVDGEGTFCFAFAASASG----LSIIGNIQQEGI 458
Query: 363 EVVYDLEKERIGFQPMDC 380
++ D +GF P C
Sbjct: 459 QISIDGSNGFVGFGPTIC 476
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/353 (26%), Positives = 144/353 (40%), Gaps = 58/353 (16%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+I +DTGSDL WV C C C N + P+RS SS + C+S C +
Sbjct: 99 LIWAEVDTGSDLMWVKCS----PCNGC----NPPPSPLYDPARSRSSGKLPCSSQLCQAL 150
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG--LVTGILTRDTLKVHGSSP 118
D C+ P + Y YG G G+L +T
Sbjct: 151 GRGRIISDQCSDD-------------PPLCGYHYAYGHSGDHSTQGVLGTETFTF---GD 194
Query: 119 GIIREIPKFCFG----CVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
G + FG GS + G+ G GRG LS+ SQLG + F++C A
Sbjct: 195 GYVAN--NVSFGRSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAGR--FAYCLAA----- 245
Query: 175 DPNISSPLVIGDVAI--SSKDNLQFTPMLKSPM--YPNYYYIGLEAITIGNSSLTEVPLS 230
DPN+ S ++ G +A +S ++ TP++ +P +YY+ L+ I++G S L P+
Sbjct: 246 DPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRL---PIK 302
Query: 231 LREF--DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
F +S G+GG+ DSG T L + Y + + S I + + G D C+
Sbjct: 303 DGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITSEI------QRLGYDAGDDTCF 356
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
N P + HF + + L G ++ S S + C+ +
Sbjct: 357 ---VAANQQAVAQMPPLVLHFDDGADMSL-NGRNYLKTSTKGPSEVLVCMAIK 405
>gi|326490700|dbj|BAJ90017.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493830|dbj|BAJ85377.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 459
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 149/343 (43%), Gaps = 51/343 (14%)
Query: 49 RDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTR 108
R CAS C ++ ++D T C + TC S+ Y G TG L
Sbjct: 125 RVQCASQTCRSLLAND------TTDACGGNPSGDDTC-----SYVNVYAPGSNTTGFLAN 173
Query: 109 DTLKVHGSSPGIIREIPKFCFGCVGSTYREP----IGIAGFGRGALSVPSQLGFLQKGFS 164
+T+ V GS G GC + P +G GF RGALS+ SQL + FS
Sbjct: 174 ETVAV-GSFVG------AAILGCSAANSTGPLVGEVGSFGFNRGALSLVSQLSVSK--FS 224
Query: 165 HCFLAFKYANDPNISSPLVIGDVAI-SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSS 223
+ +LA A + S +++GD A+ ++ + TP+L+S +P+ YY+ L AI + +
Sbjct: 225 Y-YLAPDEAGSSDSESVVLLGDAAVPQTRGGGRSTPLLRSTAFPDVYYVKLSAIQVDGQA 283
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTY--THLPEPFYSQLLSILQSTITYYPRAKEVEER 281
L+ +P + + G+ G +V GT Y T L E Y+ + L S I A+EV
Sbjct: 284 LSGIPAGAFDLAADGSSGGVV-MGTLYPITRLQEDAYNAVRQALVSKI----NAQEVNGS 338
Query: 282 T----GFDLCYRVPCPNNTFTDDLFPSITFHFLNN---VSLVLPQGNHFYAMSAPSNSSA 334
FDLCY + FP IT F +L L ++F+ N +
Sbjct: 339 AFAGGVFDLCYDA----QSVATLTFPKITLVFDGGNAPATLELTTVHYFF----KDNVTG 390
Query: 335 VKCLLFQSMDDGDYGPSG-VFGSFQQQNVEVVYDLEKERIGFQ 376
++C M G P G V GS Q ++YD+ E + +
Sbjct: 391 LQCFTMLPMPVGT--PFGSVLGSMVQAGTNMIYDVGGETLTLE 431
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 152/392 (38%), Gaps = 77/392 (19%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
++ +DTGS +TW C C+ C +
Sbjct: 141 KLILDTGSSITWTQCK----ACVHC--------------------------------LKD 164
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
S FD S S + + ST ++ TYG+ G DT+ + S
Sbjct: 165 SHRHFDSLASSTYSFGSCIPSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEPSDV---- 217
Query: 123 EIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPN 177
KF FGC G G+ G G+G LS SQ +K FS+C + N
Sbjct: 218 -FQKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCL------PEEN 270
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPLSLR 232
L+ G+ A S +L+FT ++ P YY++ L I++GN L +P S+
Sbjct: 271 SIGSLLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLN-IPSSV- 328
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK-EVEERTGFDLCYRVP 291
F S G ++DSGT T LP+ YS L + + + YP + +E D CY +
Sbjct: 329 -FASPGT---IIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLS 384
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
D L P HF + + L + N ++ CL F P
Sbjct: 385 GRK----DVLLPEXVLHFGDGADVRLNGKRVVWG-----NDASRLCLAFAGNSKSTMNPE 435
Query: 352 -GVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ G+ QQ ++ V+YD+ RIGF C++
Sbjct: 436 LTIIGNRQQVSLTVLYDIRGRRIGFGGNGCSN 467
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 152/381 (39%), Gaps = 76/381 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D+GSD+ W+ C CD N + F+P+ S+S C+S+ C
Sbjct: 146 IDSGSDIVWI-------QCEPCDQCYN-QTDPIFNPATSASFIGVACSSNVC-------- 189
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCP-SFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
+ L CR + YG+G G L +T+ + + +I++
Sbjct: 190 ------------NQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRT---VIQDT 234
Query: 125 PKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLV 183
C + G+ G G G +S QLG G F +C +S +
Sbjct: 235 AIGCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCL----------VSRAMP 284
Query: 184 IGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD--SQGNGG 241
+G + + P++ +P YP++YY+ L + +G VP+S + F G GG
Sbjct: 285 VGAM---------WVPLIHNPFYPSFYYVSLSGLAVGG---IRVPISEQIFQLTDIGTGG 332
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
+++D+GT T LP Y+ + T PRA V FD CY + N F
Sbjct: 333 VVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSI---FDTCYDL----NGFVTVR 385
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG--VFGSFQQ 359
P+++F+F L P N P++ C F PSG + G+ QQ
Sbjct: 386 VPTVSFYFSGGQILTFPARNFL----IPADDVGTFCFAFAP------SPSGLSIIGNIQQ 435
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
+ ++V D +GF P C
Sbjct: 436 EGIQVSIDGTNGFVGFGPNVC 456
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/383 (25%), Positives = 149/383 (38%), Gaps = 61/383 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ DTGSDLTW C C N+ + F+PS+S+S + +C S+ C ++ S+
Sbjct: 168 LIFDTGSDLTWTQCEPCVKSCY-------NQKEAIFNPSQSTSYANISCGSTLCDSLASA 220
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
C S C + YG+ G ++ L + +
Sbjct: 221 TGNIFNCASSTCV---------------YGIQYGDSSFSIGFFGKEKLSLTATDV----- 260
Query: 124 IPKFCFGCVGSTYREPIGIAGFG----RGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
F FGC G + G A R LS+ SQ K FS+C P+
Sbjct: 261 FNDFYFGC-GQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCL--------PSS 311
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
SS S+ + FTP+ ++Y + L I++G L P
Sbjct: 312 SSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFS------ 365
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
G ++DSGT T LP YS L S + ++ YP A + D C+ ++T +
Sbjct: 366 TAGTIIDSGTVITRLPPAAYSALSSTFRKLMSQYPAAPALSI---LDTCFDFS-NHDTIS 421
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P I F V + + + FY N CL F + D +FG+ Q
Sbjct: 422 ---VPKIGLFFSGGVVVDIDKTGIFYV-----NDLTQVCLAFAG--NSDASDVAIFGNVQ 471
Query: 359 QQNVEVVYDLEKERIGFQPMDCA 381
Q+ +EVVYD R+GF P C+
Sbjct: 472 QKTLEVVYDGAAGRVGFAPAGCS 494
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 104/383 (27%), Positives = 157/383 (40%), Gaps = 67/383 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
DTGS +TW C C + + F P++S+S + +C+S+ C
Sbjct: 152 FDTGSGITWTQCQPCLGSCYPQKEQK-------FDPTKSTSYNNVSCSSASC-------- 196
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P + GCS S STC + YG+ G +TL + S
Sbjct: 197 NLLPTSERGCSAS---NSTCL-----YQIIYGDQSYSQGFFATETLTISSSDV-----FT 243
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
F FGC S + + G+ G ++S+PSQ QK FS+C S+P
Sbjct: 244 NFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLP----------STP 293
Query: 182 LVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
G + K FTP+ SP + ++Y I + I++ S L P+ F + G
Sbjct: 294 SSTGYLNFGGKVSQTAGFTPI--SPAFSSFYGIDIVGISVAGSQL---PIDPSIFTTSG- 347
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT T LP Y L ++ YP+ E D CY + +T
Sbjct: 348 --AIIDSGTVITRLPPTAYKALKEAFDEKMSNYPKTNGDEL---LDTCYDF----SNYTT 398
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS-MDDGDYGPSGVFGSFQ 358
FP ++ F V + + Y + N + CL F + DD ++G +FG+ Q
Sbjct: 399 VSFPKVSVSFKGGVEVDIDASGILYLV----NGVKMVCLAFAANKDDSEFG---IFGNHQ 451
Query: 359 QQNVEVVYDLEKERIGFQPMDCA 381
Q+ EVVYD K IGF C+
Sbjct: 452 QKTYEVVYDGAKGMIGFAAGACS 474
>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 427
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/399 (26%), Positives = 164/399 (41%), Gaps = 74/399 (18%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C L L S F+P SSS + C SS C+
Sbjct: 72 VTMVLDTGSELSWLHCKKLP------------NLNSTFNPLLSSSYTPTPCNSSVCMT-R 118
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAY---TYGEGGLVTGILTRDTLKVHGSS- 117
+ D L C P + +Y + G L +T + G++
Sbjct: 119 TRD---------------LTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ 163
Query: 118 PGIIREIPKFCFGCVGST-YREPI-------GIAGFGRGALSVPSQLGFLQKGFSHCFLA 169
PG + FGC+ S Y I G+ G RG+LS+ +Q+ + FS+C
Sbjct: 164 PGTL-------FGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM--VLPKFSYCI-- 212
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNY-----YYIGLEAITIGNSSL 224
+ + L++GD S+ LQ+TP++ + Y Y + LE I + + L
Sbjct: 213 ----SGEDAFGVLLLGD-GPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKV-SEKL 266
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKE--VEER 281
++P S+ D G G +VDSGT +T L P Y+ L L+ T R ++
Sbjct: 267 LQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTRIEDPNFVFE 326
Query: 282 TGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
DLCY P + P++T F + + + Y +S V C F
Sbjct: 327 GAMDLCYHAPA-----SLAAVPAVTLVF-SGAEMRVSGERLLYRVS--KGRDWVYCFTFG 378
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ D + V G QQNV + +DL K R+GF C
Sbjct: 379 NSDLLGI-EAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 103/403 (25%), Positives = 156/403 (38%), Gaps = 85/403 (21%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS--- 63
DTGSDLTW+ C C C + P S+ C C+++HSS
Sbjct: 75 DTGSDLTWLQC---DAPCQQCTE--------TLHPLYQPSNDLVPCKDPLCMSLHSSMDH 123
Query: 64 --DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+NP D C + Y +GG G+L RD ++ ++ I
Sbjct: 124 RCENP-DQC--------------------DYEVEYADGGSSLGVLVRDVFPLNLTNGDPI 162
Query: 122 REIPKFCFGC-----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKY 172
R P+ GC GS+ P+ GI G GRGA+S+ SQL G ++ HCF +
Sbjct: 163 R--PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGG 220
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGL-EAITIGNSSLTEVPLSL 231
I L +TPM + YP +Y G E I G S+ L
Sbjct: 221 GY--------XFFGDGIYDPYRLVWTPMSRD--YPKHYSPGFGELIFNGRST------GL 264
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
R N ++ DSG++YT+ Y L S+L + P + +++ T LC+R
Sbjct: 265 R------NLFVVFDSGSSYTYFNAQAYQVLTSLLNRELAGKPLREAMDDDT-LPLCWRGR 317
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAM-SAPSNSSAV------KCLLFQSMD 344
P + D + ++L G A+ P+ + CL +
Sbjct: 318 KPIKSLRD------VRKYFKPLALSFSSGGRSKAVFEIPTEGYMIISSMGNVCLGILNGT 371
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
D S + G Q+ VVY+ EK+ IG+ +C +Q
Sbjct: 372 DVGLENSNIIGDISMQDKMVVYNNEKQAIGWATANCDRVPKSQ 414
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 101/389 (25%), Positives = 155/389 (39%), Gaps = 79/389 (20%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I + +DTGS +TW C C++C +
Sbjct: 141 IXLILDTGSSITWTQCK----ACVNC--------------------------------LQ 164
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ FD S S + + ST ++ TYG+ G DT+ + S
Sbjct: 165 DSNRYFDSSASSTYSFGSCIPSTVEN---NYNMTYGDDSTSVGNYGCDTMTLEPSDV--- 218
Query: 122 REIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDP 176
KF FGC G G+ G G+G LS SQ K FS+C +
Sbjct: 219 --FQKFQFGCGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCL-----PEED 271
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSP---MYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+I S L+ G+ A S +L+FT ++ P YY++ L I++GN L +P S+
Sbjct: 272 SIGS-LLFGEKATSQSSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERL-NIPSSV-- 327
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPC 292
F S G ++DS T T LP+ YS L + + + YP + ++ D CY +
Sbjct: 328 FASPGT---IIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNL-- 382
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ D L P I HF + L N + A + CL F +
Sbjct: 383 --SGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDA-----SRLCLAFAGTSE-----LT 430
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G+ QQ ++ V+YD++ RIGF C+
Sbjct: 431 IIGNRQQLSLTVLYDIQGRRIGFGGNGCS 459
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/383 (24%), Positives = 150/383 (39%), Gaps = 65/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS LTW+ C C + ++ F P SS+ + C++S C + ++
Sbjct: 149 MVVDTGSSLTWLQCSPCVVSC-------HRQVGPLFDPRASSTYTSVRCSASQCDELQAA 201
Query: 64 D-NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
NP S CS S + C + +YG+ G L+ DT+ +S
Sbjct: 202 TLNP------SACSASNV----CI-----YQASYGDSSFSVGYLSTDTVSFGSTS----- 241
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
P F +GC + G+ G R LS+ QL L FS+C P
Sbjct: 242 -YPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL--------PTA 292
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S + ++ +TPM S + + Y+I L +++G S L P E+ S
Sbjct: 293 ASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSP---SEYSSLP 349
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
++DSGT T LP ++ L + + RA D C+ +
Sbjct: 350 T---IIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSI---LDTCFE-----GQAS 398
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++ F S+ L N + + CL F D + + G+ Q
Sbjct: 399 QLRVPTVVMAFAGGASMKLTTRNVLIDVD-----DSTTCLAFAPTDS-----TAIIGNTQ 448
Query: 359 QQNVEVVYDLEKERIGFQPMDCA 381
QQ V+YD+ + RIGF C+
Sbjct: 449 QQTFSVIYDVAQSRIGFSAGGCS 471
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 156/389 (40%), Gaps = 51/389 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C C C R+ F PS S+S + C +S C
Sbjct: 177 LTVIVDTGSDLTWVQCK----PCSVCYAQRDPL----FDPSGSASYAAVPCNASACEASL 228
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + KS C ++ YG+G G+L DT+ + G+S
Sbjct: 229 KAATGVPGSCATVGGGGGGGKSERCY----YSLAYGDGSFSRGVLATDTVALGGAS---- 280
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC S + G+ G GR LS+ SQ G FS+C A A +
Sbjct: 281 --VDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPA---ATSGD 335
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+ L +G S ++ + +T M+ P P +Y++ N + V +
Sbjct: 336 AAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFM--------NVTGASVGGAAVAAA 387
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G +L+DSGT T L Y + + Q YP A D CY
Sbjct: 388 GLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSL---LDACY----- 439
Query: 294 NNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N T D++ P +T + + + A + S V CL S+ D P
Sbjct: 440 NLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFM--ARKDGSQV-CLAMASLSFEDQTP-- 494
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G++QQ+N VVYD R+GF DC+
Sbjct: 495 IIGNYQQKNKRVVYDTVGSRLGFADEDCS 523
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/389 (26%), Positives = 156/389 (40%), Gaps = 51/389 (13%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ V +DTGSDLTWV C C C R+ F PS S+S + C +S C
Sbjct: 176 LTVIVDTGSDLTWVQCK----PCSVCYAQRDPL----FDPSGSASYAAVPCNASACEASL 227
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ + KS C ++ YG+G G+L DT+ + G+S
Sbjct: 228 KAATGVPGSCATVGGGGGGGKSERCY----YSLAYGDGSFSRGVLATDTVALGGAS---- 279
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPN 177
+ F FGC S + G+ G GR LS+ SQ G FS+C A A +
Sbjct: 280 --VDGFVFGCGLSNRGLFGGTAGLMGLGRTELSLVSQTAPRFGGVFSYCLPA---ATSGD 334
Query: 178 ISSPLVIGDVAISSKDN--LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+ L +G S ++ + +T M+ P P +Y++ N + V +
Sbjct: 335 AAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFM--------NVTGASVGGAAVAAA 386
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLS--ILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G +L+DSGT T L Y + + Q YP A D CY
Sbjct: 387 GLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSL---LDACY----- 438
Query: 294 NNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
N T D++ P +T + + + A + S V CL S+ D P
Sbjct: 439 NLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFM--ARKDGSQV-CLAMASLSFEDQTP-- 493
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G++QQ+N VVYD R+GF DC+
Sbjct: 494 IIGNYQQKNKRVVYDTVGSRLGFADEDCS 522
>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 163/382 (42%), Gaps = 62/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D ++P S C+ C + FSP+ S+S C+ C +
Sbjct: 113 MVLDTSTDEAFIP----SSGCIGCS-------ATTFSPNASTSYVPLECSVPQCSQVRGL 161
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P T SG SF +Y G + L +D+L++
Sbjct: 162 SCP---ATGSGAC--------------SFNKSYA-GSTYSATLVQDSLRLA------TDV 197
Query: 124 IPKFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP + FG + + I G+ G GRG LS+ SQ G L G FS+C +FK S
Sbjct: 198 IPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFK---SYYFS 254
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L++P P+ Y++ L IT+G ++ P L FD
Sbjct: 255 GSLKLGPVG--QPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNV-PFPKELLAFDVNTG 311
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T EP Y+ + + +T FD C+ +
Sbjct: 312 SGTIIDSGTVITRFVEPVYNAVRDEFRKQVT-----GPFSSLGAFDTCFV------KNYE 360
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-DDGDYGPSGVFGSFQ 358
L P+IT HF ++ L LP N S+S ++ CL S + +Y V ++Q
Sbjct: 361 TLAPAITLHF-TDLDLKLPLENSLIH----SSSGSLACLAMASTPKNVNYTVLNVIANYQ 415
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQN+ V++D ++G C
Sbjct: 416 QQNLRVLFDTVNNKVGIARELC 437
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 100/387 (25%), Positives = 158/387 (40%), Gaps = 67/387 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS +T+VPC + C C ++++ + F P SSS S C N+
Sbjct: 104 LIVDSGSTVTYVPCAS----CEQCGNHQDPR----FQPDLSSSYSPVKC------NVD-- 147
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
CT C K C ++ Y E +G+L D + S +
Sbjct: 148 ------CT---CDSD---KKQC-----TYERQYAEMSSSSGVLGEDIVSFGRESE---LK 187
Query: 124 IPKFCFGCVGSTY-----REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
+ FGC S + GI G GRG LS+ QL G + FS C+
Sbjct: 188 PQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGG- 246
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+V+G V S + L+SP YY I L+ I + +L + R F+
Sbjct: 247 ----GAMVLGGVPAPSDMVFSHSDPLRSP----YYNIELKEIHVAGKALR---VDSRVFN 295
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
S+ G ++DSGTTY +LPE + + S + + + + D+C+ N
Sbjct: 296 SKH--GTVLDSGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYK-DICFAGAGRNV 352
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVF 354
+ ++FP + F N L L N+ + S CL +FQ+ D P+ +
Sbjct: 353 SKLHEVFPDVDMVFGNGQKLSLTPENYLFRH---SKVDGAYCLGVFQNGKD----PTTLL 405
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G +N V YD E+IGF +C+
Sbjct: 406 GGIIVRNTLVTYDRHNEKIGFWKTNCS 432
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 157/381 (41%), Gaps = 66/381 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD++WV C S C+ R+ F P++SS+ S C + C +
Sbjct: 158 VEVDTGSDVSWVQCKPCSAPA--CNSQRDQL----FDPAKSSTYSAVPCGADACSELRIY 211
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ +GCS S C + +YG+G TG+ DTL + +PG
Sbjct: 212 E--------AGCS-----GSQC-----GYVVSYGDGSNTTGVYGSDTLAL---APG--NT 248
Query: 124 IPKFCFGCVGSTYREPIGIAG---FGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
+ F FGC + GI G GR ++S+ SQ G FS+C + + A +
Sbjct: 249 VGTFLFGCGHAQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSA-----A 303
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G +S T +L + P +Y + L I++G + VP S
Sbjct: 304 GYLTLG--GPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQV-AVPASAFA------ 354
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
GG +VD+GT T LP Y+ L S + I Y D CY + +
Sbjct: 355 GGTVVDTGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGI-LDTCYDF----SRYGV 409
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++ F +L A+ AP S+ CL F +G G + + G+ QQ
Sbjct: 410 VTLPTVALTFSGGATL---------ALEAPGILSS-GCLAF--APNGGDGDAAILGNVQQ 457
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
++ V +D +GF P C
Sbjct: 458 RSFAVRFD--GSTVGFMPGAC 476
>gi|242041951|ref|XP_002468370.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
gi|241922224|gb|EER95368.1| hypothetical protein SORBIDRAFT_01g044790 [Sorghum bicolor]
Length = 408
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/385 (23%), Positives = 140/385 (36%), Gaps = 83/385 (21%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D TW C CD S F P+ SSS + CAS +C
Sbjct: 96 LDTSADATWS-------HCAPCD---TCPAGSRFIPASSSSYASLPCASDWCPLFRRPAV 145
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P +P + + LL++ P +G+L + R P
Sbjct: 146 PGEPGRVGAAADVRLLQAASRTP-------------RSGVLAATRCGWARTPSPATRSGP 192
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIG 185
GS Y G+ FS+C +++ S L +G
Sbjct: 193 MSLLSQTGSRYN---GV--------------------FSYCLPSYRSYY---FSGSLRLG 226
Query: 186 DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVD 245
A N+++TP+L +P P+ YY+ + +++G +L + P FD G ++D
Sbjct: 227 --AAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGR-ALVKAPAGSFAFDPSTGAGTVID 283
Query: 246 SGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG------FDLCYRVPCPNNTFTD 299
SGT T P Y+ L + ++V +G FD C+ TD
Sbjct: 284 SGTVITRWTAPVYAALRDEFR---------RQVAAPSGYTSLGAFDTCFN--------TD 326
Query: 300 DL----FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
++ P +T H V L LP N SA + + CL V
Sbjct: 327 EVAAGGAPPVTLHMGGGVDLTLPMENTLIHSSA----TPLACLAMAEAPQNVNSVVNVVA 382
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
+ QQQNV VV D+ R+GF C
Sbjct: 383 NLQQQNVRVVVDVAGSRVGFAREPC 407
>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
Length = 453
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 108/409 (26%), Positives = 162/409 (39%), Gaps = 94/409 (22%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC--LNIHSS 63
+DT SDL W+ C C+ C YR +L F+P SSS + C+S C L+ H
Sbjct: 105 IDTASDLVWLQCQ----PCVSC--YR--QLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRC 156
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D D CR + Y Y + G L D L V G+
Sbjct: 157 DEDDD---------------QACR----YNYKYSGNAVTNGTLAIDKLAVGGNV------ 191
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
GC VG + G+ G RG LS+ SQL + F +C P
Sbjct: 192 FHAVVLGCSDSSVGGPPPQASGLVGLARGPLSLLSQLSV--RRFMYCL------PPPMSR 243
Query: 180 SP--LVIG-----DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+P LV+G D + D + T M S YP+YYY+ + + +G+ + P ++R
Sbjct: 244 TPGKLVLGAGAGADAVRNVSDRVTVT-MSSSTRYPSYYYLNFDGLAVGD----QTPGTIR 298
Query: 233 EFDS-----------------QGNG-GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR 274
S N G++VD +T + L Y +L L+ I PR
Sbjct: 299 RPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRL-PR 357
Query: 275 AKEVEERTGFDLCYRVPCPNNTFTDDLF-PSITFHFLNNVSLVLPQGNHFYAMSAPSNSS 333
A R G DLC+ +P D ++ P+++ F + L L + F
Sbjct: 358 ATP-STRLGLDLCFILP--EGVGIDRVYVPTVSMSF-DGRWLELERDRLFL------EDG 407
Query: 334 AVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ CL+ G + G++QQQN+ V+Y+L + +I F C S
Sbjct: 408 RMMCLMI-----GRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASCDS 451
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/403 (24%), Positives = 159/403 (39%), Gaps = 72/403 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW+ C +C P + C + + N
Sbjct: 220 VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPKDL----------LCQELQGNQN 269
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ C + Y + G+L RD + + ++ G RE
Sbjct: 270 ----------------YCETCKQC-DYEIEYADRSSSMGVLARDDMHIITTNGG--REKL 310
Query: 126 KFCFGCV----GSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
F FGC G P GI G +S+PSQL G + F HC D
Sbjct: 311 DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCI-----TRD 365
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
PN + +GD + + + TP+ +P N ++ + + G+ L S+R
Sbjct: 366 PNGGGYMFLGDDYVP-RWGMTSTPIRSAP--DNLFHTEAQKVYYGDQQL-----SMR--G 415
Query: 236 SQGNG-GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR-AKEVEERTGFDLCYRVPCP 293
+ GN ++ DSG++YT+LP+ Y L++ ++ YP ++ +RT LC P
Sbjct: 416 ASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYA---YPNFVQDSSDRT-LPLCLATDFP 471
Query: 294 NNTFTD--DLFPSITFHFLNNVSLVLPQG-----NHFYAMSAPSNSSAVKCLLFQSMDDG 346
D LF + HF V+P+ +++ +S N CL F + D
Sbjct: 472 VRYLEDVKQLFKPLNLHF-GKRWFVMPRTFTILPDNYLIISDKGNV----CLGFLNGKDI 526
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
D+G + + G + VVYD ++ +IG+ DC + +G
Sbjct: 527 DHGSTVIVGDNALRGKLVVYDNQQRQIGWTNSDCTKPQTQKGF 569
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 97/403 (24%), Positives = 159/403 (39%), Gaps = 72/403 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW+ C +C P + C + + N
Sbjct: 221 VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPKDL----------LCQELQGNQN 270
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ C + Y + G+L RD + + ++ G RE
Sbjct: 271 ----------------YCETCKQC-DYEIEYADRSSSMGVLARDDMHIITTNGG--REKL 311
Query: 126 KFCFGCV----GSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
F FGC G P GI G +S+PSQL G + F HC D
Sbjct: 312 DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCI-----TRD 366
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
PN + +GD + + + TP+ +P N ++ + + G+ L S+R
Sbjct: 367 PNGGGYMFLGDDYVP-RWGMTSTPIRSAP--DNLFHTEAQKVYYGDQQL-----SMR--G 416
Query: 236 SQGNG-GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR-AKEVEERTGFDLCYRVPCP 293
+ GN ++ DSG++YT+LP+ Y L++ ++ YP ++ +RT LC P
Sbjct: 417 ASGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKYA---YPNFVQDSSDRT-LPLCLATDFP 472
Query: 294 NNTFTD--DLFPSITFHFLNNVSLVLPQG-----NHFYAMSAPSNSSAVKCLLFQSMDDG 346
D LF + HF V+P+ +++ +S N CL F + D
Sbjct: 473 VRYLEDVKQLFKPLNLHF-GKRWFVMPRTFTILPDNYLIISDKGNV----CLGFLNGKDI 527
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
D+G + + G + VVYD ++ +IG+ DC + +G
Sbjct: 528 DHGSTVIVGDNALRGKLVVYDNQQRQIGWTNSDCTKPQTQKGF 570
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 105/397 (26%), Positives = 151/397 (38%), Gaps = 87/397 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNF-----SPSRSSSSSRDTCASSFCL 58
V +DTGSDL WVPC DC C + S+F +P+ SS+S + TC +S C
Sbjct: 115 VALDTGSDLFWVPC-----DCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCT 169
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ S C L T CP +GIL D L +
Sbjct: 170 H------------RSQC-LGTFSN------CPYMVSYVSAETSTSGILVEDVLHLTQEDN 210
Query: 119 GIIREIPKFCFGC----VGS--TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC GS P G+ G G +SVPS L GF FS CF
Sbjct: 211 HHDLVEANVIFGCGQIQSGSFLDVAAPNGLFGLGMEKISVPSMLSREGFTADSFSMCF-- 268
Query: 170 FKYANDPNISSPLVIGDVAISSKDNL--QFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
D IG ++ K + TP +P +P Y N ++T+V
Sbjct: 269 ---GRDG-------IGRISFGDKGSFDQDETPFNLNPSHPTY-----------NITVTQV 307
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
+ D + L DSGT++T+L +P Y++L S + R + R F+ C
Sbjct: 308 RVGTTVIDVEFTA--LFDSGTSFTYLVDPTYTRLTESFHSQVQ--DRRHRSDSRIPFEYC 363
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYA----MSAPSNSSAVKCLLFQSM 343
Y D+ P + +VSL + G+HF + + S V CL
Sbjct: 364 Y-----------DMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSELVYCLAVVKS 412
Query: 344 DDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + + G VV+D EK +G++ DC
Sbjct: 413 AELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 444
>gi|226500708|ref|NP_001149229.1| aspartic proteinase nepenthesin-2 [Zea mays]
gi|195625632|gb|ACG34646.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/399 (23%), Positives = 164/399 (41%), Gaps = 74/399 (18%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
I +DTGSD+ W +C SRS + S C S C
Sbjct: 125 ISAVVDTGSDIFWT----TEKEC-----------------SRSKTRSMLPCCSPKCEQRA 163
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGG--LVTGILTRDTLKVHGSSPG 119
S GC S L ++A YG G++ D L + +
Sbjct: 164 SC----------GCGRSELKAEAEKETKCTYAIIYGGNANDSTAGVMYEDKLTIVAVASK 213
Query: 120 II---REIPKFCFGCVGST---YREP--IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK 171
+ + + GC S +++P G+ G GR A S+P QL F + FS+C +++
Sbjct: 214 AVPSSQSFKEVAIGCSTSATLKFKDPSIKGVFGLGRSATSLPRQLNFSK--FSYCLSSYQ 271
Query: 172 YANDPNISSPLVIGDV------AISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
+P++ S L++ A+ + T + + Y Y++ L+ I+IG +
Sbjct: 272 ---EPDLPSYLLLTAAPDMATGAVGGGAAVATTALQPNSDYKTLYFVHLQNISIGGTRFP 328
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
V ++ G + VD+G ++T L +++L++ L + KE R
Sbjct: 329 AV-------STKSGGNMFVDTGASFTRLEGTVFAKLVTELDRIMKERKYVKEQPGRNNGQ 381
Query: 286 LCYRVPCPNNTFTDD--LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQS 342
+CY P +T D+ P + HF ++ ++VLP ++ + +++ CL +++S
Sbjct: 382 ICY---SPPSTAADESSKLPDMVLHFADSANMVLPWDSYLW------KTTSKLCLAIYKS 432
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G V G+FQ QN ++ D E++ F DC+
Sbjct: 433 NIKGGIS---VLGNFQMQNTHMLLDTGNEKLSFVRADCS 468
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 105/439 (23%), Positives = 178/439 (40%), Gaps = 106/439 (24%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD+ W+ C C +C + N+ + SSS++ S
Sbjct: 86 VQIDTGSDILWLNCNT----CNNCPKSSGLGIDLNYFDTASSSTAALVSCS--------- 132
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---------KVH 114
DP +T S+ C S+ + YG+G +G D +
Sbjct: 133 ----DPVCSYAVQTATSQCSSQANQC-SYTFQYGDGSGTSGYYVYDAMYFDVIMGQSVFS 187
Query: 115 GSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
SS ++ + G + T + GI GFG GALSV SQ+ G K FSHC
Sbjct: 188 NSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCL---- 243
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
+ LV+G++ + N+ +TP++ P+ P +Y + L++I + L P+
Sbjct: 244 -KGQGSGGGILVLGEIL---EPNIVYTPLV--PLQP-HYNLNLQSIAVNGQIL---PIDQ 293
Query: 232 REFDSQGNGGLLVDSGTT---------------------YTHLPEP-------------- 256
F + N G +VDSGTT +TH EP
Sbjct: 294 DVFATGNNRGTIVDSGTTLAYLVQEAYDPFLNAGSPCHFFTHFNEPTNNIKYEDGNNNHQ 353
Query: 257 ------FYSQLL--------SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF 302
+Y ++ +I+ +T++ + +K + + + CY VP T D+F
Sbjct: 354 SRVKRHYYDEVTLRLVLKHSAIITTTVSQF--SKPIISKG--NQCYLVP----TSLGDIF 405
Query: 303 PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNV 362
P ++ +F+ S+VL + + + +A+ C+ FQ + G + G ++
Sbjct: 406 PLVSLNFMGGASMVL-KPEQYLIHYGFLDGAAMWCIGFQKVQKG----YTILGDLVLKDK 460
Query: 363 EVVYDLEKERIGFQPMDCA 381
VYDL +RIG+ DC+
Sbjct: 461 IFVYDLANQRIGWTDYDCS 479
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/330 (26%), Positives = 142/330 (43%), Gaps = 65/330 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSP---SRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C +C R + L +P S++ +C FCL +
Sbjct: 102 VQVDTGSDIVWVNC----IQCRECP--RTSSLGMELTPYDLEESTTGKLVSCDEQFCLEV 155
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
+ +SGC T CP + YG+G G +D ++ + S +
Sbjct: 156 NGG-------PLSGC--------TTNMSCP-YLQIYGDGSSTAGYFVKDYVQYNRVSGDL 199
Query: 121 IREIPK--FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQLG---FLQKGFSHCF 167
FGC +GS+ E + GI GFG+ S+ SQL ++K F+HC
Sbjct: 200 ETTAANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL 259
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ N +G V + K N+ +P+ PN +Y + + + +G+ L
Sbjct: 260 ------DGTNGGGIFAMGHV-VQPKVNM-------TPLVPNQPHYNVNMTGVQVGHIILN 305
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
+S F++ G ++DSGTT +LPE Y L++ + S EV+ G
Sbjct: 306 ---ISADVFEAGDRKGTIIDSGTTLAYLPELIYEPLVAKILSQ----QHNLEVQTIHGEY 358
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSL 315
C++ + DD FP + FHF N++ L
Sbjct: 359 KCFQY----SERVDDGFPPVIFHFENSLLL 384
>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 445
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 107/404 (26%), Positives = 166/404 (41%), Gaps = 59/404 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C + + S F+P SSS + C S C
Sbjct: 83 VTMVLDTGSELSWLHC------------KKQQNINSVFNPHLSSSYTPIPCMSPIC---- 126
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFA-YTYGEGGLVTGILTRDTLKVHGS-SPG 119
D C + L C S+A +T EG L + DT + GS PG
Sbjct: 127 -KTRTRDFLIPVSCDSNNL-----CHVTVSYADFTSLEGNLAS-----DTFAISGSGQPG 175
Query: 120 IIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNIS 179
II F + + G+ G RG+LS +Q+GF + FS+C + + S
Sbjct: 176 IIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFPK--FSYCI------SGKDAS 227
Query: 180 SPLVIGDVAISSKDNLQFTPMLK--SPMYPNY----YYIGLEAITIGNSSLTEVPLSLRE 233
L+ GD L++TP++K +P+ P + Y + L I +G+ L +VP +
Sbjct: 228 GVLLFGDATFKWLGPLKYTPLVKMNTPL-PYFDRVAYTVRLMGIRVGSKPL-QVPKEIFA 285
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQL----LSILQSTITYYPRAKEVEERTGFDLCYR 289
D G G +VDSGT +T L Y+ L ++ + +T V E DLC+R
Sbjct: 286 PDHTGAGQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFE-GAMDLCFR 344
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSA----VKCLLFQSMDD 345
V P++T F + + Y + + + V CL F + D
Sbjct: 345 V---RRGGVVPAVPAVTMVF-EGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDL 400
Query: 346 GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ V G QQNV + +DL R+GF C + GL
Sbjct: 401 LGI-EAYVIGHHHQQNVWMEFDLVNSRVGFADTKCELASRRLGL 443
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/383 (24%), Positives = 151/383 (39%), Gaps = 65/383 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS LTW+ C C + ++ F P SS+ + C++S C + ++
Sbjct: 149 MVVDTGSSLTWLQCSPCVVSC-------HRQVGPLFDPRASSTYASVRCSASQCDELQAA 201
Query: 64 D-NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
NP S CS S + C + +YG+ G L+ DT+ GS+
Sbjct: 202 TLNP------SACSASNV----CI-----YQASYGDSSFSVGSLSTDTVSF-GST----- 240
Query: 123 EIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
P F +GC + G+ G R LS+ QL L FS+C P
Sbjct: 241 RYPSFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCL--------PTA 292
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+S + ++ +TPM S + + Y+I L +++G S L P E+ S
Sbjct: 293 ASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSP---SEYSSLP 349
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
++DSGT T LP ++ L + + RA D C+ +
Sbjct: 350 T---IIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSI---LDTCFE-----GQAS 398
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P++ F S+ L N + + CL F D + + G+ Q
Sbjct: 399 QLRVPTVAMAFAGGASMKLTTRNVLIDVD-----DSTTCLAFAPTDS-----TAIIGNTQ 448
Query: 359 QQNVEVVYDLEKERIGFQPMDCA 381
QQ V+YD+ + RIGF C+
Sbjct: 449 QQTFSVIYDVAQSRIGFSAGGCS 471
>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 428
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 111/412 (26%), Positives = 169/412 (41%), Gaps = 82/412 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC---- 57
+ + +DTGS+L+W+ C L L S F+P SSS + C SS C
Sbjct: 73 VTMVLDTGSELSWLHCKKLP------------NLNSTFNPLLSSSYTPTPCNSSICTTRT 120
Query: 58 --LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHG 115
L I +S +P + C +Y + G L +T + G
Sbjct: 121 RDLTIPASCDP---------------NNKLCH----VIVSYADASSAEGTLAAETFSLAG 161
Query: 116 SS-PGIIREIPKFCFGCVGST-YREPI-------GIAGFGRGALSVPSQLGFLQKGFSHC 166
++ PG + FGC+ S Y I G+ G RG+LS+ +Q+ + FS+C
Sbjct: 162 AAQPGTL-------FGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPK--FSYC 212
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNY-----YYIGLEAITIGN 221
+ + L++GD + LQ+TP++ + Y Y + LE I + +
Sbjct: 213 I------SGEDALGVLLLGD-GTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKV-S 264
Query: 222 SSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKE--- 277
L ++P S+ D G G +VDSGT +T L YS L L+ T R ++
Sbjct: 265 EKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNF 324
Query: 278 VEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKC 337
V E DLCY P +F P++T F + + + Y +S S V C
Sbjct: 325 VFEG-AMDLCYHAPA---SFAA--VPAVTLVF-SGAEMRVSGERLLYRVS--KGSDWVYC 375
Query: 338 LLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
F + D + V G QQNV + +DL K R+GF C GL
Sbjct: 376 FTFGNSDLLGI-EAYVIGHHHQQNVWMEFDLLKSRVGFTQTTCDLATQRLGL 426
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/387 (24%), Positives = 154/387 (39%), Gaps = 48/387 (12%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPS-RSSSSSRDTCASSF--CLNIHSS 63
DTGSDLTW+ C C + + +N S S R+ S D C ++
Sbjct: 138 DTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTEC 197
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
NP PC F Y Y G G+ +T+ V + IR
Sbjct: 198 PNPNAPCL--------------------FDYRYLNGPRAIGVFANETVTVGLNDHKKIR- 236
Query: 124 IPKFCFGCVGS---TYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+ GC S T P G+ G G S+ +L FS+C + + + N
Sbjct: 237 LFDVLIGCTESFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLV--DHLSSSNHK 294
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYY-IGLEAITIGNSSLTEVPLSLREFDSQG 238
+ L GD+ +Q T +L Y N +Y + + I++G S L+ +S ++ G
Sbjct: 295 NFLSFGDIPEMKLPKMQHTELLLG--YINAFYPVNVSGISVGGSMLS---ISSDIWNVTG 349
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG++VDSGT+ T L Y +++ L+ + + +E + C+ + F
Sbjct: 350 VGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFE----DKGFD 405
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P + HF + P ++ ++ +KCL + D+ S + G+
Sbjct: 406 RAAVPRLLIHFADGAIFKPPVKSYIIDVA-----EGIKCL---GIIKADFPGSSILGNVM 457
Query: 359 QQNVEVVYDLEKERIGFQPMDCASTAS 385
QQN YDL + ++GF P C + S
Sbjct: 458 QQNHLWEYDLGRGKLGFGPSSCIMSNS 484
>gi|326523515|dbj|BAJ92928.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 459
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/343 (28%), Positives = 149/343 (43%), Gaps = 51/343 (14%)
Query: 49 RDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTR 108
R CAS C ++ ++D T C + TC S+ Y G TG L
Sbjct: 125 RVQCASQTCRSLLAND------TTDACGGNPSGDDTC-----SYVNVYAPGSNTTGFLAN 173
Query: 109 DTLKVHGSSPGIIREIPKFCFGCVGSTYREP----IGIAGFGRGALSVPSQLGFLQKGFS 164
+T+ V GS G GC + P +G GF RGALS+ SQL + FS
Sbjct: 174 ETVAV-GSFVG------AAILGCSAANSTGPLVGEVGSFGFNRGALSLVSQLSVSK--FS 224
Query: 165 HCFLAFKYANDPNISSPLVIGDVAI-SSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSS 223
+ +LA A + S +++GD A+ ++ + TP+L+S +P+ +Y+ L AI + +
Sbjct: 225 Y-YLAPDEAGSSDSESVVLLGDAAVPQTRGGGRSTPLLRSTAFPDVHYVKLSAIQVDGQA 283
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTY--THLPEPFYSQLLSILQSTITYYPRAKEVEER 281
L+ +P + + G+ G +V GT Y T L E Y+ + L S I A+EV
Sbjct: 284 LSGIPAGAFDLAADGSSGGVV-MGTLYPITRLQEDAYNAVRQALVSKI----NAQEVNGS 338
Query: 282 T----GFDLCYRVPCPNNTFTDDLFPSITFHFLNN---VSLVLPQGNHFYAMSAPSNSSA 334
FDLCY + FP IT F +L L ++F+ N +
Sbjct: 339 AFAGGVFDLCYDA----QSVATLTFPKITLVFDGGNAPATLELTTVHYFF----KDNVTG 390
Query: 335 VKCLLFQSMDDGDYGPSG-VFGSFQQQNVEVVYDLEKERIGFQ 376
++C M G P G V GS Q ++YD+ E + +
Sbjct: 391 LQCFTMLPMPVGT--PFGSVLGSMVQAGTNMIYDVGGETLTLE 431
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 151/332 (45%), Gaps = 42/332 (12%)
Query: 57 CLNIHSSDNP-FDPCTMSGCSLSTLLKSTCC--RPCPSFAYTYGEGGLVTGILTRDTLKV 113
C + NP FDP + C+ + +C + C + Y Y + G+L ++
Sbjct: 62 CQGCYKQKNPMFDP--LKECN--SFFDHSCSPEKAC-DYVYAYADDSATKGMLAKEIATF 116
Query: 114 HGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFL--QKGFSHCF 167
+ I E FGC G +G+ G G G LS+ SQ+G L K FS C
Sbjct: 117 SSTDGKPIVE--SIIFGCGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCL 174
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEV 227
+ F DP+ S + +G+ + S + + TP++ S Y + LE I++G+ T V
Sbjct: 175 VPFH--ADPHTSGTISLGEASDVSGEGVVTTPLV-SEEGQTPYLVTLEGISVGD---TFV 228
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P + E S+GN +++DSGT T+LP+ FY +L+ L+ I P V+ G LC
Sbjct: 229 PFNSSEMLSKGN--IMIDSGTPETYLPQEFYDRLVEELKVQINLPPI--HVDPDLGTQLC 284
Query: 288 YRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
Y+ + T+ P +T HF +LP P + V C DG
Sbjct: 285 YK------SETNLEGPILTAHFEGADVKLLP----LQTFIPPKD--GVFCFAMTGTTDGL 332
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMD 379
Y +FG+F Q NV + +DL+K + F+P D
Sbjct: 333 Y----IFGNFAQSNVLIGFDLDKRIVFFKPTD 360
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 106/383 (27%), Positives = 163/383 (42%), Gaps = 74/383 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD++WV C C C +++ S F PS SS+ S +C S+ C +
Sbjct: 142 MLIDTGSDVSWVQCK----PCSQC----HSQADSLFDPSSSSTYSAFSCTSAACAQLR-- 191
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
GCS S+ C+ + YG+G +G + DTL + S+
Sbjct: 192 --------QRGCS------SSQCQ----YTVKYGDGSTGSGTYSSDTLALGSST------ 227
Query: 124 IPKFCFGCVGST-----YREPIGIAGFGRGALSVPSQ-LGFLQKGFSHCFLAFKYANDPN 177
+ F FGC S + G+ G G GA S+ +Q G K FS+C P
Sbjct: 228 VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCL-----PPTPG 282
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
S L +G S+ + TPML+S P+YY + L+AI +G L +P S
Sbjct: 283 SSGFLTLG---ASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQL-NIPASAF----- 333
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+ G ++DSGT T LP YS L S ++ + YP A+ + FD C+ ++
Sbjct: 334 -SAGSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGI---FDTCFDFSGQSSVS 389
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P++ F + L CL F + + D G+ G+
Sbjct: 390 ----IPTVALVFSGGAVVDLASDGIILG----------SCLAFAA--NSDDTSLGIIGNV 433
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ+ EV+YD+ +GF+ C
Sbjct: 434 QQRTFEVLYDVGGGAVGFKAGAC 456
>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 499
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 135/306 (44%), Gaps = 30/306 (9%)
Query: 87 RPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE---IPKFCFGC---VGSTYREPI 140
+ CP + Y YG+ TG +T V+ ++ G E + FGC +
Sbjct: 211 QSCP-YYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAA 269
Query: 141 GIAGFGRGALSVPSQLGFLQ-KGFSHCFLAFKYANDPNISSPLVIG-DVAISSKDNLQFT 198
G+ G GRG LS SQL L FS+C + +D N+SS L+ G D + S NL FT
Sbjct: 270 GLLGLGRGPLSFSSQLQSLYGHSFSYCLV--DRNSDTNVSSKLIFGEDKDLLSHPNLNFT 327
Query: 199 PML--KSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEP 256
+ K + +YY+ +++I + L +P S G GG ++DSGTT ++ EP
Sbjct: 328 SFVAGKENLVDTFYYVQIKSILVAGEVLN-IPEETWNISSDGAGGTIIDSGTTLSYFAEP 386
Query: 257 FYSQLLS-ILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSL 315
Y + + I + YP ++ D C+ V +N P + F +
Sbjct: 387 AYEFIKNKIAEKAKGKYPVYRDFPI---LDPCFNVSGIHNV----QLPELGIAFADGAVW 439
Query: 316 VLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGF 375
P N F ++ + CL + + G++QQQN ++YD ++ R+G+
Sbjct: 440 NFPTENSFIWLNED-----LVCLAMLGTPKSAFS---IIGNYQQQNFHILYDTKRSRLGY 491
Query: 376 QPMDCA 381
P CA
Sbjct: 492 APTKCA 497
>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 153/379 (40%), Gaps = 60/379 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DT +D +VPC C C D + FSP S+S C+ C +
Sbjct: 116 LDTSTDEAFVPCSG----CTGCSD-------TTFSPKASTSYGPLDCSVPQCGQVR---- 160
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
G S C SF +Y G + L +D L++ IP
Sbjct: 161 --------GLSCPATGTGAC-----SFNQSYA-GSSFSATLVQDALRLA------TDVIP 200
Query: 126 KFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
+ FGCV + + G+ G GRG LS+ SQ G G FS+C +FK S
Sbjct: 201 YYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFK---SYYFSGS 257
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L +G V +++ TP+L+SP P+ YY+ I++G L P F+ G
Sbjct: 258 LKLGPVG--QPKSIRTTPLLRSPHRPSLYYVNFTGISVGRV-LVPFPSEYLGFNPNTGSG 314
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT T EP Y+ + + + FD C+ T+ + L
Sbjct: 315 TIIDSGTVITRFVEPVYNAVREEFRKQVG----GTTFTSIGAFDTCFV-----KTY-ETL 364
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P IT HF + L LP N SA S + CL + D V +FQQQN
Sbjct: 365 APPITLHF-EGLDLKLPLENSLIHSSAGS----LACLAMAAAPDNVNSVLNVIANFQQQN 419
Query: 362 VEVVYDLEKERIGFQPMDC 380
+ +++D+ ++G C
Sbjct: 420 LRILFDIVNNKVGIAREVC 438
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/394 (24%), Positives = 152/394 (38%), Gaps = 79/394 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNN--KLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+DTGSDLTWV C C C R+ K +N P C++S C + +
Sbjct: 71 IDTGSDLTWVQC---DAPCKGCTKPRDKLYKPKNNLVP----------CSNSLCQAVSTG 117
Query: 64 DN-----PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+N P D C + Y + G G+L D+ + S+
Sbjct: 118 ENYHCDAPDDQC--------------------DYEIEYADLGSSIGVLLSDSFPLRLSNG 157
Query: 119 GIIREIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
+++ PK FGC + GI G GRG +S+ SQL G Q HCF
Sbjct: 158 TLLQ--PKMAFGCGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFS 215
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ L GD S + +TPML+S Y G + G P
Sbjct: 216 RAR-------GGFLFFGDHLFPSS-RITWTPMLRSSS-DTLYSSGPAELLFGGK-----P 261
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
++ L+ DSG++YT+ Y +L++++ + P E+ +C+
Sbjct: 262 TGIKGLQ------LIFDSGSSYTYFNAQVYQSILNLVRKDLAGKPLKDAPEKELA--VCW 313
Query: 289 RVPCPNNTFTD--DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
+ P + D F +T F+N ++ L Y + + CL + +
Sbjct: 314 KTAKPIKSILDIKSYFKPLTISFMNAKNVQLQLAPEDYLIITKDGNV---CLGILNGSEQ 370
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G V G Q+ V+YD EK++IG+ P +C
Sbjct: 371 QLGNFNVIGDIFMQDRVVIYDNEKQQIGWFPANC 404
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 96/389 (24%), Positives = 152/389 (39%), Gaps = 71/389 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS-- 63
+DTGSDLTWV C C C L + P ++R CASS C I ++
Sbjct: 85 IDTGSDLTWVQC---DAPCKGC----TKPLDKLYKPK----NNRVPCASSLCQAIQNNNC 133
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D P + C + Y + G G+L D + ++ +++
Sbjct: 134 DIPTEQC--------------------DYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQ- 172
Query: 124 IPKFCFGC-VGSTYREP------IGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYA 173
P+ FGC Y P GI G GRG S+ SQL G Q HCF
Sbjct: 173 -PRIAFGCGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRV--- 228
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
L GD + + +TPML+S Y G + G P ++
Sbjct: 229 ----TGGFLFFGD-HLLPPSGITWTPMLRSSS-DTLYSSGPAELLFGGK-----PTGIKG 277
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
L+ DSG++YT+ Y +L++++ ++ P K+ E +C++ P
Sbjct: 278 LQ------LIFDSGSSYTYFNAQVYQSILNLVRKDLSGMP-LKDAPEEKALAVCWKTAKP 330
Query: 294 NNTFTD--DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ D F +T +F+ ++ L Y + + CL + + G
Sbjct: 331 IKSILDIKSFFKPLTINFIKAKNVQLQLAPEDYLIITKDGNV---CLGILNGGEQGLGNL 387
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G Q+ VVYD E+++IG+ P +C
Sbjct: 388 NVIGDIFMQDRVVVYDNERQQIGWFPTNC 416
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 111/413 (26%), Positives = 164/413 (39%), Gaps = 76/413 (18%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C D + F S SSS + C+S C +
Sbjct: 76 VTMVLDTGSELSWLLCNGSRHD-------------APFDASASSSYAPVPCSSPACTWL- 121
Query: 62 SSDNPFDP-CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
D P P C S C +S +Y + G+L DT + GSSP
Sbjct: 122 GRDLPVRPFCDSSACRVS---------------LSYADASSADGLLAADTFLL-GSSP-- 163
Query: 121 IREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
+P FGC+ S + P G+ G RG LS +Q + F++C A
Sbjct: 164 ---MPAL-FGCITSYSSSTDPSETPPTGLLGMNRGGLSFVTQTA--TRRFAYCIAA---G 214
Query: 174 NDPNISSPLVIG----DVAISS--KDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNS 222
P I L++G + ++S + L +TP+++ S P + Y + LE I +G S
Sbjct: 215 QGPGI---LLLGGNDTETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVG-S 270
Query: 223 SLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT------YYPRAK 276
+L +P L D G G +VDSGT +T L Y+ L + + +T P +
Sbjct: 271 ALLAIPKHLLTPDHTGAGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGE 330
Query: 277 EVEERTG-FDLCYR--VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP--SN 331
G FD C+R + L P + +V Y +
Sbjct: 331 PGFVFQGAFDACFRGTEARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGE 390
Query: 332 SSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
V CL F S D + V G QQ+V V YDL R+GF CA A
Sbjct: 391 GEGVWCLTFGSSDMAGVS-AYVIGHHHQQDVWVEYDLRNARLGFAAARCADLA 442
>gi|383125857|gb|AFG43519.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125863|gb|AFG43522.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125867|gb|AFG43524.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125869|gb|AFG43525.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125871|gb|AFG43526.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125873|gb|AFG43527.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125877|gb|AFG43529.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 70/128 (54%), Gaps = 8/128 (6%)
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLK------SPMYPNYYYIGLEAITIGNSSLTEV 227
++ N S +V+GD A + L +TP L S Y YYYIGL A++IG + ++
Sbjct: 3 DEENQKSLMVLGDKAFPTGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGKRM-KL 61
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P L FD++GNGG ++DSGTT+T + + + + S I Y RA +VE TG LC
Sbjct: 62 PSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQIE-YRRAVDVEALTGMGLC 120
Query: 288 YRVPCPNN 295
Y V N
Sbjct: 121 YNVSGLEN 128
>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
Length = 396
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 84/385 (21%), Positives = 153/385 (39%), Gaps = 60/385 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D +L W C C C ++ + + F P+ SS+ + C ++ C +I +
Sbjct: 62 VDVAGELVWTQCSA----CRRC--FKQD--LPVFVPNASSTFKPEPCGTAVCESIPTRSC 113
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D C+ G T L+ G +G DT + ++
Sbjct: 114 SGDVCSYKG--PPTQLR-----------------GNTSGFAATDTFAIGTATV------- 147
Query: 126 KFCFGCVGS----TYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGCV + T P G G GR S+ +Q+ + FS+C + SS
Sbjct: 148 RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTR--FSYCL----SPRNTGKSSR 201
Query: 182 LVIGDVA-ISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L +G A ++ ++ P +K+ NYY + L+AI GN+++ +Q
Sbjct: 202 LFLGSSAKLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIAT---------AQ 252
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G L++ + + ++ L + Y + + FDLC++ F
Sbjct: 253 SGGILVMHTVSPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFK---KAAGF 309
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P + F F +L +P + + +++ L ++ V GS
Sbjct: 310 SRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSL 369
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
QQ++V +YDL+KE + F+P DC+S
Sbjct: 370 QQEDVHFLYDLKKETLSFEPADCSS 394
>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 162/404 (40%), Gaps = 59/404 (14%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGS+L+W+ C F L S F+P S + S+ C S C
Sbjct: 82 VTMVLDTGSELSWLHCKKTQF------------LNSVFNPLSSKTYSKVPCLSPTC-KTR 128
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ D P + L ++ S Y + + G L +T ++ +
Sbjct: 129 TRDLTI-PVSCDATKLCHVIVS------------YADATSIEGNLAFETFRLGSLTK--- 172
Query: 122 REIPKFCFGCVGSTYR-------EPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
P FGC+ S + + G+ G RG+LS +Q+G+ + FS+C F A
Sbjct: 173 ---PATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYPK--FSYCISGFDSAG 227
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPL 229
L++G+ + L +TP+++ S P + Y + LE I + N L+ +P
Sbjct: 228 ------VLLLGNASFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLS-LPK 280
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE---RTGFDL 286
S+ D G G +VDSGT +T L P Y+ L + S + + + DL
Sbjct: 281 SVFVPDHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKVLNDDNFVFQGAMDL 340
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
CY + + P ++ F V + + +V C F + D
Sbjct: 341 CYLLDSSRPNLQN--LPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLL 398
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGLH 390
+ V G QQNV + +DLEK RIG + C GL+
Sbjct: 399 GV-EAFVIGHHHQQNVWMEFDLEKSRIGLADVRCDVAGQKLGLY 441
>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 440
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 155/381 (40%), Gaps = 60/381 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D +VPC C C D + FSP S+S C+ C +
Sbjct: 115 MVLDTSTDEAFVPCSG----CTGCSD-------TTFSPKASTSYGPLDCSVPQCGQVRGL 163
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P +T C SF +Y G + L +D+L++
Sbjct: 164 SCP----------------ATGTGAC-SFNQSYA-GSSFSATLVQDSLRLA------TDV 199
Query: 124 IPKFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP + FGCV + + G+ G GRG LS+ SQ G G FS+C +FK S
Sbjct: 200 IPNYSFGCVNAITGASVPAQGLLGLGRGPLSLLSQSGSNYSGIFSYCLPSFK---SYYFS 256
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L+SP P+ YY+ I++G L P F+
Sbjct: 257 GSLKLGPVG--QPKSIRTTPLLRSPHRPSLYYVNFTGISVGRV-LVPFPSEYLGFNPNTG 313
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T EP Y+ + + + FD C+ T+ +
Sbjct: 314 SGTIIDSGTVITRFVEPVYNAVREEFRKQVG----GTTFTSIGAFDTCFV-----KTY-E 363
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
L P IT HF + L LP N SA S + CL + D V +FQQ
Sbjct: 364 TLAPPITLHF-EGLDLKLPLENSLIHSSAGS----LACLAMAAAPDNVNSVLNVIANFQQ 418
Query: 360 QNVEVVYDLEKERIGFQPMDC 380
QN+ +++D ++G C
Sbjct: 419 QNLRILFDTVNNKVGIAREVC 439
>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
Length = 459
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 106/412 (25%), Positives = 158/412 (38%), Gaps = 98/412 (23%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI----- 60
+DT SDL W+ C C+ C YR +L F+P SSS + C S C +
Sbjct: 109 IDTASDLVWMQCQP----CVSC--YR--QLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRC 160
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
H D+ C + Y Y G+ G L D L + G
Sbjct: 161 HEDDD-----------------GAC-----QYTYKYSGHGVTKGTLAIDKLAIGGDV--- 195
Query: 121 IREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
FGC VG + G+ G GRG LS+ SQL H F+
Sbjct: 196 ---FHAVVFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQLSV------HRFMYCLPPPMS 246
Query: 177 NISSPLVIG---DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S LV+G D + D + T M S YP+YYY+ L+ + +G+ + P + R
Sbjct: 247 RTSGKLVLGAGADAVRNMSDRVTVT-MSSSTRYPSYYYLNLDGLAVGD----QTPGTTRN 301
Query: 234 FDSQGNG----------------------GLLVDSGTTYTHLPEPFYSQLLSILQSTITY 271
S +G G++VD +T + L Y +L L+ I
Sbjct: 302 ATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEIR- 360
Query: 272 YPRAKEVEERTGFDLCYRVPCPNNTFTDDLF-PSITFHFLNNVSLVLPQGNHFYAMSAPS 330
PRA R G DLC+ + P D ++ P+++ F + L L + F
Sbjct: 361 LPRATP-SLRLGLDLCFIL--PEGVGMDRVYVPTVSLSF-DGRWLELDRDRLFV------ 410
Query: 331 NSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ CL+ G + G+FQ QN+ V+++L + +I F C S
Sbjct: 411 TDGRMMCLMI-----GRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASCDS 457
>gi|383125861|gb|AFG43521.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 70/128 (54%), Gaps = 8/128 (6%)
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLK------SPMYPNYYYIGLEAITIGNSSLTEV 227
++ N S +V+GD A + L +TP L S Y YYYIGL A++IG + ++
Sbjct: 3 DEENQKSLMVLGDKAFPNGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGKRM-KL 61
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P L FD++GNGG ++DSGTT+T + + + + S I Y RA +VE TG LC
Sbjct: 62 PSKLLRFDAKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQIE-YRRAVDVEALTGMGLC 120
Query: 288 YRVPCPNN 295
Y V N
Sbjct: 121 YNVSGLEN 128
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 154/387 (39%), Gaps = 71/387 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS LT+VPC C C +++ NF P SS+
Sbjct: 109 VDTGSTLTYVPCST----CEQCGKHQD----PNFQPDWSST------------------- 141
Query: 66 PFDP--CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ P C+M S ++ R Y E +G+L D + S +
Sbjct: 142 -YQPLKCSMECTCDSEMMHCVYDR-------QYAEMSSSSGVLGEDIVSFGKQSE---LK 190
Query: 124 IPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
+ FGC G Y + GI G GRG LS+ QL G + FS C+
Sbjct: 191 PQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGG- 249
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+V+G IS + FT P YY I L+ I I L P++ FD
Sbjct: 250 ----GAMVLG--GISPPAGMVFTH--SDPARSAYYNIDLKEIHIAGKQL---PINPMVFD 298
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G G ++DSGTTY +LPEP + + + + + +R D+C+ +
Sbjct: 299 --GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSL-KLIQGPDRNYNDICFSGVGSDV 355
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVF 354
+ FP++ F N L L N+ + S + CL +FQ+ +D + +
Sbjct: 356 SQLSKTFPAVDLVFSNGNRLSLSPENYLFQH---SKAHGAYCLGIFQNEND----QTTLL 408
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G +N V+YD E +IGF +C+
Sbjct: 409 GGIIVRNTLVMYDREHLKIGFWKTNCS 435
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 150/383 (39%), Gaps = 73/383 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ DTGS L W C C C + F P++S+S C+S C +I
Sbjct: 147 LIFDTGSGLIWTQCK----PCKAC-----YPKVPVFDPTKSASFKGLPCSSKLCQSI--- 194
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ C P ++ Y + TG L +T+ S + +
Sbjct: 195 ------------------RQGCSSPKCTYLTAYVDNSSSTGTLATETI----SFSHLKYD 232
Query: 124 IPKFCFGCVGSTYREPIG---IAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
GC E +G I G R +S+ SQ K FS+C S
Sbjct: 233 FKNILIGCSDQVSGESLGESGIMGLNRSPISLASQTANIYDKLFSYCIP----------S 282
Query: 180 SPLVIGDVAISSK--DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+P G + K ++++F+P+ K+ +Y I + I++G L + S + S
Sbjct: 283 TPGSTGHLTFGGKVPNDVRFSPVSKTAPSSDYD-IKMTGISVGGRKLL-IDASAFKIAST 340
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+DSG T LP YS L S+ + + YP +++ D CY + +
Sbjct: 341 ------IDSGAVLTRLPPKAYSALRSVFREMMKGYPL---LDQDDFLDTCYDF----SNY 387
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ PSI+ F V + + + + S V CL F +DD +FG+F
Sbjct: 388 STVAIPSISVFFEGGVEMDIDVSGIMWQVPG----SKVYCLAFAELDD----EVSIFGNF 439
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
QQ+ VV+D KERIGF P C
Sbjct: 440 QQKTYTVVFDGAKERIGFAPGGC 462
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 101/387 (26%), Positives = 154/387 (39%), Gaps = 71/387 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS LT+VPC C C +++ NF P SS+
Sbjct: 109 VDTGSTLTYVPCST----CEQCGKHQD----PNFQPDWSST------------------- 141
Query: 66 PFDP--CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ P C+M S ++ R Y E +G+L D + S +
Sbjct: 142 -YQPLKCSMECTCDSEMMHCVYDR-------QYAEMSSSSGVLGEDIVSFGKQSE---LK 190
Query: 124 IPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
+ FGC G Y + GI G GRG LS+ QL G + FS C+
Sbjct: 191 PQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGG- 249
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+V+G IS + FT P YY I L+ I I L P++ FD
Sbjct: 250 ----GAMVLG--GISPPAGMVFTH--SDPARSAYYNIDLKEIHIAGKQL---PINPMVFD 298
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNN 295
G G ++DSGTTY +LPEP + + + + + +R D+C+ +
Sbjct: 299 --GKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSL-KLIQGPDRNYNDICFSGVGSDV 355
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVF 354
+ FP++ F N L L N+ + S + CL +FQ+ +D + +
Sbjct: 356 SQLSKTFPAVDLVFSNGNRLSLSPENYLFQH---SKAHGAYCLGIFQNEND----QTTLL 408
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCA 381
G +N V+YD E +IGF +C+
Sbjct: 409 GGIIVRNTLVMYDREHLKIGFWKTNCS 435
>gi|361067987|gb|AEW08305.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125859|gb|AFG43520.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125865|gb|AFG43523.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
gi|383125875|gb|AFG43528.1| Pinus taeda anonymous locus 2_6033_01 genomic sequence
Length = 134
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 70/128 (54%), Gaps = 8/128 (6%)
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLK------SPMYPNYYYIGLEAITIGNSSLTEV 227
++ N S +V+GD A + L +TP L S Y YYYIGL A++IG + ++
Sbjct: 3 DEENQKSLMVLGDKAFPNGIPLNYTPFLTNYRAPPSSQYGVYYYIGLRAVSIGGKRM-KL 61
Query: 228 PLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLC 287
P L FD++GNGG ++DSGTT+T + + + + S I Y RA +VE TG LC
Sbjct: 62 PSKLLRFDTKGNGGTIIDSGTTFTVFHDEIFKHIAAGFASQIE-YRRAVDVEALTGMGLC 120
Query: 288 YRVPCPNN 295
Y V N
Sbjct: 121 YNVSGLEN 128
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 159/391 (40%), Gaps = 74/391 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT SD+ WV C C Y + ++ + P++S S+ C+S C ++
Sbjct: 176 MVVDTASDVPWVQCA----PCPQPQCYAQSDVL--YDPTKSILSAPFPCSSPQCRSLGRY 229
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
N CT +G + TC + Y +G +G D L ++ G +
Sbjct: 230 ANG---CTGAGNT------GTC-----QYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS- 274
Query: 124 IPKFCFGCV------GSTYREPIGIAGFGRGALSVPSQL-GFLQKG--FSHCFLAFKYAN 174
KF FGC GS + G GRGA S+ SQ G KG FS+C
Sbjct: 275 --KFQFGCSHALLRPGSFNNKTAGFMALGRGAQSLSSQTKGTFSKGNVFSYCL------- 325
Query: 175 DPNISSP--LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
P S L +G V + TPMLKS M P Y + L I + L VP ++
Sbjct: 326 PPTGSHKGFLSLG-VPQHAASRYAVTPMLKSKMAPMIYMVRLIGIDVAGQRL-PVPPAVF 383
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR--- 289
++ +DS T T LP Y L + ++ + Y + V + D CY
Sbjct: 384 AANAA------MDSRTIITRLPPTAYMALRAAFRAQMRAY---RAVAPKGQLDTCYDFTG 434
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYG 349
VP P +T F N ++ L PS CL F + + D+
Sbjct: 435 VPMVR-------LPKVTLVFDRNAAVEL----------DPSGVMLDSCLAF-APNANDFM 476
Query: 350 PSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
P G+ G+ QQQ +EV+Y+++ +GF+ C
Sbjct: 477 P-GIIGNVQQQTLEVLYNVDGASVGFRRAAC 506
>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 440
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 100/391 (25%), Positives = 162/391 (41%), Gaps = 53/391 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ +DTGS L+W+ C + ++F PS SSS S C C
Sbjct: 94 QMVLDTGSQLSWI-------QCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPLC----- 141
Query: 63 SDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P P L +TC R C ++Y Y +G G L R+ + S
Sbjct: 142 --KPRIP--------DFTLPTTCDQNRLC-HYSYFYADGTYAEGSLVREKITFSSS---- 186
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ P GC ++ E GI G G S SQ + FS+C + + +
Sbjct: 187 -QSTPPLILGCAEASTDEK-GILGMNLGRRSFASQAKISK--FSYCVPTRQARAGLSSTG 242
Query: 181 PLVIGDVAISSK----DNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+G+ S + + L FTP +SP + P Y I ++ I +GN+ L + +L D
Sbjct: 243 SFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARL-NISATLFRPD 301
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPN 294
G G ++DSG+ +T+L + Y+++ + + P+ K+ G D+C+ N
Sbjct: 302 PSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVG--PKLKKGYVYGGVSDMCFD---GN 356
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGV 353
L ++ F F V +V+ + + V C+ + +S G S +
Sbjct: 357 PMEIGRLIGNMVFEFEKGVEIVIDKWRVLADVGG-----GVHCIGIGRSEMLG--AASNI 409
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
G+F QQN+ V YDL RIG DC+ +
Sbjct: 410 IGNFHQQNLWVEYDLANRRIGLGKADCSRSV 440
>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 523
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 140/353 (39%), Gaps = 57/353 (16%)
Query: 39 FSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGE 98
F PSRSSS + C S C CT + C F +G
Sbjct: 217 FEPSRSSSFAAIPCGSPECAV---------ECTGASCP---------------FTIQFGN 252
Query: 99 GGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVP 153
+ G L RDTL + S+ F FGC+ T+ +G+ R + S+
Sbjct: 253 VTVANGTLVRDTLTLPPSA-----TFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLA 307
Query: 154 SQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAIS------SKDNLQFTPMLKSPMYP 207
S++ + G + AF Y P+ S+ G ++I S ++++ PM +P +P
Sbjct: 308 SRV--ISNGATTSAAAFSYCL-PSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHP 364
Query: 208 NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQS 267
N Y++ L I++G L P+ F + G L+++ T +T L Y+ L +
Sbjct: 365 NSYFVDLVGISVGGEDL---PVPPAVFAAHGT---LLEAATEFTFLAPAAYAALRDAFRK 418
Query: 268 TITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
+ YP A D CY + P++ F L L Y
Sbjct: 419 DMAPYPAAPPFRV---LDTCYNL----TGLASLAVPAVALRFAGGTELELDVRQMMYFAD 471
Query: 328 APSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
S S+V CL F + + S V G+ Q++ EVVYDL R+GF P C
Sbjct: 472 PSSVFSSVACLAFAAAPLPAFPVS-VIGTLAQRSTEVVYDLRGGRVGFIPGRC 523
>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
Length = 165
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 56/181 (30%), Positives = 77/181 (42%), Gaps = 16/181 (8%)
Query: 200 MLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYS 259
+ ++P YYY+GL I++G L +P + E DS GNGG++VDSGT T L Y+
Sbjct: 1 LRRNPQLDTYYYVGLVGISVGGE-LLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYN 59
Query: 260 QLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQ 319
+ EV FD CY + T P++ FHF LVLP
Sbjct: 60 VVRDAFVKGTKDLLATNEVSL---FDTCYDLSSK----TSVEVPTVAFHFGEGKVLVLPA 112
Query: 320 GNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMD 379
N+ P +S C F + G+ QQQ V +DL +GF P
Sbjct: 113 KNYL----VPVDSVGTFCFAFAPT----MSSLSIIGNIQQQGTRVSFDLANSLVGFSPNR 164
Query: 380 C 380
C
Sbjct: 165 C 165
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 100/389 (25%), Positives = 161/389 (41%), Gaps = 71/389 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS +T+VPC + C C ++++ + F P SS+ S C++
Sbjct: 100 LIVDSGSTVTYVPCAS----CEQCGNHQDPR----FQPDLSSTYSPVKCSAD-------- 143
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
CT C KS C ++ Y E +G+L D + S G E
Sbjct: 144 ------CT---CDSD---KSQC-----TYERQYAEMSSSSGVLGEDIV-----SFGTESE 181
Query: 124 IP--KFCFGCVGSTY-----REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYA 173
+ + FGC S + GI G GRG LS+ QL G + FS C+
Sbjct: 182 LKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIG 241
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+V+G A+ + ++ F+ P+ YY I L+ I + +L P R
Sbjct: 242 G-----GAMVLG--AMPAPPDMVFS--RSDPVRSPYYNIELKEIHVAGKALRLDP---RI 289
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
FDS+ G ++DSGTTY +LPE + + S + + + + D+C+
Sbjct: 290 FDSKH--GTVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYK-DICFAGAGR 346
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSG 352
N + FP + F + L L N+ + S CL +FQ+ D P+
Sbjct: 347 NVSQLSQAFPDVDMVFGDGQKLSLSPENYLFRH---SKVEGAYCLGVFQNGKD----PTT 399
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G +N V YD E+IGF +C+
Sbjct: 400 LLGGIVVRNTLVTYDRHNEKIGFWKTNCS 428
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 109/379 (28%), Positives = 154/379 (40%), Gaps = 75/379 (19%)
Query: 3 QVYMDTGSDLTWV---PCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
+V +DTGSD++WV PC S C + + F P+ SS+ + C+++ C
Sbjct: 122 RVVIDTGSDVSWVQCEPCPAPS----PCHAHAG----ALFDPAASSTYAAFNCSAAACAQ 173
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ S +GC KS C + YG+G TG + D L + GS
Sbjct: 174 LGDSGE------ANGCDA----KSRC-----QYIVKYGDGSNTTGTYSSDVLTLSGSD-- 216
Query: 120 IIREIPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYA 173
++R F FGC +G+ + G+ G G A S SQ K F +C A
Sbjct: 217 VVR---GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPA---- 269
Query: 174 NDPNISSPLVIGDVAISSKD---NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
P S L +G A TPML+S P YY+ LE I +G L P
Sbjct: 270 -TPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSP-- 326
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
S G LVDSGT T LP Y+ L S ++ +T Y RA+ + D C+
Sbjct: 327 -----SVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGI---LDTCF-- 376
Query: 291 PCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDY 348
N T D + P++ F V+ H CL F + DD +
Sbjct: 377 ---NFTGLDKVSIPTVALVFAGGA--VVDLDAHGIVSGG--------CLAFAPTRDDKAF 423
Query: 349 GPSGVFGSFQQQNVEVVYD 367
G G+ QQ+ EV+YD
Sbjct: 424 ---GTIGNVQQRTFEVLYD 439
>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 481
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 103/422 (24%), Positives = 160/422 (37%), Gaps = 73/422 (17%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDD----YRNNKLMSNFSPSRSSSSSRDTCASSFCL 58
+ +DTGSDL W C + N NFS SR++ +
Sbjct: 92 EAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRTARA----------- 140
Query: 59 NIHSSDNPFDPCTMS----GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVH 114
+ D+ C ++ GC+ C A +YG G+ G+L D
Sbjct: 141 -VPCDDDDGALCGVAPETAGCARGGGSGDDAC----VVAASYG-AGVALGVLGTDAFTFP 194
Query: 115 GSSPGIIREIPKFCFGCVGSTYREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFL 168
SS + FGCV T P GI G GRGALS+ SQL + FS+C
Sbjct: 195 SSSSVTL------AFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATE--FSYCLT 246
Query: 169 AFKYANDPNISSPLVIGD-----------VAISSKDNLQFTPMLKSPM---YPNYYYIGL 214
Y D S L +GD + P K+P + +YY+ L
Sbjct: 247 --PYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPL 304
Query: 215 EAITIGNS--SLTEVPLSLREFDSQ-GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITY 271
+ GN+ +L LRE + GG L+DSG+ +T L +P + L L +
Sbjct: 305 VGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRG 364
Query: 272 YPRAKEVEERTG--FDLCYRVPCPNNTFTDDLFPSITFHFLNNV----SLVLPQGNHFYA 325
+ G +LC ++ P + F + V LV+P ++
Sbjct: 365 SGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWAR 424
Query: 326 MSAPSNSSAVKCLLFQSMDDGDY----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ A + C+ S G+ + + G+F QQ++ V+YDL + FQP +C+
Sbjct: 425 VEA-----STWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479
Query: 382 ST 383
+
Sbjct: 480 AV 481
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/390 (24%), Positives = 150/390 (38%), Gaps = 78/390 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS+ TW+ C SF+ + TCAS C
Sbjct: 128 LVVDTGSEFTWLNCSK-SFEAV-------------------------TCASRKC------ 155
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPS----FAYTYGEGGLVTGILTRDTLKVHGSSPG 119
LS L + C P PS + +Y +G G D++ V G + G
Sbjct: 156 ----------KVDLSELFSLSVC-PKPSDPCLYDISYADGSSAKGFFGTDSITV-GLTNG 203
Query: 120 IIREIPKFCFGCVGSTY------REPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYA 173
++ GC S E GI G G S F+ K + F Y
Sbjct: 204 KQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDS------FIDKAANKYGAKFSYC 257
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSP---MYPNYYYIGLEAITIGNSSLTEVPLS 230
++S V ++ I N + ++ ++P +Y + + I+IG L ++P
Sbjct: 258 LVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQML-KIPPQ 316
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+ +F+++G G L+DSGTT T L P Y + L ++T R E+ + C+
Sbjct: 317 VWDFNAEG--GTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTG-EDFDALEFCFDA 373
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
F D + P + FHF P ++ + AP VKC+ +D G
Sbjct: 374 ----EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDV-AP----LVKCIGIVPID--GIGG 422
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ V G+ QQN +DL +GF P C
Sbjct: 423 ASVIGNIMQQNHLWEFDLSTNTVGFAPSTC 452
>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
Length = 435
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 140/353 (39%), Gaps = 57/353 (16%)
Query: 39 FSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGE 98
F PSRSSS + C S C CT + C F +G
Sbjct: 129 FEPSRSSSFAAIPCGSPECAV---------ECTGASCP---------------FTIQFGN 164
Query: 99 GGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVP 153
+ G L RDTL + S+ F FGC+ T+ +G+ R + S+
Sbjct: 165 VTVANGTLVRDTLTLPPSA-----TFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLA 219
Query: 154 SQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAIS------SKDNLQFTPMLKSPMYP 207
S++ + G + AF Y P+ S+ G ++I S ++++ PM +P +P
Sbjct: 220 SRV--ISNGATTSAAAFSYCL-PSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHP 276
Query: 208 NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQS 267
N Y++ L I++G L P+ F + G L+++ T +T L Y+ L +
Sbjct: 277 NSYFVDLVGISVGGEDL---PVPPAVFAAHGT---LLEAATEFTFLAPAAYAALRDAFRK 330
Query: 268 TITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
+ YP A D CY + P++ F L L Y
Sbjct: 331 DMAPYPAAPPFRV---LDTCYNL----TGLASLAVPAVALRFAGGTELELDVRQMMYFAD 383
Query: 328 APSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
S S+V CL F + + S V G+ Q++ EVVYDL R+GF P C
Sbjct: 384 PSSVFSSVACLAFAAAPLPAFPVS-VIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 160/392 (40%), Gaps = 75/392 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS +T+VPC + C C ++++ + F P SS+ S C N+ +
Sbjct: 103 LIVDSGSTVTYVPCAS----CEQCGNHQDPR----FQPDLSSTYSPVKC------NVDCT 148
Query: 64 -DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D+ + CT + Y E +G+L D + S G
Sbjct: 149 CDSDKNQCT--------------------YERQYAEMSSSSGVLGEDIV-----SFGTES 183
Query: 123 EIP--KFCFGCVGSTY-----REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKY 172
E+ + FGC S + GI G GRG LS+ QL G + FS C+
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 243
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+V+G + + ++SP YY I L+ + + +L P R
Sbjct: 244 GG-----GAMVLGAMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDP---R 291
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVP 291
FD G G ++DSGTTY +LPE + + S + +P K + + D+C+
Sbjct: 292 IFD--GKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQV--HPLKKIRGPDSNYKDICFAGA 347
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGP 350
N + ++FP + F N L L N+ + S CL +FQ+ D P
Sbjct: 348 GRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRH---SKVEGAYCLGVFQNGKD----P 400
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ + G +N V YD E+IGF +C+
Sbjct: 401 TTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/388 (24%), Positives = 159/388 (40%), Gaps = 70/388 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS LTW+ C + C + + + P SS+ + C++ C + ++
Sbjct: 123 MVVDSGSSLTWLQCAPCAVSC-------HPQAGPLYDPRASSTYAAVPCSAPQCAELQAA 175
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+P + SG S C+ + +YG+G G L++DT+ + S
Sbjct: 176 T--LNPSSCSG--------SGVCQ----YQASYGDGSFSFGYLSKDTVSLSSSG-----S 216
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
P F +GC VG + G+ G R LS+ SQL + F++C A+ +
Sbjct: 217 FPGFYYGCGQDNVG-LFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYL 275
Query: 179 SSPLVIGDVAISSKDN-----LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S G S+ DN +T M+ S + + Y++ L +++ S L VP S E
Sbjct: 276 S----FG----SNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPL-AVPSS--E 324
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
+ G+ ++DSGT T LP P Y+ L + A + C++
Sbjct: 325 Y---GSLPTIIDSGTVITRLPTPVYTAL----SKAVGAALAAPSAPAYSILQTCFK---- 373
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
P++ F +L L GN ++ CL F D + +
Sbjct: 374 -GQVAKLPVPAVNMAFAGGATLRLTPGNVLVDVN-----ETTTCLAFAPTDS-----TAI 422
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCA 381
G+ QQQ VVYD++ RIGF C+
Sbjct: 423 IGNTQQQTFSVVYDVKGSRIGFAAGGCS 450
>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 462
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/377 (25%), Positives = 153/377 (40%), Gaps = 65/377 (17%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + +DTGSD TW+ C + S +C +NK + F+PS SSS S +C S N
Sbjct: 142 LNLIIDTGSDTTWIRCNSCSLG--NC----HNKKIPTFNPSLSSSYSNRSCIPSTKTN-- 193
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
+ Y + G+ D + + P +
Sbjct: 194 ------------------------------YTMNYEDNSYSKGVFVCDEVTL---KPDVF 220
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGA-LSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+ C G + G+ G +G S+ SQ +K FS+CF ++ N
Sbjct: 221 PKFQFGCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYCF-----PHNENTR 275
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L+ G+ AIS+ +L+FT +L +P + Y++ L I++ L +S F S G
Sbjct: 276 GSLLFGEKAISASPSLKFTRLL-NPSSGSVYFVELIGISVAKKRLN---VSSSLFASPGT 331
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP-CPNNTFT 298
++DSGT THLP Y L + Q + + P + D CY + C
Sbjct: 332 ---IIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIK 388
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P I HF+ V + L +A + + CL F + + G+ Q
Sbjct: 389 ---LPEIVLHFVGEVDVSLHPSGILWANGDLTQA----CLAFARKSHPSH--VTIIGNRQ 439
Query: 359 QQNVEVVYDLEKERIGF 375
Q +++VVYD+E R+GF
Sbjct: 440 QVSLKVVYDIEGGRLGF 456
>gi|356500756|ref|XP_003519197.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 451
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 160/380 (42%), Gaps = 59/380 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSS-SSRDTCASSFCLNIHSSD 64
+DT +D WVPC C C + +SP S++ C + C +
Sbjct: 125 LDTSTDEAWVPCTG----CTGCSSSS-----TYYSPQASTTYGGAVACYAPRCAQARGAL 175
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
PC +G S C +F +Y G + L +D+L++ I +
Sbjct: 176 ----PCPYTG--------SKAC----TFNQSYA-GSTFSATLVQDSLRLG------IDTL 212
Query: 125 PKFCFGCVGST--YREPI-GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISS 180
P + FGCV S + P G+ G GRG LS+PSQ L G FS+C +F+ + S
Sbjct: 213 PSYAFGCVNSASGWTLPAQGLLGLGRGPLSLPSQSSKLYSGIFSYCLPSFQSSY---FSG 269
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G + ++ TP+L++P P+ YY+ L +T+G + +P+ FD
Sbjct: 270 SLKLGPTGQPRR--IRTTPLLQNPRRPSLYYVNLTGVTVGRVKV-PLPIEYLAFDPNKGS 326
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G ++DSGT T P YS + ++ + R GFD C+ T+ ++
Sbjct: 327 GTILDSGTVITRFVGPVYSAIRDEFRNQV-----KGPFFSRGGFDTCFV-----KTY-EN 375
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
L P I F + + LP N + + CL + + V ++QQQ
Sbjct: 376 LTPLIKLRF-TGLDVTLPYENTLIHTAY----GGMACLAMAAAPNNVNSVLNVIANYQQQ 430
Query: 361 NVEVVYDLEKERIGFQPMDC 380
N+ V++D R+G C
Sbjct: 431 NLRVLFDTVNNRVGIARELC 450
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/396 (24%), Positives = 150/396 (37%), Gaps = 94/396 (23%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD+ W+ C C DC Y+ + + F P+ SSS + TC + C ++
Sbjct: 172 MVLDTGSDVNWLQCK----PCSDC--YQQSDPI--FDPTASSSYNPLTCDAQQCQDLE-- 221
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
MS C L + +YG+G G +T+
Sbjct: 222 --------MSACRNGKCL----------YQVSYGDGSFTVGEYVTETVS----------- 252
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGAL--------------SVPSQLGFLQKGFSHCFLA 169
FG GS R IG G S+ SQ+ FS+C +
Sbjct: 253 -----FG-AGSVNRVAIGCGHDNEGLFVGSAGLLGLGGGPLSLTSQIK--ATSFSYCLVD 304
Query: 170 FKYANDPNISSPLVI-----GDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
D SS L GD ++ P+LK+ +YY+ L +++G +
Sbjct: 305 ----RDSGKSSTLEFNSPRPGDSVVA--------PLLKNQKVNTFYYVELTGVSVGGEIV 352
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
T VP D G GG++VDSGT T L Y+ + + + A+ V F
Sbjct: 353 T-VPPETFAVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVAL---F 408
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
D CY + ++ P+++FHF + + LP N+ P + + C F
Sbjct: 409 DTCYDL----SSLQSVRVPTVSFHFSGDRAWALPAKNYLI----PVDGAGTYCFAFAPTT 460
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ V +DL +GF P C
Sbjct: 461 SS----MSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/403 (25%), Positives = 162/403 (40%), Gaps = 77/403 (19%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN---------FSPSRSSSSSRDTC 52
+ +DTGSD+ W C C C +N + S+ + P S ++S TC
Sbjct: 101 LNAIVDTGSDILWFKCKL----CQGCSSKKNVIVCSSIIMQGPITLYDPELSITASPATC 156
Query: 53 ASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLK 112
+ C S + C ++ +Y + TGI RD +
Sbjct: 157 SDPLCSEGGSCRGNNNSC--------------------AYDISYEDTSSSTGIYFRDVVH 196
Query: 113 V-HGSSPGIIREIPKFCFGCVGS-TYREPI-GIAGFGRGALSVPSQLGFLQKG----FSH 165
+ H +S GC S + P+ GI GFGR +SVP+QL Q G F H
Sbjct: 197 LGHKASLN-----TTMFLGCATSISGLWPVDGIMGFGRSKVSVPNQLA-AQAGSYNIFYH 250
Query: 166 CFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSS 223
C K LV+G N +F M+ +PM N Y + L ++++ + +
Sbjct: 251 CLSGEKEGG-----GILVLGK-------NDEFPEMVYTPMLANDIVYNVKLVSLSVNSKA 298
Query: 224 LTEVPLSLREFD---SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE 280
L P+ EF+ + GNGG ++DSGT+ P + + + T P A E
Sbjct: 299 L---PIEASEFEYNATVGNGGTIIDSGTSSATFPSKALALFVKAVSKFTTAIPTAP--LE 353
Query: 281 RTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+G C+ N+ D FP++T F ++ L N+ A+ + S + F
Sbjct: 354 SSG-SPCFISISDRNSVEVD-FPNVTLKFDGGATMELTAHNYLEAVVSRKLSESTH---F 408
Query: 341 QSMD----DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMD 379
Q + G S + G ++ VVYD+EK RIG+ D
Sbjct: 409 QGVRLVCISWSVGNSTILGDAILKDKVVVYDMEKSRIGWVKQD 451
>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
Length = 437
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 149/334 (44%), Gaps = 37/334 (11%)
Query: 53 ASSFCLNIHSSDNPFDPCTMSGCSLSTLLK--STCCRPCPSFAYTYGEGGLVTGILTRDT 110
A++F N+ +S P D C++ C L +T C SF +Y G + L +D+
Sbjct: 134 ATTFYPNVSTSFVPLD-CSVPQCGQVRGLSCPATGSGAC-SFNQSYA-GSTFSATLVQDS 190
Query: 111 LKVHGSSPGIIREIPKFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKG-FSHC 166
L++ IP + FG + + + G+ G GRG LS+ SQ G + G FS+C
Sbjct: 191 LRLA------TDVIPSYSFGSINAISGSSVPAQGLLGLGRGPLSLLSQSGAIYSGVFSYC 244
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
+FK S L +G V +++ TP+L +P P+ YY+ L AI++G +
Sbjct: 245 LPSFK---SYYFSGSLKLGPVG--QPKSIRTTPLLHNPHRPSLYYVNLTAISVGRVYV-P 298
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
+P L F+ G ++DSGT T EP Y+ + + +T FD
Sbjct: 299 LPSELLAFNPSTGAGTIIDSGTVITRFVEPIYNAVRDEFRKQVT-----GPFSSLGAFDT 353
Query: 287 CYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDG 346
C+ + L P+IT HF ++ L LP N S+S ++ CL +
Sbjct: 354 CFV------KNYETLAPAITLHF-TDLDLKLPLENSLIH----SSSGSLACLAMAAAPSN 402
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V +FQQQN+ V++D ++G C
Sbjct: 403 VNSVLNVIANFQQQNLRVLFDTVNNKVGIARELC 436
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/254 (28%), Positives = 121/254 (47%), Gaps = 55/254 (21%)
Query: 145 FGRGA---LSVPSQLGFLQKGFSHCFLAFKYANDPNIS-SPLVIGDVAISSKDNLQFTPM 200
FG GA +++ +QLG FS+C N+P + + LV+G + D+ TP+
Sbjct: 248 FGLGAYPHITMATQLG---NKFSYCI---GDINNPLYTHNHLVLGQGSYIEGDS---TPL 298
Query: 201 LKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLP----EP 256
++ +YY+ L++I++G+ +L P + + S G+GG+L+DSG TYT L E
Sbjct: 299 ---QIHFGHYYVTLQSISVGSKTLKIDPNAFK-ISSDGSGGVLIDSGMTYTKLANGGFEL 354
Query: 257 FYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL--FPSITFHFLNNVS 314
Y +++ +++ + P ++ E LC++ + DL FP++TFHF
Sbjct: 355 LYDEIVDLMKGLLERIPTQRKFE-----GLCFK-----GVVSRDLVGFPAVTFHFAGGAD 404
Query: 315 LVLPQGNHFYAMSA--------PSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVY 366
LVL G+ F PSNS + V G QQN V +
Sbjct: 405 LVLESGSLFRQHGGDRFCLAILPSNSELLNL--------------SVIGILAQQNYNVGF 450
Query: 367 DLEKERIGFQPMDC 380
DLE+ ++ F+ +DC
Sbjct: 451 DLEQMKVFFRRIDC 464
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 96/388 (24%), Positives = 150/388 (38%), Gaps = 72/388 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS-- 63
+DTGS L+W+ C C Y + + + PS S + + +CAS C + ++
Sbjct: 3 LDTGSSLSWL-------QCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATL 55
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
++P + C + +YG+ G L++D L + S +
Sbjct: 56 NDPLCETDSNACL---------------YTASYGDTSFSIGYLSQDLLTLTSS-----QT 95
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNIS 179
+P+F +GC + GI G R LS+ +QL FS+C AN +
Sbjct: 96 LPQFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL---PTANSGSSG 152
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
+ S + +FTPML P+ Y++ L AIT+ L R
Sbjct: 153 GGFLSIGSI--SPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRV------ 204
Query: 240 GGLLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
L+DSGT T LP Y+ L + ++ T Y +A D C++ + +
Sbjct: 205 -PTLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSI---LDTCFK----GSLKS 256
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPS----NSSAVKCLLFQSMDDGDYGPS--G 352
P I F QG + APS + CL F G G +
Sbjct: 257 ISAVPEIKMIF---------QGGADLTLRAPSILIEADKGITCLAFA----GSSGTNQIA 303
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G+ QQQ + YD+ RIGF P C
Sbjct: 304 IIGNRQQQTYNIAYDVSTSRIGFAPGSC 331
>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
Length = 435
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 140/353 (39%), Gaps = 57/353 (16%)
Query: 39 FSPSRSSSSSRDTCASSFCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGE 98
F PSRSSS + C S C CT + C F +G
Sbjct: 129 FEPSRSSSFAAIPCGSPECAV---------ECTGASCP---------------FTIQFGN 164
Query: 99 GGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVP 153
+ G L RDTL + S+ F FGC+ T+ +G+ R + S+
Sbjct: 165 VTVANGTLVRDTLTLPPSA-----TFAGFTFGCIEVGADADTFDGAVGLIDLSRSSHSLA 219
Query: 154 SQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVAIS------SKDNLQFTPMLKSPMYP 207
S++ + G + AF Y P+ S+ G ++I S ++++ PM +P +P
Sbjct: 220 SRV--ISNGATTSAAAFSYCL-PSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNPNHP 276
Query: 208 NYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQS 267
N Y++ L I++G L P+ F + G L+++ T +T L Y+ L +
Sbjct: 277 NSYFVELVGISVGGEDL---PVPPAVFAAHGT---LLEAATEFTFLAPAAYAALRDAFRR 330
Query: 268 TITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMS 327
+ YP A D CY + P++ F L L Y
Sbjct: 331 DMAPYPAAPPFRV---LDTCYNL----TGLASLAVPTVALRFAGGTELELDVRQMMYFAD 383
Query: 328 APSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
S S+V CL F + + S V G+ Q++ EVVYDL R+GF P C
Sbjct: 384 PSSVFSSVACLAFAAAPLPAFPVS-VIGTLAQRSTEVVYDLRGGRVGFIPGRC 435
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 105/396 (26%), Positives = 161/396 (40%), Gaps = 80/396 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDC-----DDYRN-NKLMSNFSPSRSSSSSRDTCASSFC 57
V +D+GSDL WVPC DC+ C Y + ++ +S +SPS+SS+S + +C+ C
Sbjct: 113 VALDSGSDLFWVPC-----DCVQCAPLSASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLC 167
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILT-----RDTLK 112
+ NP C S++ +ST G LV I+ DTL
Sbjct: 168 DMGPNCKNPKQSCPY---SINYYTESTS-----------SSGLLVEDIIHLASGGDDTLN 213
Query: 113 VHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
+P II K G + P G+ G G +SVPS L G +Q FS CF
Sbjct: 214 TSVKAPVIIGCGMKQSGGYLDGV--APDGLLGLGLQEISVPSFLAKAGLIQNSFSMCF-- 269
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ + S + GD +++ Q P LK Y +G+E +G S L +
Sbjct: 270 -----NEDDSGRIFFGDQGPATQ---QSAPFLKLNGNYTTYIVGVEVCCVGTSCLKQSSF 321
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
S LVDSGT++T LP+ + + + + ++ E + CY+
Sbjct: 322 S-----------ALVDSGTSFTFLPDDVFEMIAEEFDTQVN---ASRSSFEGYSWKYCYK 367
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVK-----CLLFQSMD 344
T + DL P I ++ L+ PQ N F + ++ CL Q D
Sbjct: 368 ------TSSQDL-PKIP-----SLRLIFPQNNSFMVQNPVFMIYGIQGVIGFCLAIQPAD 415
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G G G VV+D E ++G+ +C
Sbjct: 416 ----GDIGTIGQNFMMGYRVVFDRENLKLGWSRSNC 447
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 95/391 (24%), Positives = 157/391 (40%), Gaps = 61/391 (15%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSS-----RDTCASSFCLNIH 61
DTGSDLTWV C + S ++ F P+ S S S DTC S ++
Sbjct: 122 DTGSDLTWVKCSSPSSSSSSPAASPPQRV---FRPAGSKSWSPLPCDSDTCKSYVPFSLA 178
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRD--TLKVHGSSPG 119
+ +P DPC S+ Y Y + G++ D T+ + G+
Sbjct: 179 NCSSPPDPC--------------------SYDYRYKDNSSARGVVGLDSATVSLSGNDGT 218
Query: 120 IIREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN 174
++ + GC G +++ G+ G +S S+ G FS+C + +
Sbjct: 219 RKAKLQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLV--DHLA 276
Query: 175 DPNISSPLVIGDVAISSKDNL--QFTPM--LKSPMYPNYYYIGLEAITIGNSSLTEVPLS 230
N +S L G+ S D+ + TP+ L+ +Y++ ++A+T+ L +P
Sbjct: 277 PRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILP-- 334
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
+D + NGG ++DSGT+ T L P Y ++ + PR F+ CY
Sbjct: 335 -DVWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM----DPFEYCY-- 387
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
N T P + F +L P G + +AP VKC+ + +G +
Sbjct: 388 ---NWTGVSAEIPRMELRFAGAATLA-PPGKSYVIDTAP----GVKCI---GVVEGAWPG 436
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
V G+ QQ +DL + F+ CA
Sbjct: 437 VSVIGNILQQEHLWEFDLANRWLRFKQSRCA 467
>gi|383161173|gb|AFG63169.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
Length = 133
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 79/140 (56%), Gaps = 13/140 (9%)
Query: 85 CCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPIGIAG 144
C + CP F+ TYG G TG L DTL + G REI F FGC + GIAG
Sbjct: 1 CSKICPHFSLTYGTGN-ATGRLLSDTLTLPLEDGGR-REIKNFAFGC-SVLSSQVAGIAG 57
Query: 145 FGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS 203
FG G LS+PSQL + F++C Y ++ SS +V+G+ A+ L +TP+L +
Sbjct: 58 FGNGGLSMPSQLAPLIGDKFAYC---LDYRSN---SSKIVLGNKAVPRDLPLTYTPLLFN 111
Query: 204 PMYP---NYYYIGLEAITIG 220
P+ P +Y+Y+ LEA++IG
Sbjct: 112 PVNPSVFSYFYLALEAVSIG 131
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/400 (25%), Positives = 155/400 (38%), Gaps = 71/400 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDLTWV C C D Y + + F PS+SS+ +++
Sbjct: 137 VLFDTGSDLTWVQC----LPCPDSSCYPQQEPL--FDPSKSSTY----------VDV--- 177
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
PC+ C + + ++ C ++ YG+ G L +T + SP +
Sbjct: 178 -----PCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSP-LAPA 231
Query: 124 IPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
FGC T G+ G GRG S+ SQ ++ + F Y P
Sbjct: 232 ATGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQT---RRSINSGGGVFSYCLPP 288
Query: 177 NISSP--LVIGDVAISSKD---NLQFTPMLKS-PMYPNYYYIGLEAITIGNSSLTEVPLS 230
SS L IG A + + NL FTP++ + + Y + L +++ N + ++P S
Sbjct: 289 RGSSTGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSV-NGAAVDIPAS 347
Query: 231 LREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
G ++DSGT TH+P Y L + + Y E + D CY V
Sbjct: 348 AFSL------GAVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKL-LDTCYDV 400
Query: 291 PCPNNTFTDDLFPSITFHF---------LNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ 341
+ P + F + + LVLP A S + CL F
Sbjct: 401 TGQDVVTA----PRVALEFGGGARIDVDASGILLVLP------AEDGSGQSLTLACLAFL 450
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ + G+ QQ+ VV+D++ RIGF P C+
Sbjct: 451 PTNSAGLV---IVGNMQQRAYNVVFDVDGGRIGFGPNGCS 487
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/390 (26%), Positives = 153/390 (39%), Gaps = 81/390 (20%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS-- 63
+DTGS L+W+ C C Y + + F PS S + +C SS C ++ +
Sbjct: 30 VDTGSSLSWL-------QCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATL 82
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+NP C S S C + +YG+ G L++D L + S +
Sbjct: 83 NNPL--CETS---------SNVC----VYTASYGDSSYSMGYLSQDLLTLAPS-----QT 122
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALS----VPSQLGFLQKGFSHCFLAFKYANDP 176
+P F +GC + + GI G GR LS V S+ G+ FS+C P
Sbjct: 123 LPGFVYGCGQDSEGLFGRAAGILGLGRNKLSMLGQVSSKFGY---AFSYCL--------P 171
Query: 177 NISSP--LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
L IG +++ +FTPM P P+ Y++ L AIT+G +L R
Sbjct: 172 TRGGGGFLSIGKASLAG-SAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVP 230
Query: 235 DSQGNGGLLVDSGTTYTHLP----EPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRV 290
++DSGT T LP PF + I+ S Y RA D C++
Sbjct: 231 T-------IIDSGTVITRLPMSVYTPFQQAFVKIMSSK---YARAPGFSI---LDTCFK- 276
Query: 291 PCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGP 350
N P + F L L N + + CL F G+ G
Sbjct: 277 ---GNLKDMQSVPEVRLIFQGGADLNLRPVNVLLQV-----DEGLTCLAFA----GNNGV 324
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + G+ QQQ +V +D+ RIGF C
Sbjct: 325 A-IIGNHQQQTFKVAHDISTARIGFATGGC 353
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/393 (25%), Positives = 160/393 (40%), Gaps = 58/393 (14%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ +DTGS L+W+ C R + F PS SSS S C C
Sbjct: 91 QMILDTGSQLSWIQCHK--------KVPRKPPPSTVFDPSLSSSFSVLPCNHPLC----- 137
Query: 63 SDNPFDP--CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P P + C L+ L C ++Y Y +G L G L R+ + S
Sbjct: 138 --KPRIPDFTLPTSCDLNRL-----CH----YSYFYADGTLAEGNLVREKITFSTS---- 182
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ P GC + GI G G LS SQ + FS+C + +
Sbjct: 183 -QSTPPLILGCAEDASDDK-GILGMNLGRLSFASQAKITK--FSYCVPTRQVRPGFTPTG 238
Query: 181 PLVIGDVAISSKDNLQFTPML---KSPMYPNY----YYIGLEAITIGNSSLTEVPLSLRE 233
+G+ S+ Q+ +L +S PN + + L+ I IGN L +P+S
Sbjct: 239 SFYLGENPNSA--GFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLN-IPVSAFR 295
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPC 292
D G G ++DSG+ +T+L + Y+++ + PR K+ +G D+C+
Sbjct: 296 ADPSGAGQSMIDSGSEFTYLVDVAYNKVRE--EVVRLAGPRLKKGYVYSGVSDMCFD--- 350
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPS 351
N L ++ F F V +V+ +G + V C+ + +S G S
Sbjct: 351 GNAMEIGRLIGNMVFEFDKGVEIVIEKGRVLADVGG-----GVHCVGIGRSEMLG--AAS 403
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
+ G+F QQN+ V +D+ R+GF DC+ +
Sbjct: 404 NIIGNFHQQNLWVEFDIANRRVGFGKADCSRSV 436
>gi|147776519|emb|CAN74010.1| hypothetical protein VITISV_003547 [Vitis vinifera]
Length = 429
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/209 (25%), Positives = 87/209 (41%), Gaps = 29/209 (13%)
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
YA+DP + N++ TP+L++P P YY+ L +++G L V L
Sbjct: 249 YASDP------------LGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGR-VLVPVAPEL 295
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
FD G ++DSGT T EP Y+ + + + FD C+
Sbjct: 296 LAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVK-----GPFATIGAFDTCFAA- 349
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+D+ P +TFHF + L LP N SA ++ CL + +
Sbjct: 350 -----TNEDIAPPVTFHF-TGMDLKLPLENTLIHSSA----GSLACLAMAAAPNNVNSVL 399
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V + QQQN+ +++D+ R+G C
Sbjct: 400 NVIANLQQQNLRIMFDVTNSRLGIARELC 428
>gi|222631382|gb|EEE63514.1| hypothetical protein OsJ_18330 [Oryza sativa Japonica Group]
Length = 464
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 103/419 (24%), Positives = 160/419 (38%), Gaps = 73/419 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDD----YRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+DTGSDL W C + + N NFS SR++ + +
Sbjct: 78 VDTGSDLVWTQCSTCRLPAVAAAGGGGCFPQNLPYYNFSLSRTARA------------VP 125
Query: 62 SSDNPFDPCTMS----GCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
D+ C ++ GC+ C A +YG G+ G+L D SS
Sbjct: 126 CDDDDGALCGVAPETAGCARGGGSGDDAC----VVAASYG-AGVALGVLGTDAFTFPSSS 180
Query: 118 PGIIREIPKFCFGCVGSTYREP------IGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK 171
+ FGCV T P GI G GRGALS+ SQL + FS+C
Sbjct: 181 SVTL------AFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLNATE--FSYCLT--P 230
Query: 172 YANDPNISSPLVIGD-----------VAISSKDNLQFTPMLKSPM---YPNYYYIGLEAI 217
Y D S L +GD + P K+P + +YY+ L +
Sbjct: 231 YFRDTVSPSHLFVGDGELAGLRAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLVGL 290
Query: 218 TIGNS--SLTEVPLSLREFDSQ-GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPR 274
GN+ +L LRE + GG L+DSG+ +T L +P + L L +
Sbjct: 291 AAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGSGS 350
Query: 275 AKEVEERTG--FDLCYRVPCPNNTFTDDLFPSITFHFLNNV----SLVLPQGNHFYAMSA 328
+ G +LC ++ P + F + V LV+P ++ + A
Sbjct: 351 LVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEKYWARVEA 410
Query: 329 PSNSSAVKCLLFQSMDDGDY----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAST 383
+ C+ S G+ + + G+F QQ++ V+YDL + FQP +C++
Sbjct: 411 -----STWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCSAV 464
>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
Length = 508
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 136/327 (41%), Gaps = 31/327 (9%)
Query: 70 CTMSGCSLSTLLKSTCCR---PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C GC L+ TC PC ++Y YG G T T L V + +R
Sbjct: 157 CANRGCQ--RLVPQTCSADDSPC-GYSYVYGGGAANT---TAGLLAVDAFAFATVRA-DG 209
Query: 127 FCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGD 186
FGC +T + G+ G GRG LS+ SQ LQ G +LA A D + S ++ D
Sbjct: 210 VIFGCAVATEGDIGGVIGLGRGELSLVSQ---LQIGRFSYYLAPDDAVD--VGSFILFLD 264
Query: 187 VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDS 246
A TP++ + + YY+ L I + L +P + + G+GG+++
Sbjct: 265 DAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLA-IPRGTFDLQADGSGGVVLSI 323
Query: 247 GTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSIT 306
T L Y + + S I RA + E G DLCY + + PS+
Sbjct: 324 TIPVTFLDAGAYKVVRQAMASKIGL--RAADGSE-LGLDLCYT----SESLATAKVPSMA 376
Query: 307 FHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVY 366
F + L GN+FY S ++ ++CL GD + GS Q ++Y
Sbjct: 377 LVFAGGAVMELEMGNYFYMDS----TTGLECLTILPSPAGD---GSLLGSLIQVGTHMIY 429
Query: 367 DLEKERIGFQPMDCA-STASAQGLHKK 392
D+ R+ F+ ++ A SA GL K
Sbjct: 430 DISGSRLVFESLEQAPPPPSASGLGGK 456
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/386 (25%), Positives = 151/386 (39%), Gaps = 68/386 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS L+W+ C C Y + + + PS S + + +CAS C + ++
Sbjct: 142 LDTGSSLSWL-------QCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAA-- 192
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
+L+ L T C + +YG+ G L++D L + S + +P
Sbjct: 193 ----------TLNDPLCETDSNAC-LYTASYGDTSFSIGYLSQDLLTLTSS-----QTLP 236
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSP 181
+F +GC + GI G R LS+ +QL FS+C AN +
Sbjct: 237 QFTYGCGQDNQGLFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCL---PTANSGSSGGG 293
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+ S + +FTPML P+ Y++ L AIT+ L R
Sbjct: 294 FLSIGSI--SPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRV-------P 344
Query: 242 LLVDSGTTYTHLPEPFYSQLL-SILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
L+DSGT T LP Y+ L + ++ T Y +A D C++ + +
Sbjct: 345 TLIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSI---LDTCFK----GSLKSIS 397
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPS----NSSAVKCLLFQSMDDGDYGPS--GVF 354
P I F QG + APS + CL F G G + +
Sbjct: 398 AVPEIKMIF---------QGGADLTLRAPSILIEADKGITCLAFA----GSSGTNQIAII 444
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDC 380
G+ QQQ + YD+ RIGF P C
Sbjct: 445 GNRQQQTYNIAYDVSTSRIGFAPGSC 470
>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 413
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 83/385 (21%), Positives = 153/385 (39%), Gaps = 60/385 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D +L W C C C ++ + + F P+ SS+ + C ++ C +I +
Sbjct: 79 VDVAGELVWTQCSA----CRRC--FKQD--LPVFVPNASSTFKPEPCGTAVCESIPTRSC 130
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
D C+ G T L+ G +G DT + ++
Sbjct: 131 SGDVCSYKG--PPTQLR-----------------GNTSGFAATDTFAIGTATV------- 164
Query: 126 KFCFGCVGS----TYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
+ FGCV + T P G G GR S+ +Q+ + FS+C + SS
Sbjct: 165 RLAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLTR--FSYCL----SPRNTGKSSR 218
Query: 182 LVIGDVA-ISSKDNLQFTPMLKSPM---YPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L +G A ++ ++ P +K+ +YY + L+AI GN+++ +Q
Sbjct: 219 LFLGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIAT---------AQ 269
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G L++ + + ++ L + Y + + FDLC++ F
Sbjct: 270 SGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFK---KAAGF 326
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
+ P + F F +L +P + + +++ L ++ V GS
Sbjct: 327 SRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSL 386
Query: 358 QQQNVEVVYDLEKERIGFQPMDCAS 382
QQ++V +YDL+KE + F+P DC+S
Sbjct: 387 QQEDVHFLYDLKKETLSFEPADCSS 411
>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
Length = 434
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 162/379 (42%), Gaps = 62/379 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DT +D ++P S C+ C + FSP+ S+S C+ C +
Sbjct: 113 MVLDTSTDEAFIP----SSGCIGCS-------ATTFSPNASTSYVPLECSVPQCSQVRGL 161
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
P T SG SF +Y G + L +D+L++
Sbjct: 162 SCP---ATGSGAC--------------SFNKSYA-GSTYSATLVQDSLRLA------TDV 197
Query: 124 IPKFCFGCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNIS 179
IP + FG + + I G+ G GRG LS+ SQ G L G FS+C +FK S
Sbjct: 198 IPSYSFGSINAISGSSIPAQGLLGLGRGPLSLLSQTGSLYSGVFSYCLPSFK---SYYFS 254
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L +G V +++ TP+L++P P+ Y++ L IT+G ++ P L FD
Sbjct: 255 GSLKLGPVG--QPKSIRTTPLLRNPRRPSLYFVNLTGITVGKVNV-PFPKELLAFDVNTG 311
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
G ++DSGT T EP Y+ + + +T FD C+ +
Sbjct: 312 SGTIIDSGTVITRFVEPVYNAVRDEFRKQVT-----GPFSSLGAFDTCFV------KNYE 360
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSM-DDGDYGPSGVFGSFQ 358
L P+IT HF ++ L LP N S+S ++ CL S + +Y V ++Q
Sbjct: 361 TLAPAITLHF-TDLDLKLPLENSLIH----SSSGSLACLAMASTPKNVNYTVLNVIANYQ 415
Query: 359 QQNVEVVYDLEKERIGFQP 377
QQN+ V++D + + P
Sbjct: 416 QQNLRVLFDTVNNKGWYCP 434
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 107/387 (27%), Positives = 151/387 (39%), Gaps = 69/387 (17%)
Query: 1 VIQ-VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
VIQ V +D+ SD+ WV C + C + ++ S + PSRS SS+ +C+S C
Sbjct: 157 VIQTVVLDSASDVPWVQC--VPCPIPPC----HPQVDSFYDPSRSPSSAPFSCSSPTCTA 210
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ N GC+ + C + Y +G +G D L + +
Sbjct: 211 LGPYAN--------GCA-----NNQC-----QYLVRYPDGSSTSGAYIADLLTLDAGN-- 250
Query: 120 IIREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYAN 174
+ F FGC GS GI G G S+ SQ FS+C A A+
Sbjct: 251 ---AVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPA--TAS 305
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
D +G V + TPM++ +Y + L IT+G L P
Sbjct: 306 DSGF---FTLG-VPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVF--- 358
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G ++DS T T LP Y L S +S++T Y + + D CY
Sbjct: 359 ----AAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMY---RSAPPKGYLDTCYDF---- 407
Query: 295 NTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P I+ F N L L P G F CL F S D D P GV
Sbjct: 408 TGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----------DCLAFTSNAD-DRMP-GV 454
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
GS QQQ +EV+YD+ +GF+ C
Sbjct: 455 LGSVQQQTIEVLYDVGGGAVGFRQGAC 481
>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
Length = 405
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 99/404 (24%), Positives = 154/404 (38%), Gaps = 93/404 (23%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI- 60
+ +D +L W C C C + + F P++SS+ C S C +I
Sbjct: 70 VSAVVDLTGELVWTQC----TPCQPCFEQD----LPLFDPTKSSTFRGLPCGSHLCESIP 121
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
SS N CT C P+ A G G+ DT + G
Sbjct: 122 ESSRN----CT----------SDVCIYEAPTKAGDTG------GMAGTDTFAI-----GA 156
Query: 121 IREIPKFCFGCVGSTYRE------PIGIAGFGRGALSVPSQLGFLQKGFSHCFL------ 168
+E FGCV T + P GI G GR S+ +Q+ FS+C
Sbjct: 157 AKE--TLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNV--TAFSYCLAGKSSGA 212
Query: 169 ------AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNS 222
A + A N S+P VI A SS + +P YY + L I G +
Sbjct: 213 LFLGATAKQLAGGKNSSTPFVIKTSAGSSDNG-------SNP----YYMVKLAGIKAGGA 261
Query: 223 SLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT 282
L + S +L+D+ + ++L + Y L L + + P A +
Sbjct: 262 PL--------QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKP-- 311
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS 342
+DLC+ + P + F F +L +P N+ A + + CL S
Sbjct: 312 -YDLCFSKAVAGDA------PELVFTFDGGAALTVPPANYLLA-----SGNGTVCLTIGS 359
Query: 343 MDD----GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G+ + + GS QQ+NV V++DL++E + F+P DC+S
Sbjct: 360 SASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCSS 403
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 158/392 (40%), Gaps = 75/392 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS +T+VPC + C C ++++ + F P SS+ S C N+ +
Sbjct: 103 LIVDSGSTVTYVPCAS----CEQCGNHQDPR----FQPDLSSTYSPVKC------NVDCT 148
Query: 64 -DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
D+ + CT + Y E +G+L D + G
Sbjct: 149 CDSDKNQCT--------------------YERQYAEMSSSSGVLGEDIVSF-----GTES 183
Query: 123 EIP--KFCFGCVGSTY-----REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKY 172
E+ + FGC S + GI G GRG LS+ QL G + FS C+
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 243
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+V+G + + ++SP YY I L+ + + +L P R
Sbjct: 244 GG-----GAMVLGAMPAPPGMIYTHSNAVRSP----YYNIELKEMHVAGKALRVDP---R 291
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVP 291
FD G G ++DSGTTY +LPE + + S + +P K + D+C+
Sbjct: 292 IFD--GKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQV--HPLKKIRGPDPNYKDICFAGA 347
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGP 350
N + ++FP + F N L L N+ + S CL +FQ+ D P
Sbjct: 348 GRNVSQLSEVFPKVDMVFGNGQKLSLSPENYLFRH---SKVEGAYCLGVFQNGKD----P 400
Query: 351 SGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
+ + G +N V YD E+IGF +C+
Sbjct: 401 TTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 103/388 (26%), Positives = 159/388 (40%), Gaps = 71/388 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +T+VPC + C C +++ K F P SS+ C ++ + D
Sbjct: 30 VDTGSSVTYVPCSS----CEQCGRHQDPK----FQPDLSSTYQSVKCN----IDCNCDDE 77
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
K C + Y E +G+L D + G+ + +
Sbjct: 78 ----------------KQQCV-----YERQYAEMSTSSGVLGEDIISF-GNLSALAPQ-- 113
Query: 126 KFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYANDPN 177
+ FGC G Y + GI G GRG LS+ L G + FS C Y
Sbjct: 114 RAVFGCENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLC-----YGGMGI 168
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+V+G IS N+ F+ P+ YY I L+ I + L PL+ FD
Sbjct: 169 GGGAMVLG--GISPPSNMVFSQ--SDPVRSPYYNIDLKEIHVAGKPL---PLNPTVFD-- 219
Query: 238 GNGGLLVDSGTTYTHLPE-PFYSQLLSILQSTITYYP-RAKEVEERTGFDLCYRVPCPNN 295
G G ++DSGTTY +LPE F S +I++ + P R + D+C+ +
Sbjct: 220 GKHGTILDSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYN---DICFSGAGSDI 276
Query: 296 TFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVF 354
+ FP++ F N L+L N+ + S CL +FQ+ D P+ +
Sbjct: 277 SQLSSSFPAVEMVFGNGQKLLLSPENYLFRH---SKVHGAYCLGIFQNGKD----PTTLL 329
Query: 355 GSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G +N V+YD E +IGF +C+
Sbjct: 330 GGIVVRNTLVLYDRENSKIGFWKTNCSE 357
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 123/281 (43%), Gaps = 47/281 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN-NKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C N + F+P SS+SS+ C+
Sbjct: 106 VQIDTGSDILWVACS----PCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSD-------- 153
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTL---KVHGSSPG 119
D CT + + + +++ PC + +TYG+G +G DT+ V G+
Sbjct: 154 -----DRCTAALQTSEAVCQTSDNSPC-GYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQ- 206
Query: 120 IIREIPKFCFGCVGS-------TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
FGC S T R GI GFG+ LSV SQL G K FSHC
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-- 264
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
N LV+G++ + L +TP++ P P +Y + LE+I + L P+
Sbjct: 265 ---KGSDNGGGILVLGEIV---EPGLVYTPLV--PSQP-HYNLNLESIVVNGQKL---PI 312
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTIT 270
F + G +VDSGTT +L + Y ++ + + ++
Sbjct: 313 DSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFVNAITAAVS 353
>gi|302813128|ref|XP_002988250.1| hypothetical protein SELMODRAFT_427034 [Selaginella moellendorffii]
gi|300143982|gb|EFJ10669.1| hypothetical protein SELMODRAFT_427034 [Selaginella moellendorffii]
Length = 377
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 150/381 (39%), Gaps = 76/381 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+D ++ +W+PCG S+F P +SS+ S C+S+ C H
Sbjct: 47 VDLNAETSWLPCGK----------------NSSFEPGKSSTFSPLPCSSNACSG-H---- 85
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ + C L K++ P+F T LT T I
Sbjct: 86 ----CSTNKCLLPISPKTSV----PAFQET----------LTGFTAPAGAKGTAI----- 122
Query: 126 KFCFGCVGSTYREPIGIAGFGRGALSVPSQLGF---LQKGFSHCFLAFKYANDPNISSPL 182
FGC + +G+A + +L++P Q+ + + F+ C P+ S L
Sbjct: 123 ---FGCAAG---KSVGVAALSKNSLALPLQIASSFSVPRKFALCL-------SPDSPSSL 169
Query: 183 VIGD------VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
GD I+ + F P + +P++P+ YY+ L I S L P
Sbjct: 170 FFGDDSSIIIGGINISSLVSFVPFVSNPVFPSRYYLDLRTIQTDFSDLKLDPSLFSINPK 229
Query: 237 QGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNT 296
G GGL + S YT +P P Y+ + + T + + + FDLC+ N
Sbjct: 230 TGIGGLTLSSTNRYTKVPTPVYAAIAQSFKKYATAFNISIVPAQNLPFDLCFNASGMNFN 289
Query: 297 FTDDLFPSITFHFLNNV--SLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVF 354
+FP+I F NN+ +LV + F+ +A+ CL QS GD +
Sbjct: 290 RLGPVFPAIQLIFRNNIPWNLVGSRVIEFF------RGNAIGCLAIQSA--GDPPATSSI 341
Query: 355 GSFQQQNVEVVYDLEKERIGF 375
G F Q + + +DL + R GF
Sbjct: 342 GLFHQFDNLLYFDLAQTRFGF 362
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 97/378 (25%), Positives = 154/378 (40%), Gaps = 73/378 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGSD TW+ C + S +C + + F+PS SSS S +C I S+
Sbjct: 144 LIIDTGSDTTWIQCNSCSLG--NCHNKKT------FNPSLSSSYSNRSC-------IPST 188
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D ++ Y + G+ D + + P +
Sbjct: 189 DT-------------------------NYTMKYEDNSYSKGVFVCDEVTL---KPDVF-- 218
Query: 124 IPKFCFGCV---GSTYREPIGIAGFGRGA-LSVPSQLG-FLQKGFSHCFLAFKYANDPNI 178
PKF FGC G + G+ G +G S+ SQ +K FS+CF ++
Sbjct: 219 -PKFQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQTASKFKKKFSYCFPPKEHT----- 272
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
L+ G+ AIS+ +L+FT +L P Y+ + L I++ L +S F S G
Sbjct: 273 LGSLLFGEKAISASPSLKFTQLLNPPSGLGYF-VELIGISVAKKRLN---VSSSLFASPG 328
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP-CPNNTF 297
++DSGT T LP Y L + Q + + P + D CY + C
Sbjct: 329 T---IIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYNLKGCGGRNI 385
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
P I HF+ V + L +A + + CL F + + + G+
Sbjct: 386 K---LPEIVLHFVGEVDVSLHPSGILWANGDLTQA----CLAFARKSNPSH--VTIIGNR 436
Query: 358 QQQNVEVVYDLEKERIGF 375
QQ +++VVYD+E R+GF
Sbjct: 437 QQVSLKVVYDIEGGRLGF 454
>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 457
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 101/400 (25%), Positives = 157/400 (39%), Gaps = 68/400 (17%)
Query: 1 VIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
V + +DTGS L+W+ C + ++F PS SS+ S C C
Sbjct: 109 VQPMVLDTGSQLSWIQCHKKA--------PAKPPPTASFDPSLSSTFSTLPCTHPVC--- 157
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
P P L ++C R C ++Y Y +G G L R+ S
Sbjct: 158 ----KPRIP--------DFTLPTSCDQNRLC-HYSYFYADGTYAEGNLVREKFTFSRS-- 202
Query: 119 GIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFL---------- 168
P GC + +P GI G RG LS SQ + FS+C
Sbjct: 203 ---LFTPPLILGCATES-TDPRGILGMNRGRLSFASQSKITK--FSYCVPTRVTRPGYTP 256
Query: 169 --AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLT 225
+F ++PN ++ I + L F + P + P Y + L+ I IG L
Sbjct: 257 TGSFYLGHNPNSNTFRYI--------EMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLN 308
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF- 284
P R D+ G+G ++DSG+ +T+L Y ++ + + + PR K+ G
Sbjct: 309 ISPAVFRA-DAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVVRAVG--PRMKKGYVYGGVA 365
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
D+C+ N L + F F V +V+P+ + V C+ + D
Sbjct: 366 DMCFD---GNAIEIGRLIGDMVFEFEKGVQIVVPKERVLATVEG-----GVHCIGIANSD 417
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
S + G+F QQN+ V +DL R+GF DC+ A
Sbjct: 418 KLG-AASNIIGNFHQQNLWVEFDLVNRRMGFGTADCSRLA 456
>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 486
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 85/315 (26%), Positives = 130/315 (41%), Gaps = 30/315 (9%)
Query: 70 CTMSGCSLSTLLKSTCCR---PCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
C GC L+ TC PC ++Y YG G T T L V + +R
Sbjct: 157 CANRGCQ--RLVPQTCSADDSPC-GYSYVYGGGAANT---TAGLLAVDAFAFATVRA-DG 209
Query: 127 FCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGD 186
FGC +T + G+ G GRG LS SQ LQ G +LA A D + S ++ D
Sbjct: 210 VIFGCAVATEGDIGGVIGLGRGELSPVSQ---LQIGRFSYYLAPDDAVD--VGSFILFLD 264
Query: 187 VAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDS 246
A TP++ S + YY+ L I + L +P + + G+GG+++
Sbjct: 265 DAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLA-IPRGTFDLQADGSGGVVLSI 323
Query: 247 GTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSIT 306
T L Y + + S I RA + E G DLCY + + PS+
Sbjct: 324 TIPVTFLDAGAYKVVRQAMASKIEL--RAADGSE-LGLDLCYT----SESLATAKVPSMA 376
Query: 307 FHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVY 366
F + L GN+FY +++ ++CL GD + GS Q ++Y
Sbjct: 377 LVFAGGAVMELEMGNYFYM----DSTTGLECLTILPSPAGD---GSLLGSLIQVGTHMIY 429
Query: 367 DLEKERIGFQPMDCA 381
D+ R+ F+ ++ A
Sbjct: 430 DISGSRLVFESLEQA 444
>gi|414869114|tpg|DAA47671.1| TPA: hypothetical protein ZEAMMB73_872184 [Zea mays]
Length = 492
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 124/286 (43%), Gaps = 26/286 (9%)
Query: 101 LVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL-GFL 159
V G ++D L V S +++ C S +G R S+PS+L G
Sbjct: 227 FVEGTFSQDVLTVAPSV--AVQDFTFVCLDAGASDGMPEVGTLDLSRDRNSLPSRLAGSA 284
Query: 160 QKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDN-LQFTPMLKS--PMYPNYYYIGLEA 216
FS+C +Y + P L +GD A DN P+L S P N Y+I +
Sbjct: 285 SAAFSYCMP--QYPDSPGF---LSLGDDATVRGDNCTAHAPLLSSDDPDLANMYFIDVVG 339
Query: 217 ITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAK 276
+++G+ ++P+ F + N +V++GTT+T L Y+ L + + Y R+
Sbjct: 340 MSLGD---VDLPIPSGTFGN--NASTIVEAGTTFTMLAPDAYTPLRDAFRQAMAQYNRS- 393
Query: 277 EVEERTGFDLCYRVPCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSS-A 334
V FD CY N T +L P + F F N SL++ G+ PS
Sbjct: 394 -VPGFYDFDTCY-----NFTGLQELTVPLVEFKFGNGDSLLI-DGDQMLYYDIPSEGPFT 446
Query: 335 VKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V CL F ++D D S V G++ EVVYD+ +GF P C
Sbjct: 447 VTCLAFSTLDVDDDDVSAVIGAYSLATTEVVYDVAGGTVGFIPESC 492
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 97/385 (25%), Positives = 152/385 (39%), Gaps = 67/385 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +T+VPC N C+ C ++++ + F P SS+ C N
Sbjct: 106 VDTGSTVTYVPCSN----CVQCGNHQDPR----FQPELSSTYQPVKC------------N 145
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C +G + + Y E +G+L D + S + +
Sbjct: 146 ADCNCDENGVQCT-------------YERRYAEMSTSSGVLAEDVMSFGKESELVPQ--- 189
Query: 126 KFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYANDPN 177
+ FGC G Y + GI G GRG LSV QL G + FS C+
Sbjct: 190 RAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVG---- 245
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+V+G ISS + F+ P YY I L+ I + L P R FD
Sbjct: 246 -GGAMVLG--GISSPPGMVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNP---RTFD-- 295
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G G ++DSGTTY + PE Y + I++ + + D+C+ + T
Sbjct: 296 GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK-DICFSGAGRDVTE 354
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVFGS 356
+FP + F N + L N+ + + S CL +F++ +D + + G
Sbjct: 355 LPKVFPEVDMVFANGQKISLSPENYLFRH---TKVSGAYCLGIFKNGND----QTTLLGG 407
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
+N V Y+ E IGF +C+
Sbjct: 408 IIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 153/382 (40%), Gaps = 63/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS LTW+ C C +R + + F+P SSS + +C++ C
Sbjct: 136 MVVDTGSSLTWLQCSPCLVSC-----HRQSGPV--FNPRSSSSYASVSCSAPQC------ 182
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D T + + ST S C + +YG+ G L++DT+ +S
Sbjct: 183 ----DALTTATLNPSTCSTSNVCI----YQASYGDSSFSVGYLSKDTVSFGSTS------ 228
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+P F +GC + + G+ G R LS+ QL + FS+C P S
Sbjct: 229 VPNFYYGCGQDNEGLFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--------PTSS 280
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + + +TPM KS + + Y+I + IT+ L+ +S + S
Sbjct: 281 SSSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLS---VSASAYSSLPT 337
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT T LP YS L + + PRA D C++ +
Sbjct: 338 ---IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSI---LDTCFQ-----GQASR 386
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P ++ F +L L N + SA CL F + + G+ QQ
Sbjct: 387 LRVPQVSMAFAGGAALKLKATNLLVDVD-----SATTCLAFAPARS-----AAIIGNTQQ 436
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
Q VVYD++ +IGF C+
Sbjct: 437 QTFSVVYDVKNSKIGFAAGGCS 458
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 90/402 (22%), Positives = 150/402 (37%), Gaps = 70/402 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW+ C +C P R C + N
Sbjct: 211 VDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDL----------LCQELQGDQN 260
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C+ C + Y + G+L +D + + ++ G RE
Sbjct: 261 ----------------YCATCKQC-DYEIEYADRSSSMGVLAKDDMHMIATNGG--REKL 301
Query: 126 KFCFGCV----GSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
F FGC G P GI G A+S+PSQL G + F HC +
Sbjct: 302 DFVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCI-----TKE 356
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
PN + +GD + + + + P+ P N Y+ + + G+ L R
Sbjct: 357 PNGGGYMFLGDDYVP-RWGMTWAPIRGGP--DNLYHTEAQKVNYGDQQL-------RMHG 406
Query: 236 SQGNG-GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G+ ++ DSG++YT+LP+ Y +L++ ++ YP + T LC++
Sbjct: 407 QAGSSIQVIFDSGSSYTYLPDEIYKKLVTAIKYD---YPSFVQDTSDTTLPLCWKADFDV 463
Query: 295 NTFTD--DLFPSITFHFLNNVSLVLPQG-----NHFYAMSAPSNSSAVKCLLFQSMDDGD 347
D F + HF N V+P+ + + +S N CL + + D
Sbjct: 464 RYLEDVKQFFKPLNLHF-GNRWFVIPRTFTILPDDYLIISDKGNV----CLGLLNGAEID 518
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
+ + + G + VVYD E+ +IG+ +C +G
Sbjct: 519 HASTLIVGDVSLRGKLVVYDNERRQIGWADSECTKPQPQKGF 560
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 98/385 (25%), Positives = 153/385 (39%), Gaps = 67/385 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +T+VPC N C+ C ++++ + F P SS+ C N
Sbjct: 106 VDTGSTVTYVPCSN----CVQCGNHQDPR----FQPELSSTYQPVKC------------N 145
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
C +G C ++ Y E +G+L D + S + +
Sbjct: 146 ADCNCDENGVQ------------C-TYERRYAEMSTSSGVLAEDVMSFGKESELVPQ--- 189
Query: 126 KFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYANDPN 177
+ FGC G Y + GI G GRG LSV QL G + FS C+
Sbjct: 190 RAVFGCETMESGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVG---- 245
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+V+G ISS + F+ P YY I L+ I + L P R FD
Sbjct: 246 -GGAMVLG--GISSPPGMVFSH--SDPSRSPYYNIELKEIHVAGKPLKLNP---RTFD-- 295
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G G ++DSGTTY + PE Y + I++ + + D+C+ + T
Sbjct: 296 GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK-DICFSGAGRDVTE 354
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVFGS 356
+FP + F N + L N+ + + S CL +F++ +D + + G
Sbjct: 355 LPKVFPEVDMVFANGQKISLSPENYLFRH---TKVSGAYCLGIFKNGND----QTTLLGG 407
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCA 381
+N V Y+ E IGF +C+
Sbjct: 408 IIVRNTLVTYNRENSTIGFWKTNCS 432
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/382 (23%), Positives = 152/382 (39%), Gaps = 64/382 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS LTW+ C C +R + + F+P SS+ + C++ C S
Sbjct: 137 MVVDTGSSLTWLQCSPCLVSC-----HRQSGPV--FNPKSSSTYASVGCSAQQC-----S 184
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D P S CS S + C + +YG+ G L++DT+ +S
Sbjct: 185 DLPSATLNPSACSSSNV----CI-----YQASYGDSSFSVGYLSKDTVSFGSTS------ 229
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNIS 179
+P F +GC + G+ G R LS+ QL L F++C + + ++
Sbjct: 230 LPNFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLG 289
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
S + +TPM+ S + + Y+I L +T+ + L+ + +
Sbjct: 290 S---------YNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT--- 337
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTD 299
++DSGT T LP YS L + + + RA D C++ +
Sbjct: 338 ---IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSA-- 389
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQ 359
P++T F +L L N + + CL F + + G+ QQ
Sbjct: 390 ---PAVTMSFAGGAALKLSAQNLLVDVD-----DSTTCLAFAPARS-----AAIIGNTQQ 436
Query: 360 QNVEVVYDLEKERIGFQPMDCA 381
Q VVYD++ RIGF C+
Sbjct: 437 QTFSVVYDVKSSRIGFAAGGCS 458
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 101/404 (25%), Positives = 154/404 (38%), Gaps = 93/404 (23%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI- 60
+ +D +L W C C C + + F P++SS+ C S C +I
Sbjct: 70 VSAVVDLTGELVWTQC----TPCQPCFEQD----LPLFDPTKSSTFRGLPCGSHLCESIP 121
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
SS N CT C P+ A G G DT + G
Sbjct: 122 ESSRN----CT----------SDVCIYEAPTKAGDTG------GKAGTDTFAI-----GA 156
Query: 121 IREIPKFCFGCVGSTYRE------PIGIAGFGRGALSVPSQLGFLQKGFSHCFL------ 168
+E FGCV T + P GI G GR S+ +Q+ FS+C
Sbjct: 157 AKET--LGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNV--TAFSYCLAGKSSGA 212
Query: 169 ------AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNS 222
A + A N S+P VI A SS + +P YY + L I G +
Sbjct: 213 LFLGATAKQLAGGKNSSTPFVIKTSAGSSDNG-------SNP----YYMVKLAGIKTGGA 261
Query: 223 SLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERT 282
L + S +L+D+ + ++L + Y L L + + P A +
Sbjct: 262 PL--------QAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPVASPPKP-- 311
Query: 283 GFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQS 342
+DLC+ P D P + F F +L +P N+ A + + CL S
Sbjct: 312 -YDLCF----PKAVAGDA--PELVFTFDGGAALTVPPANYLLA-----SGNGTVCLTIGS 359
Query: 343 MDD----GDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G+ + + GS QQ+NV V++DL++E + F+P DC+S
Sbjct: 360 SASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCSS 403
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G GA+SV Q FS+C K
Sbjct: 103 -VQKIPGFSFGCNMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L +G F S V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDLGRGGVFVERSVQEQD--VWCLAF 311
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 152/379 (40%), Gaps = 62/379 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRN--NKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
DTGSD++W+ C CD ++ F P SSS S +C S C H D
Sbjct: 202 DTGSDVSWL-------QCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQC---HLLD 251
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
C + C + YG+G G L +T S+ I
Sbjct: 252 EA--ACDANSCI---------------YEVEYGDGSFTVGELATETFSFRHSN-----SI 289
Query: 125 PKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
P GC + G+ G G GA+S+ SQL FS+C + D SS
Sbjct: 290 PNLPIGCGHDNEGLFVGAAGLIGLGGGAISLSSQLE--ATSFSYCLVDL----DSESSST 343
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L A D+L +P++K+ +P + Y+ + +++G L + S E D G+GG
Sbjct: 344 LDFN--ADQPSDSLT-SPLVKNDRFPTFRYVKVIGMSVGGKPL-PISSSSFEIDESGSGG 399
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++VDSGTT T +P Y L P A V + FD CY + +N
Sbjct: 400 IIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEV--- 453
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+I F SL LP N + + +S+ CL F P + G+ QQQ
Sbjct: 454 -PTIAFILPGENSLQLPAKNCLFQV----DSAGTFCLAFLP----STFPLSIIGNVQQQG 504
Query: 362 VEVVYDLEKERIGFQPMDC 380
+ V YDL +GF C
Sbjct: 505 IRVSYDLANSLVGFSTDKC 523
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 108/401 (26%), Positives = 165/401 (41%), Gaps = 67/401 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL-MSNFSPSRSSSSSRDTCASSFCLNIHS 62
V +DTGSD+ WV C C C + ++ + P+ S +S+ C FC + +S
Sbjct: 87 VQVDTGSDILWVNCAG----CTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYS 142
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+SGC CP ++ TYG+G +G D+L S G +
Sbjct: 143 G-------PISGCKQDM--------SCP-YSITYGDGSTTSGSFVNDSLTFDEVS-GNLH 185
Query: 123 EIP---KFCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFL 168
P FGC + S E + GI GFG+ SV SQL G +++ FSHC
Sbjct: 186 TKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCL- 244
Query: 169 AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVP 228
+ + IG V + TP++ P +Y I + G L +P
Sbjct: 245 -----DSHHGGGIFSIGQVM---EPKFNTTPLV--PRMAHYNVILKDMDVDGEPIL--LP 292
Query: 229 LSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCY 288
L L FDS G ++DSGTT +LP Y+QL L + P K + F C+
Sbjct: 293 LYL--FDSGSGRGTIIDSGTTLAYLPLSIYNQL---LPKVLGRQPGLKLMIVEDQFT-CF 346
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ--SMDDG 346
+ D+ FP + FHF +SL + ++ + + C+ +Q S
Sbjct: 347 HY----SDKLDEGFPVVKFHF-EGLSLTVHPHDYLFLYKED-----IYCIGWQKSSTQTK 396
Query: 347 DYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQ 387
+ + G N VVYDLE IG+ +C+S+ +
Sbjct: 397 EGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSSSIKVK 437
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 91/380 (23%), Positives = 148/380 (38%), Gaps = 64/380 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS LTW+ C C +R + + F+P SS+ + C++ C SD
Sbjct: 14 VDTGSSLTWLQCSPCLVSC-----HRQSGPV--FNPKSSSTYASVGCSAQQC-----SDL 61
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P S CS S + + +YG+ G L++DT+ +S +P
Sbjct: 62 PSATLNPSACSSSNVCI---------YQASYGDSSFSVGYLSKDTVSFGSTS------LP 106
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSP 181
F +GC + G+ G R LS+ QL L F++C SS
Sbjct: 107 NFYYGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---------PSSSS 157
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
+ + +TPM+ S + + Y+I L +T+ + L+ + +
Sbjct: 158 SGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPT----- 212
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++DSGT T LP YS L + + + RA D C++ +
Sbjct: 213 -IIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSA---- 264
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P++T F +L L N + + CL F + + G+ QQQ
Sbjct: 265 -PAVTMSFAGGAALKLSAQNLLVDVD-----DSTTCLAFAPARS-----AAIIGNTQQQT 313
Query: 362 VEVVYDLEKERIGFQPMDCA 381
VVYD++ RIGF C+
Sbjct: 314 FSVVYDVKSSRIGFAAGGCS 333
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 154/384 (40%), Gaps = 60/384 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSDL+WV C C Y + + P+ SS+ + C S C ++
Sbjct: 142 VLIDTGSDLSWVQCK----PCNSSSCYPQKDPL--YDPTASSTYAPVPCDSKACKDL--V 193
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+ +D GC+ S+ ++ C+ + YG G+ + +TL + SP +
Sbjct: 194 PDAYD----HGCTNSS--GTSLCQ----YGIEYGNRDTTVGVYSTETLTL---SPQV--S 238
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
+ F FGC G + + G P L + + AF Y P S+
Sbjct: 239 VKDFGFGC-GLVQQGTFDLFDGLLGLGGAPESL--VSQTAETYGGAFSYCLPPGNST--- 292
Query: 184 IGDVAISSKDN------LQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
G +A+ + N FTP+ P +Y + L +++G L P L
Sbjct: 293 TGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVL------ 346
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+GG+++DSGT T LP+ YS L + ++ ++ YP + D CY
Sbjct: 347 -SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDV-LDTCYNF----TGI 400
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQ-SMDDGDYGPSGVFGS 356
+ P++ F G + PS CL F DGD G+ G+
Sbjct: 401 ANVTVPTVALTF---------DGGATIDLDVPSGVLIQDCLAFAGGASDGDV---GIIGN 448
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
Q+ EV+YD + +GF+P C
Sbjct: 449 VNQRTFEVLYDSGRGHVGFRPGAC 472
>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 102/393 (25%), Positives = 163/393 (41%), Gaps = 70/393 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTG++L+W+ C C + + N + P +SS S+ S N HS
Sbjct: 105 IDTGNELSWI-------QCEGCQN-KGNMCFPHKDPPYTSSQSKSYKPVS--CNQHSFCE 154
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P + C C+ + TYG G +G L +T + S+ G +
Sbjct: 155 P-NQCKEGLCA---------------YNVTYGPGSYTSGNLANETFTFY-SNHGKHTALK 197
Query: 126 KFCFGCVGSTY---------REPI-GIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYAN 174
FGC + + P+ G+ G G G S +QLG + G FS+C A N
Sbjct: 198 SISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTHN 257
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNY-YYIGLEAITIGNSSL--TEVPLSL 231
+ L G + SK NLQ T +++ + P+ Y++ L I++ L T+ L++
Sbjct: 258 -----TYLRFGKHVVKSK-NLQTTKIMQ--VKPSAAYHVNLLGISVNGVKLNITKTDLAV 309
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKE-VEERTGFDLCYRV 290
R+ G+ G ++D+GT T L +P + L + L + ++ K V + DLCY
Sbjct: 310 RK---DGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCYE- 365
Query: 291 PCPNNTFTD---DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGD 347
+D P +TFH N V P+ + N V CL S D
Sbjct: 366 -----QLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKN---VFCLSMLSDDS-- 415
Query: 348 YGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ G++QQ + VYD + + F P DC
Sbjct: 416 ---KTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 95/401 (23%), Positives = 155/401 (38%), Gaps = 83/401 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN------NKLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSD+ WVPC DC++C ++ ++ + PS S++S C C
Sbjct: 120 VALDAGSDMLWVPC-----DCIECASLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLC 174
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKV---- 113
++HS C S PCP +G + D L +
Sbjct: 175 -DVHSF------CKGSK------------DPCPYEVQYASANTSSSGYVFEDKLHLTSDG 215
Query: 114 -HGSSPGIIREIPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQL---GFLQKGFSHC 166
H + I C Y P G+ G G G +SVPS L G +Q FS C
Sbjct: 216 KHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNISVPSLLAKAGLIQNSFSIC 275
Query: 167 FLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTE 226
D N S ++ GD ++ + F P++ Y +G+E+ +G+
Sbjct: 276 L-------DENESGRIIFGDQGHVTQHSTPFLPIIA-------YMVGVESFCVGS----- 316
Query: 227 VPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL 286
L L+E Q L+DSG+++T LP Y ++++ + A + ++ ++
Sbjct: 317 --LCLKETRFQA----LIDSGSSFTFLPNEVYQKVVTEFDKQVN----ASRIVLQSSWEY 366
Query: 287 CYRVPCPNNTFTDDL--FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMD 344
CY N + +L P + F N + ++ Q FY ++ + CL
Sbjct: 367 CY------NASSQELVNIPPLKLAFSRNQTFLI-QNPIFYDPASQEQEYTIFCLPVSPSA 419
Query: 345 DGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTAS 385
D DY G +V+D E R G+ +C AS
Sbjct: 420 D-DY---AAIGQNFLMGYRLVFDRENLRFGWSRWNCQDRAS 456
>gi|148907857|gb|ABR17052.1| unknown [Picea sitchensis]
Length = 422
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 127/287 (44%), Gaps = 29/287 (10%)
Query: 106 LTRDTLKV---HGSSPGIIREIPKFCFGCVGSTYRE---PIGIAGFGRGALSVPSQL--- 156
L +D L + GS+PG + P+ F C S+ R +G+AG L++PSQL
Sbjct: 128 LAQDVLVLPSSDGSNPGPLARFPQLAFACDLSSNRVISGTVGVAGMTSSTLALPSQLSAA 187
Query: 157 -GFLQKGFSHCFL------AFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNY 209
GF +K F+ C A + ++P + P D++ + TP++K+ +Y +
Sbjct: 188 EGFSRK-FAMCLPSGNAPGALFFGDEPLVFLPPPGRDLS----SQIIRTPLIKNSVYTDV 242
Query: 210 YYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTI 269
+Y+G++ I +G ++ LR FD G GG + + YT L P Y+ L + S +
Sbjct: 243 FYLGVQRIEVGGVNVAIDAEKLR-FDKDGRGGTKLSTVVRYTQLASPIYNSLEGVFTS-V 300
Query: 270 TYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP 329
V + F C+ +T P+I N + F A S
Sbjct: 301 AKKMNITRVASVSPFGACFDSSGVGSTRVGPAVPTIDIVLQGNSTTTW---RIFGANSMV 357
Query: 330 SNSSAVKCLLFQSMDDGD-YGPSGVFGSFQQQNVEVVYDLEKERIGF 375
++ V CL F +D GD S V G++Q Q+ + +DL +GF
Sbjct: 358 RVNNKVLCLGF--VDGGDNLQQSIVIGTYQMQDNLLQFDLATSTLGF 402
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 93/389 (23%), Positives = 155/389 (39%), Gaps = 69/389 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS +T+VPC C C ++++ + F P SS+ S C N+ +
Sbjct: 106 LIVDSGSTVTYVPCAT----CEQCGNHQDPR----FQPDLSSTYSPVKC------NVDCT 151
Query: 64 -DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
DN CT + Y E +G+L D + S
Sbjct: 152 CDNERSQCT--------------------YERQYAEMSSSSGVLGEDIMSFGKESE---L 188
Query: 123 EIPKFCFGCVGSTY-----REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAN 174
+ + FGC + + GI G GRG LS+ QL G + FS C+
Sbjct: 189 KPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVG- 247
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
+V+G + + ++SP YY I L+ I + +L L + F
Sbjct: 248 ----GGTMVLGGMPAPPDMVFSHSNPVRSP----YYNIELKEIHVAGKALR---LDPKIF 296
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+S+ G ++DSGTTY +LPE + + + + + + + D+C+ N
Sbjct: 297 NSKH--GTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYK-DICFAGAGRN 353
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGV 353
+ ++FP + F N L L N+ + S CL +FQ+ D P+ +
Sbjct: 354 VSQLSEVFPDVDMVFGNGQKLSLSPENYLFRH---SKVEGAYCLGVFQNGKD----PTTL 406
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G +N V YD E+IGF +C+
Sbjct: 407 LGGIVVRNTLVTYDRHNEKIGFWKTNCSE 435
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 138/322 (42%), Gaps = 42/322 (13%)
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ S FDP SL + + ST ++ TYG+ G DT+ + S
Sbjct: 110 LKDSHRHFDPSASLTYSLGSCIPSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEPSD-- 164
Query: 120 IIREIPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYAN 174
PKF FGC G G+ G G+G LS SQ +K FS+C
Sbjct: 165 ---VFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCL-----PE 216
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPL 229
+ +I S L+ G+ A +S+ +L+FT ++ P YY++ L I++GN L VP
Sbjct: 217 EDSIGS-LLFGEKA-TSQSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRL-NVPS 273
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCY 288
S+ F S G ++DSGT T LP+ YS L + + + YP + ++ D CY
Sbjct: 274 SV--FASPGT---IIDSGTVITCLPQRAYSALTAAFKKAMAKYPLSNGRRKKGDILDTCY 328
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
+ + D L P I HF + L + N ++ CL F
Sbjct: 329 NL----SGRKDVLLPEIVLHFGEGADVRLNGKRVIWG-----NDASRLCLAFAGNSKSTM 379
Query: 349 GPS-GVFGSFQQQNVEVVYDLE 369
+ G+ QQ ++ V+YD++
Sbjct: 380 NSELTIIGNRQQVSLTVLYDIQ 401
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 90/392 (22%), Positives = 154/392 (39%), Gaps = 68/392 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTWV C C C ++ N C+ C+ S+
Sbjct: 79 IDTGSDLTWVQCDGPDAPCKGCTMPKDKLYKPN-------GKQVVKCSDPICVATQSTH- 130
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYT--YGEGGLVTGILTRDTLKVHGSSPGIIRE 123
+L C + P Y Y + G+L RD + H SP +
Sbjct: 131 --------------VLGQICSKQSPPCVYNVQYADHASTLGVLVRDYM--HIGSPSSSTK 174
Query: 124 IPKFCFGC------VGST--YREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKY 172
P FGC G T + +P GI G G G S+ SQL GF+ HC A
Sbjct: 175 DPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVLGHCLSA--- 231
Query: 173 ANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
L +GD + S + +TP+++S + +Y N+ ++ + +
Sbjct: 232 ----EGGGYLFLGDKFVPS-SGIVWTPIIQSSLEKHY-----------NTGPVDLFFNGK 275
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
++G ++ DSG++YT+ P Y+ + +++ + + P ++ + +C++
Sbjct: 276 PTPAKGL-QIIFDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPS--LPICWKGVK 332
Query: 293 PNNTFTD--DLFPSITFHFL--NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
P + + + F +T F N+ LP + CL + ++
Sbjct: 333 PFKSLNEVNNYFKPLTLSFTKSKNLQFQLPPVAYLIITKY-----GNVCLGILNGNEAGL 387
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
G V G Q+ VVYD EK++IG+ +C
Sbjct: 388 GNRNVVGDISLQDKVVVYDNEKQQIGWASANC 419
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 97/377 (25%), Positives = 151/377 (40%), Gaps = 81/377 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS +TW C C+ C + +S
Sbjct: 177 LILDTGSSITWTQCK----PCVRC--------------------------------LKAS 200
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
FDP SL + + ST ++ TYG+ G DT+ + S
Sbjct: 201 RRHFDPSASLTYSLGSCIPSTVGN---TYNMTYGDKSTSVGNYGCDTMTLEHSD-----V 252
Query: 124 IPKFCFGC----VGSTYREPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNI 178
PKF FGC G G+ G G+G LS SQ +K FS+C + +I
Sbjct: 253 FPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCL-----PEEDSI 307
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSP-----MYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S L+ G+ A S +L+FT ++ P YY++ L I++GN L +P S+
Sbjct: 308 GS-LLFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRL-NIPSSV-- 363
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPC 292
F S G ++DSGT T LP+ YS L + + + YP + ++ D CY +
Sbjct: 364 FASPGT---IIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNL-- 418
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSG 352
+ D L P I HF + L + N ++ CL F +
Sbjct: 419 --SGRKDVLLPEIVLHFGEGADVRLNGKRVIWG-----NDASRLCLAFAGNSE-----LT 466
Query: 353 VFGSFQQQNVEVVYDLE 369
+ G+ QQ ++ V+YD++
Sbjct: 467 IIGNRQQVSLTVLYDIQ 483
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 154/400 (38%), Gaps = 88/400 (22%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDD-----YRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
+DTGS+LTW+ C C C+ YR KL+ CA C +
Sbjct: 57 IDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPKKLVP--------------CADPLCDAL 102
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPS---FAYTYGEGGLVTGILTRDTLKVHGSS 117
H L + CR P + Y +G G+L D +
Sbjct: 103 HKD----------------LGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSL---P 143
Query: 118 PGIIREIPKFCFGC-----VGSTYREPI-----GIAGFGRGALSVPSQL---GFLQKG-F 163
G R I FGC G + P GI G GRG++ + SQL G + K
Sbjct: 144 TGSARNI---AFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVI 200
Query: 164 SHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSS 223
HC + L IG+ + S +L + PN+Y G + +G +
Sbjct: 201 GHCLSS-------KGGGYLFIGEENVPS-SHLHIIYIYCISREPNHYSPGQATLHLGRN- 251
Query: 224 LTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTG 283
P+ + F + + DSG+TYT+LPE ++QL+S L++++ + T
Sbjct: 252 ----PIGTKPFKA------IFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTR 301
Query: 284 FDLCYRVPCPNNTFTD---DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
LC++ P P T D + +T F + V++ +P N + ++ N+ C
Sbjct: 302 LHLCWKGPKPFKTVHDLPKEFKSLVTLKFDHGVTMTIPPEN-YLIITGHGNA----CFGI 356
Query: 341 QSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ D V G Q V++D EK R+ + P C
Sbjct: 357 LELPGYDL---FVIGGISMQEQLVIHDNEKGRLAWMPSPC 393
>gi|414589629|tpg|DAA40200.1| TPA: hypothetical protein ZEAMMB73_727364, partial [Zea mays]
Length = 201
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 89/190 (46%), Gaps = 16/190 (8%)
Query: 194 NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHL 253
+Q TP+L+SP P +YY+ +T+G L +P S G+GG++VDSGT T L
Sbjct: 25 RVQTTPLLQSPQNPTFYYVHFTGLTVGARRL-RIPESAFALRPDGSGGVIVDSGTALTLL 83
Query: 254 PEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP--NNTFTDDL-FPSITFHFL 310
P ++++ + + P A G +C+ VP ++ T + P + HF
Sbjct: 84 PAAVLAEVVRAFRQQLR-LPFANGGNPEDG--VCFLVPAAWRRSSSTSQMPVPRMVLHF- 139
Query: 311 NNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEK 370
L LP+ N+ + CLL D GD G + G+ QQ++ V+YDLE
Sbjct: 140 QGADLDLPRRNYVL----DDHRRGRLCLLL--ADSGDDGST--IGNLVQQDMRVLYDLEA 191
Query: 371 ERIGFQPMDC 380
E + P C
Sbjct: 192 ETLSIAPARC 201
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 56/383 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS L+WV C N C D + K F+P SS+ S+ C++ C +H
Sbjct: 14 VTIDTGSTLSWVQCKNCQIKCYD----QAAKAGQIFNPYNSSTYSKVGCSTEACNGMH-- 67
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D GC TC ++ YG G G L +D L + + R
Sbjct: 68 ---MDLAVEYGCVEE---DDTCI-----YSLRYGSGEYSVGYLGKDRLTLASN-----RS 111
Query: 124 IPKFCFGCVGSTYREPI--GIAGFGRGALSVPSQLGFLQK----GFSHCFLAFKYANDPN 177
I F FGC + GI GFG + S +Q+ Q+ FS+CF D
Sbjct: 112 IDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQV--CQQTDYTAFSYCF-----PRDHE 164
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L IG A NL +T ++ P Y L+ + G + + + +
Sbjct: 165 NEGSLTIGPYA--RDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKM--- 219
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T++ P + L + + + +ER +C+ + +
Sbjct: 220 ----TIVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERR---ICFISNSGSANW 272
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
D FP++ + + +L LP N FY +S+ V C F D G G + G+
Sbjct: 273 ND--FPTVEMKLIRS-TLKLPVENAFY-----ESSNNVICSTFLPDDAGVRGVQ-MLGNR 323
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
++ ++V+D++ GF+ C
Sbjct: 324 AVRSFKLVFDIQAMNFGFKARAC 346
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 98/400 (24%), Positives = 158/400 (39%), Gaps = 76/400 (19%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDLTW+ C C C K + R + R + FC+ + +
Sbjct: 217 IDTGSDLTWIQC---DAPCTSC-----AKGANQLYKPRKDNLVR--SSEPFCVEVQRNQ- 265
Query: 66 PFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDT--LKVHGSSPGII 121
L C C C + Y + G+LT+D LK+H G +
Sbjct: 266 ---------------LTEHCESCHQC-DYEIEYADHSYSMGVLTKDKFHLKLHN---GSL 306
Query: 122 REIPKFCFGC-------VGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFK 171
E FGC + +T + GI G R +S+PSQL G + HC
Sbjct: 307 AE-SDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCL---- 361
Query: 172 YANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
A+D N + +G + S + + PML P + Y + + ++ GN+ L+
Sbjct: 362 -ASDLNGEGYIFMGSDLVPSH-GMTWVPMLHHP-HLEVYQMQVTKMSYGNAMLS------ 412
Query: 232 REFDSQGN--GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
D + G +L D+G++YT+ P YSQL++ LQ + +E +C+R
Sbjct: 413 --LDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDE--ALPICWR 468
Query: 290 VPC--PNNTFTD--DLFPSITFHFLNNVSLV----LPQGNHFYAMSAPSNSSAVKCLLFQ 341
P ++ +D F IT + ++ L Q + +S N CL
Sbjct: 469 AKTNSPISSLSDVKKFFRPITLQIGSKWLIISKKLLIQPEDYLIISNKGNV----CLGIL 524
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G + + G + +VYD K+RIG+ DC
Sbjct: 525 DGSNVHDGSTIIIGDISMRGRLIVYDNVKQRIGWMKSDCV 564
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/387 (27%), Positives = 151/387 (39%), Gaps = 69/387 (17%)
Query: 1 VIQ-VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLN 59
VIQ V +D+ SD+ WV C + C + ++ S + PSRS +S+ +C+S C
Sbjct: 27 VIQTVVLDSASDVPWVQC--VPCPIPPC----HPQVDSFYDPSRSPTSAAFSCSSPTCTA 80
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
+ N GC+ + C + Y +G +G D L + +
Sbjct: 81 LGPYAN--------GCA-----NNQC-----QYLVRYPDGSSTSGAYIADLLTLDAGN-- 120
Query: 120 IIREIPKFCFGCV----GSTYREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYAN 174
+ F FGC GS GI G G S+ SQ FS+C A A+
Sbjct: 121 ---AVSGFKFGCSHAEQGSFDARAAGIMALGGGPESLLSQTASRYGNAFSYCIPA--TAS 175
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
D +G V + TPM++ +Y + L IT+G L P
Sbjct: 176 DSGF---FTLG-VPRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVF--- 228
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
G ++DS T T LP Y L + +S++T Y + + D CY
Sbjct: 229 ----AAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMY---RSAPPKGYLDTCYDF---- 277
Query: 295 NTFTDDLFPSITFHFLNNVSLVL-PQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P I+ F N L L P G F CL F S D D P GV
Sbjct: 278 TGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----------DCLAFTSNAD-DRMP-GV 324
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDC 380
GS QQQ +EV+YD+ +GF+ C
Sbjct: 325 LGSVQQQTIEVLYDVGGGAVGFRQGAC 351
>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 95/393 (24%), Positives = 150/393 (38%), Gaps = 70/393 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS L+W+ C ++ ++F PS SS+ S C C
Sbjct: 90 MVLDTGSQLSWIQC------------HKKQPPTASFDPSLSSTFSILPCTHPLC------ 131
Query: 64 DNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
P P L ++C R C ++Y Y +G G L R+ S
Sbjct: 132 -KPRIP--------DFTLPTSCDQNRLC-HYSYFYADGTYAEGNLVREKFTFSRSV---- 177
Query: 122 REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFL------------A 169
P GC + +P GI G G LS Q + FS+C +
Sbjct: 178 -STPPLILGCATES-TDPRGILGMNLGRLSFAKQSKITK--FSYCVPPRQTRPGFTPTGS 233
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
F N+P+ +G + S + F P+ Y I + I I L P
Sbjct: 234 FYLGNNPSSKGFKYVGMMTSSRQRMPNFDPLA--------YTIPMVGIRIAGKKLNISPA 285
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCY 288
R D+ G+G ++DSG+ +T+L Y ++ + + + PR K+ G D+C+
Sbjct: 286 VFRA-DAGGSGQTMIDSGSEFTYLVSEAYDKVRAQVVRAVG--PRLKKGYVYGGVADMCF 342
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDY 348
L + F F V +V+P+ + V C+ S D
Sbjct: 343 D--SVKAVEIGRLIGEMVFEFERGVEVVIPKERVLADVGG-----GVHCVGIGSSDKLG- 394
Query: 349 GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
S + G+F QQN+ V +DL + R+GF DC+
Sbjct: 395 AASNIIGNFHQQNLWVEFDLVRRRVGFGKADCS 427
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 56/383 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS L+WV C N C D + K F+P SS+ S+ C++ C +H
Sbjct: 40 VTIDTGSTLSWVQCKNCQIKCYD----QAAKAGQIFNPYNSSTYSKVGCSTEACNGMH-- 93
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D GC TC ++ YG G G L +D L + + R
Sbjct: 94 ---MDLAVEYGCVEE---DDTCI-----YSLRYGSGEYSVGYLGKDRLTLASN-----RS 137
Query: 124 IPKFCFGCVGSTYREPI--GIAGFGRGALSVPSQLGFLQK----GFSHCFLAFKYANDPN 177
I F FGC + GI GFG + S +Q+ Q+ FS+CF D
Sbjct: 138 IDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQV--CQQTDYTAFSYCF-----PRDHE 190
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L IG A NL +T ++ P Y L+ + G + + + +
Sbjct: 191 NEGSLTIGPYA--RDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKM--- 245
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T++ P + L + + + +ER +C+ + +
Sbjct: 246 ----TIVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERR---ICFISNSGSANW 298
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
D FP++ + + +L LP N FY +S+ V C F D G G + G+
Sbjct: 299 ND--FPTVEMKLIRS-TLKLPVENAFY-----ESSNNVICSTFLPDDAGVRGVQ-MLGNR 349
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
++ ++V+D++ GF+ C
Sbjct: 350 AVRSFKLVFDIQAMNFGFKARAC 372
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 56/383 (14%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGS L+WV C N C D + K F+P SS+ S+ C++ C +H
Sbjct: 21 VTIDTGSTLSWVQCKNCQIKCYD----QAAKAGQIFNPYNSSTYSKVGCSTEACNGMH-- 74
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
D GC TC ++ YG G G L +D L + + R
Sbjct: 75 ---MDLAVEYGCVEE---DDTCI-----YSLRYGSGEYSVGYLGKDRLTLASN-----RS 118
Query: 124 IPKFCFGCVGSTYREPI--GIAGFGRGALSVPSQLGFLQK----GFSHCFLAFKYANDPN 177
I F FGC + GI GFG + S +Q+ Q+ FS+CF D
Sbjct: 119 IDNFIFGCGEDNLYNGVNAGIIGFGTKSYSFFNQV--CQQTDYTAFSYCF-----PRDHE 171
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
L IG A NL +T ++ P Y L+ + G + + + +
Sbjct: 172 NEGSLTIGPYA--RDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKM--- 226
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
+VDSGT T++ P + L + + + +ER +C+ + +
Sbjct: 227 ----TIVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWDERR---ICFISNSGSANW 279
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
D FP++ + + +L LP N FY +S+ V C F D G G + G+
Sbjct: 280 ND--FPTVEMKLIRS-TLKLPVENAFY-----ESSNNVICSTFLPDDAGVRGVQ-MLGNR 330
Query: 358 QQQNVEVVYDLEKERIGFQPMDC 380
++ ++V+D++ GF+ C
Sbjct: 331 AVRSFKLVFDIQAMNFGFKARAC 353
>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 439
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 135/311 (43%), Gaps = 39/311 (12%)
Query: 84 TCCRPCP---SFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFG----CVGSTY 136
T C+ C ++ TYG+ G DT+ + S KF FG G
Sbjct: 154 TQCKACTVENNYNMTYGDDSTSVGNYGCDTMTLEPSD-----VFQKFQFGRGRNNKGDFG 208
Query: 137 REPIGIAGFGRGALSVPSQLGF-LQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNL 195
G+ G G+G LS SQ K FS+C + +I S L+ G+ A S +L
Sbjct: 209 SGVDGMLGLGQGQLSTVSQTASKFNKVFSYCL-----PEEDSIGS-LLFGEKATSQSSSL 262
Query: 196 QFTPMLKSP---MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTH 252
+FT ++ P YY++ L I++GN L +P S+ F S G ++DS T T
Sbjct: 263 KFTSLVNGPGTLQESGYYFVNLSDISVGNERL-NIPSSV--FASPGT---IIDSRTVITR 316
Query: 253 LPEPFYSQLLSILQSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFTDDLFPSITFHFLN 311
LP+ YS L + + + YP + ++ D CY + + D L P I HF
Sbjct: 317 LPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNL----SGRKDVLLPEIVLHFGG 372
Query: 312 NVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVFGSFQQQNVEVVYDLEK 370
+ L N + S+ S + CL F P + G+ QQ ++ V+YD++
Sbjct: 373 GADVRLNGTNIVWG----SDESRL-CLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQG 427
Query: 371 ERIGFQPMDCA 381
RIGF+ C+
Sbjct: 428 GRIGFRSNGCS 438
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 90/330 (27%), Positives = 143/330 (43%), Gaps = 69/330 (20%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKL---MSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSD+ WV C C C R + L ++ ++ S S +C FC I
Sbjct: 95 VQVDTGSDIMWVNC----IQCKQCP--RRSTLGIELTLYNIDESDSGKLVSCDDDFCYQI 148
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
S P +SGC + CP + YG+G G +D ++ + +
Sbjct: 149 --SGGP-----LSGCKANM--------SCP-YLEIYGDGSSTAGYFVKDVVQYDSVAGDL 192
Query: 121 IREIPK--FCFGC-------VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCF 167
+ FGC + S+ E + GI GFG+ S+ SQL G ++K F+HC
Sbjct: 193 KTQTANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL 252
Query: 168 LAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN--YYYIGLEAITIGNSSLT 225
+ N IG V + K N+ +P+ PN +Y + + A+ +G LT
Sbjct: 253 ------DGRNGGGIFAIGRV-VQPKVNM-------TPLVPNQPHYNVNMTAVQVGQEFLT 298
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
+P L F G ++DSGTT +LPE Y L+ + + + K+ +
Sbjct: 299 -IPADL--FQPGDRKGAIIDSGTTLAYLPEIIYEPLVK-KEPALKVHIVDKDYK------ 348
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSL 315
C++ + D+ FP++TFHF N+V L
Sbjct: 349 -CFQY----SGRVDEGFPNVTFHFENSVFL 373
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 96/382 (25%), Positives = 156/382 (40%), Gaps = 66/382 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V +DTGSD++W+ C C +C Y+ + + F P S+S S C + C ++
Sbjct: 164 VVLDTGSDVSWIQCA----PCSEC--YQQSDPI--FDPVSSNSYSPIRCDAPQCKSL--- 212
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
LS TC + +YG+G G +T+ + ++
Sbjct: 213 ------------DLSECRNGTCL-----YEVSYGDGSYTVGEFATETVTLGTAA------ 249
Query: 124 IPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFK--YANDPNI 178
+ GC + + G+ G G G LS P+Q+ FS+C + +
Sbjct: 250 VENVAIGCGHNNEGLFVGAAGLLGLGGGKLSFPAQVN--ATSFSYCLVNRDSDAVSTLEF 307
Query: 179 SSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQG 238
+SPL N+ P+ ++P +YY+GL+ I++G +L +P S+ E D+ G
Sbjct: 308 NSPL---------PRNVVTAPLRRNPELDTFYYLGLKGISVGGEAL-PIPESIFEVDAIG 357
Query: 239 NGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFT 298
GG+++DSGT T L Y L P+A V + FD CY + +
Sbjct: 358 GGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGV---SLFDTCYDLSSRESV-- 412
Query: 299 DDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQ 358
P+++FHF L LP N+ P +S C F + G+ Q
Sbjct: 413 --QVPTVSFHFPEGRELPLPARNYLI----PVDSVGTFCFAFAPTTSS----LSIMGNVQ 462
Query: 359 QQNVEVVYDLEKERIGFQPMDC 380
QQ V +D+ +GF C
Sbjct: 463 QQGTRVGFDIANSLVGFSADSC 484
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSASWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 139/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GRHGVFVERSVQEQDVWCLAF 311
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 139/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS ++WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSISWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L F S V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDLGSSGVFVERSVQEQD--VWCLAF 311
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSASWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/380 (24%), Positives = 152/380 (40%), Gaps = 66/380 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD- 64
+DTGS LTW+ C C +R + + F P SSS + +C++ C ++ ++
Sbjct: 154 VDTGSSLTWLQCSPCRVSC-----HRQSGPV--FDPKTSSSYAAVSCSTPQCNDLSTATL 206
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
NP + CS S + C + +YG+ G L++DT+ +S +
Sbjct: 207 NP------AACSSSDV----CI-----YQASYGDSSFSVGYLSKDTVSFGSNS------V 245
Query: 125 PKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISS 180
P F +GC + G+ G R LS+ QL L FS+C SS
Sbjct: 246 PNFYYGCGQDNEGLFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL---------PSSS 296
Query: 181 PLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
+ + +TPM+ S + + Y+I L +T+ L +S E+ S
Sbjct: 297 SSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLA---VSSSEYSSLPT- 352
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
++DSGT T LP Y L + + RA + + D C+ +
Sbjct: 353 --IIDSGTVITRLPTTVYDALSKAVAGAMKGTKRA---DAYSILDTCFV-----GQASSL 402
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++ F +L L N + S+ CL F + + G+ QQQ
Sbjct: 403 RVPAVSMAFSGGAALKLSAQNLLVDVD-----SSTTCLAFAPARS-----AAIIGNTQQQ 452
Query: 361 NVEVVYDLEKERIGFQPMDC 380
VVYD++ RIGF C
Sbjct: 453 TFSVVYDVKSNRIGFAAGGC 472
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/348 (24%), Positives = 138/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS TWV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTTWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L F S V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDLGSRGVFVERSVQEQD--VWCLAF 311
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|194707292|gb|ACF87730.1| unknown [Zea mays]
Length = 216
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 98/236 (41%), Gaps = 26/236 (11%)
Query: 150 LSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN 208
+S+ SQ G G FS+C +++ S L +G A N+++TP+L +P P+
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYR---SYYFSGSLRLG--AAGQPRNVRYTPLLTNPHRPS 55
Query: 209 YYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQST 268
YY+ + +++G + +VP FD G ++DSGT T P Y+ L +
Sbjct: 56 LYYVNVTGLSVGR-TWVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQ 114
Query: 269 ITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL----FPSITFHFLNNVSLVLPQGNHFY 324
+ FD C+ TD++ P +T H V L LP N
Sbjct: 115 VA---APSGYTSLGAFDTCFN--------TDEVAAGGAPPVTLHMDGGVDLTLPMENTLI 163
Query: 325 AMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
SA + + CL V + QQQNV VV D+ R+GF C
Sbjct: 164 HSSA----TPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 158/391 (40%), Gaps = 54/391 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHS 62
Q+ +DTGS L+W+ C R S F PS SSS S C C
Sbjct: 96 QMILDTGSQLSWIQCHK--------KVPRKPPPSSVFDPSLSSSFSVLPCNHPLC----- 142
Query: 63 SDNPFDPCTMSGCSLSTLLKSTC--CRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGI 120
P P L ++C R C ++Y Y +G L G L R+ + S
Sbjct: 143 --KPRIP--------DFTLPTSCDQNRLC-HYSYFYADGTLAEGNLVREKITFSRS---- 187
Query: 121 IREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS 180
+ P GC + + GI G G LS SQ + FS+C + +
Sbjct: 188 -QSTPPLILGCAEES-SDAKGILGMNLGRLSFASQAKLTK--FSYCVPTRQVRPGFTPTG 243
Query: 181 PLVIGDVAISSK----DNLQFTPMLKSP-MYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+G+ S + L F+ + P + P Y + ++ I IGN L +P+S D
Sbjct: 244 SFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLN-IPISAFRPD 302
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF-DLCYRVPCPN 294
G G ++DSG+ +T+L + Y+++ + + R K+ G D+C+ N
Sbjct: 303 PSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVG--ARLKKGYVYGGVSDMCFN---GN 357
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGV 353
L ++ F F V +V+ + + V C+ + +S G S +
Sbjct: 358 AIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGG-----GVHCVGIGRSEMLG--AASNI 410
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
G+F QQN+ V +DL R+GF DC+ +
Sbjct: 411 IGNFHQQNIWVEFDLANRRVGFGKADCSRSV 441
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 142/349 (40%), Gaps = 65/349 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C L + +
Sbjct: 103 -VQKIPGFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYC-LPLQMSE 160
Query: 175 DPNISSP---LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
S +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 161 RGFFSKTTGYFSLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVF 218
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 ------SRKGVVFDSGSELSYIPDRALSVLRQRIRELLLKRGAAEEESERNCYDM----- 267
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 ---RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 106/379 (27%), Positives = 151/379 (39%), Gaps = 62/379 (16%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRN--NKLMSNFSPSRSSSSSRDTCASSFCLNIHSSD 64
DTGSD++W+ C CD ++ F P SSS S +C S C H D
Sbjct: 202 DTGSDVSWL-------QCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQC---HLLD 251
Query: 65 NPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREI 124
C + C + YG+G G L +T S+ I
Sbjct: 252 EA--ACDANSCI---------------YEVEYGDGSFTVGELATETFSFRHSN-----SI 289
Query: 125 PKFCFGCVGST---YREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSP 181
P GC + G+ G G GA+S+ SQL FS+C + D SS
Sbjct: 290 PNLPIGCGHDNEGLFVGADGLIGLGGGAISLSSQLE--ATSFSYCLVDL----DSESSST 343
Query: 182 LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGG 241
L A D+L +P++K+ +P + Y+ + +++G L + S E D G+GG
Sbjct: 344 LDFN--ADQPSDSLT-SPLVKNDRFPTFRYVKVIGMSVGGKPL-PISSSSFEIDESGSGG 399
Query: 242 LLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDL 301
++VDSGTT T +P Y L P A V + FD CY + +N
Sbjct: 400 IIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEV--- 453
Query: 302 FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQN 361
P+I F SL LP N + +S+ CL F P + G+ QQQ
Sbjct: 454 -PTIAFILPGENSLQLPAKNCLIQV----DSAGTFCLAFLP----STFPLSIIGNVQQQG 504
Query: 362 VEVVYDLEKERIGFQPMDC 380
+ V YDL +GF C
Sbjct: 505 IRVSYDLANSLVGFSTDKC 523
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 106/406 (26%), Positives = 166/406 (40%), Gaps = 87/406 (21%)
Query: 4 VYMDTGSDLTWVPCGNLSFDC---MDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNI 60
V +DTGSDL W+PC N + C M+ D KL + ++PS+S SSS+ TC S+ C
Sbjct: 104 VALDTGSDLFWLPC-NCNSTCVRSMETDQGERIKL-NIYNPSKSKSSSKVTCNSTLC--- 158
Query: 61 HSSDNPFDPCTMSGCSLSTLLKSTCCRP---CPSFAYTYGEGGLVTGILTRDTLKVHGSS 117
L++ C P CP G TG+L D + + +
Sbjct: 159 -------------------ALRNRCISPVSDCPYRIRYLSPGSKSTGVLVEDVIHM-STE 198
Query: 118 PGIIREIPKFCFGCVGST---YREPI--GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
G R+ + FGC S ++E GI G ++VP+ L G FS CF
Sbjct: 199 EGEARD-ARITFGCSESQLGLFKEVAVNGIMGLAIADIAVPNMLVKAGVASDSFSMCF-- 255
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
PN + GD S D L+ TP L + P +Y + + +G ++
Sbjct: 256 -----GPNGKGTISFGDKG--SSDQLE-TP-LSGTISPMFYDVSITKFKVGKVTVDT--- 303
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
EF + DSGT T L EP+Y+ L + ++ +K V+ + F+ CY
Sbjct: 304 ---EFTAT------FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVD--SPFEFCYI 352
Query: 290 VPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP-------SNSSAVKCLLFQS 342
+ +T +D PS++F +G Y + +P S V CL
Sbjct: 353 I---TSTSDEDKLPSVSFEM---------KGGAAYDVFSPILVFDTSDGSFQVYCLAVLK 400
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQG 388
+ D+ + G N +V+D E+ +G++ +C T G
Sbjct: 401 QVNADF---SIIGQNFMTNYRIVHDRERRILGWKKSNCNDTNGFTG 443
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 156/389 (40%), Gaps = 53/389 (13%)
Query: 3 QVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFC----L 58
+V +DTGS+LTWV C + +N ++ F S S C + C +
Sbjct: 102 RVVVDTGSELTWVNC---RYRGRGKGKVKNRRV---FRAEESKSFKTVGCFTQTCKVDLM 155
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
N+ S P T PC S+ Y Y +G G+ ++T+ V G +
Sbjct: 156 NLFSLSTCPTPST----------------PC-SYDYRYADGSAAQGVFAKETITV-GLTN 197
Query: 119 GIIREIPKFCFGCVGSTYREPI----GIAGFGRGALSVPS-QLGFLQKGFSHCFLAFKYA 173
G + GC S + G+ G S S S+C + +
Sbjct: 198 GRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLV--DHL 255
Query: 174 NDPNISSPLVIG--DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
++ NIS+ L+ G + S+K T L + P +Y I + I+IG+ L ++P +
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDML-DIPTQV 314
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
+D+ GG ++DSGT+ T L E Y +++ L + R K E + C+
Sbjct: 315 --WDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVK--PEGIPIEYCF--- 367
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
+ F + P +TFH L + P + +AP VKCL F S +
Sbjct: 368 SSTSGFNESKLPQLTFH-LKGGARFEPHRKSYLVDAAP----GVKCLGFMS---AGTPAT 419
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
V G+ QQN +DL + F P C
Sbjct: 420 NVVGNIMQQNYLWEFDLMASTLSFAPSTC 448
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 141/349 (40%), Gaps = 63/349 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C L + +
Sbjct: 103 -VQKIPGFTFGCNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYC-LPLQMSE 160
Query: 175 DPNISSP---LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
S +G +++ ++++T M+ +++ L AI++ L P
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 221 ------SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM----- 269
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 270 ---RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 313
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/349 (23%), Positives = 140/349 (40%), Gaps = 63/349 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C L + +
Sbjct: 103 -VQKIPGFTFGCNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDGFSYC-LPLQMSE 160
Query: 175 DPNISSP---LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSL 231
S +G +++ ++++T M+ +++ L AI++ L P
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 232 REFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVP 291
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 221 ------SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM----- 269
Query: 292 CPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G H + V CL F
Sbjct: 270 ---RSVDEGDMPAISLHFDDGARFDL--GRHGVFVERSVQEQDVWCLAF 313
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/380 (26%), Positives = 143/380 (37%), Gaps = 63/380 (16%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGSDL+WV C S C ++ F P++SSS + C C +
Sbjct: 157 VDTGSDLSWVQCKPCS-AAPSCYSQKDPL----FDPAQSSSYAAVPCGGPVCAGLGIYAA 211
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
G + +YG+G TG+ + DTL + SS +
Sbjct: 212 SACSAAQCG-----------------YVVSYGDGSNTTGVYSSDTLTLSASS-----AVQ 249
Query: 126 KFCFGC---VGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSP 181
F FGC + G+ G GR S+ Q G FS+C P+ +
Sbjct: 250 GFFFGCGHAQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCL-----PTKPSTAGY 304
Query: 182 LVIGDVAIS-SKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNG 240
L +G S + T +L SP P YY + L I++G L+ VP S G
Sbjct: 305 LTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLS-VPASAFA------G 357
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G +VD+GT T LP Y+ L S +S + Y D CY
Sbjct: 358 GTVVDTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGI-LDTCYN----------- 405
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
F L NV+L G+ M + CL F G G + G+ QQ+
Sbjct: 406 -FAGYGTVTLPNVALTF--GSGATVMLGADGILSFGCLAFA--PSGSDGGMAILGNVQQR 460
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+ EV ++ +GF+P C
Sbjct: 461 SFEV--RIDGTSVGFKPSSC 478
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 157/389 (40%), Gaps = 69/389 (17%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +D+GS +T+VPC + C C ++++ + F P SSS S C N+
Sbjct: 103 LIVDSGSTVTYVPCSS----CEQCGNHQDPR----FQPDLSSSYSPVKC------NVD-- 146
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
CT C K C ++ Y E +G+L D + S
Sbjct: 147 ------CT---CDSD---KKQC-----TYERQYAEMSSSSGVLGEDIVSFGRES----EL 185
Query: 124 IPKFC-FGCVGSTY-----REPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAN 174
P+ FGC S + GI G GRG LS+ QL G + FS C+
Sbjct: 186 KPQHAIFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGG 245
Query: 175 DPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREF 234
+V+G + + L+SP YY I L+ I + +L + R F
Sbjct: 246 -----GAMVLGGMLAPPDMIFSNSDPLRSP----YYNIELKEIHVAGKALR---VESRIF 293
Query: 235 DSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPN 294
+S+ G ++DSGTTY +LPE + + S + + + + D+C+ N
Sbjct: 294 NSKH--GTVLDSGTTYAYLPEQAFVAFKEAVTSKVHSLKKIRGPDPSYK-DICFAGAGRN 350
Query: 295 NTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGV 353
+ ++FP + F N L L N+ + S CL +FQ+ D P+ +
Sbjct: 351 VSKLHEVFPDVDMVFGNGQKLSLTPENYLFRH---SKVDGAYCLGVFQNGKD----PTTL 403
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCAS 382
G +N V YD E+IGF +C+
Sbjct: 404 LGGIIVRNTLVTYDRHNEKIGFWKTNCSE 432
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 139/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L + F S V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDLGRRGVFVERSVQEQD--VWCLAF 311
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/389 (24%), Positives = 155/389 (39%), Gaps = 72/389 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
+ +DTGS +T+VPC C C +++ K F P SSS C
Sbjct: 95 LIVDTGSTVTYVPCST----CKQCGKHQDPK----FQPELSSSYKALKC----------- 135
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
NP C G C + Y E +G+L+ D + S +
Sbjct: 136 -NPDCNCDDEG--------KLCV-----YERRYAEMSSSSGVLSEDLISFGNESQLTPQ- 180
Query: 124 IPKFCFGC----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYAND 175
+ FGC G + + GI G GRG LSV QL G ++ FS C+ +
Sbjct: 181 --RAVFGCENVETGDLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGG- 237
Query: 176 PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFD 235
+V+G ++ + + +SP YY I L+ + + SL L+ + F+
Sbjct: 238 ----GAMVLGKISPPAGMVFSHSDPFRSP----YYNIDLKQMHVAGKSLK---LNPKVFN 286
Query: 236 SQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVE--ERTGFDLCYRVPCP 293
G G ++DSGTTY + P+ + ++I + I P K + + D+C+
Sbjct: 287 --GKHGTVLDSGTTYAYFPKEAF---IAIKDAIIKEIPSLKRIHGPDPNYDDVCFSGAGR 341
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSG 352
+ + FP I F N L+L N+ + + CL +F D +
Sbjct: 342 DVAEIHNFFPEIDMEFGNGQKLILSPENYLFRH---TKVRGAYCLGIFPDRDS-----TT 393
Query: 353 VFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G +N V YD E +++GF +C+
Sbjct: 394 LLGGIVVRNTLVTYDRENDKLGFLKTNCS 422
>gi|222635172|gb|EEE65304.1| hypothetical protein OsJ_20543 [Oryza sativa Japonica Group]
Length = 274
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 117/302 (38%), Gaps = 80/302 (26%)
Query: 97 GEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGC----VGSTYREPIGIAGFGRGALSV 152
G G + IL D+ G + FGC G GIAGFGRG S+
Sbjct: 40 GRGLAMPEILATDSFTFGGDDNAGGLAARRVTFGCGHINKGIFQANETGIAGFGRGRWSL 99
Query: 153 PSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDVA--------ISSKDNLQFTPMLKSP 204
PSQL FS+CF + D SS + +G A + +++ T ++K+P
Sbjct: 100 PSQLNV--TSFSYCFTSMF---DTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNP 154
Query: 205 MYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSI 264
P+ Y++ L I++G + + VP +S+ ++DSG + T LPE Y ++
Sbjct: 155 SQPSLYFVPLRGISVGGARVA-VP------ESRLRSSTIIDSGASITTLPEDVYE---AV 204
Query: 265 LQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFY 324
++ PR V E D RV C +VL
Sbjct: 205 KAEFVSQLPRGNYVFE----DYAARVLC----------------------VVL------- 231
Query: 325 AMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTA 384
D G V G++QQQN VVYDLE + + F P C A
Sbjct: 232 --------------------DAAAGEQVVIGNYQQQNTHVVYDLENDVLSFAPARCDKLA 271
Query: 385 SA 386
++
Sbjct: 272 AS 273
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 97/398 (24%), Positives = 161/398 (40%), Gaps = 86/398 (21%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS-- 63
+DTGSDLTW+ C C C NK+ R + + C C ++H+
Sbjct: 83 VDTGSDLTWLQC---DAPCRSC-----NKVPHPL--YRPTKNKLVPCVDQLCASLHNGLN 132
Query: 64 -----DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
D+P++ C + Y + G TG+L D+ + ++
Sbjct: 133 RKHKCDSPYEQC--------------------DYVIKYADQGSSTGVLVNDSFALRLANG 172
Query: 119 GIIREIPKFCFGC-----VGSTYREPI-GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
++R P FGC V S P G+ G G G++S+ SQ G + HC L+
Sbjct: 173 SVVR--PSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHC-LS 229
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ L GD + + + +TPM++SP+ NYY G ++ G+ SL +
Sbjct: 230 LRGGGF------LFFGDDLVPYQ-RVTWTPMVRSPLR-NYYSPGSASLYFGDQSLR---V 278
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYR 289
L E ++ DSG+++T+ Y L++ L+ ++ KEV + + LC++
Sbjct: 279 KLTE--------VVFDSGSSFTYFAAQPYQALVTALKGDLSR--TLKEVSDPS-LPLCWK 327
Query: 290 VPCPNNTFTD--DLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAV-----KCLLFQS 342
P + D F S+ +F N GN + P N V CL +
Sbjct: 328 GKKPFKSVLDVKKEFKSLVLNFGN--------GNKAFMEIPPQNYLIVTKYGNACLGILN 379
Query: 343 MDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + G Q+ V+YD EK +IG+ C
Sbjct: 380 GSEVGLKDLSILGDITMQDQMVIYDNEKGQIGWIRAPC 417
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 158/380 (41%), Gaps = 58/380 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V MD+GSD+ WV C C C Y + + F+P+ SSS S +CAS+ C ++
Sbjct: 151 VVMDSGSDIIWVQCE----PCTQC--YHQSDPV--FNPADSSSFSGVSCASTVCSHV--- 199
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIRE 123
DN C C + +YG+G G L +T+ + +IR
Sbjct: 200 DNA--ACHEGRC---------------RYEVSYGDGSYTKGTLALETITFGRT---LIRN 239
Query: 124 IPKFCFGCVGSTYREPIGIAGFGRGALSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPL 182
+ C + G+ G G G +S QLG G FS+C ++ + S L
Sbjct: 240 VAIGCGHHNQGMFVGAAGLLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIES----SGLL 295
Query: 183 VIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNS--SLTEVPLSLREFDSQGNG 240
G A+ + P++ +P ++YYIGL + +G S++E L E G+G
Sbjct: 296 EFGREAMPV--GAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSEL---GDG 350
Query: 241 GLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDD 300
G+++D+GT T LP Y + T PRA V + FD CY + F
Sbjct: 351 GVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGV---SIFDTCYDL----FGFVSV 403
Query: 301 LFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQ 360
P+++F+F L LP N P + C F G + G+ QQ+
Sbjct: 404 RVPTVSFYFSGGPILTLPARNFLI----PVDDVGTFCFAFAPSSSG----LSIIGNIQQE 455
Query: 361 NVEVVYDLEKERIGFQPMDC 380
+++ D +GF P C
Sbjct: 456 GIQISVDGANGFVGFGPNVC 475
>gi|297740191|emb|CBI30373.3| unnamed protein product [Vitis vinifera]
Length = 218
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 104/227 (45%), Gaps = 19/227 (8%)
Query: 161 KGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYY-IGLEAITI 219
K F++C + Y +D S L++ D L +TP LKSP +YY +G++ I I
Sbjct: 4 KKFAYCLNSHDY-DDTRNSGKLIL-DYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKI 61
Query: 220 GNSSLTEVPLSLREFDSQGNGGLLVDSGTTYT-HLPEPFYSQLLSILQSTITYYPRAKEV 278
GN L +P S G G+++DSG ++ P + + + L+ ++ Y R+ E
Sbjct: 62 GNK-LLRIPSKYLAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEA 120
Query: 279 EERTGFDLCYRVPCPNNTFTDDL-FPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKC 337
E +TG PC N T + P + + F ++V+P N+F S ++ C
Sbjct: 121 ETQTGL-----TPCYNFTGHKSIKIPPLIYQFRGGANMVVPGKNYF----GISPQESLAC 171
Query: 338 LLFQSMDDGDY----GPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
L + PS + G+ Q + V YDL+ +R GF+ C
Sbjct: 172 FLMDTNGTNALEITPDPSIILGNSQHVDYYVEYDLKNDRFGFRRQTC 218
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 99/411 (24%), Positives = 159/411 (38%), Gaps = 91/411 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSN--------FSPSRSSSSSRDTCASS 55
V +DTGSDL W+PC N + C+ + + N ++PS S+SSS+ TC S+
Sbjct: 126 VALDTGSDLFWLPC-NCNSTCVRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCNST 184
Query: 56 FCLNIHSSDNPFDPCTMSGCSLSTLLKSTCCRP---CPSFAYTYGEGGLVTGILTRDTLK 112
C L++ C P CP G TG+L D +
Sbjct: 185 LC----------------------ALRNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIH 222
Query: 113 VHGSSPGIIREIPKFCFGCVGST---YREPI--GIAGFGRGALSVPSQL---GFLQKGFS 164
+ + G R+ + FGC + ++E GI G ++VP+ L G FS
Sbjct: 223 M-STEEGEARD-ARITFGCSETQLGLFQEVAVNGIMGLAMADIAVPNMLVKAGVASDSFS 280
Query: 165 HCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSL 224
CF PN + GD S + TP L + P +Y + + +G ++
Sbjct: 281 MCF-------GPNGKGTISFGDKGSSDQHE---TP-LGGTISPLFYDVSITKFKVGKVTV 329
Query: 225 TEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF 284
E S + DSGT T L +P+Y+ L + ++ R + F
Sbjct: 330 -ETKFS-----------AIFDSGTAVTWLLDPYYTALTTNFHLSVP--DRRLPANVDSTF 375
Query: 285 DLCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAP-------SNSSAVKC 337
+ CY + +T ++ PSI+F +G Y + +P S V C
Sbjct: 376 EFCYII---TSTSDEEKLPSISFEM---------KGGAAYDVFSPILVFDTSDGSFQVYC 423
Query: 338 LLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQG 388
L D D+ + G N +V+D E+ +G++ +C T G
Sbjct: 424 LAVLKQDKADF---NIIGQNFMTNYRIVHDRERMILGWKKSNCNDTNGFTG 471
>gi|381148024|gb|AFF60302.1| xyloglucanase-specific endoglucanase inhibitor [Solanum tuberosum]
Length = 438
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 43/304 (14%)
Query: 94 YTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPI-----GIAGFGRG 148
YT+G L D L + G+SP ++ PKF F CV S + + GIAGFG
Sbjct: 136 YTFGAE------LAEDVLAI-GTSPIVLVSQPKFIFTCVESYIMKRLAKGVTGIAGFGHN 188
Query: 149 A-LSVPSQLGFLQKGFSHCF---LAFKYANDPNI---SSPLVIGDVAISSKDNLQFTPML 201
+ +S+P+QL L F+ F L+ + I SSP + + I NL +TP++
Sbjct: 189 STISIPNQLASLDSKFTRKFGICLSSSTRSSGVIFIGSSPYYVYNPMIDISKNLIYTPLV 248
Query: 202 KSPM---YPNYYYIGLEAITIGNSSLTEVPL--SLREFDSQGNGGLLVDSGTTYTHLPEP 256
+PM P Y++ + +I I +VPL +L + QG+GG + + +T L
Sbjct: 249 GNPMDWLTPMEYHVNVSSIRIAGK---DVPLNKTLLSINDQGHGGTRISTTIPFTILHTS 305
Query: 257 FY----SQLLSILQSTITYY-PRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLN 311
Y + ++ L +T P K F C+ T P I F F
Sbjct: 306 IYEVVKTAFINALPKNVTMVDPPMKR------FGACFSSKNIRITNVGPDVPVIDFVFHK 359
Query: 312 NVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKE 371
+ G A S S + CL F D + PS V G +Q + +V+DL +
Sbjct: 360 KSAFWRIYG----ANSVVQVSKDIMCLAFVGRDQ-TWEPSIVIGGYQLEENLLVFDLPHK 414
Query: 372 RIGF 375
+IGF
Sbjct: 415 KIGF 418
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 139/327 (42%), Gaps = 65/327 (19%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDT-CASSFCLNIHS 62
V +DTGS WV C C + F RSS SS++ C + C
Sbjct: 98 VQLDTGSKAFWVN----GISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC----- 148
Query: 63 SDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIR 122
+ P PC M+ CP + Y +GGL GIL D L H G +
Sbjct: 149 TSRP--PCNMT-------------LRCP-YITGYADGGLTMGILFTDLLHYH-QLYGNGQ 191
Query: 123 EIP---KFCFGC----VGSTYREPI---GIAGFGRGALSVPSQL---GFLQKGFSHCFLA 169
P FGC GS + GI GFG + SQL G +K FSHC
Sbjct: 192 TQPTSTSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCL-- 249
Query: 170 FKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPL 229
+ N IG+V + ++ TP++K+ Y+ + L++I + ++L ++P
Sbjct: 250 ----DSTNGGGIFAIGEVV---EPKVKTTPIVKNNEV--YHLVNLKSINVAGTTL-QLPA 299
Query: 230 SLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDL-CY 288
++ F + G +DSG+T +LPE YS+L+ + + + ++ ++ C+
Sbjct: 300 NI--FGTTKTKGTFIDSGSTLVYLPEIIYSELI------LAVFAKHPDITMGAMYNFQCF 351
Query: 289 RVPCPNNTFTDDLFPSITFHFLNNVSL 315
DD FP ITFHF N+++L
Sbjct: 352 HFLGS----VDDKFPKITFHFENDLTL 374
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 140/348 (40%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G GA+SV Q FS+C K
Sbjct: 103 -VQKIPGFSFGCNMDSFGANEFGNVDGLLGMGAGAMSVLKQSSPTFDCFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|383161172|gb|AFG63168.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
gi|383161174|gb|AFG63170.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
gi|383161175|gb|AFG63171.1| Pinus taeda anonymous locus 0_11073_01 genomic sequence
Length = 133
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/140 (38%), Positives = 77/140 (55%), Gaps = 13/140 (9%)
Query: 85 CCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGSTYREPIGIAG 144
C + CP F+ TYG G TG L DTL + G REI F GC + GIAG
Sbjct: 1 CSKICPHFSLTYGTGN-ATGRLLSDTLTLPLEDGGR-REIKNFATGCA-VVSSQVAGIAG 57
Query: 145 FGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKS 203
FG G LS+PSQL + F++C Y ++ SS +V+G+ A+ L +TP+L +
Sbjct: 58 FGNGGLSMPSQLAPLIGDKFAYC---LDYRSN---SSKIVLGNKAVPRDLPLTYTPLLFN 111
Query: 204 PMYP---NYYYIGLEAITIG 220
P+ P +Y+Y+ LE ++IG
Sbjct: 112 PVNPSVFSYFYLALETVSIG 131
>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
Length = 431
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 157/377 (41%), Gaps = 60/377 (15%)
Query: 7 DTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDNP 66
DT SDLTW C NL D ++ F P++SSS + TC+S C + DNP
Sbjct: 109 DTASDLTWTQC-NLFNDTA-------KQVEPLFDPAKSSSFAFVTCSSKLC----TEDNP 156
Query: 67 FDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPK 126
CS T CR + Y Y G+L ++ + ++ I
Sbjct: 157 ----GTKRCSNKT------CR----YVYPYVSVE-AAGVLAYESFTLSDNNQHICMS--- 198
Query: 127 FCFGCVGSTYREPIG---IAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLV 183
F FGC T +G I G LS+ SQL + FS+C + SSPL
Sbjct: 199 FGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPK--FSYCLTPYT----DRKSSPLF 252
Query: 184 IGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLL 243
G A + P+ KS + YYY+ L +++G L +VP + GG +
Sbjct: 253 FGAWADLGRYKTT-GPIQKSLTF--YYYVPLVGLSLGTRRL-DVPAATFALK---QGGTV 305
Query: 244 VDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFP 303
VD G T L EP ++ L + T+ + V++ + +C+ +P P
Sbjct: 306 VDLGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVKD---YKVCFALPS-GVAMGAVQTP 361
Query: 304 SITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVE 363
+ +F +VLP+ N+F +A + CL ++ G G + G+ QQQN
Sbjct: 362 PLVLYFDGGADMVLPRDNYFQEPTA-----GLMCL---ALVPG--GGMSIIGNVQQQNFH 411
Query: 364 VVYDLEKERIGFQPMDC 380
+++D+ + F P C
Sbjct: 412 LLFDVHDSKFLFAPTIC 428
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 99/384 (25%), Positives = 159/384 (41%), Gaps = 59/384 (15%)
Query: 2 IQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIH 61
+ + DTGSD+TW C + C Y+ + + F PS+S+S + +C+SS C ++
Sbjct: 162 LSLIFDTGSDITWTQCQPCARSC-----YKQKEQI--FDPSQSTSYTNISCSSSICNSLT 214
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
S+ C S C + YG+ G + L + +
Sbjct: 215 SATGNTPGCASSACV---------------YGIQYGDSSFSVGFFGTEKLTLTSTDA--- 256
Query: 122 REIPKFCFGCVGST---YREPIGIAGFGRGALSVPSQLG-FLQKGFSHCFLAFKYANDPN 177
FGC + + G+ G GR LSV SQ K FS+C P+
Sbjct: 257 --FNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCL--------PS 306
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
SS S+ N +FTP+ P++Y + I++G L +S F +
Sbjct: 307 SSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKL---AISASVFST- 362
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G ++DSGT T LP YS L + ++ ++ YP K + D CY +++
Sbjct: 363 --AGAIIDSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSI---LDTCYDF----SSY 413
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSF 357
T P I F F + + + + YA +S + CL F + D +FG+
Sbjct: 414 TTISVPKIGFSFSSGIEVDIDATGILYA-----SSLSQVCLAF--AGNSDATDVFIFGNV 466
Query: 358 QQQNVEVVYDLEKERIGFQPMDCA 381
QQ+ +EV YD ++GF P C+
Sbjct: 467 QQKTLEVFYDGSAGKVGFAPGGCS 490
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 139/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G H + V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDL--GIHGVFVERSVQEQDVWCLAF 311
>gi|302783204|ref|XP_002973375.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
gi|300159128|gb|EFJ25749.1| hypothetical protein SELMODRAFT_413680 [Selaginella moellendorffii]
Length = 407
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 117/248 (47%), Gaps = 30/248 (12%)
Query: 141 GIAGFGRGALSVPSQLGFLQ--KGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFT 198
G+ GF + S QL + F +C A S +V G+ ISS +L +T
Sbjct: 144 GLVGFAKTNKSFIGQLAEMDYTGKFIYC------APSDTFSGKIVFGNYKISSNSSLSYT 197
Query: 199 PMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFY 258
PM+ +P+ YYIGL +I+I N LT + ++ + G GG ++DS +++ Y
Sbjct: 198 PMIVNPISTALYYIGLRSISI-NDMLTFL---VQGILADGTGGTIIDSTFAFSYFTPDSY 253
Query: 259 SQLLSILQSTITYYPR--AKEVEERTGFDLCYRVPCPNNTFTDDLFP---SITFHFLNNV 313
+ L+ +Q+ + + + + G D+CY V +T P ++T+HF N
Sbjct: 254 TPLVQAIQNLNSNLTKVSSNKTAALLGNDICYNVSVNGDT------PPPQTLTYHFENGT 307
Query: 314 SLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPS-GVFGSFQQQNVEVVYDLEKER 372
+ + ++ + + ++ V CL D G S V G++QQ +V V +DLEK+
Sbjct: 308 QV---EFRTWFLLDDDAENATV-CLAVG--DSQKVGFSLNVIGTYQQLDVAVEFDLEKQE 361
Query: 373 IGFQPMDC 380
IGF C
Sbjct: 362 IGFGTAGC 369
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 96/396 (24%), Positives = 167/396 (42%), Gaps = 67/396 (16%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCD-----DYRNNKLMSNFSPSRSSSSSRDTCASSFC- 57
V +DTGSDL WVPC DC++C +YR+ K +SP +SS+S + C+S+ C
Sbjct: 119 VALDTGSDLFWVPC-----DCINCAPLVSPNYRDLKF-DTYSPQKSSTSRKVPCSSNLCD 172
Query: 58 -LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS 116
+ S + P ++ S +T YG+ +VT +T ++
Sbjct: 173 LQSACRSASSSCPYSIEYLSDNTSSTGVLVEDVLYLITEYGQPKIVTAPITFGCGRIQTG 232
Query: 117 SPGIIREIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCFLAFKYA 173
S +GS P G+ G G ++SVPS L G FS CF
Sbjct: 233 S-------------FLGSA--APNGLLGLGMDSISVPSLLASEGVAANSFSMCF------ 271
Query: 174 NDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
+ + GD S + Q TP+ P YY I + +G+ S
Sbjct: 272 -GDDGRGRINFGDTGSSDQ---QETPLNIYKQNP-YYNISITGAMVGSKS---------- 316
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
F++ N +VDSGT++T L +P YS++ S S + P +++ F+ CY + P
Sbjct: 317 FNTNFNA--IVDSGTSFTALSDPMYSEITSSFNSQVQDKP--TQLDSSLPFEFCYSI-SP 371
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGV 353
+ P+I+ + + P + ++ +++ CL + + +
Sbjct: 372 KGSVNP---PNIS--LMAKGGSIFPVNDPIITITDDASNPMAYCLAVMKSEGVN-----L 421
Query: 354 FGSFQQQNVEVVYDLEKERIGFQPMDCASTASAQGL 389
G ++VV+D E++ +G++ +C S ++ L
Sbjct: 422 IGENFMSGLKVVFDRERKVLGWKKFNCYSVDNSSNL 457
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 98/399 (24%), Positives = 153/399 (38%), Gaps = 91/399 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDC-----DDYRNNKLMSNFSPSRSSSSSRDTCASSFCL 58
V +DTGSDL WVPC DC C Y ++ +S ++P SS+S + TC + C
Sbjct: 112 VALDTGSDLFWVPC-----DCSRCAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMCA 166
Query: 59 NIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP 118
+ F CP +GIL +D L +
Sbjct: 167 QRNRCLGTFS-------------------SCPYIVSYVSAQTSTSGILVKDVLHLTTEDG 207
Query: 119 GIIREIPK--FCFGC----VGS--TYREPIGIAGFGRGALSVPSQL---GFLQKGFSHCF 167
G RE + FGC GS P G+ G G +SVPS L G + FS CF
Sbjct: 208 G--REFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPSVLSREGLIADSFSMCF 265
Query: 168 LAFKYANDPNISSPLVIGDVAISSKD--NLQFTPMLKSPMYPNYYYIGLEAITIGNSSLT 225
+D IG ++ K + + TP +P +P Y N ++T
Sbjct: 266 -----GHDG-------IGRISFGDKGSPDQEETPFNVNPAHPTY-----------NVTVT 302
Query: 226 EVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFD 285
+ + D + L DSGT++T++ +P YS++ S + + + R F+
Sbjct: 303 QARVGTMLIDVEFTA--LFDSGTSFTYMVDPAYSRVSEKFHSLAR--DKRRPPDPRIPFE 358
Query: 286 LCYRVPCPNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYA----MSAPSNSSAVKCLLFQ 341
CY D+ P + ++SL + G HF + + + V CL
Sbjct: 359 YCY-----------DMSPDANASLVPSMSLTMKGGRHFTVYDPIIVISTQNEIVYCLAVV 407
Query: 342 SMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
+ + + G VV+D EK +G++ DC
Sbjct: 408 KSTELN-----IIGQNFMTGYRVVFDREKLVLGWKKFDC 441
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 138/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L F S V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDLGSKGVFVERSVQEQD--VWCLAF 311
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 84/348 (24%), Positives = 138/348 (39%), Gaps = 63/348 (18%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----N 59
V +DTGS +WV C +CD N F SRS++ ++ +C +S CL +
Sbjct: 16 VEIDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSD 65
Query: 60 IHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPG 119
H D+ P CP F +Y +G GIL +DTL
Sbjct: 66 PHCQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD---- 102
Query: 120 IIREIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYAN 174
+++IP F FGC + + G+ G G G +SV Q GFS+C K
Sbjct: 103 -VQKIPSFTFGCNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYCLPLQKSER 161
Query: 175 D--PNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLR 232
+ +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 162 GFFSKTTGYFSLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIF- 218
Query: 233 EFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPC 292
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 -----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------ 267
Query: 293 PNNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L F S V CL F
Sbjct: 268 --RSVDEGDMPAISLHFDDGARFDLGSRGVFVERSVQEQD--VWCLAF 311
>gi|367068392|gb|AEX13220.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068394|gb|AEX13221.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068396|gb|AEX13222.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068398|gb|AEX13223.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068402|gb|AEX13225.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068404|gb|AEX13226.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068406|gb|AEX13227.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068408|gb|AEX13228.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
gi|367068410|gb|AEX13229.1| hypothetical protein CL3308Contig1_01 [Pinus taeda]
Length = 77
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/78 (46%), Positives = 56/78 (71%), Gaps = 1/78 (1%)
Query: 177 NISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDS 236
N S +V+G+ A+ L + P+L +P+YP++YY+GLEA++IG LT +P +L FDS
Sbjct: 1 NNGSKIVLGNKAVPRDIALTYIPLLINPIYPDFYYLGLEAVSIGAKRLT-LPSNLLSFDS 59
Query: 237 QGNGGLLVDSGTTYTHLP 254
Q NGG ++DSGT++T+ P
Sbjct: 60 QRNGGTIIDSGTSFTNFP 77
>gi|242044812|ref|XP_002460277.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
gi|241923654|gb|EER96798.1| hypothetical protein SORBIDRAFT_02g025885 [Sorghum bicolor]
Length = 369
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/190 (27%), Positives = 83/190 (43%), Gaps = 18/190 (9%)
Query: 193 DNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTH 252
++ TP+L +P + YY+ + I +G + P +L FD G ++DSGT +T
Sbjct: 197 QRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPAL-AFDPATGAGTVLDSGTMFTR 255
Query: 253 LPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSITFHFLNN 312
L P Y + ++ + V GFD C+ NT T +P +T F +
Sbjct: 256 LVAPAYVAVRDEVRRRV-----GAPVSSLGGFDTCF------NT-TAVAWPPVTLLF-DG 302
Query: 313 VSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKER 372
+ + LP+ N S + CL + DG V S QQQN V++D+ R
Sbjct: 303 MQVTLPEENVVIH----STYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGR 358
Query: 373 IGFQPMDCAS 382
+GF C +
Sbjct: 359 VGFARERCTA 368
>gi|224030719|gb|ACN34435.1| unknown [Zea mays]
Length = 216
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/236 (27%), Positives = 97/236 (41%), Gaps = 26/236 (11%)
Query: 150 LSVPSQLGFLQKG-FSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPN 208
+S+ SQ G G FS+C +++ S L +G A N++ TP+L +P P+
Sbjct: 1 MSLLSQTGSRYNGVFSYCLPSYR---SYYFSGSLRLG--AAGQPRNVRHTPLLTNPHRPS 55
Query: 209 YYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQST 268
YY+ + +++G + +VP FD G ++DSGT T P Y+ L +
Sbjct: 56 LYYVNVTGLSVGRT-WVKVPAGSFAFDPATGAGTVIDSGTVITRWTAPVYAALREEFRRQ 114
Query: 269 ITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLF----PSITFHFLNNVSLVLPQGNHFY 324
+ FD C+ TD++ P +T H V L LP N
Sbjct: 115 VA---APSGYTSLGAFDTCFN--------TDEVAAGGAPPVTLHMDGGVDLTLPMENTLI 163
Query: 325 AMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
SA + + CL V + QQQNV VV D+ R+GF C
Sbjct: 164 HSSA----TPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVGFAREPC 215
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 141/347 (40%), Gaps = 65/347 (18%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCL----NIH 61
+DTGS +WV C +CD N F SRS++ ++ +C +S CL + H
Sbjct: 18 IDTGSSTSWVFC--------ECDGCHTNP--RTFLQSRSTTCAKVSCGTSMCLLGGSDPH 67
Query: 62 SSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGII 121
D+ P CP F +Y +G GIL +DTL +
Sbjct: 68 CQDSENYP------------------DCP-FRVSYQDGSASYGILYQDTLTFSD-----V 103
Query: 122 REIPKFCFGCV-----GSTYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDP 176
++IP F FGC + + G+ G G G +SV Q GFS+C L + +
Sbjct: 104 QKIPSFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDGFSYC-LPLQMSERG 162
Query: 177 NISSP---LVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
S +G VA ++ ++++T M+ +++ L AI++ L P
Sbjct: 163 FFSKTTGYFSLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIF-- 218
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
G++ DSG+ +++P+ S L ++ + A+E ER +D+
Sbjct: 219 ----SRKGVVFDSGSELSYIPDRALSVLSQRIRELLLRRGAAEEESERNCYDM------- 267
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLF 340
+ + P+I+ HF + L G+H + V CL F
Sbjct: 268 -RSVDEGDMPAISLHFDDGARFDL--GSHGVFVERSVQEQDVWCLAF 311
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 99/402 (24%), Positives = 159/402 (39%), Gaps = 92/402 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN------NKLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSDL W+PC DC+ C ++ ++ +SPS SS+S +C+ C
Sbjct: 96 VALDAGSDLLWIPC-----DCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLC 150
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS- 116
+ + D+P + CP Y E +G+L D L +
Sbjct: 151 ESSPNCDSP-------------------KQLCPYTINYYSENTSSSGLLIEDILHLTSGI 191
Query: 117 ---------SPGII----REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQ 160
+P II R+ + G P G+ G G G +SVPS L G ++
Sbjct: 192 DDASNSSVRAPVIIGCGMRQTGGYLDGVA------PDGLMGLGLGEISVPSFLSKAGLVK 245
Query: 161 KGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIG 220
FS CF ND + S + GD ++++ F P S Y +G+EA IG
Sbjct: 246 NSFSLCF------NDDD-SGRIFFGDQGLATQQTTLFLP---SDGKYETYIVGVEACCIG 295
Query: 221 NSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE 280
+S + + S R LVDSG ++T LP+ Y ++ + + E
Sbjct: 296 SSCIKQT--SFRA---------LVDSGASFTFLPDESYRNVVDEFDKQVN---ATRFSFE 341
Query: 281 RTGFDLCYRVPCPNNTFTDDLF--PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL 338
++ CY+ + + +L PS+ F N S V+ N + + CL
Sbjct: 342 GYPWEYCYK------SSSKELLKNPSVILKFALNNSFVV--HNPVFVVHGYQGVVGF-CL 392
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
Q D G G+ G +V+D E ++G+ +C
Sbjct: 393 AIQPAD----GDIGILGQNFMTGYRMVFDRENLKLGWSRSNC 430
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 99/402 (24%), Positives = 159/402 (39%), Gaps = 92/402 (22%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRN------NKLMSNFSPSRSSSSSRDTCASSFC 57
V +D GSDL W+PC DC+ C ++ ++ +SPS SS+S +C+ C
Sbjct: 115 VALDAGSDLLWIPC-----DCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLC 169
Query: 58 LNIHSSDNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGS- 116
+ + D+P + CP Y E +G+L D L +
Sbjct: 170 ESSPNCDSP-------------------KQLCPYTINYYSENTSSSGLLIEDILHLTSGI 210
Query: 117 ---------SPGII----REIPKFCFGCVGSTYREPIGIAGFGRGALSVPSQL---GFLQ 160
+P II R+ + G P G+ G G G +SVPS L G ++
Sbjct: 211 DDASNSSVRAPVIIGCGMRQTGGYLDGVA------PDGLMGLGLGEISVPSFLSKAGLVK 264
Query: 161 KGFSHCFLAFKYANDPNISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIG 220
FS CF ND + S + GD ++++ F P S Y +G+EA IG
Sbjct: 265 NSFSLCF------NDDD-SGRIFFGDQGLATQQTTLFLP---SDGKYETYIVGVEACCIG 314
Query: 221 NSSLTEVPLSLREFDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEE 280
+S + + S R LVDSG ++T LP+ Y ++ + + E
Sbjct: 315 SSCIKQT--SFRA---------LVDSGASFTFLPDESYRNVVDEFDKQVN---ATRFSFE 360
Query: 281 RTGFDLCYRVPCPNNTFTDDLF--PSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL 338
++ CY+ + + +L PS+ F N S V+ N + + CL
Sbjct: 361 GYPWEYCYK------SSSKELLKNPSVILKFALNNSFVV--HNPVFVVHGYQGVVGF-CL 411
Query: 339 LFQSMDDGDYGPSGVFGSFQQQNVEVVYDLEKERIGFQPMDC 380
Q D G G+ G +V+D E ++G+ +C
Sbjct: 412 AIQPAD----GDIGILGQNFMTGYRMVFDRENLKLGWSRSNC 449
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 99/390 (25%), Positives = 159/390 (40%), Gaps = 60/390 (15%)
Query: 4 VYMDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSS 63
V DTGSDLTWV C + C Y+ + + F PS+SS+ +++
Sbjct: 141 VLFDTGSDLTWVQCKPCTDSC-----YQQQEPL--FDPSKSSTY----------VDV--- 180
Query: 64 DNPFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSP---GI 120
PC C + TC ++ YG+ + G L ++ + S+P G+
Sbjct: 181 -----PCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAAGV 235
Query: 121 IREIP-KFCFGCVGSTYREPI-GIAGFGRGALSVPSQLGFLQKG--FSHCFLAFKYANDP 176
+ ++ G G+ + G+ G GRG S+ SQ G FS+C P
Sbjct: 236 VFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCL-------PP 288
Query: 177 NISSP--LVIGDVAISSKDNLQFTPML-KSPMYPNYYYIGLEAITIGNSSLTEVPLSLRE 233
SS L IG A + NL FTP++ + + Y + L I++ ++L P+
Sbjct: 289 RGSSAGYLTIG-AAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAAL---PIDASA 344
Query: 234 FDSQGNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCP 293
F G ++DSGT TH+P Y L + + Y E + D CY V
Sbjct: 345 FYI----GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVES-LDTCYDV-TG 398
Query: 294 NNTFTDDLFPSITFHFLNNVSLVLPQGNHF--YAMSAPSNSSAVKCLLFQSMDDGDYGPS 351
++ T P + F + + +A+ A S + CL F + +
Sbjct: 399 HDVVTA---PPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGF--- 452
Query: 352 GVFGSFQQQNVEVVYDLEKERIGFQPMDCA 381
+ G+ QQ+ VV+D+E RIGF C+
Sbjct: 453 VIIGNMQQRAYNVVFDVEGRRIGFGANGCS 482
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 97/386 (25%), Positives = 152/386 (39%), Gaps = 67/386 (17%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS +T+VPC C C +++ + F P SS+ C N
Sbjct: 105 VDTGSTVTYVPCST----CEQCGKHQDPR----FQPESSSTYKPMQC------------N 144
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
P C G + C ++ Y E +G+L D L S +
Sbjct: 145 PSCNCDDEG------------KQC-TYERRYAEMSSSSGLLAEDVLSFGNESELTPQ--- 188
Query: 126 KFCFGC----VGSTYREPI-GIAGFGRGALSVPSQLGFLQ---KGFSHCFLAFKYANDPN 177
+ FGC G + + GI G GRG LSV QL + FS C Y
Sbjct: 189 RAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLC-----YGGMDV 243
Query: 178 ISSPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQ 237
+ +V+G+ I ++ F P YY I L+ + + L L+ R FD
Sbjct: 244 VGGAMVLGN--IPPPPDMVFA--HSDPYRSAYYNIELKELHVAGKRLK---LNPRVFD-- 294
Query: 238 GNGGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTF 297
G G ++DSGTTY +LPE + + I + + + D+C+ + +
Sbjct: 295 GKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYN-DICFSGAGRDVSQ 353
Query: 298 TDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCL-LFQSMDDGDYGPSGVFGS 356
+FP + F N L L N+ + + S CL +FQ+ D P+ + G
Sbjct: 354 LSKIFPEVNMVFGNGQKLSLSPENYLFRH---TKVSGAYCLGIFQNGKD----PTTLLGG 406
Query: 357 FQQQNVEVVYDLEKERIGFQPMDCAS 382
+N V YD + ++IGF +C+
Sbjct: 407 IVVRNTLVTYDRDNDKIGFWKTNCSE 432
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 153/384 (39%), Gaps = 60/384 (15%)
Query: 6 MDTGSDLTWVPCGNLSFDCMDCDDYRNNKLMSNFSPSRSSSSSRDTCASSFCLNIHSSDN 65
+DTGS L+W+ C C Y + ++ F+PS S T + C + S
Sbjct: 124 VDTGSSLSWL-------QCQPCVIYCHVQVDPIFTPSVS-----KTYKALSCSSSQCSSL 171
Query: 66 PFDPCTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIP 125
GCS +T C + +YG+ G L++D L + S+
Sbjct: 172 KSSTLNAPGCSNAT---GACV-----YKASYGDTSFSIGYLSQDVLTLTPSA----APSS 219
Query: 126 KFCFGCVGST---YREPIGIAGFGRGALSVPSQL-GFLQKGFSHCFLAFKYANDPN--IS 179
F +GC + GI G LS+ QL FS+C L ++ PN +S
Sbjct: 220 GFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYC-LPSSFSAQPNSSVS 278
Query: 180 SPLVIGDVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGN 239
L IG ++SS +FTP++K+P P+ Y++GL IT+ PL + S N
Sbjct: 279 GFLSIGASSLSSSP-YKFTPLVKNPKIPSLYFLGLTTITVAGK-----PLGVSA--SSYN 330
Query: 240 GGLLVDSGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGF---DLCYRVPCPNNT 296
++DSGT T LP Y+ L ++ K+ + GF D C++ +
Sbjct: 331 VPTIIDSGTVITRLPVAIYNALKKSFVMIMS-----KKYAQAPGFSILDTCFK----GSV 381
Query: 297 FTDDLFPSITFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGS 356
P I F L L N + CL + + P + G+
Sbjct: 382 KEMSTVPEIRIIFRGGAGLELKVHNSLVEI-----EKGTTCLAIAASSN----PISIIGN 432
Query: 357 FQQQNVEVVYDLEKERIGFQPMDC 380
+QQQ V YD+ +IGF P C
Sbjct: 433 YQQQTFTVAYDVANSKIGFAPGGC 456
>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
Length = 289
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 125/315 (39%), Gaps = 47/315 (14%)
Query: 70 CTMSGCSLSTLLKSTCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCF 129
C S + T T + C FA +Y +G G ++D L + +PG I + F F
Sbjct: 18 CARSSPPMRTAAAVTSGKQC-GFAISYADGTSTVGAYSQDKLTL---APGAI--VQNFYF 71
Query: 130 GCVGSTYREPI---GIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISS-PLVIG 185
GC + G+ G GR S+ ++ G + FS+C P++SS P +
Sbjct: 72 GCGHGKHAVRGLFDGVLGLGRLRESLGARYGGV---FSYCL--------PSVSSKPGFLA 120
Query: 186 DVAISSKDNLQFTPMLKSPMYPNYYYIGLEAITIGNSSLTEVPLSLREFDSQGNGGLLVD 245
A + FTPM P P + + L I +G L P + +GG++VD
Sbjct: 121 LGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAF-------SGGMIVD 173
Query: 246 SGTTYTHLPEPFYSQLLSILQSTITYYPRAKEVEERTGFDLCYRVPCPNNTFTDDLFPSI 305
SGT T L Y L S + + Y + D CY + + + + P I
Sbjct: 174 SGTVITGLQSTAYRALRSAFRKAMEAY----RLLPNGDLDTCYNL----TGYKNVVVPKI 225
Query: 306 TFHFLNNVSLVLPQGNHFYAMSAPSNSSAVKCLLFQSMDDGDYGPSGVFGSFQQQNVEVV 365
F ++ L P+ CL F + G G +GV G+ Q+ EV+
Sbjct: 226 ALTFTGGATINL---------DVPNGILVNGCLAFA--ESGPDGSAGVLGNVNQRAFEVL 274
Query: 366 YDLEKERIGFQPMDC 380
+D + GF+ C
Sbjct: 275 FDTSTSKFGFRAKAC 289
>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
Length = 431
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 129/325 (39%), Gaps = 46/325 (14%)
Query: 83 STCCRPCPSFAYTYGEGGLVTGILTRDTLKVHGSSPGIIREIPKFCFGCVGS-------- 134
S CR + +Y + G+L DT + G +P + FGC+ S
Sbjct: 115 SNACR----VSLSYADASSADGVLATDTFLLTGGAPPVAV---GAYFGCITSYSSTTATN 167
Query: 135 -------TYREPIGIAGFGRGALSVPSQLGFLQKGFSHCFLAFKYANDPNISSPLVIGDV 187
G+ G RG LS +Q G + F++C P + L++GD
Sbjct: 168 SNGTGTDVSEAATGLLGMNRGTLSFVTQTG--TRRFAYCI---APGEGPGV---LLLGDD 219
Query: 188 AISSKDNLQFTPMLK-SPMYPNY----YYIGLEAITIGNSSLTEVPLSLREFDSQGNGGL 242
L +TP+++ S P + Y + LE I +G +L +P S+ D G G
Sbjct: 220 G-GVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVG-CALLPIPKSVLTPDHTGAGQT 277
Query: 243 LVDSGTTYTHLPEPFYSQLLSIL--QSTITYYPRAKEVEERTG-FDLCYRVPCPNNTFTD 299
+VDSGT +T L Y+ L + Q+ + P + G FD C+R P
Sbjct: 278 MVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAFDACFRGPEARVAAAS 337
Query: 300 DLFPSITFHFLNNVSLVLPQGNHFYAM----SAPSNSSAVKCLLFQSMDDGDYGPSGVFG 355
L P + L + + Y + + AV CL F + D + V G
Sbjct: 338 GLLPEVGL-VLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMS-AYVIG 395
Query: 356 SFQQQNVEVVYDLEKERIGFQPMDC 380
QQNV V YDL+ R+GF P C
Sbjct: 396 HHHQQNVWVEYDLQNGRVGFAPARC 420
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.136 0.423
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,601,557,083
Number of Sequences: 23463169
Number of extensions: 292194172
Number of successful extensions: 561475
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 300
Number of HSP's successfully gapped in prelim test: 1684
Number of HSP's that attempted gapping in prelim test: 555253
Number of HSP's gapped (non-prelim): 2420
length of query: 394
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 250
effective length of database: 8,980,499,031
effective search space: 2245124757750
effective search space used: 2245124757750
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)