BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013377
(444 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 277/407 (68%), Positives = 326/407 (80%), Gaps = 6/407 (1%)
Query: 25 FGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK 84
+GFGTFGFD HHRYSDPVKG+L+VDDLP+KGS YY+++AHRD + GR L + N
Sbjct: 36 YGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRD--ILIHGRKLVSD-NTS 92
Query: 85 TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS--CVHGL 142
TPLTF +GN+TYR +SLGFLHY NVS+G P+LS++VALDTGSDLFWLPCDC + CV GL
Sbjct: 93 TPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGL 152
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
SG+ IDFNIY PN SSTS +PCN+TLC Q +CPSA S CPYQV+YLS+GT STG
Sbjct: 153 QFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGV 212
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVED+LHL TD+ QS+++D++I FGCGRVQTGSFLDGAAPNGLFGLGM SVPS LA +
Sbjct: 213 LVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLARE 272
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
G NSFSMCFG DG GRISFGD GS GQGETPF+LRQ HPTYN++IT+++VGG + E
Sbjct: 273 GYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITKINVGGRDADLE 332
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
FSAIFDSGTSFTYLNDPAYT ISE+FN AKEKR +S SD+PFEYCY +S NQTN E P
Sbjct: 333 FSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSSNQTNLEIPT 392
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
VNL M+GG F V DPIVIV + G +YCL +VKS +VNIIG+ +
Sbjct: 393 VNLVMQGGSQFNVTDPIVIVILQ-GGASIYCLAIVKSGDVNIIGQNF 438
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 526 bits (1355), Expect = e-147, Method: Compositional matrix adjust.
Identities = 263/408 (64%), Positives = 314/408 (76%), Gaps = 10/408 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
C +FGFD HHR+SDPVK IL V DLP KG+ YY +AHRDR FR GR LAA +
Sbjct: 24 CHALNSFGFDIHHRFSDPVKEILGVHDLPDKGTRLYYVVMAHRDRIFR--GRRLAAAVH- 80
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
+PLTF N+TY++ + GFLH+ NVSVG P LSF+VALDTGSDLFWLPC+C CV G+
Sbjct: 81 HSPLTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGV- 139
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
S+G+ I FNIY SSTS V CNS LCELQ+QCPS+ S CPY+V YLS+GT +TGFL
Sbjct: 140 ESNGEKIAFNIYDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFL 199
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVLHL TD+ ++K D+RI+FGCG+VQTG+FLDGAAPNGLFGLGM SVPSILA +G
Sbjct: 200 VEDVLHLITDDDETKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEG 259
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSFSMCFGSDG GRI+FGD S QG+TPF+LR HPTYNIT+TQ+ VGGNA + EF
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGGNAADLEF 319
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS--DLPFEYCYVLSPNQTNFEYP 381
AIFDSGTSFT+LNDPAY QI+ +FNS K +R +S+S +LPFEYCY LS N+T E P
Sbjct: 320 HAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSSNKT-VELP 378
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+NLTMKGG + V DPIV +S E G+ L CLGV+KS+NVNIIG+ +
Sbjct: 379 -INLTMKGGDNYLVTDPIVTISGE--GVNLLCLGVLKSNNVNIIGQNF 423
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 261/412 (63%), Positives = 318/412 (77%), Gaps = 13/412 (3%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDD---LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
C+ G FG D HHR+SDPV IL + + LP KG+ YY+A+ HRDR F GR LA
Sbjct: 33 CYSLGKFGLDIHHRFSDPVTEILGIGNDELLPHKGTPQYYAAMVHRDRVFH--GRRLA-- 88
Query: 81 GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
+ TP+TF+AGN+T+++ + GFLH+ NVSVG P L F+VALDTGSDLFWLPC+C SCV
Sbjct: 89 DDRDTPITFAAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSCVR 148
Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
GL + +G+VID NIY + SST VPCNS +C+ Q QC S+GS+C Y+V YLS+ T S+
Sbjct: 149 GLKTQNGKVIDLNIYELDKSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSS 207
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
GFLVEDVLHL TD Q+K +D++I+ GCG+VQTG FL+GAAPNGLFGLGM+ SVPSILA
Sbjct: 208 GFLVEDVLHLITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILA 267
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
+GLI +SFSMCFGSDG+GRI+FGD GS QG+TPF+LR++HPTYN+TITQ+ VGG A +
Sbjct: 268 QKGLISDSFSMCFGSDGSGRITFGDTGSSDQGKTPFNLRESHPTYNVTITQIIVGGYAAD 327
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE---TSTSDLPFEYCYVLSPNQTN 377
EF AIFDSGTSFTYLNDPAYT ISE FNSL K R + SDLPFEYCY +SP+QT
Sbjct: 328 HEFHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCYDMSPDQT- 386
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
E P +NLTMKGG ++V DPIV VSSE +G L CLG+ KSDN+NIIGREY
Sbjct: 387 IEVPFLNLTMKGGDDYYVTDPIVPVSSEVEG-NLLCLGIQKSDNLNIIGREY 437
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 257/412 (62%), Positives = 322/412 (78%), Gaps = 10/412 (2%)
Query: 23 CCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
CC+G TFGFD HHR+SD +KG+L +DD+P+KG+ YY+ +AHRDR FR GR LA +
Sbjct: 26 CCYGLSTFGFDIHHRFSDQIKGMLGIDDVPQKGTPQYYAVMAHRDRVFR--GRRLAG-AD 82
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG- 141
+PLTF+AGNDT+++ S GFLH+ NVSVG P L F+VALDTGSDLFWLPCDC+SCVHG
Sbjct: 83 HHSPLTFAAGNDTHQIASSGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCDCISCVHGG 142
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCN-STLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L + +G+++ FN Y + SSTS++V CN ST C ++QCPSAGS C YQV YLS+ T S
Sbjct: 143 LRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSR 202
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
GF+VEDVLHL TD+ Q+K D+RI+FGCG+VQTG FL+GAAPNGLFGLGMD SVPSILA
Sbjct: 203 GFVVEDVLHLITDDDQTKDADTRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILA 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
+GLI NSFSMCFGSD GRI+FGD GSP Q +TPF++R+ HPTYNITIT++ V + +
Sbjct: 263 REGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPFNVRKLHPTYNITITKIIVEDSVAD 322
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCYVLSPNQTN 377
EF AIFDSGTSFTY+NDPAYT+I E +NS K KR +S S++PF+YCY +S +QT
Sbjct: 323 LEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCYDISISQT- 381
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
E P +NLTMKGG ++V DPI+ VSSE +G L CLG+ KSD+VNIIG+ +
Sbjct: 382 IEVPFLNLTMKGGDDYYVMDPIIQVSSEEEG-DLLCLGIQKSDSVNIIGQNF 432
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 514 bits (1323), Expect = e-143, Method: Compositional matrix adjust.
Identities = 250/419 (59%), Positives = 316/419 (75%), Gaps = 7/419 (1%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFR 71
+L+++ S C G G FGF+FHHR+SD V G+L D LP + S YY +AHRDR
Sbjct: 15 ILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL-- 72
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+RGR LA++ D++ +TF+ GN+T R+N+LGFLHY NV+VG P+ F+VALDTGSDLFWL
Sbjct: 73 IRGRRLASE--DQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWL 130
Query: 132 PCDC-VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQV 190
PCDC +CV L + G +D NIYSPN SSTSSKVPCNSTLC +C S S+CPYQ+
Sbjct: 131 PCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQI 190
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
RYLS+GT STG LVEDVLHL + EK SK + +RI+ GCG VQTG F DGAAPNGLFGLG+
Sbjct: 191 RYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGL 250
Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
+ SVPS+LA +G+ NSFSMCFG DG GRISFGDKGS Q ETP ++RQ HPTYN+T+T
Sbjct: 251 EDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVT 310
Query: 311 QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
Q+SVGGN + EF A+FD+GTSFTYL D YT ISE+FNSLA +KR + S+LPFEYCY
Sbjct: 311 QISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYA 370
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+SPN+ +FEYP VNLTMKGG + V P+++V E +YCL ++KS++++IIG+ +
Sbjct: 371 VSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDT--VVYCLAIMKSEDISIIGQNF 427
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 263/408 (64%), Positives = 313/408 (76%), Gaps = 10/408 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
C +FGFD HHR+SDPVK IL V DLP KG+ YY A+AHRDR FR GR LAA
Sbjct: 24 CHALHSFGFDIHHRFSDPVKEILGVHDLPDKGTRQYYVAMAHRDRIFR--GRRLAA--GY 79
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
+PLTF N+TY++ + GFLH+ NVSVG P LSF+VALDTGSDLFWLPC+C CVHG+
Sbjct: 80 HSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVHGIG 139
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
S+G+ I FNIY SSTS V CNS+LCELQ+QCPS+ + CPY+V YLS+GT +TGFL
Sbjct: 140 LSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGTSTTGFL 199
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVLHL TD+ ++K D+RI+FGCG+VQTG+FLDGAAPNGLFGLGM SVPSILA +G
Sbjct: 200 VEDVLHLITDDDKTKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAKEG 259
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSFSMCFGSDG GRI+FGD S QG+TPF+LR HPTYNIT+TQ+ VG + EF
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLEF 319
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS--DLPFEYCYVLSPNQTNFEYP 381
AIFDSGTSFTYLNDPAY QI+ +FNS K +R +++S +LPFEYCY LSPNQT E
Sbjct: 320 HAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYELSPNQT-VELS 378
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+NLTMKGG + V DPIV VS E G+ L CLGV+KS+NVNIIG+ +
Sbjct: 379 -INLTMKGGDNYLVTDPIVTVSGE--GINLLCLGVLKSNNVNIIGQNF 423
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 241/396 (60%), Positives = 308/396 (77%), Gaps = 7/396 (1%)
Query: 35 HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND 94
HHR+SD V G+L D LP + S YY +AHRDR +RGR LA + D++ +TFS GN+
Sbjct: 38 HHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL--IRGRRLANE--DQSLVTFSDGNE 93
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
T R+++LGFLHY NV+VG P+ F+VALDTGSDLFWLPCDC +CV L + G +D NI
Sbjct: 94 TVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
YSPN SSTS+KVPCNSTLC +C S S+CPYQ+RYLS+GT STG LVEDVLHL +++
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
K SK++ +R++FGCG+VQTG F DGAAPNGLFGLG++ SVPS+LA +G+ NSFSMCFG
Sbjct: 214 KSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
+DG GRISFGDKGS Q ETP ++RQ HPTYNIT+T++SVGGN + EF A+FDSGTSFT
Sbjct: 274 NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFT 333
Query: 335 YLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
YL D AYT ISE+FNSLA +KR +T+ S+LPFEYCY LSPN+ +F+YP VNLTMKGG +
Sbjct: 334 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSY 393
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
V P+V++ K +YCL ++K ++++IIG+ +
Sbjct: 394 PVYHPLVVIPM--KDTDVYCLAIMKIEDISIIGQNF 427
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/396 (60%), Positives = 306/396 (77%), Gaps = 7/396 (1%)
Query: 35 HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND 94
HHR+SD V G+L D LP + S YY +AHRDR +RGR LA + D++ +TFS GN+
Sbjct: 38 HHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL--IRGRRLANE--DQSLVTFSDGNE 93
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
T R+++LGFLHY NV+VG P+ F+VALDTGSDLFWLPCDC +CV L + G +D NI
Sbjct: 94 TIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
YSPN SSTS+KVPCNSTLC +C S SNCPYQ+RYLS+GT STG LVEDVLHL +++
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
K SK++ +R++ GCG+VQTG F DGAAPNGLFGLG++ SVPS+LA +G+ NSFSMCFG
Sbjct: 214 KSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
+DG GRISFGDKGS Q ETP ++RQ HPTYNIT+T++SV GN + EF A+FDSGTSFT
Sbjct: 274 NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDLEFDAVFDSGTSFT 333
Query: 335 YLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
YL D AYT ISE+FNSLA +KR +T+ S+LPFEYCY LSPN+ +F+YP VNLTMKGG +
Sbjct: 334 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSY 393
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
V P+V++ K +YCL ++K ++++IIG+ +
Sbjct: 394 PVYHPLVVIPM--KDTDVYCLAILKIEDISIIGQNF 427
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 250/412 (60%), Positives = 310/412 (75%), Gaps = 11/412 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN- 82
C+G +FGFD HHR+SDPVKGIL +D++P KGS YY A+AHRDR FR GR LA G+
Sbjct: 33 CYGSSSFGFDIHHRFSDPVKGILGIDNIPDKGSREYYVAMAHRDRVFR--GRRLADGGDV 90
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D+ LTFS N TY+++ G+LH+ NVSVG PA S++VALDTGSDLFWLPC+C CVHG+
Sbjct: 91 DQKLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTKCVHGI 150
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQC-PSAGSNCPYQVRYLSDGTMSTG 201
S+GQ I FNIY SSTS V CNS+LCE + QC S+G CPYQV YLS+ T +TG
Sbjct: 151 QLSTGQKIAFNIYDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTG 210
Query: 202 FLVEDVLHLATD-EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
FLVEDVLHL TD + Q++ + I+FGCG+VQTG+FLDGAAPNGLFGLGM SVPSILA
Sbjct: 211 FLVEDVLHLITDNDDQTQHANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILA 270
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAV 319
QGL NSFSMCF +DG GRI+FGD S QG+TPF++R +H TYNIT+TQ+ VGGN+
Sbjct: 271 KQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSA 330
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE--TSTSDLPFEYCYVLSPNQTN 377
+ EF+AIFD+GTSFTYLN+PAY QI+++F+S K +R +++ DLPFEYCY L NQT
Sbjct: 331 DLEFNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQT- 389
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
E P +NLTMKGG +FV DPI+ G + CL V+KS+NVNIIG+ +
Sbjct: 390 IEVPNINLTMKGGDNYFVMDPIITSGGGNNG--VLCLAVLKSNNVNIIGQNF 439
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/403 (57%), Positives = 299/403 (74%), Gaps = 6/403 (1%)
Query: 27 FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
FG+F F+ HH YS V+ IL P +G+ YY+A+ D + R G Q D P
Sbjct: 55 FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDHFVHSRRLG---QVQDHRP 111
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
LTF +GN+T R++ LGFL+Y V+VG P + ++VALDTGSDLFWLPCDCV+C+ GLN++
Sbjct: 112 LTFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQ 171
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
G V +FNIYSPN SSTS +V C+S+LC QC S CPYQV YLSD T STG+LVED
Sbjct: 172 GPV-NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 230
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
+LHL T++ QSK V++RI+ GCG+ Q+G+FL AAPNGLFGLG++ SVPSILAN GLI
Sbjct: 231 ILHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLIS 290
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
NSFS+CFG GRI FGDKGSPGQ ETPF+L + HPTYN++ITQ+ VGG+ + + + I
Sbjct: 291 NSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVI 350
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
FDSGTSFTYLNDPAY+ ++ F S+ +EK+ T SD+PFE CY LSPNQT F YP++NLT
Sbjct: 351 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 410
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
MKGGG F +N PIV++S+E K L+CL + +SD++NIIG+ +
Sbjct: 411 MKGGGHFVINHPIVLISTESK--RLFCLAIARSDSINIIGQNF 451
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 483 bits (1244), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/403 (57%), Positives = 299/403 (74%), Gaps = 6/403 (1%)
Query: 27 FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
FG+F F+ HH YS V+ IL P +G+ YY+A+ D + R G Q D P
Sbjct: 32 FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLG---QVQDHRP 88
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
LTF +GN+T R++ LGFL+Y V+VG P + ++VALDTGSDLFWLPCDCV+C+ GLN++
Sbjct: 89 LTFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQ 148
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
G V +FNIYSPN SSTS +V C+S+LC QC S CPYQV YLSD T STG+LVED
Sbjct: 149 GPV-NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 207
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
+LHL T++ QSK V++RI+ GCG+ Q+G+FL AAPNGLFGLG++ SVPSILAN GLI
Sbjct: 208 ILHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLIS 267
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
NSFS+CFG GRI FGDKGSPGQ ETPF+L + HPTYN++ITQ+ VGG+ + + + I
Sbjct: 268 NSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVI 327
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
FDSGTSFTYLNDPAY+ ++ F S+ +EK+ T SD+PFE CY LSPNQT F YP++NLT
Sbjct: 328 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLT 387
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
MKGGG F +N PIV++S+E K L+CL + +SD++NIIG+ +
Sbjct: 388 MKGGGHFVINHPIVLISTESK--RLFCLAIARSDSINIIGQNF 428
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 226/393 (57%), Positives = 290/393 (73%), Gaps = 32/393 (8%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGF----------------LHY 106
+AHRDR +RGR LA + D++ +TFS GN+T R+++LGF LHY
Sbjct: 1 MAHRDRL--IRGRRLANE--DQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHY 56
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
NV+VG P+ F+VALDTGSDLFWLPCDC +CV L + G +D NIYSPN SSTS+KV
Sbjct: 57 ANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 116
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PCNSTLC +C S S+CPYQ+RYLS+GT STG LVEDVLHL +++K SK++ +R++F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDK 286
GCG+VQTG F DGAAPNGLFGLG++ SVPS+LA +G+ NSFSMCFG+DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236
Query: 287 GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
GS Q ETP ++RQ HPTYNIT+T++SVGGN + EF A+FDSGTSFTYL D AYT ISE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296
Query: 347 TFNSLAKEKR-ETSTSDLPFEYCYVLS---------PNQTNFEYPVVNLTMKGGGPFFVN 396
+FNSLA +KR +T+ S+LPFEYCY L PN+ +F+YP VNLTMKGG + V
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALRLPLYSGHHHPNKDSFQYPAVNLTMKGGSSYPVY 356
Query: 397 DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
P+V++ K +YCL ++K ++++IIG+ +
Sbjct: 357 HPLVVIPM--KDTDVYCLAIMKIEDISIIGQNF 387
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/418 (52%), Positives = 294/418 (70%), Gaps = 9/418 (2%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAV-DDLPKKGSFAYYSALAHRDRYFR 71
LLI + + C G F F HHR+SD K + + P+KGSF YY+ALAHRD+
Sbjct: 10 LLITIWVFSKTCKG-RVFTFKMHHRFSDSFKNWSGLTRNWPEKGSFEYYAALAHRDQM-- 66
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
LRGR L+ + L FS GN T+R++SLGFLHYT V +G P + F+VALDTGSDLFW+
Sbjct: 67 LRGRRLS---DADASLAFSDGNSTFRISSLGFLHYTTVELGTPGVKFMVALDTGSDLFWV 123
Query: 132 PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVR 191
PCDC C +S + +IY+P SSTS KV CN+ +C + +C S+CPY V
Sbjct: 124 PCDCSRCAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVS 183
Query: 192 YLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMD 251
Y+S T ++G LV+DVLHL T++ + V++ ++FGCG+VQ+GSFLD AAPNGLFGLGM+
Sbjct: 184 YVSAQTSTSGILVKDVLHLTTEDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGME 243
Query: 252 KTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
K SVPS+L+ +GLI +SFSMCFG DG GRISFGDKGSP Q ETPF++ HPTYN+T+TQ
Sbjct: 244 KISVPSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPFNVNPAHPTYNVTVTQ 303
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
VG ++ EF+A+FDSGTSFTY+ DPAY+++SE F+SLA++KR +PFEYCY +
Sbjct: 304 ARVGTMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCYDM 363
Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
SP+ P ++LTMKGG F V DPI+++S++ + +YCL VVKS +NIIG+ +
Sbjct: 364 SPDANASLVPSMSLTMKGGRHFTVYDPIIVISTQNE--IVYCLAVVKSTELNIIGQNF 419
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 220/425 (51%), Positives = 290/425 (68%), Gaps = 15/425 (3%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK-----GILAVDDLPKKGSFAYYSALA 64
+ LL L CC C G + F HHR+S+PV+ + P++G+ YY+ LA
Sbjct: 8 IVSLLSLWECCQ--CHGH-VYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYAELA 64
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
RDR LRGR L+ L FS GN T+R++SLGFLHYT V +G P + F+VALDT
Sbjct: 65 DRDRL--LRGRKLS---QIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDT 119
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
GSDLFW+PCDC C +++ D N+Y+PN SSTS KV CN++LC + QC S
Sbjct: 120 GSDLFWVPCDCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFS 179
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
NCPY V Y+S T ++G LVEDVLHL ++ V++ + FGCG++Q+GSFLD AAPNG
Sbjct: 180 NCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNG 239
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT 304
LFGLGM+K SVPS+L+ +G +SFSMCFG DG GRISFGDKGS Q ETPF+L +HPT
Sbjct: 240 LFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPT 299
Query: 305 YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
YNIT+TQV VG ++ EF+A+FDSGTSFTYL DP YT+++E+F+S +++R S S +P
Sbjct: 300 YNITVTQVRVGTTVIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIP 359
Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
FEYCY +SP+ P V+LTM GG F V DPI+I+S++ + +YCL VVKS +NI
Sbjct: 360 FEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE--LVYCLAVVKSAELNI 417
Query: 425 IGREY 429
IG+ +
Sbjct: 418 IGQNF 422
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 437 bits (1124), Expect = e-120, Method: Compositional matrix adjust.
Identities = 221/424 (52%), Positives = 291/424 (68%), Gaps = 14/424 (3%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAH 65
++ILLS F F HHR+S+PVK + P KGSF YY+ LAH
Sbjct: 9 IVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYYAELAH 68
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
RDR LRGR L+ + LTFS GN T+R++SLGFLHYT VS+G P F+VALDTG
Sbjct: 69 RDR--ALRGRRLS---DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTG 123
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
SDLFW+PCDC C ++ + +IY+P SSTS KV C+++LC + +C SN
Sbjct: 124 SDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTFSN 183
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
CPY V Y+S T ++G LVEDVLHL T++ + + V++ ++FGCG+VQTGSFLD AAPNGL
Sbjct: 184 CPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGL 243
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTY 305
FGLG++K SVPSIL+ +G +SFSMCFG DG GRISFGDKGSP Q ETPF+L HPTY
Sbjct: 244 FGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPFNLNALHPTY 303
Query: 306 NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
NIT+TQV VG ++ +F+A+FDSGTSFTYL DP YT + ++F+S A++ R S +PF
Sbjct: 304 NITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPF 363
Query: 366 EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
E+CY +SP + P ++LTMKGG F V DPI+I+SS+ + +YC+ VV+S +NII
Sbjct: 364 EFCYDMSPGENTSLIPSMSLTMKGGSQFPVYDPIIIISSQSE--LIYCMAVVRSAELNII 421
Query: 426 GREY 429
G+ +
Sbjct: 422 GQNF 425
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 223/425 (52%), Positives = 293/425 (68%), Gaps = 19/425 (4%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVK--------GILAVDDLPKKGSFAYYSALA 64
+ I+ S C G + F HHR+S+PV+ GI A P+KG+ YY+ LA
Sbjct: 5 VFIIASLFLSLCHGH-VYTFTMHHRHSEPVRKWSHSTASGIPAP---PEKGTVEYYAELA 60
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
RDR LRGR L+ Q +D L FS GN T+R++SLGFLHYT V +G P + F+VALDT
Sbjct: 61 DRDRL--LRGRKLS-QIDDG--LAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDT 115
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
GSDLFW+PCDC C +S+ D N+Y+PN SSTS KV CN++LC + QC S
Sbjct: 116 GSDLFWVPCDCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLS 175
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
NCPY V Y+S T ++G LVEDVLHL ++ V++ + FGCG++Q+GSFLD AAPNG
Sbjct: 176 NCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNG 235
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT 304
LFGLGM+K SVPS+L+ +G +SFSMCFG DG GRISFGDKGS Q ETPF+L +HPT
Sbjct: 236 LFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPT 295
Query: 305 YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
YNIT+TQV VG ++ EF+A+FDSGTSFTYL DP YT+++E+F+S +++R S S +P
Sbjct: 296 YNITVTQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIP 355
Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
FEYCY +SP+ P V+LTM GG F V DPI+I+S++ + +YCL VVK+ +NI
Sbjct: 356 FEYCYDMSPDANTSLIPSVSLTMGGGSHFAVYDPIIIISTQSE--LVYCLAVVKTAELNI 413
Query: 425 IGREY 429
IG+ +
Sbjct: 414 IGQNF 418
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 220/436 (50%), Positives = 300/436 (68%), Gaps = 14/436 (3%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK------GILAVDDLPKK 54
M+ + + + ++ IL+ G C G F F+ HHR+SD VK G A P K
Sbjct: 1 MSCCFFKTTLFLIPILMLLSFGSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFA--KFPPK 57
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP-LTFSAGNDTYRLNSLGFLHYTNVSVGQ 113
GSF Y++AL RD + +RGR L+ ++ LTFS GN T R++SLGFLHYT V +G
Sbjct: 58 GSFEYFNALVLRD--WLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGT 115
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + F+VALDTGSDLFW+PCDC C ++ + +IY+P S+T+ KV CN++LC
Sbjct: 116 PGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC 175
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ QC S CPY V Y+S T ++G L+EDV+HL T++K + V++ ++FGCG+VQ+
Sbjct: 176 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQS 235
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE 293
GSFLD AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS Q E
Sbjct: 236 GSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEE 295
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
TPF+L +HP YNIT+T+V VG ++ EF+A+FD+GTSFTYL DP YT +SE+F+S A+
Sbjct: 296 TPFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQ 355
Query: 354 EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC 413
+KR + S +PFEYCY +S + P ++LTMKG F +NDPI+++S+E G +YC
Sbjct: 356 DKRHSPDSRIPFEYCYDMSNDANASLIPSLSLTMKGNSHFTINDPIIVISTE--GELVYC 413
Query: 414 LGVVKSDNVNIIGREY 429
L +VKS +NIIG+ Y
Sbjct: 414 LAIVKSSELNIIGQNY 429
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 210/403 (52%), Positives = 281/403 (69%), Gaps = 10/403 (2%)
Query: 30 FGFDFHHRYSDPVKGI---LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
F F HHR+SD +K + + P KGSF YY+ LAHRD+ LRGR L N + P
Sbjct: 28 FTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAELAHRDQM--LRGRKLY---NVEAP 82
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
L FS GN T+R++SLGFLHYT V +G P + F+VALDTGSDLFW+PCDC C +
Sbjct: 83 LAFSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAY 142
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
+ +IY P SSTS KV CN+ LC + +C S+CPY V Y+S T ++G LVED
Sbjct: 143 ASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVED 202
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
VLHL +++ +S+ + ++FGCG+VQ+GSFL+ AAPNGLFGLGMD+ SVPSIL+ +GL
Sbjct: 203 VLHLTSEDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTA 262
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
+SFSMCFG DG GRISFGDKGSP Q ETPF+ +HP+YNI++TQV VG V+ +F+A+
Sbjct: 263 DSFSMCFGHDGVGRISFGDKGSPDQEETPFNSNPSHPSYNISVTQVRVGTTLVDVDFTAL 322
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
FDSGTSFTYL +P Y +SE F++ A++KR +PFEYCY +SP + P ++LT
Sbjct: 323 FDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLT 382
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
MKG G F V DPI++++++ + +YCL +VKS +NIIG+ +
Sbjct: 383 MKGRGHFTVFDPIIVITTQNE--LVYCLAIVKSTELNIIGQNF 423
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 426 bits (1096), Expect = e-117, Method: Compositional matrix adjust.
Identities = 215/412 (52%), Positives = 289/412 (70%), Gaps = 10/412 (2%)
Query: 22 GCCFGFGTFGFDFHHRYSDPVKGILAVD----DLPKKGSFAYYSALAHRDRYFRLRGRGL 77
G C G F F+ HHR+SD VK P KGSF Y++AL RD + +RGR L
Sbjct: 22 GSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFVKFPPKGSFEYFNALVLRD--WLIRGRRL 78
Query: 78 AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
+ ++ + LTFS GN T R++SLGFLHYT V +G P + F+VALDTGSDLFW+PCDC
Sbjct: 79 SDSESESS-LTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGK 137
Query: 138 CVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
C ++ + +IY+P S+T+ KV CN++LC + QC S CPY V Y+S T
Sbjct: 138 CAPTEGATYASEFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 197
Query: 198 MSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
++G L+EDV+HL T++K + V++ ++FGCG+VQ+GSFLD AAPNGLFGLGM+K SVPS
Sbjct: 198 STSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPS 257
Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN 317
+LA +GL+ +SFSMCFG DG GRISFGDKGS Q ETPF+L +HP YNIT+T+V VG
Sbjct: 258 VLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTT 317
Query: 318 AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
++ EF+A+FD+GTSFTYL DP YT +SE+F+S A++KR + S +PFEYCY +S +
Sbjct: 318 LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCYDMSNDANA 377
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
P ++LTMKG F +NDPI+++S+E G +YCL +VKS +NIIG+ Y
Sbjct: 378 SLIPSLSLTMKGNSHFTINDPIIVISTE--GELVYCLAIVKSSELNIIGQNY 427
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 231/411 (56%), Positives = 278/411 (67%), Gaps = 43/411 (10%)
Query: 63 LAHRDRYFRLRGRGLAA-----QGNDKTPLTFSAGNDTYRLNSLGF-------------- 103
+A RDR + GR LA N+KT LTF GN+TYR++ LG
Sbjct: 1 MAQRDRV--IHGRRLATSTGGDNKNNKTLLTFYYGNETYRIDGLGLRNSCVSLYSNGLFG 58
Query: 104 --LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
LHY NVSVG P++SF+VALDTGS+L WLPCDC SCVH L S SG V D NIYSPNTSS
Sbjct: 59 YILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTV-DLNIYSPNTSS 117
Query: 162 TSSKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
TS KVPCNSTLC ++ CPS SNCPYQV YLS+GT +TG++V+D+LHL +D+ QSK+
Sbjct: 118 TSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISDDSQSKA 177
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
VD++I+FGCG+VQTGSFL G APNGLFGLGM SVPS LA+ G SFSMCF +G G
Sbjct: 178 VDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIG 237
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLND 338
RISFGDKGS GQGET F+ Q + YNI+ITQ S+GG A + +SAIFDSGTSFTYLND
Sbjct: 238 RISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLVYSAIFDSGTSFTYLND 297
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS--------------PNQTNFEYPVVN 384
PAYT I+E+FN L KE R +ST +PF+YCY + NQT P V
Sbjct: 298 PAYTLIAESFNKLVKETRRSST-QVPFDYCYDIRSFISAQILPFSCAYANQTEPTIPAVT 356
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNI 435
L M GG F V DPIV+V G +YCLG++KS +VNIIG+ + + I
Sbjct: 357 LVMSGGDYFNVTDPIVLVQLA-DGSAVYCLGMIKSGDVNIIGQNFMTGHRI 406
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 410 bits (1053), Expect = e-112, Method: Compositional matrix adjust.
Identities = 207/414 (50%), Positives = 275/414 (66%), Gaps = 21/414 (5%)
Query: 30 FGFDFHHRYSDPVKGILAV-------DDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
F F HHR+SD +K V D P KG+ YY+ LA RDR+FR G+ L+
Sbjct: 28 FSFKMHHRFSDQLKNWSGVSGKFTLPDSWPVKGTIEYYAQLAFRDRFFR--GQRLSEFDG 85
Query: 83 DKTPLTFSAGNDTYRLNSLGFLH-------YTNVSVGQPALSFIVALDTGSDLFWLPCDC 135
PL FS GN ++R++SLGF YT V +G P F+VALDTGSDLFW+PCDC
Sbjct: 86 ---PLAFSDGNSSFRISSLGFALFDVFFFFYTTVQLGTPGTKFMVALDTGSDLFWVPCDC 142
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSD 195
C S + ++YSP SSTS VPCN+ LC + QC A NCPY V Y+S
Sbjct: 143 SRCAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSA 202
Query: 196 GTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
T +TG L+ED+LHL T+ K S+ + + I+FGCG+VQ+GSFLD AAPNGLFGLGM++ SV
Sbjct: 203 ETSTTGILIEDLLHLKTEHKHSEPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 262
Query: 256 PSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVG 315
PSIL+ +GL+ NSFSMCF DG GRI+FGDKGS Q ETPF+L Q HP YNIT+T + VG
Sbjct: 263 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVG 322
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
++ + +A+FDSGTSF+Y DP Y+++S +F++ ++ R +PFEYCY +SP+
Sbjct: 323 TTLIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCYNMSPDA 382
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
P ++LTMKGGGPF V DPI+++S++ + +YCL VVKS +NIIG+ +
Sbjct: 383 NASLTPGISLTMKGGGPFPVYDPIIVISTQNE--LIYCLAVVKSAELNIIGQNF 434
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 217/448 (48%), Positives = 289/448 (64%), Gaps = 17/448 (3%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFG--FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFA 58
MAS++ + +L++ + AG +F FD HHR+SD +KGI + LP+K +
Sbjct: 1 MASTFSSGAQMLLVLSVFILAGSLRSGDAASFKFDIHHRFSDSIKGIFHSEGLPEKHTPG 60
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
YY+ + HRDR +RGR LAA D T LTF+ GNDT + LGFL+Y NVSVG P+L F
Sbjct: 61 YYATMVHRDRL--VRGRRLAASDVD-TQLTFAYGNDTAFIPDLGFLYYANVSVGTPSLDF 117
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
+VALDTGSDLFWLPC+C SC LN+S+G N YSPN S+TSS VPC S+LC +
Sbjct: 118 LVALDTGSDLFWLPCECSSCFTYLNTSNGGKFMLNHYSPNDSTTSSTVPCTSSLC---NR 174
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
C S + CPY++RYLS T S G+LVEDVLHLATD+ K V+++I+FGCG VQTG F
Sbjct: 175 CTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPVEAKITFGCGTVQTGIFAT 234
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
AAPNGL GLGM+K SVPS LA+QGL NSFSMCFG+DG GRI FGD G Q +TPF+
Sbjct: 235 TAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDTGPADQKQTPFNT 294
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
+ +YN+T ++VGG + F+AIFDSGTSFTYL +PAY+ I++ ++ K KR +
Sbjct: 295 MLEYQSYNVTFNVINVGGEPNDVPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRYS 354
Query: 359 STS-DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGL-------- 409
+ PFEYCY + P F+Y +N TMKGG F D V + + +
Sbjct: 355 LFGPNFPFEYCYEIPPGAKEFQYLTLNFTMKGGDEFTPTDIFVFLPVDVSTMNIIFEETT 414
Query: 410 YLYCLGVVKSDNVNIIGREYPIANNISL 437
++ CL + KS ++++IG+ + I+
Sbjct: 415 HVACLAIAKSTDIDLIGQNFMTGYRITF 442
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 215/441 (48%), Positives = 279/441 (63%), Gaps = 42/441 (9%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGIL-----AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLA 78
C F F HHRYS+PVK P+KGS YY+ LA RDR+ LRGR L+
Sbjct: 20 CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRF--LRGRRLS 77
Query: 79 AQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC 138
L FS GN T+R++SLGFLHYT + +G P + F+VALDTGSDLFW+PCDC C
Sbjct: 78 ---QFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRC 134
Query: 139 ----VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
S+ D ++Y+PN SSTS KV CN++LC + QC SNCPY V Y+S
Sbjct: 135 SATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVS 194
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
T ++G LVEDVLHL + V++ + FGCG+VQ+GSFLD AAPNGLFGLGM+K S
Sbjct: 195 AETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKIS 254
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
VPS+L+ +G +SFSMCFG DG GRISFGDKGS Q ETPF++ +HPTYNITI QV V
Sbjct: 255 VPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRV 314
Query: 315 GGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET--------------------------F 348
G ++ EF+A+FDSGTSFTYL DP Y+++SE+ F
Sbjct: 315 GTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQF 374
Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
+S +++R S +PF+YCY +SP+ P ++LTM GG F V DPI+I+S++ +
Sbjct: 375 HSQVEDRRRPPDSRIPFDYCYDMSPDSNTSLIPSMSLTMGGGSRFVVYDPIIIISTQSE- 433
Query: 409 LYLYCLGVVKSDNVNIIGREY 429
+YCL VVKS +NIIG+ +
Sbjct: 434 -LVYCLAVVKSAELNIIGQNF 453
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 205/399 (51%), Positives = 269/399 (67%), Gaps = 5/399 (1%)
Query: 32 FDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS 90
D HHRYS V+G+ + P G+ YY+ALA D R R AA G L F+
Sbjct: 27 LDVHHRYSAAVRGLAGHLRAPPPAGTAEYYAALAGHD--LRRRSLAAAAGGGGAGNLAFA 84
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + G +
Sbjct: 85 DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGD-L 143
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
F++YSP SSTS KVPC+S+LC+ Q C +A ++CPY ++YLS+ T S G LVEDVL+L
Sbjct: 144 KFDMYSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYL 203
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T+ QSK + I+FGCG+VQ+GSFL AAPNGL GLGMD SVPS+LA++G+ NSFS
Sbjct: 204 TTESGQSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFS 263
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GS Q ETP ++ + +P YNI+IT VGG + + +FSA+ DSG
Sbjct: 264 MCFGEDGHGRINFGDTGSSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTKFSAVVDSG 323
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
TSFT L+DP YT+I+ TFN+ KE R+ + +PFEYCY +S Q P ++LT KGG
Sbjct: 324 TSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSIS-AQGAVNPPNISLTAKGG 382
Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
F VN PI+ ++ YCL ++KS+ VN+IG +
Sbjct: 383 SIFPVNGPIITITDTSSRPIAYCLAIMKSEGVNLIGENF 421
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 207/405 (51%), Positives = 267/405 (65%), Gaps = 15/405 (3%)
Query: 32 FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA 91
D HHRYS V+G + P G+ YY+ALA D LR R L+
Sbjct: 34 LDVHHRYSATVRGWAGLRRGPSPGTAEYYAALAGHDD---LRRRSLSLAAAPAPGAGGPF 90
Query: 92 ----GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C L+S
Sbjct: 91 AFVDGNDTYRLNQFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAP-LSSPDY 149
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
+ F++YSP SSTS KVPC+S +C+LQ +C +A ++CPY++ YLSD T S G LVEDV
Sbjct: 150 GNLKFDVYSPRKSSTSRKVPCSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDV 209
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
++LAT+ SK + I+FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA+QG+ N
Sbjct: 210 MYLATESGHSKITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAAN 269
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIF 327
SFSMCFG DG GRI+FGD GS Q ETP ++ + +P YNI+I GG + +FSA+
Sbjct: 270 SFSMCFGEDGHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTKFSAVV 329
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGTSFT L+DP YT+I+ F+ KEKR + S LPFEYCY +S ++ P ++LT
Sbjct: 330 DSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTIS-SKGAVSPPNISLTA 388
Query: 388 KGGGPFFVNDPIVI---VSSEPKGLYLYCLGVVKSDNVNIIGREY 429
KGG F V DPI+ +SS P G YCL ++KS+ VN+IG +
Sbjct: 389 KGGSVFPVKDPIITITDISSSPVG---YCLAIMKSEGVNLIGENF 430
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 209/399 (52%), Positives = 267/399 (66%), Gaps = 8/399 (2%)
Query: 32 FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK-TPLTFS 90
D HHRYS A P G+ YY+ALA D LR R L G F+
Sbjct: 29 LDVHHRYSA-AVRRWAAAAAPPHGTAEYYAALAGHDG---LRRRSLGVGGGGGGAEFAFA 84
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C L S + +
Sbjct: 85 DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAP-LQSPNYGSL 143
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
F++YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L
Sbjct: 144 KFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL 203
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+D QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFS
Sbjct: 204 TSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFS 263
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSG
Sbjct: 264 MCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSG 323
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
TSFT L+DP YTQI+ +F++ + R S +PFE+CY +S N +P V+LT KGG
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGG 381
Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
F VNDPI+ ++ YCL ++KS+ VN+IG +
Sbjct: 382 SIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 420
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 208/399 (52%), Positives = 267/399 (66%), Gaps = 8/399 (2%)
Query: 32 FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK-TPLTFS 90
D HHRYS A P G+ YY+ALA D LR R L G F+
Sbjct: 29 LDVHHRYSA-AVRRWAAAAAPPHGTAEYYAALAGHDG---LRRRSLGVGGGGGGAEFAFA 84
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G +
Sbjct: 85 DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-L 143
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
F++YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L
Sbjct: 144 KFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL 203
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+D QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFS
Sbjct: 204 TSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFS 263
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSG
Sbjct: 264 MCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSG 323
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
TSFT L+DP YTQI+ +F++ + R S +PFE+CY +S N +P V+LT KGG
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGG 381
Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
F VNDPI+ ++ YCL ++KS+ VN+IG +
Sbjct: 382 SIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 420
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/408 (51%), Positives = 272/408 (66%), Gaps = 16/408 (3%)
Query: 28 GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
G +FHHR+S V+ G P G FAY +ALA DR+ R L+A G
Sbjct: 21 GAPSLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRH-----RALSAAG 75
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
+ PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 76 G-RPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCTPP 134
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
+S++ F Y P+ SSTS VPCNS C L+K+C S S+CPY++ Y+S T S+G
Sbjct: 135 PSSAASAPASF--YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSG 191
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
FLVEDVL+L+T++ + + ++I FGCG VQTGSFLD AAPNGLFGLG+D SVPSILA
Sbjct: 192 FLVEDVLYLSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251
Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
+GL NSFSMCFG DG GRISFGD+GS Q ETP + Q HPTY ITIT ++VG N ++
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDL 311
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
E S IFD+GTSFTYL DPAYT I++ F+S + R + S +PFEYCY LS ++ + P
Sbjct: 312 EVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTP 371
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
++L GG F DP ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 372 SISLRTVGGSLFPAIDPGQVISIQ-QHEYVYCLAIVKSTKLNIIGQNF 418
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 210/408 (51%), Positives = 272/408 (66%), Gaps = 16/408 (3%)
Query: 28 GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
G +FHHR+S V+ G P G FAY +ALA DR+ R L+A G
Sbjct: 21 GAPSLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRH-----RALSAAG 75
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
+ PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 76 G-RPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCTPP 134
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
+S++ F Y P+ SSTS VPCNS C L+K+C S S+CPY++ Y+S T S+G
Sbjct: 135 PSSAASAPASF--YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSG 191
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
FLVEDVL+L+T++ + + ++I FGCG VQTGSFLD AAPNGLFGLG+D SVPSILA
Sbjct: 192 FLVEDVLYLSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251
Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
+GL NSFSMCFG DG GRISFGD+GS Q ETP + Q HPTY ITIT ++VG N ++
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDL 311
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
E S IFD+GTSFTYL DPAYT I++ F+S + R + S +PFEYCY LS ++ + P
Sbjct: 312 EVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSSSEARIQTP 371
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
++L GG F DP ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 372 SISLRTVGGSLFPAIDPGQVISIQ-QHEYVYCLAIVKSTKLNIIGQNF 418
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 210/400 (52%), Positives = 269/400 (67%), Gaps = 2/400 (0%)
Query: 30 FGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
D HHRYS V+ P G+ YY+ALA D R G AA G + F
Sbjct: 29 LSLDVHHRYSATVREWAGHHRAPPAGTAEYYAALARHDLRRRSLAAGPAAGGGGGGEVAF 88
Query: 90 SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV 149
+ GNDTYRLN LGFLHY V++G P ++F+VALDTGSDLFW+PCDC++C L S + +
Sbjct: 89 ADGNDTYRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAP-LVSPNYRD 147
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ F+ YSP SSTS KVPC+S LC+LQ C SA S+CPY + YLSD T STG LVEDVL+
Sbjct: 148 LKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDVLY 207
Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
L T+ Q K V + I+FGCGR+QTGSFL AAPNGL GLGMD SVPS+LA++G+ NSF
Sbjct: 208 LITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLASEGVAANSF 267
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDS 329
SMCFG DG GRI+FGD GS Q ETP ++ + +P YNI+IT VG + N F+AI DS
Sbjct: 268 SMCFGDDGRGRINFGDTGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNTNFNAIVDS 327
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GTSFT L+DP Y++I+ +FNS ++K S LPFE+CY +SP + + P ++L KG
Sbjct: 328 GTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSISP-KGSVNPPNISLMAKG 386
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
G F VNDPI+ ++ + YCL V+KS+ VN+IG +
Sbjct: 387 GSIFPVNDPIITITDDASNPMAYCLAVMKSEGVNLIGENF 426
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 208/402 (51%), Positives = 261/402 (64%), Gaps = 8/402 (1%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
+F F HHR+SD +K I + LP+K + YY+A+ HRDR L GR LA D TPL
Sbjct: 30 ASFKFTIHHRFSDSIKEIFGSEGLPEKHTPGYYAAMVHRDRL--LHGRNLATTNGD-TPL 86
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
FS GN+TY L+ LG L+Y NVS+G P L F+VALDTGSDLFWLPC+C C L
Sbjct: 87 MFSYGNETYELSGLGNLYYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDN 146
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N YS N SSTS +VPC+S+LCEL QC S S+CPYQ YLS+ + S G+LV+D+
Sbjct: 147 GKFWLNHYSSNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDI 206
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
LH+ATD+ Q K VD +++ GCG+VQTG F + APNGL GLGM K SVPS LA+QGL +
Sbjct: 207 LHMATDDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTD 266
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIF 327
SFSMCFG G GRI FGD G GQ ETPF+ +YN+TI Q+ V N +AI
Sbjct: 267 SFSMCFGYYGYGRIDFGDIGPVGQRETPFN--PASLSYNVTILQIIVTNRPTNVHLTAII 324
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSG SFTYL DP Y+ I+E ++ + +R S SD PFEYCY LS T F+ P +N TM
Sbjct: 325 DSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRLSL-ATIFQQPNLNFTM 383
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+GG F V V V ++ G L CL +VKS ++N+IG +
Sbjct: 384 EGGRKFDVITSYVSVDTD-DGPAL-CLAIVKSTDINVIGHNF 423
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 202/407 (49%), Positives = 267/407 (65%), Gaps = 19/407 (4%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
+FHHR+S P++ G P GS AY +ALA DR+ R ++A G +
Sbjct: 32 LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D PLTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 87 DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GF
Sbjct: 147 TAASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 202
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 203 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
GL NSFSMCFG DG GRISFGD+ S Q ETP + + HPTY ITI+ ++VG + +
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 322
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
F IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY LS ++ F P
Sbjct: 323 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPD 382
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+ L G F V DP ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 383 IILRTVTGSMFPVIDPGQVISIQ-EHEYVYCLAIVKSMKLNIIGQNF 428
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 202/407 (49%), Positives = 267/407 (65%), Gaps = 17/407 (4%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
+FHHR+S P++ G P GS AY +ALA DR+ R ++A G +
Sbjct: 32 LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D PLTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 87 DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GF
Sbjct: 147 TAASGS-FQATFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 204
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 205 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 264
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
GL NSFSMCFG DG GRISFGD+ S Q ETP + + HPTY ITI+ ++VG + +
Sbjct: 265 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 324
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
F IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY LS ++ F P
Sbjct: 325 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPD 384
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+ L G F V DP ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 385 IILRTVTGSMFPVIDPGQVISIQ-EHEYVYCLAIVKSMKLNIIGQNF 430
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 203/405 (50%), Positives = 266/405 (65%), Gaps = 15/405 (3%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
+FHHR+S P++ G P GS AY +ALA DR+ R A G T
Sbjct: 31 LEFHHRFSAPLRRWAEARGRALPGGWPAPGSAAYVAALAGHDRH---RAVSAAGGGGSGT 87
Query: 86 P-LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS 144
P LTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 88 PPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATA 147
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
+SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GFLV
Sbjct: 148 ASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGFLV 203
Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
EDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +GL
Sbjct: 204 EDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGL 263
Query: 265 IPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
NSFSMCFG DG GRISFGD+GS Q ETP ++ Q HPTY ITI+ +++G + +F
Sbjct: 264 TSNSFSMCFGRDGIGRISFGDQGSSDQEETPLNINQQHPTYAITISGITIGNKPTDLDFI 323
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY LS ++ F P +
Sbjct: 324 TIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDII 383
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
L G F V DP ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 384 LRTVSGSLFPVIDPGQVISIQ-EHEYVYCLAIVKSRKLNIIGQNF 427
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 202/407 (49%), Positives = 266/407 (65%), Gaps = 21/407 (5%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
+FHHR+S P++ G P GS AY +ALA DR+ R ++A G +
Sbjct: 32 LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D PLTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 87 DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GF
Sbjct: 147 TAASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 202
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 203 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
GL NSFSMCFG DG GRISFGD+ S Q ETP + + HPTY ITI+ ++VG + +
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 322
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
F IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY LS + F P
Sbjct: 323 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLS--EARFPIPD 380
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+ L G F V DP ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 381 IILRTVTGSMFPVIDPGQVISIQ-EHEYVYCLAIVKSMKLNIIGQNF 426
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 203/404 (50%), Positives = 270/404 (66%), Gaps = 18/404 (4%)
Query: 32 FDFHHRYSDPVKGILAVD------DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
+FHHR+S ++G P G AY +ALA DR+ R LAA D
Sbjct: 30 LEFHHRFSARLRGWADARGHELPGGWPPPGGAAYVAALAGHDRH-----RALAAA--DHP 82
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C + +
Sbjct: 83 PLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGA 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
SG + Y P+ SSTS VPCNS C+ +K C S S+CPY++ Y+S T S+GFLVE
Sbjct: 143 SGSA---SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYKMVYVSADTSSSGFLVE 198
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
DVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D SVPSILA++GL
Sbjct: 199 DVLYLSTEDNHPQILKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLT 258
Query: 266 PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
+SFSMCFG DG GRISFGD+GS Q ETP + Q HPTY ITIT ++VG ++ EFS
Sbjct: 259 SDSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGITVGTEPMDLEFST 318
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
IFD+GT+FTYL DPAYT I+++F++ + R + + +PFEYCY LS ++ + P V+
Sbjct: 319 IFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSSSEARIQTPGVSF 378
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
GG F V D ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 379 RTVGGSLFPVIDLGQVISIQ-QHEYVYCLAIVKSTKLNIIGQNF 421
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/338 (55%), Positives = 244/338 (72%), Gaps = 3/338 (0%)
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQ 148
F+ GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G
Sbjct: 19 FADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS 78
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ F++YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL
Sbjct: 79 -LKFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVL 137
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
+L +D QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NS
Sbjct: 138 YLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANS 197
Query: 269 FSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFD 328
FSMCFG DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI D
Sbjct: 198 FSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVD 257
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGTSFT L+DP YTQI+ +F++ + R S +PFE+CY +S N +P V+LT K
Sbjct: 258 SGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAK 315
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
GG F VNDPI+ ++ YCL ++KS+ VN+IG
Sbjct: 316 GGSIFPVNDPIITITDNAFNPVGYCLAIMKSEGVNLIG 353
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 209/419 (49%), Positives = 267/419 (63%), Gaps = 23/419 (5%)
Query: 30 FGFDFHHRYSDPVKGILAVDDLP-------KKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
GFD HHR S V+ P +G+ YY+AL DR R RGLA +G+
Sbjct: 29 IGFDLHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLAR-RGLA-EGD 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
+ LTF++GN T+RL G LHY V+VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 87 GEGLLTFASGNLTFRLE--GSLHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIA 144
Query: 143 NSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---SNCPYQVRYLSDGTM 198
N+S + D YSP SSTS V C LCE C +AG ++CPY VRY+S T
Sbjct: 145 NASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTS 204
Query: 199 STGFLVEDVLHLATDEK--QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G LVEDVLHL+ + S +V + + GCG+VQTG+FLDGAA +GL GLGMDK SVP
Sbjct: 205 SSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVP 264
Query: 257 SILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVG 315
S+L GL+ +SFSMCF DG GRI+FGD G GQ ETPF++R THPTYNI++T +SV
Sbjct: 265 SVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISVTAMSVS 324
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
G V EF+AI DSGTSFTYLNDPAYT+++ FNS +E+R ++ +PFEYCY L Q
Sbjct: 325 GKEVAAEFAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYELGRGQ 384
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL----YCLGVVKSD-NVNIIGREY 429
T P V+LT +GG F V PIV++ E + YCL V+K+D ++IIG+ +
Sbjct: 385 TELFVPEVSLTTRGGAVFPVTRPIVVIYGETSDGRIVAAGYCLAVLKNDITIDIIGQNF 443
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 200/406 (49%), Positives = 269/406 (66%), Gaps = 13/406 (3%)
Query: 32 FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
+FHHR+S PV +G + P+ GS Y +AL DR L G+
Sbjct: 35 LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 95 PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
++SG + Y P+ SSTS VPCNS CEL+K+C S S CPY++ Y+S T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSF+MCF DG GRISFGD+GS Q ETP + HPTY I+I++++VG + + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEF 330
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
S IFD+GTSFTYL DPAYT I+++F++ R + S +PFEYCY LS ++ + P +
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSI 390
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+L GG F V D ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQ-QHEYVYCLAIVKSAKLNIIGQNF 435
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 200/406 (49%), Positives = 269/406 (66%), Gaps = 13/406 (3%)
Query: 32 FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
+FHHR+S PV +G + P+ GS Y +AL DR L G+
Sbjct: 35 LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 95 PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
++SG + Y P+ SSTS VPCNS CEL+K+C S S CPY++ Y+S T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSF+MCF DG GRISFGD+GS Q ETP + HPTY I+I++++VG + + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEF 330
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
S IFD+GTSFTYL DPAYT I+++F++ R + S +PFEYCY LS ++ + P +
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSI 390
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+L GG F V D ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQ-QHEYVYCLAIVKSAKLNIIGQNF 435
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 200/406 (49%), Positives = 269/406 (66%), Gaps = 13/406 (3%)
Query: 32 FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
+FHHR+S PV +G + P+ GS Y +AL DR L G+
Sbjct: 35 LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 95 PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
++SG + Y P+ SSTS VPCNS CEL+K+C S S CPY++ Y+S T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSF+MCF DG GRISFGD+GS Q ETP + HPTY I+I++++VG + + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDLEF 330
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
S IFD+GTSFTYL DPAYT I+++F++ R + S +PFEYCY LS ++ + P +
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSSSEDRIQTPSI 390
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
+L GG F V D ++S + + Y+YCL +VKS +NIIG+ +
Sbjct: 391 SLRTVGGSVFPVIDEGQVISIQ-QHEYVYCLAIVKSAKLNIIGQNF 435
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 192/376 (51%), Positives = 258/376 (68%), Gaps = 16/376 (4%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK------GILAVDDLPKK 54
M+ + + + ++ IL+ G C G F F+ HHR+SD VK G A P K
Sbjct: 1 MSCCFFKTTLFLIPILMLLSFGSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFA--KFPPK 57
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP-LTFSAGNDTYRLNSLGFLHYTNVSVGQ 113
GSF Y++AL RD + +RGR L+ ++ LTFS GN T R++SLGFLHYT V +G
Sbjct: 58 GSFEYFNALVLRD--WLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGT 115
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + F+VALDTGSDLFW+PCDC C ++ + +IY+P S+T+ KV CN++LC
Sbjct: 116 PGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC 175
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ QC S CPY V Y+S T ++G L+EDV+HL T++K + V++ ++FGCG+VQ+
Sbjct: 176 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQS 235
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE 293
GSFLD AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS Q E
Sbjct: 236 GSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEE 295
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
TPF+L +HP YNIT+T+V VG ++ EF+A+FD+GTSFTYL DP YT +SE+ A+
Sbjct: 296 TPFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQ 351
Query: 354 EKRETSTSDLPFEYCY 369
+KR + S +PFEYCY
Sbjct: 352 DKRHSPDSRIPFEYCY 367
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 207/413 (50%), Positives = 272/413 (65%), Gaps = 11/413 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G F F+ HH +SD VK L +DDL P+KGS Y+ LA RDR +RGRGLA+ N
Sbjct: 23 CEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 79
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
++TP+TF GN T ++ LGFLHY NVSVG PA F+VALDTGSDLFWLPC+C S C+
Sbjct: 80 EETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRD 139
Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q N+YSPNTSSTSS + C+ C +C S S+CPYQ++YLS T +T
Sbjct: 140 LKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTT 199
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L EDVLHL T+++ + V + I+ GCG+ QTG AA NGL GLG+ SVPSILA
Sbjct: 200 GTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILA 259
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ NSFSMCFG+ D GRISFGDKG Q ETP + PTY +++T+VSVGG+A
Sbjct: 260 KAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDA 319
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
V + A+FD+GTSFT+L +P Y I++ F+ +KR +LPFE+CY LSPN+T
Sbjct: 320 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTI 379
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
+P V +T +GG F+ +P+ IV +E +YCLG++KS + +NIIG+ +
Sbjct: 380 LFPRVAMTFEGGSQMFLRNPLFIVWNEDNS-AMYCLGILKSVDFKINIIGQNF 431
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 174/326 (53%), Positives = 232/326 (71%), Gaps = 2/326 (0%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
LHYT V +G P F+VALDTGSDLFW+PCDC C S + ++YSP SSTS
Sbjct: 3 LHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSSTS 62
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
VPCN++LC + QC A NCPY V Y+S T +TG L+ED+LHL T+ K S+ + +
Sbjct: 63 KTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHSEPIQAY 122
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I+FGCG+VQ+GSFLD AAPNGLFGLGM++ SVPSIL+ +GL+ NSFSMCF DG GRI+F
Sbjct: 123 ITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINF 182
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
GDKGS Q ETPF+L Q HP YNIT+T + VG ++ + +A+FDSGTSF+Y DP Y++
Sbjct: 183 GDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADITALFDSGTSFSYFTDPIYSK 242
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
+S +F++ ++ R +PFEYCY +SP+ P ++LTMKGGGPF V DPI+++S
Sbjct: 243 LSASFHAQTRDGRHPPNPRIPFEYCYNMSPDANASLTPGISLTMKGGGPFPVYDPIIVIS 302
Query: 404 SEPKGLYLYCLGVVKSDNVNIIGREY 429
++ + +YCL VVKS +NIIG+ +
Sbjct: 303 TQNE--LIYCLAVVKSAELNIIGQNF 326
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 367 bits (942), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 182/333 (54%), Positives = 238/333 (71%), Gaps = 3/333 (0%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
RLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G + F++YS
Sbjct: 68 RLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-LKFDVYS 126
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L +D Q
Sbjct: 127 PAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQ 186
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
SK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFSMCFG D
Sbjct: 187 SKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDD 246
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYL 336
G GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSGTSFT L
Sbjct: 247 GHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSGTSFTAL 306
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
+DP YTQI+ +F++ + R S +PFE+CY +S N +P V+LT KGG F VN
Sbjct: 307 SDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGGSIFPVN 364
Query: 397 DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
DPI+ ++ YCL ++KS+ VN+IG +
Sbjct: 365 DPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 397
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 366 bits (940), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 200/425 (47%), Positives = 262/425 (61%), Gaps = 32/425 (7%)
Query: 29 TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
+FGFD HHR+S V+ G LA D P +G+ YYSAL+ DR R A G
Sbjct: 33 SFGFDLHHRFSPVVRRWAEARGGPLAADQWPARGTPEYYSALSRHDRARRA-----LAGG 87
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC--V 139
D LTF+AGNDTY+ G L+Y V +G P +F+VALDTGSDLFW+PCDC C +
Sbjct: 88 ADDGLLTFAAGNDTYQS---GTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATI 144
Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTM 198
N + YSP SSTS +V C++ LC + C +A +CPY+V+Y+S T
Sbjct: 145 PSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQRNGCSAATNGSCPYEVQYVSANTS 204
Query: 199 STGFLVEDVLHLATDE----KQSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDK 252
S+G LV+DVLHL + +++ + + FGCG+VQTG+FLDG A +GL GLGM K
Sbjct: 205 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGK 264
Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
SVPS LA GL+ +SFSMCFG DG GR++FGD GS GQ ETPF++R +PTYN++ T
Sbjct: 265 VSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTS 324
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR----ETSTSDLPFEY 367
+ VG +V EF+A+ DSGTSFTYL+DP YTQ++ FNS E+R S PFEY
Sbjct: 325 IGVGSESVAAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEY 384
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNI 424
CY LSPNQT P V+LT KGG F V P + V YCL ++++D ++I
Sbjct: 385 CYRLSPNQTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAVGYCLAIMRNDMAIGIDI 444
Query: 425 IGREY 429
IG+ +
Sbjct: 445 IGQNF 449
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 182/335 (54%), Positives = 238/335 (71%), Gaps = 3/335 (0%)
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
T LN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G + F++
Sbjct: 52 TADLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-LKFDV 110
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L +D
Sbjct: 111 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 170
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFSMCFG
Sbjct: 171 AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG 230
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSGTSFT
Sbjct: 231 DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSGTSFT 290
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
L+DP YTQI+ +F++ + R S +PFE+CY +S N +P V+LT KGG F
Sbjct: 291 ALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCYSVSAN--GIVHPNVSLTAKGGSIFP 348
Query: 395 VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
VNDPI+ ++ YCL ++KS+ VN+IG +
Sbjct: 349 VNDPIITITDNAFNPVGYCLAIMKSEGVNLIGENF 383
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 185/339 (54%), Positives = 233/339 (68%), Gaps = 12/339 (3%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAH 65
++ILLS F F HHR+S+PVK + P KGSF YY+ LAH
Sbjct: 9 IVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYYAELAH 68
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
RDR LRGR L+ + LTFS GN T+R++SLGFLHYT VS+G P F+VALDTG
Sbjct: 69 RDR--ALRGRRLS---DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTG 123
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
SDLFW+PCDC C ++ + +IY+P SSTS KV CN++LC + +C SN
Sbjct: 124 SDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTFSN 183
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
CPY V Y+S T ++G LVEDVLHL T++ + + V++ ++FGCG+VQTGSFLD AAPNGL
Sbjct: 184 CPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGL 243
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTY 305
FGLG++K SVPSIL+ +G +SFSMCFG DG GRISFGDKG P Q ETPF+L HPTY
Sbjct: 244 FGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGGPDQEETPFNLNALHPTY 303
Query: 306 NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQI 344
NIT+TQV VG ++ +F+A+FDSGTSFTYL DP YT +
Sbjct: 304 NITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNV 342
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 55/97 (56%), Positives = 70/97 (72%), Gaps = 3/97 (3%)
Query: 7 NSPVCVLLILLS-CCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
NS ++++L+S + C+G GTFGFD HHR+SDPVKGIL VDDLP+K S YY A+AH
Sbjct: 491 NSXWVLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAH 550
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLG 102
RD + + GR L+ K PLTFS GN+TYRL+SLG
Sbjct: 551 RD--WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLG 585
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 363 bits (933), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 199/424 (46%), Positives = 262/424 (61%), Gaps = 32/424 (7%)
Query: 30 FGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
FGFD HHR+S V+ G LA D P +G+ YYSAL+ DR R A G
Sbjct: 36 FGFDLHHRFSPVVRRWAEARGGPLAADRWPARGTPEYYSALSRHDR-----ARRALAGGA 90
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC--VH 140
D LTF+AGNDTY+ G L+Y V +G P +F+VALDTGSDLFW+PCDC C +
Sbjct: 91 DDGLLTFAAGNDTYQS---GTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIP 147
Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTMS 199
N++ YSP SSTS +V C++ LC + C +A +CPY+V+Y+S T S
Sbjct: 148 SANATGPDAPPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSS 207
Query: 200 TGFLVEDVLHLATDE----KQSKSVDSRISFGCGRVQTGSFLD--GAAPNGLFGLGMDKT 253
+G LV+DVLHL + +++ + + FGCG+VQTG+FLD G A +GL GLGM K
Sbjct: 208 SGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKV 267
Query: 254 SVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
SVPS LA GL+ +SFSMCFG DG GR++FGD GS GQ ETPF++R +PTYN++ T +
Sbjct: 268 SVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTSI 327
Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR----ETSTSDLPFEYC 368
+G +V EF+A+ DSGTSFTYL+DP YTQ++ FNS E+R S PFEYC
Sbjct: 328 GIGSESVAAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYC 387
Query: 369 YVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNII 425
Y LSPNQT P V+LT KGG F V P + V YCL ++++D ++II
Sbjct: 388 YRLSPNQTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMAIGIDII 447
Query: 426 GREY 429
G+ +
Sbjct: 448 GQNF 451
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 203/422 (48%), Positives = 267/422 (63%), Gaps = 29/422 (6%)
Query: 29 TFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
+ GFD HHR+S V+ A D P +GS YYSAL+ DR R R LA G
Sbjct: 33 SVGFDLHHRFSPVVRQWAEARGHPFAAQDWPARGSPEYYSALSRHDRAVLSR-RALA-DG 90
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
D +TF+AGNDT L +G L+Y V VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 91 ADGL-VTFAAGNDT--LQYIGSLYYAVVEVGTPNATFLVALDTGSDLFWVPCDCKQCASI 147
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTMST 200
N + YSP SSTS +V C++ LC+ C +A +CPY+V+YLS T ++
Sbjct: 148 ANVTGQPATALRPYSPRESSTSKQVTCDNALCDRPNGCSAATNGSCPYEVQYLSANTSTS 207
Query: 201 GFLVEDVLHLATDE-----KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
G LV+DVLHL + + +++ + + FGCG+VQTG+FLDGAA +GL GLG + SV
Sbjct: 208 GVLVQDVLHLTRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDGAAFDGLMGLGRENVSV 267
Query: 256 PSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
PS+LA+ GL+ +SFSMCFG DG GRI+FGD GS GQGETPF+ R+T YN++ T V+V
Sbjct: 268 PSVLASSGLVASDSFSMCFGDDGVGRINFGDSGSSGQGETPFTGRRT--LYNVSFTAVNV 325
Query: 315 GGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET----STSDLPFEYCYV 370
+V EF+A+ DSGTSFTYL DP YT+++ FNSL +E+R S PFEYCY
Sbjct: 326 ETKSVAAEFAAVIDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCYA 385
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNIIGR 427
L PNQT P V+LT KGG F V P++ V+S + + YCL ++K+D N NIIG+
Sbjct: 386 LGPNQTEALIPDVSLTTKGGARFPVTQPVIGVASG-RTVVGYCLAIMKNDLGVNFNIIGQ 444
Query: 428 EY 429
+
Sbjct: 445 NF 446
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 357 bits (915), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 193/416 (46%), Positives = 264/416 (63%), Gaps = 14/416 (3%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G FGF+ HH +SD VK L +DDL P++GS Y+ LAHRDR +RGRGLA+ N
Sbjct: 23 CEASGKFGFEVHHIFSDAVKQSLGLDDLVPEQGSLEYFKVLAHRDRL--IRGRGLASN-N 79
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC-VSCVHG 141
+ TP+TF GN T + LG L+Y NVSVG P SF+VALDTGSDLFWLPC+C +C+
Sbjct: 80 EDTPVTFDGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRD 139
Query: 142 LNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q + N+Y+PN S+TSS + C+ C K+C S S CPYQ+ Y S+ T +T
Sbjct: 140 LEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISY-SNSTGTT 198
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L++DVLHLAT+++ V + ++ GCG+ QTG F + NG+ GLG+ SVPS+LA
Sbjct: 199 GTLLQDVLHLATEDENLTPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLA 258
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ +SFSMCFG GRISFGDKG Q ETPF Y + +T VSVGG+
Sbjct: 259 KANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGDP 318
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
V A FD+G+SFT+L +PAY ++++F+ L ++KR +LPFE+CY LSPN T+
Sbjct: 319 VGTRLFAKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDLSPNATSI 378
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPK---GLYLYCLGVVKSD--NVNIIGREY 429
E+P V +T GG +N+P ++ + G +YCLGV+KS +N+IG+ +
Sbjct: 379 EFPFVEMTFVGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKINVIGQNF 434
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 206/413 (49%), Positives = 269/413 (65%), Gaps = 21/413 (5%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G F F+ HH +SD VK L +DDL P+KGS Y+ LA RDR +RGRGLA+ N
Sbjct: 23 CEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 79
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
++TP+TF GN T ++ LGFLHY NVSVG PA F+VALDTGSDLFWLPC+C S C+
Sbjct: 80 EETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRD 139
Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q N+YSPNTSSTSS + C+ C +C S S+CPYQ++YLS T +T
Sbjct: 140 LKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTT 199
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L EDVLHL T+++ + V + I+ GCG+ QTG AA NGL GLG+ SVPSILA
Sbjct: 200 GTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILA 259
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ NSFSMCFG+ D GRISFGDKG Q ETP L T P ++T+VSVGG+A
Sbjct: 260 KAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETP--LLPTEP----SVTEVSVGGDA 313
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
V + A+FD+GTSFT+L +P Y I++ F+ +KR +LPFE+CY LSPN+T
Sbjct: 314 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTI 373
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
+P V +T +GG F+ +P+ I +S +YCLG++KS + +NIIG+ +
Sbjct: 374 LFPRVAMTFEGGSQMFLRNPLFIDNSA-----MYCLGILKSVDFKINIIGQNF 421
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 191/419 (45%), Positives = 266/419 (63%), Gaps = 18/419 (4%)
Query: 24 CFGF------GTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRG 76
C+GF G FGF+ HH +SD VK L + DL P++GS Y+ LAHRDR +RGRG
Sbjct: 17 CWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRL--IRGRG 74
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC- 135
LA+ ND+TP+TF GN T + LG L+Y NVSVG P SF+VALDTGSDLFWLPC+C
Sbjct: 75 LASN-NDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG 133
Query: 136 VSCVHGLNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
+C+ L Q + N+Y+PN S+TSS + C+ C K+C S S CPYQ+ Y S
Sbjct: 134 TTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-S 192
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
+ T + G L++DVLHLAT+++ V + ++ GCG+ QTG F + NG+ GLG+ S
Sbjct: 193 NSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 255 VPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
VPS+LA + NSFSMCFG GRISFGD+G Q ETPF Y + I+ V
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGV 312
Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
SV G+ V+ A FD+G+SFT+L +PAY ++++F+ L +++R +LPFE+CY LS
Sbjct: 313 SVAGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDLS 372
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIGREY 429
PN T ++P+V +T GG +N+P ++ +G +YCLGV+KS +N+IG+ +
Sbjct: 373 PNATTIQFPLVEMTFIGGSKIILNNPFFTARTQ-EGNVMYCLGVLKSVGLKINVIGQNF 430
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 209/413 (50%), Positives = 268/413 (64%), Gaps = 11/413 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G F F+ HH +SD VK L +DDL P+KGS Y+ LA RDR +RGRGLA+ N
Sbjct: 24 CEASGKFSFEVHHMFSDRVKQTLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 80
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
++TP+TF GN T ++ LGFLHY NVSVG PA F+VALDTGS+LFWLPC+C S C+
Sbjct: 81 EETPITFMRGNRTVSIDFLGFLHYANVSVGTPATWFLVALDTGSNLFWLPCNCGSTCIRD 140
Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q N+YSPNTSSTSS + CN C QC S S+CPYQ++YLS T +T
Sbjct: 141 LKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKDTFTT 200
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L EDVLHL T++ K V + I+ GCGR QTG AA NGL GLGM SVPSILA
Sbjct: 201 GTLFEDVLHLVTEDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILA 260
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ NSFSMCFG+ D GRISFGDKG Q ETP + PTY + +T+VSVGG+
Sbjct: 261 KAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPTEPSPTYAVNVTEVSVGGDV 320
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
V + A+FD+GTSFT+L +P Y I++ F+ +KR ++PFE+CY LSPN T
Sbjct: 321 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYDLSPNSTTI 380
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
+P V +T +GG F+ +P+ IV +E +YCLG++KS + +NIIG+ +
Sbjct: 381 LFPRVAMTFEGGSLMFLRNPLFIVWNE-DNTAMYCLGILKSVDFKINIIGQNF 432
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 354 bits (908), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 185/416 (44%), Positives = 266/416 (63%), Gaps = 16/416 (3%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G+ F+ HHR+S+ VK +L LP+ GS YY AL HRDR GR L + N++T +
Sbjct: 20 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRQLTSNNNNQTTI 74
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHGLNSSS 146
+F+ GN T + FLHY NV++G PA F+VALDTGSDLFWLPC+C S CV + +
Sbjct: 75 SFAQGNST---EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQ 131
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
G+ I NIY+P+ S +SSKV CNSTLC L+ +C S S+CPY++RYLS G+ STG LVED
Sbjct: 132 GERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVED 191
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
V+H++T+E +++ D+RI+FGC Q G F + A NG+ GL + +VP++L G+
Sbjct: 192 VIHMSTEEGEAR--DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVAS 248
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
+SFSMCFG +G G ISFGDKGS Q ETP S + Y+++IT+ VG V+ EF+A
Sbjct: 249 DSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTAT 308
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
FDSGT+ T+L +P YT ++ F+ ++R + + D PFE+CY+++ + P V+
Sbjct: 309 FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFE 368
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN--IIGREYPIANNISLFHN 440
MKGG + V PI++ + +YCL V+K N + IIG+ + N + H+
Sbjct: 369 MKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGQNF--MTNYRIVHD 422
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 348 bits (893), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 186/432 (43%), Positives = 266/432 (61%), Gaps = 30/432 (6%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G+ F+ HHR+S+ VK +L LP+ GS YY AL HRDR GR L + N++T +
Sbjct: 30 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRRLTSN-NNQTTI 83
Query: 88 TFSAGNDTYRLNS----------LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
+F+ GN T ++ +LHY NV++G PA F+VALDTGSDLFWLPC+C S
Sbjct: 84 SFAQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNS 143
Query: 138 -CVHGLNSSSG------QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQV 190
CV + + G Q I NIY+P+ S++SSKV CNSTLC L+ +C S S+CPY++
Sbjct: 144 TCVRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRI 203
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
RYLS G+ STG LVEDV+H++T+E +++ D+RI+FGC Q G F + A NG+ GL M
Sbjct: 204 RYLSPGSKSTGVLVEDVIHMSTEEGEAR--DARITFGCSETQLGLFQE-VAVNGIMGLAM 260
Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
+VP++L G+ +SFSMCFG +G G ISFGDKGS Q ETP + Y+++IT
Sbjct: 261 ADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQHETPLGGTISPLFYDVSIT 320
Query: 311 QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
+ VG V +FSAIFDSGT+ T+L DP YT ++ F+ ++R + D FE+CY+
Sbjct: 321 KFKVGKVTVETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYI 380
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIGRE 428
++ + P ++ MKGG + V PI++ + +YCL V+K D + NIIG+
Sbjct: 381 ITSTSDEEKLPSISFEMKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQDKADFNIIGQN 440
Query: 429 YPIANNISLFHN 440
+ N + H+
Sbjct: 441 F--MTNYRIVHD 450
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 200/425 (47%), Positives = 260/425 (61%), Gaps = 34/425 (8%)
Query: 28 GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
G GFD HHR+S VK G A +GS YYSAL+ DR R + A G
Sbjct: 7 GGVGFDLHHRFSPVVKRWAESRGRPAAAAWWPEGSPEYYSALSAHDR-----ARRVLAGG 61
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
++ L+F+ GN T R G LHY V++G P +F+VALDTGSDLFW+PCDC C
Sbjct: 62 KGESLLSFADGNSTTR--HAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPI 119
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
N+S YSP SSTS V C+ +LC+ C + +CPY V+Y+S T S+G
Sbjct: 120 ANTSE----LLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSG 175
Query: 202 FLVEDVLHLATDEKQS---------KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
LVEDVL++ S ++V +R+ FGCG+ QTG+FLDGAA GL GLGMD+
Sbjct: 176 VLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDR 235
Query: 253 TSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPG-QGETPFSLRQTHPTYNITIT 310
SVPS+LA GL+ +SFSMCF DG GRI+FG+ G Q ETPF + +T PTYNI++T
Sbjct: 236 VSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVT 295
Query: 311 QVSVGGN-AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
V+V G A+ EF+A+ DSGTSFTYLNDPAY+ ++ +FNS +EKR ++ +PFEYCY
Sbjct: 296 AVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCY 355
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL----YCLGVVKSD-NVNI 424
LS QT P V+LT +GG F V P VIV+ E + YCL V KSD ++I
Sbjct: 356 ALSRGQTEVLMPEVSLTTRGGAVFPVTRPFVIVAGETTDGQVHAVGYCLAVFKSDIPIDI 415
Query: 425 IGREY 429
IG+ +
Sbjct: 416 IGQNF 420
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 192/427 (44%), Positives = 259/427 (60%), Gaps = 16/427 (3%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFR 71
+L+L+ C G F F+ HH +SD VK L DDL P+ GS Y+ LAHRDR+
Sbjct: 13 MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRF-- 70
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+RGRGLA+ N++TPLT N T LN LGFLHY NVS+G PA F+VALDTGSDLFWL
Sbjct: 71 IRGRGLASN-NEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWL 129
Query: 132 PCDC-VSCVHGLNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189
PC+C +C+H L + + + N+Y+PN S+TSS + C+ C +C S S CPYQ
Sbjct: 130 PCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQ 189
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
+ LS T++TG L++DVLHL T+++ K V++ ++ GCG+ QTG+F A NG+ GL
Sbjct: 190 IA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLS 248
Query: 250 MDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307
M + SVPS+LA + NSFSMCFG GRISFGDKG Q ETP +T Y +
Sbjct: 249 MKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGV 308
Query: 308 TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
+T VSVGG V+ A+FD+G+SFT L + AY ++ F+ L ++KR D PFE+
Sbjct: 309 NVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEF 368
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGP-------FFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
CY L N + ++ K P ND VS +G +YCLG++KS
Sbjct: 369 CYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSI 428
Query: 421 NVNIIGR 427
N+NIIG+
Sbjct: 429 NLNIIGQ 435
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 192/427 (44%), Positives = 259/427 (60%), Gaps = 16/427 (3%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFR 71
+L+L+ C G F F+ HH +SD VK L DDL P+ GS Y+ LAHRDR+
Sbjct: 1 MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRF-- 58
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+RGRGLA+ N++TPLT N T LN LGFLHY NVS+G PA F+VALDTGSDLFWL
Sbjct: 59 IRGRGLASN-NEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWL 117
Query: 132 PCDC-VSCVHGLNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189
PC+C +C+H L + + + N+Y+PN S+TSS + C+ C +C S S CPYQ
Sbjct: 118 PCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQ 177
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
+ LS T++TG L++DVLHL T+++ K V++ ++ GCG+ QTG+F A NG+ GL
Sbjct: 178 IA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLS 236
Query: 250 MDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307
M + SVPS+LA + NSFSMCFG GRISFGDKG Q ETP +T Y +
Sbjct: 237 MKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGV 296
Query: 308 TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
+T VSVGG V+ A+FD+G+SFT L + AY ++ F+ L ++KR D PFE+
Sbjct: 297 NVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEF 356
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGP-------FFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
CY L N + ++ K P ND VS +G +YCLG++KS
Sbjct: 357 CYDLREEHLNSDARPRHMQSKCYNPCRDDFRWRIQNDSQESVSYSNEGTKMYCLGILKSI 416
Query: 421 NVNIIGR 427
N+NIIG+
Sbjct: 417 NLNIIGQ 423
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 171/326 (52%), Positives = 222/326 (68%), Gaps = 5/326 (1%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
LHY V+VG P +F+VALDTGSDLFWLPC C C ++SG Y P SSTS
Sbjct: 6 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSA---TFYIPGMSSTS 62
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
VPCNS C+LQK+C S CPY++ Y+S GT S+GFLVEDVL+L+T+ + + ++
Sbjct: 63 KAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQ 121
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +GL NSFSMCFG DG GRISF
Sbjct: 122 IMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISF 181
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
GD+ S Q ETP + + HPTY ITI+ ++VG + +F IFD+GTSFTYL DPAYT
Sbjct: 182 GDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFITIFDTGTSFTYLADPAYTY 241
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
I+++F++ + R + S +PFEYCY LS ++ F P + L G F V DP ++S
Sbjct: 242 ITQSFHAQVQANRHAADSRIPFEYCYDLSSSEARFPIPDIILRTVTGSMFPVIDPGQVIS 301
Query: 404 SEPKGLYLYCLGVVKSDNVNIIGREY 429
+ + Y+YCL +VKS +NIIG+ +
Sbjct: 302 IQ-EHEYVYCLAIVKSMKLNIIGQNF 326
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 338 bits (867), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 202/425 (47%), Positives = 261/425 (61%), Gaps = 45/425 (10%)
Query: 28 GTFGFDFHHRYSDPVK----------------GILAVDDLPKKGSFAYYSALAHRDRYFR 71
G GF+ HHR+S V+ L ++ P GS YYSAL DR
Sbjct: 28 GGIGFNLHHRFSPVVRQWMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSALLRHDRALF 87
Query: 72 LRGRGLAAQGNDK-TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
R RGLA+ + + T LTF+ GN T RL++ +LHY V VG P+ F+VALDTGSDLFW
Sbjct: 88 TRRRGLASAADGQSTTLTFADGNAT-RLDTYEYLHYAEVEVGTPSSKFLVALDTGSDLFW 146
Query: 131 LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---SNCP 187
LPC+C C N S+ +YSP+ SSTS VPC LCE C +AG S+CP
Sbjct: 147 LPCECKLCAK--NGST-------MYSPSLSSTSKTVPCGHPLCERPDACATAGKSSSSCP 197
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
Y+V+Y+S T S+G LVEDVLHL K+V + I FGCG+VQTG+FL GAA GL
Sbjct: 198 YEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAFLRGAAAGGL 257
Query: 246 FGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPF----SLRQ 300
GLG+DK SVPS LA+ GL+ +SFSMCF DG GRI+FGD GSP Q ETP SL+
Sbjct: 258 MGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETPLIAAGSLQP 317
Query: 301 THPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
++ YNI++ ++V A+ EF+A+ DSGTSFTYL+DPAYT ++ FNS E ET
Sbjct: 318 SY--YNISVGAITVDSKAMAVEFTAVVDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYG 375
Query: 361 SDL-PFEYCYVLSPNQTNFE-YPVVNLTMKGGGPFFVNDPIV-IVSSEPKGLYL---YCL 414
S FE+CY LSP QT+ + P ++LT KGG F + PI+ +++S G Y YCL
Sbjct: 376 SGYEKFEFCYRLSPGQTSMKRLPAMSLTTKGGAVFPITWPIIPVLASTNGGPYHPIGYCL 435
Query: 415 GVVKS 419
G++K+
Sbjct: 436 GIIKT 440
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 338 bits (866), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 208/428 (48%), Positives = 263/428 (61%), Gaps = 34/428 (7%)
Query: 30 FGFDFHHRYSDPVK---------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
GFD HHRYS V+ G+ GS YYSAL+ D R RGLA Q
Sbjct: 27 LGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFAR-RGLA-Q 84
Query: 81 GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
G+ +TF+ GN T RL+ G LHY V+VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 85 GDGL--VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAP 140
Query: 141 GLNSSS---GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
N ++ G + YSP+ SSTS V C S LC+ C +A S+CPY VRY T
Sbjct: 141 LGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANT 200
Query: 198 MSTGFLVEDVLHLATDEKQSKS-----VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
S+G LVEDVL+L ++ + + V + + FGCG+VQTGSFLDGAA +GL GLGM+K
Sbjct: 201 SSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEK 260
Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
SVPSILA+ G++ NSFSMCF DG GRI+FGD GS Q ETPF ++ TH YNI+IT
Sbjct: 261 VSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITS 320
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----ETSTSDLPFE 366
+SVG + F AI DSGTSFTYLNDPAYT + FN+ E+R T + PFE
Sbjct: 321 MSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFE 380
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG----LYLYCLGVVKSD-N 421
YCY LSP+QT E PVV+LT GG F V P+ ++++ + YCL V+KSD
Sbjct: 381 YCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLP 440
Query: 422 VNIIGREY 429
++IIG+ +
Sbjct: 441 IDIIGQNF 448
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 337 bits (864), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 207/428 (48%), Positives = 263/428 (61%), Gaps = 34/428 (7%)
Query: 30 FGFDFHHRYSDPVK---------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
GFD HHRYS V+ G+ GS YYSAL+ D R RGLA Q
Sbjct: 27 LGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFAR-RGLA-Q 84
Query: 81 GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
G+ +TF+ GN T RL+ G LHY V+VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 85 GDGL--VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAP 140
Query: 141 GLNSSS---GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
N ++ G + YSP+ SSTS V C S LC+ C +A S+CPY VRY T
Sbjct: 141 LGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANT 200
Query: 198 MSTGFLVEDVLHLATDEKQSKS-----VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
S+G LVEDVL+L ++ + + V + + FGCG+VQTGSFLDGAA +GL GLGM+K
Sbjct: 201 SSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEK 260
Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
SVPSILA+ G++ NSFSMCF DG GRI+FGD GS Q ETPF ++ TH YNI+IT
Sbjct: 261 VSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITS 320
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----ETSTSDLPFE 366
+SVG + F AI DSGTSFTYLNDPAYT + FN+ E+R T + PFE
Sbjct: 321 MSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFE 380
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG----LYLYCLGVVKSD-N 421
YCY LSP+QT E P+V+LT GG F V P+ ++++ + YCL V+KSD
Sbjct: 381 YCYSLSPDQTTVELPIVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYCLAVIKSDLP 440
Query: 422 VNIIGREY 429
++IIG+ +
Sbjct: 441 IDIIGQNF 448
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 178/317 (56%), Positives = 223/317 (70%), Gaps = 12/317 (3%)
Query: 33 DFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAG 92
D HHRYS V+ A P G+ YY+ALA D LR R LA G + F+ G
Sbjct: 25 DVHHRYSATVRE-WAGHRAPPAGTAEYYAALAGHD----LRRRSLAGGGE----VAFADG 75
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF 152
NDTYRLN LGFLHY V++G P ++F+VALDTGSDLFW+PCDC++C L S + + + F
Sbjct: 76 NDTYRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAP-LVSPNYRDLKF 134
Query: 153 NIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ YSP SSTS KVPC+S LC+ Q C SA S+CPY ++YLSD T STG LVEDVL+L T
Sbjct: 135 DTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVT 194
Query: 213 DE-KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFS 270
+ +Q K V + I+FGCGR QTGSFL AAPNGL GLGMD SVPS+LA+QG+ NSFS
Sbjct: 195 EYGRQPKIVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVAAANSFS 254
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCF DG GRI+FGD GS Q ETP ++ + +P YNI+IT +VG +++ +F+AI DSG
Sbjct: 255 MCFAQDGHGRINFGDTGSSDQQETPLNMYKQNPYYNISITGATVGSKSIHTKFNAIVDSG 314
Query: 331 TSFTYLNDPAYTQISET 347
TSFT L+DP YTQI+ +
Sbjct: 315 TSFTALSDPMYTQITSS 331
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 141/251 (56%), Positives = 187/251 (74%), Gaps = 4/251 (1%)
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
+VALDTGSDLFW+PCDC C ++ + +IY+P S+T+ KV CN++LC + Q
Sbjct: 1 MVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 60
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
C S CPY V Y+S T ++G L+EDV+HL T++K + V++ ++FGCG+VQ+GSFLD
Sbjct: 61 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 120
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS Q ETPF+L
Sbjct: 121 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 180
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
+HP YNIT+T+V VG ++ EF+A+FD+GTSFTYL DP YT +SE+ A++KR +
Sbjct: 181 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQDKRHS 236
Query: 359 STSDLPFEYCY 369
S +PFEYCY
Sbjct: 237 PDSRIPFEYCY 247
>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
vinifera]
Length = 294
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 141/202 (69%), Positives = 166/202 (82%), Gaps = 2/202 (0%)
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
CG+VQTGSFL+GAAPNGLFGLGM SVPSILA +GL+ +SFSMCFG+DGTGRISFGD+G
Sbjct: 1 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60
Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET 347
S GQ ETPF+ ++ YNI+ITQ+SVGG + + F AIFDSGTSFTYLNDPAYT ISE+
Sbjct: 61 SSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFDAIFDSGTSFTYLNDPAYTSISES 120
Query: 348 FNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
FN AK+KR +S SDLPFEYCY +S QT EYP+VNLTMKGG FFV DPIVIVS +
Sbjct: 121 FNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ-- 178
Query: 408 GLYLYCLGVVKSDNVNIIGREY 429
G Y+YCLGVVKS ++NIIG+ +
Sbjct: 179 GGYVYCLGVVKSGDINIIGQNF 200
>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
Length = 306
Score = 286 bits (731), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 141/202 (69%), Positives = 166/202 (82%), Gaps = 2/202 (0%)
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
CG+VQTGSFL+GAAPNGLFGLGM SVPSILA +GL+ +SFSMCFG+DGTGRISFGD+G
Sbjct: 13 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72
Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET 347
S GQ ETPF+ ++ YNI+ITQ+SVGG + + F AIFDSGTSFTYLNDPAYT ISE+
Sbjct: 73 SSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFDAIFDSGTSFTYLNDPAYTSISES 132
Query: 348 FNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
FN AK+KR +S SDLPFEYCY +S QT EYP+VNLTMKGG FFV DPIVIVS +
Sbjct: 133 FNLRAKDKRSSSDSDLPFEYCYDISEQQTTVEYPIVNLTMKGGDNFFVTDPIVIVSIQ-- 190
Query: 408 GLYLYCLGVVKSDNVNIIGREY 429
G Y+YCLGVVKS ++NIIG+ +
Sbjct: 191 GGYVYCLGVVKSGDINIIGQNF 212
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 170/416 (40%), Positives = 227/416 (54%), Gaps = 73/416 (17%)
Query: 24 CFGF------GTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRG 76
C+GF G FGF+ HH +SD VK L + DL P++GS Y+ LAHRDR +RGRG
Sbjct: 17 CWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRL--IRGRG 74
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC- 135
LA+ ND+TP+TF GN T + LG L+Y NVSVG P SF+VALDTGSDLFWLPC+C
Sbjct: 75 LASN-NDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG 133
Query: 136 VSCVHGLNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
+C+ L Q + N+Y+PN S+TSS + C+ C K+C S S CPYQ+ Y S
Sbjct: 134 TTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-S 192
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
+ T + G L++DVLHLAT+++ V + ++ GCG+ QTG F + NG+ GLG+ S
Sbjct: 193 NSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 255 VPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
VPS+LA + NSFSMCFG GRISFG
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFG---------------------------- 284
Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET-FNSLAKEKRETSTSDLPFEYCYVL 371
D YT ET F S+A +R +LPFE+CY L
Sbjct: 285 -------------------------DRGYTDQEETPFISVAPRRRPVD-PELPFEFCYDL 318
Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK---GLYLYCLGVVKSDNVNI 424
SPN T ++P+V +T GG +N+P ++ + G +YCLGV+KS + I
Sbjct: 319 SPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQARHGEGNVMYCLGVLKSVGLKI 374
>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
Length = 414
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 160/404 (39%), Positives = 223/404 (55%), Gaps = 59/404 (14%)
Query: 10 VCVLLILLSCCAGC--CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHR 66
V VLL +L C G C G F F+ HH +SD VK L DL P+KGS Y+ LA R
Sbjct: 7 VFVLLSVLVACWGLQRCESAGKFSFEVHHMFSDTVKQNLGFGDLVPEKGSLEYFKLLAQR 66
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
DR +RGRGL++ N++ P+TF GN T ++ L GS
Sbjct: 67 DRL--IRGRGLSSN-NEEAPVTFILGNRTVSIDFL-----------------------GS 100
Query: 127 DLFWLPCDC-VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
DLFWLPC+C +C+ L D + Q C S S
Sbjct: 101 DLFWLPCNCGTTCIRDLE-------DIGLS--------------------QGGCSSPASV 133
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
CPYQ+ YL + T + G L EDVLHL T+++ + V + I+ GCG+ QTG + A NGL
Sbjct: 134 CPYQIPYLFNTTSTRGTLFEDVLHLVTEDEGLEPVKANITLGCGQNQTGLYRKSLAVNGL 193
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP 303
GLGM SVPS+LA + + NSFSMCFG+ D GRISFGD+G Q +TP + +P
Sbjct: 194 LGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTDQLQTPLVPIEPNP 253
Query: 304 TYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
TY + +T+V+VGG+ + + A+FD+GTSFT+L +PAY +++ F+ +KR ++
Sbjct: 254 TYAVNVTEVTVGGDILEIQMLALFDTGTSFTHLLEPAYGLLTKAFDDHVTDKRRPIDPEI 313
Query: 364 PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
PFE+CY SPN +F++P VN+T GG + DP+ V +E +
Sbjct: 314 PFEFCYDTSPNIKSFKFPRVNMTFVGGSKLTLRDPLFTVWNEAR 357
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 271 bits (692), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 141/287 (49%), Positives = 188/287 (65%), Gaps = 12/287 (4%)
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
N YSPN S+TSS VPC S+LC +C S + CPY++RYLS T S G+LVEDVLHLA
Sbjct: 3 LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
TD+ K V+++I+FGCG VQTG F AAPNGL GLGM+K SVPS LA+QGL NSFSM
Sbjct: 60 TDDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSM 119
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGT 331
CFG+DG GRI FGD G Q +TPF+ + +YN+T ++VGG + F+AIFDSGT
Sbjct: 120 CFGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTFNVINVGGEPNDVPFTAIFDSGT 179
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETS-TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
SFTYL +PAY+ I++ ++ K KR + + PFEYCY + P F+Y +N TMKGG
Sbjct: 180 SFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEIPPGAKEFQYLTLNFTMKGG 239
Query: 391 GPFFVNDPIVIVSSEPKGL--------YLYCLGVVKSDNVNIIGREY 429
F D V + + + ++ CL + KS ++++IG+ +
Sbjct: 240 DEFTPTDIFVFLPVDVSTMNIIFEETTHVACLAIAKSTDIDLIGQNF 286
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 258 bits (659), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 157/405 (38%), Positives = 213/405 (52%), Gaps = 16/405 (3%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
SFT L Y + F+ R D ++YCY SP + + P + LT
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 381
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
+PI+ + + L +CL V+ S + + II + + + ++
Sbjct: 382 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 157/405 (38%), Positives = 213/405 (52%), Gaps = 16/405 (3%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
SFT L Y + F+ R D ++YCY SP + + P + LT
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 381
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
+PI+ + + L +CL V+ S + + II + + + ++
Sbjct: 382 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 157/405 (38%), Positives = 213/405 (52%), Gaps = 16/405 (3%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 3 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 60
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 61 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 114
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 115 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 174
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFSMCF
Sbjct: 175 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 233
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 234 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 293
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
SFT L Y + F+ R D ++YCY SP + + P + LT
Sbjct: 294 SFTSLPLDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 351
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
+PI+ + + L +CL V+ S + + II + + + ++
Sbjct: 352 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 396
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 164/415 (39%), Positives = 233/415 (56%), Gaps = 23/415 (5%)
Query: 29 TFGFDFHHRYSDPVKGILA--VDDL----PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
TF HR+SD VK + D L P+K S YY L + D F+ + L Q
Sbjct: 35 TFSSRLIHRFSDEVKALRVSRKDSLSYSWPEKKSMDYYQILVNSD--FQRQKMKLGPQYQ 92
Query: 83 DKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
P S G+ T L + G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 93 FLFP---SQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALDAGSDLLWVPCDCLQCAP- 148
Query: 142 LNSS--SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
L++S S D N YSP+ SSTS + C+ LCEL C S CPY + Y ++ T S
Sbjct: 149 LSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSS 208
Query: 200 TGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
+G LVED+LHLA+ D S SV + + GCG Q+G +LDG AP+GL GLG+ + SVPS
Sbjct: 209 SGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPS 268
Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
LA GLI NSFSMCF D +GRI FGD+G Q TPF +L + TY + + VG
Sbjct: 269 FLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGS 328
Query: 317 NAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
+ + F A+ D+GTSFT+L + Y +I+E F+ +S + P++YCY S N
Sbjct: 329 SCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATI-SSFNGYPWKYCYKSSSNH 387
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
+ P V L F +++P+ ++ +G+ +CL + ++ ++ IG+ +
Sbjct: 388 LT-KVPSVKLIFPLNNSFVIHNPVFMIYGI-QGITGFCLAIQPTEGDIGTIGQNF 440
>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
Length = 263
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 126/220 (57%), Positives = 159/220 (72%), Gaps = 2/220 (0%)
Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
T+E K V + I FGCG+VQTG+FLD AAPNGLFGLGMDK SVPS+LA++G NSF
Sbjct: 1 FKTEETIPKVVKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSF 60
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDS 329
SMCFGSDG GRI FGD GS QGETPF + +HPTYNI++ + VG ++++ SAI DS
Sbjct: 61 SMCFGSDGMGRIYFGDTGSSDQGETPFDVNHSHPTYNISLIGMEVGNSSIDVNSSAIVDS 120
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GTSFT L DP YT++SE+F++ +E R S +PFEYCY LS NQ + P +NLT KG
Sbjct: 121 GTSFTCLADPMYTKLSESFHAQVRENRHESDPGIPFEYCYGLSRNQNSILLPKINLTTKG 180
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREY 429
G F +NDPI+++SSE YCLG+VKS +NIIG+ +
Sbjct: 181 GSQFPINDPIIVISSEQSS--FYCLGIVKSSQLNIIGQNF 218
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 156/405 (38%), Positives = 212/405 (52%), Gaps = 16/405 (3%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL LGM SVPS LA GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCF 263
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
SFT L Y + F+ R D ++YCY SP + + P + LT
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITLTFAADK 381
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
+PI+ + + L +CL V+ S + + II + + + ++
Sbjct: 382 SLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 426
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 161/433 (37%), Positives = 239/433 (55%), Gaps = 24/433 (5%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALA 64
++L++ S TF HR+S K G + P+K S YY L
Sbjct: 2 LILVMSSFLVQNTVELATFSSRLIHRFSKEYKEVSVSRGGDVNGTWWPEKKSKEYYQILV 61
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALD 123
D L+ + L G L S G+ T L N G+LHYT + +G P +SF+VALD
Sbjct: 62 SSD----LKRQKLKL-GPHYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALD 116
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQKQCPS 181
+GSDLFW+PCDCV C L++S +D ++ YSP+ SSTS ++ C+ LC++ C +
Sbjct: 117 SGSDLFWVPCDCVQCAP-LSASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKN 175
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDG 239
+CPY + Y ++ T S+G LVED++HLA+ D+ + SV + + GCG Q+G +LDG
Sbjct: 176 PKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDG 235
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SL 298
AP+GL GLG+ + SVPS LA GLI NSFSMCF D +GRI FGD+G Q PF L
Sbjct: 236 VAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKL 295
Query: 299 RQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
+ TY + + VG + + FSA+ DSGTSFT+L D + I+E F++ R
Sbjct: 296 NGNYTTYIVGVEVCCVGTSCLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASR- 354
Query: 358 TSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
+S ++YCY S +Q + P + L F V +P+ ++ +G+ +CL +
Sbjct: 355 SSFEGYSWKYCYKTS-SQDLPKIPSLRLIFPQNNSFMVQNPVFMIYGI-QGVIGFCLAIQ 412
Query: 418 KSD-NVNIIGREY 429
+D ++ IG+ +
Sbjct: 413 PADGDIGTIGQNF 425
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 160/441 (36%), Positives = 230/441 (52%), Gaps = 26/441 (5%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDD-----LPKKG 55
MA+ + + V+L++ SC A F HR+SD VK A P+
Sbjct: 1 MAARFLVAMSVVVLLIESCMAA------MFSARLIHRFSDEVKAFRAARSGLSGSWPEWR 54
Query: 56 SFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQP 114
+ YY L D R G+ L S G+ T N G+LHYT + +G P
Sbjct: 55 TMEYYKMLVRSDW-----ERQKVMLGSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTP 109
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLC 173
+SF+VALD GSDL W+PCDC+ C S G + D N YSP+ SSTS + C+ LC
Sbjct: 110 NISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLC 169
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRV 231
E C S CPY + Y S+ T S+G L+ED+LHL + D+ + SV + + GCG
Sbjct: 170 ESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMR 229
Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQ 291
QTG +LDG AP+GL GLG+ + SVPS L+ GL+ NSFS+CF D +GRI FGD+G Q
Sbjct: 230 QTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQ 289
Query: 292 GETPFSLRQ-THPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFN 349
T F + TY + + +G + + F A+ DSG SFT+L D +Y + + F+
Sbjct: 290 QTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFD 349
Query: 350 SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGL 409
R S P+EYCY S + + P V L F V++P+ +V +G+
Sbjct: 350 KQVNATR-FSFEGYPWEYCYKSSSKEL-LKNPSVILKFALNNSFVVHNPVFVVHGY-QGV 406
Query: 410 YLYCLGVVKSD-NVNIIGREY 429
+CL + +D ++ I+G+ +
Sbjct: 407 VGFCLAIQPADGDIGILGQNF 427
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 164/415 (39%), Positives = 223/415 (53%), Gaps = 21/415 (5%)
Query: 29 TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+SD K G + D PKK SF YY L D L+ + L G
Sbjct: 14 TFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSD----LKRQKLKL-G 68
Query: 82 NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
+ L S G+D L N G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 69 AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAP 128
Query: 141 GLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
S ++ D N YSP+ SSTS + CN LCEL C S+ CPY Y S+ T S
Sbjct: 129 LSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSS 188
Query: 200 TGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
+G L+ED LHLA ++ SV + + GCGR Q+G+F DGAAP+GL GLG SVPS
Sbjct: 189 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 248
Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
+LA GL+ N+FS+CF + +G I FGD+G Q T F L TY I + VG
Sbjct: 249 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS 308
Query: 317 NAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
+++ F A+ DSGTSFT+L Y +I F+ R +S P++YCY S +Q
Sbjct: 309 SSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATR-SSFKGSPWKYCYN-SSSQ 366
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK-SDNVNIIGREY 429
P V L F V++P++ + SE + ++CL + + IIG+ +
Sbjct: 367 ELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNF 421
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 254 bits (648), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 161/401 (40%), Positives = 208/401 (51%), Gaps = 21/401 (5%)
Query: 36 HRYSDPVKGILAVD----DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA 91
HR SD + LA P+ GS YY AL D + R L + FS
Sbjct: 80 HRLSDEAR--LAAGPHGARWPRHGSGGYYRALVRSDLQRQKRKHQLLSVSEAGG--IFSP 135
Query: 92 GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID 151
GND G+L+YT V VG P SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 136 GND------FGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRD 189
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
IY P S+TS +PC+ LC C S CPY YL + T S+G L+ED+LHL
Sbjct: 190 LGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLD 249
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ E + V + + GCGR Q+GS+LDG AP+GL GLGM SVPS LA GL+ NSFSM
Sbjct: 250 SRESHAP-VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSM 308
Query: 272 CFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDS 329
CF D +GRI FGD+G Q TPF L + TY + + + VG F A+ DS
Sbjct: 309 CFKED-SGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDS 367
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GTSFT L Y ++ F+ R T D FEYCY SP + + P V LT
Sbjct: 368 GTSFTALPLNVYKAVAVEFDKQVHAPRITQ-EDASFEYCYSASPLKMP-DVPTVTLTFAA 425
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREY 429
F +P +++ + +CL + KS + + IIG+ +
Sbjct: 426 NKSFQAVNPTIVLKDGEGSVAGFCLALQKSPEPIGIIGQNF 466
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 164/415 (39%), Positives = 223/415 (53%), Gaps = 21/415 (5%)
Query: 29 TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+SD K G + D PKK SF YY L D L+ + L G
Sbjct: 24 TFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSD----LKRQKLKL-G 78
Query: 82 NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
+ L S G+D L N G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 79 AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAP 138
Query: 141 GLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
S ++ D N YSP+ SSTS + CN LCEL C S+ CPY Y S+ T S
Sbjct: 139 LSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSS 198
Query: 200 TGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
+G L+ED LHLA ++ SV + + GCGR Q+G+F DGAAP+GL GLG SVPS
Sbjct: 199 SGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPS 258
Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
+LA GL+ N+FS+CF + +G I FGD+G Q T F L TY I + VG
Sbjct: 259 LLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS 318
Query: 317 NAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
+++ F A+ DSGTSFT+L Y +I F+ R +S P++YCY S +Q
Sbjct: 319 SSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATR-SSFKGSPWKYCYN-SSSQ 376
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK-SDNVNIIGREY 429
P V L F V++P++ + SE + ++CL + + IIG+ +
Sbjct: 377 ELLNIPTVTLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNF 431
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 163/422 (38%), Positives = 221/422 (52%), Gaps = 32/422 (7%)
Query: 29 TFGFDFHHRYSD-------PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR SD P G+ P++GS YY AL D + + R LA +
Sbjct: 26 TFSSRMVHRLSDEARLEAGPRMGLW-----PQRGSGGYYRALLRSD--LQRQKRRLAGKN 78
Query: 82 N----DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
K TFS GND LG+L+Y V VG P SF+VALDTGSDLFW+PCDC+
Sbjct: 79 QLLSLSKGGSTFSPGND------LGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132
Query: 138 CVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDG 196
C L+S G + D IY P S+TS +PC+ LC+ C + C Y + Y S+
Sbjct: 133 CAP-LSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSEN 191
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
T S+G L+ED LHL + E + V++ + GCGR Q+G +LDG AP+GL GLGM SVP
Sbjct: 192 TTSSGLLIEDSLHLNSREGHAP-VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVP 250
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S LA GL+ NSFSMCF D +GRI FGD+G Q TPF L TY + + + +G
Sbjct: 251 SFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIG 310
Query: 316 GNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
+ F A+ DSGTSFT L Y + F+ R D ++YCY SP
Sbjct: 311 HKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASR-VPYEDSTWKYCYSASPL 369
Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIAN 433
+ + P + L F +PI+ + E L +CL V+ S + + IIG+ + +
Sbjct: 370 EMP-DVPTIILAFAANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPIGIIGQNFLVGY 428
Query: 434 NI 435
++
Sbjct: 429 HV 430
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 152/382 (39%), Positives = 211/382 (55%), Gaps = 14/382 (3%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALS 117
Y+ AL D + R G Q L+ S G + N LG+L+YT V VG P S
Sbjct: 60 YFRALVRSDLQRQKRRVGGKYQL-----LSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTS 114
Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQ 176
F+VALDTGSDLFW+PCDC+ C L+S G + D IY P+ S+TS +PC+ LC
Sbjct: 115 FLVALDTGSDLFWVPCDCIQCAP-LSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPA 173
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C + CPY + Y S+ T S+G L+ED+LHL + E + V++ + GCG+ Q+GS+
Sbjct: 174 SGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAP-VNASVIIGCGKKQSGSY 232
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF 296
L+G AP+GL GLGM SVPS LA GL+ NSFSMCF D +GRI FGD+G P Q TPF
Sbjct: 233 LEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPF 292
Query: 297 -SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ TY + + + +G F A+ D+GTSFT L AY I+ F+
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINA 352
Query: 355 KRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL 414
R S+ D FEYCY P + + P + LT F +PI+ + ++CL
Sbjct: 353 SR-ASSDDYSFEYCYSTGPLEMP-DVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCL 410
Query: 415 GVVKS-DNVNIIGREYPIANNI 435
V+ S + V IIG+ + + ++
Sbjct: 411 AVLPSPEPVGIIGQNFMVGYHV 432
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 152/382 (39%), Positives = 211/382 (55%), Gaps = 14/382 (3%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALS 117
Y+ AL D + R G Q L+ S G + N LG+L+YT V VG P S
Sbjct: 60 YFRALVRSDLQRQKRRVGGKYQL-----LSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTS 114
Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQ 176
F+VALDTGSDLFW+PCDC+ C L+S G + D IY P+ S+TS +PC+ LC
Sbjct: 115 FLVALDTGSDLFWVPCDCIQCAP-LSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPA 173
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C + CPY + Y S+ T S+G L+ED+LHL + E + V++ + GCG+ Q+GS+
Sbjct: 174 SGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAP-VNASVIIGCGKKQSGSY 232
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF 296
L+G AP+GL GLGM SVPS LA GL+ NSFSMCF D +GRI FGD+G P Q TPF
Sbjct: 233 LEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPF 292
Query: 297 -SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ TY + + + +G F A+ D+GTSFT L AY I+ F+
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINA 352
Query: 355 KRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL 414
R S+ D FEYCY P + + P + LT F +PI+ + ++CL
Sbjct: 353 SR-ASSDDYSFEYCYSTGPLEMP-DVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCL 410
Query: 415 GVVKS-DNVNIIGREYPIANNI 435
V+ S + V IIG+ + + ++
Sbjct: 411 AVLPSPEPVGIIGQNFMVGYHV 432
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 160/392 (40%), Positives = 215/392 (54%), Gaps = 20/392 (5%)
Query: 52 PKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA-GNDTYRLNSLGFLHYTNVS 110
P++GS YY +L D + R G G L+FS G N G+L+YT V
Sbjct: 158 PRRGSGDYYRSLVRSDLQRQKRRLG----GGKHQLLSFSKDGGIIPTGNDFGWLYYTWVD 213
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSC--VHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
VG P SF+VALDTGSDLFW+PCDC+ C + G + S + D IY P S+TS +PC
Sbjct: 214 VGTPNTSFMVALDTGSDLFWIPCDCIECAPLSGYHGSLDR--DLGIYKPAESTTSRHLPC 271
Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+ LC L C + CPY +YL + T S+G LVED+LHL + E + V + + GC
Sbjct: 272 SHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAP-VKASVIIGC 330
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGS 288
GR Q+GS+LDG AP+GL GLGM SVPS LA GL+ NSFSMCF D +GRI FGD+G
Sbjct: 331 GRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFTKD-SGRIFFGDQGV 389
Query: 289 PGQGETPF-SLRQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISE 346
Q TPF L TY + + + VG + F AI DSGTSFT L Y ++
Sbjct: 390 STQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAI 449
Query: 347 TFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSS 404
F+ R + +TS F+YCY SP + P V LT G F +P ++
Sbjct: 450 EFDKQVNASRLPQEATS---FDYCYSASP-LVMPDVPTVTLTFAGNKSFQPVNPTFLLHD 505
Query: 405 EPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
E + +CL VV+S + + II + + + ++
Sbjct: 506 EEGAVAGFCLAVVQSPEPIGIIAQNFLLGYHV 537
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 153/416 (36%), Positives = 222/416 (53%), Gaps = 39/416 (9%)
Query: 36 HRYSDPVKGILAV----DDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
HR+SD + + + LP+K S YY LA D FR + L A+ P
Sbjct: 31 HRFSDEGRASIRTPSSSESLPEKQSLEYYRLLAKSD--FRRQRMNLGAKFQSLVPSEGSK 88
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
T S+GND G+LHYT + +G P++SF+VALDTGSDL W+PC+CV C ++ S
Sbjct: 89 TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYS 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
S D N Y+P++SSTS C+ LC+ C S CPY V YLS T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVE 202
Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
D+LHL + S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
GL+ NSFS+CF + +GRI FGD G Q TPF + + Y + + +G + +
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLENNSGYIVGVEACCIGNSCLK 322
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQ 375
F+ DSG SFTYL + Y ++ +L ++ +TS + +EYCY +
Sbjct: 323 QTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKSFEGVSWEYCY---ESS 374
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIGREY 429
+ P + L F ++ P+ + + +GL +CL + S + + IG+ Y
Sbjct: 375 VEPKVPAIKLKFSHNNTFVIHKPLFVF-QQSQGLVQFCLPISPSGQEGIGSIGQNY 429
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 159/412 (38%), Positives = 227/412 (55%), Gaps = 20/412 (4%)
Query: 29 TFGFDFHHRYSDPVKGI-LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
TF HR+S+ +K + + D P + + Y+ L R+ + R + G + L
Sbjct: 26 TFSVKLFHRFSEEMKPVQVQTGDWPDRRTLHYHEKLL-RNDFLRHK----INLGGARHKL 80
Query: 88 TF-SAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
F S G+ T N G+LHYT + +G P+ SF+VALD GSDL W+PCDC+ C L++S
Sbjct: 81 LFPSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCA-PLSAS 139
Query: 146 --SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQVRYLSDGTMSTGF 202
S D N YSP+ S +S + C+ LC++ C S CPY + YLSD T S+G
Sbjct: 140 FYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGL 199
Query: 203 LVEDVLHLATDE--KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
LVED+ HL + + + SV + + GCG Q+G +LDG AP+GL GLG ++SVPS LA
Sbjct: 200 LVEDIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLA 259
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAV 319
GLI +SFS+CF D +GR+ FGD+GS Q TPF L TY + + +G +
Sbjct: 260 KSGLIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCP 319
Query: 320 NF-EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
F+A FDSGTSFT+L AY I+E F+ R T P+EYCYV S Q
Sbjct: 320 KVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGS-PWEYCYVPSSQQLP- 377
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
+ P + L + F V +P V VS +G+ +CL + ++ + IG+ +
Sbjct: 378 KIPTLTLMFQQNNSFVVYNP-VFVSYNEQGVDGFCLAIQPTEGGMGTIGQNF 428
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 152/406 (37%), Positives = 216/406 (53%), Gaps = 20/406 (4%)
Query: 36 HRYSDPVKGILAVDD-----LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS 90
HR+SD VK A P+ + YY L D R G+ L S
Sbjct: 11 HRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVRSDWE-----RQKVMLGSKYQFLFPS 65
Query: 91 AGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV 149
G+ T N G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C S G +
Sbjct: 66 EGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSL 125
Query: 150 -IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
D N YSP+ SSTS + C+ LCE C S CPY + Y S+ T S+G L+ED+L
Sbjct: 126 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 185
Query: 209 HLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
HL + D+ + SV + + GCG QTG +LDG AP+GL GLG+ + SVPS L+ GL+
Sbjct: 186 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 245
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAV-NFEFS 324
NSFS+CF D +GRI FGD+G Q T F + TY + + +G + + F
Sbjct: 246 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 305
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
A+ DSG SFT+L D +Y + + F+ R S P+EYCY S + + P V
Sbjct: 306 ALVDSGASFTFLPDESYRNVVDEFDKQVNATR-FSFEGYPWEYCYKSSSKEL-LKNPSVI 363
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
L F V++P+ +V +G+ +CL + +D ++ I+G+ +
Sbjct: 364 LKFALNNSFVVHNPVFVVHGY-QGVVGFCLAIQPADGDIGILGQNF 408
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 158/428 (36%), Positives = 222/428 (51%), Gaps = 23/428 (5%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKG-ILAVDDLPKKGSFAYYSALAHRDRYF 70
+LL +LS + F HR+SD + I + P+K SF YY L D
Sbjct: 8 ILLFILSLVSEKSLA-SLFSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSIDS-- 64
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
R + L A+ P S G+ T N G+LHYT + +G P++SF+VALD+GSDL
Sbjct: 65 RRQKMNLGAKFQSLVP---SEGSKTISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLL 121
Query: 130 WLPCDCVSC--VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCP 187
W+PC+CV C + SS D N + P+ S+TS PC+ LCE C S CP
Sbjct: 122 WIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCP 181
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
Y V Y S+ T S+G LVEDVLHLA S SV +R+ GCG Q+G FL G AP+G+ G
Sbjct: 182 YTVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMG 241
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYN 306
LG + SVPS LA GL+ NSFSMCF + +GRI FGD G Q T F + Y
Sbjct: 242 LGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYF 301
Query: 307 ITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
+ + VG + + F+ + DSG SFT+L + Y +++ +S + P+
Sbjct: 302 VGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGG-PW 360
Query: 366 EYCYVLSPNQTNFE--YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
EYCY +T+FE P + L F ++ P+ ++ +GL +CL + S+
Sbjct: 361 EYCY-----ETSFEPKVPAIKLKFSSNNTFVIHKPLFVLQRS-EGLVQFCLPISASEEGT 414
Query: 424 --IIGREY 429
+IG+ Y
Sbjct: 415 GGVIGQNY 422
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 248 bits (632), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 161/409 (39%), Positives = 222/409 (54%), Gaps = 16/409 (3%)
Query: 29 TFGFDFHHRYSDPVKGILAVDD-LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
TF HR++D +K + P + S YY L D + R + G L
Sbjct: 22 TFSARLVHRFADEMKPVRPPTGYWPDRWSMGYYRMLLTGD----ILRRKIKVGGARYQLL 77
Query: 88 TFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS- 145
S G+ T L N G+LHYT + +G P+ SF+VALD GSDL W+PCDCV C L+SS
Sbjct: 78 FPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAP-LSSSY 136
Query: 146 -SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
S D N YSP+ S +S + C+ LC+ C S+ CPY V YLS+ T S+G LV
Sbjct: 137 YSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLV 196
Query: 205 EDVLHLATDEKQSKS-VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
ED+LHL + S S V + + GCG Q+G +LDG AP+GL GLG ++SVPS LA G
Sbjct: 197 EDILHLQSGGSLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSG 256
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF- 321
LI +SFS+CF D +GRI FGD+G Q T F L + TY I + VG + +
Sbjct: 257 LIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMT 316
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
F DSGTSFT+L Y I+E F+ R +S P+EYCYV S +Q + P
Sbjct: 317 SFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSR-SSFEGSPWEYCYVPS-SQELPKVP 374
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
+ LT + F V DP+ + +G+ +CL + ++ ++ IG+ +
Sbjct: 375 SLTLTFQQNNSFVVYDPVFVFYGN-EGVIGFCLAIQPTEGDMGTIGQNF 422
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 162/417 (38%), Positives = 216/417 (51%), Gaps = 23/417 (5%)
Query: 29 TFGFDFHHRYSDPVKGILA------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
TF HR+SD K L V PK+GS Y+ L + D + L +Q
Sbjct: 24 TFSSRIIHRFSDEAKVHLRNNGGENVQSWPKRGSSEYFRLLLNSD--LTRQKMKLGSQDQ 81
Query: 83 DKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
P S G+ T N +LHYT + +G P +SF+VALDTGSD+FW+PCDC+ C
Sbjct: 82 SFYP---SEGSKTLSFGNDFVWLHYTWIDIGTPNVSFLVALDTGSDMFWVPCDCIECAP- 137
Query: 142 LNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
L+++ +D N YSP+ SS+S +PC LC C CPY Y SD T S
Sbjct: 138 LSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSS 197
Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
+GFL+ED LHLA++ S+ + + GCGR Q+G FL+GAAPNG+ GLG SVP++L
Sbjct: 198 SGFLIEDKLHLASNNATKNSIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALL 257
Query: 260 ANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-TPFSLRQTH-PTYNITITQVSVGGN 317
A GLI NS S+C G+GRI FGD+G Q TPF L Y + + + VG
Sbjct: 258 AKAGLIRNSISICLNEKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSF 317
Query: 318 AVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
EF A D+GTSFTYL Y + F R TS F CY S ++
Sbjct: 318 CYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCYNASSRES 377
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI-IGREYPIA 432
N +P + T F + +P + + E + CL VV+SD+ I IGR+Y IA
Sbjct: 378 N-NFPPMKFTFSKNQSFIIQNPFISMDQEDTTI---CLAVVQSDDELITIGRKYTIA 430
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 152/416 (36%), Positives = 229/416 (55%), Gaps = 22/416 (5%)
Query: 29 TFGFDFHHRYSDPVKGILAVDD--------LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
TF HR+S+ +K + A P+KGS YY L D FR + L ++
Sbjct: 23 TFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGD--FRRQKMKLGSR 80
Query: 81 GNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
P S G+ T L N G+LHYT + +G P++SF+VALD GSDL W+PC+C+ C
Sbjct: 81 FQLLFP---SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCA 137
Query: 140 HGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
S G + D N Y P++SSTS + C+ LC+ + C S +CPY + Y+++ T
Sbjct: 138 PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTS 197
Query: 199 STGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G L++DVLHL++ + S ++ + + GCG Q+G +L G AP+GLFGLG+ + SV
Sbjct: 198 SSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVL 257
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S LA + L+ NSFS+CF DG+GRI FGD+G Q T F L + TY + + +
Sbjct: 258 SSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIE 317
Query: 316 GNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
+ + F A+ DSGTSFTYL + AY I F+ S P++YCY +S +
Sbjct: 318 NSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISAD 377
Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
+ P V L F V+DP+ + + +GL +C ++ +D ++ I+G+ Y
Sbjct: 378 AMP-KVPSVTLLFPLNNSFVVHDPVFPIYGD-QGLAGFCFAILPADGDIGILGQNY 431
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 159/409 (38%), Positives = 221/409 (54%), Gaps = 16/409 (3%)
Query: 29 TFGFDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
TF HR++D +K + P + S YY L D + R + G L
Sbjct: 23 TFSARLVHRFADEMKPVRPPTGYWPDQRSMRYYQMLLTGD----ILRRKIKVGGTRYQLL 78
Query: 88 TFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS- 145
S G+ T L N G+LHYT + +G P+ SF+VALD GSDL W+PCDCV C L+SS
Sbjct: 79 FPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAP-LSSSY 137
Query: 146 -SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
S D N YSP+ S +S + C+ LC+ C S+ CPY V YLS+ T S+G LV
Sbjct: 138 YSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLV 197
Query: 205 EDVLHLATDEKQSKS-VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
ED+LHL + S S V + + GCG Q+G +LDG AP+GL GLG ++SVPS LA G
Sbjct: 198 EDILHLQSGGTLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSG 257
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF- 321
LI SFS+CF D +GR+ FGD+G Q T F L + TY I + +G + +
Sbjct: 258 LIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKMT 317
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
F A DSGTSFT+L Y I+E F+ R +S P+EYCYV S +Q + P
Sbjct: 318 SFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSR-SSFEGSPWEYCYVPS-SQDLPKVP 375
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREY 429
L + F V DP+ + +G+ +CL ++ ++ ++ IG+ +
Sbjct: 376 SFTLMFQRNNSFVVYDPVFVFYGN-EGVIGFCLAILPTEGDMGTIGQNF 423
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 244 bits (624), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 156/424 (36%), Positives = 222/424 (52%), Gaps = 41/424 (9%)
Query: 30 FGFDFHHRYSDP----VKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
F HR+SD +K + D LP K S YY LA D FR + L A+
Sbjct: 25 FSSRLIHRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAESD--FRRQRMNLGAKVQSLV 82
Query: 86 P----LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
P T S+GND G+LHYT + +G P++SF+VALDTGS+L W+PC+CV C
Sbjct: 83 PSEGSKTISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPL 136
Query: 142 LNS--SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
++ SS D N Y+P++SSTS C+ LC+ C S CPY V YLS T S
Sbjct: 137 TSTYYSSLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSS 196
Query: 200 TGFLVEDVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
+G LVED+LHL + S SV +R+ GCG+ Q+G +LDG AP+GL GLG + S
Sbjct: 197 SGLLVEDILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEIS 256
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQV 312
VPS L+ GL+ NSFS+CF + +GRI FGD G Q TPF + Y + +
Sbjct: 257 VPSFLSKAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEAC 316
Query: 313 SVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEY 367
+G + + F+ DSG SFTYL + Y ++ +L ++ +TS + +EY
Sbjct: 317 CIGNSCLKQTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKNFEGVSWEY 371
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNII 425
CY S + P + L F ++ P+ + + +GL +CL + S + + I
Sbjct: 372 CYESSAEP---KVPAIKLKFSHNNTFVIHKPLFVF-QQSQGLVQFCLPISPSGQEGIGSI 427
Query: 426 GREY 429
G+ Y
Sbjct: 428 GQNY 431
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 244 bits (622), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 160/421 (38%), Positives = 221/421 (52%), Gaps = 27/421 (6%)
Query: 29 TFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+SD K I + D PK+ SF Y+ L D R G
Sbjct: 27 TFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDL-----KRQRMKLG 81
Query: 82 NDKTPLTF-SAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
+ K L F S G+ N L +LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 82 SQKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCA 141
Query: 140 HGLNSSSGQV---IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS-D 195
L++S + D + YSP+ SSTS + C+ LCE C + CPY Y +
Sbjct: 142 -PLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFE 200
Query: 196 GTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
T S GFLVED LHLA+ D K + + + GCGR Q GSF DGAAP+G+ GLG
Sbjct: 201 NTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDI 260
Query: 254 SVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQV 312
SVPS+LA GLI N FS+CF + +GRI FGD+G Q TPF ++ T+ Y + +
Sbjct: 261 SVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESY 320
Query: 313 SVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
VG + + F A+ DSG+SFTYL Y ++ F+ KR S D ++YCY
Sbjct: 321 CVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKR-ISFQDGLWDYCYNA 379
Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIGREYP 430
S +Q + P + L F V++P + +G ++CL + +D + IIG+ +
Sbjct: 380 S-SQELHDIPAIQLKFPRNQNFVVHNPTYSIPHH-QGFTMFCLSLQPTDGSYGIIGQNFM 437
Query: 431 I 431
I
Sbjct: 438 I 438
>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
Length = 426
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/403 (36%), Positives = 222/403 (55%), Gaps = 52/403 (12%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G+ F+ HHR+S+ VK +L LP+ GS YY AL HRDR GR L + N++T +
Sbjct: 20 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRQLTSNNNNQTTI 74
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
+F+ GN T ++ L+ N++ P L F + V C L
Sbjct: 75 SFAQGNSTEEIS----LYDKNLA---PPLYFHLT------------QAVICFGYL----- 110
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
+ +P + L K +C S S+CPY++RYLS G+ STG LVED
Sbjct: 111 ---------------AIAIPLVYGVWRLTKARCISPVSDCPYRIRYLSPGSKSTGVLVED 155
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
V+H++T+E +++ D+RI+FG Q G F + A NG+ GL + +VP++L G+
Sbjct: 156 VIHMSTEEGEAR--DARITFG--ESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVAS 210
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
+SFSMCFG +G G ISFGDKGS Q ETP S + Y+++IT+ VG V+ EF+A
Sbjct: 211 DSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTAT 270
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
FDSGT+ T+L +P YT ++ F+ ++R + + D PFE+CY+++ + P V+
Sbjct: 271 FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITSTSDEDKLPSVSFE 330
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN--IIGR 427
MKGG + V PI++ + +YCL V+K N + IIGR
Sbjct: 331 MKGGAAYDVFSPILVFDTSDGSFQVYCLAVLKQVNADFSIIGR 373
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 160/423 (37%), Positives = 221/423 (52%), Gaps = 36/423 (8%)
Query: 29 TFGFDFHHRYSDPVKGIL-------AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+S+ K +L + P K SF Y L D + + L AQ
Sbjct: 23 TFSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLLLDND--LKRQKMKLGAQN 80
Query: 82 NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
P S G+ T+ N L +LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 81 QLLFP---SLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCA- 136
Query: 141 GLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
L++S + +D ++ Y P+ S+TS + CN LCEL C + CPY Y T
Sbjct: 137 PLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTS 196
Query: 199 STGFLVEDVLHLATDEKQSKSVDSRIS----FGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
S+GFLVED+LHLA+ S S R+ GCGR QTG +LDGAAP+G+ GLG S
Sbjct: 197 SSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSIS 256
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVS 313
VPS+LA GLI SFS+CF +G+G I FGD+G Q TP Q + Y I +
Sbjct: 257 VPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYC 316
Query: 314 VGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
VG + + F A+ DSG SFTYL Y +I F+ +R +S P+ YCY S
Sbjct: 317 VGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGG-PWNYCYNTS 375
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSS-----EPKGLYLYCLGVVKSD-NVNIIG 426
Q + P + L+ F +N ++I +S + + ++CL + +D N IIG
Sbjct: 376 SKQLD-NVPAMRLS------FLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNYGIIG 428
Query: 427 REY 429
+ Y
Sbjct: 429 QNY 431
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 153/416 (36%), Positives = 222/416 (53%), Gaps = 39/416 (9%)
Query: 36 HRYSDP----VKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
HR+SD +K + + LP+K S AYY LA D FR + L A+ P
Sbjct: 31 HRFSDEGRASIKTPSSSESLPEKQSLAYYRLLAKSD--FRRQRMNLGAKFQSLVPSEGSK 88
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
T S+GND G+LHYT + +G P++SF+VALDTGSDL W+PC+CV C ++ S
Sbjct: 89 TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYS 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
S D N Y+P++SS+S C+ LC C S C Y V+YLS T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCDSPKEQCTYTVKYLSGNTSSSGLLVE 202
Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
D+LHL + S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
GL+ NSFS+CF + +GRI FGD G Q PF + + Y + + +G + +
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAPFLQLENNSGYIVGVEACCIGNSCLK 322
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQ 375
F+ DSG SFTYL + Y ++ +L ++ +TS + +EYCY +
Sbjct: 323 QTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKSFEGVSWEYCY---ESS 374
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI--IGREY 429
+ P + L F ++ P+ + + +GL +CL + S+ I IG+ Y
Sbjct: 375 VEPKVPAIKLKFSHNNTFVIHKPLFVF-QQSQGLVQFCLPISPSEQEGIGSIGQNY 429
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 145/385 (37%), Positives = 211/385 (54%), Gaps = 20/385 (5%)
Query: 29 TFGFDFHHRYSDPVKGILAVDD--------LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
TF HR+S+ +K + A P+KGS YY L D FR + L ++
Sbjct: 23 TFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGD--FRRQKMKLGSR 80
Query: 81 GNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
P S G+ T L N G+LHYT + +G P++SF+VALD GSDL W+PC+C+ C
Sbjct: 81 FQLLFP---SEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCA 137
Query: 140 HGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
S G + D N Y P++SSTS + C+ LC+ + C S +CPY + Y+++ T
Sbjct: 138 PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTS 197
Query: 199 STGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G L++DVLHL++ + S ++ + + GCG Q+G +L G AP+GLFGLG+ + SV
Sbjct: 198 SSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVL 257
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S LA + L+ NSFS+CF DG+GRI FGD+G Q T F L + TY + + +
Sbjct: 258 SSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIE 317
Query: 316 GNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
+ + F A+ DSGTSFTYL + AY I F+ S P++YCY +S +
Sbjct: 318 NSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISAD 377
Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPI 399
+ P V L F V+DP+
Sbjct: 378 AMP-KVPSVTLLFPLNNSFVVHDPV 401
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 152/419 (36%), Positives = 219/419 (52%), Gaps = 29/419 (6%)
Query: 28 GTFGFDFHHRYSDPVKGILA---------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLA 78
TF HR+S+ K LA + P++ S Y+ L D R R R
Sbjct: 23 ATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSD-VARQRMR--- 78
Query: 79 AQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
G+ L S G T+ N+L +LHYT + +G P +SF+VALD GSD+ W+PCDC+
Sbjct: 79 -LGSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIE 137
Query: 138 CVHGLNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSD 195
C L++ + V+D N Y P+ S+TS +PC LC++ C + CPY+V+Y S
Sbjct: 138 CA-SLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPYEVQYASA 196
Query: 196 GTMSTGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
T S+G++ ED LHL +D K ++ SV + I GCGR QTG +L GA P+G+ GLG
Sbjct: 197 NTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNI 256
Query: 254 SVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVS 313
SVPS+LA GLI NSFS+C + +GRI FGD+G Q TPF Y + +
Sbjct: 257 SVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPF---LPIIAYMVGVESFC 313
Query: 314 VGGNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
VG + F A+ DSG+SFT+L + Y ++ F+ R S +EYCY S
Sbjct: 314 VGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSS--WEYCYNAS 371
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVI-VSSEPKGLYLYCLGVVKS-DNVNIIGREY 429
+Q P + L F + +PI +S+ + ++CL V S D+ IG+ +
Sbjct: 372 -SQELVNIPPLKLAFSRNQTFLIQNPIFYDPASQEQEYTIFCLPVSPSADDYAAIGQNF 429
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 234 bits (598), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 151/456 (33%), Positives = 229/456 (50%), Gaps = 27/456 (5%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFG---FGTFGFDFHHRYSDPV-------KGILAVDD 50
MA++ R+ V L+++ CC D H++S G+ D
Sbjct: 1 MATTVRSRGV---LVMVHCCVLWMLATTFANALRMDLFHKFSKQAIEAMRSRNGMDYAQD 57
Query: 51 LPKKGSFAYYSALAHRD--RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTN 108
P +G+ + + L D R+ R R LAA D+ L GN T +L G LHY+
Sbjct: 58 WPTEGTIEFQTMLRDHDVARHTRTARRILAASSMDQYVLI--QGNATEQLFG-GGLHYSY 114
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVH-GLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G P + F+V LDTGSDL W+PC+C SC S + N Y+P+ SST+ V
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+ LCE+ C + CPY++ Y+S T ++G L ED ++ E V + G
Sbjct: 175 CSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYF-MRESGGNPVKLPVYLG 233
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
CG+VQTGS L GAAPNGL GLG SVP+ LA+ G + +SFS+C G+G ++FGD+G
Sbjct: 234 CGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEG 293
Query: 288 SPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQIS 345
Q TP + TY + I ++VG + A+FD+GTSFTYL+ Y Q
Sbjct: 294 PAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYLSKTVYPQFV 353
Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
+ +++ + ++ CY S TNF+ PVV+L + GG V + + +
Sbjct: 354 QAYDAQMSLPKWNDPRFSKWDLCYQTS--NTNFQVPVVSLALSGGNSLDVVSGLKSIVDD 411
Query: 406 PKGLYLYCLGVVKS-DNVNIIGREYPIANNISLFHN 440
+ C+ V+ S ++IIG+ + N S+ +N
Sbjct: 412 NNAMIAVCVTVMDSGAGLSIIGQNF--MTNYSITYN 445
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 151/411 (36%), Positives = 218/411 (53%), Gaps = 30/411 (7%)
Query: 29 TFGFDFHHRYSDPVKGILA---------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA 79
TF HR+S+ K LA + P++ S Y+ L D R R R L +
Sbjct: 24 TFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSD-VTRQRMR-LGS 81
Query: 80 QGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
Q P F G N+L +LHYT + +G P +SF+VALD GSD+ W+PCDC+ C
Sbjct: 82 QYEMLYP--FEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECA 139
Query: 140 HGLNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
L++ + V+D N Y P+ S+TS +PC LC++ C + CPY V+Y S T
Sbjct: 140 -SLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANT 198
Query: 198 MSTGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
S+G++ ED LHL ++ K ++ SV + I GCGR QTG +L GA P+G+ GLG SV
Sbjct: 199 SSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISV 258
Query: 256 PSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSV 314
PS+LA GLI NSFS+CF + +GRI FGD+G Q TPF + Y + + V
Sbjct: 259 PSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCV 318
Query: 315 GGNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL---PFEYCYV 370
G + F A+ DSG+SFT+L + Y ++ F+ K+ +TS + +EYCY
Sbjct: 319 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFD-----KQVNATSIVLQNSWEYCYN 373
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
S +Q P +NL + + +PI I + ++CL V SD+
Sbjct: 374 AS-SQELISIPPLNLAFSRNQTYLIQNPIFI-DPASQEYTIFCLPVSPSDD 422
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 153/436 (35%), Positives = 227/436 (52%), Gaps = 29/436 (6%)
Query: 11 CVLLILL--SCCAGCCFGFGTFGFDFHHRYSDPVK--------GILAVDDLPKKGSFAYY 60
C LL+L S C T + HR+SD K G ++ P S Y+
Sbjct: 4 CALLLLFIASLFVNCSLAL-TLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLKYF 62
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFI 119
L D L+ R L G+ L S G+ N +LHYT + +G P++ F+
Sbjct: 63 QMLMDYD----LKRRRLNI-GSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFL 117
Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQK 177
VALD GSDL W+PCDC+ C L+++ V+D ++ Y+P SSTS + C LC
Sbjct: 118 VALDVGSDLLWVPCDCIQCA-PLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWST 176
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS--VDSRISFGCGRVQTGS 235
C SA C Y+ Y SD T ++GF++ED L L + K + + + FGCGR Q+GS
Sbjct: 177 TCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGS 236
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP 295
+LDGAAP+G+ GLG SVP++LA +GL+ N+FS+CF ++G+GRI FGD G Q T
Sbjct: 237 YLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQ 296
Query: 296 F-SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
F L Y I + VG + + F A+ DSG+SFTYL Y +I F+ K
Sbjct: 297 FLPLFGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVK 356
Query: 354 -EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY 412
+LP+ YCY +S +F P + L F++DP+ ++ + +G ++
Sbjct: 357 VNATRIVLRELPWNYCYNIS-TLVSFNIPSMQLVFPLNQ-IFIHDPVYVLPAN-QGYKVF 413
Query: 413 CLGVVKSD-NVNIIGR 427
CL + ++D + +IG+
Sbjct: 414 CLTLEETDEDYGVIGQ 429
>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
gi|255630909|gb|ACU15817.1| unknown [Glycine max]
Length = 244
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 99/168 (58%), Positives = 123/168 (73%), Gaps = 5/168 (2%)
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GSP Q +TPF++R+ HPTYNITITQ+ V + + EF AIFDSG
Sbjct: 1 MCFGPDGAGRITFGDTGSPDQRKTPFNVRKLHPTYNITITQIVVEDSVADLEFHAIFDSG 60
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCYVLSPNQTNFEYPVVNLTM 387
TSFTY+NDPAYT++ E +NS K R +S S++PFEYCY +S NQT E P +NLTM
Sbjct: 61 TSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCYDISINQT-IEVPFLNLTM 119
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNI 435
KGG ++V DPIV V SE +G L CLG+ KSD+VNIIG+ + I I
Sbjct: 120 KGGDDYYVMDPIVQVFSEEEG-DLLCLGIQKSDSVNIIGQNFMIGYKI 166
>gi|359496801|ref|XP_003635339.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 151
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/143 (61%), Positives = 110/143 (76%), Gaps = 2/143 (1%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V++++ + C+G GTFGFD HHR+SDPVKGIL VDDLP+K S YY A+AHRD
Sbjct: 10 VLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAHRD-- 67
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
+ + GR L+ K PLTFS GN+TYRL+SLG+LHY NVS+G P+L F+VALDTGSDLF
Sbjct: 68 WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLGYLHYANVSLGTPSLWFLVALDTGSDLF 127
Query: 130 WLPCDCVSCVHGLNSSSGQVIDF 152
WLPCDC SC+ GLN++SG+V F
Sbjct: 128 WLPCDCTSCIKGLNTTSGKVCYF 150
>gi|297739018|emb|CBI28370.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/143 (61%), Positives = 110/143 (76%), Gaps = 2/143 (1%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V++++ + C+G GTFGFD HHR+SDPVKGIL VDDLP+K S YY A+AHRD
Sbjct: 10 VLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAHRD-- 67
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
+ + GR L+ K PLTFS GN+TYRL+SLG+LHY NVS+G P+L F+VALDTGSDLF
Sbjct: 68 WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLGYLHYANVSLGTPSLWFLVALDTGSDLF 127
Query: 130 WLPCDCVSCVHGLNSSSGQVIDF 152
WLPCDC SC+ GLN++SG+V F
Sbjct: 128 WLPCDCTSCIKGLNTTSGKVCYF 150
>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
Length = 307
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 101/197 (51%), Positives = 128/197 (64%), Gaps = 11/197 (5%)
Query: 244 GLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTH 302
L GLGM+K SVPSILA+ G++ NSFSMCF DG GRI+FGD GS Q ETPF ++ TH
Sbjct: 8 ALMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTH 67
Query: 303 PTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----E 357
YNI+IT +SVG + F AI DSGTSFTYLNDPAYT + FN+ E+R
Sbjct: 68 SYYNISITSMSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGS 127
Query: 358 TSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG----LYLYC 413
T + PFEYCY LSP+QT E PVV+LT GG F V P+ ++++ + YC
Sbjct: 128 TRSGPFPFEYCYSLSPDQTTVELPVVSLTTNGGAVFPVTSPVYPIAAQMTNGEIRIIGYC 187
Query: 414 LGVVKSD-NVNIIGREY 429
L V+KSD ++IIG+ +
Sbjct: 188 LAVIKSDLPIDIIGQNF 204
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 108/291 (37%), Positives = 153/291 (52%), Gaps = 6/291 (2%)
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
Q D IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED
Sbjct: 2 QDRDLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDT 61
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
LHL E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ N
Sbjct: 62 LHLNYREDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQN 120
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSA 325
SFSMCF D +GRI FGD+G P Q TPF L TY + + + +G + F A
Sbjct: 121 SFSMCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKA 180
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
+ DSGTSFT L Y + F+ R D ++YCY SP + + P + L
Sbjct: 181 LVDSGTSFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCYSASPLEMP-DVPTITL 238
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIGREYPIANNI 435
T +PI+ + + L +CL V+ S + + II + + + ++
Sbjct: 239 TFAADKSLQAVNPILPFNDKQGALAGFCLAVLPSTEPIGIIAQNFLVGYHV 289
>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
Length = 260
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 108/163 (66%), Gaps = 5/163 (3%)
Query: 271 MCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFD 328
MCFG+ D GRISFGDKG Q ETP + PTY +++T+VSVGG+AV + A+FD
Sbjct: 1 MCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQLLALFD 60
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
+GTSFT+L +P Y I++ F+ +KR +LPFE+CY LSPN+T +P V +T +
Sbjct: 61 TGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDLSPNKTTILFPRVAMTFE 120
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIGREY 429
GG F+ +P+ IV +E +YCLG++KS + +NIIG+ +
Sbjct: 121 GGSQMFLRNPLFIVWNEDNSA-MYCLGILKSVDFKINIIGQNF 162
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/388 (29%), Positives = 181/388 (46%), Gaps = 38/388 (9%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S L RD LR R + + + D +++ L+YT V +G P + F V
Sbjct: 41 SQLRARDE---LRHRRMLQSSSGVVDFSVQGTFDPFQVG----LYYTKVQLGTPPVEFNV 93
Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C+ SC +G +SG I N + P +SSTSS + C+ C KQ
Sbjct: 94 QIDTGSDVLWVSCN--SC-NGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGKQSS 150
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
C S + C Y +Y DG+ ++G+ V D++HL T + S + +S + FGC QT
Sbjct: 151 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSMTTNSTAPVVFGCSNQQT 209
Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
G A +G+FG G + SV S L++QG+ P FS C D G G + G+ P
Sbjct: 210 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEIVEPN 269
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
T SL P YN+ + +SV G + + S I DSGT+ YL + AY
Sbjct: 270 IVYT--SLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAY 327
Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPIV 400
+ + T S CY+++ + T+ +P V+L GG + +
Sbjct: 328 DPFVSAITAAIPQSVRTVVSR--GNQCYLITSSVTDV-FPQVSLNFAGGASMILRPQDYL 384
Query: 401 IVSSEPKGLYLYCLGV--VKSDNVNIIG 426
I + G ++C+G ++ + I+G
Sbjct: 385 IQQNSIGGAAVWCIGFQKIQGQGITILG 412
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 114/388 (29%), Positives = 177/388 (45%), Gaps = 38/388 (9%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S L RD LR R + N + D +++ L+YT V +G P + F V
Sbjct: 38 SQLRARDA---LRHRRMLQSSNGVVDFSVQGTFDPFQVG----LYYTKVQLGTPPVEFNV 90
Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C+ S G +SG I N + P +SSTSS + C+ C Q
Sbjct: 91 QIDTGSDVLWVSCNSCS---GCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSS 147
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
C S + C Y +Y DG+ ++G+ V D++HL T + S + +S + FGC QT
Sbjct: 148 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQT 206
Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
G A +G+FG G + SV S L++QG+ P FS C D G G + G+ P
Sbjct: 207 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN 266
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
T SL P YN+ + ++V G + + S I DSGT+ YL + AY
Sbjct: 267 IVYT--SLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAY 324
Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPIV 400
+ + T S CY+++ + T +P V+L GG + +
Sbjct: 325 DPFVSAITASIPQSVHTVVSR--GNQCYLITSSVTEV-FPQVSLNFAGGASMILRPQDYL 381
Query: 401 IVSSEPKGLYLYCLGV--VKSDNVNIIG 426
I + G ++C+G ++ + I+G
Sbjct: 382 IQQNSIGGAAVWCIGFQKIQGQGITILG 409
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 124/406 (30%), Positives = 176/406 (43%), Gaps = 36/406 (8%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPL--TFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
S L RDR R + G P+ TF + S L+YT + +G P F
Sbjct: 44 SQLKERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDF 103
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
V +DTGSD+ W+ C S +G SSG I N + P +S T+S + C+ C L Q
Sbjct: 104 YVQIDTGSDVLWVSC---SSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQ 160
Query: 179 -----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS--KSVDSRISFGCGRV 231
C + + C Y +Y DG+ ++G+ V D+LH T S K+ + I FGC +
Sbjct: 161 SSDSVCAAQNNQCGYTFQY-GDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTL 219
Query: 232 QTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGS 288
QTG A +G+FG G SV S LA+QG+ P FS C D G G + G+
Sbjct: 220 QTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVE 279
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDP 339
P TP L + P YN+ + + V G + + S I DSGT+ YL +
Sbjct: 280 PNIVYTP--LVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEA 337
Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG-GPFFVNDP 398
AY S S CY L+ + N +P V+L GG +
Sbjct: 338 AYDPFISAITSTVSPSVSPYLSK--GNQCY-LTSSSINDVFPQVSLNFAGGTSMILIPQD 394
Query: 399 IVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
+I S G L+C+G K I G+E I ++ L + Y
Sbjct: 395 YLIQQSSINGAALWCVGFQK-----IQGQEITILGDLVLKDKIFVY 435
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 160/336 (47%), Gaps = 29/336 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P + F V +DTGSD+ W+ C+ S G +SG I N + P +SSTS
Sbjct: 24 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCS---GCPQTSGLQIQLNFFDPGSSSTS 80
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q C S + C Y +Y DG+ ++G+ V D++HL T + S
Sbjct: 81 SMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSV 139
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S + FGC QTG A +G+FG G + SV S L++QG+ P FS C
Sbjct: 140 TTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKG 199
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
D G G + G+ P T SL P YN+ + ++V G + + S
Sbjct: 200 DSSGGGILVLGEIVEPNIVYT--SLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRG 257
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ YL + AY + + T+ S CY+++ + T +P V+
Sbjct: 258 TIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSR--GNQCYLITSSVTEV-FPQVS 314
Query: 385 LTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKS 419
L GG + +I + G ++C+G KS
Sbjct: 315 LNFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKS 350
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 107/346 (30%), Positives = 166/346 (47%), Gaps = 33/346 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKIRLGSPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 TPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QGL P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT+ YL++ AY E N++++ R + CYV++ + + +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVIATSVADI-FPPV 369
Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGV--VKSDNVNIIG 426
+L GG F+N +I + G ++C+G +++ + I+G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 137 bits (346), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 175/388 (45%), Gaps = 37/388 (9%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S L RDR GR L + G D + + L+YT + +G P F V
Sbjct: 14 SKLKERDRV--RHGRMLQSSGVGVVDFPVQGTFDPFLVG----LYYTRLQLGTPPRDFYV 67
Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C SC +G +SG I N + P +S T+S + C+ C L Q
Sbjct: 68 QIDTGSDVLWVSCG--SC-NGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSS 124
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
C + + C Y +Y DG+ ++G+ V D+LH T S +S I FGC +QT
Sbjct: 125 DSVCSAQNNLCGYNFQY-GDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQT 183
Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
G A +G+FG G SV S LA+QG+ P +FS C D G G + G+ P
Sbjct: 184 GDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEIVEPN 243
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
TP L + P YN+ + +SV G + + S I DSGT+ YL + AY
Sbjct: 244 IVYTP--LVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAY 301
Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF-FVNDPIV 400
S+ S +CY++S + N +P V+L GG + +
Sbjct: 302 DPFISAITSIVSPSVRPYLSK--GNHCYLIS-SSINDIFPQVSLNFAGGASMILIPQDYL 358
Query: 401 IVSSEPKGLYLYCLGV--VKSDNVNIIG 426
I S G L+C+G ++ + I+G
Sbjct: 359 IQQSSIGGAALWCIGFQKIQGQGITILG 386
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 166/346 (47%), Gaps = 33/346 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QG+ P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT+ YL++ AY E N++++ R + CYV++ + + +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITTSVGDI-FPPV 369
Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGV--VKSDNVNIIG 426
+L GG F+N +I + G ++C+G +++ + I+G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 166/346 (47%), Gaps = 33/346 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QG+ P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT+ YL++ AY E N++++ R + CYV++ + + +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITTSVGDI-FPPV 369
Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGV--VKSDNVNIIG 426
+L GG F+N +I + G ++C+G +++ + I+G
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILG 415
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 115/377 (30%), Positives = 179/377 (47%), Gaps = 35/377 (9%)
Query: 63 LAHRDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
+AH R+R GR L + G + FS + TY +G L+YT V +G P F V
Sbjct: 45 IAHLRSRDRVRHGRMLQSSGG---VIDFSV-SGTYDPFLVG-LYYTRVQLGNPPKDFYVQ 99
Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--- 178
+DTGSD+ W+ C+ SC +G ++SG I N + P +S+T+S V C+ +C L Q
Sbjct: 100 IDTGSDVLWVSCN--SC-NGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSD 156
Query: 179 --CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTG 234
C + C Y +Y DG+ ++G+ V D++HL D + + + + FGC QTG
Sbjct: 157 SACFGQSNQCAYVFQY-GDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTG 215
Query: 235 SFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPGQ 291
A +G+FG G SV S L+++G+ P FS C D G G + G+ P
Sbjct: 216 DLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPNV 275
Query: 292 GETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFSAIFDSGTSFTYLNDPAYT 342
TP L + P YN+ + +SV G A + I DSGT+ YL + AY
Sbjct: 276 VYTP--LVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYN 333
Query: 343 QISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPIVI 401
++ + ++ L CYV S + ++ +P V+L GG + +I
Sbjct: 334 AFVVAVTNIVSQSTQSVV--LKGNRCYVTSSSVSDI-FPQVSLNFAGGASLVLGAQDYLI 390
Query: 402 VSSEPKGLYLYCLGVVK 418
+ G ++C+G K
Sbjct: 391 QQNSVGGTTVWCIGFQK 407
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 134 bits (337), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 185/405 (45%), Gaps = 47/405 (11%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGL-----AAQGNDKTPLTFSAGNDTYRLNSLGFLH 105
LP KG + L RD R RGL A G P+ SA + Y + L+
Sbjct: 38 LPHKGVPVEH--LKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSA--NPYMVG----LY 89
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+T V +G PA + V +DTGSD+ W+ C C C +SSG I ++P++SSTSS
Sbjct: 90 FTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGC----PTSSGLNIQLEFFNPDSSSTSS 145
Query: 165 KVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
++PC+ C Q A S C Y Y DG+ ++GF V D ++ T
Sbjct: 146 RIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTY-GDGSGTSGFYVSDTMYFDTVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + + FGC Q+G + A +G+FG G + SV S L + G+ P +FS C
Sbjct: 205 EQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVFTP--LVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT+ YL D AY N++A + S + ++ + + +P
Sbjct: 323 QGTIVDSGTTLVYLVDGAYDPF---INAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPT 379
Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
L KGG V + ++ L+C+G +S + I+G
Sbjct: 380 ATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILG 424
>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 217
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 97/183 (53%), Gaps = 10/183 (5%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQ 216
E
Sbjct: 205 EDH 207
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 108/366 (29%), Positives = 169/366 (46%), Gaps = 39/366 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T V +G P F V +DTGSD+ W+ C S +G +SG I + P +S+T+
Sbjct: 83 LYFTRVQLGSPPKDFYVQIDTGSDVLWVSC---SSCNGCPVTSGLQIPLTFFDPGSSTTA 139
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS- 217
+ V C+ C Q C S + C Y +Y DG+ ++G+ V D++HL T S
Sbjct: 140 ALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQY-GDGSGTSGYYVADLMHLDTLLLSSG 198
Query: 218 ------KSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
++ DS +SF C +QTG A +G+FG G + SV S LA+QG+ P FS
Sbjct: 199 ELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFS 258
Query: 271 MCFGSD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--- 325
C D G G + G+ P TP L + P YN+ + +SV G + + S
Sbjct: 259 HCLKGDDSGGGVLVLGEIVEPNIVYTP--LVPSQPHYNLYLQSISVAGQTLAIDPSVFGA 316
Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
I DSGT+ YL + AY S+ T S CY+++ + N
Sbjct: 317 SSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSK--GNQCYLVT-SSVNDV 373
Query: 380 YPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLF 438
+P V+L GG +N ++ + G ++C+G K+ G++ I ++ L
Sbjct: 374 FPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTP-----GQQITILGDLVLK 428
Query: 439 HNCYSY 444
+ Y
Sbjct: 429 DKIFVY 434
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 176/381 (46%), Gaps = 47/381 (12%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
YY L D+ RLR R L + S +DT+ L+YT + +G P F
Sbjct: 12 YYRTLREHDQR-RLR-RILP----EVVAFPISGDDDTFTTG----LYYTRIYLGTPPQQF 61
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
V +DTGSD+ W+ +CV C + +S + +I+ P S++ + + C C L
Sbjct: 62 YVHVDTGSDVAWV--NCVPCTN-CKRASNVALPISIFDPEKSTSKTSISCTDEECYLASN 118
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKSVDSRISFGCGRVQT 233
+C +CPY Y DG+ + G+L+ DVL + + + S +R++FGCG QT
Sbjct: 119 SKCSFNSMSCPYSTLY-GDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQT 177
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPGQ 291
G++L +GL G G + S+PS L+ Q + N F+ C D G+G + G PG
Sbjct: 178 GTWLT----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGL 233
Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVN----FEFS----AIFDSGTSFTYLNDPAYTQ 343
TP +Q+H YN+ + + V G V F+ S I DSGT+ TYL PAY Q
Sbjct: 234 VYTPIVPKQSH--YNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQ 291
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
AK + + LP + + + +P V L GG ++ P +
Sbjct: 292 FQ------AKVRDCMRSGVLPVAFQFFCT---IEGYFPNVTLYFAGGAAMLLS-PSSYLY 341
Query: 404 SE--PKGLYLYCLGVVKSDNV 422
E GL YC ++S +V
Sbjct: 342 KEMLTTGLSAYCFSWLESTSV 362
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 100/313 (31%), Positives = 150/313 (47%), Gaps = 30/313 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QG+ P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT+ YL++ AY E N++++ R + CYV++ + + +P V
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITTSVGDI-FPPV 369
Query: 384 NLTMKGGGPFFVN 396
+L GG F+N
Sbjct: 370 SLNFAGGASMFLN 382
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 112/409 (27%), Positives = 180/409 (44%), Gaps = 52/409 (12%)
Query: 56 SFAYYSALAHRDRYFRLRGRGLAA---QGNDK--------------TPLTFSAGNDTYRL 98
S Y ++L H +R F L GL + D+ + +D Y +
Sbjct: 4 SAVYCASLLHLERAFPLNNHGLELHQLRARDRLRHARLLQGFVGGVVDFSVQGSSDPYLV 63
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
L++T V +G P F V +DTGSD+ W+ C+ C +C +SG I N +
Sbjct: 64 G----LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPR----TSGLGIQLNFFDS 115
Query: 158 NTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
++SST+ +V C+ +C QC S C Y +Y DG+ ++G+ V D L+
Sbjct: 116 SSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQY-GDGSGTSGYYVSDTLYFDA 174
Query: 213 DEKQS--KSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
QS + + I FGC Q+G A +G+FG G + SV S L+ +G+ P F
Sbjct: 175 ILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVF 234
Query: 270 SMCFGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-- 325
S C DG+ G + G+ PG +P L + P YN+ + ++V G + + +A
Sbjct: 235 SHCLKGDGSGGGILVLGEILEPGIVYSP--LVPSQPHYNLNLLSIAVNGQLLPIDPAAFA 292
Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
I DSGT+ YL AY N++ TS CY++S + +
Sbjct: 293 TSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSK--GNQCYLVSTSVSQM 350
Query: 379 EYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+P+ + GG + + +I G ++C+G K V I+G
Sbjct: 351 -FPLASFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILG 398
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 149/338 (44%), Gaps = 37/338 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P + + V +DTGSD+ WL C C SCV S I Y P+ SST
Sbjct: 36 LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPS---IKLTTYDPSRSST 92
Query: 163 SSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ C + C + C SAG C Y Y DG+ + G+ ++DV+ +
Sbjct: 93 DGALSCRDSNCGAALGSNEVSCTSAGY-CAYSTTY-GDGSSTQGYFIQDVMTFQEIHNNT 150
Query: 218 K-SVDSRISFGCGRVQTGSFL-DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ + + + FGCG Q+G+ L A +GL G G S+PS LA+ G + N F+ C
Sbjct: 151 QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQG 210
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
D G G I G P TP R Y + + ++V G V S
Sbjct: 211 DNQGGGTIVIGSVSEPNISYTPIVSRN---HYAVGMQNIAVNGRNVTTPASFDTTSTSAG 267
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ YL DPAYTQ ++ + + L +C + + ++P V
Sbjct: 268 GVIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFSSHSQCLQLAWCSLQA------DFPTV 321
Query: 384 NLTMKGGGPFFVNDPIVIVSSEP--KGLYLYCLGVVKS 419
L G + P + S+P G YC+G KS
Sbjct: 322 KLFFDAGAVMNLT-PRNYLYSQPLQNGQAAYCMGWQKS 358
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 165/363 (45%), Gaps = 37/363 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSST
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
SSK+PC+ C Q A S C Y Y DG+ ++G+ V D ++ T
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT+ YL D AY + + S C+V S + + +P
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCFVTS-SSVDSSFPT 379
Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNC 441
V+L GG V + ++ + L+C+G ++ G++ I ++ L
Sbjct: 380 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ-----GQQITILGDLVLKDKI 434
Query: 442 YSY 444
+ Y
Sbjct: 435 FVY 437
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 128/254 (50%), Gaps = 24/254 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ F V +DTGSD+ W+ C C+ C +++ Y + SST
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-----PRKSDLVELTPYDVDASST 138
Query: 163 SSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDEKQSK 218
+ V C+ C Q+ +GS C Y + Y DG+ + G+LV+DV+H L T +Q+
Sbjct: 139 AKSVSCSDNFCSYVNQRSECHSGSTCQYVIMY-GDGSSTNGYLVKDVVHLDLVTGNRQTG 197
Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD 276
S + I FGCG Q+G + AA +G+ G G +S S LA+QG + SF+ C ++
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IF 327
G G + G+ SP TP + H Y++ + + VG + + +A I
Sbjct: 258 GGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLELSSNAFDSGDDKGVII 315
Query: 328 DSGTSFTYLNDPAY 341
DSGT+ YL D Y
Sbjct: 316 DSGTTLVYLPDAVY 329
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 107/363 (29%), Positives = 164/363 (45%), Gaps = 37/363 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSST
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDE 214
SSK+PC+ C Q A S C Y Y DG+ ++G+ V D ++
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDSVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT+ YL D AY + + S C+V S + + +P
Sbjct: 323 QGTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCFVTS-SSVDSSFPT 379
Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNC 441
V+L GG V + ++ + L+C+G ++ G++ I ++ L
Sbjct: 380 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ-----GQQITILGDLVLKDKI 434
Query: 442 YSY 444
+ Y
Sbjct: 435 FVY 437
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 126/254 (49%), Gaps = 24/254 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ F V +DTGSD+ W+ C C+ C +++ Y + SST
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-----PRKSDLVELTPYDADASST 138
Query: 163 SSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDEKQSK 218
+ V C+ C Q+ +GS C Y + Y DG+ + G+LV DV+H L T +Q+
Sbjct: 139 AKSVSCSDNFCSYVNQRSECHSGSTCQYVILY-GDGSSTNGYLVRDVVHLDLVTGNRQTG 197
Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD 276
S + I FGCG Q+G + AA +G+ G G +S S LA+QG + SF+ C ++
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IF 327
G G + G+ SP TP + H Y++ + + VG + + A I
Sbjct: 258 GGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLQLSSDAFDSGDDKGVII 315
Query: 328 DSGTSFTYLNDPAY 341
DSGT+ YL D Y
Sbjct: 316 DSGTTLVYLPDAVY 329
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 164/362 (45%), Gaps = 37/362 (10%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSSTS
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSSTS 172
Query: 164 SKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEK 215
SK+PC+ C Q A S C Y Y DG+ ++G+ V D ++ T +
Sbjct: 173 SKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGNE 231
Query: 216 QSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 232 QTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 291
Query: 274 GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 292 GSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQ 349
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ YL D AY + + S C+V S + + +P V
Sbjct: 350 GTIVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSK--GNQCFVTS-SSVDSSFPTV 406
Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCY 442
+L GG V + ++ + L+C+G ++ G++ I ++ L +
Sbjct: 407 SLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQ-----GQQITILGDLVLKDKIF 461
Query: 443 SY 444
Y
Sbjct: 462 VY 463
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 108/382 (28%), Positives = 169/382 (44%), Gaps = 38/382 (9%)
Query: 62 ALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
AL RDR GR L + +D Y + L++T V +G PA F V
Sbjct: 46 ALRARDR--ARHGRILQGVVGGVVDFSVQGTSDPYFVG----LYFTKVKLGSPAKEFYVQ 99
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C C +C H SSG I+ + + SST++ V C +C Q
Sbjct: 100 IDTGSDILWINCITCSNCPH----SSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTA 155
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT---DEKQSKSVDSRISFGCGRVQ 232
C S + C Y +Y DG+ +TG+ V D ++ T + + S I FGC Q
Sbjct: 156 TSECSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQ 214
Query: 233 TGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSP 289
+G A +G+FG G SV S L+++G+ P FS C G +G G + G+ P
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
+P L + P YN+ + ++V G + + + I DSGT+ YL A
Sbjct: 275 SIVYSP--LVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEA 332
Query: 341 YTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN-DPI 399
Y + + + + S CY++S N +P V+L GG +N +
Sbjct: 333 YNPFVKAITAAVSQFSKPIISK--GNQCYLVS-NSVGDIFPQVSLNFMGGASMVLNPEHY 389
Query: 400 VIVSSEPKGLYLYCLGVVKSDN 421
++ G ++C+G K +
Sbjct: 390 LMHYGFLDGAAMWCIGFQKVEQ 411
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 140/298 (46%), Gaps = 37/298 (12%)
Query: 67 DRYFRLRG---RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
D Y LR R L + S ND + + L+YT +S+G P F V +D
Sbjct: 4 DHYHTLRKHDQRRLRRMLPEVVSFPISGDNDIFAMG----LYYTRISLGTPPQQFYVDVD 59
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCEL---QKQ 178
TGS++ W+ C C C H SG V + + + P S+T + C C + + Q
Sbjct: 60 TGSNVAWVKCAPCTGCEH-----SGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQ 114
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKSVDSRISFGCGRVQTGS 235
C +CPY + Y DG+ + G+ + DV + +D +KS +R+ FGCG QTGS
Sbjct: 115 CSPERLSCPYSLLY-GDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGS 173
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDKGSPGQGE 293
+ + +GL G G S+P+ LA Q + N F+ C D +GR + G P
Sbjct: 174 W----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVY 229
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAV------NFEFS--AIFDSGTSFTYLNDPAYTQ 343
TP + H YN+ + + + G V + E++ I DSGT+ TYL PAY +
Sbjct: 230 TPMVFGEDH--YNVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYDE 285
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 116/383 (30%), Positives = 177/383 (46%), Gaps = 47/383 (12%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
P++ +D+L + S L RDR R GR + G P+ S+ D
Sbjct: 43 PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94
Query: 96 YRLNS-LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
Y + S + L++T V +G P F V +DTGSD+ W+ C C +C H SSG ID +
Sbjct: 95 YLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLH 150
Query: 154 IYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ S T+ V C+ +C QC S + C Y RY DG+ ++G+ + D
Sbjct: 151 FFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTF 208
Query: 209 HLATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLI 265
+ +S +S I FGC Q+G A +G+FG G K SV S L+++G+
Sbjct: 209 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 268
Query: 266 PNSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NA 318
P FS C DG+G F G+ PG +P L + P YN+ + + V G +A
Sbjct: 269 PPVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDA 326
Query: 319 VNFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSP 373
FE S I D+GT+ TYL AY N+++ + T + E CY++S
Sbjct: 327 AVFEASNTRGTIVDTGTTLTYLVKEAYDLF---LNAISNSVSQLVTPIISNGEQCYLVST 383
Query: 374 NQTNFEYPVVNLTMKGGGPFFVN 396
+ ++ +P V+L GG +
Sbjct: 384 SISDM-FPSVSLNFAGGASMMLR 405
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 104/300 (34%), Positives = 135/300 (45%), Gaps = 38/300 (12%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
AH DR RGR LAA PL GN L S L+YT V +G PA F V +D
Sbjct: 43 AHDDRR---RGRFLAAI---DVPL---GGNG---LPSSTGLYYTKVGLGSPAKEFYVQVD 90
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
TGSD+ W+ C C +C SG +D +Y PN S TS+ VPC C P +
Sbjct: 91 TGSDILWVNCAGCTAC----PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPIS 146
Query: 183 G----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF 236
G +CPY + Y DG+ ++G V D L + +K +S + FGCG Q+GS
Sbjct: 147 GCKQDMSCPYSITY-GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSL 205
Query: 237 LDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGE 293
+ A +G+ G G +SV S LA G + FS C S G G S G P
Sbjct: 206 SSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNT 265
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAYTQI 344
TP R H YN+ + + V G + I DSGT+ YL Y Q+
Sbjct: 266 TPLVPRMAH--YNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL 323
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 162/345 (46%), Gaps = 32/345 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P F V +DTGSD+ W+ C C +C +SG I N + +SST
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQ----TSGLGIQLNYFDTTSSST 135
Query: 163 SSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ VPC+ +C Q QCP + C Y +Y DG+ ++G+ V D + +S
Sbjct: 136 ARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQY-GDGSGTSGYYVSDTFYFDAVLGES 194
Query: 218 KSVDSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+S I FGC Q+G A +G+FG G + SV S L++ G+ P FS C
Sbjct: 195 LIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLK 254
Query: 274 GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
G D G G + G+ PG +P L + P YN+ + ++V G + + +A
Sbjct: 255 GEDSGGGILVLGEILEPGIVYSP--LVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNR 312
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT+ YL + AY + A + T T + CY++S N + +P V
Sbjct: 313 GTIIDTGTTLAYLVEEAYDPFVSAITA-AVSQLATPTIN-KGNQCYLVS-NSVSEVFPPV 369
Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
+ GG + + ++ + G L+C+G K + I+G
Sbjct: 370 SFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILG 414
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 128/259 (49%), Gaps = 28/259 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSST
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
SSK+PC+ C Q A S C Y Y DG+ ++G+ V D ++ T
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 325 --AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 323 QGTIVDSGTTLAYLADGAY 341
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 103/352 (29%), Positives = 159/352 (45%), Gaps = 54/352 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T + VG P S+ + +DTGSDL W+ CD C+SC G + +Y P S+
Sbjct: 191 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHV---------LYKPTRSN 241
Query: 162 TSSKVPCNSTLC-ELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S V LC ++QK + + C Y+++Y +D + S G LV D LHL T
Sbjct: 242 VVSSV---DALCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTTNG 297
Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
++ + FGCG Q G L+ +G+ GL K S+P LA++GLI N C
Sbjct: 298 SKTKLN--VVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLS 355
Query: 275 SDGT--GRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA----- 325
+DG G + GD P G P + T Y I ++ G + F+ +
Sbjct: 356 NDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSKVGKM 415
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN- 384
+FDSG+S+TY AY + + N ++ SD C+ Q NF V
Sbjct: 416 VFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICW-----QANFPIKSVKD 470
Query: 385 -------LTMKGGGPFFVNDPIVIVSSEPKGLYL------YCLGVVKSDNVN 423
LT++ G +++ + +S P+G + CLG++ NVN
Sbjct: 471 VKDYFKTLTLRFGSKWWILSTLFQIS--PEGYLIISNKGHVCLGILDGSNVN 520
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 158/356 (44%), Gaps = 37/356 (10%)
Query: 62 ALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
AL RDR GR L + +D Y + L++T V +G PA F V
Sbjct: 46 ALRARDR--ARHGRILQGVVGGVVDFSVQGTSDPYFVG----LYFTKVKLGSPAKDFYVQ 99
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C C +C H SSG I+ + + SST++ V C +C Q
Sbjct: 100 IDTGSDILWINCITCSNCPH----SSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTA 155
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT---DEKQSKSVDSRISFGCGRVQ 232
C S + C Y +Y DG+ +TG+ V D ++ T + + S I FGC Q
Sbjct: 156 TSGCSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQ 214
Query: 233 TGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSP 289
+G A +G+FG G SV S L+++G+ P FS C G +G G + G+ P
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
+P L + P YN+ + ++V G + + + I DSGT+ YL A
Sbjct: 275 SIVYSP--LVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEA 332
Query: 341 YTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
Y + + + + S CY++S N +P V+L GG +N
Sbjct: 333 YNPFVDAITAAVSQFSKPIISK--GNQCYLVS-NSVGDIFPQVSLNFMGGASMVLN 385
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 168/373 (45%), Gaps = 55/373 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G PA F V +DTGSD+ W+ C C C +SSG I ++P++SST
Sbjct: 88 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 143
Query: 163 SSKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
+S++ C+ C Q A S C Y Y DG+ ++G+ V D + T
Sbjct: 144 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 202
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS
Sbjct: 203 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 262
Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 263 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 320
Query: 325 ----AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
I DSGT+ YL D AY +S + SL + + C++ S
Sbjct: 321 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS 370
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPI 431
+ + +P V L GG V + ++ + L+C+G ++ G+E I
Sbjct: 371 -SSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITI 424
Query: 432 ANNISLFHNCYSY 444
++ L + Y
Sbjct: 425 LGDLVLKDKIFVY 437
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 168/373 (45%), Gaps = 55/373 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G PA F V +DTGSD+ W+ C C C +SSG I ++P++SST
Sbjct: 90 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
+S++ C+ C Q A S C Y Y DG+ ++G+ V D + T
Sbjct: 146 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 204
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS
Sbjct: 205 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 264
Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 265 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 322
Query: 325 ----AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
I DSGT+ YL D AY +S + SL + + C++ S
Sbjct: 323 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS 372
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPI 431
+ + +P V L GG V + ++ + L+C+G ++ G+E I
Sbjct: 373 -SSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITI 426
Query: 432 ANNISLFHNCYSY 444
++ L + Y
Sbjct: 427 LGDLVLKDKIFVY 439
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 107/373 (28%), Positives = 170/373 (45%), Gaps = 55/373 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G PA F V +DTGSD+ W+ C C C +SSG I ++P++SST
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 59
Query: 163 SSKVPCNSTLCELQKQ-----CPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
+S++ C+ C Q C ++ S C Y Y DG+ ++G+ V D + T
Sbjct: 60 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 118
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS
Sbjct: 119 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 178
Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 179 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 236
Query: 325 ----AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
I DSGT+ YL D AY +S + SL + + C++ S
Sbjct: 237 NTQGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS 286
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPI 431
+ + +P V L GG V + ++ + L+C+G ++ G+E I
Sbjct: 287 -SSVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITI 340
Query: 432 ANNISLFHNCYSY 444
++ L + Y
Sbjct: 341 LGDLVLKDKIFVY 353
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/382 (29%), Positives = 165/382 (43%), Gaps = 54/382 (14%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V L+LLS C GF F+ H++ KG +AL D
Sbjct: 7 VLVGLLLLSFCLP---GFCNLVFEVQHKF---------------KGRERSLNALKSHD-- 46
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
R GR L+ + G + + + L+Y + +G P F V +DTGSD+
Sbjct: 47 VRRHGRLLSV-------IDLELGGNGHPAET--GLYYARIGIGSPPNDFHVQVDTGSDIL 97
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN---- 185
W+ +CV C + S V D +Y+P +SSTS+ + C+ C P G
Sbjct: 98 WV--NCVGCSNCPKKSDIGV-DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLL 154
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA-AP 242
C Y+V Y DG+ + G+ V D + L A ++ + I FGCG Q+G + A
Sbjct: 155 CQYKVIY-GDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEAL 213
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
+G+ G G +S+ S LA G + F+ C S G G + G+ P TP Q
Sbjct: 214 DGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQA 273
Query: 302 HPTYNITITQVSVGGNAVN-----FEFS----AIFDSGTSFTYLNDPAYTQISETFNSLA 352
H YN+ + V VG A++ FE S AI DSGT+ YL D Y + E A
Sbjct: 274 H--YNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPDSIYLPLMEKILG-A 330
Query: 353 KEKRETSTSDLPFEYCYVLSPN 374
+ + T D F C+V N
Sbjct: 331 QPDLKLRTVDDQFT-CFVFDKN 351
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 121/428 (28%), Positives = 193/428 (45%), Gaps = 57/428 (13%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
P++ +D+L + S L RDR R GR + G P+ S+ D
Sbjct: 43 PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
Y + L++T V +G P F V +DTGSD+ W+ C C +C H SSG ID +
Sbjct: 95 YLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHF 146
Query: 155 YSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ S T+ V C+ +C QC S + C Y RY DG+ ++G+ + D +
Sbjct: 147 FDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFY 204
Query: 210 LATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+S +S I FGC Q+G A +G+FG G K SV S L+++G+ P
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 267 NSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAV 319
FS C DG+G F G+ PG +P L + P YN+ + + V G +A
Sbjct: 265 PVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDAA 322
Query: 320 NFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPN 374
FE S I D+GT+ TYL AY N+++ + T + E CY++S +
Sbjct: 323 VFEASNTRGTIVDTGTTLTYLVKEAYDLF---LNAISNSVSQLVTPIISNGEQCYLVSTS 379
Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY----LYCLGVVKSDNVNIIGREYP 430
++ +P V+L GG + + G+Y ++C+G K+ I +
Sbjct: 380 ISDM-FPSVSLNFAGGASMMLRPQDYLFH---YGIYDGASMWCIGFQKAPEEQTILGDLV 435
Query: 431 IANNISLF 438
+ + + ++
Sbjct: 436 LKDKVFVY 443
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/382 (30%), Positives = 175/382 (45%), Gaps = 50/382 (13%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
P++ +D+L + S L RDR R GR + G P+ S+ D
Sbjct: 43 PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
Y + L++T V +G P F V +DTGSD+ W+ C C +C H SSG ID +
Sbjct: 95 YLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHF 146
Query: 155 YSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ S T+ V C+ +C QC S + C Y RY DG+ ++G+ + D +
Sbjct: 147 FDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFY 204
Query: 210 LATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+S +S I FGC Q+G A +G+FG G K SV S L+++G+ P
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 267 NSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAV 319
FS C DG+G F G+ PG +P L + P YN+ + + V G +A
Sbjct: 265 PVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDAA 322
Query: 320 NFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPN 374
FE S I D+GT+ TYL AY N+++ + T + E CY++S +
Sbjct: 323 VFEASNTRGTIVDTGTTLTYLVKEAYDLF---LNAISNSVSQLVTPIISNGEQCYLVSTS 379
Query: 375 QTNFEYPVVNLTMKGGGPFFVN 396
++ +P V+L GG +
Sbjct: 380 ISDM-FPSVSLNFAGGASMMLR 400
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 112/362 (30%), Positives = 167/362 (46%), Gaps = 44/362 (12%)
Query: 61 SALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPA 115
S L RDR R GR + G P+ S+ D Y + L++T V +G P
Sbjct: 57 SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DPYLVG----LYFTKVKLGSPP 110
Query: 116 LSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE 174
F V +DTGSD+ W+ C C +C H SSG ID + + S T+ V C+ +C
Sbjct: 111 TEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHFFDAPGSFTAGSVTCSDPICS 166
Query: 175 -----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFG 227
QC S + C Y RY DG+ ++G+ + D + +S +S I FG
Sbjct: 167 SVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFG 224
Query: 228 CGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF--G 284
C Q+G A +G+FG G K SV S L+++G+ P FS C DG+G F G
Sbjct: 225 CSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLG 284
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS----AIFDSGTSFTY 335
+ PG +P L + P YN+ + + V G +A FE S I D+GT+ TY
Sbjct: 285 EILVPGMVYSP--LLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTY 342
Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
L AY N+++ + T + E CY++S + ++ +P V+L GG
Sbjct: 343 LVKEAYDPF---LNAISNSVSQLVTLIISNGEQCYLVSTSISDM-FPPVSLNFAGGASMM 398
Query: 395 VN 396
+
Sbjct: 399 LR 400
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/351 (29%), Positives = 158/351 (45%), Gaps = 46/351 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
L++T V +G PA F V +DTGSD+ W+ PCD G SSG I+ N++ S
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD------GCPDSSGLGIELNLFDTTKS 136
Query: 161 STSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDE 214
S++ +PC +C QC + +C Y Y D + ++GF V D +H + E
Sbjct: 137 SSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHY-RDRSGTSGFYVTDSMHFDILLGE 195
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + I FGC Q G A +G+FG G + SV S L+++G+ P FS C
Sbjct: 196 STIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG----NAVNFEFS--- 324
G +G G + G+ P +P L + P Y + + +++ G N F S
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSP--LIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAG 313
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ YL + Y I S + + S C+ +S + + +PV+
Sbjct: 314 ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISR--GSQCFRVSMSVADI-FPVL 370
Query: 384 NLTMKGGGPFFVN-------DPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
+G V D IV EP L+C+G K+ D +NI+G
Sbjct: 371 RFNFEGIASMVVTPEEYLQFDSIV---REPA---LWCIGFQKAEDGLNILG 415
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 165/382 (43%), Gaps = 54/382 (14%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V L+LLS C GF F+ H++ KG +AL D
Sbjct: 7 VLVGLLLLSFCLP---GFCNLVFEVQHKF---------------KGRERSLNALKSHD-- 46
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
R GR L+ + G + + + L+Y + +G P F V +DTGSD+
Sbjct: 47 VRRHGRLLSV-------IDLELGGNGHPAET--GLYYARIGIGSPPNDFHVQVDTGSDIL 97
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN---- 185
W+ +CV C + S V D +Y+P +SSTS+ + C+ C P G
Sbjct: 98 WV--NCVGCSNCPKKSDIGV-DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLL 154
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA-AP 242
C Y+V Y DG+ + G+ V D + L A ++ + I FGCG Q+G + A
Sbjct: 155 CQYKVIY-GDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEAL 213
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
+G+ G G +S+ S LA G + F+ C S G G + G+ P TP Q
Sbjct: 214 DGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLXNTPVVPNQA 273
Query: 302 HPTYNITITQVSVGGNAVN-----FEFS----AIFDSGTSFTYLNDPAYTQISETFNSLA 352
H YN+ + V VG A++ FE S AI DSGT+ YL + Y + E A
Sbjct: 274 H--YNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPESIYLPLMEKILG-A 330
Query: 353 KEKRETSTSDLPFEYCYVLSPN 374
+ + T D F C+V N
Sbjct: 331 QPDLKLRTVDDQFT-CFVFDKN 351
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 139/301 (46%), Gaps = 32/301 (10%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
+ L RDR R GR L G + +D Y + L++T V +G PA F V
Sbjct: 32 TTLKARDRA-RHGGRILQDGGGGILDFSVQGTSDPYLVG----LYFTKVKMGSPAKEFYV 86
Query: 121 ALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ--- 176
+DTGSD+ WL C+ C +C SSG ID N + +SST++ V C+ +C
Sbjct: 87 QIDTGSDILWLNCNTCNNC----PKSSGLGIDLNYFDTASSSTAALVSCSDPVCSYAVQT 142
Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS--KSVDSRISFGCGRVQ 232
QC S + C Y +Y DG+ ++G+ V D ++ QS + S + FGC Q
Sbjct: 143 ATSQCSSQANQCSYTFQY-GDGSGTSGYYVYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQ 201
Query: 233 TGSFL-DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSP 289
+G A +G+FG G SV S +++QG+ P FS C G+ G + G+ P
Sbjct: 202 SGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGSGGGILVLGEILEP 261
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
TP Q H YN+ + ++V G + + I DSGT+ YL A
Sbjct: 262 NIVYTPLVPLQPH--YNLNLQSIAVNGQILPIDQDVFATGNNRGTIVDSGTTLAYLVQEA 319
Query: 341 Y 341
Y
Sbjct: 320 Y 320
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 146/311 (46%), Gaps = 27/311 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +SG I N + P +SSTS
Sbjct: 76 LYYTKVKLGTPPREFYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPRSSSTS 132
Query: 164 SKVP-----CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C S + C S + C Y +Y DG+ ++G+ V D++H A + +
Sbjct: 133 SLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQY-GDGSGTSGYYVSDLMHFAGIFEGTL 191
Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S S FGC +QTG A +G+FG G SV S L+ QG+ P FS C
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKG 251
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
D G G + G+ P +P L Q+ P YN+ + +SV G V +
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNNRG 309
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ YL + AY +L + + S CY+++ + +P V+
Sbjct: 310 TIVDSGTTLAYLAEEAYNPFVNAITALVPQSVRSVLSR--GNQCYLITTSSNVDIFPQVS 367
Query: 385 LTMKGGGPFFV 395
L GG +
Sbjct: 368 LNFAGGASLVL 378
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 127/259 (49%), Gaps = 25/259 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ +C+SC SG ++ +Y P SST
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 88
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
SKV C+ C L C ++ C Y V Y DG+ +TG+ V D+L + + Q
Sbjct: 89 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 146
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ +S ++FGCG Q G A +G+ G G TS+ S L+ G + F+ C +
Sbjct: 147 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 206
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
+G G + G+ P TP L P YN+ + + VGG A+ +
Sbjct: 207 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 264
Query: 326 IFDSGTSFTYLNDPAYTQI 344
I DSGT+ TYL + Y +I
Sbjct: 265 IIDSGTTLTYLPEIVYKEI 283
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 131/273 (47%), Gaps = 27/273 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ +C+SC SG ++ +Y P SST
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 144
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
SKV C+ C L C ++ C Y V Y DG+ +TG+ V D+L + + Q
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 202
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ +S ++FGCG Q G A +G+ G G TS+ S L+ G + F+ C +
Sbjct: 203 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 262
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
+G G + G+ P TP L P YN+ + + VGG A+ +
Sbjct: 263 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 320
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
I DSGT+ TYL + Y +I AK K T
Sbjct: 321 IIDSGTTLTYLPEIVYKEI--MLAVFAKHKDIT 351
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 131/273 (47%), Gaps = 27/273 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ +C+SC SG ++ +Y P SST
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 59
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
SKV C+ C L C ++ C Y V Y DG+ +TG+ V D+L + + Q
Sbjct: 60 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 117
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ +S ++FGCG Q G A +G+ G G TS+ S L+ G + F+ C +
Sbjct: 118 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 177
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
+G G + G+ P TP L P YN+ + + VGG A+ +
Sbjct: 178 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 235
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
I DSGT+ TYL + Y +I AK K T
Sbjct: 236 IIDSGTTLTYLPEIVYKEI--MLAVFAKHKDIT 266
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 103/351 (29%), Positives = 158/351 (45%), Gaps = 43/351 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
L++T V +G PA F V +DTGSD+ W+ PCD G SSG I+ N++ S
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD------GCPDSSGLGIELNLFDTTKS 136
Query: 161 STSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDE 214
S++ +PC +C QC + +C Y Y D + ++GF V D +H + E
Sbjct: 137 SSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHY-RDRSGTSGFYVTDSMHFDILLGE 195
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + I FGC Q G A +G+FG G + SV S L+++G+ P FS C
Sbjct: 196 STIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG----NAVNFEFS--- 324
G +G G + G+ P +P L + P Y + + +++ G N F S
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSP--LIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAG 313
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ YL + Y I S + + S C+ +S + + +PV+
Sbjct: 314 ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISR--GSQCFRVSMSVADI-FPVL 370
Query: 384 NLTMKGGGPFFVN-------DPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
+G V D IV S K L+C+G K+ D +NI+G
Sbjct: 371 RFNFEGIASMVVTPEEYLQFDSIV---SCYKFASLWCIGFQKAEDGLNILG 418
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 119/259 (45%), Gaps = 25/259 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P + V +DTGSD+ W+ C C C H SG +D +Y P SST
Sbjct: 85 LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPH----KSGLGLDLTLYDPKASST 140
Query: 163 SSKVPCNSTLCE--LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
S V C+ C + P G+N C Y V Y DG+ + G V D L T + Q
Sbjct: 141 GSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTY-GDGSSTIGSFVTDALQFDQVTRDGQ 199
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ ++ + FGCG Q G A +G+ G G TS+ S L G + F+ C +
Sbjct: 200 TQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDT 259
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
G G S GD P TP L P YN+ + + VGG + +
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTP--LVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGT 317
Query: 326 IFDSGTSFTYLNDPAYTQI 344
I DSGT+ TYL + + ++
Sbjct: 318 IIDSGTTLTYLPELVFKEV 336
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 124/261 (47%), Gaps = 30/261 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P ++ + +DTGSDL W+ C C+ C + S I Y S++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC----PAFSDLKIPIVPYDVKASAS 90
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
SSKVPC+ C L Q +G N C Y +Y DG+ + G+LVEDVLH + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 219 SVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
+ FGCG Q+G A +G+ G G S S LA QG PN F+ C G
Sbjct: 149 -----VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------AI 326
G G + G+ P TP +H YN+ + +SV + + FS I
Sbjct: 204 RGGGILVLGNVIEPDIQYTPLVPYMSH--YNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261
Query: 327 FDSGTSFTYLNDPAYTQISET 347
FDSGT+ YL D AY ++
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQA 282
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/361 (29%), Positives = 163/361 (45%), Gaps = 33/361 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P V +DTGSD+ W+ C SC +G +SG I N + P +SSTS
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPGSSSTS 132
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C C Q C + C Y +Y DG+ ++G+ V D++H A+ + +
Sbjct: 133 SLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S S FGC +QTG A +G+FG G SV S L++QG+ P FS C
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
D G G + G+ P +P L + P YN+ + +SV G V S
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNNRG 309
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ YL + AY ++ + + S CY+++ + +P V+
Sbjct: 310 TIVDSGTTLAYLAEEAYNPFVIAIAAVIPQSVRSVLSR--GNQCYLITTSSNVDIFPQVS 367
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGL-YLYCLGVVKSDNVNIIGREYPIANNISLFHNCYS 443
L GG + ++ G ++C+G K I G+ I ++ L +
Sbjct: 368 LNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQK-----ISGQSITILGDLVLKDKIFV 422
Query: 444 Y 444
Y
Sbjct: 423 Y 423
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 126/283 (44%), Gaps = 26/283 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT V +G P F V +DTGSD+ W+ C C C H SG +D +Y P SST
Sbjct: 87 LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPH----KSGLGLDLTLYDPKASST 142
Query: 163 SSKVPCNSTLCE--LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
S V C+ C + P +N C Y V Y DG+ + G V D L T + Q
Sbjct: 143 GSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTY-GDGSSTVGSFVNDALQFDQVTGDGQ 201
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ ++ + FGCG Q G + A +G+ G G TS+ S LA G + F+ C +
Sbjct: 202 TQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDT 261
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
G G + GD P TP L P YN+ + + VGG + +
Sbjct: 262 IKGGGIFAIGDVVQPKVKTTP--LVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRGT 319
Query: 326 IFDSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEY 367
I DSGT+ TYL + + ++ FN L FEY
Sbjct: 320 IIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEY 362
>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
Length = 313
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 116/222 (52%), Gaps = 18/222 (8%)
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+ GL+ NSFS+CF +
Sbjct: 4 SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 63
Query: 277 GTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSF 333
+GRI FGD G Q TPF + Y + + +G + + F+ DSG SF
Sbjct: 64 DSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 123
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQTNFEYPVVNLTMKG 389
TYL + Y ++ +L ++ +TS + +EYCY S + P + L
Sbjct: 124 TYLPEEIYRKV-----ALEIDRHINATSKNFEGVSWEYCYESSAEP---KVPAIKLKFSH 175
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIGREY 429
F ++ P+ + + +GL +CL + S + + IG+ Y
Sbjct: 176 NNTFVIHKPLFVF-QQSQGLVQFCLPISPSGQEGIGSIGQNY 216
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 156/351 (44%), Gaps = 52/351 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T + VG P S+ + +DTGSDL W+ CD C SC G + Y P S+
Sbjct: 193 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQ---------YKPTRSN 243
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S V +S ++QK + + C Y+++Y +D + S G LV D LHL T
Sbjct: 244 VVSSV--DSLCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTTNGS 300
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ + FGCG Q G L+ A +G+ GL K S+P LA++GLI N C +
Sbjct: 301 KTKLN--VVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSN 358
Query: 276 DGT--GRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----I 326
DG G + GD P G P + T Y I ++ G + F+ +
Sbjct: 359 DGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVF 418
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN-- 384
FDSG+S+TY AY + + N ++ SD C+ Q NF+ +
Sbjct: 419 FDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICW-----QANFQIRSIKDV 473
Query: 385 ------LTMKGGGPFFVNDPIVIVSSEPKGLYL------YCLGVVKSDNVN 423
LT++ G +++ + + P+G + CLG++ VN
Sbjct: 474 KDYFKTLTLRFGSKWWILSTLFQIP--PEGYLIISNKGHVCLGILDGSKVN 522
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 127/285 (44%), Gaps = 34/285 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y + +G PA + + +DTGSDL WL CD C SC G + +Y P +
Sbjct: 30 LYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPH---------GLYDPKRAR 80
Query: 162 TSSKVPCNSTLC-ELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C C ++Q+ C C Y+V Y+ DG+ + G LVED + L
Sbjct: 81 V---VDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYV-DGSSTMGILVEDTITLVL--TN 134
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+R GCG Q G+ A +G+ GL K S+PS LA +G+ N C
Sbjct: 135 GTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAG 194
Query: 274 GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------A 325
GS+G G + FGD P G TP R Y + + GG + E + A
Sbjct: 195 GSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGA 254
Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCY 369
+FDSGTSFTYL AYT + S + E +D +C+
Sbjct: 255 MFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCW 299
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 118/401 (29%), Positives = 190/401 (47%), Gaps = 43/401 (10%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRG-RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNV 109
+P G +AL RDR R RG+A D FS T NS+G L+YT V
Sbjct: 30 IPPTGHRVEVAALKARDRARHARMLRGVAGGVVD-----FSV-QGTSDPNSVG-LYYTKV 82
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
+G P F V +DTGSD+ W+ C+ C +C SS I+ N + SST++ +PC
Sbjct: 83 KMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQ----SSQLGIELNFFDTVGSSTAALIPC 138
Query: 169 NSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +C + Q C + C Y +Y DG+ ++G+ V D ++ + Q +V+S
Sbjct: 139 SDPICTSRVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFSLIMGQPPAVNSS 197
Query: 224 --ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGT 278
I FGC Q+G A +G+FG G SV S L+++G+ P FS C DG
Sbjct: 198 ATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGG 257
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS-----AIFD 328
G + G+ P +P L + P YN+ + ++V G N F S I D
Sbjct: 258 GVLVLGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRGGTIVD 315
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
GT+ YL AY + N ++++ R+T++ CY++S + + +P V+L
Sbjct: 316 CGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDI-FPSVSLNF 371
Query: 388 KGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
+GG + + ++ + G ++C+G K + +I+G
Sbjct: 372 EGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILG 412
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 123/261 (47%), Gaps = 30/261 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P ++ + +DTGSDL W+ C C+ C + S I Y S++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC----PAFSDLKIPIVPYDVKASAS 90
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
SSKVPC+ C L Q +G N C Y +Y DG+ + G+LVEDVLH + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 219 SVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
+ FGCG Q+G A +G+ G G S S LA QG PN F+ C G
Sbjct: 149 -----VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------AI 326
G G + G+ P TP H YN+ + +SV + + FS I
Sbjct: 204 RGGGILVLGNVIEPDIQYTPLVPYMYH--YNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261
Query: 327 FDSGTSFTYLNDPAYTQISET 347
FDSGT+ YL D AY ++
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQA 282
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 166/374 (44%), Gaps = 53/374 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F F+ H+++ K + +L SF + LA+ D PL
Sbjct: 26 GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 65
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
G D+ R +S+G L++T + +G P + V +DTGSD+ W+ C C C + +
Sbjct: 66 ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 117
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
G I ++Y SSTS V C C Q + G+ C Y V Y DG+ S G V
Sbjct: 118 G--IPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFV 174
Query: 205 EDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
+D + L T ++ + + FGCG+ Q+G +A +G+ G G TSV S LA
Sbjct: 175 KDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAA 234
Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
G + FS C + +G G + G+ SP TP Q H YN+ + + V G ++
Sbjct: 235 GGSVKRIFSHCLDNMNGGGIFAIGEVESPVVKTTPLVPNQVH--YNVILKGMDVDGEPID 292
Query: 321 F---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
+ I DSGT+ YL Y + E AK++ + F C+
Sbjct: 293 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFA-CFSF 349
Query: 372 SPNQTNFEYPVVNL 385
+ N T+ +PVVNL
Sbjct: 350 TSN-TDKAFPVVNL 362
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 137/316 (43%), Gaps = 42/316 (13%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGF-----LH 105
P+ GS AH RGR LAA PL LG L+
Sbjct: 38 FPRLGSKGGGDITAHLTHDSNRRGRLLAAA---DVPL-----------GGLGLPTDTGLY 83
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
YT + +G P + V +DTGSD+ W+ +C+SC + S ID +Y P SS+ S
Sbjct: 84 YTEIEIGTPPKQYHVQVDTGSDILWV--NCISC-NKCPRKSDLGIDLRLYDPKGSSSGST 140
Query: 166 VPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKS 219
V C+ C + P N C Y V Y DG+ +TG+ V D L + + Q++
Sbjct: 141 VSCDQKFCAATYGGKLPGCAKNIPCEYSVMY-GDGSSTTGYFVSDSLQYNQVSGDGQTRH 199
Query: 220 VDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DG 277
++ + FGCG Q G A +G+ G G TS+ S LA G + FS C + G
Sbjct: 200 ANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKG 259
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFD 328
G + GD P TP L P YN+ + ++VGG + + I D
Sbjct: 260 GGIFAIGDVVQPKVKSTP--LVPDMPHYNVNLESINVGGTTLQLPSHMFETGEKKGTIID 317
Query: 329 SGTSFTYLNDPAYTQI 344
SGT+ TYL + Y +
Sbjct: 318 SGTTLTYLPELVYKDV 333
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 92/283 (32%), Positives = 130/283 (45%), Gaps = 26/283 (9%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+YT + +G P F V +DTGSD+ W+ +CVSC + SG ID +Y P SS+ S
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWV--NCVSC-DKCPTKSGLGIDLALYDPKGSSSGS 143
Query: 165 KVPCNSTLCELQ----KQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
V C++ C ++ P +AG C Y+ Y DG+ + G V D L + Q
Sbjct: 144 AVSCDNKFCAATYGSGEKLPGCTAGKPCEYRAEY-GDGSSTAGSFVSDSLQYNQLSGNAQ 202
Query: 217 SKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ + + FGCG Q G A +G+ G G TS S LA+ G + FS C +
Sbjct: 203 TRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDT 262
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS----A 325
G G + G+ P TP +H YN+ + + V GNA+ FE S
Sbjct: 263 IKGGGIFAIGEVVQPKVKSTPLLPNMSH--YNVNLQSIDVAGNALQLPPHIFETSEKRGT 320
Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEY 367
I DSGT+ TYL + Y I + F T L FEY
Sbjct: 321 IIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEY 363
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 166/374 (44%), Gaps = 53/374 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F F+ H+++ K + +L SF + LA+ D PL
Sbjct: 27 GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 66
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
G D+ R +S+G L++T + +G P + V +DTGSD+ W+ C C C + +
Sbjct: 67 ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 118
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
G I ++Y TSSTS V C C Q + G+ C Y V Y DG+ S G +
Sbjct: 119 G--IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFI 175
Query: 205 ED--VLHLATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
+D L T ++ + + FGCG+ Q+G +A +G+ G G TS+ S LA
Sbjct: 176 KDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 235
Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
G FS C + +G G + G+ SP TP Q H YN+ + + V G+ +
Sbjct: 236 GGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDGDPID 293
Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
N + I DSGT+ YL Y + E AK++ + F C+
Sbjct: 294 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFA-CFSF 350
Query: 372 SPNQTNFEYPVVNL 385
+ N T+ +PVVNL
Sbjct: 351 TSN-TDKAFPVVNL 363
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/374 (29%), Positives = 166/374 (44%), Gaps = 53/374 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F F+ H+++ K + +L SF + LA+ D PL
Sbjct: 23 GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 62
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
G D+ R +S+G L++T + +G P + V +DTGSD+ W+ C C C + +
Sbjct: 63 ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 114
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
G I ++Y TSSTS V C C Q + G+ C Y V Y DG+ S G +
Sbjct: 115 G--IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFI 171
Query: 205 ED--VLHLATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
+D L T ++ + + FGCG+ Q+G +A +G+ G G TS+ S LA
Sbjct: 172 KDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 231
Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
G FS C + +G G + G+ SP TP Q H YN+ + + V G+ +
Sbjct: 232 GGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDGDPID 289
Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
N + I DSGT+ YL Y + E AK++ + F C+
Sbjct: 290 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKIT--AKQQVKLHMVQETFA-CFSF 346
Query: 372 SPNQTNFEYPVVNL 385
+ N T+ +PVVNL
Sbjct: 347 TSN-TDKAFPVVNL 359
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 148/315 (46%), Gaps = 31/315 (9%)
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D YR+ L++T V +G P F V +DTGSD+ W+ C SC +G SSG I N
Sbjct: 76 DPYRVG----LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCG--SC-NGCPQSSGLHIPLN 128
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ P +SST+S + C+ C L Q C S G+ C Y +Y DG+ ++G+ V D+L
Sbjct: 129 FFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQY-GDGSGTSGYYVSDLL 187
Query: 209 HL-ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+ A + + I FGC QTG A +G+FG G SV S +++QG+ P
Sbjct: 188 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 247
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN--AVNFEFS 324
FS C DG G + L + P YN+ + +SV G A++ E
Sbjct: 248 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVF 307
Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE--YCYVLSPNQ 375
A I DSGT+ YL + AY + F S E S L + CY+++ +
Sbjct: 308 ATSTNRGTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSV 363
Query: 376 TNFEYPVVNLTMKGG 390
+P V+L GG
Sbjct: 364 KGI-FPTVSLNFAGG 377
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 102/315 (32%), Positives = 148/315 (46%), Gaps = 31/315 (9%)
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D YR+ L++T V +G P F V +DTGSD+ W+ C SC +G SSG I N
Sbjct: 61 DPYRVG----LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCG--SC-NGCPQSSGLHIPLN 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ P +SST+S + C+ C L Q C S G+ C Y +Y DG+ ++G+ V D+L
Sbjct: 114 FFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQY-GDGSGTSGYYVSDLL 172
Query: 209 HL-ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+ A + + I FGC QTG A +G+FG G SV S +++QG+ P
Sbjct: 173 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 232
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN--AVNFEFS 324
FS C DG G + L + P YN+ + +SV G A++ E
Sbjct: 233 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVF 292
Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE--YCYVLSPNQ 375
A I DSGT+ YL + AY + F S E S L + CY+++ +
Sbjct: 293 ATSTNRGTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSV 348
Query: 376 TNFEYPVVNLTMKGG 390
+P V+L GG
Sbjct: 349 KGI-FPTVSLNFAGG 362
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 111/378 (29%), Positives = 172/378 (45%), Gaps = 60/378 (15%)
Query: 90 SAGNDTYRLNSLG-----FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGL 142
S GN + R + G L+Y + +G P + + +DTGSDL W CD C +C G
Sbjct: 20 SVGNHSVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGP 79
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQK----QCPSAGSNCPYQVRYLSDGT 197
+ +Y+P + V C+ +C ++Q+ +C S C Y+V Y +DG+
Sbjct: 80 H---------GLYNPKKAKV---VDCHLPVCAQIQQGGSYECNSDVKQCDYEVEY-ADGS 126
Query: 198 MSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVP 256
+ G LVED L + + ++ GCG Q G+ A+ +G+ GL K ++P
Sbjct: 127 STMGVLVEDTLTVRL--TNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALP 184
Query: 257 SILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQV 312
+ LA +G+I N C GS+G G + FGD+ P G TP + Y + +
Sbjct: 185 AQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSI 244
Query: 313 SVGGNAVNFE---------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
GG+++ S +FDSGTSFTYL AY + + R S + L
Sbjct: 245 RYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTL 304
Query: 364 PFEYCYV-LSPNQ--TNFEYPVVNLTMK-GGGPFFVNDPIVIVSSEPKGLYL------YC 413
P YC+ SP Q T+ LT+ GG +F D + +S P+G + C
Sbjct: 305 P--YCWRGPSPFQSITDVHQYFKTLTLDFGGRNWFATDSTLDLS--PQGYLIVSTQGNVC 360
Query: 414 LGVVKS-----DNVNIIG 426
LG++ + + NIIG
Sbjct: 361 LGILDASGASLEVTNIIG 378
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/310 (31%), Positives = 149/310 (48%), Gaps = 37/310 (11%)
Query: 68 RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
RY RL+G A + +D+ LT AG D T R + G L+Y + +G PA S+ V
Sbjct: 38 RYPRLQGSLSALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
+DTGSD+ W+ C C C S+ G I+ +Y+ + S + V C+ C P
Sbjct: 97 VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152
Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
+G +CPY Y DG+ + G+ V+DV+ +A D K +++ + + FGCG Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210
Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
G LD + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
TP Q H YN+ +T V VG +N AI DSGT+ YL +
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSGTTLAYLPEII 327
Query: 341 YTQISETFNS 350
Y + + S
Sbjct: 328 YEPLVKKITS 337
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 106/395 (26%), Positives = 173/395 (43%), Gaps = 43/395 (10%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL------NSLGF-LHYTNVSVGQPA 115
L HR LR R G + G +R+ ++LG+ L+ T V +G P
Sbjct: 37 LNHRVEIDTLRARDRVRHG--RILRASVGGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPP 94
Query: 116 LSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE 174
F V +DTGSD+ W+ C+ C +C SSG I+ N + SST++ VPC+ +C
Sbjct: 95 REFTVQIDTGSDILWINCNTCSNC----PKSSGLGIELNFFDTVGSSTAALVPCSDPMCA 150
Query: 175 -----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD----SRIS 225
QC + C Y +Y DG+ ++G V D ++ QS + + I
Sbjct: 151 SAIQGAAAQCSPQVNQCSYTFQY-EDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIV 209
Query: 226 FGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--IS 282
FGC Q+G A +G+ G G + SV S L+++G+ P FS C DG G +
Sbjct: 210 FGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGGGILV 269
Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSF 333
G+ P +P L + P YN+ + ++V G ++ + I DSGT+
Sbjct: 270 LGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKRGTIIDSGTTL 327
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
+YL AY + ++ + + S CY L + +P V+ +GG
Sbjct: 328 SYLVQEAYDPLVNAVDTAVSQFATSFISK--GSQCY-LVLTSIDDSFPTVSFNFEGGASM 384
Query: 394 FVNDPIVIVSSE-PKGLYLYCLGVVK-SDNVNIIG 426
+ +++ G ++C+G K + V I+G
Sbjct: 385 DLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILG 419
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/298 (32%), Positives = 140/298 (46%), Gaps = 32/298 (10%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P F + DTGSDL W C+ C +D P S++
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCE--PCAKTCYKQKEPRLD-----PTKSTSYK 185
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C+S C+L + C S C YQV+Y DG+ S GF + L L+ S +
Sbjct: 186 NISCSSAFCKLLDTEGGESCSSP--TCLYQVQY-GDGSYSIGFFATETLTLS-----SSN 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V FGCG+ +G F GAA GL GLG K S+PS A + S+ + S G
Sbjct: 238 VFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKG 294
Query: 280 RISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTS 332
+SFG + S TP S ++ P Y + IT++SVGGN ++ + S + DSGT
Sbjct: 295 YLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTV 354
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T L AY+ +S F L + T + F+ CY S N+T + P V ++ KGG
Sbjct: 355 ITRLPSTAYSALSSAFQKLMTDYPSTDGYSI-FDTCYDFSKNET-IKIPKVGVSFKGG 410
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 158/357 (44%), Gaps = 52/357 (14%)
Query: 100 SLGFLHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIY 155
+G L+YT + VG+P + + +DTGS+L W+ CD C SC G N +Y
Sbjct: 25 QMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN---------QLY 75
Query: 156 SP---NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
P N +S +L + C + C Y++ Y +D + S G L +D HL
Sbjct: 76 KPRKDNLVRSSEAFCVEVQRNQLTEHCENC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL 133
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+S I FGCG Q G L+ +G+ GL K S+PS LA++G+I N
Sbjct: 134 --HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGH 191
Query: 272 CFGSD--GTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
C SD G G I G P G T P Y + +T++S G ++ +
Sbjct: 192 CLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGR 251
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
+FD+G+S+TY + AY+Q+ + ++ + SD C+ +TNF +
Sbjct: 252 VGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICW---RAKTNFPFS 308
Query: 382 VVN--------LTMKGGGPFFVNDPIVIVSSEPKGLYL-------YCLGVVKSDNVN 423
++ +T++ G + + +++ E YL CLG++ +V+
Sbjct: 309 SLSDVKKFFRPITLQIGSKWLIISRKLLIQPED---YLIISNKGNVCLGILDGSSVH 362
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 154/354 (43%), Gaps = 54/354 (15%)
Query: 104 LHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSP-- 157
L+YT + VG+P + + +DTGSDL W+ CD C SC G N +Y P
Sbjct: 197 LYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGAN---------QLYKPRK 247
Query: 158 -NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
N +S +L + C S C Y++ Y +D + S G L +D HL
Sbjct: 248 DNLVRSSEPFCVEVQRNQLTEHCESC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL--HN 303
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S I FGCG Q G L+ +G+ GL K S+PS LA++G+I N C S
Sbjct: 304 GSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 363
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHP---TYNITITQVSVGGNAVNFEFS------ 324
D G G I G P G T + HP Y + +T++S G ++ +
Sbjct: 364 DLNGEGYIFMGSDLVPSHGMTWVPMLH-HPHLEVYQMQVTKMSYGNAMLSLDGENGRVGK 422
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ--------T 376
+FD+G+S+TY + AY+Q+ + ++ + SD C+ N
Sbjct: 423 VLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICWRAKTNSPISSLSDVK 482
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL-------YCLGVVKSDNVN 423
F P+ T++ G + + +++ E YL CLG++ NV+
Sbjct: 483 KFFRPI---TLQIGSKWLIISKKLLIQPED---YLIISNKGNVCLGILDGSNVH 530
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 160/344 (46%), Gaps = 30/344 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T V +G P + F V +DTGSD+ W+ C+ SC +G SSG I N + ++SS+S
Sbjct: 78 LYFTKVKLGTPPMEFTVQIDTGSDILWVNCN--SC-NGCPRSSGLGIQLNFFDASSSSSS 134
Query: 164 SKVP-----CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S V CNS QC + + C Y +Y DG+ ++G+ V + ++ QS
Sbjct: 135 SLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQY-GDGSGTSGYYVSESMYFDMVMGQSM 193
Query: 219 SVDSRIS--FGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S S FGC Q+G A +G+FG G SV S L+ +G+ P FS C
Sbjct: 194 IANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKG 253
Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFS 324
+G G + G+ PG +P L + P YN+ + +SV G A +
Sbjct: 254 EGNGGGILVLGEVLEPGIVYSP--LVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRG 311
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ YL + AYT + + + S CY++S + +P+V+
Sbjct: 312 TIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISK--GNQCYLVSTSVGEI-FPLVS 368
Query: 385 LTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
L G + + ++ G L+C+G K + V I+G
Sbjct: 369 LNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILG 412
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 156/345 (45%), Gaps = 32/345 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P F V +DTGSD+ W+ C+ C +C +SG I N + ++SST
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPR----TSGLGIQLNFFDSSSSST 120
Query: 163 SSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ V C+ +C QC + C Y +Y DG+ ++G+ V D L+ +S
Sbjct: 121 AGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQY-EDGSGTSGYYVSDTLYFDAILGES 179
Query: 218 KSVDSR--ISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
V+S I FGC Q+G + A +G+FG G + SV S L+ G+ P FS C
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239
Query: 275 SD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
+ G G + G+ PG +P L + P YN+ + ++V G + + S
Sbjct: 240 GEGIGGGILVLGEILEPGMVYSP--LVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ YL AY N + S CY++S + + +P+
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISK--GNQCYLVSTSVSQM-FPLA 354
Query: 384 NLTMKGGGPFFVN--DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ GG + D ++ G ++C+G K V I+G
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILG 399
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 124/265 (46%), Gaps = 24/265 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T + +G P + V +DTGSD+ W+ +C+SC SG +D Y P SS+
Sbjct: 83 LYFTEIKLGTPPKRYYVQVDTGSDILWV--NCISC-EKCPRKSGLGLDLTFYDPKASSSG 139
Query: 164 SKVPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C + P +N C Y V Y DG+ +TGF V D L T + Q+
Sbjct: 140 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMY-GDGSSTTGFFVTDALQFDQVTGDGQT 198
Query: 218 KSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ ++ ++FGCG Q G A +G+ G G TS+ S LA G + F+ C +
Sbjct: 199 QPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTI 258
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAI 326
G G + G+ P TP L P YN+ + + VGG + I
Sbjct: 259 KGGGIFAIGNVVQPKVKTTP--LVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTI 316
Query: 327 FDSGTSFTYLNDPAYTQI-SETFNS 350
DSGT+ TYL + + ++ + FN
Sbjct: 317 IDSGTTLTYLPELVFKEVMAAIFNK 341
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 172/405 (42%), Gaps = 39/405 (9%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
++ L DR GR L N T D Y + L+YT + +G P F
Sbjct: 5 HFEMLKAHDR--ARHGRSL----NTIVDFTLQGTADPY----VAGLYYTRIELGTPPRPF 54
Query: 119 IVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC---- 173
V +DTGSD+ W+ C C +C +SG + N + P SST+S + C + C
Sbjct: 55 YVQIDTGSDILWVNCKPCNACPL----TSGLGVALNFFDPRGSSTASPLSCIDSKCVSSN 110
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRV 231
++ + + C Y Y DG+ + G+ V D + ++ + + ++I+FGC
Sbjct: 111 QISESVCTTDRYCGYSFEY-GDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYN 169
Query: 232 QTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD-GTGRISFGDKGS 288
Q+G A +G+FG G + SV S L +QGL P FS C G+D G G + G+
Sbjct: 170 QSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGILVLGEITE 229
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---------FSAIFDSGTSFTYLNDP 339
PG TP Q H YN+ + ++V G ++ + I D GT+ YL +
Sbjct: 230 PGMVYTPIVPSQPH--YNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLAYLAEE 287
Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI 399
AY T +A + T L C+ L+ + + +P V L +G
Sbjct: 288 AYEPFVNTI--IAAVSQSTQPFMLKGNPCF-LTVHSIDEIFPSVTLYFEGAPMDLKPKDY 344
Query: 400 VIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
+I P ++C+G KS + I ++ L + Y
Sbjct: 345 LIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVY 389
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 132/278 (47%), Gaps = 30/278 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTS 160
L+YT +S+G P + + +DTGS W+ CD C SC G + +Y P +
Sbjct: 159 LYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHP---------LYRP--A 207
Query: 161 STSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
T+ +P + LCE Q + P + C Y++ Y +DG+ S G V D + ++ + ++
Sbjct: 208 RTADALPASDPLCEGAQHENP---NQCDYEISY-ADGSSSMGVYVRDSMQFVGEDGEREN 263
Query: 220 VDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
D I FGCG Q G L+ +G+ GL S+P+ LA++G+I N+F C +D +
Sbjct: 264 AD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPS 321
Query: 279 GR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE---FSAIFDSG 330
G + GD P G T +R + Q++ G +N + +FD+G
Sbjct: 322 GAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTG 381
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
+++TY D A T++ + A + SD +C
Sbjct: 382 STYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFC 419
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 148/310 (47%), Gaps = 37/310 (11%)
Query: 68 RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
RY RL+G A + +D+ LT AG D T R + G L+Y + +G PA S+ V
Sbjct: 38 RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
+DTGSD+ W+ C C C S+ G I+ +Y+ + S + V C+ C P
Sbjct: 97 VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152
Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
+G +CPY Y DG+ + G+ V+DV+ +A D K +++ + + FGCG Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210
Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
G LD + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
TP Q H YN+ +T V VG + AI DSGT+ YL +
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEII 327
Query: 341 YTQISETFNS 350
Y + + S
Sbjct: 328 YEPLVKKITS 337
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 107/460 (23%), Positives = 185/460 (40%), Gaps = 63/460 (13%)
Query: 6 RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
R + V L++++ C G + F+ H+++ + + A+ + SA+
Sbjct: 9 RLATVLSLVVIVELGFVVCLSNGNYVFNVQHKFAGKERSLSALKQHDARRHRRILSAVD- 67
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
L G G A+ L++ + +G P + V +DTG
Sbjct: 68 ----LPLGGNGHPAEAG---------------------LYFAKIGLGNPPKDYYVQVDTG 102
Query: 126 SDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
SD+ W+ C +C C + S + +Y P +S++++++ C+ C G
Sbjct: 103 SDILWVNCANCDKC----PTKSDLGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQGC 158
Query: 185 N----CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-L 237
C Y V Y DG+ + GF V+D L T Q+ S + + FGCG Q+G
Sbjct: 159 TKDLPCQYSVVY-GDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGT 217
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPF 296
A +G+ G G +S+ S LA G + F+ C + G G + G+ SP TP
Sbjct: 218 SSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGIFAIGEVVSPKVNTTPM 277
Query: 297 SLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQISET 347
Q H YN+ + ++ VGGN + I DSGT+ YL + Y +
Sbjct: 278 VPNQPH--YNVVMKEIEVGGNVLELPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESM--- 332
Query: 348 FNSLAKEKRETSTSDLPFEY-CYVLSPNQTNFEYPVVNLTMKGGGPFFVN--DPIVIVSS 404
+ E+ + ++ C+ + N N +PVV G VN D + +
Sbjct: 333 MTKIVSEQPGLKLHTVEEQFTCFQYTGN-VNEGFPVVKFHFNGSLSLTVNPHDYLFQIHE 391
Query: 405 EPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
E ++C G S + GR+ + ++ L + Y
Sbjct: 392 E-----VWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLY 426
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 127/283 (44%), Gaps = 32/283 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T++ VG P + + +DTGSDL W+ CD C SC G N +Y P +
Sbjct: 100 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP---------LYKPKKGN 150
Query: 162 TSSKVPCNSTLC-ELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
VP +LC E+Q+ + C Y++ Y +D + S G L D LHL
Sbjct: 151 L---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGS 206
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ I FGC Q G L+ A +G+ GL K S+PS LA+Q +I N C S
Sbjct: 207 LTKLG--IMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 264
Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTH-PTYNITITQVSVGGNAVNF------EFSAI 326
D T G + GD P G + +H P Y+ I ++S G ++ +
Sbjct: 265 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 324
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FD+G+S+TY AY + + ++ E SD C+
Sbjct: 325 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCW 367
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 138/303 (45%), Gaps = 37/303 (12%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLP 132
RGR L+A + F+ G + L ++ L++T + +G P+ + V +DTGSD+ W+
Sbjct: 46 RGRILSA-------VDFNLGGNG--LPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVN 96
Query: 133 C-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC----ELQKQCPSAGSNCP 187
C +C C S I +Y P S TS V C C E + A + CP
Sbjct: 97 CVECTRCPR----KSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCP 152
Query: 188 YQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRVQTGSFLDGA--APN 243
Y + Y DG+ +TG+ V+D L + + + +S I FGCG Q+G+F + A +
Sbjct: 153 YSISY-GDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALD 211
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-GTGRISFGDKGSPGQGETPFSLRQTH 302
G+ G G +SV S LA G + FS C ++ G G S G+ P TP H
Sbjct: 212 GIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAH 271
Query: 303 PTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
YN+ + + V G+ + + DSGT+ YL Y Q+ LAK
Sbjct: 272 --YNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKV--LAK 327
Query: 354 EKR 356
+ R
Sbjct: 328 QPR 330
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 155/332 (46%), Gaps = 38/332 (11%)
Query: 82 NDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DC 135
+D+ L AG D R + LG L+Y + +G P + V +DTGSD+ W+ C C
Sbjct: 51 DDQRQLRILAGVDLPLGGIGRPDILG-LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQC 109
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQ-KQCP--SAGSNCPYQVR 191
C SS G ID +Y+ N S T VPC+ C E+ Q P +A +CPY
Sbjct: 110 RECPK--TSSLG--IDLTLYNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEI 165
Query: 192 YLSDGTMSTGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFG 247
Y DG+ + G+ V+DV+ A + + ++ + + + FGCG Q+G + A +G+ G
Sbjct: 166 Y-GDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILG 224
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN 306
G +S+ S LA G + F+ C G++G G G P TP Q H YN
Sbjct: 225 FGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGGIFVIGHVVQPKVNMTPLIPNQPH--YN 282
Query: 307 ITITQVSVGGNAVN-----FEF----SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
+ +T V VG ++ FE AI DSGT+ YL + Y + S + +
Sbjct: 283 VNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKV 342
Query: 358 TSTSD--LPFEYCYVLS---PNQT-NFEYPVV 383
+ D F+Y L PN T +FE V+
Sbjct: 343 HTVRDEYTCFQYSDSLDDGFPNVTFHFENSVI 374
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 149/311 (47%), Gaps = 37/311 (11%)
Query: 68 RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
RY RL+G A + +D+ LT AG D T R + G L+Y + +G PA S+ V
Sbjct: 38 RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
+DTGSD+ W+ C C C S+ G I+ +Y+ + S + V C+ C P
Sbjct: 97 VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152
Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
+G +CPY Y DG+ + G+ V+DV+ +A D K +++ + + FGCG Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210
Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
G LD + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
TP Q H YN+ +T V VG + AI DSGT+ YL +
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEII 327
Query: 341 YTQISETFNSL 351
Y + + +L
Sbjct: 328 YEPLVKKEPAL 338
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 127/283 (44%), Gaps = 32/283 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T++ VG P + + +DTGSDL W+ CD C SC G N +Y P +
Sbjct: 313 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP---------LYKPKKGN 363
Query: 162 TSSKVPCNSTLC-ELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
VP +LC E+Q+ + C Y++ Y +D + S G L D LHL
Sbjct: 364 L---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGS 419
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ I FGC Q G L+ A +G+ GL K S+PS LA+Q +I N C S
Sbjct: 420 LTKLG--IMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 477
Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTH-PTYNITITQVSVGGNAVNF------EFSAI 326
D T G + GD P G + +H P Y+ I ++S G ++ +
Sbjct: 478 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 537
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FD+G+S+TY AY + + ++ E SD C+
Sbjct: 538 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCW 580
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 110 bits (275), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 142/317 (44%), Gaps = 32/317 (10%)
Query: 70 FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
F + R LAA + +D + L AG D T R ++G L+Y + +G PA + V +
Sbjct: 57 FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQV 115
Query: 123 DTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS 181
DTGSD+ W+ C C C SS G ++ +Y S T V C+ C P
Sbjct: 116 DTGSDIMWVNCIQCNECPK--KSSLG--MELTLYDIKESLTGKLVSCDQDFCYAINGGPP 171
Query: 182 ----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGS 235
A +C Y Y +DG+ S G+ V D++ + + ++ S + + FGC Q+G
Sbjct: 172 SYCIANMSCSYTEIY-ADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGD 230
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGET 294
A +G+ G G TS+ S LA+ G + F+ C G +G G + G P T
Sbjct: 231 LSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290
Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQIS 345
P QTH YN+ + V VGG +N + I DSGT+ YL + Y Q+
Sbjct: 291 PLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 348
Query: 346 ETFNSLAKEKRETSTSD 362
S + + + D
Sbjct: 349 SKIFSWQSDLKVHTIHD 365
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 117/260 (45%), Gaps = 26/260 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G P + V +DTGSD+ W+ C C C S ID +Y P S T
Sbjct: 69 LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPR----KSDLGIDLTLYDPKGSET 124
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQ 216
S + C+ C P G CPY + Y DG+ +TG+ V+D L + D +
Sbjct: 125 SELISCDQEFCSATYDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNHVNDNLR 183
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ +S I FGCG VQ+G+ + A +G+ G G +SV S LA G + FS C
Sbjct: 184 TAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLD 243
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
+ G G + G+ P TP R H YN+ + + V + +
Sbjct: 244 NIRGGGIFAIGEVVEPKVSTTPLVPRMAH--YNVVLKSIEVDTDILQLPSDIFDSGNGKG 301
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
I DSGT+ YL Y ++
Sbjct: 302 TIIDSGTTLAYLPAIVYDEL 321
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/340 (30%), Positives = 150/340 (44%), Gaps = 45/340 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T V +G P +IV +DTGSD+ W+ C S G S I +Y P SST+
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCS---GCPRKSALNIPLTMYDPRESSTT 57
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
S V C+ LC + QC A +NC Y Y DG+ S G+ V D +
Sbjct: 58 SLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSY-GDGSTSEGYYVRDAMQYNVISSNGL 116
Query: 218 KSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ S++ FGC QTG A +G+ G G + SVP+ LA Q IP FS C +
Sbjct: 117 ANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--E 174
Query: 277 GTGR----ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFSA---- 325
G R + G PG TP H YN+ + +SV N + +FS+
Sbjct: 175 GEKRGGGILVIGGIAEPGMTYTPLVPDSVH--YNVVLRGISVNSNRLPIDAEDFSSTNDT 232
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY------CYVLSPNQTN 377
I DSGT+ Y AY N + RE +TS P C+++S ++
Sbjct: 233 GVIMDSGTTLAYFPSGAY-------NVFVQAIRE-ATSATPVRVQGMDTQCFLVSGRLSD 284
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIV-SSEPKGLY-LYCLG 415
+P V L +GG D ++ + P G ++C+G
Sbjct: 285 L-FPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIG 323
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 152/334 (45%), Gaps = 32/334 (9%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R++S+G L++T + +G P + V +DTGSD+ W+ C C C N + +++
Sbjct: 67 RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLN----FRLSLF 121
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
N SSTS KV C+ C Q S C Y + Y +D + S G + D+L L
Sbjct: 122 DMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY-ADESTSDGKFIRDMLTLEQV 180
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T + ++ + + FGCG Q+G +G +A +G+ G G TSV S LA G FS
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFS 240
Query: 271 MCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C + G G + G SP TP Q H YN+ + + V G +++ S
Sbjct: 241 HCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTSLDLPRSIVRNG 298
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ Y Y + ET LA++ + + F+ C+ S N + +P V
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEETFQ-CFSFSTN-VDEAFPPV 354
Query: 384 NLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLG 415
+ + V +D + + E LYC G
Sbjct: 355 SFEFEDSVKLTVYPHDYLFTLEEE-----LYCFG 383
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 129/285 (45%), Gaps = 35/285 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T + VG P + + +DT SDL W+ CD C SC G N+ +Y P +
Sbjct: 207 LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANA---------LYKPRRDN 257
Query: 162 TSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ P +S EL + AG C Y++ Y +D + S G L D LHL
Sbjct: 258 IVT--PKDSLCVELHRN-QKAGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTM--AN 311
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
S + + +FGC Q G L+ +G+ GL K S+PS LAN+G+I N C +
Sbjct: 312 GSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLAN 371
Query: 276 D--GTGRISFGDKGSP--GQGETPFSLRQTHPTYNITITQ-------VSVGGNAVNFEFS 324
D G G + GD P G P + +Y I + +S+GG
Sbjct: 372 DVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVR-R 430
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FDSG+S+TY AY+++ + ++ E TSD +C+
Sbjct: 431 IVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCW 475
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 132/274 (48%), Gaps = 25/274 (9%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++++G PA + + +DTGS L W+ CD C +C G + + NI P S
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKE-NIVPPRDSHC 187
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ N C+ KQ C Y++ Y +D + S G L D + L T + + +++D
Sbjct: 188 -QELQGNQNYCDTCKQ-------CDYEIAY-ADRSSSAGVLARDNMELITADGERENMD- 237
Query: 223 RISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTG 279
+ FGC Q G L A+ +G+ GL S+P+ LA QG+I N F C +D G+
Sbjct: 238 -LVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSA 296
Query: 280 RISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDSGTS 332
+ GD P G T +R Y+ + +V+ G +N A IFDSG+S
Sbjct: 297 YMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSS 356
Query: 333 FTYLNDPAYTQISETFNSLAKE-KRETSTSDLPF 365
+TY YT + + +++ R+ S LPF
Sbjct: 357 YTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPF 390
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/372 (30%), Positives = 163/372 (43%), Gaps = 55/372 (14%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLP 132
RGR LA +G D FS G L+ G L++T V +G P +IV +DTGSD+ W+
Sbjct: 5 RGRFLA-EGVD-----FSLGGTADPLS--GGLYFTQVGLGNPVKHYIVQVDTGSDVLWVN 56
Query: 133 CD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNC 186
C C C S I +Y P SST+S V C+ LC + QC +NC
Sbjct: 57 CRPCSGCPR----KSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNC 112
Query: 187 PYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNG 244
Y Y DG+ S G+ V D + + S++ FGC QTG A +G
Sbjct: 113 EYIFSY-GDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDG 171
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR----ISFGDKGSPGQGETPFSLRQ 300
+ G G + SVP+ LA Q IP FS C +G R + G PG TP
Sbjct: 172 IIGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEKRGGGILVIGGIAEPGMTYTPLVPDS 229
Query: 301 THPTYNITITQVSVGGNAVNF---EFSA------IFDSGTSFTYLNDPAYTQISETFNSL 351
H YN+ + +SV N + +FS+ I DSGT+ Y AY N
Sbjct: 230 VH--YNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAY-------NVF 280
Query: 352 AKEKRETSTSDLPFEY------CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV-SS 404
+ RE +TS P C+++S ++ +P V L +GG D ++ +
Sbjct: 281 VQAIRE-ATSATPVRVQGMDTQCFLVSGRLSDL-FPNVTLNFEGGAMELQPDNYLMWGGT 338
Query: 405 EPKGLY-LYCLG 415
P G ++C+G
Sbjct: 339 APTGTTDVWCIG 350
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 144/314 (45%), Gaps = 33/314 (10%)
Query: 70 FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
F + R LAA + +D + L AG D T R ++G L+Y + +G PA + V +
Sbjct: 57 FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQV 115
Query: 123 DTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS 181
DTGSD+ W+ C C C SS G ++ +Y S T V C+ C P
Sbjct: 116 DTGSDIMWVNCIQCNECPK--KSSLG--MELTLYDIKESLTGKLVSCDQDFCYAINGGPP 171
Query: 182 ----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGS 235
A +C Y Y +DG+ S G+ V D++ + + ++ S + + FGC Q+G
Sbjct: 172 SYCIANMSCSYTEIY-ADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGD 230
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGET 294
A +G+ G G TS+ S LA+ G + F+ C G +G G + G P T
Sbjct: 231 LSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290
Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQ-I 344
P QTH YN+ + V VGG +N + I DSGT+ YL + Y Q +
Sbjct: 291 PLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 348
Query: 345 SETFNSLAKEKRET 358
S+ F+ + K T
Sbjct: 349 SKIFSWQSDLKVHT 362
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 138/305 (45%), Gaps = 31/305 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G P + V +DTGSD+ W+ C +C C S ID +Y P S T
Sbjct: 69 LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPR----KSDLGIDLTLYDPKGSET 124
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQ 216
S V C+ C P G CPY + Y DG+ +TG+ V+D L + +
Sbjct: 125 SDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNRINGNLR 183
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ +S I FGCG VQ+G+ + A +G+ G G +SV S LA G + FS C
Sbjct: 184 TSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD 243
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
+ G G + G+ P TP R H YN+ + + V + +
Sbjct: 244 NVRGGGIFAIGEVVEPKVSTTPLVPRMAH--YNVVLKSIEVDTDILQLPSDIFDSVNGKG 301
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVV 383
+ DSGT+ YL D Y ++ + LA++ + + F C++ + N + +PVV
Sbjct: 302 TVIDSGTTLAYLPDIVYDELIQKV--LARQPGLKLYLVEQQFR-CFLYTGN-VDRGFPVV 357
Query: 384 NLTMK 388
L K
Sbjct: 358 KLHFK 362
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 152/334 (45%), Gaps = 32/334 (9%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R++S+G L++T + +G P + V +DTGSD+ W+ C C C N + +++
Sbjct: 67 RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLN----FRLSLF 121
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
N SSTS KV C+ C Q S C Y + Y +D + S G + D+L L
Sbjct: 122 DMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY-ADESTSDGKFIRDMLTLEQV 180
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T + ++ + + FGCG Q+G +G +A +G+ G G TSV S LA G FS
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFS 240
Query: 271 MCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C + G G + G SP TP Q H YN+ + + V G +++ S
Sbjct: 241 HCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTSLDLPRSIVRNG 298
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ Y Y + ET LA++ + + F+ C+ S N + +P V
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEETFQ-CFSFSTN-VDEAFPPV 354
Query: 384 NLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLG 415
+ + V +D + + E LYC G
Sbjct: 355 SFEFEDSVKLTVYPHDYLFTLEEE-----LYCFG 383
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 156/343 (45%), Gaps = 36/343 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +S I + + P SS++
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 139
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C+ C Q S S C Y +Y DG+ ++GF + D + T + ++
Sbjct: 140 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGFYISDFMSFDTVITSTLAI 198
Query: 221 DSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+S FGC +QTG A +G+FGLG SV S LA QGL P FS C D
Sbjct: 199 NSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
G G + G P TP L + P YN+ + ++V G + + S I
Sbjct: 259 SGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316
Query: 327 FDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEY----CYVLSPNQTNFEYP 381
D+GT+ YL D AY+ I N++++ R P Y C+ ++ + +P
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAIANAVSQYGR-------PITYESYQCFEITAGDVDV-FP 368
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
V+L+ GG + + G ++C+G + + I
Sbjct: 369 EVSLSFAGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRI 411
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 113/407 (27%), Positives = 170/407 (41%), Gaps = 59/407 (14%)
Query: 25 FGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH-RDRYFRLRGRGLAAQGND 83
F G F F H+++ K L H + R R LA+
Sbjct: 20 FASGNFVFKVQHKFAGKEK------------------KLEHFKSHDTRRHSRMLAS---- 57
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ G D+ R++S+G L++T + +G P + V +DTGSD+ W+ C C C
Sbjct: 58 ---IDLPLGGDS-RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKT 112
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMST 200
N + +++ N SSTS KV C+ C Q S C Y + Y +D + S
Sbjct: 113 NLN----FHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYHIVY-ADESTSE 167
Query: 201 GFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPS 257
G + D L L T + Q+ + + FGCG Q+G +A +G+ G G TSV S
Sbjct: 168 GNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLS 227
Query: 258 ILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG 316
LA G FS C + G G + G SP TP Q H YN+ + + V G
Sbjct: 228 QLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDG 285
Query: 317 NAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
A++ S I DSGT+ Y Y + ET LA++ + + F+ C+
Sbjct: 286 TALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLIETI--LARQPVKLHIVEDTFQ-CFS 342
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLG 415
S N + +P V+ + V +D + + E LYC G
Sbjct: 343 FSEN-VDVAFPPVSFEFEDSVKLTVYPHDYLFTLEKE-----LYCFG 383
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 131/277 (47%), Gaps = 31/277 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPEKPNV 209
Query: 163 SSKVPCNSTLC-ELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VP + C ELQ + C Y++ Y +D + S G L D + L T + + ++
Sbjct: 210 ---VPPRDSYCQELQGNQNYGDTSKQCDYEITY-ADRSSSMGILARDNMQLITADGEREN 265
Query: 220 VDSRISFGCGRVQTGSFLDGAA-PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+D FGCG Q G+ L A +G+ GL S+P+ LA+QG+I N F C +D +
Sbjct: 266 LD--FVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323
Query: 279 --GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDS 329
G + GD P G T +R Y+ + +V+ G +N A IFDS
Sbjct: 324 NGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDS 383
Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPF 365
G+S+TYL YT I+ + ++ S LPF
Sbjct: 384 GSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPF 420
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 123/283 (43%), Gaps = 32/283 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYSPN 158
L+ ++++G P + + +DTGSDL W+ CD C C + +Y PN
Sbjct: 61 LYTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKD---------KLYKPN 111
Query: 159 TSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
V C+ +C L + C C Y V+Y +D + G LV D +H+
Sbjct: 112 GKQV---VKCSDPICVATQSTHVLGQICSKQSPPCVYNVQY-ADHASTLGVLVRDYMHIG 167
Query: 212 TDEKQSKSVDSRISFGCGRVQ--TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ +K D ++FGCG Q +G + P G+ GLG KTS+ S L + G I N
Sbjct: 168 SPSSSTK--DPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVL 225
Query: 270 SMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAI 326
C ++G G + GDK P G TP YN + G + I
Sbjct: 226 GHCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFFNGKPTPAKGLQII 285
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSG+S+TY + P YT ++ N+ K K + D C+
Sbjct: 286 FDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICW 328
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 131/277 (47%), Gaps = 31/277 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPEKPNV 209
Query: 163 SSKVPCNSTLC-ELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VP + C ELQ + C Y++ Y +D + S G L D + L T + + ++
Sbjct: 210 ---VPPRDSYCQELQGNQNYGDTSKQCDYEITY-ADRSSSMGILARDNMQLITADGEREN 265
Query: 220 VDSRISFGCGRVQTGSFLDGAA-PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+D FGCG Q G+ L A +G+ GL S+P+ LA+QG+I N F C +D +
Sbjct: 266 LD--FVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323
Query: 279 --GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDS 329
G + GD P G T +R Y+ + +V+ G +N A IFDS
Sbjct: 324 NGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDS 383
Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPF 365
G+S+TYL YT I+ + ++ S LPF
Sbjct: 384 GSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPF 420
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 160/339 (47%), Gaps = 35/339 (10%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G F V +DTGSD+ W+ C+ C +C SS I+ N + SST++ +PC+
Sbjct: 75 GXXXXXFNVQIDTGSDILWVNCNTCSNCPQ----SSQLGIELNFFDTVGSSTAALIPCSD 130
Query: 171 TLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR-- 223
+C +C + C Y +Y DG+ ++G+ V D ++ Q +V+S
Sbjct: 131 LICTSGVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFNLIMGQPPAVNSTAT 189
Query: 224 ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GR 280
I FGC Q+G A +G+FG G SV S L++QG+ P FS C DG G
Sbjct: 190 IVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249
Query: 281 ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS-----AIFDSG 330
+ G+ P +P L + P YN+ + ++V G N F S I D G
Sbjct: 250 LVLGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCG 307
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
T+ YL AY + N ++++ R+T++ CY++S + + +P+V+L +G
Sbjct: 308 TTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVSTSIGDI-FPLVSLNFEG 363
Query: 390 GGPFFVN-DPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
G + + ++ + G ++C+G K + +I+G
Sbjct: 364 GASMVLKPEQYLMHNGYLDGAEMWCVGFQKLQEGASILG 402
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 88/344 (25%), Positives = 154/344 (44%), Gaps = 41/344 (11%)
Query: 101 LGFLH------YTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
LG+ H YT + +G P +F V +DTGS + ++PC DC C G +++
Sbjct: 3 LGYRHTRHSYFYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHC--GKHTA-------E 53
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ P+ S+T+ K+ C LC + ++ Y R ++ + S G+++ED
Sbjct: 54 WFDPDKSTTAKKLACGDPLCNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDS 113
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ R+ FGC +TG A +G+ G+G + + S L + +I + FS+CF
Sbjct: 114 DSPV-----RLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCF 167
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTH---PTYNITITQVSVGGNAVNFE-------F 323
G G + GD P T ++ TH YN+ + ++V G + F+ +
Sbjct: 168 GYPKDGILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGY 227
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY---CYVLSPNQ---TN 377
+ DSGT+FTYL A+ +++ ++K ST +Y C+ +P+Q +
Sbjct: 228 GTVLDSGTTFTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLD 287
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
+P GG + + S+P YCLG+ + N
Sbjct: 288 KYFPPAEFVFGGGAKLTLPPLRYLFLSKPAE---YCLGIFDNGN 328
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 35 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 83
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 84 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVF--SMN 136
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 137 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 196
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 197 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 255
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 256 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 297
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 54 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 102
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 103 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 155
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 156 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 215
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 216 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 274
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 275 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 316
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 57 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 105
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 106 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 158
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 159 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 218
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 219 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 277
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 319
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 45 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 93
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 94 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 146
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 147 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 206
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 207 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 265
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 266 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 307
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/353 (26%), Positives = 156/353 (44%), Gaps = 52/353 (14%)
Query: 104 LHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSP-- 157
L+YT + VG+P + + +DTGS+L W+ CD C SC G N +Y P
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN---------QLYKPRK 252
Query: 158 -NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
N +S +L + C + C Y++ Y +D + S G L +D HL
Sbjct: 253 DNLVRSSEAFCVEVQRNQLTEHCENC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL--HN 308
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S I FGCG Q G L+ +G+ GL K S+PS LA++G+I N C S
Sbjct: 309 GSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 368
Query: 276 D--GTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFEFS------A 325
D G G I G P G T P Y + +T++S G ++ +
Sbjct: 369 DLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKV 428
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN- 384
+FD+G+S+TY + AY+Q+ + ++ + SD C+ +TNF + ++
Sbjct: 429 LFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICW---RAKTNFPFSSLSD 485
Query: 385 -------LTMKGGGPFFVNDPIVIVSSEPKGLYL-------YCLGVVKSDNVN 423
+T++ G + + +++ E YL CLG++ +V+
Sbjct: 486 VKKFFRPITLQIGSKWLIISRKLLIQPED---YLIISNKGNVCLGILDGSSVH 535
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 209
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 210 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 268
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 269 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 328
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
DG G + G+ P TP Q H YN+ + ++ VGG+ ++ A I
Sbjct: 329 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ Y Y + E S + R
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 416
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 209
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 210 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 268
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 269 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 328
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
DG G + G+ P TP Q H YN+ + ++ VGG+ ++ A I
Sbjct: 329 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ Y Y + E S + R
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 416
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 73 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 128
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 129 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 187
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 188 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 247
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
DG G + G+ P TP Q H YN+ + ++ VGG+ ++ A I
Sbjct: 248 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 305
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ Y Y + E S + R
Sbjct: 306 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 335
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 104/348 (29%), Positives = 158/348 (45%), Gaps = 38/348 (10%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R R GR L T +D Y + L++T V +G P F V +DTG
Sbjct: 51 RARDQARHGRLLRGVVGGVVDFTVYGTSDPYLVG----LYFTKVKLGSPPREFNVQIDTG 106
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP-----CNSTLCELQKQC 179
SD+ W+ C+ C C +SG I+ + + P++SST+S V C S + +C
Sbjct: 107 SDILWVTCNSCNDCPR----TSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAEC 162
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS--RISFGCGRVQTGSFL 237
+ C Y Y DG+ +TG+ V D+L+ T S +S I FGC Q+G
Sbjct: 163 SPQSNQCSYSFHY-GDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLT 221
Query: 238 D-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGET 294
A +G+FG G SV S L++ G+ P FS C DG G++ G+ P +
Sbjct: 222 KVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGGGKLVLGEILEPNIIYS 281
Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAYTQIS 345
P Q+H YN+ + +SV G + + + I DSGT+ TYL + AY
Sbjct: 282 PLVPSQSH--YNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAY---- 335
Query: 346 ETFNSLAKEKRETSTSDLPFE--YCYVLSPNQTNFEYPVVNLTMKGGG 391
+ F S +ST+ + + CY++S + +P V+L GG
Sbjct: 336 DPFVSAITATVSSSTTPVLSKGNQCYLVSTSVDEI-FPPVSLNFAGGA 382
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 113/463 (24%), Positives = 187/463 (40%), Gaps = 68/463 (14%)
Query: 6 RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVD--DLPKKGSFAYYSAL 63
R V +++ L CC F ++ P + + A+ D ++G F L
Sbjct: 4 RERLVRLVVSLFVVVQLCCHANANMVFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDL 63
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A L G G R S G L+YT + +G + V +D
Sbjct: 64 A-------LGGNG--------------------RPTSTG-LYYTKIGLGPN--DYYVQVD 93
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
TGSD W+ C C +C SG ++ +Y PN+S TS VPC+ C P +
Sbjct: 94 TGSDTLWVNCVGCTTC----PKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGPIS 149
Query: 183 G----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV--DSRISFGCGRVQTGSF 236
G +CPY + Y DG+ ++G ++D L ++V ++ + FGCG Q+G+
Sbjct: 150 GCKKDMSCPYSITY-GDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTL 208
Query: 237 --LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGE 293
+ +G+ G G +SV S LA G + FS C + +G G + G+ P
Sbjct: 209 SSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPKVKT 268
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQI 344
TP R H YN+ + + V G+ + I DSGT+ YL Y Q+
Sbjct: 269 TPLVPRMAH--YNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQL 326
Query: 345 SETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF--FVNDPIVI 401
E +LA+ E + F + + +P V T + G + +D +
Sbjct: 327 LE--KTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFP 384
Query: 402 VSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCYSY 444
+ ++C+G KS G++ + ++ L + + Y
Sbjct: 385 FKED-----MWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIY 422
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 129/285 (45%), Gaps = 28/285 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G P+ + V +DTGSD+ W+ C C SC SG ID +Y P S++
Sbjct: 88 LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPR----KSGLGIDLTLYDPTASAS 143
Query: 163 SSKVPCNSTLCELQKQC---PSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
S V C C PS +N C Y + Y DG+ +TGF V D L + +
Sbjct: 144 SKTVTCGQEFCATATNGGVPPSCAANSPCQYSITY-GDGSSTTGFFVADFLQYDQVSGDG 202
Query: 216 QSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q+ ++ ++FGCG G+ A +G+ G G +S+ S L + G + FS C
Sbjct: 203 QTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLD 262
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------EF 323
+ +G G + G+ P TP L P YN+ + + VGG+ +
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTP--LVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSR 320
Query: 324 SAIFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEY 367
I DSGT+ YL + Y + S F++ + L F+Y
Sbjct: 321 GTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQY 365
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 156/343 (45%), Gaps = 36/343 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +S I + + P SS++
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 139
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C+ C Q S S C Y +Y DG+ ++G+ + D + T + ++
Sbjct: 140 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGYYISDFMSFDTVITSTLAI 198
Query: 221 DSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+S FGC +Q+G A +G+FGLG SV S LA QGL P FS C D
Sbjct: 199 NSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
G G + G P TP L + P YN+ + ++V G + + S I
Sbjct: 259 SGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316
Query: 327 FDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEY----CYVLSPNQTNFEYP 381
D+GT+ YL D AY+ I N++++ R P Y C+ ++ + +P
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAVANAVSQYGR-------PITYESYQCFEITAGDVDV-FP 368
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
V+L+ GG + + G ++C+G + + I
Sbjct: 369 QVSLSFAGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRI 411
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 51/383 (13%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYL 336
P TP L P YN+ + + VGG A+ I DSGT+ Y+
Sbjct: 275 VVQPKVKTTP--LVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYV 332
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
+ Y + F + + ++ S L C+ S + +P V +G
Sbjct: 333 PEGVYKAL---FAMVFDKHQDISVQTLQDFSCFQYS-GSVDDGFPEVTFHFEG------- 381
Query: 397 DPIVIVSSE----PKGLYLYCLG 415
D +IVS G LYC+G
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMG 404
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 57 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 105
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 106 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 158
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 159 YTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 218
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 219 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 277
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 319
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 131/268 (48%), Gaps = 29/268 (10%)
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
T R +S+G L+Y + +G P+ + + +DTG+D+ W+ C C C + S +D
Sbjct: 64 TGRPDSVG-LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKEC----PTRSNLGMDLT 118
Query: 154 IYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDV 207
+Y+ SS+ VPC+ LC+ L C S ++ CPY Y DG+ + G+ V+DV
Sbjct: 119 LYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIY-GDGSSTAGYFVKDV 177
Query: 208 LHL--ATDEKQSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+ + + ++ S + + FGCG Q+G S+ + A +G+ G G S+ S L++ G
Sbjct: 178 VLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSG 237
Query: 264 LIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
+ F+ C G +G G + G P TP L P Y++ +T + VG +N
Sbjct: 238 KVKKMFAHCLNGVNGGGIFAIGHVVQPTVNTTP--LLPDQPHYSVNMTAIQVGHTFLNLS 295
Query: 323 FSA---------IFDSGTSFTYLNDPAY 341
A I DSGT+ YL D Y
Sbjct: 296 TDASEQRDSKGTIIDSGTTLAYLPDGIY 323
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 145/329 (44%), Gaps = 34/329 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC +CV C + + + P SST
Sbjct: 91 TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPR---------FQPELSSTYQP 141
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN+ C C G C Y+ RY ++ + S+G L EDV+ K+S+ V R
Sbjct: 142 VKCNAD-C----NCDENGVQCTYERRY-AEMSTSSGVLAEDVMSFG---KESELVPQRAV 192
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +++G A +G+ GLG SV L +G++ NSFS+C+G G G +
Sbjct: 193 FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G SP S P YNI + ++ V G + ++ AI DSGT++ Y
Sbjct: 252 GGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYF 311
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE----YPVVNLTMKGGGP 392
+ AY + ++ S D F+ + E +P V++ G
Sbjct: 312 PEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
++ P + K YCLG+ K+ N
Sbjct: 372 ISLS-PENYLFRHTKVSGAYCLGIFKNGN 399
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 91/329 (27%), Positives = 144/329 (43%), Gaps = 34/329 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC +CV C + + + P SST
Sbjct: 91 TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPR---------FQPELSSTYQP 141
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN+ C G C Y+ RY ++ + S+G L EDV+ K+S+ V R
Sbjct: 142 VKCNADC-----NCDENGVQCTYERRY-AEMSTSSGVLAEDVMSFG---KESELVPQRAV 192
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +++G A +G+ GLG SV L +G++ NSFS+C+G G G +
Sbjct: 193 FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G SP S P YNI + ++ V G + ++ AI DSGT++ Y
Sbjct: 252 GGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYF 311
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE----YPVVNLTMKGGGP 392
+ AY + ++ S D F+ + E +P V++ G
Sbjct: 312 PEKAYYAFKDAIMKKISFLKQISGPDPNFKDICFSGAGRDVTELPKVFPEVDMVFANGQK 371
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
++ P + K YCLG+ K+ N
Sbjct: 372 ISLS-PENYLFRHTKVSGAYCLGIFKNGN 399
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 51/383 (13%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYL 336
P TP L P YN+ + + VGG A+ I DSGT+ Y+
Sbjct: 275 VVQPKVKTTP--LVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYV 332
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
+ Y + F + + ++ S L C+ S + +P V +G
Sbjct: 333 PEGVYKAL---FAMVFDKHQDISVQTLQDFSCFQYS-GSVDDGFPEVTFHFEG------- 381
Query: 397 DPIVIVSSE----PKGLYLYCLG 415
D +IVS G LYC+G
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMG 404
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 125/254 (49%), Gaps = 30/254 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTS 160
L+YT +S+G P + + +DTGS W+ CD C SC G + +Y P +
Sbjct: 159 LYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHP---------LYRP--A 207
Query: 161 STSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
T+ +P + LCE Q + P + C Y++ Y +DG+ S G V D + ++ + ++
Sbjct: 208 RTADALPASDPLCEGAQHENP---NQCDYEISY-ADGSSSMGVYVRDSMQFVGEDGEREN 263
Query: 220 VDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
D I FGCG Q G L+ +G+ GL S+P+ LA++G+I N+F C +D +
Sbjct: 264 AD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPS 321
Query: 279 GR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE---FSAIFDSG 330
G + GD P G T +R + Q++ G +N + +FD+G
Sbjct: 322 GAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTG 381
Query: 331 TSFTYLNDPAYTQI 344
+++TY D A T++
Sbjct: 382 STYTYFPDEALTRL 395
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 123/261 (47%), Gaps = 23/261 (8%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y ++S+GQP + + DTGSDL WL CD CV C + +Y PN
Sbjct: 64 LGY-YYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHP---------LYRPN 113
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ K P ++L +C C Y+V Y +DG S G LV+DV L +
Sbjct: 114 NNLVICKDPMCASLHPPGYKCEHP-EQCDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGL 169
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+ R++ GCG Q P +G+ GLG K+S+ S L +QG+I N C S G
Sbjct: 170 RLAPRLALGCGYDQIPG--QSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRG 227
Query: 278 TGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFT 334
G + FGD S TP LR H Y+ ++ +GG F+ FDSG+S+T
Sbjct: 228 GGFLFFGDDLYDSSRVVWTPM-LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYT 286
Query: 335 YLNDPAYTQISETFNSLAKEK 355
YLN AY + EK
Sbjct: 287 YLNSLAYQALVHLVRKELSEK 307
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 129/282 (45%), Gaps = 43/282 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y + +G PA + + +DTGSDL WL CD C SC G + +Y P +
Sbjct: 22 LYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPH---------GLYDPKKAR 72
Query: 162 TSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH-LATDEK 215
V C LC L +Q C C Y V Y +DG+ + G L+ED + L T+
Sbjct: 73 L---VDCRVPLCALVQQGGSYACGGPVRQCDYDVEY-ADGSSTMGVLMEDTITLLLTNGT 128
Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+SK+ GCG Q G+ A+ +G+ GL K S+PS LA +G++ N C
Sbjct: 129 RSKTT---AIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLA 185
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------AI 326
GS+G G + FGD P G T + T NI GG + + + +
Sbjct: 186 GGSNGGGYLFFGDSLVPALGMTWTPIMGKSITGNI-------GGKSGDADDKTGDIGGVM 238
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPF 365
FDSGTSFTYL AY + ++ R + + LPF
Sbjct: 239 FDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPF 280
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 109/383 (28%), Positives = 162/383 (42%), Gaps = 51/383 (13%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYL 336
P TP L P YN+ + + VGG A+ I DSGT+ Y+
Sbjct: 275 VVQPKVKTTP--LVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYV 332
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
+ Y + F + + ++ S L C+ S + +P V +G
Sbjct: 333 PEGVYKAL---FAMVFDKHQDISVQTLQDFSCFQYS-GSVDDGFPEVTFHFEG------- 381
Query: 397 DPIVIVSSE----PKGLYLYCLG 415
D +IVS G LYC+G
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMG 404
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 133/296 (44%), Gaps = 36/296 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T + +G P + V +DTGSD+ W+ +C+SC SG +D Y P SS+
Sbjct: 86 LYFTEIKLGTPPKRYYVQVDTGSDILWV--NCISCSK-CPRKSGLGLDLTFYDPKASSSG 142
Query: 164 SKVPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C + P +N C Y V Y DG+ +TGF + D L T + Q+
Sbjct: 143 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMY-GDGSSTTGFFITDALQFDQVTGDGQT 201
Query: 218 KSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ ++ I+FGCG Q G + A +G+ G G TS+ S LA G F+ C +
Sbjct: 202 QPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTI 261
Query: 276 DGTGRISFGDKGSP----------GQGETPFSL----RQTHPTYNITITQVSVGGNAVNF 321
G G + G+ P G P L + P YN+ + + VGG +
Sbjct: 262 KGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQL 321
Query: 322 ---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD-LPFEY 367
+ I DSGT+ TYL + + Q+ + S ++ + D L F+Y
Sbjct: 322 PAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQY 377
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/331 (28%), Positives = 149/331 (45%), Gaps = 35/331 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G PA S+ V +DTGSD+ W+ C C +C SG I+ +Y P+ SS+
Sbjct: 80 LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPR----KSGLGIELTLYDPSGSSS 135
Query: 163 SSKVPCNSTLCELQKQ--CPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
+ V C C PS + C Y + Y DG+ +TGF V D L + Q
Sbjct: 136 GTGVTCGQDFCVATHGGVIPSCVPAAPCQYSISY-GDGSSTTGFFVTDFLQYNQVSGNSQ 194
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ ++ I+FGCG G + A +G+ G G +S+ S LA G + F+ C +
Sbjct: 195 TTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDT 254
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------A 325
+G G + GD P TP L P YN+ + + VGG + +
Sbjct: 255 INGGGIFAIGDVVQPKVSTTP--LVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGT 312
Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ YL Y I S+ F A+ +D F+ C+ S + +P++
Sbjct: 313 IIDSGTTLAYLPGVVYNAIMSKVF---AQYGDMPLKNDQDFQ-CFRYS-GSVDDGFPIIT 367
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLG 415
+GG P ++ + + LYC+G
Sbjct: 368 FHFEGGLPLNIHPHDYLFQNGE----LYCMG 394
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 134/284 (47%), Gaps = 32/284 (11%)
Query: 82 NDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DC 135
+D+ L AG D + R +++G L+Y V +G P+ + V +DTGSD+ W+ C C
Sbjct: 59 DDRRQLRILAGVDLPLGGSGRPDTVG-LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQC 117
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP----SAGSNCPYQVR 191
C SS G ++ +Y+ S + VPC+ C P +A +CPY
Sbjct: 118 RECPR--TSSLG--MELTLYNIKDSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEI 173
Query: 192 YLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFG 247
Y DG+ + G+ V+DV+ + + Q+ S + + FGCG Q+G A +G+ G
Sbjct: 174 Y-GDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILG 232
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN 306
G +S+ S LA + F+ C G +G G + G P TP Q H YN
Sbjct: 233 FGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIGHVVQPKVNMTPLIPNQPH--YN 290
Query: 307 ITITQVSVGGNAVNF---EFS------AIFDSGTSFTYLNDPAY 341
+ +T V VG + ++ EF AI DSGT+ YL + Y
Sbjct: 291 VNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTTLAYLPEIVY 334
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 131/290 (45%), Gaps = 29/290 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y + +G PA F V +DTGS + ++PC G N + P SST+S+
Sbjct: 79 YATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDA------AFDPEASSTASR 132
Query: 166 VPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ C S C +C + C Y R ++ + S+G L+EDVL L + I
Sbjct: 133 ISCTSPKCSCGSPRCGCSTQQCTY-TRSYAEQSSSSGILLEDVLAL-----HDGLPGAPI 186
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISF 283
FGC +TG A +GLFGLG SV + L G+I + FS+CFG +G G +
Sbjct: 187 IFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGALLL 245
Query: 284 GDKGSPGQ---GETPFSLRQTHP-TYNITITQVSVGGNAVNFE-------FSAIFDSGTS 332
GD PG TP THP YN+ + ++V G + + + DSGT+
Sbjct: 246 GDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGYGTVLDSGTT 305
Query: 333 FTYLNDPAYTQISETFN--SLAKEKRETSTSDLPF-EYCYVLSPNQTNFE 379
FTY+ P + + +L+ + D F + C+ +P+ + E
Sbjct: 306 FTYMPSPVFKAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLE 355
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 153/362 (42%), Gaps = 36/362 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G + V +DTGSD W+ C C +C SG +D +Y PN S T
Sbjct: 75 LYYTKIGLGPK--DYYVQVDTGSDTLWVNCVGCTAC----PKKSGLGMDLTLYDPNLSKT 128
Query: 163 SSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S VPC+ C + Q + G +CPY + Y DG+ ++G ++D L +
Sbjct: 129 SKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITY-GDGSTTSGSYIKDDLTFDRVVGDLR 187
Query: 219 SV--DSRISFGCGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+V ++ + FGCG Q+G+ + +G+ G G +SV S LA G + FS C
Sbjct: 188 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLD 247
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
S G G + G+ P TP L Q YN+ + + V G+ +
Sbjct: 248 SISGGGIFAIGEVVQPKVKTTP--LLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRG 305
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ YL Y Q+ E + + D F + + +P V
Sbjct: 306 TIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVED-QFTCFHYSDEESVDDLFPTVK 364
Query: 385 LTMKGGGPF--FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHNCY 442
T + G + D + + + ++C+G KS G+E + ++ L +
Sbjct: 365 FTFEEGLTLTTYPRDYLFLFKED-----MWCVGWQKSMAQTKDGKELILLGDLVLANKLV 419
Query: 443 SY 444
Y
Sbjct: 420 VY 421
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 113/406 (27%), Positives = 168/406 (41%), Gaps = 72/406 (17%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
+L++L + GC G F R P G +G + +AL D R+
Sbjct: 14 LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RL G A G P DT L+YT + +G P + V +DTGSD+
Sbjct: 62 GRLLGAVDLALGGVGLP------TDT-------GLYYTRIEIGSPPKGYYVQVDTGSDIL 108
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ------CPSAG 183
W+ +C+ C G + SG I+ Y P S T+ V C C CPS
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
S C +++ Y DG+ +TGF V D + + Q+ + ++ I+FGCG Q G L +
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
A +G+ G G +S+ S LA + F+ C + G G + G+ P TP
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVP 281
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAY-TQISETF 348
TH YN+ + +SVGG + S I DSGT+ YL Y T ++ F
Sbjct: 282 NVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVF 339
Query: 349 NSLAKEKRETSTSDLPFE-----YCYVLSPNQTNFEYPVVNLTMKG 389
+ DLP C+ S + +PV+ + KG
Sbjct: 340 DKY---------QDLPLHNYQDFVCFQFS-GSIDDGFPVITFSFKG 375
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC---------SKVPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/260 (33%), Positives = 120/260 (46%), Gaps = 21/260 (8%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y ++S+GQP + + TGSDL WL CD CV C + +Y PN
Sbjct: 64 LGY-YYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHX---------LYRPN 113
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ K P + L +C C Y+V Y +DG S G LV+DV L +
Sbjct: 114 NNLVICKDPMCAXLHPPGYKCEHP-EQCDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGL 169
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ R++ GCG Q +G+ GLG K+S+ S L +QG+I N C S G
Sbjct: 170 RLAPRLALGCGYDQIPG-XSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGG 228
Query: 279 GRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTY 335
G + FGD S TP LR H Y+ ++ +GG F+ FDSG+S+TY
Sbjct: 229 GFLFFGDDLYDSSRVVWTPM-LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTY 287
Query: 336 LNDPAYTQISETFNSLAKEK 355
LN AY + EK
Sbjct: 288 LNSLAYQALVHLVRKELSEK 307
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 117/260 (45%), Gaps = 40/260 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 250
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ T +
Sbjct: 251 EKIVPPKDLLCQELQGNQNYCETC-KQCDYEIEY-ADRSSSMGVLARDDMHIITTNGGRE 308
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LANQG+I N F C D
Sbjct: 309 KLD--FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDP 366
Query: 277 -GTGRISFGDKGSPGQGETPFSLR---------QTHPTY--NITITQVSVGGNAVNFEFS 324
G G + GD P G T +R + Y + ++ GN+V
Sbjct: 367 NGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQ---- 422
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
IFDSG+S+TYL D Y +
Sbjct: 423 VIFDSGSSYTYLPDEIYKNL 442
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 117/260 (45%), Gaps = 40/260 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 251
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ T +
Sbjct: 252 EKIVPPKDLLCQELQGNQNYCETC-KQCDYEIEY-ADRSSSMGVLARDDMHIITTNGGRE 309
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LANQG+I N F C D
Sbjct: 310 KLD--FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDP 367
Query: 277 -GTGRISFGDKGSPGQGETPFSLR---------QTHPTY--NITITQVSVGGNAVNFEFS 324
G G + GD P G T +R + Y + ++ GN+V
Sbjct: 368 NGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQ---- 423
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
IFDSG+S+TYL D Y +
Sbjct: 424 VIFDSGSSYTYLPDEIYKNL 443
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 125/278 (44%), Gaps = 33/278 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHG----LNSSSGQVIDFNIYSPN 158
+YT++++G P + + +DTGSD W+ CD C +C G + G+++
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVH------P 69
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++ N CE KQ C Y++ Y +D + S G L D + L T + + K
Sbjct: 70 RDPLCEELQGNQNYCETCKQ-------CDYEITY-ADRSSSKGVLARDNMQLTTADGEMK 121
Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+VD FGC Q G LD + +G+ GL S+ + LAN G+I N F C +D
Sbjct: 122 NVD--FVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDP 179
Query: 278 T--GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
+ G + GD P G T +R Y+ + +V+ G +N A IFD
Sbjct: 180 SSGGYMFLGDDYVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFD 239
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKE-KRETSTSDLPF 365
SG+S+TY YT + + R+ S LPF
Sbjct: 240 SGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPF 277
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 131/280 (46%), Gaps = 31/280 (11%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
LG+ + T +++GQP + + LDTGSDL WL CD CVH L + +Y P
Sbjct: 54 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEAPH------PLYQP--- 102
Query: 161 STSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
++ +PCN LC+ +C + C Y+V Y +DG S G LV DV L +
Sbjct: 103 -SNDLIPCNDPLCKALHFNGNHRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSL--NYT 157
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ + R++ GCG Q +G+ GLG K S+ S L +QG + N C S
Sbjct: 158 KGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSS 217
Query: 276 DGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDSGT 331
G G + FG+ S TP + R+ Y+ + ++ GG + +FDSG+
Sbjct: 218 LGGGILFFGNDLYDSSRVSWTPMA-RENSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGS 276
Query: 332 SFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
S+TY N AY ++ E KE R+ T L ++
Sbjct: 277 SYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 316
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 99/365 (27%), Positives = 159/365 (43%), Gaps = 44/365 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P+ + V +DTGSD+ W+ +C+ C G ++SG I+ Y P S T+
Sbjct: 84 LYYTQIEIGSPSKGYYVQVDTGSDILWV--NCIRC-DGCPTTSGLGIELTQYDPAGSGTT 140
Query: 164 SKVPCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
V C+ C L CPS S C +++ Y DG+ +TGF V D + +
Sbjct: 141 --VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAY-GDGSSTTGFYVSDSVQYNQVSGNG 197
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q+ ++ I+FGCG Q G L + A +G+ G G +S+ S LA + F+ C
Sbjct: 198 QTTPSNASITFGCG-AQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256
Query: 274 GS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------- 324
+ G G + G+ P TP TH YN+ + +SVGG + S
Sbjct: 257 DTVHGGGIFAIGNVVQPKVKTTPLVQNVTH--YNVNLQGISVGGATLQLPSSTFDSGDSK 314
Query: 325 -AIFDSGTSFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
I DSGT+ YL Y T + + + LA + C+ S +
Sbjct: 315 GTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLALHNYQDFV-------CFQFS-GSIDDG 366
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFH 439
+PVV + +G V + +E LYC+G + G++ + ++ L +
Sbjct: 367 FPVVTFSFEGEITLNVYPHDYLFQNEND---LYCMGFLDGGVQTKDGKDMVLLGDLVLSN 423
Query: 440 NCYSY 444
Y
Sbjct: 424 KLVVY 428
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 231
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 232 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 290
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 291 GSTLVYLPEIIYSEL 305
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 112/406 (27%), Positives = 168/406 (41%), Gaps = 72/406 (17%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
+L++L + GC G F R P G +G + +AL D R+
Sbjct: 14 LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RL G A G P DT L+YT + +G P + V +DTGSD+
Sbjct: 62 GRLLGAVDLALGGVGLP------TDT-------GLYYTRIEIGSPPKGYYVQVDTGSDIL 108
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ------CPSAG 183
W+ +C+ C G + SG I+ Y P S T+ V C C CPS
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
S C +++ Y DG+ +TGF V D + + Q+ + ++ I+FGCG Q G L +
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
A +G+ G G +S+ S LA + F+ C + G G + G+ P TP
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVP 281
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAY-TQISETF 348
TH YN+ + +SVGG + S I DSGT+ YL Y T ++ F
Sbjct: 282 NVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVF 339
Query: 349 NSLAKEKRETSTSDLPFE-----YCYVLSPNQTNFEYPVVNLTMKG 389
+ DLP C+ S + +PV+ + +G
Sbjct: 340 DKY---------QDLPLHNYQDFVCFQFS-GSIDDGFPVITFSFEG 375
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL-PCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 231
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 232 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 290
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 291 GSTLVYLPEIIYSEL 305
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 142/296 (47%), Gaps = 32/296 (10%)
Query: 71 RLRGRGLAA-QGND-KTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALD 123
+ + R L+A + +D + L+ AG D + R +++G L+Y + +G P ++ + +D
Sbjct: 43 KYQDRSLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVG-LYYAKIGIGTPPKNYYLQVD 101
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQK 177
TGSD+ W+ C C C + S +D +Y SS+ VPC+ C+ L
Sbjct: 102 TGSDIMWVNCIQCKEC----PTRSSLGMDLTLYDIKESSSGKLVPCDQEFCKEINGGLLT 157
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRVQTG- 234
C +A +CPY Y DG+ + G+ V+D++ + + ++ S + I FGCG Q+G
Sbjct: 158 GC-TANISCPYLEIY-GDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGARQSGD 215
Query: 235 -SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQG 292
S + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 216 LSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAIGHVVQPKVN 275
Query: 293 ETPFSLRQTHPTYNITITQV-------SVGGNAVNFEFSAIFDSGTSFTYLNDPAY 341
TP Q H + N+T QV S +A I DSGT+ YL + Y
Sbjct: 276 MTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYLPEGIY 331
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 90/284 (31%), Positives = 127/284 (44%), Gaps = 47/284 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPLYRPTKNKL 107
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC +++C K+C + C YQ++Y +D S G LV D L K
Sbjct: 108 ---VPCANSICTALHSGSSPNKKC-TTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRNK 162
Query: 216 QSKSVDSRISFGCG-RVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+V +SFGCG Q G +GAAP +GL GLG S+ S L QG+ N
Sbjct: 163 S--NVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGH 218
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE-------- 322
C + G G + FGD P T S+ R T Y S G + F+
Sbjct: 219 CLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNY------YSPGSATLYFDRRSLSTKP 272
Query: 323 FSAIFDSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
+FDSG+++TY + P IS SL+K ++ S LP
Sbjct: 273 MEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPL 316
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 35/335 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC DC C G+ D + P+ SST
Sbjct: 90 TRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHC--------GKHQDPR-FQPDESSTYHP 140
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C G NC Y+ RY ++ + S+G L ED++ QS+ V R
Sbjct: 141 VKCN-----MDCNCDHDGVNCVYERRY-AEMSSSSGVLGEDIISFGN---QSEVVPQRAV 191
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
FGC V+TG A +G+ GLG + S+ L ++ +I +SFS+C+G G +
Sbjct: 192 FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVL 250
Query: 286 KGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P + FS + P YNI + ++ V G + + + DSGT++ YL
Sbjct: 251 GGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYL 310
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
+ A+ + + ++ D + + C+ +Q + +P V++ G
Sbjct: 311 PEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSGAGRDVSQLSKAFPEVDMVFSNGQK 370
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
+ P + K YCLG+ ++ D+ ++G
Sbjct: 371 LSLT-PENYLFQHTKVHGAYCLGIFRNGDSTTLLG 404
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 126/262 (48%), Gaps = 25/262 (9%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSSTSS 164
T + +G P+ F + +D+GS + ++PC S S +I+ + + P+ SST S
Sbjct: 94 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
V CN + C + S C Y+ +Y ++ + S+G L ED++ K+S+ R
Sbjct: 154 PVKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRA 204
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
FGC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 205 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 263
Query: 285 DKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTY 335
G P + FS P YNI + ++ V G A+ N + + DSGT++ Y
Sbjct: 264 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 323
Query: 336 LNDPAYT----QISETFNSLAK 353
L + A+ ++ NSL K
Sbjct: 324 LPEQAFVAFKDAVTNKVNSLKK 345
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 126/262 (48%), Gaps = 25/262 (9%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSSTSS 164
T + +G P+ F + +D+GS + ++PC S S +I+ + + P+ SST S
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
V CN + C + S C Y+ +Y ++ + S+G L ED++ K+S+ R
Sbjct: 153 PVKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRA 203
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
FGC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 204 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 262
Query: 285 DKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTY 335
G P + FS P YNI + ++ V G A+ N + + DSGT++ Y
Sbjct: 263 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 322
Query: 336 LNDPAYT----QISETFNSLAK 353
L + A+ ++ NSL K
Sbjct: 323 LPEQAFVAFKDAVTNKVNSLKK 344
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 124/275 (45%), Gaps = 36/275 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 194 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 241
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L +D +H+ +
Sbjct: 242 EKIVPPRDLLCQELQGDQNYCATC-KQCDYEIEY-ADRSSSMGVLAKDDMHMIATNGGRE 299
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LA+QG+I N F C +
Sbjct: 300 KLD--FVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEP 357
Query: 277 -GTGRISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T +R Y+ +V+ G + A IFD
Sbjct: 358 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417
Query: 329 SGTSFTYLNDPAY----TQISETFNSLAKEKRETS 359
SG+S+TYL D Y T I + S ++ +T+
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTT 452
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/427 (26%), Positives = 170/427 (39%), Gaps = 40/427 (9%)
Query: 18 SCCAGCCFGFGTFGFDFHHRYS--DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGR 75
+C A G G F DF HR S P + P A A A R + GR
Sbjct: 21 TCTASAAAGEGGFSVDFIHRDSARSPYR-------HPALSPHARALAAARRSLRGEVLGR 73
Query: 76 GLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC 135
+ P++ + G ++ + F + V+VG P + DTGSDL W+ C
Sbjct: 74 SYSGASPAAAPVSAADGGVESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNC-- 131
Query: 136 VSCVHGLNSSSGQVIDFN-----IYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQ 189
+SS G + D + ++ P SST S++ C S C+ Q A S C YQ
Sbjct: 132 -------SSSGGGLADADAGGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQ 184
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y DG+ + G L + + + R++FGC G+F +GL GLG
Sbjct: 185 YSY-GDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLG 239
Query: 250 MDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFGDKG---SPGQGETPFSLRQTH 302
S+ S L I S C + ++ + ++FG + PG TP
Sbjct: 240 AGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVD 299
Query: 303 PTYNITITQVSVGGNAVNFEFSAIF-DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
Y + + V+VGG V S I DSGT+ T+L+ + K +R
Sbjct: 300 SYYTVALESVAVGGQEVATHDSRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPE 359
Query: 362 DLPFEYCY-VLSPNQT-NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
L + CY V ++T NF P V L GG + + L L + V +S
Sbjct: 360 QL-LQLCYDVQGKSETDNFGIPDVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSES 418
Query: 420 DNVNIIG 426
V+I+G
Sbjct: 419 QPVSILG 425
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 112/256 (43%), Gaps = 32/256 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P
Sbjct: 187 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPTKEKI 237
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +HL +
Sbjct: 238 ---VPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHLIATNGGRE 292
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LA+ G+I N F C +
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQ 350
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T S+R Y+ V G + A IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFD 410
Query: 329 SGTSFTYLNDPAYTQI 344
SG+S+TYL D Y +
Sbjct: 411 SGSSYTYLPDEIYENL 426
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/272 (32%), Positives = 122/272 (44%), Gaps = 28/272 (10%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++G P + + +DTGSDL WL CD C C + +Y P
Sbjct: 82 VGFYNVT-INIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 130
Query: 159 TSSTSSKVPCNSTLCELQKQCPS----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TD 213
++ VPC LC Q + C Y+V Y +D S G LV DV L T+
Sbjct: 131 ---SNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEY-ADHYSSLGVLVNDVYVLNFTN 186
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q K R++ GCG Q +G+ GLG K+S+ S L QGL+ N C
Sbjct: 187 GVQLKV---RMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCL 243
Query: 274 GSDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGT 331
+ G G I FGD S TP S R + Y+ ++ +GG F A+FD+G+
Sbjct: 244 SAQGGGYIFFGDVYDSSRLAWTPMSSRD-YKHYSAGAAELVLGGKRTGFGNLLAVFDAGS 302
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
S+TY N AY E KE E T L
Sbjct: 303 SYTYFNSNAYQLTKELAGKPIKEAPEDQTLPL 334
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 126/284 (44%), Gaps = 47/284 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPLYRPTKNKL 107
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC +++C K+C + C YQ++Y +D S G LV D L K
Sbjct: 108 ---VPCANSICTALHSGSSPNKKC-TTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRNK 162
Query: 216 QSKSVDSRISFGCG-RVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+V +SFGCG Q G +GAAP +GL GLG S+ S L QG+ N
Sbjct: 163 S--NVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGH 218
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE-------- 322
C + G G + FGD P T + R T Y S G + F+
Sbjct: 219 CLSTSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNY------YSPGSATLYFDRRSLSTKP 272
Query: 323 FSAIFDSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
+FDSG+++TY + P IS SL+K ++ S LP
Sbjct: 273 MEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPL 316
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 147/329 (44%), Gaps = 34/329 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P +F + +DTGS L ++PC C C G+ D N + P+ SST
Sbjct: 94 TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQC--------GKHQDPN-FQPDWSSTYQP 144
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ C+ ++ C S +C Y +Y ++ + S+G L ED++ KQS+ R
Sbjct: 145 LKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSFG---KQSELKPQRTV 195
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L +G+I NSFS+C+G G G +
Sbjct: 196 FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVL 254
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S YNI + ++ + G + ++ I DSGT++ YL
Sbjct: 255 GGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
+PA+ + + D + + C+ +Q + +P V+L G
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNR 374
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
++ P + K YCLG+ +++N
Sbjct: 375 LSLS-PENYLFQHSKAHGAYCLGIFQNEN 402
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 147/329 (44%), Gaps = 34/329 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P +F + +DTGS L ++PC C C G+ D N + P+ SST
Sbjct: 94 TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQC--------GKHQDPN-FQPDWSSTYQP 144
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ C+ ++ C S +C Y +Y ++ + S+G L ED++ KQS+ R
Sbjct: 145 LKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSFG---KQSELKPQRTV 195
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L +G+I NSFS+C+G G G +
Sbjct: 196 FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVL 254
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S YNI + ++ + G + ++ I DSGT++ YL
Sbjct: 255 GGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
+PA+ + + D + + C+ +Q + +P V+L G
Sbjct: 315 PEPAFKAFKDAIMKELNSLKLIQGPDRNYNDICFSGVGSDVSQLSKTFPAVDLVFSNGNR 374
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
++ P + K YCLG+ +++N
Sbjct: 375 LSLS-PENYLFQHSKAHGAYCLGIFQNEN 402
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 99/306 (32%), Positives = 138/306 (45%), Gaps = 37/306 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ VSVG P + +DTGSD+ WL C CVSC H + ++ P SST
Sbjct: 37 YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCD---------EVFDPYKSSTY 87
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + CNS C G+ C YQV Y DG+ STG D + L + + V ++
Sbjct: 88 STLGCNSRQCLNLDVGGCVGNKCLYQVDY-GDGSFSTGEFATDAVSLNSTSGGGQVVLNK 146
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
I GCG G F+ A GL S P+ + ++ FS C +D T R
Sbjct: 147 IPLGCGHDNEGYFVGAAGLLGLG---KGPLSFPNQINSEN--GGRFSYCLTGRDTDSTER 201
Query: 281 IS--FGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
S FGD P G TP + T Y + +T +SVGG+ + SA
Sbjct: 202 SSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGG 261
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGTS T L + AY + E F + + T+ L F+ CY LS + ++ + P V
Sbjct: 262 VIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSL-FDTCYNLS-DLSSVDVPTVT 319
Query: 385 LTMKGG 390
L +GG
Sbjct: 320 LHFQGG 325
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 122/281 (43%), Gaps = 43/281 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +S
Sbjct: 54 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANSL 104
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC + LC +CPS C YQ++Y +D S G L+ D L
Sbjct: 105 ---VPCANALCTALHSGHGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDNFSLPM--- 156
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S ++ ++FGCG Q + AA +G+ GLG S+ S L QG+ N C
Sbjct: 157 RSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE--------FSA 325
++G G + FGD P S P I+ S G + F+
Sbjct: 217 STNGGGFLFFGDD------IVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEV 270
Query: 326 IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF 365
+FDSG+++TY Y + S L+K ++ S LP
Sbjct: 271 VFDSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPL 311
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 122/256 (47%), Gaps = 24/256 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+Y + +G P ++ + +DTGSD+ W+ C C C + S +D +Y SS+
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKEC----PTRSNLGMDLTLYDIKESSS 139
Query: 163 SSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEK 215
VPC+ C+ L C +A +CPY Y DG+ + G+ V+D++ + +
Sbjct: 140 GKFVPCDQEFCKEINGGLLTGC-TANISCPYLEIY-GDGSSTAGYFVKDIVLYDQVSGDL 197
Query: 216 QSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
++ S + I FGCG Q+G S + A G+ G G +S+ S LA+ G + F+ C
Sbjct: 198 KTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL 257
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------A 325
G +G G + G P TP Q H + N+T QV +++ + S
Sbjct: 258 NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGT 317
Query: 326 IFDSGTSFTYLNDPAY 341
I DSGT+ YL + Y
Sbjct: 318 IIDSGTTLAYLPEGIY 333
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/339 (27%), Positives = 149/339 (43%), Gaps = 44/339 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 238
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP +LC+ Q C + C Y++ Y +D + S G L +D +HL +
Sbjct: 239 EKIVPPRDSLCQELQGDQNYCETC-KQCDYEIEY-ADRSSSMGVLAKDDMHLIATNGGRE 296
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
+D FGC Q G L A +G+ GL S+PS LA++G+I N F C +
Sbjct: 297 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354
Query: 276 DGTGRISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNF--EFSAIFDSGTS 332
+G G + GD P G T +R Y+ +V+ G ++ IFDSG+S
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
+TYL + Y + + + + S SD C+ + +F P L + G
Sbjct: 415 YTYLPEEMYKNLIDAIKEDSPSFVQDS-SDTTLPLCWKADFSVRSFFKP---LNLHFGRR 470
Query: 393 FF--------VNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
+F V D +I+S + CLG++ +N
Sbjct: 471 WFVVPKTFTIVPDDYLIISDKGN----VCLGLLNGTEIN 505
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 110/414 (26%), Positives = 168/414 (40%), Gaps = 32/414 (7%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F DF HR D + A LP + + R GR + P+
Sbjct: 28 GGFSVDFIHR--DSARSPFAQPSLPPHARALAAARRSLRGAAL---GRYVGGASPAPGPV 82
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
+ G ++ + F + V+VG P + DTGSDL W+ +C S G +S G
Sbjct: 83 PEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWV--NCSSNGGGGGASDG 140
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVED 206
V ++ P+ S+T S + C S C+ Q A S C YQ Y DG+ + G L +
Sbjct: 141 AV----VFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAY-GDGSRTIGVLSTE 195
Query: 207 VLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
A + + R+SFGC GSF +GL GLG S+ S L
Sbjct: 196 TFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAAR 251
Query: 265 IPNSFSMCF-----GSDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGG 316
I FS C ++ + +SFG + PG TP + Y + + V+V G
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAG 311
Query: 317 NAVNFEFSA--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
V S+ I DSGT+ T+L+ + + R L + CY +
Sbjct: 312 QDVASANSSRIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQL-LQLCYDVQGK 370
Query: 375 QTNFEYPVVNLTMK-GGGPFFVNDPIVIVSSEPKG-LYLYCLGVVKSDNVNIIG 426
++ + ++T++ GGG P S +G L L + V +S V+I+G
Sbjct: 371 SQAEDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILG 424
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 121/278 (43%), Gaps = 28/278 (10%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
GF + T + VGQP + + DTGSDL WL CD C C L+ +Y P
Sbjct: 55 GFYNVT-LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP-- 102
Query: 160 SSTSSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
++ VPC LC + +C + C Y+V Y +DG S G LV DV L +
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEY-ADGGSSLGVLVRDVFPL--NL 156
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ R++ GCG Q +G+ GLG S+ S L NQG++ N CF
Sbjct: 157 TNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE-FSAIFDSGTS 332
S G G + FGD + + +P Y+ ++ G + +FDSG+S
Sbjct: 217 SKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 276
Query: 333 FTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+TY N AY ++ N LA + + D C+
Sbjct: 277 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCW 314
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 131/284 (46%), Gaps = 27/284 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P + V +DTGSD+ W+ C C +C S I+ ++YSP++SST
Sbjct: 73 LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNC----PKKSDLGIELSLYSPSSSST 128
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVED--VLHLATDEKQ 216
S++V CN C P G C Y+V Y DG+ + G+ V D VL T Q
Sbjct: 129 SNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAY-GDGSSTAGYFVRDHVVLDRVTGNFQ 187
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ S + I FGCG Q+G AA +G+ G G +S+ S LA+ G + F+ C +
Sbjct: 188 TTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDN 247
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN---------FEFSA 325
+G G + G+ P TP +Q H YN+ + + V +N
Sbjct: 248 INGGGIFAIGEVVQPKVRTTPLVPQQAH--YNVFMKAIEVDNEVLNLPTDVFDTDLRKGT 305
Query: 326 IFDSGTSFTYLNDPAYT-QISETFNSLAKEKRETSTSDLP-FEY 367
I DSGT+ Y D Y IS+ F + K T FEY
Sbjct: 306 IIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFEY 349
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 99/310 (31%), Positives = 139/310 (44%), Gaps = 42/310 (13%)
Query: 97 RLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNI 154
R SLG +Y +V +G PA + V DTGSDL W+ C C C + +
Sbjct: 140 RGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP---------L 190
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ P+ SST + V C + C EL S+ S C Y+V+Y D + + G LV D L L+
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSAS 249
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN---SFS 270
+ V FGCG G F +GLFGLG +K S+PS QG P+ F+
Sbjct: 250 DTLPGFV-----FGCGDQNAGLF---GQVDGLFGLGREKVSLPS----QG-APSYGPGFT 296
Query: 271 MCFGSDGTGR--ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF------- 321
C S +GR +S G T + T Y I + + VGG A+
Sbjct: 297 YCLPSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAA 356
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+ DSGT T L AY + F S+A+ K+ + S L + CY + ++T +
Sbjct: 357 AGGTVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL--DTCYDFTGHRTA-QI 413
Query: 381 PVVNLTMKGG 390
P V L GG
Sbjct: 414 PTVELAFAGG 423
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 39/312 (12%)
Query: 55 GSFAYYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGND-----TYRLNSLGFLHYTN 108
G F+ A R+R L+ ++ Q L F AG D + R +++G L+Y
Sbjct: 38 GVFSVKYKYAGRERSLSTLKAHDISRQ------LRFLAGVDIPLGGSGRPDAVG-LYYAK 90
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G P+ + V +DTGSD+ W+ C C C SS G ++ Y S+T V
Sbjct: 91 IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPR--TSSLG--MELTPYDLEESTTGKLVS 146
Query: 168 CNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVD 221
C+ C P +G +CPY ++ DG+ + G+ V+D + + + ++ + +
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 222 SRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGT 278
I FGCG Q+G A +G+ G G +S+ S LA+ + F+ C G++G
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSA------IFDS 329
G + G P TP Q H YN+ +T V VG +N F A I DS
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFEAGDRKGTIIDS 323
Query: 330 GTSFTYLNDPAY 341
GT+ YL + Y
Sbjct: 324 GTTLAYLPELIY 335
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 138/307 (44%), Gaps = 42/307 (13%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
SLG +Y +V +G PA + V DTGSDL W+ C C C + ++ P
Sbjct: 143 SLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP---------LFDP 193
Query: 158 NTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ SST + V C + C EL S+ S C Y+V+Y D + + G LV D L L+ +
Sbjct: 194 SLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSASDTL 252
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN---SFSMCF 273
V FGCG G F +GLFGLG +K S+PS QG P+ F+ C
Sbjct: 253 PGFV-----FGCGDQNAGLF---GQVDGLFGLGREKVSLPS----QG-APSYGPGFTYCL 299
Query: 274 GSDGTGR--ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFS 324
S +GR +S G T + T Y I + + VGG A+
Sbjct: 300 PSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG 359
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
+ DSGT T L AY + F S+A+ K+ + S L + CY + ++T + P V
Sbjct: 360 TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL--DTCYDFTGHRTA-QIPTV 416
Query: 384 NLTMKGG 390
L GG
Sbjct: 417 ELAFAGG 423
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/269 (29%), Positives = 121/269 (44%), Gaps = 17/269 (6%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
++++G+ +F +D+GSDL W+ CD C H +Y PN ++ +
Sbjct: 57 VSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPRE------QLYKPNNNALNCFE 109
Query: 167 P-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
P C S C SA C Y++ Y G+ S G LV D H+ RI+
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGS-SLGVLVND--HVPLKLTNGSLAAPRIA 166
Query: 226 FGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
FGCG S D + P G+ GLG + S S L++ G++ N C +G G + FG
Sbjct: 167 FGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFG 225
Query: 285 DKGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYLNDPAY 341
D+ P G T S+ Y+ +V GG A + + +FDSG+S+TY N AY
Sbjct: 226 DEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAY 285
Query: 342 TQI-SETFNSLAKEKRETSTSDLPFEYCY 369
I + N+L + E + D C+
Sbjct: 286 NSILALVKNNLRGKPLEDAPEDKSLPVCW 314
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 125/278 (44%), Gaps = 39/278 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + Y P +
Sbjct: 73 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPWYKPTKNKI 123
Query: 163 SSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VPC ++LC K+C + C YQ++Y +D S G L+ D L+ + S +
Sbjct: 124 ---VPCAASLCTSLTPNKKC-AVPQQCDYQIKY-TDKASSLGVLIADNFTLSL--RNSST 176
Query: 220 VDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
V + ++FGCG Q + AA +GL GLG S+ S L QG+ N CF ++G
Sbjct: 177 VRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNG 236
Query: 278 TGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE--------FSAIFD 328
G + FGD P T + R T Y S G + F+ +FD
Sbjct: 237 GGFLFFGDDIVPTSRVTWVPMARTTSGNY------YSPGSGTLYFDRRSLGMKPMEVVFD 290
Query: 329 SGTSFTYL-NDPAYTQISETFNSLAKEKRETSTSDLPF 365
SG+++ Y +P +S L+K +E S LP
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPL 328
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 109/360 (30%), Positives = 155/360 (43%), Gaps = 58/360 (16%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
N G H +SVG P L+F +DTGSDL W C C + + +Y P
Sbjct: 91 NGAGAYHMI-LSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTP--------LYDP 141
Query: 158 NTSSTSSKVPCNSTLCELQKQCPSA-----GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST SK+PC S LC+ PSA + C Y RY + G+L D L +
Sbjct: 142 ARSSTFSKLPCASPLCQ---ALPSAFRACNATGCVYDYRYAVG--FTAGYLAADTLAIGD 196
Query: 213 DEKQSKSVDS--RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ + S ++FGC G +DGA +G+ GLG S S+L+ G+ FS
Sbjct: 197 GDGDGDASSSFAGVAFGCSTANGGD-MDGA--SGIVGLGR---SALSLLSQIGV--GRFS 248
Query: 271 MCFGSD---GTGRISF-------GDK-GSPGQGETPFSLRQTHPTYNITITQVSVGGNAV 319
C SD G I F GDK S P + R+ P Y + +T ++VG +
Sbjct: 249 YCLRSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDL 308
Query: 320 -----NFEFSA------IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEY 367
F F+A I DSGT+FTYL + YT + + F S A S + F+
Sbjct: 309 PVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDL 368
Query: 368 CYVLSPNQTNFEYPVVNLTMK-GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
C+ T PV L + GG + + +G + CL V+ + V++IG
Sbjct: 369 CFEAGAADT----PVPRLVFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIG 424
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 151/347 (43%), Gaps = 44/347 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y + +G PA + V DTGSD W+ C+ CV + ++
Sbjct: 179 RALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQE--------KLFD 230
Query: 157 PNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P SST + + C + C K C +G +C Y V+Y DG+ S GF D L L+
Sbjct: 231 PARSSTDANISCAAPACSDLYTKGC--SGGHCLYGVQY-GDGSYSIGFFAMDTLTLS--- 284
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
S FGCG G F + A GL GLG KTS+P ++ F+ CF
Sbjct: 285 --SYDAIKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDK--YGGVFAHCFP 337
Query: 274 -GSDGTGRISFGDKGSPG---QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---- 325
S GTG + FG SP + TP + Y + +T + VGG ++ S
Sbjct: 338 ARSSGTGYLDFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA 397
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L AY+ + F S +A + + + + CY + + P
Sbjct: 398 GTIVDSGTVITRLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFT-GMSQVAIPT 456
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV---KSDNVNIIG 426
V+L +GG V+ +I ++ + CLG + D+V I+G
Sbjct: 457 VSLLFQGGASLDVDASGIIYAAS---VSQACLGFAANEEDDDVGIVG 500
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 115/276 (41%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC---------NKVPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC LC + +C S C Y+++Y G+ S G L+ D A
Sbjct: 108 I---VPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGS-SLGVLLTD--SFAVRL 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
G G + FGD P T P Y+ + GG ++ + DSG
Sbjct: 222 SIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281
Query: 331 TSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF 365
+SFTY Y + S L+K +E LP
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPL 317
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 145/342 (42%), Gaps = 34/342 (9%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
++LG +Y + +G PA + V DTGSD W+ C+ CV + ++
Sbjct: 154 SALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQE--------KLFD 205
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + + C + C +G +C Y V+Y DG+ S GF D L L+
Sbjct: 206 PARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLS----- 259
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
S FGCG G + + A GL GLG KTS+P ++ F+ CF
Sbjct: 260 SYDAIKGFRFGCGERNEGLYGEAA---GLLGLGRGKTSLPVQAYDK--YGGVFAHCFPAR 314
Query: 275 SDGTGRISFGDKGSP---GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
S GTG + FG P + TP + Y + +T + VGG ++ S
Sbjct: 315 SSGTGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGT 374
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVN 384
I DSGT T L AY+ + F S E+ L + CY + + P V+
Sbjct: 375 IVDSGTVITRLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFT-GMSEVAIPTVS 433
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
L +GG V+ +I ++ L G + D+V I+G
Sbjct: 434 LLFQGGASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVG 475
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 142/307 (46%), Gaps = 38/307 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
SL L Y V +G PA++ +++DTGSD+ W+ C C C ++S ++
Sbjct: 124 TSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDS---------LFD 174
Query: 157 PNTSSTSSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
P+ SST S C+S C + Q+ + S C Y V Y+ DG+ +TG D L L +
Sbjct: 175 PSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYV-DGSSTTGTYSSDTLTLGS 233
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + FGC + ++G F D +GL GLG D S+ S A G +FS C
Sbjct: 234 NAIKG------FQFGCSQSESGGFSD--QTDGLMGLGGDAQSLVSQTA--GTFGKAFSYC 283
Query: 273 F--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVN-----FEF 323
+G ++ G G +TP LR T PT Y + + + VGG +N F
Sbjct: 284 LPPTPGSSGFLTLGAASRSGFVKTPM-LRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSA 342
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
++ DSGT T L AY+ +S F + K+ S + + C+ S Q++ P V
Sbjct: 343 GSVMDSGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGI-LDTCFDFS-GQSSVSIPSV 400
Query: 384 NLTMKGG 390
L GG
Sbjct: 401 ALVFSGG 407
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 39/312 (12%)
Query: 55 GSFAYYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGND-----TYRLNSLGFLHYTN 108
G F+ A R+R L+ ++ Q L F AG D + R +++G L+Y
Sbjct: 38 GIFSVKYKYAGRERSLSTLKAHDISRQ------LRFLAGIDIPLGGSGRPDAVG-LYYAK 90
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G P+ + V +DTGSD+ W+ C C C SS G ++ Y S+T V
Sbjct: 91 IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPR--TSSLG--MELTPYDLEESTTGKLVS 146
Query: 168 CNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVD 221
C+ C P +G +CPY ++ DG+ + G+ V+D + + + ++ + +
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 222 SRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGT 278
I FGCG Q+G A +G+ G G +S+ S LA+ + F+ C G++G
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSA------IFDS 329
G + G P TP Q H YN+ +T V VG +N F A I DS
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFEAGDRKGTIIDS 323
Query: 330 GTSFTYLNDPAY 341
GT+ YL + Y
Sbjct: 324 GTTLAYLPELIY 335
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 152/330 (46%), Gaps = 36/330 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P +SST
Sbjct: 86 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPESSSTYQP 136
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V C + C S C Y+ +Y ++ + S+G L ED++ QS+ R
Sbjct: 137 VKCT-----IDCNCDSDRMQCVYERQY-AEMSTSSGVLGEDLISFGN---QSELAPQRAV 187
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++ +I +SFS+C+G G G +
Sbjct: 188 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVL 246
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYL 336
G P +S P YNI + ++ V G N + + + DSGT++ YL
Sbjct: 247 GGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 306
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
+ A+ + + ++ S D + + C+ + +Q + +PVV++ + G
Sbjct: 307 PEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFENGQK 366
Query: 393 FFVN-DPIVIVSSEPKGLYLYCLGVVKSDN 421
+ ++ + + S+ +G YCLGV ++ N
Sbjct: 367 YTLSPENYMFRHSKVRG--AYCLGVFQNGN 394
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 97.8 bits (242), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 124/261 (47%), Gaps = 33/261 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P+ F + +D+GS + ++PC C C + + + P+ SST S
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPR---------FQPDLSSTYSP 143
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C + S C Y+ +Y ++ + S+G L ED++ K+S+ R
Sbjct: 144 VKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRAV 194
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
FGC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 195 FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253
Query: 286 KGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTYL 336
G P + FS P YNI + ++ V G A+ N + + DSGT++ YL
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313
Query: 337 NDPAYT----QISETFNSLAK 353
+ A+ ++ NSL K
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKK 334
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/222 (30%), Positives = 106/222 (47%), Gaps = 18/222 (8%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSP 157
N + ++YT + +G P F V +DTGSD+ W+ C CV C + + + P
Sbjct: 76 NPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGC---------PLQNVTFFDP 126
Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS++ K+ C+ C S S Y+V Y SDG+ ++G+ + D++ T +
Sbjct: 127 GASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEY-SDGSFTSGYYISDLISFETVMSSN 185
Query: 218 KSVDSR--ISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+V S FGC + G L + +G+ GLG + V S L++Q L P FS+C
Sbjct: 186 LTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLS 245
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
G +G G I G+ P TP QTH YN+ + +V
Sbjct: 246 GGQEGGGVIILGENRLPNTVYTPLVRSQTH--YNVNLKTFAV 285
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 112/221 (50%), Gaps = 16/221 (7%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P V +DTGSD+ W+ C SC +G +SG I N + P +SSTS
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPGSSSTS 132
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C C Q C + C Y +Y DG+ ++G+ V D++H A+ + +
Sbjct: 133 SLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S S FGC +QTG A +G+FG G SV S L++QG+ P FS C
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
D G G + G+ P +P L + P YN+ + +SV
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVPSQPHYNLNLQSISV 290
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 34/335 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SST S V
Sbjct: 87 TRLYIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 138
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C++ C S S C Y+ +Y ++ + S+G L ED++ T +S+ R F
Sbjct: 139 KCSADCT-----CDSDKSQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 189
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L ++G+I +SFSMC+G G G + G
Sbjct: 190 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 248
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 249 AMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLP 308
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
+ A+ + S + ++ D + + C+ + +Q + +P V++ G G
Sbjct: 309 EQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDMVF-GDGQK 367
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
P + K YCLGV ++ D ++G
Sbjct: 368 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 402
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 151/346 (43%), Gaps = 42/346 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y + +G PA + V DTGSD W+ C CV + ++
Sbjct: 175 RALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQE--------KLFD 226
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 227 PARSSTYANVSCAAPACSDLYTRGCSGGHCLYSVQY-GDGSYSIGFFAMDTLTLSSYDAV 285
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 286 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 335
Query: 275 SDGTGRISFGDKGSP---GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG GSP G +T L PT Y + +T + VGG ++ S
Sbjct: 336 SSGTGYLDFG-PGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAG 394
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T L AY+ + F S +A + + + + CY + + P V
Sbjct: 395 TIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYDFT-GMSEVAIPKV 453
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
+L +GG VN ++ ++ L CLG + D+V I+G
Sbjct: 454 SLLFQGGAYLDVNASGIMYAAS---LSQVCLGFAANEDDDDVGIVG 496
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 142/329 (43%), Gaps = 34/329 (10%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYR-LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
R R +AA+ N + + + D L+ G + ++SVG P F DTGSDL W+
Sbjct: 22 RVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWV 81
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
+ C C G I+ P SST ++ C+S LC EL C S C Y
Sbjct: 82 QSEPCTGCSGG-----------TIFDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYS 130
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y S T G D + L T S+ S + GCG V +G DG +GL GLG
Sbjct: 131 YEYGSGET--EGEFARDTISLGTTSDGSQKFPS-FAVGCGMVNSG--FDGV--DGLVGLG 183
Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDK----GSPGQGETPFSLRQT 301
S+ S L+ I + FS C + + FG G+ Q T
Sbjct: 184 QGPVSLTSQLS--AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDT 241
Query: 302 HPTYNI-TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
+PTY + T+ ++V G + + I DSGT+ TY+ Y ++ S+ R
Sbjct: 242 YPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPR-VDG 300
Query: 361 SDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
S + + CY S N+ N+++P + + + G
Sbjct: 301 SSMGLDLCYDRSSNR-NYKFPALTIRLAG 328
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 138/297 (46%), Gaps = 47/297 (15%)
Query: 93 NDTYRL-NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
N++Y S G+ + + +G P +V +DTGSDL W+ + C +C +
Sbjct: 11 NESYEFPESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADP----- 65
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
I+ P+ SST +K+ C+S+ C L Q SA +NC Y Y DG+++ G+ ++
Sbjct: 66 ----IFDPSKSSTYNKIACSSSACADLLGTQTCSAAANCIYAYGY-GDGSVTRGYFSKET 120
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ ATD + + FG TG+F D G+ GLG S+PS L + ++ N
Sbjct: 121 I-TATD-----TAGEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGN 171
Query: 268 SFSMCF------GSDGTGRISFGDKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGN 317
FS C GS+ T + FGD P GE TP HPT Y I + +SVGG+
Sbjct: 172 KFSYCLVDWLSAGSE-TSTMYFGDAAVP-SGEVQYTPIVPNADHPTYYYIAVQGISVGGS 229
Query: 318 AVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
++ + S I DSGT+ TYL + + + S + TS + L
Sbjct: 230 LLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATGL 286
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 102/345 (29%), Positives = 151/345 (43%), Gaps = 36/345 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P V +DTGSD+ W+ C C SC+ S + +IY+ + SST
Sbjct: 82 LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCL----SKQDIIPPLSIYNLSASST 137
Query: 163 SSKVPCNSTLCELQK-QCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
SS C+ LC ++ C +G+N C Y Y D + S G V D +H + +
Sbjct: 138 SSVSSCSDPLCTGEEVVCSRSGNNSACAYVSSY-QDKSASVGAYVRDDMHYVLHGGNATT 196
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
SRI FGC TGS+ +G+ G G+ +VP+ +A Q + FS C G + G
Sbjct: 197 --SRIFFGCATNITGSW----PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHG 250
Query: 278 TGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNF---EFS--------- 324
G + FG+ +P E F+ L YN+ + +SV + EFS
Sbjct: 251 GGILEFGE--APNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNT 308
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+F L A + + SL K L E Y+ S +P V
Sbjct: 309 GVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGL--ECFYLKSGLTMETSFPNV 366
Query: 384 NLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
LT GG + D ++++ K YC +D + I G
Sbjct: 367 TLTFSGGSTMKLKPDNYLVMAEYKKKRNGYCYAWSSADGLTIFGE 411
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/329 (26%), Positives = 145/329 (44%), Gaps = 34/329 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P+ SST
Sbjct: 79 TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQC--------GKHQDPR-FQPDLSSTYRP 129
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C G C Y+ RY ++ + S+G + EDV+ + S+ R
Sbjct: 130 VKCNPS-C----NCDDEGKQCTYERRY-AEMSSSSGVIAEDVVSFGNE---SELKPQRAV 180
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG + SV L ++G+I +SFS+C+G G G +
Sbjct: 181 FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVL 239
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI + ++ V G + + + DSGT++ Y
Sbjct: 240 GQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYF 299
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPNQTNF---EYPVVNLTMKGGGP 392
+ A+ + + + ++ D + + C+ + + + +P VN+ G G
Sbjct: 300 PEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKVFPEVNMVF-GSGQ 358
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
P + K YCLG+ ++ N
Sbjct: 359 KLSLSPENYLFRHTKVSGAYCLGIFQNGN 387
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 57/357 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ ++VG P +V +DTGSDL WL CV C H + +Y P +SST
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIWL--QCVPCRHCYRQVT------PLYDPRSSSTHR 139
Query: 165 KVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
++PC S C C + C Y V Y DG+ S+G L D L D
Sbjct: 140 RIPCASPRCRDVLRYPGCDARTGGCVYMVVY-GDGSASSGDLATDRLVFPDDTHVHN--- 195
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG------S 275
++ GCG G L+ AA GL G+G + S P+ LA + FS C G
Sbjct: 196 --VTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRLSRAQ 248
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
+G+ + FG +P T F+ +T+P Y + + SVGG V +A
Sbjct: 249 NGSSYLVFGR--TPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNP 306
Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCYVLSPN- 374
+ DSGT+ + AY + + F+S A R+ +T F+ CY L N
Sbjct: 307 ATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNG 366
Query: 375 --QTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
P + L GG + ++ V + Y +CLG+ +D+ +N++G
Sbjct: 367 APAAAVRVPSIVLHFAGGADMALPQANYLIPVQGGDRRTY-FCLGLQAADDGLNVLG 422
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 120/273 (43%), Gaps = 32/273 (11%)
Query: 105 HYTNV-SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
HY+ + ++G P +F + +DTGSDL W+ CD C C L+ +Y P
Sbjct: 67 HYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDK---------LYKPK--- 114
Query: 162 TSSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+++VPC S+LC+ C C Y+V Y G+ S G L+ D L +
Sbjct: 115 -NNRVPCASSLCQAIQNNNCDIPTEQCDYEVEYADLGS-SLGVLLSDYFPLRLNN--GSL 170
Query: 220 VDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ RI+FGCG Q +L +P G+ GLG K S+ S L G+ N CF
Sbjct: 171 LQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRV 228
Query: 277 GTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
G + FGD P G TP + Y+ ++ GG + IFDSG+S+
Sbjct: 229 TGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSY 288
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
TY N Y I N + K+ D P E
Sbjct: 289 TYFNAQVYQSI---LNLVRKDLSGMPLKDAPEE 318
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 134/309 (43%), Gaps = 38/309 (12%)
Query: 78 AAQGNDKTPLTFSAGNDTYRLNSLGFL----------HYT-NVSVGQPALSFIVALDTGS 126
A N K P T + N+ +RL+S HYT ++++G P + + +D+GS
Sbjct: 26 AQPRNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGS 85
Query: 127 DLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQKQC 179
DL W+ CD C C + +Y PN + V C LC + C
Sbjct: 86 DLTWVQCDAPCKGCTKPRD---------QLYKPN----HNLVQCVDQLCSEVHLSMAYNC 132
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
PS C Y+V Y G+ S G LV D ++ V R++FGCG Q S +
Sbjct: 133 PSPDDPCDYEVEYADHGS-SLGVLVRD--YIPFQFTNGSVVRPRVAFGCGYDQKYSGSNS 189
Query: 240 A-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
A +G+ GLG + S+ S L + GLI N C + G G + FGD P G S+
Sbjct: 190 PPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSM 249
Query: 299 RQTHPTYNITI--TQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ + + ++ G A + IFDSG+S+TY N AY + + K K
Sbjct: 250 LSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGK 309
Query: 356 RETSTSDLP 364
+ +D P
Sbjct: 310 QLKRATDDP 318
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 128/298 (42%), Gaps = 35/298 (11%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P + +DTGSD+ WL C C C I++P+ SS+ +PC
Sbjct: 92 SVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTP---------IFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+S LC+ + N C Y + + SD + S G L + L L + S S + G
Sbjct: 143 SSNLCQSVRYTSCNKQNSCEYTINF-SDQSYSQGELSVETLTLDSTTGHSVSFPKTV-IG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGRIS 282
CG G F +G+ GLG+ S+ + L + I FS C S+ T +++
Sbjct: 201 CGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLN 256
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF-------SAIFDSGTS 332
FGD G TPF + Y +T+ SVG + FE + I DSGT+
Sbjct: 257 FGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTT 316
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T L YT + L K R + L CY ++ +Q +++P++ KG
Sbjct: 317 LTLLPSHVYTNLESAVAQLVKLDRVDDPNQL-LNLCYSITSDQ--YDFPIITAHFKGA 371
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 142/329 (43%), Gaps = 34/329 (10%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYR-LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
R R +AA+ N + + + D L+ G + ++SVG P F DTGSDL W+
Sbjct: 22 RVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWV 81
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
+ C C G I+ P SST ++ C+S LC EL C S C Y
Sbjct: 82 QSEPCTGCSGG-----------TIFDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYS 130
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y S T G D + L T S+ S + GCG V +G DG +GL GLG
Sbjct: 131 YEYGSGET--EGEFARDTISLGTTSGGSQKFPS-FAVGCGMVNSG--FDGV--DGLVGLG 183
Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDK----GSPGQGETPFSLRQT 301
S+ S L+ I + FS C + + FG G+ Q T
Sbjct: 184 QGPVSLTSQLS--AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDT 241
Query: 302 HPTYNI-TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
+PTY + T+ ++V G + + I DSGT+ TY+ Y ++ S+ R
Sbjct: 242 YPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPR-VDG 300
Query: 361 SDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
S + + CY S N+ N+++P + + + G
Sbjct: 301 SSMGLDLCYDRSSNR-NYKFPALTIRLAG 328
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 146/341 (42%), Gaps = 35/341 (10%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 176 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 227
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P +SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 228 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 286
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P + G F+ C
Sbjct: 287 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPAR 336
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
S GTG + FG P TP L PT Y + +T + VGG + F+A I
Sbjct: 337 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 395
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK--RETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
DSGT T L AY+ + F + + R+ + L + CY + + P V+L
Sbjct: 396 DSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL-LDTCYDFT-GMSQVAIPTVSL 453
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+GG V+ ++ + + L G +V I+G
Sbjct: 454 LFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVG 494
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 145/338 (42%), Gaps = 42/338 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G PA + V DTGSD W+ C CV G + D P SST +
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWV--QCRPCVVKCYKQKGPLFD-----PAKSSTYA 215
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
V C + C G +C Y V+Y DG+ + GF +D L +A D +
Sbjct: 216 NVSCTDSACADLDTNGCTGGHCLYAVQY-GDGSYTVGFFAQDTLTIAHDAIKG------F 268
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRIS 282
FGCG G F A GL GLG KTS+ N+ +F+ C + GTG +
Sbjct: 269 RFGCGEKNNGLFGKTA---GLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGTGYLD 323
Query: 283 FGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFT 334
FG GS G TP + Y + +T + VGG V S + DSGT T
Sbjct: 324 FG-PGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVIT 382
Query: 335 YLNDPAYTQISETFNS--LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
L AYT +S F+ LA+ ++ + + CY + ++ E P V+L +GG
Sbjct: 383 RLPATAYTALSSAFDKVMLARGYKKAPGYSI-LDTCYDFT-GLSDVELPTVSLVFQGGAC 440
Query: 393 FFVN-DPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
V+ IV SE + CL + ++V I+G
Sbjct: 441 LDVDVSGIVYAISEAQ----VCLAFASNGDDESVAIVG 474
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 136/316 (43%), Gaps = 36/316 (11%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
F + VS+G P +S V +DTGSD+ W+ PC +C NS Q+ D P
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPAC----NSQRDQLFD-----PAK 191
Query: 160 SSTSSKVPCNSTLC-ELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SST S VPC + C EL+ + +GS C Y V Y DG+ +TG D L LA
Sbjct: 192 SSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSY-GDGSNTTGVYGSDTLALAPGNTVG 250
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+ FGCG Q G F A +GL LG S+ S A G FS C S
Sbjct: 251 TFL-----FGCGHAQAGMF---AGIDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQ 300
Query: 277 -GTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFD 328
G ++ G S G T PT Y + +T +SVGG V SA + D
Sbjct: 301 SAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVD 360
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
+GT T L AY + F ++A ++ ++ + CY S P V LT
Sbjct: 361 TGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCYDFS-RYGVVTLPTVALTF 419
Query: 388 KGGGPFFVNDPIVIVS 403
GG + P ++ S
Sbjct: 420 SGGATLALEAPGILSS 435
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/366 (28%), Positives = 150/366 (40%), Gaps = 57/366 (15%)
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSS 145
+ +AG R S + +G P + +VA+D +D W+PC C+ C G +S
Sbjct: 86 VPIAAGRQILRTPS----YVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSP 141
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCE----LQKQCPSA-GSNCPYQVRYLSDGTMST 200
S + P SST V C + C CP+ G++C + + Y S +
Sbjct: 142 S--------FDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHA- 192
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSIL 259
L +D L L +D + D +FGC RV TGS P GL G G S +
Sbjct: 193 -VLGQDALSL-SDSNGAAVPDDHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPLSFLSQTK 249
Query: 260 ANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVS 313
A G I FS C S+ +G + G G P + +T L H P+ Y + + V
Sbjct: 250 ATYGSI---FSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVR 306
Query: 314 VGGNAVNFEFSA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
V G AV SA I D+GT FT L+ PAY + F +R S
Sbjct: 307 VNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAF------RRGVSAP 360
Query: 362 DLP----FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
P F+ CY ++ ++ P V GG + + V++SS G+ +
Sbjct: 361 AAPALGGFDTCYYVNGTKS---VPAVAFVFAGGARVTLPEENVVISSTSGGVACLAMAAG 417
Query: 418 KSDNVN 423
SD VN
Sbjct: 418 PSDGVN 423
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 120/269 (44%), Gaps = 17/269 (6%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
++++G+ +F +D+GSDL W+ CD C H +Y PN ++ +
Sbjct: 57 VSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPRE------QLYKPNNNALNCFE 109
Query: 167 P-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
P C S C SA C Y++ Y G+ S G LV D H+ RI+
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGS-SLGVLVND--HVPLKLTNGSLAAPRIA 166
Query: 226 FGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
FGCG S D + P G+ GLG + S S L++ G++ N C +G G + FG
Sbjct: 167 FGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG-GFLFFG 225
Query: 285 DKGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYLNDPAY 341
D+ P G T S+ Y+ +V G A + + +FDSG+S+TY N AY
Sbjct: 226 DEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAY 285
Query: 342 TQI-SETFNSLAKEKRETSTSDLPFEYCY 369
I + N+L + E + D C+
Sbjct: 286 NSILALVKNNLRGKPLEDAPEDKSLPVCW 314
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 101/316 (31%), Positives = 136/316 (43%), Gaps = 36/316 (11%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
F + VS+G P +S V +DTGSD+ W+ PC +C NS Q+ D P
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPAC----NSQRDQLFD-----PAK 191
Query: 160 SSTSSKVPCNSTLC-ELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SST S VPC + C EL+ + +GS C Y V Y DG+ +TG D L LA
Sbjct: 192 SSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSY-GDGSNTTGVYGSDTLALAPGNTVG 250
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+ FGCG Q G F A +GL LG S+ S A G FS C S
Sbjct: 251 TFL-----FGCGHAQAGMF---AGIDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQ 300
Query: 277 -GTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFD 328
G ++ G S G T PT Y + +T +SVGG V SA + D
Sbjct: 301 SAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVD 360
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
+GT T L AY + F ++A ++ ++ + CY S P V LT
Sbjct: 361 TGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCYDFS-RYGVVTLPTVALTF 419
Query: 388 KGGGPFFVNDPIVIVS 403
GG + P ++ S
Sbjct: 420 SGGATLALEAPGILSS 435
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 146/341 (42%), Gaps = 35/341 (10%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 223
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P +SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 224 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P + G F+ C
Sbjct: 283 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPAR 332
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
S GTG + FG P TP L PT Y + +T + VGG + F+A I
Sbjct: 333 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 391
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK--RETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
DSGT T L AY+ + F + + R+ + L + CY + + P V+L
Sbjct: 392 DSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL-LDTCYDFT-GMSQVAIPTVSL 449
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+GG V+ ++ + + L G +V I+G
Sbjct: 450 LFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVG 490
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 146/341 (42%), Gaps = 35/341 (10%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P +SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P + G F+ C
Sbjct: 284 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPPR 333
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
S GTG + FG P TP L PT Y + +T + VGG + F+A I
Sbjct: 334 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 392
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK--RETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
DSGT T L AY+ + F + + R+ + L + CY + + P V+L
Sbjct: 393 DSGTVITRLPPAAYSSLRSAFAAAMAARGYRKAAAVSL-LDTCYDFT-GMSQVAIPTVSL 450
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+GG V+ ++ + + L G +V I+G
Sbjct: 451 LFQGGAALDVDASGIMYTVSASQVCLAFAGNEDGGDVGIVG 491
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 131/293 (44%), Gaps = 30/293 (10%)
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTN-VSVGQPALSFIVALDTG 125
DR F RGR L + T + L +YT+ V +G P F + +DTG
Sbjct: 12 DRRFERRGRKLE-----------ESARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTG 60
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFN--IYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
S + ++PC C C H S S + + P SS+ K+ C S+ C + C S
Sbjct: 61 STVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQKIGCRSSDC-ITGLCDSN 119
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
C Y+ R ++ + S G L +D+L + + +SFGC ++G A
Sbjct: 120 SHQCKYE-RMYAEMSTSKGVLGKDLLDFGPASRLQSQL---LSFGCETAESGDLYLQVA- 174
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQ 300
+G+ GLG S+ L G I +SFS+C+G +G G + G +P S +
Sbjct: 175 DGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPR 234
Query: 301 THPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
YN+ +T++ V G N N +F I DSGT++ YL D A+ ++
Sbjct: 235 RSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSGTTYAYLPDRAFEAFTD 287
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 147/330 (44%), Gaps = 36/330 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C + + P +SST
Sbjct: 85 TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FDPESSSTYKP 135
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN + C S G C Y+ +Y ++ + S+G L EDV+ QS+ + R
Sbjct: 136 IKCN-----IDCICDSDGVQCVYERQY-AEMSTSSGVLGEDVISFGN---QSELIPQRAV 186
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC ++TG A +G+ GLG S+ L +G I +SFS+C+G G G +
Sbjct: 187 FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P +S P YN+ + ++ V G + + A+ DSGT++ YL
Sbjct: 246 GGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYL 305
Query: 337 NDPAYT----QISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
A++ I + +SL K + + + D+ F + +N ++P V++ + G
Sbjct: 306 PAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSN-KFPTVDMVFENGQ 364
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
+ P K YCLG+ ++ N
Sbjct: 365 KLSLT-PENYFFRHSKVHGAYCLGIFENGN 393
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 166/374 (44%), Gaps = 47/374 (12%)
Query: 34 FHHRYSD----PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
HHR+ P K + +++D + +A R ++ G A G +++ +T
Sbjct: 61 LHHRHGPCSPLPTKKMPSLEDRLHRDQL--RAAYIKRKFSGDVKKDGQGAGGVEQSHVTV 118
Query: 90 SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQ 148
T LN+L +L V +G PA + V +D+GSD+ W+ C C+ C ++
Sbjct: 119 PTTLGT-SLNTLEYL--ITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDP---- 171
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLV 204
++ P+ SST S C+S C Q C S+ S C Y VRY +DG+ +TG
Sbjct: 172 -----LFDPSLSSTYSPFSCSSAACAQLGQDGNGC-SSSSQCQYIVRY-ADGSSTTGTYS 224
Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
D L L ++ S FGC V++G F D +GL GLG S+ S A G
Sbjct: 225 SDTLALGSN------TISNFQFGCSHVESG-FND--LTDGLMGLGGGAPSLASQTA--GT 273
Query: 265 IPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN- 320
+FS C +G ++ G G+ G +TP PT Y + + + VGG ++
Sbjct: 274 FGTAFSYCLPPTPSSSGFLTLG-AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSI 332
Query: 321 ----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
F + DSGT T L AY+ +S F + K+ R + + C+ S Q+
Sbjct: 333 PTSVFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSI-MDTCFDFS-GQS 390
Query: 377 NFEYPVVNLTMKGG 390
+ P V L GG
Sbjct: 391 SVRLPSVALVFSGG 404
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 138/308 (44%), Gaps = 55/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV-----------LFDPSKSSSS 139
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C++ C KQ P +AG +C + + Y G+ L +D L LA D +S
Sbjct: 140 RNLQCDAPQC---KQAPNPTCTAGKSCGFNMTY--GGSTIEASLTQDTLTLANDVIKS-- 192
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
+FGC TG+ L GL GLG S+ I Q L ++FS C S
Sbjct: 193 ----YTFGCISKATGTSLPA---QGLMGLGRGPLSL--ISQTQNLYMSTFSYCLPNSKSS 243
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 244 NFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTG 303
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
IFDSGT FT L +PAY + F K TS F+ CY V+ P+ T F
Sbjct: 304 AGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLGG--FDTCYSGSVVYPSVT-FM 360
Query: 380 YPVVNLTM 387
+ +N+T+
Sbjct: 361 FAGMNVTL 368
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 123/279 (44%), Gaps = 34/279 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 77 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 132
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 133 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 191
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 192 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 251
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQ---------THPTYNITITQVSVGGNAVNFEFSA- 325
DG G + G+ P + F L + YN+ + ++ VGG+ ++ A
Sbjct: 252 DGGGIFAIGEVVEP---KVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAF 308
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
I DSGT+ Y Y + E S + R
Sbjct: 309 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLR 347
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 147/330 (44%), Gaps = 36/330 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C + + P +SST
Sbjct: 85 TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FDPESSSTYKP 135
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN + C S G C Y+ +Y ++ + S+G L EDV+ QS+ + R
Sbjct: 136 IKCN-----IDCICDSDGVQCVYERQY-AEMSTSSGVLGEDVISFGN---QSELIPQRAV 186
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC ++TG A +G+ GLG S+ L +G I +SFS+C+G G G +
Sbjct: 187 FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P +S P YN+ + ++ V G + + A+ DSGT++ YL
Sbjct: 246 GGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYL 305
Query: 337 NDPAYT----QISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
A++ I + +SL K + + + D+ F + +N ++P V++ + G
Sbjct: 306 PAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAGSDAAELSN-KFPTVDMVFENGQ 364
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
+ P K YCLG+ ++ N
Sbjct: 365 KLSLT-PENYFFRHSKVHGAYCLGIFENGN 393
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 34/335 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SST S V
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 141
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S + C Y+ +Y ++ + S+G L ED++ T +S+ R F
Sbjct: 142 KCN-----VDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L ++G+I +SFSMC+G G G + G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 252 AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLP 311
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
+ A+ + +S ++ D + + C+ + +Q + +P V++ G G
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDMVF-GNGQK 370
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
P + K YCLGV ++ D ++G
Sbjct: 371 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 405
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 146/329 (44%), Gaps = 34/329 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P +SST
Sbjct: 114 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPESSSTYQP 164
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V C + C C Y+ +Y ++ + S+G L EDV+ QS+ R
Sbjct: 165 VKCT-----IDCNCDGDRMQCVYERQY-AEMSTSSGVLGEDVISFG---NQSELAPQRAV 215
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++ +I +SFS+C+G G G +
Sbjct: 216 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVL 274
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYL 336
G P +S P YNI + ++ V G N + + + DSGT++ YL
Sbjct: 275 GGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 334
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPN---QTNFEYPVVNLTMKGGGP 392
+ A+ + + ++ S D + + C+ + N Q + +PVV++ G G
Sbjct: 335 PEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVF-GNGH 393
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
+ P + K YCLG+ ++ N
Sbjct: 394 KYSLSPENYMFRHSKVRGAYCLGIFQNGN 422
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 112/256 (43%), Gaps = 32/256 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 234
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ +
Sbjct: 235 EKIVPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHMIATNGGRE 292
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S PS LA+ G+I N F C +
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQ 350
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T S+R Y+ V G + A IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 329 SGTSFTYLNDPAYTQI 344
SG+S+TYL + Y +
Sbjct: 411 SGSSYTYLPNEIYENL 426
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 152/335 (45%), Gaps = 34/335 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SST S V
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 141
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S + C Y+ +Y ++ + S+G L ED++ T +S+ R F
Sbjct: 142 KCN-----VDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L ++G+I +SFSMC+G G G + G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 252 AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLP 311
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
+ A+ + +S ++ D + + C+ + +Q + +P V++ G G
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDMVF-GNGQK 370
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
P + K YCLGV ++ D ++G
Sbjct: 371 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLG 405
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 150/347 (43%), Gaps = 44/347 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + + C + C +G NC Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333
Query: 275 SDGTGRISFGDKGSPGQG----ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
S GTG + FG GSP TP L PT Y + +T + VGG ++ S
Sbjct: 334 SSGTGYLDFG-PGSPAAAGARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTA 391
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L AY+ + F S +A + + + + CY + + P
Sbjct: 392 GTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCYDFT-GMSQVAIPT 450
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNIIG 426
V+L +GG V+ ++ ++ + CLG ++ +V I+G
Sbjct: 451 VSLLFQGGARLDVDASGIMYAAS---VSQVCLGFAANEDGGDVGIVG 494
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 87/311 (27%), Positives = 138/311 (44%), Gaps = 43/311 (13%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVS 110
LP S+ S LA R RG G A N + L + Y + T +
Sbjct: 47 LPLTRSYPNASRLAASSR----RGLGDGAHPNARMRLHDDLLTNGY--------YTTRLY 94
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +D+GS + ++PC SC N + + P+ SS+ S V CN
Sbjct: 95 IGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSSYSPVKCN- 145
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
+ C S C Y+ +Y ++ + S+G L ED++ ++S+ R FGC
Sbjct: 146 ----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKPQRAVFGCEN 197
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
+TG A +G+ GLG + S+ L +G+I +SFS+C+G G + G P
Sbjct: 198 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPA 256
Query: 291 QGETPFS----LRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTYLNDP 339
+ FS LR P YNI + ++ V G A+ N + + DSGT++ YL +
Sbjct: 257 PSDMVFSHSDPLRS--PYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQ 314
Query: 340 AYTQISETFNS 350
A+ + S
Sbjct: 315 AFVAFKDAVTS 325
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 122/283 (43%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL 103
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC + LC +CPS C YQ++Y +D S G L+ D L
Sbjct: 104 ---VPCANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM--- 155
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S ++ ++FGCG Q + AA +G+ GLG S+ S L QG+ N C
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------F 323
++G G + FGD P T P + R + Y S G + F+
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPM 268
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+FDSG+++TY Y + L+K ++ S LP
Sbjct: 269 EVVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 311
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 122/283 (43%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL 103
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC + LC +CPS C YQ++Y +D S G L+ D L
Sbjct: 104 ---VPCANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM--- 155
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S ++ ++FGCG Q + AA +G+ GLG S+ S L QG+ N C
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------F 323
++G G + FGD P T P + R + Y S G + F+
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPM 268
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+FDSG+++TY Y + L+K ++ S LP
Sbjct: 269 EVVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 311
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 145/350 (41%), Gaps = 52/350 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V +G P F +DTGSDL W C C+ CV Q + + P S++
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVE-------QPTPY--FEPAKSTSY 138
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC+S +C + C YQ Y D S G L + T+ ++ R
Sbjct: 139 ASLPCSSAMCNALYSPLCFQNACVYQAFY-GDSASSAGVLANETFTFGTNS--TRVAVPR 195
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+SFGCG + G+ +G +G+ G G S+ S L + FS C F S T R
Sbjct: 196 VSFGCGNMNAGTLFNG---SGMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSR 247
Query: 281 ISFGDKGS----------PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
+ FG + P Q TPF + PT Y + +T +SV G+ + + S
Sbjct: 248 LYFGAYATLNSTNTSSSGPVQ-STPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 306
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
I DSGT+ T+L PAY + F + R +T F+ C+ P
Sbjct: 307 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 366
Query: 378 F-EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + L G + +++ L CL ++ SD+ +IIG
Sbjct: 367 MVTLPEMVLHFDGADMELPLENYMVMDGGTGNL---CLAMLPSDDGSIIG 413
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 94/350 (26%), Positives = 145/350 (41%), Gaps = 52/350 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V +G P F +DTGSDL W C C+ CV Q + + P S++
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVE-------QPTPY--FEPAKSTSY 135
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC+S +C + C YQ Y D S G L + T+ ++ R
Sbjct: 136 ASLPCSSAMCNALYSPLCFQNACVYQAFY-GDSASSAGVLANETFTFGTNS--TRVAVPR 192
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+SFGCG + G+ +G +G+ G G S+ S L + FS C F S T R
Sbjct: 193 VSFGCGNMNAGTLFNG---SGMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSR 244
Query: 281 ISFGDKGS----------PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
+ FG + P Q TPF + PT Y + +T +SV G+ + + S
Sbjct: 245 LYFGAYATLNSTNTSSSGPVQ-STPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 303
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
I DSGT+ T+L PAY + F + R +T F+ C+ P
Sbjct: 304 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRR 363
Query: 378 F-EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + L G + +++ L CL ++ SD+ +IIG
Sbjct: 364 MVTLPEMVLHFDGADMELPLENYMVMDGGTGNL---CLAMLPSDDGSIIG 410
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 160/371 (43%), Gaps = 61/371 (16%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYS 156
L G +Y + VG PA+ ++ +DTGSD+ W+ C C CV L ++
Sbjct: 132 LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPP---------FN 182
Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
P SS+ K+PC S+ C ++ C +G C + ++Y DG++S+G L + +
Sbjct: 183 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQY-GDGSLSSGLLAMETIAGN 241
Query: 212 T----DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
T D + K S I+ GC + GA+ GL G+ S PS L+++
Sbjct: 242 TPNFGDGEPVKL--SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YAR 295
Query: 268 SFSMCFGS-----DGTGRISFG--DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
FS CF + +G + FG D SP TP P+ ++ V + G +V
Sbjct: 296 KFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVD 355
Query: 320 ---------NFEFS-------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
NF+ I DSGT+FTYL PA+ + F LA+ D
Sbjct: 356 ESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDN 413
Query: 364 P-FEYCYVLSPNQTNFE---YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVV 417
F CY ++ E P + L +GG + N ++ VSS + L CL +
Sbjct: 414 SGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTL-CLAFL 472
Query: 418 KSDNV--NIIG 426
S ++ NIIG
Sbjct: 473 MSGDIPFNIIG 483
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 129/300 (43%), Gaps = 37/300 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P DTGSD+ WL C+ C C + I++P+ SS+ +PC
Sbjct: 92 SVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTP---------IFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+S LC + + N C Y++ Y D + S G L D L L + S +I G
Sbjct: 143 SSKLCHSVRDTSCSDQNSCQYKISY-GDSSHSQGDLSVDTLSLESTSGSPVSF-PKIVIG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------GSDGTGRI 281
CG G+F G A +G+ GLG S+ + L + I FS C S+ + +
Sbjct: 201 CGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 282 SFGDKG-SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIFDSG 330
SFGD G G L + P Y +T+ SVG V F E + I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T+ T + YT + L K R + F CY L N+ +++P++ + KG
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDR-VDDPNQQFSLCYSLKSNE--YDFPIITVHFKGA 373
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 149/346 (43%), Gaps = 38/346 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P V +DTGSD+ W+ C C SC+ S + +IY+ + SST
Sbjct: 82 LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCL----SKQDIIPPLSIYNLSASST 137
Query: 163 SSKVPCNSTLCE-LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
SS C+ LC Q C +GSN C Y + Y D + S G V+D +H + +
Sbjct: 138 SSVSSCSDPLCTGEQAVCSRSGSNSACAYGISY-QDKSTSIGAYVKDDMHYVL--QGGNA 194
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
S I FGC TGS+ +G+ G G +VP+ +A Q + FS C G + G
Sbjct: 195 TTSHIFFGCAINITGSW----PADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHG 250
Query: 278 TGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAV---NFEFS--------- 324
G + FG++ P E F+ L YN+ + +SV + + EFS
Sbjct: 251 GGILEFGEE--PNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNET 308
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT-NFEYPV 382
I DSGTSF L A + +L K L C+ L T +P
Sbjct: 309 GVIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQ---CFYLKSGLTVETSFPN 365
Query: 383 VNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
V LT GG + D +++ K YC +D + I G
Sbjct: 366 VTLTFSGGSTMKLKPDNYLVMVELKKKRNGYCYAWSSADGLTIFGE 411
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 94.7 bits (234), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 137/308 (44%), Gaps = 35/308 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ + +SC G + SG I+ Y P S T+
Sbjct: 84 LYYTRIEIGSPPKGYYVQVDTGSDILWV--NGISC-DGCPTRSGLGIELTQYDPAGSGTT 140
Query: 164 SKVPCNSTLC-------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDE 214
V C C + CPSA S C +++ Y DG+ +TGF V D + +
Sbjct: 141 --VGCEQEFCVANSAASGVPPACPSAASPCQFRITY-GDGSSTTGFYVTDFVQYNQVSGN 197
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
Q+ + I+FGCG Q G L + A +G+ G G S+ S LA + F+ C
Sbjct: 198 GQTTPSNVSITFGCG-AQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHC 256
Query: 273 FGS-DGTGRISFGDKGSPG-QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------ 324
+ G G + G+ P TP TH YN+ + +SVGG + S
Sbjct: 257 LDTVRGGGIFAIGNVVQPPIVKTTPLVPNATH--YNVNLQGISVGGATLQLPTSTFDSGD 314
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT+ YL Y + ++ + + + + C+ S + E+P
Sbjct: 315 SKGTIIDSGTTLAYLPREVYRTL---LTAVFDKHPDLAVRNYEDFICFQFS-GSLDEEFP 370
Query: 382 VVNLTMKG 389
V+ + +G
Sbjct: 371 VITFSFEG 378
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 152/347 (43%), Gaps = 44/347 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE--------KLFD 223
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 224 PARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 283 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 332
Query: 275 SDGTGRISFGDKGSPGQG----ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
S GTG + FG GSP TP L PT Y + +T + VGG ++ S
Sbjct: 333 SSGTGYLDFG-PGSPAAAGARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATA 390
Query: 326 --IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L PAY+ + F +++A + + + + CY + + P
Sbjct: 391 GTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCYDFT-GMSQVAIPT 449
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD---NVNIIG 426
V+L +GG V+ ++ ++ + CLG ++ +V I+G
Sbjct: 450 VSLLFQGGAILDVDASGIMYAAS---VSQVCLGFAANEDGGDVGIVG 493
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 136/304 (44%), Gaps = 39/304 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T++ +G PA +V LDTGSD W+ C C C + ++ P+ SST
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEA---------LFDPSKSSTY 184
Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S + C+S C+ K S+ CPY++ Y +D + + G L D L L+ +
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKKCPYEITY-ADDSYTVGNLARDTLTLSPTDAVPGF 243
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DG 277
V FGCG GSF +GL GLG K S+ S +A + FS C S
Sbjct: 244 V-----FGCGHNNAGSF---GEIDGLLGLGRGKASLSSQVAAR--YGAGFSYCLPSSPSA 293
Query: 278 TGRISFG--DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-------IF 327
TG +SF +P + + HP+ Y + +T ++V G A+ S I
Sbjct: 294 TGYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTII 353
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+F+ L AY + + S + +S + F+ CY L+ ++T P V L
Sbjct: 354 DSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTI-FDTCYDLTGHET-VRIPSVALVF 411
Query: 388 KGGG 391
G
Sbjct: 412 ADGA 415
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 115/389 (29%), Positives = 157/389 (40%), Gaps = 53/389 (13%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDT 95
HR+ P + DD P + A D R+ A G D ++ A
Sbjct: 24 HRHG-PCSPLQTPDDAPSDADLLEHDQ-ARVDSIHRMIANETAVVGQD---VSLPA---- 74
Query: 96 YRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVID 151
R S+G +Y +V +G PA V DTGSDL W+ PC C H +
Sbjct: 75 ERGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDP------- 127
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQ-CPSAGSN--CPYQVRYLSDGTMSTGFLVEDVL 208
+++P++SST S V C C +Q C S+ + CPY+V Y D + + G L D L
Sbjct: 128 --LFAPSSSSTFSAVRCGEPECPRARQSCSSSPGDDRCPYEVVY-GDKSRTVGHLGNDTL 184
Query: 209 HLATDEKQSKSVDSR-----ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
L T + S ++ FGCG TG F +GLFGLG K S+ S A G
Sbjct: 185 TLGTTPSTNASENNSNKLPGFVFGCGENNTGLF---GKADGLFGLGRGKVSLSSQAA--G 239
Query: 264 LIPNSFSMCF---GSDGTGRISFGDKG-SPGQGE-TPFSLRQTHPT-YNITITQVSVGGN 317
FS C S+ G +S G +P TP R P+ Y + + + V G
Sbjct: 240 KYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGR 299
Query: 318 AVN-------FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEY 367
A+ + I DSGT T L AY+ + F S + KR S L Y
Sbjct: 300 AIKVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCY 359
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
+ N T P V L GG V+
Sbjct: 360 DFTAHANAT-VSIPAVALVFAGGATISVD 387
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 98/369 (26%), Positives = 157/369 (42%), Gaps = 53/369 (14%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+G P ++ +DT S+L W+ SC N S +V FN P SS+ P
Sbjct: 2 QTKIGTPPREVLLLVDTASELTWV--QGTSCT---NCSPTKVPPFN---PGLSSSFISEP 53
Query: 168 CNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
C S++C Q C + +C +QV YL DG+ + G + ++ L + + + ++
Sbjct: 54 CTSSVCLGRSKLGFQSACNRSTGSCSFQVAYL-DGSEAYGVIAREIFSLQSWDGAASTLG 112
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL--IPNSFSMCFGS---- 275
I FGC +D ++ G GL S P+ + ++ + + FS CF +
Sbjct: 113 DVI-FGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEH 169
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSAI-- 326
+ +G I FGD G P SL Q P Y + + +SVGG ++ SA
Sbjct: 170 LNSSGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKI 229
Query: 327 ---------FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
FDSGT+ ++L +PA+T + E F TS SD E CY ++
Sbjct: 230 DRLGNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDAR 289
Query: 378 F-EYPVVNLTMKGGGPFFVNDPIVIVS-SEPKGLYLYCL-----GVVKSDNVNIIG---- 426
P+V L K + + V V + + CL G V VN+IG
Sbjct: 290 LPTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQ 349
Query: 427 REYPIANNI 435
++Y I +++
Sbjct: 350 QDYLIEHDL 358
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/308 (30%), Positives = 132/308 (42%), Gaps = 37/308 (12%)
Query: 96 YRLNSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFN 153
+R LG +Y +V +G P +V DTGSDL W+ C C +C +
Sbjct: 178 HRGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDP--------- 228
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ P+ S+T S VPC + C C S C Y+V Y D + + G L D L L
Sbjct: 229 LFDPSQSTTYSAVPCGAQECLDSGTCSSG--KCRYEVVY-GDMSQTDGNLARDTLTLGPS 285
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + FGCG TG F +GLFGLG D+ S+ S A + FS C
Sbjct: 286 SDQLQG----FVFGCGDDDTGLF---GRADGLFGLGRDRVSLASQAAAR--YGAGFSYCL 336
Query: 274 GSD--GTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA- 325
S G +S G +P + T R P+ Y + + + V G V F A
Sbjct: 337 PSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP 396
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKE-KRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+ DSGT T L AY+ + +F + KR + S L + CY + +T + P
Sbjct: 397 GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSIL--DTCYDFT-GRTKVQIPS 453
Query: 383 VNLTMKGG 390
V L GG
Sbjct: 454 VALLFDGG 461
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 119/462 (25%), Positives = 188/462 (40%), Gaps = 64/462 (13%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYY 60
MASS + + +LL+L + F + R + +++ + G++ +
Sbjct: 1 MASSASHMIIVILLVL--AVSSALFSPAASTWRSLDRRPEKNGFRVSLRHVDSGGNYTKF 58
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLT---FSAGNDTYRLNSLGFLHYTNVSVGQPALS 117
L + RLR + L+A+ P AGN + +N +++G PA +
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGEFLMN---------LAIGTPAET 109
Query: 118 FIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ 176
+ +DTGSDL W C C C I+ P SS+ SK+PC+S LC +
Sbjct: 110 YSAIMDTGSDLIWTQCKPCKVCFDQPTP---------IFDPEKSSSFSKLPCSSDLC-VA 159
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG-S 235
S C Y+ Y D + + G L + SV S+I FGCG G +
Sbjct: 160 LPISSCSDGCEYRYSY-GDHSSTQGVLATETFTFG-----DASV-SKIGFGCGEDNRGRA 212
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQ 291
+ GA GL GLG S+ S L +P FS C S G + G + +
Sbjct: 213 YSQGA---GLVGLGRGPLSLISQLG----VPK-FSYCLTSIDDSKGISTLLVGSEATVKS 264
Query: 292 G-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLND 338
TP + P+ Y +++ +SVG + E S I DSGT+ TYL D
Sbjct: 265 AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKD 324
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
A+ + + F S K + S S E C+ L P+ + + P + +G +
Sbjct: 325 SAFAALKKEFISQMKLDVDASGST-ELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN 383
Query: 399 IVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
+I E L + CL + S ++I G NI + H+
Sbjct: 384 YII---EDSALRVICLTMGSSSGMSIFGNFQ--QQNIVVLHD 420
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/272 (29%), Positives = 121/272 (44%), Gaps = 28/272 (10%)
Query: 105 HYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
HYT ++++G P + + +D+GSDL W+ CD C C + +Y PN
Sbjct: 63 HYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRD---------QLYKPN--- 110
Query: 162 TSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ V C LC ++ C S C Y+V Y G+ S G LV D ++
Sbjct: 111 -HNLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYADHGS-SLGVLVRD--YIPFQFTN 166
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
V R++FGCG Q S + A +G+ GLG + S+ S L + GLI N C +
Sbjct: 167 GSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSA 226
Query: 276 DGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTS 332
G G + FGD P G S+ + Y+ ++ G A + IFDSG+S
Sbjct: 227 RGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLELIFDSGSS 286
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
+TY N AY + + K K+ +D P
Sbjct: 287 YTYFNSQAYQAVVDLVTQDLKGKQLKRATDDP 318
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 138/308 (44%), Gaps = 55/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA + +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C + C KQ P + +C + + Y G+ +L +D L LATD
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSAIEAYLTQDTLTLATD------ 185
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V +FGC +G+ L GL GLG S+ I +Q L ++FS C S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
IFDSGT +T L +PAY + F K TS F+ CY V+ P+ T F
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGG--FDTCYSGSVVFPSVT-FM 357
Query: 380 YPVVNLTM 387
+ +N+T+
Sbjct: 358 FAGMNVTL 365
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 112/393 (28%), Positives = 164/393 (41%), Gaps = 67/393 (17%)
Query: 65 HRDRYFRLRGRGL-AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
HR R G+ A G + AGN + ++ V++G PALS+ +D
Sbjct: 68 HRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMD---------VAIGTPALSYAAIVD 118
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPS 181
TGSDL W C CV C ++ P++SST + VPC+S LC +L +
Sbjct: 119 TGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATVPCSSALCSDLPTSTCT 169
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGA 240
+ S C Y Y D + + G L + L ++K+ V +FGCG G F GA
Sbjct: 170 SASKCGYTYTY-GDASSTQGVLASETFTLGKEKKKLPGV----AFGCGDTNEGDGFTQGA 224
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKG----------- 287
GL GLG S+ S L GL + FS C S DG G+ G
Sbjct: 225 ---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDGDGKSPLLLGGSAAAISESAAT 276
Query: 288 SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTY 335
+P Q TP + P+ Y +++T ++VG + SA I DSGTS TY
Sbjct: 277 APVQ-TTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITY 335
Query: 336 LNDPAYTQISETFNSLAKEKRET-STSDLPFEYCYVLSPNQTN-FEYPVVNLTMKGGGPF 393
L Y + + F +A+ T S++ + C+ + + P + L GG
Sbjct: 336 LELQGYRALKKAF--VAQMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLVLHFDGGADL 393
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ +V G CL V S ++IIG
Sbjct: 394 DLPAENYMVLDSASG--ALCLTVAPSRGLSIIG 424
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 111/408 (27%), Positives = 168/408 (41%), Gaps = 62/408 (15%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP---LTFSAGNDTYRLNSLGFLHYTNVSV 111
G++ + L + RLR + L+A+ P AGN + +N +++
Sbjct: 53 GNYTKFERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGEFLMN---------LAI 103
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA ++ +DTGSDL W C C C I+ P SS+ SK+PC+S
Sbjct: 104 GTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTP---------IFDPEKSSSFSKLPCSS 154
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
LC + S C Y+ Y D + + G L + SV S+I FGCG
Sbjct: 155 DLC-VALPISSCSDGCEYRYSY-GDHSSTQGVLATETFTFG-----DASV-SKIGFGCGE 206
Query: 231 VQTG-SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGD 285
G ++ GA GL GLG S+ S L +P FS C S G + G
Sbjct: 207 DNRGRAYSQGA---GLVGLGRGPLSLISQLG----VPK-FSYCLTSIDDSKGISTLLVGS 258
Query: 286 KGSPGQG-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
+ + TP + P+ Y +++ +SVG + E S I DSGT+
Sbjct: 259 EATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTT 318
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
TYL D A+ + + F S K + S S E C+ L P+ + E P + +G
Sbjct: 319 ITYLKDNAFAALKKEFISQMKLDVDASGST-ELELCFTLPPDGSPVEVPQLVFHFEGVDL 377
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
+ +I E L + CL + S ++I G NI + H+
Sbjct: 378 KLPKENYII---EDSALRVICLTMGSSSGMSIFGNFQ--QQNIVVLHD 420
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 156/367 (42%), Gaps = 52/367 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++GQP + + +DTGSDL WL CD C C + +Y P
Sbjct: 74 VGFYNVT-LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 122
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY---LSDGTMSTGFLVEDVLHLA-TDE 214
++ VPC +LC + P+Q Y +D S G L+ DV L T+
Sbjct: 123 ---SNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNG 179
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q K R++ GCG Q +G+ GLG KTS+ S L +QGL+ N C
Sbjct: 180 VQLKV---RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLS 236
Query: 275 SDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTS 332
+ G G I FGD S TP S R ++ GG A+FD+G+S
Sbjct: 237 AQGGGYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSS 296
Query: 333 FTYLNDPAYTQI-----SETFNSLAKEKRETSTSDL------PFEYCYVLSPNQTNFEYP 381
+TY N AY + E+ KE + T L PF Y + + F+
Sbjct: 297 YTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEV---RKYFKPI 353
Query: 382 VVNLTMKGGGPFFVNDP---IVIVSSEPKGLYLYCLGVVKSDNV-----NIIGREYPIAN 433
V++ T G P +I+S+ + CLG++ V N+IG + + N
Sbjct: 354 VLSFTSNGRSKAQFEMPPEAYLIISN----MGNVCLGILNGSEVGMGDLNLIG-DISMLN 408
Query: 434 NISLFHN 440
+ +F N
Sbjct: 409 KVMVFDN 415
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 124/268 (46%), Gaps = 35/268 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+YT + VG+P + + +DTGSDL W+ CD C SC G + +Y P +
Sbjct: 198 LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSP---------LYKPRREN 248
Query: 162 TSSKVPCNSTLC-ELQK-----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
V +LC E+Q+ QC +A C Y+V+Y +D + S G LV+D L
Sbjct: 249 V---VSFKDSLCMEVQRNYDGDQC-AACQQCNYEVQY-ADQSSSLGVLVKDEFTLRFSNG 303
Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+++ FGC Q G L+ + +G+ GL K S+PS LA++G+I N C
Sbjct: 304 SLTKLNA--IFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLT 361
Query: 275 SD--GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEF------S 324
D G G + GD P G ++ + Y + ++ G ++ +
Sbjct: 362 GDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQ 421
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLA 352
+FDSG+S+TY AY Q+ ++
Sbjct: 422 VVFDSGSSYTYFTKEAYYQLVANLEEVS 449
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 41/277 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTK--------------YKPN 108
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D + L
Sbjct: 109 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 162
Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ R++FGCG Q G+ GLG K + + L + G+ N C
Sbjct: 163 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 221
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL P+ N + + G +N
Sbjct: 222 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 277
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 278 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 314
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 157/348 (45%), Gaps = 59/348 (16%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y + +G PA F V +DTGS + ++PC SC + G + P +SS+S+
Sbjct: 63 YATLHLGTPARQFAVIVDTGSTITYVPC--ASC----GRNCGPHHKDAAFDPASSSSSAV 116
Query: 166 VPCNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+S C + P G C YQ Y ++ + S G LV D L L + +V+
Sbjct: 117 IGCDSDKCICGR--PPCGCSEKRECTYQRTY-AEQSSSAGLLVSDQLQL-----RDGAVE 168
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR 280
+ FGC +TG + A +G+ GLG + S+ + LA G+I + F++CFGS +G G
Sbjct: 169 --VVFGCETKETGEIYNQEA-DGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGA 225
Query: 281 ISFGDKGSPGQGETPFSLRQT-------HPT-YNITITQVSVGGNAVNFE-------FSA 325
+ GD + E +L+ T HP Y++ + + VGG + + +
Sbjct: 226 LMLGDVDA---AEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGT 282
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEK----------RETSTSDLPFEYCYVLSP-- 373
+ DSGT+FTYL A+ E ++ A E +E S + + C+ +P
Sbjct: 283 VLDSGTTFTYLPSEAFQLFKEAVSAYALEHGLNSVKGPDPKEKSFAQF-HDICFGGAPHA 341
Query: 374 ---NQTNFE--YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGV 416
+Q+ E +PV L G P+ + + YCLGV
Sbjct: 342 GHADQSKLEKVFPVFELQF-ADGVRLRTGPLNYLFMHTGEMGAYCLGV 388
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 147/337 (43%), Gaps = 37/337 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +D+GS + ++PC DC C G+ D + P SST
Sbjct: 96 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPELSSTYQP 146
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ Y ++ + S G L ED++ +S+ R
Sbjct: 147 VKCN-----MDCNCDDDKEQCVYEREY-AEHSSSKGVLGEDLISFGN---ESQLTPQRAV 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++GLI NSF +C+G G G +
Sbjct: 198 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMIL 256
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI +T + V G ++ E A+ DSGT++ YL
Sbjct: 257 GGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAYL 316
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPNQTNFE----YPVVNLTMKGGG 391
D A+ E ++ D F + C++++ + E +P V + K G
Sbjct: 317 PDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIFKSGQ 376
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
+ ++ P + K YCLGV + D+ ++G
Sbjct: 377 SWLLS-PENYMFRHSKVHGAYCLGVFPNGKDHTTLLG 412
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 118/257 (45%), Gaps = 39/257 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P SS+
Sbjct: 82 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSSSYKA 132
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN C C G C Y+ RY ++ + S+G L ED++ +S+ R
Sbjct: 133 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGN---ESQLTPQRAV 183
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG K SV L ++G+I + FS+C+G G G +
Sbjct: 184 FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 242
Query: 284 GDKGSPGQGET-----PFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
G K SP G PF P YNI + Q+ V G ++ N + + DSGT
Sbjct: 243 G-KISPPAGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 297
Query: 332 SFTYLNDPAYTQISETF 348
++ Y A+ I +
Sbjct: 298 TYAYFPKEAFIAIKDAI 314
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 144/336 (42%), Gaps = 44/336 (13%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
V +G PA + V DTGSD W+ C CV + ++ P SST + V
Sbjct: 166 TVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEP--------LFDPAKSSTYANV 217
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C + C G +C Y V+Y DG+ + GF +D L +A D + F
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQY-GDGSYTVGFFAQDTLTIAHDAIKG------FRF 270
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFG 284
GCG G F A GL GLG KTS+ N+ +F+ C + GTG + FG
Sbjct: 271 GCGEKNNGLFGKTA---GLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGTGYLDFG 325
Query: 285 DKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYL 336
GS G TP + Y + +T + VGG V S + DSGT T L
Sbjct: 326 -PGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRL 384
Query: 337 NDPAYTQISETFNS--LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
AYT +S F+ LA+ ++ + + CY + ++ E P V+L +GG
Sbjct: 385 PATAYTALSSAFDKVMLARGYKKAPGYSI-LDTCYDFT-GLSDVELPTVSLVFQGGACLD 442
Query: 395 VN-DPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
V+ IV SE + CL + ++V I+G
Sbjct: 443 VDVSGIVYAISEAQ----VCLAFASNGDDESVAIVG 474
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 114/275 (41%), Gaps = 28/275 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTKNK 115
Query: 162 TSSKVPCNSTLC-------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC LC + +C S C Y ++Y G+ STG LV D L
Sbjct: 116 L---VPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGS-STGVLVNDSFALRL-- 169
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
V ++FGCG Q S + + +G+ GLG S+ S G+ N C
Sbjct: 170 ANGSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCLS 229
Query: 275 SDGTGRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFDSGT 331
G G + FGD P Q TP Y+ + G ++ + + +FDSG+
Sbjct: 230 LRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDSGS 289
Query: 332 SFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
SFTY Y + L++ +E S LP
Sbjct: 290 SFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPL 324
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 117/274 (42%), Gaps = 33/274 (12%)
Query: 114 PALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNST 171
P + + DTGSDL W+ CD C SC G N+ Y P + VP
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANA---------WYKPRRGNI---VPPKDL 246
Query: 172 LCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
LC ++ AG C Y++ Y +D + S G L D L L ++ F
Sbjct: 247 LCMEVQRNQKAGYCETCDQCDYEIEY-ADHSSSMGVLATDKLLLMVANGSLTKLN--FIF 303
Query: 227 GCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISF 283
GC Q G L +G+ GL K S+PS LA+QG+I N C +D G G +
Sbjct: 304 GCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFL 363
Query: 284 GDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNF------EFSAIFDSGTSFTY 335
GD P G P + Y+ + +++ G + ++ +FDSG+S+TY
Sbjct: 364 GDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTY 423
Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
AY+++ + N ++ STSD C+
Sbjct: 424 FPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCW 457
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 36/277 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 113
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D + L
Sbjct: 114 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 167
Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ R++FGCG Q G+ GLG K + + L + G+ N C
Sbjct: 168 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 226
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL P+ N + + G +N
Sbjct: 227 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 282
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 129/300 (43%), Gaps = 32/300 (10%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G PA F V DTGSD W+ C CV+ + +++P S+T + +
Sbjct: 169 IRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEP--------LFTPTKSATYANIS 220
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C S+ C +G +C Y V+Y DG+ + GF +D L L D + FG
Sbjct: 221 CTSSYCSDLDTRGCSGGHCLYAVQY-GDGSYTVGFYAQDTLTLGYDTVKD------FRFG 273
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
CG G F A GL GLG KTSVP ++ F+ C S GTG + FG
Sbjct: 274 CGEKNRGLFGKAA---GLMGLGRGKTSVPVQAYDK--YSGVFAYCIPATSSGTGFLDFGP 328
Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNF------EFSAIFDSGTSFTYLN 337
TP + Y + +T + VGG+ ++ + A+ DSGT T L
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388
Query: 338 DPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
AY + F + +T+ + + CY L+ Q + P V+L +GG V+
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVD 448
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 145/320 (45%), Gaps = 59/320 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P F +DTGSDL W+ C C C + IY P+ SST +K
Sbjct: 7 EIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDP---------IYDPSASSTFAKT 57
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+++ C+ C S+ C Y +Y D + + G + L L + SK+
Sbjct: 58 SCSTSSCQSLPASGCSSSAKTCIYGYQY-GDSSSTQGDFALETLTLRSSGGSSKAFP-NF 115
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
FGCGR+ +GSF GAA G+ GLG K S+ + L + I N FS C S T
Sbjct: 116 QFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSSKTS 170
Query: 280 RISFGDKGSPGQGE-----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--------- 325
+ FG S G G P S R T+ Y + + +SVGG ++ A
Sbjct: 171 PLIFGSSASTGSGAISTPIIPNSGRSTY--YFVGLEGISVGGKQLSLATRAIDFLSVRSK 228
Query: 326 ---------------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
IFDSGT+ T L+D Y+++ F +S++ + S+S F+ CY
Sbjct: 229 KKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSG--FDLCY 286
Query: 370 VLSPNQTNFEYPVVNLTMKG 389
+S ++ NF++P + L KG
Sbjct: 287 DVSKSK-NFKFPALTLAFKG 305
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/346 (28%), Positives = 139/346 (40%), Gaps = 39/346 (11%)
Query: 104 LHYTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
L Y +VS +G P F + +DTGSDL W+ CD C C L+ ++Y P
Sbjct: 64 LGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLH---------HLYKPRN 114
Query: 160 SSTSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ S P C++ QC SA C Y+++Y +G+ S G LV D L
Sbjct: 115 NLLSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGS-SLGVLVTDYFPLRL--MNGS 171
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+ +++FGCG Q P G+ GLG KTS+ S L G++ N C G
Sbjct: 172 FLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKG 231
Query: 278 TGRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-IFDSGTSFT 334
G + FG P G P S + Y ++ GG + IFDSG+S+T
Sbjct: 232 GGFLFFGQDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSYT 291
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF- 393
Y N Y T N + KE D P E + T + VN PF
Sbjct: 292 YFNAQVY---QSTLNLIRKELSGKPLRDAPEEKALAICWKGTK-RFKSVNEVKSYFKPFA 347
Query: 394 --FVNDPIVIVSSEPKGLYL------YCLGVVKSD-----NVNIIG 426
F V + P+ + CLG++ N N+IG
Sbjct: 348 LSFTKAKSVQLQIPPEDYLIVTNDGNVCLGILNGSEVGLGNFNVIG 393
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 36/277 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 113
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D + L
Sbjct: 114 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 167
Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ R++FGCG Q G+ GLG K + + L + G+ N C
Sbjct: 168 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 226
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL P+ N + + G +N
Sbjct: 227 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 282
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 99/391 (25%), Positives = 167/391 (42%), Gaps = 46/391 (11%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVS 110
LP S+ S LA R RG G A N + L + Y + T +
Sbjct: 47 LPLTRSYPNASRLAASLR----RGLGDGAHPNARMRLHDDLLTNGY--------YTTRLY 94
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +D+GS + ++PC SC N + + P+ SS+ S V CN
Sbjct: 95 IGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSSYSPVKCN- 145
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
+ C S C Y+ +Y ++ + S+G L ED++ ++S+ R FGC
Sbjct: 146 ----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKAQRAVFGCEN 197
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
+TG A +G+ GLG + S+ L +G+I +SFS+C+G G + G P
Sbjct: 198 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPT 256
Query: 291 QGETPFSLRQ--THPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAY 341
+ FS P YNI + ++ V G A+ + + DSGT++ YL + A+
Sbjct: 257 PSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAF 316
Query: 342 TQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPFFVND 397
+ S ++ D + + C+ + ++ + +P V++ G G
Sbjct: 317 MAFKDAVTSKVHSLKKIRGPDPSYKDICFAGARRNVSKLHEVFPDVDMVF-GNGQKLSLT 375
Query: 398 PIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
P + K YCLGV ++ D ++G
Sbjct: 376 PENYLFRHSKVDGAYCLGVFQNGKDPTTLLG 406
>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
Length = 165
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 69/127 (54%), Gaps = 12/127 (9%)
Query: 29 TFGFDFHHRYSDPVKGI------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
++ +H++S+ VK L D P +GS YY AL H D GR LA
Sbjct: 27 SYSLQMYHKFSNEVKEWMTWRHGLDTDGWPVEGSNEYYKALYHHDS--ARHGRKLA---- 80
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D LTF GN+T + LGFL Y+ V VG P ++ VALDTGSD+FW+PCDC +C
Sbjct: 81 DHPSLTFLEGNETVEIPQLGFLFYSMVQVGTPNVTLFVALDTGSDVFWVPCDCQACAPTS 140
Query: 143 NSSSGQV 149
+S G V
Sbjct: 141 AASYGLV 147
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 125/288 (43%), Gaps = 38/288 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y + VG P+ + + +D+GS+L W+ CD C+SC G + +Y S
Sbjct: 78 LYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHP---------LYKLKKGS 128
Query: 162 TSSKVPCNSTLCELQK-------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VP LC + A C Y V Y +D S GFLV D +
Sbjct: 129 L---VPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAY-ADHGYSEGFLVRDSVRALLTN 184
Query: 215 KQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K + +S FGCG Q S + A +G+ GLG S+PS A QGLI N C
Sbjct: 185 KTVLTANS--VFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCI 242
Query: 274 ---GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--- 325
G DG G + FGD + P R + Y + Q++ G ++ +
Sbjct: 243 FGAGRDG-GYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKL 301
Query: 326 ---IFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSG+++TY + AY +S +L+ ++ E +SD C+
Sbjct: 302 GGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCW 349
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 103/371 (27%), Positives = 159/371 (42%), Gaps = 61/371 (16%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYS 156
L G +Y + +G PA+ ++ +DTGSD+ W+ C C CV L ++
Sbjct: 131 LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPP---------FN 181
Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
P SS+ K+PC S+ C ++ C +G C + ++Y DG++S+G L + +
Sbjct: 182 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQY-GDGSLSSGLLAMETIAGN 240
Query: 212 T----DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
T D + K S I+ GC + GA+ GL G+ S PS L+++
Sbjct: 241 TPNFGDGEPVKL--SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YAR 294
Query: 268 SFSMCFGS-----DGTGRISFG--DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
FS CF + +G + FG D SP TP P+ ++ V + G +V
Sbjct: 295 KFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVD 354
Query: 320 ---------NFEFS-------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
NF+ I DSGT+FTYL PA+ + F LA+ D
Sbjct: 355 ESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDN 412
Query: 364 P-FEYCYVLSPNQTNFE---YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVV 417
F CY ++ E P + L +GG + N ++ VSS + L CL
Sbjct: 413 SGFTPCYNITSGTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTL-CLAFQ 471
Query: 418 KSDNV--NIIG 426
S ++ NIIG
Sbjct: 472 MSGDIPFNIIG 482
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 99/314 (31%), Positives = 134/314 (42%), Gaps = 36/314 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN-IYSPNTSSTS 163
+ +V +G PA V DTGSDL W+ C G SS G + +++P+ SST
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-------GPCSSGGCYKQQDPLFAPSDSSTF 206
Query: 164 SKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV- 220
S V C + C ++ C + + CPY+V Y D + + G L D L L T + S
Sbjct: 207 SAVRCGARECRARQSCGGSPGDDRCPYEVVY-GDKSRTQGHLGNDTLTLGTMAPANASAE 265
Query: 221 -DSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
D+++ FGCG TG F +GLFGLG K S+ S A G FS C
Sbjct: 266 NDNKLPGFVFGCGENNTGLF---GQADGLFGLGRGKVSLSSQAA--GKFGEGFSYCLPSS 320
Query: 274 GSDGTGRISFGDK-GSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFE-----FSA 325
S G +S G +P + TP R T P+ Y + + + V G A+
Sbjct: 321 SSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPL 380
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L AY + F S + KR S L Y + N T P
Sbjct: 381 IVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANAT-VSIPA 439
Query: 383 VNLTMKGGGPFFVN 396
V L GG V+
Sbjct: 440 VALVFAGGATISVD 453
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 146/337 (43%), Gaps = 37/337 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +D+GS + ++PC DC C G+ D + P SST
Sbjct: 95 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPEMSSTYQP 145
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ Y ++ + S G L ED++ +S+ R
Sbjct: 146 VKCN-----MDCNCDDDREQCVYEREY-AEHSSSKGVLGEDLISFGN---ESQLTPQRAV 196
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++GLI NSF +C+G G G +
Sbjct: 197 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMIL 255
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI +T + V G ++ E A+ DSGT++ YL
Sbjct: 256 GGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYL 315
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYVLSPNQTNFE----YPVVNLTMKGGG 391
D A+ E ++ D F + C+ ++ + E +P V + K G
Sbjct: 316 PDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVFKSGQ 375
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
+ ++ P + K YCLGV + D+ ++G
Sbjct: 376 SWLLS-PENYMFRHSKVHGAYCLGVFPNGKDHTTLLG 411
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 137/308 (44%), Gaps = 55/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C + C KQ P + +C + + Y G+ +L +D L LA+D
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------ 185
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V +FGC +G+ L GL GLG S+ I +Q L ++FS C S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
IFDSGT +T L +PAY + F K TS F+ CY V+ P+ T F
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG--FDTCYSGSVVFPSVT-FM 357
Query: 380 YPVVNLTM 387
+ +N+T+
Sbjct: 358 FAGMNVTL 365
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 137/308 (44%), Gaps = 55/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C + C KQ P + +C + + Y G+ +L +D L LA+D
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------ 185
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V +FGC +G+ L GL GLG S+ I +Q L ++FS C S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFE 379
IFDSGT +T L +PAY + F K TS F+ CY V+ P+ T F
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG--FDTCYSGSVVFPSVT-FM 357
Query: 380 YPVVNLTM 387
+ +N+T+
Sbjct: 358 FAGMNVTL 365
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 152/341 (44%), Gaps = 59/341 (17%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L N S+GQPA + +DTGS++ W+ C C C +G ++D P+ SST
Sbjct: 98 LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQ----QNGPLLD-----PSKSST 148
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC +T+C PSA N C Y + Y + G S G L + L + ++
Sbjct: 149 YASLPCTNTMCHY---APSAYCNRLNQCGYNLSY-ATGLSSAGVLATEQLIFHSSDEGVN 204
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
+V S + FGC + G + D G+FGLG TS + + ++ FS C G+
Sbjct: 205 AVPS-VVFGCSH-ENGDYKDRRF-TGVFGLGKGITSFVTRMGSK------FSYCLGNIAD 255
Query: 277 ---GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------EF 323
G ++ FG+K + TP + H Y +T+ +SVG ++ E
Sbjct: 256 PHYGYNQLVFGEKANFEGYSTPLKVVNGH--YYVTLEGISVGEKRLDIDSTAFSMKGNEK 313
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEY----CYVLSPNQTNF 378
SA+ DSGT+ T+L + A F +L E R+ L PF CY + +Q
Sbjct: 314 SALIDSGTALTWLAESA-------FRALDNEVRQLLDGVLMPFWRGSFACYKGTVSQDLI 366
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
+PVV GG ++ + + P L C+ V ++
Sbjct: 367 GFPVVTFHFSGGADLDLDTESMFYQATPDIL---CIAVRQA 404
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 148/346 (42%), Gaps = 53/346 (15%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++S+G PAL++ +DTGSDL W C CV N S+ ++ P++SST S +P
Sbjct: 121 DMSIGTPALAYAAIVDTGSDLVW--TQCKPCVECFNQST------PVFDPSSSSTYSTLP 172
Query: 168 CNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C+S+LC C SA +C Y Y D + + G L + LA K+ ++
Sbjct: 173 CSSSLCSDLPTSTCTSAAKDCGYTYTY-GDASSTQGVLAAETFTLA------KTKLPGVA 225
Query: 226 FGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR--- 280
FGCG G F GA GL GLG S+ S L GL FS C S D T +
Sbjct: 226 FGCGDTNEGDGFTQGA---GLVGLGRGPLSLVSQL---GL--GKFSYCLTSLDDTSKSPL 277
Query: 281 -------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------- 325
IS + TP + P+ Y +T+ ++VG + SA
Sbjct: 278 LLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDG 337
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT-NFEY 380
I DSGTS TYL Y + + F + K ++ + + C+ + + E
Sbjct: 338 TGGVIVDSGTSITYLELQGYRPLKKAFAAQMKLPVADGSA-VGLDLCFKAPASGVDDVEV 396
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + L GG + +V G CL V+ S ++IIG
Sbjct: 397 PKLVLHFDGGADLDLPAENYMVLDSASG--ALCLTVMGSRGLSIIG 440
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 127/305 (41%), Gaps = 37/305 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L + N SVGQP + +DTGS L W+ C C C SS +I +++P SST
Sbjct: 67 LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHC------SSNHMIH-PVFNPALSST 119
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C+ C + + C Y+ Y+S GT S G L ++ L T + V
Sbjct: 120 FVECSCDDRFCRYAPNGHCSSNKCVYEQVYIS-GTGSKGVLAKERLTFTTPNGNT-VVTQ 177
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDG 277
I+FGCG + G L+ G+ GLG TS+ L ++ FS C G + G
Sbjct: 178 PIAFGCGH-ENGEQLESEF-TGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNYG 229
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------FSAIF 327
++ G+ TP + Y + + +SVG +N E I
Sbjct: 230 YNQLVLGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVIL 289
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETST-SDLPFEYCYVLSPNQTNFEYPVVNLT 386
D+GT +T+L D AY ++ S+ K E D CY N+ +PVV
Sbjct: 290 DTGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDF---LCYHGRVNEELIGFPVVTFH 346
Query: 387 MKGGG 391
GG
Sbjct: 347 FAGGA 351
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 149/335 (44%), Gaps = 43/335 (12%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
RL RG+ + T L +G S+G Y V +G P F + DTGSD+
Sbjct: 91 RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 143
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
W C+ CV + +P+TS++ + C+S LC+L + C S
Sbjct: 144 WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 195
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
S C YQV+Y DG+ S GF + L L+ S +V FGCG+ G F A
Sbjct: 196 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 247
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
GL K ++PS A S+ + S G +S G + S TP S
Sbjct: 248 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 304
Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ P Y + IT +SVGG ++ + SA + DSGT T L+ AY+++S F +L +
Sbjct: 305 STPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 364
Query: 356 RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
TS + F+ CY S T P V +T KGG
Sbjct: 365 PSTSGYSI-FDTCYDFSKYDT-VRIPKVGVTFKGG 397
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 125/305 (40%), Gaps = 33/305 (10%)
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPN 158
LG +Y +V +G P +V DTGSDL W+ C C C + ++ P+
Sbjct: 133 LGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDP---------LFDPS 183
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S+T S VPC + C + C Y+V Y D + + G L D L L S
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVY-GDMSQTDGNLARDTLTLGPSSSSSS 242
Query: 219 SVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
S FGCG TG F +GLFGLG D+ S+ S A + FS C S
Sbjct: 243 SDQLQEFVFGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAK--YGAGFSYCLPSSS 297
Query: 278 T--GRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFD 328
T G +S G P T R P+ Y + + + V G V + + D
Sbjct: 298 TAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVID 357
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
SGT T L AY + +F L + KR + S L + CY + + + P V L
Sbjct: 358 SGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSIL--DTCYDFT-GRNKVQIPSVAL 414
Query: 386 TMKGG 390
GG
Sbjct: 415 LFDGG 419
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 150/337 (44%), Gaps = 43/337 (12%)
Query: 69 YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSD 127
+ RL RG+ + T L +G S+G Y V +G P F + DTGSD
Sbjct: 101 HARLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSD 153
Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQC 179
+ W C+ CV + +P+TS++ + C+S LC+L + C
Sbjct: 154 ITWTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSC 205
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
S S C YQV+Y DG+ S GF + L L+ S +V FGCG+ G F
Sbjct: 206 SS--STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGA 257
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR 299
A GL K ++PS A S+ + S G +S G + S TP S
Sbjct: 258 AGLLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSAD 314
Query: 300 -QTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAK 353
+ P Y + IT +SVGG ++ + SA + DSGT T L+ AY+++S F +L
Sbjct: 315 FDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMT 374
Query: 354 EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
+ TS + F+ CY S T P V +T KGG
Sbjct: 375 DYPSTSGYSI-FDTCYDFSKYDT-VRIPKVGVTFKGG 409
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 159/361 (44%), Gaps = 40/361 (11%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++GQP + + +DTGS+L WL CD C C + +Y P+
Sbjct: 71 VGFYNVT-LNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHP---------LYKPS 120
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQS 217
K P ++L + C Y+++Y +D + G L+ DV L T+ Q
Sbjct: 121 NDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKY-ADQYSTLGVLLNDVYLLNFTNGVQL 179
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
K R++ GCG Q S +G+ GLG K S+ S L +QGL+ N C S G
Sbjct: 180 KV---RMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRG 236
Query: 278 TGRISFGDK-GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTY 335
G I FG+ S TP S + Y+ ++ GG + IFD+G+S+TY
Sbjct: 237 GGYIFFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTY 296
Query: 336 LNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY-----VLSPNQTNFEYPVVNLTMKG 389
N AY + N L ++ + + D C+ S N+ + + L+
Sbjct: 297 FNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTN 356
Query: 390 GG---PFFVNDP--IVIVSSEPKGLYLYCLGVVKSDNV-----NIIGREYPIANNISLFH 439
GG P F P +I+S+ + CLG++ V N+IG + + + + +F
Sbjct: 357 GGRVKPQFEIPPEAYLIISN----MGNVCLGILNGPEVGLGELNLIG-DISMLDKVMVFD 411
Query: 440 N 440
N
Sbjct: 412 N 412
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 119/270 (44%), Gaps = 32/270 (11%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
+G PA + + +DTGSDL WL CD C SC + +Y P + VPC
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL---VPC 48
Query: 169 NSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ LC +CPS C YQ++Y +D S G L+ D L +S ++
Sbjct: 49 ANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM---RSSNIR 103
Query: 222 SRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
++FGCG Q + AA +G+ GLG S+ S L QG+ N C ++G G
Sbjct: 104 PGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGG 163
Query: 280 RISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
+ FGD P T P + R + Y+ + ++ + +FDSG+++TY
Sbjct: 164 FLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYF 223
Query: 337 N-DPAYTQISETFNSLAKEKRETSTSDLPF 365
P +S L+K ++ S LP
Sbjct: 224 TAQPYQAVVSALKGGLSKSLKQVSDPTLPL 253
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 121/251 (48%), Gaps = 25/251 (9%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y +++G+PA + + +DTGS+L WL +C VHG + Y+P + + K
Sbjct: 39 YATLNIGEPAKPYFLDVDTGSNLTWL--ECHHPVHGCKGCHPRP-PHPYYTP--ADGNLK 93
Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C S LC ++ P N C Y+++Y++ S G L D++ + +K+
Sbjct: 94 VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
RI+FGCG Q +P +G+ GLGM K + + L +I N C S
Sbjct: 151 -----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSS 205
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
G G + GD P +G T +R++ Y+ + +V + + N F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 334 TYLNDPAYTQI 344
T++ Y +I
Sbjct: 266 THVPAQIYNEI 276
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 141/355 (39%), Gaps = 58/355 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++T + VG PA F V +DTGS+L W V+C + + ++ + S +
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTW-----VNCRYRARGKDNR----RVFRADESKSFK 134
Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C + C++ CP+ + C Y RY +DG+ + G ++ + + +
Sbjct: 135 TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRM 193
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----- 272
+ + GC TG GA +G+ GL S S + L FS C
Sbjct: 194 ARLPGHL-IGCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDHL 248
Query: 273 ----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
FGS + + +F + TP L + P Y I + +S+G + ++
Sbjct: 249 SNKNVSNYLIFGSSRSTKTAF-------RRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 301
Query: 323 FSA---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
I DSGTS T L D AY Q+ E + +P EYC+ +
Sbjct: 302 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 361
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
+ P + +KGG F + +V + P + CLG V + N+IG
Sbjct: 362 GFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG---VKCLGFVSAGTPATNVIG 413
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 104/335 (31%), Positives = 149/335 (44%), Gaps = 43/335 (12%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
RL RG+ + T L +G S+G Y V +G P F + DTGSD+
Sbjct: 43 RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
W C+ CV + +P+TS++ + C+S LC+L + C S
Sbjct: 96 WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 147
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
S C YQV+Y DG+ S GF + L L+ S +V FGCG+ G F A
Sbjct: 148 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 199
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
GL K ++PS A S+ + S G +S G + S TP S
Sbjct: 200 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 256
Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ P Y + IT +SVGG ++ + SA + DSGT T L+ AY+++S F +L +
Sbjct: 257 STPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 316
Query: 356 RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
TS + F+ CY S T P V +T KGG
Sbjct: 317 PSTSGYSI-FDTCYDFSKYDT-VRIPKVGVTFKGG 349
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 167/381 (43%), Gaps = 41/381 (10%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
+DR +R + A+ K ++ A N L++ ++ ++ +G PA +V LDTG
Sbjct: 103 QDRVDAIRRKVTASSNKPKGGVSLLA-NWGKSLSTTNYV--ASLRLGTPATELVVELDTG 159
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQK 177
SD W+ C C C + ++ P SST S VPC + C+ +
Sbjct: 160 SDQSWVQCKPCADCYEQRDP---------VFDPTASSTYSAVPCGARECQELASSSSSRN 210
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS-VDSRISFGCGRVQTGSF 236
NCPY+V Y D + + G L D L L+ S + FGCG G+F
Sbjct: 211 CSSDNNKNCPYEVSY-DDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTF 269
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGE- 293
+ +GL GLG+ K S+PS +A + +FS C S G +SFG + +
Sbjct: 270 GE---VDGLLGLGLGKASLPSQVAAR--YGAAFSYCLPSSPSAAGYLSFGGAAARANAQF 324
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------IFDSGTSFTYLNDPAYTQISE 346
T Q +Y + +T + V G A+ SA I DSGT+F+ L AY +
Sbjct: 325 TEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRS 384
Query: 347 TFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
+F S + + + + + S F+ CY + ++T P V L G ++ V+ +
Sbjct: 385 SFRSAMGRYRYKRAPSSPIFDTCYDFTGHET-VRIPAVELVFADGATVHLHPSGVLYTW- 442
Query: 406 PKGLYLYCLGVVKSDNVNIIG 426
+ CL V + ++ I+G
Sbjct: 443 -NDVAQTCLAFVPNHDLGILG 462
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 101/235 (42%), Gaps = 25/235 (10%)
Query: 128 LFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---- 183
+F L C +C SG +D +Y PN S TS+ VPC C P +G
Sbjct: 26 VFLLQLGCTAC----PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQD 81
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
+CPY + Y DG+ ++G V D L + +K +S + FGCG Q+GS +
Sbjct: 82 MSCPYSITY-GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSD 140
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
A +G+ G G +SV S LA G + FS C S G G S G P TP
Sbjct: 141 EALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVP 200
Query: 299 RQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAYTQI 344
R H YN+ + + V G + I DSGT+ YL Y Q+
Sbjct: 201 RMAH--YNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL 253
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 155/353 (43%), Gaps = 45/353 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + LDTGS L WL C CV H +D ++ P+ S+T
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCH-------SQVD-PLFEPSASNTY 171
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C+S+ C L K C ++G C Y Y D + S G+L D+L L
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGV-CVYTASY-GDASYSMGYLSRDLLTLTP---- 225
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
S+++ S ++GCG+ G F A G+ GL DK S+ + L+ + +FS C
Sbjct: 226 SQTLPS-FTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPK--YGYAFSYCLPTS 279
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTY------NITITQVSVGGNAVNFEFSAIF 327
S G G +S G TP +P+ IT+ VG A ++ I
Sbjct: 280 TSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTII 339
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT T L Y + E F + + E + + + C+ S + P + +
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCFKGSLKSMSGA-PEIRMIF 398
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
+GG + P +++ ++ KG + CL S+ + IIG + Y IA ++S
Sbjct: 399 QGGADLSLRAPNILIEAD-KG--IACLAFASSNQIAIIGNHQQQTYNIAYDVS 448
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 82/254 (32%), Positives = 115/254 (45%), Gaps = 40/254 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +S I + + P SS++
Sbjct: 131 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 187
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C+ C Q S S C Y +Y DG+ ++G+ + D
Sbjct: 188 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGYYISD-------------- 232
Query: 221 DSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
F C +Q+G A +G+FGLG SV S LA QGL P FS C D G
Sbjct: 233 -----FMCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG 287
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFD 328
G + G P TP L + P YN+ + ++V G + + S I D
Sbjct: 288 GGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIID 345
Query: 329 SGTSFTYLNDPAYT 342
+GT+ YL D AY+
Sbjct: 346 TGTTLAYLPDEAYS 359
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 111/256 (43%), Gaps = 32/256 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C + G + +Y P +
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHP---------LYKP---AK 234
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ +
Sbjct: 235 EKIVPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHMIATNGGRE 292
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S PS LA+ G+I N F C +
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQ 350
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T S+R Y+ V G + A IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 329 SGTSFTYLNDPAYTQI 344
SG+S+TYL + Y +
Sbjct: 411 SGSSYTYLPNEIYENL 426
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 155/367 (42%), Gaps = 52/367 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++GQP + + +DTGSDL WL CD C C + +Y P
Sbjct: 76 VGFYNVT-LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 124
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY---LSDGTMSTGFLVEDVLHLA-TDE 214
++ VPC LC + P+Q Y +D S G L+ DV L T+
Sbjct: 125 ---SNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNG 181
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q K R++ GCG Q +G+ GLG KTS+ S L +QGL+ N C
Sbjct: 182 VQLKV---RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLS 238
Query: 275 SDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTS 332
+ G G I FGD S TP S R ++ GG A+FD+G+S
Sbjct: 239 AQGGGYIFFGDVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSS 298
Query: 333 FTYLNDPAYTQI-----SETFNSLAKEKRETSTSDL------PFEYCYVLSPNQTNFEYP 381
+TY N AY + E+ KE + T L PF Y + + F+
Sbjct: 299 YTYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEV---RKYFKPI 355
Query: 382 VVNLTMKGGGPF---FVNDPIVIVSSEPKGLYLYCLGVVKSDNV-----NIIGREYPIAN 433
V++ T G + + +IVS+ CLG++ V N+IG + + N
Sbjct: 356 VLSFTSNGRSKAQFEMLPEAYLIVSNMGN----VCLGILNGSEVGMGDLNLIG-DISMLN 410
Query: 434 NISLFHN 440
+ +F N
Sbjct: 411 KVMVFDN 417
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 137/332 (41%), Gaps = 39/332 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L N SVGQP + + +DTGS L W+ C C C SS +I +++P SST
Sbjct: 95 LFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHC------SSDHMIH-PVFNPALSST 147
Query: 163 SSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+ C SN C Y+ Y+S GT S G L ++ L T + V
Sbjct: 148 FVECSCDDRFCRYAPNGHCGSSNKCVYEQVYIS-GTGSKGVLAKERLTFTTPNGNT-VVT 205
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SD 276
I+FGCG + G L+ G+ GLG TS+ L ++ FS C G +
Sbjct: 206 QPIAFGCG-YENGEQLESHF-TGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNY 257
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------FSAI 326
G ++ G+ TP + Y + + +SVG +N E I
Sbjct: 258 GYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVI 317
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETST-SDLPFEYCYVLSPNQTNFEYPVVNL 385
DSGT +T+L D AY ++ S+ K E D CY ++ +PVV
Sbjct: 318 LDSGTLYTWLADIAYRELYNEIKSILDPKLERFWFRDF---LCYHGRVSEELIGFPVVTF 374
Query: 386 TMKGGGPFFVNDPIVIVS-SEPKGLYLYCLGV 416
GG + + SEP ++C+ V
Sbjct: 375 HFAGGAELAMEATSMFYPLSEPNTFNVFCMSV 406
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 115
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG L+ D L L
Sbjct: 116 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 171
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 172 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 227
Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
C G G + FGD P Q TP + Y+ + G ++ + +FD
Sbjct: 228 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 287
Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
SG+SFTY Y + + L++ E + LP
Sbjct: 288 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 325
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 56 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 106
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG L+ D L L
Sbjct: 107 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 162
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 163 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 218
Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
C G G + FGD P Q TP + Y+ + G ++ + +FD
Sbjct: 219 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 278
Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
SG+SFTY Y + + L++ E + LP
Sbjct: 279 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 316
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/276 (26%), Positives = 123/276 (44%), Gaps = 25/276 (9%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y +++G+PA + + +DTGS+L WL +C VHG + Y+P + + K
Sbjct: 39 YATLNIGEPAKPYFLDVDTGSNLTWL--ECHHPVHGCKGCHPRP-PHPYYTP--ADGNLK 93
Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C S LC ++ P N C Y+++Y++ S G L D++ + +K+
Sbjct: 94 VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
RI+FGCG Q +P +G+ GLGM K + L +I N C S
Sbjct: 151 -----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSS 205
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
G G + GD P +G T +R++ Y+ + +V + + N F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T++ Y +I E C+
Sbjct: 266 THVPAQIYNEIVSKVRGTLSESSLEEVKGRALPLCW 301
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 88/336 (26%), Positives = 148/336 (44%), Gaps = 36/336 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P+ SST
Sbjct: 83 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPDLSSTYQP 133
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V C L C + C Y+ +Y ++ + S+G L EDV+ QS+ R
Sbjct: 134 VKCT-----LDCNCDNDRMQCVYERQY-AEMSTSSGVLGEDVVSFGN---QSELAPQRAV 184
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++ ++ +SFS+C+G G G +
Sbjct: 185 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVL 243
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI + ++ V G + + ++ DSGT++ YL
Sbjct: 244 GGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYL 303
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
+ A+ E + + S D + + C+ + +Q + +PVV++ G G
Sbjct: 304 PEEAFLAFKEAIVKELQSFSQISGPDPNYNDLCFSGAGIDVSQLSKTFPVVDMIF-GNGH 362
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIG 426
+ P + K YCLG+ ++ D ++G
Sbjct: 363 KYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLG 398
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 118/279 (42%), Gaps = 34/279 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 63 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 113
Query: 162 TSSKVPCNSTLCEL--------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLA 211
VPC LC + +C S C Y ++Y G+ STG LV D L L
Sbjct: 114 L---VPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGS-STGVLVNDSFALRLT 169
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFS 270
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 170 NGSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVG 225
Query: 271 MCFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIF 327
C G G + FGD P Q TP + Y+ + G ++ + +F
Sbjct: 226 HCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVF 285
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
DSG+SFTY Y + + L++ E + LP
Sbjct: 286 DSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 324
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 126/300 (42%), Gaps = 37/300 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P DTGSD+ WL C+ C C + I++P+ SS+ +PC
Sbjct: 92 SVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTP---------IFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S LC + + N C Y++ Y D + S G L D L L + S + G
Sbjct: 143 LSKLCHSVRDTSCSDQNSCQYKISY-GDSSHSQGDLSVDTLSLESTSGSPVSFPKTV-IG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------GSDGTGRI 281
CG G+F G A +G+ GLG S+ + L + I FS C S+ + +
Sbjct: 201 CGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 282 SFGDKG-SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIFDSG 330
SFGD G G L + P Y +T+ SVG V F E + I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T+ T + YT + L K R + F CY L N+ +++P++ KG
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDR-VDDPNQQFSLCYSLKSNE--YDFPIITAHFKGA 373
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 141/355 (39%), Gaps = 58/355 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++T + VG PA F V +DTGS+L W V+C + + ++ + S +
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTW-----VNCRYRARGKDNR----RVFRADESKSFK 156
Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C + C++ CP+ + C Y RY +DG+ + G ++ + + +
Sbjct: 157 TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRM 215
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----- 272
+ + GC TG GA +G+ GL S S + L FS C
Sbjct: 216 ARLPGHL-IGCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDHL 270
Query: 273 ----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
FGS + + +F + TP L + P Y I + +S+G + ++
Sbjct: 271 SNKNVSNYLIFGSSRSTKTAF-------RRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 323
Query: 323 FSA---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
I DSGTS T L D AY Q+ E + +P EYC+ +
Sbjct: 324 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 383
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
+ P + +KGG F + +V + P + CLG V + N+IG
Sbjct: 384 GFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPG---VKCLGFVSAGTPATNVIG 435
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 99/362 (27%), Positives = 149/362 (41%), Gaps = 44/362 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 223
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 224 PARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 283 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 332
Query: 275 SDGTGRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------I 326
S GTG + FG GSP TP + Y + +T + VGG + S I
Sbjct: 333 STGTGYLDFG-AGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTI 391
Query: 327 FDSGTSFTYLNDPAYTQISETFNSL--AKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
DSGT T L AY+ + F + A+ ++ L + CY + + P V+
Sbjct: 392 VDSGTVITRLPPAAYSSLRSAFAAAMSARGYKKAPAVSL-LDTCYDFA-GMSQVAIPTVS 449
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIANNIS 436
L +GG V+ ++ ++ + L +G+V + + G Y I +
Sbjct: 450 LLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVV 509
Query: 437 LF 438
F
Sbjct: 510 SF 511
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 122/282 (43%), Gaps = 43/282 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 52 YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSC---------NKVPHPLYKP---TK 99
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+ VPC +++C K+C + C YQ++Y +D S G LV D L +
Sbjct: 100 NKLVPCAASICTTLHSAQSPNKKC-AVPQQCDYQIKY-TDSASSLGVLVTDNFTLPL--R 155
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S SV +FGCG Q + + A +GL GLG S+ S L G+ N C
Sbjct: 156 NSSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCL 215
Query: 274 GSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE--------FS 324
++G G + FGD P T + R T Y S G + F+
Sbjct: 216 STNGGGFLFFGDNVVPTSRATWVPMVRSTSGNY------YSPGSGTLYFDRRSLGVKPME 269
Query: 325 AIFDSGTSFTYL-NDPAYTQISETFNSLAKEKRETSTSDLPF 365
+FDSG+++TY P +S L+K ++ S LP
Sbjct: 270 VVFDSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPL 311
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 115
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG L+ D L L
Sbjct: 116 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 171
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 172 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 227
Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
C G G + FGD P Q TP + Y+ + G ++ + +FD
Sbjct: 228 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 287
Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
SG+SFTY Y + + L++ E + LP
Sbjct: 288 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 325
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 142/334 (42%), Gaps = 38/334 (11%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IY 155
L++L F+ V G PA ++ V DTGSD+ W+ C+ C SG + I+
Sbjct: 130 LDTLEFV--VTVGFGTPAQTYTVIFDTGSDVSWI--QCLPC-------SGHCYKQHDPIF 178
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P S+T S VPC C + C Y+V Y DG+ S G L + L L +
Sbjct: 179 DPTKSATYSVVPCGHPQCAAADGSKCSNGTCLYKVEY-GDGSSSAGVLSHETLSLTSTRA 237
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+FGCG+ G F D +GL GLG + S+ S A +FS C S
Sbjct: 238 LPG-----FAFGCGQTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPS 287
Query: 276 DGT--GRISFGDKGSPGQGETPFSL---RQTHPT-YNITITQVSVGGNAVNF------EF 323
D T G ++ G + ++ +Q +P+ Y + + + +GG + +
Sbjct: 288 DNTTHGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDD 347
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
DSGT TYL AYT + + F + + D PF+ CY + Q+ P V
Sbjct: 348 GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYD-PFDTCYDFT-GQSAIFIPAV 405
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
+ G F ++ +++ + + CLG V
Sbjct: 406 SFKFSDGSVFDLSFFGILIFPDDTAPAIGCLGFV 439
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 97/343 (28%), Positives = 150/343 (43%), Gaps = 60/343 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + ++A+DT +D W+PC CV C +++ S+T
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC------------SSTVFNNVKSTTF 143
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C + C+ GS C + + Y S + L +DV+ LATD S
Sbjct: 144 KTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDSIPS------ 195
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
+FGC TGS + P GL GLG S+ S Q L ++FS C S + +G
Sbjct: 196 YTFGCLTEATGSSIP---PQGLLGLGRGPMSLLS--QTQNLYQSTFSYCLPSFRSLNFSG 250
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
+ G G P + +T L+ + Y + + + VG V+ SA I
Sbjct: 251 SLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTI 310
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVV 383
FDSGT FT L PAYT + + F + T TS F+ CY +++P T F + +
Sbjct: 311 FDSGTVFTRLVAPAYTAVRDAFRK--RVGNATVTSLGGFDTCYTSPIVAPTIT-FMFSGM 367
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNII 425
N+T+ D ++I S+ + CL + + DNVN +
Sbjct: 368 NVTLPP-------DNLLIHSTASS---ITCLAMAAAPDNVNSV 400
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 143/346 (41%), Gaps = 46/346 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P F V +DTGSDL W+ C + N + ++ PNTS++ +
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDA--------LFLPNTSTSFT 64
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
K+ C S LC + C Y Y DG+++TG V D + + Q + V
Sbjct: 65 KLACGSALCNGLPFPMCNQTTCVYWYSY-GDGSLTTGDFVYDTITMDGINGQKQQV-PNF 122
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
+FGCG GSF A +G+ GLG S S L + + FS C T
Sbjct: 123 AFGCGHDNEGSF---AGADGILGLGQGPLSFHSQL--KSVYNGKFSYCLVDWLAPPTQTS 177
Query: 280 RISFGDKGSPGQGET---PFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
+ FGD P + P PT Y + + +SVG N +N +
Sbjct: 178 PLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG 237
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNS--LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
IFDSGT+ T L + AY ++ N+ +A ++ S L + C P P
Sbjct: 238 TIFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRL--DLCLSGFPKDQLPTVPA 295
Query: 383 VNLTMKGGGPFF--VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ +GG N I + SS+ YC + S +VNIIG
Sbjct: 296 MTFHFEGGDMVLPPSNYFIYLESSQS-----YCFAMTSSPDVNIIG 336
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 140/344 (40%), Gaps = 43/344 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ NV +G P + DTGSDL W C CV + I+ P+TS T
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQP--------IFDPSTSKTY 205
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C S C K + SNC Y ++Y D + + GF +D L L ++
Sbjct: 206 SNISCTSAACSSLKSATGNSPGCSSSNCVYGIQY-GDSSFTIGFFAKDKLTLTQND---- 260
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
V FGCG+ G F A GL GLG D S+ A + FS C +
Sbjct: 261 -VFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314
Query: 277 GTGRISFGD----KGSP----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE------ 322
G ++FG+ K S G TPF+ Q Y I + +SVGG A++
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN 374
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L AY + F K T+ + + CY LS N T+ P
Sbjct: 375 AGTIIDSGTVITRLPSTAYGSLKSAFKQFM-SKYPTAPALSLLDTCYDLS-NYTSISIPK 432
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++ G ++ +++++ + L G D++ I G
Sbjct: 433 ISFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFG 476
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 94/373 (25%), Positives = 154/373 (41%), Gaps = 37/373 (9%)
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
DR F RGRGL +D L + G+ + + V +G PA F + +DTGS
Sbjct: 71 DRRFERRGRGLVEDAR------MVLHDD---LLTKGY-YTSRVFIGTPAQEFALIVDTGS 120
Query: 127 DLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
+ ++PC C C H Q + P+ SS+ V CNS C + K C +
Sbjct: 121 TVTYVPCSSCTHCGHH------QACFDPRFKPDNSSSYQTVSCNSPDC-ITKMCDARVHQ 173
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
C Y+ R ++ + S G L +D+L S+ + FGC +TG A +G+
Sbjct: 174 CKYE-RVYAEMSSSKGVLGKDLLGFGNG---SRLQPHPLLFGCETAETGDLYLQHA-DGI 228
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHP 303
GLG S+ L G + +SFS+C+G +G G + G P S
Sbjct: 229 MGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGAIPPPPAMVFAKSDPNRSN 288
Query: 304 TYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
YN+ ++++ V G ++N + DSGT++ YL D A+ + +
Sbjct: 289 YYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTTYAYLPDKAFDAFKDAITQQLGSLQ 348
Query: 357 ETSTSDLPF-EYCYVLSPNQTNF---EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY 412
D + + C+ + + + +P V+ G F+ P + K Y
Sbjct: 349 AVPGPDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLA-PENYLFKHTKVPGAY 407
Query: 413 CLGVVKSDNVNII 425
CLG K+ + +
Sbjct: 408 CLGFFKNQDATTL 420
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 131/298 (43%), Gaps = 40/298 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
V +G PA F V DTGSD W+ C CV+ + ++ P S+T + +
Sbjct: 100 VRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEP--------LFDPTKSATYANIS 151
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+S+ C +G +C Y ++Y DG+ + GF +D L LA D ++ FG
Sbjct: 152 CSSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYDTIKN------FRFG 204
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
CG G F A GL GLG KTS+P ++ F+ C S GTG + G
Sbjct: 205 CGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLDLG- 258
Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLN 337
G+P TP + + Y + +T + VGG+ + S + DSGT T L
Sbjct: 259 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 318
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQ-TNFEYPVVNLTMKGG 390
AY + F+ K + S P + CY L+ ++ + P V+L +GG
Sbjct: 319 PSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGG 373
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/338 (28%), Positives = 144/338 (42%), Gaps = 49/338 (14%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P++ + DTGSDL WL C C +C + ++ P SST VPC
Sbjct: 93 SLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQ---------EAPLFDPTQSSTYVDVPC 143
Query: 169 NSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSR 223
S C L Q++C S+ C Y +Y +D + + G L D + +T Q + +
Sbjct: 144 ESQPCTLFPQNQRECGSS-KQCIYLHQYGTD-SFTIGRLGYDTISFSSTGMGQGGATFPK 201
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
FGC +F NG GLG S+ S L +Q I + FS C F S TG+
Sbjct: 202 SVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGK 259
Query: 281 ISFGDKGSPGQ-GETPFSLRQTHPTYNI-TITQVSVGGNAV---NFEFSAIFDSGTSFTY 335
+ FG + TPF + ++P+Y + + ++VG V + I DS T+
Sbjct: 260 LKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTH 319
Query: 336 LNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
L YT IS ++ E E + + PFEYC N TN +P G
Sbjct: 320 LEQGIYTDFISSVKEAINVEVAEDAPT--PFEYCVR---NPTNLNFPEFVFHFTGAD--- 371
Query: 395 VNDPIVIVSSEPKGLY------LYCLGVVKSDNVNIIG 426
V PK ++ L C+ VV S ++I G
Sbjct: 372 -------VVLGPKNMFIALDNNLVCMTVVPSKGISIFG 402
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 134/310 (43%), Gaps = 43/310 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +DTGSD+ WL C C +C ++ +++P++SS+
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDA---------LFNPSSSSSF 66
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+S+LC + C YQ Y DG+ + G LV D + L + V +
Sbjct: 67 KVLDCSSSLCLNLDVMGCLSNKCLYQADY-GDGSFTMGELVTDNVVLDDAFGPGQVVLTN 125
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
I GCG G+F A G+ GLG S P+ L N FS C SD +
Sbjct: 126 IPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNL--DASTRNIFSYCLPDRESDPNHK 180
Query: 281 --ISFGDKGSP--GQGETPFSLRQTHPT----YNITITQVSVGGNAVN------FEFSA- 325
+ FGD P G F + +P Y + IT +SVGGN + F+ +
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFE 379
IFDSGT+ T L AYT + + F A TS +D F+ CY + +
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFR--AATMHLTSAADFKIFDTCYDFT-GMNSIS 297
Query: 380 YPVVNLTMKG 389
P V +G
Sbjct: 298 VPTVTFHFQG 307
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 119/265 (44%), Gaps = 27/265 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SS+ S V
Sbjct: 90 TRLYIGTPPQEFALIVDSGSTVTYVPCS--SCEQCGNHQDPR------FQPDLSSSYSPV 141
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S C Y+ +Y ++ + S+G L ED++ ++S+ F
Sbjct: 142 KCN-----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKPQHAIF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G G + G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 251
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ E + DSGT++ YL
Sbjct: 252 GMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLP 311
Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
+ A+ E S ++ D
Sbjct: 312 EQAFVAFKEAVTSKVHSLKKIRGPD 336
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 87/327 (26%), Positives = 140/327 (42%), Gaps = 34/327 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P +SST
Sbjct: 90 TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQC--------GKHQDPR-FQPESSSTYKP 140
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN + C C G C Y+ RY ++ + S+G L EDVL +S+ R
Sbjct: 141 MQCNPS-C----NCDDEGKQCTYERRY-AEMSSSSGLLAEDVLSFGN---ESELTPQRAI 191
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISF 283
FGC V+TG A +G+ GLG SV L + ++ NSFS+C+G G +
Sbjct: 192 FGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVL 250
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G+ P S YNI + ++ V G + + + DSGT++ YL
Sbjct: 251 GNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYL 310
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGP 392
+ A+ + K ++ D + + C+ +Q + +P VN+ G G
Sbjct: 311 PEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKIFPEVNMVF-GNGQ 369
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS 419
P + K YCLG+ ++
Sbjct: 370 KLSLSPENYLFRHTKVSGAYCLGIFQN 396
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 121/261 (46%), Gaps = 50/261 (19%)
Query: 105 HYTNVSVGQPA-LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y N+++G P+ +F V +DTGS L ++PC +C + G D
Sbjct: 112 YYANIALGDPSPRTFQVIVDTGSTLTYVPC--ATCAKCGTHTGGTRFD------------ 157
Query: 164 SKVPCNSTLCELQKQCPSAG-------------SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P L +KQC +AG + C Y R ++G+ +G LV D +H
Sbjct: 158 ---PTGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYS-RTYAEGSGVSGDLVRDKMHF 213
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK-TSVPSILANQGLIPNSF 269
D + + + FGC ++G+ D A +GL GLG ++ S+P+ LA+ +P F
Sbjct: 214 GGDIAPATNGTLDVVFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADTHGLPRVF 272
Query: 270 SMCFGS-DGTGRISFGDKGSPGQGETP------FSLRQTHPTYNITIT-QVSVGGNAV-- 319
S+CFGS +G G +SFG P TP + + HP Y + T + +G AV
Sbjct: 273 SLCFGSFEGGGALSFGRL--PATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVAT 330
Query: 320 ----NFEFSAIFDSGTSFTYL 336
+ + DSGT+FTY+
Sbjct: 331 PSDLAVGYGTVMDSGTTFTYV 351
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 147/351 (41%), Gaps = 51/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C CV C + Q + + P S+T
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLC-------ADQPTPY--FRPARSATY 142
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
VPC S LC L S C YQ Y D + G L + SK + S
Sbjct: 143 RLVPCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTFGA-ANSSKVMVS 200
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
++FGCG + +G + +G+ GLG S+ S L P+ FS C F S
Sbjct: 201 DVAFGCGNINSGQLANS---SGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPS 252
Query: 280 RISFG-----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------A 318
R++FG GSP Q TP + P+ Y +++ +S+G A
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQ-STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311
Query: 319 VNFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
+N + + DSGTS T+L AY + S+ + T+ +++ E C+ P +
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPS 371
Query: 377 -NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + L GG V ++ G CL +++S + IIG
Sbjct: 372 VAVTVPDMELHFDGGANMTVPPENYMLIDGATG--FLCLAMIRSGDATIIG 420
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 109/424 (25%), Positives = 172/424 (40%), Gaps = 70/424 (16%)
Query: 31 GFDFHHRYSDPVKGILAVDDLPKKG-SFAYYS----ALAHRDRYFRLRGRGLAAQGNDKT 85
G HH P G+ V + G + Y A+ +R R L + +T
Sbjct: 28 GTLLHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRSINAMLQSSSGIET 87
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNS 144
P+ AG+ Y +N V++G PA S +DTGSDL W C+ C C
Sbjct: 88 PVY--AGSGEYLMN---------VAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTP 136
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG--SNCPYQVRYLSDGTMSTGF 202
I++P SS+ S +PC S C+ PS ++C Y Y DG+ + G+
Sbjct: 137 ---------IFNPQDSSSFSTLPCESQYCQ---DLPSESCYNDCQYTYGY-GDGSSTQGY 183
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA-- 260
+ + T S I+FGCG G F G GL G+G S+PS L
Sbjct: 184 MATETFTFET------SSVPNIAFGCGEDNQG-FGQGNGA-GLIGMGWGPLSLPSQLGVG 235
Query: 261 ------NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
+ ++ GS +G +GSP SL T+ Y IT+ ++V
Sbjct: 236 QFSYCMTSSGSSSPSTLALGSAASGV----PEGSPSTTLIHSSLNPTY--YYITLQGITV 289
Query: 315 GGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSD 362
GG+ + S I DSGT+ TYL AY +++ F + + + S+S
Sbjct: 290 GGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSG 349
Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
L C+ L + + + P +++ GG + ++I +E G+ +G +
Sbjct: 350 L--STCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLISPAE--GVICLAMGSSSQQGI 405
Query: 423 NIIG 426
+I G
Sbjct: 406 SIFG 409
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 147/351 (41%), Gaps = 51/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C CV C + Q + + P S+T
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLC-------ADQPTPY--FRPARSATY 142
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
VPC S LC L S C YQ Y D + G L + SK + S
Sbjct: 143 RLVPCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTFGA-ANSSKVMVS 200
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
++FGCG + +G + +G+ GLG S+ S L P+ FS C F S
Sbjct: 201 DVAFGCGNINSGQLANS---SGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPS 252
Query: 280 RISFG-----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------A 318
R++FG GSP Q TP + P+ Y +++ +S+G A
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQ-STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311
Query: 319 VNFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
+N + + DSGTS T+L AY + S+ + T+ +++ E C+ P +
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPS 371
Query: 377 -NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + L GG V ++ G CL +++S + IIG
Sbjct: 372 VAVTVPDMELHFDGGANMTVPPENYMLIDGATG--FLCLAMIRSGDATIIG 420
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 154/368 (41%), Gaps = 58/368 (15%)
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
P T + DT + V +G PA++ + +DTGSD+ W+ C NS+
Sbjct: 117 PTTLGSALDTME-------YVITVGIGSPAVTQTMMIDTGSDVSWVRC---------NST 160
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFL 203
G ++ P+ S+T + C+S C SN C Y+V+Y DG+ +TG
Sbjct: 161 DG----LTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQY-GDGSNTTGTY 215
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
D L L+ + + FGC + DG +GL GLG D S+ S A
Sbjct: 216 SSDTLALSASDTVTD-----FHFGCSHHEED--FDGEKIDGLMGLGGDAQSLVSQTA--A 266
Query: 264 LIPNSFSMCF--GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPT-YNITITQVSVGGNA 318
SFS C + +G ++FG G TP PT Y + + +SVGG
Sbjct: 267 TYGKSFSYCLPPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTP 326
Query: 319 VNFEFS-----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLS 372
+ + S ++ DSGT T+L AY+ +S F S R + L + CY +
Sbjct: 327 LGIQPSVLSNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFT 386
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY-CLGVVKSDNVNIIG----R 427
N P V+L + GG +V + G+ + CL + +IIG R
Sbjct: 387 -GLVNVSIPAVSLVLDGG---------AVVDLDGNGIMIQDCLAFAATSGDSIIGNVQQR 436
Query: 428 EYPIANNI 435
+ + +++
Sbjct: 437 TFEVLHDV 444
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 107/420 (25%), Positives = 169/420 (40%), Gaps = 67/420 (15%)
Query: 34 FHHRYSDPVKGI-LAVDDLPKKGSFAYYS----ALAHRDRYFRLRGRGLAAQGNDKTPLT 88
HH P G+ + ++ + + Y A+ +R R L + +TP+
Sbjct: 31 LHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVY 90
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
AG+ Y +N V++G P SF +DTGSDL W C+ C C
Sbjct: 91 --AGDGEYLMN---------VAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTP--- 136
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
I++P SS+ S +PC S C+ + C Y Y DG+ + G++ +
Sbjct: 137 ------IFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGY-GDGSTTQGYMATET 189
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
T S I+FGCG G F G GL G+G S+PS L
Sbjct: 190 FTFET------SSVPNIAFGCGEDNQG-FGQGNGA-GLIGMGWGPLSLPSQLGV-----G 236
Query: 268 SFSMC---FGSDGTGRISFGD------KGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
FS C +GS ++ G +GSP SL T+ Y IT+ ++VGG+
Sbjct: 237 QFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTY--YYITLQGITVGGDN 294
Query: 319 VNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE 366
+ S I DSGT+ TYL AY +++ F + + + S+S L
Sbjct: 295 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGL--S 352
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
C+ + + + P +++ GG I+I +E G+ +G ++I G
Sbjct: 353 TCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISPAE--GVICLAMGSSSQLGISIFG 410
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/307 (24%), Positives = 127/307 (41%), Gaps = 39/307 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ VG P+ F++ DTGSDL W+ C C S + N + ++ ++ N SS+
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSS 141
Query: 163 SSKVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+PC + +C+++ CP+ + C Y RY SDG+ + GF + + + E
Sbjct: 142 FKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEG 200
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ + + + GC G A +G+ GLG K S A + FS C
Sbjct: 201 RKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVD 255
Query: 274 ---GSDGTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
+ + ++FG S T L + Y + + +S+GG +
Sbjct: 256 HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
I DSG+S T+L +PAY + + R+ P EYC+ N T
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF----NST 371
Query: 377 NFEYPVV 383
FE +V
Sbjct: 372 GFEESLV 378
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 145/347 (41%), Gaps = 45/347 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++ +G P + LDTGSDL W C C+ CV Q F + P S +
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVD-------QPTPF--FDPAQSPSY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+K+PCNS +C + C YQ Y D + G L + T++ ++ R
Sbjct: 140 AKLPCNSPMCNALYYPLCYRNVCVYQYFY-GDSANTAGVLSNETFTFGTND--TRVTVPR 196
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG--------LIPNSFSMCFGS 275
I+FGCG + GS +G+ G+ G G S+ S L + + P + FG+
Sbjct: 197 IAFGCGNLNAGSLFNGS---GMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGA 253
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---------- 324
T + G P Q TPF + PT Y + +T +SVGG + + S
Sbjct: 254 YATLNSTSASTGEPVQ-STPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGT 312
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLA--KEKRETSTSDLPFEYCYVLSPNQTNF-E 379
I DSG++ TYL AY + + F TS +D+ + C+V P
Sbjct: 313 GGVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLADV-LDTCFVWPPPPRKIVT 371
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + +G + +++ + L CL + SD+ +IIG
Sbjct: 372 MPELAFHFEGANMELPLENYMLIDGDTGNL---CLAIAASDDGSIIG 415
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 151/348 (43%), Gaps = 46/348 (13%)
Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
S+G +Y T + +G P ++++ +D+GS L WL C C + +G +Y P
Sbjct: 102 SVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWL--QCAPCAVSCHPQAGP-----LYDPR 154
Query: 159 TSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST + VPC++ C ELQ PS+ S C YQ Y DG+ S G+L +D + L+
Sbjct: 155 ASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASY-GDGSFSFGYLSKDTVSLS- 212
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
S +GCG+ G F A GL GL +K S+ S LA + NSF+ C
Sbjct: 213 ----SSGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--VGNSFAYC 263
Query: 273 F---GSDGTGRISFG---DKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
+ G +SFG D +PG+ + S Y +++ +SV G+ + S
Sbjct: 264 LPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSS 323
Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
I DSGT T L P YT +S+ + + S L + C+
Sbjct: 324 EYGSLPTIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSIL--QTCF--KGQVAKL 379
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P VN+ GG + V+V CL +D+ IIG
Sbjct: 380 PVPAVNMAFAGGATLRLTPGNVLVDVNET---TTCLAFAPTDSTAIIG 424
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 116/256 (45%), Gaps = 39/256 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P S++
Sbjct: 78 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSTSYQA 128
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN C G C Y+ RY ++ + S+G L ED++ + + S R
Sbjct: 129 LKCNPDC-----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGNESQLSPQ---RAV 179
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +TG A +G+ GLG K SV L ++G+I + FS+C+G G G +
Sbjct: 180 FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 238
Query: 284 GDKGSPGQG-----ETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
G K SP G PF P YNI + Q+ V G ++ N + + DSGT
Sbjct: 239 G-KISPPPGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 293
Query: 332 SFTYLNDPAYTQISET 347
++ Y A+ I +
Sbjct: 294 TYAYFPKEAFIAIKDA 309
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 124/282 (43%), Gaps = 23/282 (8%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSC--VHGL--NSSSGQVIDFNIYSPNT 159
+ +++G PA + + +DTGS L WL CD C++C H L G + +Y P
Sbjct: 39 FVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFVPHGLYKPEL 98
Query: 160 --SSTSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ ++ C +L+K N C Y ++Y+ G S G L+ D L
Sbjct: 99 KYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGT 156
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
+ + I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C
Sbjct: 157 N---PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCIS 213
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
S G G + FGD P G T + + H Y+ + N+ IFDSG
Sbjct: 214 SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGA 273
Query: 332 SFTYLN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
++TY P + +S ++L+KE + E D C+
Sbjct: 274 TYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 315
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 117/256 (45%), Gaps = 39/256 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P S++
Sbjct: 78 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSTSYQA 128
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN C C G C Y+ RY ++ + S+G L ED++ + + S R
Sbjct: 129 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGNESQLSPQ---RAV 179
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +TG A +G+ GLG K SV L ++G+I + FS+C+G G G +
Sbjct: 180 FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 238
Query: 284 GDKGSPGQG-----ETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
G K SP G PF P YNI + Q+ V G ++ N + + DSGT
Sbjct: 239 G-KISPPPGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 293
Query: 332 SFTYLNDPAYTQISET 347
++ Y A+ I +
Sbjct: 294 TYAYFPKEAFIAIKDA 309
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 94/340 (27%), Positives = 146/340 (42%), Gaps = 45/340 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VG+PA + LDTGSD+ WL C C C + +Y P+ S++
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDP---------VYDPSVSTSY 213
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C+S C C ++ +C Y+V Y DG+ + G + L L S
Sbjct: 214 ATVGCDSPRCRDLDAAACRNSTGSCLYEVAY-GDGSYTVGDFATETLTLGDSAPVSN--- 269
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ +FS C S +
Sbjct: 270 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 319
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ FGD P +T+ Y + ++ +SVGG A++ SA I
Sbjct: 320 STLQFGDSEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIV 379
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L AY + E F + S L F+ CY L+ +++ + P V L
Sbjct: 380 DSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSL-FDTCYDLA-GRSSVQVPAVALWF 437
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
+GGG + ++ + G YCL S V+IIG
Sbjct: 438 EGGGELKLPAKNYLIPVDAAG--TYCLAFAGTSGPVSIIG 475
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 131/298 (43%), Gaps = 40/298 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
V +G PA F V DTGSD W+ C CV+ + ++ P S+T + +
Sbjct: 165 VRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEP--------LFDPTKSATYANIS 216
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+S+ C +G +C Y ++Y DG+ + GF +D L LA D ++ FG
Sbjct: 217 CSSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYDTIKN------FRFG 269
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
CG G F A GL GLG KTS+P ++ F+ C S GTG + G
Sbjct: 270 CGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLDLG- 323
Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLN 337
G+P TP + + Y + +T + VGG+ + S + DSGT T L
Sbjct: 324 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 383
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQ-TNFEYPVVNLTMKGG 390
AY + F+ K + S P + CY L+ ++ + P V+L +GG
Sbjct: 384 PSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGG 438
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 119/251 (47%), Gaps = 25/251 (9%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y +++G+PA + + +DTGS+L WL +C VHG + Y+P + K
Sbjct: 39 YATLNIGEPAKPYFLDVDTGSNLTWL--ECHPPVHGCKGCHPRP-PHPYYTP--ADGKLK 93
Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C S LC ++ P N C Y+++Y++ S G L D++ + +K+
Sbjct: 94 VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
RI+FGCG Q +P NG+ GLGM K + L +I N C S
Sbjct: 151 -----RIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHCLSS 205
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
G G + GD P +G T +R++ Y+ + +V + + N F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 334 TYLNDPAYTQI 344
T++ Y +I
Sbjct: 266 THVPAQIYNEI 276
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 132/324 (40%), Gaps = 38/324 (11%)
Query: 80 QGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVS 137
Q K +T A R SLG +Y ++ +G PA V DTGSDL W+ C C
Sbjct: 124 QARGKKGVTLPA----QRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSD 179
Query: 138 CVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDG 196
C + ++ P SST S VPC S C+ L + S C Y+V Y D
Sbjct: 180 CYEQKDP---------LFDPARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVY-GDQ 229
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
+ + G L D L L + V FGCG TG F G A +GL GLG +K S+
Sbjct: 230 SQTDGALARDTLTLTQSD-----VLPGFVFGCGEQDTGLF--GRA-DGLVGLGREKVSLS 281
Query: 257 SILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVS 313
S A++ FS C S G +S G T R P+ Y + + V
Sbjct: 282 SQAASK--YGAGFSYCLPSSPSAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVK 339
Query: 314 VGGNAVNFE---FSA---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE 366
V G V FSA + DSGT T L Y + F S+ + + + + +
Sbjct: 340 VAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILD 399
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGG 390
CY + T P V L GG
Sbjct: 400 TCYDFT-GHTTVRIPSVALVFAGG 422
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 108/408 (26%), Positives = 163/408 (39%), Gaps = 62/408 (15%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQG---NDKTPLTFSAGNDTYRLNSLGFLHYTNVSV 111
G++ + L + +LR + L+A+ AGN + + +++
Sbjct: 53 GNYTKFERLQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMK---------LAI 103
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA ++ +DTGSDL W C C C I+ P SS+ SK+PC+S
Sbjct: 104 GTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTP---------IFDPKKSSSFSKLPCSS 154
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
LC S C Y Y D + + G L + SV S+I FGCG
Sbjct: 155 DLCA-ALPISSCSDGCEYLYSY-GDYSSTQGVLATETFAFG-----DASV-SKIGFGCGE 206
Query: 231 VQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGD 285
GS F GA GL GLG S+ S L FS C S G + G
Sbjct: 207 DNDGSGFSQGA---GLVGLGRGPLSLISQLGEP-----KFSYCLTSMDDSKGISSLLVGS 258
Query: 286 KGSPGQG-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
+ + TP + P+ Y +++ +SVG + E S I DSGT+
Sbjct: 259 EATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTT 318
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
TYL D A+ + + F S K + S S + C+ L P+ + + P + +G
Sbjct: 319 ITYLEDSAFAALKKEFISQLKLDVDESGST-GLDLCFTLPPDASTVDVPQLVFHFEGADL 377
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
+ +I S GL + CL + S ++I G NI + H+
Sbjct: 378 KLPAENYIIADS---GLGVICLTMGSSSGMSIFGNFQ--QQNIVVLHD 420
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 148/343 (43%), Gaps = 48/343 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASY 216
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C+S C C +A C Y+V Y DG+ + G + L L
Sbjct: 217 AAVSCDSQRCRDLDTAACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVGN--- 272
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ ++FS C S
Sbjct: 273 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAA 322
Query: 279 GRISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA----------- 325
+ FGD + T +R +T Y + ++ +SVGG ++ SA
Sbjct: 323 STLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGG 382
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ T L AY + + F A TS L F+ CY LS ++T+ E P V+
Sbjct: 383 VIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVS 440
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
L +GGG + ++ + G YCL ++ V+IIG
Sbjct: 441 LRFEGGGALRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 481
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 136/344 (39%), Gaps = 43/344 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ NV +G P + DTGSDL W C CV + I+ P+ S T
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQP--------IFDPSASKTY 205
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C ST C K + SNC Y ++Y D + + GF +D L L ++
Sbjct: 206 SNISCTSTACSGLKSATGNSPGCSSSNCVYGIQY-GDSSFTVGFFAKDTLTLTQND---- 260
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
V FGCG+ G F A GL GLG D S+ A + FS C +
Sbjct: 261 -VFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314
Query: 277 GTGRISFGDKGSPGQGE--------TPFSLRQTHPTYNITITQVSVGGNAVNFE------ 322
G ++FG+ + TPF+ Q Y I + +SVGG A++
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L Y + TF K T+ + + CY LS N T+ P
Sbjct: 375 AGTIIDSGTVITRLPSTVYGSLKSTFKQFM-SKYPTAPALSLLDTCYDLS-NYTSISIPK 432
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++ G + +++++ + L G D + I G
Sbjct: 433 ISFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFG 476
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 113/273 (41%), Gaps = 30/273 (10%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P +F +DTGSDL W+ CD C C N Y P + +
Sbjct: 53 MQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQ---------YKPK----GNII 99
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC++ +C + CP+ C Y+V+Y G+ S G LV D L +
Sbjct: 100 PCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGS-SMGALVTDQFPLKL--VNGSFMQ 156
Query: 222 SRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
++FGCG Q+ S A G+ GLG K + + L + GL N C S G G
Sbjct: 157 PPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGF 216
Query: 281 ISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
+ FGD P G TP + H Y + G + IFD+G+S+TY N
Sbjct: 217 LFFGDNLVPSIGVAWTPLLSQDNH--YTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFN 274
Query: 338 DPAY-TQISETFNSLAKEKRETSTSDLPFEYCY 369
AY T I+ N L + + D C+
Sbjct: 275 SKAYQTIINLIGNDLKVSPLKVAKEDKTLPICW 307
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 95/341 (27%), Positives = 142/341 (41%), Gaps = 41/341 (12%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNS 100
PV G ++P F L R + F++R + G K T + +
Sbjct: 82 PVTGAPKTINVPSTAEFLLQDQL--RVKSFQVRLSMNPSSGVFKEMQTTIPAS----IVP 135
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
G + V +G P F ++ DTGSDL W C+ C+ G + D P TS
Sbjct: 136 TGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCE--PCLGGCFPQNQPKFD-----PTTS 188
Query: 161 STSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ V C+S C+L + C S + C Y ++Y S T+ GFL + L +A+
Sbjct: 189 TSYKNVSCSSEFCKLIAEGNYPAQDCIS--NTCLYGIQYGSGYTI--GFLATETLAIASS 244
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ V FGC G+F GL GLG ++PS N+ N FS C
Sbjct: 245 D-----VFKNFLFGCSEESRGTF---NGTTGLLGLGRSPIALPSQTTNK--YKNLFSYCL 294
Query: 274 GS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---AIFD 328
+ TG +SFG + S TP S + Y + +SV G + S I D
Sbjct: 295 PASPSSTGHLSFGVEVSQAAKSTPISPKLKQ-LYGLNTVGISVRGRELPINGSISRTIID 353
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SGT+FT+L P Y+ + F + T+ + F+ CY
Sbjct: 354 SGTTFTFLPSPTYSALGSAFREMMANYTLTNGTS-SFQPCY 393
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 130/280 (46%), Gaps = 38/280 (13%)
Query: 106 YTNV--SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
Y NV S+GQPA + + +DTGSDL WL CD C C+ + +Y P
Sbjct: 70 YYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHP---------LYRP---- 116
Query: 162 TSSKVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+++ V C LC Q P + C Y+V Y +DG S G LV+DV L +
Sbjct: 117 SNNLVICEDPLCA-SLQPPGVHNCQDPDQCDYEVEY-ADGGSSLGVLVKDVFVL--NFTN 172
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K ++ ++ GCG Q L G + +G+ GLG +S+PS L++QGL+ N C
Sbjct: 173 GKRLNPLLALGCGYDQ----LPGRSNHPLDGILGLGRGISSIPSQLSSQGLVSNVIGHCL 228
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
G G + FG+ S G TP S R Y+ ++ G + +FDSG
Sbjct: 229 SGRGGGFLFFGEDIYDSSGVTWTPMS-RDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSG 287
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+S+TYLN AY + + L+++ + D C+
Sbjct: 288 SSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCW 327
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 95/310 (30%), Positives = 136/310 (43%), Gaps = 43/310 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
L++L ++ VS+G PA++ V +DTGSD+ W+ C + +G + F+ P
Sbjct: 120 LDTLAYV--ITVSIGTPAMTQAVMIDTGSDVSWVHCHA-------RAGAGSSLFFD---P 167
Query: 158 NTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
SST + C+S C E + S S C Y VRY DG+ +TG D L L + E
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRY-GDGSNTTGTYGSDTLALNSTE 226
Query: 215 KQSKSVDSRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS-FSMC 272
K FGC G LD +GL GLG PS+++ S FS C
Sbjct: 227 KVEN-----FQFGCSETSDPGEGLDEDQTDGLMGLG---GGAPSLVSQTAATYGSAFSYC 278
Query: 273 F--GSDGTGRISFG-DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVN-----FEF 323
+ +G ++ G G+ G TP + PT+ I Q ++VGG+ V F
Sbjct: 279 LPATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA 338
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAK---EKRETSTSDLPFEYCYVLSPNQTNFEY 380
+I DSGT T L AY+ +S F + + R S D F++ Q N
Sbjct: 339 GSIMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFT-----GQDNVSI 393
Query: 381 PVVNLTMKGG 390
P V L GG
Sbjct: 394 PAVELVFSGG 403
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 136/322 (42%), Gaps = 45/322 (13%)
Query: 69 YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDL 128
+ R + R +Q +D++P T + ++ + F +G P + DTGSDL
Sbjct: 62 FARSKRRLRLSQNDDRSPGTITIPDEPITEYLMRFY------IGTPPVERFAIADTGSDL 115
Query: 129 FWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QKQCPSAG 183
W+ C C CV + ++ P SST VPC+S C L Q+ C
Sbjct: 116 IWVQCAPCEKCVPQ---------NAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKS 166
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
C YQ Y D T+ +G L + ++ + K +++FGC + +
Sbjct: 167 GQCYYQYIY-GDHTLVSGILGFESINFGSKNNAIKF--PKLTFGCTFSNNDTVDESKRNM 223
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGD----KGSPGQGETPF 296
GL GLG+ S+ S L Q I FS CF S+ T ++ FG+ K G TP
Sbjct: 224 GLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPL 281
Query: 297 SLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNS 350
++ P+ Y + + VS+G V S + DSGTSFT L Y + F +
Sbjct: 282 IIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNK----FVA 337
Query: 351 LAKEKRETSTSDLP---FEYCY 369
L KE +P + +C+
Sbjct: 338 LVKEVYGVEAVKIPPLVYNFCF 359
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 137/344 (39%), Gaps = 42/344 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P F V +DTGSDL W+ C + N S ++ PNTS++ +
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDS--------LFIPNTSTSFT 54
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
K+ C + LC + C Y Y DG++STG V D + + Q + V
Sbjct: 55 KLACGTELCNGLPYPMCNQTTCVYWYSY-GDGSLSTGDFVYDTITMDGINGQKQQV-PNF 112
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
+FGCG GSF A +G+ GLG S PS L + FS C T
Sbjct: 113 AFGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTS 167
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA---------- 325
+ FGD P + T+P Y + + +SVGG +N +A
Sbjct: 168 PLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAG 227
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
IFDSGT+ T L + ++ N+ + S + C P +
Sbjct: 228 TIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMT 287
Query: 385 LTMKGGGPFF--VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+GG N I + SS+ YC +V S +V IIG
Sbjct: 288 FHFEGGDMELPPSNYFIFLESSQS-----YCFSMVSSPDVTIIG 326
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 125/302 (41%), Gaps = 39/302 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VG P+ F++ DTGSDL W+ C C S + N + ++ ++ N SS+ +P
Sbjct: 88 KVGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSSFKTIP 146
Query: 168 CNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
C + +C+++ CP+ + C Y RY SDG+ + GF + + + E + +
Sbjct: 147 CLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEGRKMKL 205
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+ + GC G A +G+ GLG K S A + FS C
Sbjct: 206 HN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVDHLSHK 260
Query: 276 DGTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA----- 325
+ + ++FG S T L + Y + + +S+GG +
Sbjct: 261 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 320
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSG+S T+L +PAY + + R+ P EYC+ N T FE
Sbjct: 321 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF----NSTGFEES 376
Query: 382 VV 383
+V
Sbjct: 377 LV 378
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 95/354 (26%), Positives = 154/354 (43%), Gaps = 46/354 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y V +G PA + + +DTGS L WL C CV H V ++ P+ S T
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCH--------VQADPLFDPSASKTY 64
Query: 164 SKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C S+ C C ++ + C Y Y D + S G+L +D+L LA +
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASY-GDSSYSMGYLSQDLLTLAPSQTL 123
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
V +GCG+ G F A G+ GLG +K S+ ++++ +FS C +
Sbjct: 124 PGFV-----YGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR 173
Query: 277 GTGR-ISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSAIF 327
G G +S G G TP + +P+ Y + +T ++VGG A+ + I
Sbjct: 174 GGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTII 233
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE-YPVVNLT 386
DSGT T L YT + F + K + + C+ N + + P V L
Sbjct: 234 DSGTVITRLPMSVYTPFQQAFVKIMSSKYARAPGFSILDTCF--KGNLKDMQSVPEVRLI 291
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
+GG + P+ ++ +G L CL ++ V IIG + + +A++IS
Sbjct: 292 FQGGADLNLR-PVNVLLQVDEG--LTCLAFAGNNGVAIIGNHQQQTFKVAHDIS 342
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 136/311 (43%), Gaps = 44/311 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
+ VS G PA+ +V +DTGSD+ WL C SSGQ +Y P+ SST
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK--------PCSSGQCFPQKDPLYDPSHSST 130
Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S VPC S +C+ C ++G C + + Y +DGT + G +D L LA
Sbjct: 131 YSAVPCASDVCKKLAADAYGSGC-TSGKQCGFAISY-ADGTSTVGAYSQDKLTLAPG--- 185
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
++ FGCG G +G+ GLG + S+ A G + FS C S
Sbjct: 186 --AIVQNFYFGCGH---GKHAVRGLFDGVLGLGRLRE---SLGARYGGV---FSYCLPSV 234
Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
+ G ++ G +P G TP PT++ +T+ ++VGG ++ SA I
Sbjct: 235 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIV 294
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT T L AY + F + R DL + CY L+ N P + LT
Sbjct: 295 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL--DTCYNLT-GYKNVVVPKIALTF 351
Query: 388 KGGGPFFVNDP 398
GG ++ P
Sbjct: 352 TGGATINLDVP 362
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/306 (30%), Positives = 133/306 (43%), Gaps = 37/306 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +SVG P + +DTGSD+ WL C CV+C H ++ I+ P SST
Sbjct: 58 YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDA---------IFDPYKSSTY 108
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + C++ C + C YQV Y DG+ +TG D + L + + V ++
Sbjct: 109 STLGCSTRQCLNLDIGTCQANKCLYQVDY-GDGSFTTGEFGTDDVSLNSTSGVGQVVLNK 167
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
I GCG G F+ A L GLG S P+ + Q FS C T
Sbjct: 168 IPLGCGHDNEGYFVGAAG---LLGLGKGPLSFPNQVDPQN--GGRFSYCLTDRETDSTEG 222
Query: 279 GRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
+ FG+ P G TP PT Y + +T +SVGG + SA
Sbjct: 223 SSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGG 282
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGTS T L + AY + + F + + T+ L F+ CY LS + + P V
Sbjct: 283 VIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSL-FDTCYDLS-GLASVDVPTVT 340
Query: 385 LTMKGG 390
L +GG
Sbjct: 341 LHFQGG 346
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 112/277 (40%), Gaps = 36/277 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 65 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 114
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D L
Sbjct: 115 HNT----LPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGY-SDHASSIGALVTDEFPLKL- 168
Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ ++FGCG Q G+ GLG K + + L + G+ N C
Sbjct: 169 -ANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHC 227
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL + N + + G +N
Sbjct: 228 LSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGIN----V 283
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 284 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 320
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 125/301 (41%), Gaps = 39/301 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
VG P+ F++ DTGSDL W+ C C S + N + ++ ++ N SS+ +PC
Sbjct: 18 VGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSSFKTIPC 76
Query: 169 NSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +C+++ CP+ + C Y RY SDG+ + GF + + + E + +
Sbjct: 77 LTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEGRKMKLH 135
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
+ + GC G A +G+ GLG K S A + FS C +
Sbjct: 136 N-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVDHLSHKN 190
Query: 277 GTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
+ ++FG S T L + Y + + +S+GG +
Sbjct: 191 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKGA 250
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSG+S T+L +PAY + + R+ P EYC+ N T FE +
Sbjct: 251 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF----NSTGFEESL 306
Query: 383 V 383
V
Sbjct: 307 V 307
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 149/343 (43%), Gaps = 48/343 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASY 219
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C+S C C +A C Y+V Y DG+ + G + L L +
Sbjct: 220 AAVSCDSPRCRDLDTAACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVTN--- 275
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ ++FS C S
Sbjct: 276 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAA 325
Query: 279 GRISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA----------- 325
+ FG G+ T +R +T Y + ++ +SVGG A++ SA
Sbjct: 326 STLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGG 385
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ T L AY + + F TS L F+ CY LS ++T+ E P V+
Sbjct: 386 VIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVS 443
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
L +GGG + ++ + G YCL ++ V+IIG
Sbjct: 444 LRFEGGGALRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 484
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/311 (30%), Positives = 136/311 (43%), Gaps = 44/311 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
+ VS G PA+ +V +DTGSD+ WL C SSGQ +Y P+ SST
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK--------PCSSGQCFPQKDPLYDPSHSST 164
Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S VPC S +C+ C ++G C + + Y +DGT + G +D L LA
Sbjct: 165 YSAVPCASDVCKKLAADAYGSGC-TSGKQCGFAISY-ADGTSTVGAYSQDKLTLAPG--- 219
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
++ FGCG G +G+ GLG + S+ A G + FS C S
Sbjct: 220 --AIVQNFYFGCGH---GKHAVRGLFDGVLGLGRLRE---SLGARYGGV---FSYCLPSV 268
Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
+ G ++ G +P G TP PT++ +T+ ++VGG ++ SA I
Sbjct: 269 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIV 328
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT T L AY + F + R DL + CY L+ N P + LT
Sbjct: 329 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL--DTCYNLT-GYKNVVVPKIALTF 385
Query: 388 KGGGPFFVNDP 398
GG ++ P
Sbjct: 386 TGGATINLDVP 396
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 91/307 (29%), Positives = 130/307 (42%), Gaps = 46/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ N+S+G PA F +DTGSDL W C C N S+ I++P SS+ S
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST------PIFNPQGSSSFS 146
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+PC+S LC+ + + ++C Y Y DG+ + G + + L S S+ I
Sbjct: 147 TLPCSSQLCQALQSPTCSNNSCQYTYGY-GDGSETQGSMGTETLTFG-----SVSIP-NI 199
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRI 281
+FGCG G F G GL G+G S+PS L FS C GS + +
Sbjct: 200 TFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTSSTL 252
Query: 282 SFGD------KGSPGQGETPFSLRQTHPTYNITITQVSVGG------------NAVNFEF 323
G GSP T Q Y IT+ +SVG N+ N
Sbjct: 253 LLGSLANSVTAGSP--NTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ TY D AY + + F S +S F+ C+ + +Q+N + P
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSS-GFDLCFQMPSDQSNLQIPTF 369
Query: 384 NLTMKGG 390
+ GG
Sbjct: 370 VMHFDGG 376
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 115/256 (44%), Gaps = 37/256 (14%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P +F + +DTGS + ++PC C C + + P SST
Sbjct: 92 TRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FEPELSSTYQP 142
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C + C Y+ +Y ++ + S+G L ED++ QS+ V R
Sbjct: 143 VSCN-----IDCTCDNERKQCVYERQY-AEMSSSSGVLGEDIISFG---NQSELVPQRAI 193
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +TG A +G+ GLG S+ L +G+I +SFS+C+G G G +
Sbjct: 194 FGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMIL 252
Query: 284 GDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTS 332
G P + ++ P YNI + + V G ++ + S + DSGT+
Sbjct: 253 GGISPP----SGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTT 308
Query: 333 FTYLNDPAYTQISETF 348
+ YL + A+T +
Sbjct: 309 YAYLPEAAFTAFKDAM 324
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 131/305 (42%), Gaps = 42/305 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ N+S+G PA F +DTGSDL W C C N S+ I++P SS+ S
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST------PIFNPQGSSSFS 146
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+PC+S LC+ + + ++C Y Y DG+ + G + + L S S+ I
Sbjct: 147 TLPCSSQLCQALQSPTCSNNSCQYTYGY-GDGSETQGSMGTETLTFG-----SVSIP-NI 199
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRI 281
+FGCG G F G GL G+G S+PS L FS C GS + +
Sbjct: 200 TFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSNSSTL 252
Query: 282 ---SFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAVNFEFSA 325
S + + G T PT Y IT+ +SVG N+ N
Sbjct: 253 LLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGI 312
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT+ TY D AY + + F S +S F+ C+ + +Q+N + P +
Sbjct: 313 IIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSS-GFDLCFQMPSDQSNLQIPTFVM 371
Query: 386 TMKGG 390
GG
Sbjct: 372 HFDGG 376
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 101/376 (26%), Positives = 156/376 (41%), Gaps = 31/376 (8%)
Query: 56 SFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPA 115
+F + + RD+ R++ N T F+ G + V +G P
Sbjct: 84 TFPSAAEILRRDQ-LRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPK 142
Query: 116 LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
F + DTGSDL W C+ C G + + D + + + S PC S E
Sbjct: 143 KDFSLLFDTGSDLTWTQCE--PCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKES 200
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
+ C S+ S C Y V+Y + T+ GFL + L + + V GCG G
Sbjct: 201 AQGCSSSNS-CLYGVKYGTGYTV--GFLATETLTITPSD-----VFENFVIGCGERNGGR 252
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE 293
F A GL GLG ++PS ++ N FS C S TG +SFG S
Sbjct: 253 FSGTA---GLLGLGRSPVALPSQTSST--YKNLFSYCLPASSSSTGHLSFGGGVSQAAKF 307
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISET 347
TP + + Y + ++ +SVGG + + S I DSGT+ TYL A++ +S
Sbjct: 308 TPIT-SKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSA 366
Query: 348 FNSLAKEKRETS-TSDLPFEYCYVLSPNQT-NFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
F + T TS L + CY S + N P +++ +GG ++D + +++
Sbjct: 367 FQEMMTNYTLTKGTSGL--QPCYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAAN 424
Query: 406 PKGLYLYCLGVVKSDN 421
GL CL + N
Sbjct: 425 --GLEEVCLAFKDNGN 438
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + ++++G P + + +DTGSDL W+ CD C C N +Y PN
Sbjct: 61 LGY-YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRN---------RLYKPN 110
Query: 159 TSSTSSKVPCNSTLCELQKQCPS---AGSN--CPYQVRYLSDGTMSTGFLVEDVLHLA-T 212
+ V C LC+ + P+ AG N C Y+V Y G+ S G L+ D + L T
Sbjct: 111 ----GNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGS-SLGVLLRDNIPLKFT 165
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ ++ + ++FGCG Q + A+ G+ GLG KTS+ S L + GLI N
Sbjct: 166 NGSLARPI---LAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGH 222
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITI---------TQVSVGGNAVNFE 322
C G G + FGD+ P G L Q+ T + SV G
Sbjct: 223 CLSERGGGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKG------ 276
Query: 323 FSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSG+S+TY N A+ ++ N L + +T D C+
Sbjct: 277 LQLIFDSGSSYTYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICW 324
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 121/272 (44%), Gaps = 32/272 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +G P + + +D+GSDL WL CD CVSC + Y PN
Sbjct: 70 VSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPP---------YKPN----KG 116
Query: 165 KVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ CN +C + C ++ C Y+V Y G+ S G LV D+ L +
Sbjct: 117 PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS-SLGVLVHDIFSLQLTNGTLAA 175
Query: 220 VDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
R++FGCG Q S+ AP +G+ GLG K+S+ + L + GLI + C
Sbjct: 176 --PRLAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGR 231
Query: 277 GTGRISFGDKGS--PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
G G + GD S PG TP S + Y + + G + +FDSG+S+
Sbjct: 232 GGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSY 291
Query: 334 TYLNDPAY-TQISETFNSLAKEKRETSTSDLP 364
TY N AY T +S L + +ET+ LP
Sbjct: 292 TYFNAQAYKTTLSLVRKYLNGKLKETADESLP 323
>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 654
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/386 (26%), Positives = 156/386 (40%), Gaps = 58/386 (15%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
H DRY R + + PL G HYT V G P V DT
Sbjct: 38 HPDRYARRLN--IEEDAPEIVPLHLGLGT-----------HYTWVYAGTPPQRASVIADT 84
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-KQCPSAG 183
GS L PC S G S + Q + + SST V C+ Q K+C
Sbjct: 85 GSGLMAFPC---SGCDGCGSHTDQP-----FQADNSSTLIHVTCSQQQSHFQCKECTEKS 136
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTGSFLD 238
C Y+ +G+ +VEDV++L DE + FGC +TG F+
Sbjct: 137 DTCAISQSYM-EGSSWKASVVEDVVYLGGESSFHDEAMRDRYGTHFQFGCQSSETGLFVT 195
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPG-QGETPF 296
A +G+ GL T + + L + IP N FS+CF +G G +S G+ + +GE +
Sbjct: 196 QVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSLCFTENG-GTMSVGEPNTKAHRGEISY 253
Query: 297 SL----RQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISE 346
+ R YN+ + + +GG ++N + A I DSGT+ +YL + +
Sbjct: 254 AKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYTRGHYIVDSGTTDSYLPRAMKNEFLQ 313
Query: 347 TFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEP 406
F +A + TS C+ + N+ P + L M+ G + VI+ P
Sbjct: 314 VFKEVAGRDYQVGTS------CHGYT-NEDLASLPKIQLVMEAYGD---ENGEVIIDIPP 363
Query: 407 KGLYL-----YCLGVVKSDNV-NIIG 426
+ L YC + S+N +IG
Sbjct: 364 EQYLLHNDNSYCGSIYLSENAGGVIG 389
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 98/362 (27%), Positives = 152/362 (41%), Gaps = 57/362 (15%)
Query: 62 ALAHRDRYFRLRGRGL---AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
+L+ R R R R + + A++ N P D+ + V +G PA+S
Sbjct: 81 SLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSLE-------YVVTVGLGTPAVSQ 133
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQK 177
++ +DTGSDL W+ C C NS++ ++ P+ SST + +PCN+ C +L +
Sbjct: 134 VLLIDTGSDLSWV--QCAPC----NSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTR 187
Query: 178 -----QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
C S G+ C Y + Y DG+ +TG + L +A FGCG
Sbjct: 188 DGYGSDCTSGSGGGAQCGYAITY-GDGSQTTGVYSNETLTMAPGVTVKD-----FHFGCG 241
Query: 230 RVQTGSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISF 283
Q G PN GL GLG S+ ++ + +FS C +D G ++
Sbjct: 242 HDQDG-------PNDKYDGLLGLGGAPESL--VVQTSSVYGGAFSYCLPAANDQAGFLAL 292
Query: 284 GDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYL 336
G + G TP +R+ Y + +T ++VGG ++ SA I DSGT T L
Sbjct: 293 GAPVNDASGFVFTPM-VREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTEL 351
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
AY + F +L + CY + +N P V LT GG ++
Sbjct: 352 QHTAYAALQAAFRKAMAAYPLLPNGEL--DTCYNFT-GHSNVTVPRVALTFSGGATVDLD 408
Query: 397 DP 398
P
Sbjct: 409 VP 410
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 121/278 (43%), Gaps = 28/278 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
+ +++G PA + + +DTGS L WL CD C++C + +Y P +
Sbjct: 39 FVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ C +L+K N C Y ++Y+ G S G L+ D L +
Sbjct: 90 KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144
Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
+ I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C S G
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTY 335
G + FGD P G T + + H Y+ + N+ IFDSG ++TY
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTY 264
Query: 336 LN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
P + +S ++L+KE + E D C+
Sbjct: 265 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 302
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 70/269 (26%), Positives = 115/269 (42%), Gaps = 53/269 (19%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S I +Y P +S +
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKC----PTKSDLGIKLTLYDPASSVS 81
Query: 163 SSKVPCNSTLC---------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
+++V C+ C + +K+ P C Y V Y DG+ + G+ V D +
Sbjct: 82 ATRVSCDDDFCTSTYNGLLPDCKKELP-----CQYNVVY-GDGSSTAGYFVSDAVQFERV 135
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
T Q+ + ++FGCG Q+G GLG ++ IL +F+
Sbjct: 136 TGNLQTGLSNGTVTFGCGAQQSG------------GLGTSGEALDGILG-------AFAH 176
Query: 272 CFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------- 321
C + +G G + G+ SP TP Q H YN+ + ++ VGG +
Sbjct: 177 CLDNVNGGGIFAIGELVSPKVNTTPMVPNQAH--YNVYMKEIEVGGTVLELPTDVFDSGD 234
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ YL + Y + S
Sbjct: 235 RRGTIIDSGTTLAYLPEVVYDSMMNEIRS 263
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 158/357 (44%), Gaps = 53/357 (14%)
Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
SLG +Y ++ +G P ++ DTGSDL W C S+ + D P
Sbjct: 128 SLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC-----------SAAETFD-----PT 171
Query: 159 TSSTSSKVPCNSTLCELQKQC---PS--AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
S++ + V C++ LC PS A S C Y ++Y DG+ S GFL ++ L + +
Sbjct: 172 KSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQY-GDGSYSIGFLGKERLTIGST 230
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + + FGCG+ G F A GL GLG DK SV S A + FS C
Sbjct: 231 D-----IFNNFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQLFSYCL 280
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----- 325
S TG +SFG S TP S + P+ YN+ +T ++VGG + S
Sbjct: 281 PSSSSTGFLSFGSSQSKSAKFTPLS---SGPSSFYNLDLTGITVGGQKLAIPLSVFSTAG 337
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T L AY+ + F ++A S L + CY S +T + P +
Sbjct: 338 TIIDSGTVVTRLPPAAYSALRSAFRKAMASYPMGKPLSIL--DTCYDFSKYKT-IKVPKI 394
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
++ GG V+ + V++ K + L G + + I G R + + ++S
Sbjct: 395 VISFSGGVDVDVDQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVS 451
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/258 (29%), Positives = 114/258 (44%), Gaps = 30/258 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ ++GQP + + DTGSDL WL CD C+ C + +Y P
Sbjct: 67 YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHP---------LYQPTNDLV 117
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
K P ++L +C C Y+V Y +DG S G LV D+ +
Sbjct: 118 VCKDPICASLHPDNYRCDDP-DQCDYEVEY-ADGGSSIGVLVNDLF--PVNLTSGMRARP 173
Query: 223 RISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
R++ GCG Q L G A +G+ GLG +S+ + L++QGL+ N CF G G
Sbjct: 174 RLTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGG 229
Query: 280 RISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
+ FGD S TP S R Y ++ + G + + +FDSG+S+TY
Sbjct: 230 YLFFGDDIYDSSKVIWTPMS-RDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYF 288
Query: 337 NDPAYTQISETFNSLAKE 354
N TQ +T S K+
Sbjct: 289 N----TQTYQTLLSFIKK 302
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 143/351 (40%), Gaps = 51/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG P + ++ALDT SDL WL C C C SG V D P S++
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 188
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ N+ C+ + + C Y V Y DG+ + G +E+ L A +
Sbjct: 189 REMSFNAADCQALGRSGGGDAKRGTCVYTVGY-GDGSTTVGDFIEETLTFAGGVRL---- 243
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGT 278
RIS GCG G F GA G+ GLG S P+ + + G +FS C G
Sbjct: 244 -PRISIGCGHDNKGLF--GAPAAGILGLGRGLMSFPNQIDHNG----TFSYCLVDFLSGP 296
Query: 279 GRIS----FGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV----------- 319
G +S FG SP TP L PT Y + +T +SVGG V
Sbjct: 297 GSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLD 356
Query: 320 --NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCYVLSPNQ 375
I DSGT+ T L PAYT + F ++A + + S F+ CY +
Sbjct: 357 PYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRG 416
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P V++ G + ++ + G + +V+IIG
Sbjct: 417 MK-KVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIG 466
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 121/268 (45%), Gaps = 24/268 (8%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +G P + + +D+GSDL WL CD CVSC + Y PN +
Sbjct: 37 VSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPP---------YKPNKGPITC 87
Query: 165 KVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
P C++ + C ++ C Y+V Y G+ S G LV D+ L + R
Sbjct: 88 NDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS-SLGVLVHDIFSLQLTNGTLAA--PR 144
Query: 224 ISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
++FGCG Q S+ AP +G+ GLG K+S+ + L + GLI + C G G
Sbjct: 145 LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGF 202
Query: 281 ISFGDKGS--PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
+ GD S PG TP S + Y + + G + +FDSG+S+TY N
Sbjct: 203 LFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFN 262
Query: 338 DPAY-TQISETFNSLAKEKRETSTSDLP 364
AY T +S L + +ET+ LP
Sbjct: 263 AQAYKTTLSLVRKYLNGKLKETADESLP 290
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 117/265 (44%), Gaps = 35/265 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +VS+G P + ++ DTGSDL W C C+ C L I++P S++
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRP---------IFNPLKSTSF 142
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S VPCN+ C C G C Y Y D T S G L + + + S SV
Sbjct: 143 SHVPCNTQTCHAVDDGHCGVQGV-CDYSYTY-GDRTYSKGDLGFEKITIG-----SSSVK 195
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGT 278
S I GCG +G F +G+ GLG + S+ S ++ I FS C S
Sbjct: 196 SVI--GCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN 250
Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----AIFDSGT 331
G+I+FG+ PG TP + T Y IT+ +S+ GN + F+ I DSGT
Sbjct: 251 GKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI-GNERHMAFAKQGNVIIDSGT 309
Query: 332 SFTYLNDPAYTQISETFNSLAKEKR 356
+ T L Y + + + K KR
Sbjct: 310 TLTILPKELYDGVVSSLLKVVKAKR 334
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 109/389 (28%), Positives = 159/389 (40%), Gaps = 65/389 (16%)
Query: 43 KGILAVDDLPKKGSFA--YYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGNDTYRLN 99
KG A D KK SFA S A D R GR + ++G + T+ G ++
Sbjct: 68 KGSSATDK--KKPSFAERLRSDRARADHILRKASGRRMMSEGGGASIPTYLGG----FVD 121
Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYS 156
SL ++ + +G PA+ V +DTGSDL W+ PC+ C + ++
Sbjct: 122 SLEYV--VTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDP---------LFD 170
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAG-------------SNCPYQVRYLSDGTMSTGFL 203
P+ SST + +PC S C KQ P G C Y + Y +G ++ G
Sbjct: 171 PSKSSTFATIPCASDAC---KQLPVDGYDNGCTNNTSGMPPQCGYAIEY-GNGAITEGVY 226
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+ L L S +V FGCG Q G + +GL GLG S+ S A+
Sbjct: 227 STETLALG-----SSAVVKSFRFGCGSDQHGPYDKF---DGLLGLGGAPESLVSQTAS-- 276
Query: 264 LIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHP-------TYNITITQVSV 314
+ +FS C + G G ++ G S + F H Y +T+T +SV
Sbjct: 277 VYGGAFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISV 336
Query: 315 GGNAVN-----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
GG A++ F I DSGT T + AY + F S E +D + CY
Sbjct: 337 GGKALDIPPAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCY 396
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
+ + T P V LT GG ++ P
Sbjct: 397 NFTGHGT-VTVPKVALTFVGGATVDLDVP 424
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 118/282 (41%), Gaps = 36/282 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYSPN 158
++ ++++G P + + +DTGSDL W+ CD C C + +Y PN
Sbjct: 61 IYTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCT---------LPKDKLYKPN 111
Query: 159 TSSTSSKVPCNSTLCE--------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
+ V C+ +C ++C C Y+V Y +D STG L D +H+
Sbjct: 112 GNQL---VKCSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEY-ADNAESTGALARDYMHI 167
Query: 211 ATDEKQSKSVDSRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ S S + FGCG Q + G+ GLG K S+ S L + G I N
Sbjct: 168 GS---PSGSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVL 224
Query: 270 SMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAI 326
C ++G G + GDK P G TP Y+ + G + I
Sbjct: 225 GHCLSAEGGGYLFLGDKFIPSSGIFWTPIIQSSLEKHYSTGPVDLFFNGKPTPAKGLQII 284
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPF 365
FDSG+S+TY + YT ++ N+ K K RET LP
Sbjct: 285 FDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKDPSLPI 326
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 144/351 (41%), Gaps = 55/351 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G PA + LDTGSDL W C C+ CV Q + + P SST
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPANSSTY 142
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C C YQ Y D + G L + T++ ++ R
Sbjct: 143 RSLGCSAPACNALYYPLCYQKTCVYQYFY-GDSASTAGVLANETFTFGTND--TRVTLPR 199
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + GS +G+ G+ G G S+ S L + FS C F S R
Sbjct: 200 ISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSR 251
Query: 281 ISFG------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-------- 325
+ FG + TPF + PT Y + +T +SVGGN + + +
Sbjct: 252 LYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG 311
Query: 326 ----IFDSGTSFTYLNDPAYTQISETF----NSLAK--EKRETSTSDLPFEYCYVLSPNQ 375
I DSGT+ TYL +PAY + E F NS + ETS D F++ P +
Sbjct: 312 TGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWP---PPPR 368
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P + L G ++V GL CL + S + +IIG
Sbjct: 369 QSVTLPQLVLHFDGADWELPLQNYMLVDPSTGGL---CLAMATSSDGSIIG 416
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 141/335 (42%), Gaps = 42/335 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ NV +G P + DTGS L W C C +C + ++ P S++
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV----------PVFDPTKSASF 181
Query: 164 SKVPCNSTLCELQKQ-CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKS 219
+PC+S LC+ +Q C S C Y Y+ D + STG L + + HL D K
Sbjct: 182 KGLPCSSKLCQSIRQGCSSP--KCTYLTAYV-DNSSSTGTLATETISFSHLKYDFKN--- 235
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
I GC +G L +G+ GL S+ S AN + FS C S
Sbjct: 236 ----ILIGCSDQVSGESL---GESGIMGLNRSPISLASQTAN--IYDKLFSYCIPSTPGS 286
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFSAIFDSGTS 332
TG ++FG K +P S Y+I +T +SVGG +A F+ ++ DSG
Sbjct: 287 TGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAV 346
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
T L AY+ + F + K D + CY S N + P +++ +GG
Sbjct: 347 LTRLPPKAYSALRSVFREMMKGYPLLDQDDF-LDTCYDFS-NYSTVAIPSISVFFEGGVE 404
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVK-SDNVNIIG 426
++ + + + G +YCL + D V+I G
Sbjct: 405 MDID--VSGIMWQVPGSKVYCLAFAELDDEVSIFG 437
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 152/357 (42%), Gaps = 38/357 (10%)
Query: 105 HYTNV-SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
H+T + ++G P+ F + +DTGSDL W+ CD C+ C + +Y P+ ++
Sbjct: 52 HFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDM---------LYRPHNNA 102
Query: 162 TSSKVPCNSTLCELQKQC-PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S + P + L L K + C Y+V Y G+ S G LV+D++ + K +
Sbjct: 103 VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYADHGS-SVGVLVKDLVPMRL--TNGKRI 159
Query: 221 DSRISFGCGRVQ-TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
+ FGCG Q G + G+ GL K ++ S L++ G + N C G G
Sbjct: 160 SPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGG 219
Query: 280 RISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYL 336
+ FG P G TP LR + Y+ +V G AV + FDSG+S+TY
Sbjct: 220 FLFFGGDVVPSSGMSWTPI-LRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSYTYF 278
Query: 337 NDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV-VNLTMKGGGPFF 394
N Y I + N L + ++ D E C+ FE V V K F
Sbjct: 279 NSQVYRAIEKLLKNDLKGNPLKLASDDKTLELCW---KGPKPFESVVDVRNFFKPLAMSF 335
Query: 395 VNDPIVIVSSEPKGLYL------YCLGVVKSD-----NVNIIGREYPIANNISLFHN 440
N V P+ + CLG++ NVNIIG + + N I ++ N
Sbjct: 336 KNSKNVQFQIPPEAYLIISEFGNVCLGILDGSKEGMGNVNIIG-DISMLNKIVVYDN 391
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 120/278 (43%), Gaps = 28/278 (10%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
GF + T + VGQP + + DTGSDL WL CD C C L+ +Y P
Sbjct: 55 GFYNVT-LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP-- 102
Query: 160 SSTSSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
++ VPC LC + +C + C Y+V Y +DG S G LV DV L +
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEY-ADGGSSLGVLVRDVFPL--NL 156
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ R++ GCG Q +G+ GLG S+ S L NQG++ N CF
Sbjct: 157 TNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE-FSAIFDSGTS 332
S G G FGD + + +P Y+ ++ G + +FDSG+S
Sbjct: 217 SKGGGYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 276
Query: 333 FTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+TY N AY ++ N LA + + D C+
Sbjct: 277 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCW 314
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 148/377 (39%), Gaps = 59/377 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF---NIYSPNTSS 161
++ VG PA F++ DTGSDL W+ C G +SS ++ P S
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKC------RGRRASSPDASPLASPRVFRPANSK 163
Query: 162 TSSKVPCNSTLCELQ-----KQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ + +PC+S C+ C SAG+ C Y RY D + + G + D +A
Sbjct: 164 SWAPIPCSSDTCKSYVPFSLANC-SAGTTPPAPCGYDYRY-KDKSSARGVVGTDAATIAL 221
Query: 213 DEKQS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
S K+ + GC G + +G+ LG S S A + FS
Sbjct: 222 SGSGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFS 277
Query: 271 MCF-----GSDGTGRISFGDKGSP-GQGETPFSL-RQTHPTYNITITQVSVGGNAVNFEF 323
C + T ++FG G+ TP L Q P Y +T+ VSV G A+N
Sbjct: 278 YCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPA 337
Query: 324 S---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSP 373
AI DSGTS T L PAY + + LA+ R T PFEYCY +
Sbjct: 338 EVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTMD---PFEYCYNWTA 394
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS--DNVNIIGR---- 427
+ P + + G ++ + P + C+G+ + V++IG
Sbjct: 395 TRRPPAVPRLEVRFAGSARLRPPTKSYVIDAAPG---VKCIGLQEGVWPGVSVIGNILQQ 451
Query: 428 ----EYPIANNISLFHN 440
E+ +AN F
Sbjct: 452 EHLWEFDLANRWLRFQE 468
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 110/273 (40%), Gaps = 30/273 (10%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P +F +DTGSD+ W+ CD C C + Y P ++ V
Sbjct: 58 LQIGNPPKAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKLQYKPKGNT----V 104
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC+ +C QCP+ C Y+V Y G+ S G LV D ++
Sbjct: 105 PCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGS-SMGALVID--QFPFKLLNGSAMQ 161
Query: 222 SRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
R++FGCG Q+ S A G+ GLG K + + L + GL N C S G G
Sbjct: 162 PRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGY 221
Query: 281 ISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
+ FGD P G TP H Y ++ G + IFD+G+S+TY N
Sbjct: 222 LFFGDTLIPSLGVAWTPLLPPDNH--YTTGPAELLFNGKPTGLKGLKLIFDTGSSYTYFN 279
Query: 338 DPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
Y I N L + + D C+
Sbjct: 280 SKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICW 312
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 152/343 (44%), Gaps = 40/343 (11%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G PA S+ + +DTGS L WL C CV + G +Y P
Sbjct: 127 TSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGP-----LYDP 179
Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGS---NCPYQVRYLSDGTMSTGFLVEDVLHLA 211
SST + VPC+++ C ELQ PSA S C YQ Y D + S G+L D +
Sbjct: 180 RASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASY-GDSSFSVGYLSRDTVSFG 238
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 239 SGSYP------NFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287
Query: 272 CFGSDG-TGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFSA- 325
C + TG +S G S TP + + Y +T++ +SVGG+ + E+S+
Sbjct: 288 CLPTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSL 347
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T L YT +S+ + A +++ + + C+ +Q P V
Sbjct: 348 PTIIDSGTVITRLPTAVYTALSKAVAA-AMVGVQSAPAFSILDTCFQGQASQ--LRVPAV 404
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ GG + V++ + CL +D+ IIG
Sbjct: 405 AMAFAGGATLKLATQNVLIDVDDS---TTCLAFAPTDSTTIIG 444
>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 146
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC 138
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQC 129
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 99/371 (26%), Positives = 147/371 (39%), Gaps = 75/371 (20%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + LDTGSDL W+ CD C C S Y P SST
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSH---------YYPKDSSTY 221
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT----- 212
+ C C+L + C + CPY Y +DG+ +TG + +
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDY-ADGSNTTGDFASETFTVNLTWPNG 280
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
EK + VD + FGCG G F GA+ GL GLG S PS + Q + +SFS C
Sbjct: 281 KEKFKQVVD--VMFGCGHWNKG-FFYGAS--GLLGLGRGPISFPSQI--QSIYGHSFSYC 333
Query: 273 F-----GSDGTGRISFGDKGS-------------PGQGETPFSLRQTHPTYNITITQVSV 314
+ + ++ FG+ G+ ETP Y + I + V
Sbjct: 334 LTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGE-ETP-----DETFYYLQIKSIMV 387
Query: 315 GGNAVN-----FEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
GG ++ + +S+ I DSG++ T+ D AY I E F K ++
Sbjct: 388 GGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIK-LQQI 446
Query: 359 STSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK 418
+ D CY +S E P + GG + EP + CL ++K
Sbjct: 447 AADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDE--VICLAIMK 504
Query: 419 SDN---VNIIG 426
+ N + IIG
Sbjct: 505 TPNHSHLTIIG 515
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 143/369 (38%), Gaps = 60/369 (16%)
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCV 136
L+ D PL G Y + S+G P DTGSDL W CD
Sbjct: 81 LSNNDTDTVPLRMDGGGGAYDME---------FSIGTPPQKLTALADTGSDLIWTKCD-- 129
Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK-----QCPSAGSNCPYQVR 191
+ Y PN SST +++PC+ LC + +C + G+ C Y+
Sbjct: 130 ------AGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYA 183
Query: 192 YL--SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y D + GFL + L D + FGC G + +GA GL GLG
Sbjct: 184 YGLGDDPDFTQGFLGSETFTLGGDAVPG------VGFGCTTALEGDYGEGA---GLVGLG 234
Query: 250 MDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDKGS---PGQGETPFSLRQTHPT 304
P L +Q L +F C +D + + FG + G G L +
Sbjct: 235 RG----PLSLVSQ-LDAGTFMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTF 289
Query: 305 YNITITQVSVGGNAV---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
Y + + +++G +FDSGT+ TYL +PAYT+ F S + TS +
Sbjct: 290 YAVNLRSITIGSATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLS-----QTTSLT 344
Query: 362 DLP----FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
+ FE CY P+ P + L GG + +V + + C V
Sbjct: 345 PVEGRYGFEACYE-KPDSARL-IPAMVLHFDGGADMALPVANYVVEVDDG---VVCWVVQ 399
Query: 418 KSDNVNIIG 426
+S +++IIG
Sbjct: 400 RSPSLSIIG 408
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 150/351 (42%), Gaps = 58/351 (16%)
Query: 105 HYTNVSVGQPA-LSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
++ ++ +G P FI+ DTGSDL W+ C+ C SC N G+V + N SS
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKP-NPHPGRV-----FRANDSS 172
Query: 162 TSSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
+ +PC+S C+++ Q CP+ + C + RYL+ F E V D
Sbjct: 173 SFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDH 232
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K+ + D I GC T SF + P+G+ GLG K S+ LA + N FS C
Sbjct: 233 KKIRLFDVLI--GC----TESFNETNGFPDGVMGLGYRKHSLALRLAE--IFGNKFSYCL 284
Query: 274 -----GSDGTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS- 324
S+ +SFGD P T L + Y + ++ +SVGG+ ++
Sbjct: 285 VDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDI 344
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF--EYCYVLSPN 374
I DSGTS T L AY ++ + + + ++ +LP +C+
Sbjct: 345 WNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCF----E 400
Query: 375 QTNFEYPVVN--LTMKGGGPFF---VNDPIVIVSSEPKGLYLYCLGVVKSD 420
F+ V L G F V I+ V+ K CLG++K+D
Sbjct: 401 DKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIK-----CLGIIKAD 446
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 112/426 (26%), Positives = 172/426 (40%), Gaps = 61/426 (14%)
Query: 35 HHRYS-DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ---GNDKTPLTFS 90
HH +S P D A S+L R ++RL +A+ K + S
Sbjct: 74 HHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVS 133
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
+G RL +L ++ + G+ V +DT S+L W+ C C SC + G +
Sbjct: 134 SGA---RLRTLNYVATVGLGGGEA----TVIVDTASELTWVQCAPCESC----HDQQGPL 182
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAG------------SNCPYQVRYLSDG 196
D P++S + + VPC+S C+ LQ+Q + + C Y + Y DG
Sbjct: 183 FD-----PSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSY-RDG 236
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
+ S G L D L LA + +D + FGCG G G +GL GLG + S+
Sbjct: 237 SYSRGVLAHDRLSLA-----GEVIDGFV-FGCGTSNQGPPFGGT--SGLMGLGRSQLSLV 288
Query: 257 SILANQ--GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ---------THPTY 305
S +Q G+ + SD +G + GD S + TP P Y
Sbjct: 289 SQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFY 348
Query: 306 NITITQVSVGGNAVN---FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+ +T ++VGG V F AI DSGT T L Y + F S E +
Sbjct: 349 LVNLTGITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAEYPQAPGFS 408
Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI--VSSEPKGLYLYCLGVVKSD 420
+ + C+ ++ + P + L GG V+ V+ VSS+ + L + D
Sbjct: 409 I-LDTCFNMT-GLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSED 466
Query: 421 NVNIIG 426
+IIG
Sbjct: 467 ETSIIG 472
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 149/338 (44%), Gaps = 42/338 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V VG PA S+ + LDTGSD+ W+ C C C + I++P SS+
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDP---------IFTPAASSSY 209
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + C+S C + C YQV Y DG+ + G V + + S +V+S
Sbjct: 210 SPLTCDSQQCNSLQMSSCRNGQCRYQVNY-GDGSFTFGDFVTETMSFGG----SGTVNS- 263
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I+ GCG G F+ A + P L +Q L SFS C + + S
Sbjct: 264 IALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTSQ-LKATSFSYCLVNRDSAASST 315
Query: 284 GDKGSPGQGETPFS--LRQTHPT--YNITITQVSVGGNAVNF-----------EFSAIFD 328
D S G++ + L+ + Y + ++ +SVGG + + I D
Sbjct: 316 LDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVD 375
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
GT+ T L AY + ++F S+++ R TS L F+ CY LS Q++ + P V+
Sbjct: 376 CGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVAL-FDTCYDLS-GQSSVKVPTVSFHFD 433
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
GG + + ++ + G Y + S +++IIG
Sbjct: 434 GGKSWDLPAANYLIPVDSAGTYCFAFAPTTS-SLSIIG 470
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 135/349 (38%), Gaps = 52/349 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA F + DTGS+L W+ C + GL ++ P S + +
Sbjct: 91 YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-----------VFRPEASKSWA 139
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VPC+S C+L C S+ S C Y RY + G + D +A +
Sbjct: 140 PVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQ 199
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----G 274
+ + GC G +G+ LG K S S A + SFS C
Sbjct: 200 LQD-VVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASRAAAR--FGGSFSYCLVDHLAP 254
Query: 275 SDGTGRISFGDKGSPGQ------GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
+ TG ++FG PGQ +T L P Y + + V V G A++
Sbjct: 255 RNATGYLAFG----PGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP 310
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY-VLSPNQTNFE 379
I DSGT+ T L PAY + L + PFE+CY +P E
Sbjct: 311 KSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCYNWTAPRPGAPE 368
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
P + + G ++ +P + C+G+ + + V++IG
Sbjct: 369 IPKLAVQFTGCARLEPPAKSYVIDVKPG---VKCIGLQEGEWPGVSVIG 414
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 112/251 (44%), Gaps = 25/251 (9%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPF 296
P TP
Sbjct: 275 VVQPKVKTTPL 285
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 124/297 (41%), Gaps = 40/297 (13%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
R+ GRG + K N Y + + ++ S+G P ++ + +DTGSDL W
Sbjct: 105 RVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYV--VTASLGTPGMAQTLEVDTGSDLSW 162
Query: 131 L---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----LQKQCPSAG 183
+ PC SC + ++ P SS+ + VPC + C C +A
Sbjct: 163 VQCKPCAAPSCYRQKDP---------LFDPAQSSSYAAVPCGRSACAGLGIYASACSAA- 212
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
C Y V Y DG+ +TG D L LA + + FGCG Q+G G +
Sbjct: 213 -QCGYVVSY-GDGSNTTGVYSSDTLTLAANATVQGFL-----FGCGHAQSGGLFTGI--D 263
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKG--SPGQGETPFSLR 299
GL G G ++ S+ + G FS C S TG ++ G +PG T
Sbjct: 264 GLLGFGREQPSL--VQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPS 321
Query: 300 QTHPTYNIT-ITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNS 350
PTY + +T +SVGG ++ SA + D+GT T L AY + F S
Sbjct: 322 PNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPAAYAALRSAFRS 378
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 125/306 (40%), Gaps = 48/306 (15%)
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHG 141
+ F G D + Y +++G+PA + + +DTGS+L W+ C C +C
Sbjct: 26 MVFKLGGDVHPTGHF----YVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTC--- 78
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLS 194
+ +Y P VPC LC+ K C C YQ+ Y +
Sbjct: 79 ------NKVPHPLYRPK-----KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINY-A 126
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGM 250
DGT S G L+ D L T ++ I+FGCG Q A +G+ GLG
Sbjct: 127 DGTTSLGVLLLDKFSLPTGSARN------IAFGCGYDQMQGPKKKAPEKVPVDGILGLGR 180
Query: 251 DKTSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPGQGET---PFSLRQTHPTYN 306
+ S L + G + N C S G G + G++ P + + + Y+
Sbjct: 181 GSVDLVSQLKHSGAVSKNVIGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYS 240
Query: 307 ITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEK-RETSTSDL 363
+ +G N + + F AIFDSG+++TYL + + Q+ SL K + S +D
Sbjct: 241 PGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDT 300
Query: 364 PFEYCY 369
C+
Sbjct: 301 RLHLCW 306
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 130/305 (42%), Gaps = 47/305 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ T +S+G PA F V DTGSDL W+ C C +C + + I+ P SS+
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDP---------IFDPEGSSSY 90
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C TLC+ +K C NC Y Y DG+ + G L + + L + + + K
Sbjct: 91 TTMSCGDTLCDSLPRKSC---SPNCDYSYGY-GDGSGTRGTLSSETVTLTSTQGE-KLAA 145
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
I+FGCG + GSF D + GL GLG S S L + L + FS C
Sbjct: 146 KNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPS 200
Query: 277 GTGRISFGDKGSPGQG----ETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
T + FGD+ S F+ +P Y + + +S+ G A+ +
Sbjct: 201 KTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDI 260
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
IFDSGT+ T L D Y + S E S + CY +S ++ +
Sbjct: 261 KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFP-EIDGSSAGLDLCYDVSGSKAS 319
Query: 378 FEYPV 382
++ +
Sbjct: 320 YKKKI 324
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 112/403 (27%), Positives = 171/403 (42%), Gaps = 58/403 (14%)
Query: 59 YYSALAHRDRYFRLRG--RGLAAQGNDKTPLTFSAGNDTYRLNSLGF--LHYT-NVSVGQ 113
+Y+ + RDR+ R+R R L A T T A RL L F L Y + +G
Sbjct: 78 HYTGILRRDRH-RVRSIYRRLTAAETTTTTTTIPA-----RLG-LAFQSLEYVVTIGIGT 130
Query: 114 PALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
P +F V DTGSDL W LPC SC ++ P+ SST VPC++
Sbjct: 131 PPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEP---------LFDPSKSSTYVDVPCSA 181
Query: 171 TLCELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
C + +Q ++C Y V+Y D + + G L E+ L+ + + + + FGC
Sbjct: 182 PECHIGGVQQTRCGATSCEYSVKY-GDESETHGSLAEETFTLSPPSPLAPAA-TGVVFGC 239
Query: 229 GRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNS----FSMCFGSDG--TGRI 281
F D G GL GLG + SIL+ NS FS C G TG +
Sbjct: 240 SHEYISVFNDTGMGVAGLLGLGRGDS---SILSQTRRSINSGGGVFSYCLPPRGSSTGYL 296
Query: 282 SFGDKGSPGQGE------TPF--SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFD 328
+ G + Q + TP ++ Q Y + + VSV G AV+ F A+ D
Sbjct: 297 TIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAVID 356
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SGT T++ AY + + F + K S + CY ++ Q P V L
Sbjct: 357 SGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVT-GQDVVTAPRVALEF 415
Query: 388 KGGGPFFVNDP--IVIVSSEP---KGLYLYCLGVVKSDNVNII 425
GG V+ ++++ +E + L L CL + +++ ++
Sbjct: 416 GGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLV 458
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 129/305 (42%), Gaps = 42/305 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ V +G P + DTGSDL W C+ SC ++ I+ P+ S++
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDA---------IFDPSKSTS 195
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDE 214
S + C STLC + C ++ C Y ++Y D + S G+ + L + ATD
Sbjct: 196 YSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQY-GDSSFSVGYFSRERLSVTATD- 253
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+ FGCG+ G F A GL GLG S + + FS C
Sbjct: 254 -----IVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAVYRKIFSYCLP 303
Query: 274 -GSDGTGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------A 325
S TGR+SFG + TPFS + + Y + IT +SVGG + S A
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT T L AYT + F K ++ + CY LS + F P ++
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFRQ-GMSKYPSAGELSILDTCYDLSGYEV-FSIPKIDF 421
Query: 386 TMKGG 390
+ GG
Sbjct: 422 SFAGG 426
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 128/298 (42%), Gaps = 45/298 (15%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
+L++L + GC G F R P G +G + +AL D R+
Sbjct: 14 LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RL G A G P DT L+YT + +G P + V +DTGSD+
Sbjct: 62 GRLLGAVDLALGGVGLP------TDTG-------LYYTRIEIGSPPKGYYVQVDTGSDIL 108
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG 183
W+ +C+ C G + SG I+ Y P S T+ V C C + CPS
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
S C +++ Y DG+ +TGF V D + + Q+ + ++ I+FGCG Q G L +
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPF 296
A +G+ G G +S+ S LA + F+ C + G G + G+ P TP
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPL 279
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/357 (25%), Positives = 140/357 (39%), Gaps = 59/357 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P +V +DTGSDL WL C C C + +Y P S T
Sbjct: 92 YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTP---------LYDPRNSKTH 142
Query: 164 SKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++PC S C C + C Y V Y DG+ S+G L D L L D +
Sbjct: 143 RRIPCASPQCRGVLRYPGCDARTGGCVYMVVY-GDGSASSGDLATDTLVLPDDTRVHN-- 199
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG------ 274
++ GCG G A GL G G + S P+ LA + FS C G
Sbjct: 200 ---VTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVFSYCLGDRMSRA 251
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG-------------N 317
+ + + FG +P T F+ +T+P Y + + SVGG N
Sbjct: 252 RNSSSYLVFGR--TPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALN 309
Query: 318 AVNFEFSAIFDSGTSFTYLNDPAYTQISETF--NSLAKEKRETSTSDLPFEYCYVLSPN- 374
+ DSGT+ + AY + + F ++ A R F+ CY + N
Sbjct: 310 PATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNG 369
Query: 375 -QTNFEYPVVNLTMKGGGPFFV---NDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
T P + L + N I +V + + +CLG+ +D+ +N++G
Sbjct: 370 PGTGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRR--TYFCLGLQAADDGLNVLG 424
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 120/278 (43%), Gaps = 28/278 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
+ +++ PA + + +DTGS L WL CD C++C + +Y P +
Sbjct: 39 FVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ C +L+K N C Y ++Y+ G S G L+ D L +
Sbjct: 90 KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144
Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
+ I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C S G
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTY 335
G + FGD P G T + + H Y+ + N+ IFDSG ++TY
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTY 264
Query: 336 LN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
P + +S ++L+KE + E D C+
Sbjct: 265 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 302
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/366 (27%), Positives = 153/366 (41%), Gaps = 48/366 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333
Query: 275 SDGTGRISFGD---KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG + + TP L + PT Y + +T + VGG ++ S
Sbjct: 334 STGTGYLDFGAGSLAAARARLTTPM-LTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT T L AY+ + F + K+ + S L + CY + + P
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLL--DTCYDFT-GMSQVAIP 449
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIAN 433
V+L +GG V+ ++ ++ + L +G+V + + G Y I
Sbjct: 450 TVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGK 509
Query: 434 NISLFH 439
+ F+
Sbjct: 510 KVVGFY 515
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 147/346 (42%), Gaps = 47/346 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G + SV L + FS C
Sbjct: 109 ----FTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
F S TG S G K + + + + + R+ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+FDSG+ +Y+ D A + +S+ L R + + CY + +
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DM 277
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P ++L G F + V V + ++CL +++V+IIG
Sbjct: 278 PAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 102/397 (25%), Positives = 154/397 (38%), Gaps = 75/397 (18%)
Query: 63 LAHRDR----YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
LA DR + RGR AA+ + S+G T ++ VG PA F
Sbjct: 46 LARMDRERMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQ-----YFVRFRVGTPAQPF 100
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF-------NIYSPNTSSTSSKVPCNST 171
++ DTGSDL W+ C + + + + + P+ S T + +PC+S
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIPCSSA 160
Query: 172 LCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR-IS 225
C C + + C Y RY DG+ + G + D +A + ++ R +
Sbjct: 161 TCRESLPFSLAACATPANPCAYDYRY-KDGSAARGTVGVDSATIALSGRAARKAKLRGVV 219
Query: 226 FGCGRVQTG-SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
GC G SFL A +G+ LG S S A++ FS C + T
Sbjct: 220 LGCTTSYNGQSFL---ASDGVLSLGYSNISFASRAASR--FGGRFSYCLVDHLAPRNATS 274
Query: 280 RISFGDKGS-----PGQG---------------------ETPFSL-RQTHPTYNITITQV 312
++FG + P +G +TP L +T P Y +T+ V
Sbjct: 275 YLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGV 334
Query: 313 SVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSD 362
SV G + + AI DSGTS T L PAY + + LA R T
Sbjct: 335 SVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMD-- 392
Query: 363 LPFEYCY-VLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
PF+YCY SP+ ++ P+ L + G + P
Sbjct: 393 -PFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPP 428
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 152/351 (43%), Gaps = 59/351 (16%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L ++ VS+G PA++ + +DTGSD+ WL C +Y P
Sbjct: 126 LNTLEYV--ITVSIGSPAVAXTMFIDTGSDVSWLRCKS-----------------RLYDP 166
Query: 158 NTSSTSSKVPCNSTLC-ELQKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
TSST + C++ C +L ++ S+GS C Y V+Y DG+ +TG D L LA
Sbjct: 167 GTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKY-GDGSNTTGTYGSDTLTLA--- 222
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSMCF 273
S+ + S FGC V+ G D +GL GLG D S V A G ++FS C
Sbjct: 223 GTSEPLISGFQFGCSAVEHGFEEDNT--DGLMGLGGDAQSFVSQTAATYG---SAFSYCL 277
Query: 274 GS--DGTGRISFGDKGSPGQGETP----FSLRQTHPTYNITITQVSVGGNAVN-----FE 322
+ +G ++ G S +Q Y + + +SVGG + F
Sbjct: 278 PPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS 337
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPN--QTNFE 379
+I DSGT T L AY +S F + +A+ + + + + C+ + + NF
Sbjct: 338 AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCFDFTGHGEGNNFT 397
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYL-YCLGVVKSDN---VNIIG 426
P V L + GG +V P G+ CL +D+ IIG
Sbjct: 398 VPSVALVLDGG---------AVVDLHPNGIVQDGCLAFAATDDDGRTGIIG 439
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 120/470 (25%), Positives = 184/470 (39%), Gaps = 79/470 (16%)
Query: 8 SPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYS-ALAHR 66
SP+ +L++L S C G GF R S + + + + + S A S +L HR
Sbjct: 3 SPLLLLVVLCSYCCYIALGGNEHGFAVVQRRSYDSETVCSASKVNLEPSSATVSMSLVHR 62
Query: 67 D--------------------RYFRLRGRGLAAQGNDKTPLTFSAGND------TYRLNS 100
R R R + +Q + + ++ D T
Sbjct: 63 YGPCAPSQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDDAAVTIPTRL 122
Query: 101 LGFL----HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
GF+ + + G P++ ++ +DTGSD+ W+ C C NS+ ++
Sbjct: 123 GGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWV--QCTPC----NSTKCYPQKDPLFD 176
Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
P+ SST + + CN+ C C S G+ C Y V Y +DG+ S G + L LA
Sbjct: 177 PSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEY-ADGSHSRGVYSNETLTLA 235
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
FGCGR Q G +GL GLG S+ ++ + +FS
Sbjct: 236 PGITVED-----FHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSY 285
Query: 272 CFGSDGTGRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
C + + F GSP G TP + T Y +T+T +SVGG ++ S
Sbjct: 286 CLPALNS-EAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQS 344
Query: 325 A-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
A I DSGT T L + AY + K + D F+ CY + +N
Sbjct: 345 AFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD--FDTCYNFT-GYSNIT 401
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
P V T GG ++ P I+ ++ CL +S D + IIG
Sbjct: 402 VPRVAFTFSGGATIDLDVPNGILVND-------CLAFQESGPDDGLGIIG 444
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 135/302 (44%), Gaps = 42/302 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G+P+ + LDTGSD+ W+ C C C H + I+ P +S++
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADP---------IFEPASSTSY 194
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + C++ C+ + C Y+V Y DG+ + G V + + L S SVD+
Sbjct: 195 SPLSCDTKQCQSLDVSECRNNTCLYEVSY-GDGSYTVGDFVTETITLG-----SASVDN- 247
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG K S PS + +SFS C SD
Sbjct: 248 VAIGCGHNNEGLFIGAAG---LLGLGGGKLSFPSQIN-----ASSFSYCLVDRDSDSAST 299
Query: 281 ISFGDKGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVN-----FEFSA------IFD 328
+ F P P R+ Y + +T +SVGG ++ FE I D
Sbjct: 300 LEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIID 359
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGT+ T L AY + + F K+ TS L F+ CY LS +T+ E P V +
Sbjct: 360 SGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVAL-FDTCYDLS-RKTSVEVPTVTFHLA 417
Query: 389 GG 390
GG
Sbjct: 418 GG 419
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 115/271 (42%), Gaps = 43/271 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +GQP S ++ DTGSDL W+ C C +C H ++ ++ P SST
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPAT--------VFFPRHSSTF 134
Query: 164 SKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S C +C L + A S CPY+ Y +DG++++G + L T
Sbjct: 135 SPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGY-ADGSLTSGLFARETTSLKTSSG 193
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + S ++FGCG +G + G + NG+ GLG S S L + N FS C
Sbjct: 194 KEAKLKS-VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYC 250
Query: 273 -----FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
T + GD G TP PT Y + + V V G + + S
Sbjct: 251 LMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPS 310
Query: 325 -----------AIFDSGTSFTYLNDPAYTQI 344
+ DSGT+ +L DPAY +
Sbjct: 311 IWEIDDSGNGGTVMDSGTTLAFLADPAYRLV 341
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 128/280 (45%), Gaps = 39/280 (13%)
Query: 106 YTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
Y NV+ +GQP+ + + +DTGSDL WL CD CV C + P
Sbjct: 33 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH-------------PYYRP 79
Query: 162 TSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEK 215
++ VPC +C+ +C + G C Y+V Y +DG S G LV D +L T EK
Sbjct: 80 RNNLVPCMDPICQSLHSNGDHRCENPG-QCDYEVEY-ADGGSSFGVLVTDTFNLNFTSEK 137
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP--NGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + ++ GCG Q F G+ +G+ GLG K+S+ S L++ GL+ N C
Sbjct: 138 RHSPL---LALGCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCL 191
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
G G + FGD S TP S H Y+ + +++ G F+ FDSG
Sbjct: 192 SGHGGGFLFFGDDLYDSSRVAWTPMSPDAKH--YSPGLAELTFDGKTTGFKNLLTTFDSG 249
Query: 331 TSFTYLNDPAYT-QISETFNSLAKEKRETSTSDLPFEYCY 369
S+TYLN AY IS L+ + + D C+
Sbjct: 250 ASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCW 289
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/313 (27%), Positives = 121/313 (38%), Gaps = 42/313 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLN----SSSGQVIDFNIYSPNT 159
+Y + VG P +DTGSD+ W C C C N SS +Y P
Sbjct: 88 YYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPEL 147
Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S T+S C+ LC C ++C Y + Y D + STG DV+HL S
Sbjct: 148 SITASPATCSDPLCSEGGSCRGNNNSCAYDISY-EDTSSSTGIYFRDVVHLG----HKAS 202
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDG 277
+++ + GC +G + +G+ G G K SVP+ LA Q N F C +G
Sbjct: 203 LNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGEKEG 258
Query: 278 TGRISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSA------ 325
G + G P TP + YN+ + +SV A+ FE++A
Sbjct: 259 GGILVLGKNDEFPEMVYTP--MLANDIVYNVKLVSLSVNSKALPIEASEFEYNATVGNGG 316
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE------YCYVLSPNQTNF 378
I DSGTS A + A K T+ P E + + N
Sbjct: 317 TIIDSGTSSATFPSKALALFVK-----AVSKFTTAIPTAPLESSGSPCFISISDRNSVEV 371
Query: 379 EYPVVNLTMKGGG 391
++P V L GG
Sbjct: 372 DFPNVTLKFDGGA 384
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 98/392 (25%), Positives = 149/392 (38%), Gaps = 82/392 (20%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-------CVSCVHGLNSSSGQ--------- 148
++ VG PA F++ DTGSDL W+ C + G N G
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 149 ----VIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMS 199
++ P+ S T + +PC+S C CP+ GS C Y+ RY DG+ +
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRY-KDGSAA 173
Query: 200 TGFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTG-SFLDGAAPNGLFGLGMDKT 253
G + D +A +KQ ++ + GC TG SFL A +G+ LG
Sbjct: 174 RGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSNV 230
Query: 254 SVPSILANQGLIPNSFSMCF-----GSDGTGRISF-----------------GDKGSPGQ 291
S S A + FS C + T ++F G +PG
Sbjct: 231 SFASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGA 288
Query: 292 GETPFSL-RQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAY 341
+TP L + P Y + + VSV G + AI DSGTS T L PAY
Sbjct: 289 RQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSPAY 348
Query: 342 TQISETFNSLAKEKRETSTSDL-PFEYCY----VLSPNQTNFEYPVVNLTMKGGGPFFVN 396
+ +L K+ + PF+YCY L+ P + + G
Sbjct: 349 RAV---VAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPP 405
Query: 397 DPIVIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
++ + P + C+G+ + D V++IG
Sbjct: 406 PKSYVIDAAPG---VKCIGLQEGDWPGVSVIG 434
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 140/352 (39%), Gaps = 46/352 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++T V VG PA F V +DTGS+L W+ C G+V + ++ S +
Sbjct: 88 YFTEVRVGTPAKKFRVVVDTGSELTWVNC------RYRGRGKGKVKNRRVFRAEESKSFK 141
Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C + C++ CP+ + C Y RY +DG+ + G ++ + + +
Sbjct: 142 TVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRK 200
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+ + GC +G GA +G+ GL S S + L S C
Sbjct: 201 ARLRGLL-VGCSSSFSGQSFQGA--DGVLGLAFSDFSFTSTATS--LFGAKLSYCLVDHL 255
Query: 278 TGR-----ISFG-------DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
+ + + FG K +PG+ TP L P Y I I +S+G + ++
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGR-TTPLDLTLIPPFYAINIIGISIGDDMLDIPTQV 314
Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
I DSGTS T L + AY + E + +P EYC+ +
Sbjct: 315 WDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFN 374
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
+ P + +KGG F + +V + P + CLG + + N++G
Sbjct: 375 ESKLPQLTFHLKGGARFEPHRKSYLVDAAPG---VKCLGFMSAGTPATNVVG 423
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 129/309 (41%), Gaps = 34/309 (11%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IY 155
SLG L + V G PA ++ + DTGSD+ W+ C+ C SG + I+
Sbjct: 113 TSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI--QCLPC-------SGHCYKQHDPIF 163
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P S+T S VPC C S+ C Y+V+Y DG+ + G L + L L
Sbjct: 164 DPTKSATYSAVPCGHPQCAAAGGKCSSNGTCLYKVQY-GDGSSTAGVLSHETLSLT---- 218
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
S +FGCG G F D +GL GLG + S+ S A S+ + +
Sbjct: 219 -SARALPGFAFGCGETNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN 274
Query: 276 DGTGRISFGD----KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFS 324
G ++ G GS G T +Q +P+ Y + + + VGG +
Sbjct: 275 TSHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG 334
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
+ DSGT TYL AYT + + F + + D PF+ CY + Q P+V+
Sbjct: 335 TLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYD-PFDTCYDFA-GQNAIFMPLVS 392
Query: 385 LTMKGGGPF 393
G F
Sbjct: 393 FKFSDGSSF 401
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 121/305 (39%), Gaps = 47/305 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
++ VG PA F++ DTGSDL W+ C + +SS+ + P S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 162 TSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA----- 211
T + +PC S C CP+ GS C Y RY DG+ + G + + +A
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSSSS 213
Query: 212 --TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ K K+ + GC TG + A +G+ LG S S A++ F
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFASHAASR--FGGRF 269
Query: 270 SMCF-----GSDGTGRISFGDKGS----------PGQGETPFSL-RQTHPTYNITITQVS 313
S C + T ++FG + PG +TP L + P Y+++I +S
Sbjct: 270 SYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAIS 329
Query: 314 VGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
V G + I DSGTS T L PAY + K R + P
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGK--KLARFPRVAMDP 387
Query: 365 FEYCY 369
FEYCY
Sbjct: 388 FEYCY 392
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 136/306 (44%), Gaps = 49/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ T +S+G PA F V DTGSDL W+ C C +C + + I+ P SS+
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDP---------IFDPEGSSSY 90
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C TLC+ +K C +C Y Y DG+ + G L + + L + + + K
Sbjct: 91 TTMSCGDTLCDSLPRKSC---SPDCDYSYGY-GDGSGTRGTLSSETVTLTSTQGE-KLAA 145
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
I+FGCG + GSF D + GL GLG S S L + L + FS C
Sbjct: 146 KNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPS 200
Query: 277 GTGRISFGDKGSPGQG----ETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
T + FGD+ S F+ +P Y + + +S+ G A+ +
Sbjct: 201 KTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDI 260
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQT 376
IFDSGT+ T L D Y + S ++ K + S++ L + CY +S ++
Sbjct: 261 KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGL--DLCYDVSGSKA 318
Query: 377 NFEYPV 382
+++ +
Sbjct: 319 SYKMKI 324
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 152/357 (42%), Gaps = 56/357 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG PA+ ++ALDT SDL WL C C C SG V D P S++
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 184
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDG----TMSTGFLVEDVLHLATDEKQ 216
++ ++ C+ + + C Y V+Y DG + S G LVE+ L A +Q
Sbjct: 185 GEMNYDAPDCQALGRSGGGDAKRGTCIYTVQY-GDGHGSTSTSVGDLVEETLTFAGGVRQ 243
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
+ +S GCG G F GA G+ GLG + S+P +A G SFS C
Sbjct: 244 AY-----LSIGCGHDNKGLF--GAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDF 295
Query: 274 ----GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV------ 319
GS + ++FG SP TP L Q PT Y + + VSVGG V
Sbjct: 296 ISGPGSP-SSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTER 354
Query: 320 -------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCY 369
I DSGT+ T L PAY + F + A + ST S L F+ CY
Sbjct: 355 DLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGL-FDTCY 413
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ + + P V++ GG + ++ + +G + +V++IG
Sbjct: 414 TVG-GRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIG 469
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 121/296 (40%), Gaps = 33/296 (11%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + DTGSDL W+ C C SC ++ P SST C
Sbjct: 96 IGTPPVERLATADTGSDLIWVQCSPCASCFPQSTP---------LFQPLKSSTFMPTTCR 146
Query: 170 STLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
S C L QK C +G C Y +Y + S G L + L +
Sbjct: 147 SQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSF 205
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRIS 282
FGCG + G+ GLG S+ S + +Q I + FS C GS T ++
Sbjct: 206 FGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLK 263
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV---NFEFSAIFDSGTSFTY 335
FG++ G TP ++ PTY + + V+V V + + + I DSGT TY
Sbjct: 264 FGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTY 323
Query: 336 LNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
L + Y + + SLA E + S LPF C+ P + NF +P + G
Sbjct: 324 LGESFYYNFAASLQESLAVELVQDVLSPLPF--CF---PYRDNFVFPEIAFQFTGA 374
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 131/303 (43%), Gaps = 39/303 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + DTGSDL W C CV + I++P+ S++
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 184
Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S C L +AG SNC Y ++Y D + S GFL +D L + +
Sbjct: 185 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKDKFTLTSSD---- 239
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
V + FGCG G F A GL GLG DK S PS A FS C S
Sbjct: 240 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 293
Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
TG ++FG G S TP S + Y + I ++VGG + FS A+ D
Sbjct: 294 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 353
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTM 387
SGT T L AY + +F AK + +TS + + C+ LS +T P V +
Sbjct: 354 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGFKT-VTIPKVAFSF 410
Query: 388 KGG 390
GG
Sbjct: 411 SGG 413
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/324 (29%), Positives = 137/324 (42%), Gaps = 67/324 (20%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P F + +D+GSDL W+ C C+ C D +Y+P+ SST
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCY---------AQDTPLYAPSNSSTF 115
Query: 164 SKVPCNSTLCELQKQC---------PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
+ VPC S C L P A C Y+ RY +D ++S G
Sbjct: 116 NPVPCLSPECLLIPATEGFPCDFHYPGA---CAYEYRY-ADTSLSKGVFA---------- 161
Query: 215 KQSKSVDS----RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+S +VD +++FGCGR GSF AA G+ GLG S S + N F+
Sbjct: 162 YESATVDDVRIDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFA 216
Query: 271 MCF-----GSDGTGRISFGDKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGNAVNF 321
C + + + FGD+ + TP +PT Y + I +V VGG ++
Sbjct: 217 YCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPI 276
Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY- 369
SA IFDSGT+ TY PAY I F+ + R S L + C
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQGL--DLCVD 334
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPF 393
V +Q +F P + + GG F
Sbjct: 335 VTGVDQPSF--PSFTIVLGGGAVF 356
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 94/346 (27%), Positives = 143/346 (41%), Gaps = 48/346 (13%)
Query: 60 YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
+S+L+H DR R L+ + R + G + + +G P + ++
Sbjct: 46 FSSLSHYDRLANAFRRSLS-----------RSAALLNRAATSGAVGLQSSIIGTPPVDYL 94
Query: 120 VALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
DTGSDL W C C+ C L I++P S++ S VPCN+ C
Sbjct: 95 GIADTGSDLTWAQCLPCLKCYQQLRP---------IFNPLKSTSFSHVPCNTQTCHAVDD 145
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C G C Y Y D T S G L + + + S SV S I GCG +G F
Sbjct: 146 GHCGVQGV-CDYSYTY-GDRTYSKGDLGFEKITIG-----SSSVKSVI--GCGHASSGGF 196
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGDKG---SPG 290
+G+ GLG + S+ S ++ I FS C S G+I+FG PG
Sbjct: 197 ---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPG 253
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----AIFDSGTSFTYLNDPAYTQISE 346
TP + T Y IT+ +S+ GN + F+ I DSGT+ ++L Y +
Sbjct: 254 VVSTPLISKNTVTYYYITLEAISI-GNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVS 312
Query: 347 TFNSLAKEKRETSTSDLPFEYCYVLSPN-QTNFEYPVVNLTMKGGG 391
+ + K KR + ++ C+ N T+ P++ GG
Sbjct: 313 SLLKVVKAKRVKDPGNF-WDLCFDDGINVATSSGIPIITAQFSGGA 357
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 118/282 (41%), Gaps = 36/282 (12%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++G PA S+ + +DTGS L WL CD C +C ++ +Y P
Sbjct: 39 FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC---------NIVPHVLYKPTPKKL- 88
Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C +LC K+C S C Y ++Y+ +M G LV D L+
Sbjct: 89 --VTCADSLCTDLYTDLGKPKRCGSQ-KQCDYVIQYVDSSSM--GVLVIDRFSLSASNGT 143
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
+ + I+FGCG Q + P + + GL K ++ S L +QG+I + C
Sbjct: 144 NPTT---IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS 200
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
S G G + FGD P G T + + H Y+ + N+ + IFDSG
Sbjct: 201 SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGA 260
Query: 332 SFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
++TY Y + + T NS K E + D C+
Sbjct: 261 TYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCW 302
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 144/351 (41%), Gaps = 52/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA ++ LDTGSD+ WL C C C SGQV D P S +
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYE----QSGQVFD-----PRRSRSY 190
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C + LC C S C YQV Y DG+++ G + L A +
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLRRSACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 244
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+R++ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRR--YGRSFSYCLVDRTSSAN 299
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV---------- 319
+ + ++FG + F+ +P Y + + +SVGG V
Sbjct: 300 TASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRL 359
Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
+ I DSGTS T L PAY+ + + F A R + F+ CY LS +
Sbjct: 360 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKV 419
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ GG + ++ + KG +C +D V+IIG
Sbjct: 420 -VKVPTVSMHFAGGAEAALPPENYLIPVDSKG--TFCFAFAGTDGGVSIIG 467
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 123/284 (43%), Gaps = 39/284 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
+ + +G P++ + DTGSDL W+ PCD C + +Y P SS
Sbjct: 96 YLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCF---------AQNTPLYDPLNSS 146
Query: 162 TSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T + +PC+S C Q C G +C Y Y D + S G L D + L +
Sbjct: 147 TFTLLPCDSQPCTQLPYSQYVCSDYG-DCIYAYTY-GDNSYSYGGLSSDSIRLMLLQLH- 203
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
+S+I FGCG + G+ GLG S+ S L ++ I + FS C F
Sbjct: 204 --YNSKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFS 259
Query: 275 SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV---NFEFSAIFD 328
S+ ++ FG+ G TP ++ P Y + + ++VG V + + I D
Sbjct: 260 SNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIID 319
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCY 369
SG++ TYL + Y + F SL KE E PF++C+
Sbjct: 320 SGSTLTYLEESFYNE----FVSLVKETVAVEEDQYIPYPFDFCF 359
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 135/346 (39%), Gaps = 45/346 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P ++ +DTGSD+ WL C CV C L+ +Y P SST
Sbjct: 99 YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSP---------LYDPRGSSTY 149
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
++ PC+ C + C C Y++ Y D + ++G L D L + D
Sbjct: 150 AQTPCSPPQCRNPQTCDGTTGGCGYRIVY-GDASSTSGNLATDRLVFSNDTSVGN----- 203
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGT 278
++ GCG G F A GL G+ S + +A+ F+ C G +
Sbjct: 204 VTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRSGSSS 258
Query: 279 GRISFGDKG--SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV----NFEFS------- 324
+ FG P TP P+ Y + + SVGG V N S
Sbjct: 259 SYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGR 318
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYVLSPNQTNFEY 380
+ DSGTS T AY + + F++ A + R+ F+ CY L +
Sbjct: 319 GGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLR-GVAVADA 377
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P V L GG + +V E + + L D +++IG
Sbjct: 378 PGVVLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIG 423
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 63/196 (32%), Positives = 94/196 (47%), Gaps = 17/196 (8%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
LG+ +YT +++G P + LDTGS L PC C S +G ++ P S
Sbjct: 78 LGY-YYTYLTIGTPGQTVSGILDTGSTLPAFPCS--GCTRCGPSKTG------MFKPELS 128
Query: 161 STSSKVPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
STSS C+ C C C Y +RYL +G+ ++GFL ED+L + +
Sbjct: 129 STSSTFGCSDARCFCGANSCSCNNEQCGYSIRYL-EGSSTSGFLAEDMLAVGDGGPAANF 187
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V FGC + ++G L +G+FG+G S+ L QG+I ++FSMCFG+ G
Sbjct: 188 V-----FGCAQSESG-LLYSQIADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPREG 241
Query: 280 RISFGDKGSPGQGETP 295
+ G+ P P
Sbjct: 242 VLLLGNVALPADAPAP 257
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 119/282 (42%), Gaps = 36/282 (12%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++G PA S+ + +DTGS L WL CD C +C ++ +Y P +
Sbjct: 404 FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC---------NIVPHVLYKP---TPK 451
Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C +LC K+C S C Y ++Y+ +M G LV D L+
Sbjct: 452 KLVTCADSLCTDLYTDLGKPKRCGSQ-KQCDYVIQYVDSSSM--GVLVIDRFSLSASNGT 508
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
+ + I+FGCG Q + P + + GL K ++ S L +QG+I + C
Sbjct: 509 NPTT---IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS 565
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
S G G + FGD P G T + + H Y+ + N+ + IFDSG
Sbjct: 566 SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGA 625
Query: 332 SFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
++TY Y + + T NS K E + D C+
Sbjct: 626 TYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCW 667
Score = 42.0 bits (97), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 71/167 (42%), Gaps = 27/167 (16%)
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ-TGSFLDGAAP 242
+ C Y+++Y +DG + G L+ D L + + FGCG Q G +P
Sbjct: 27 TQCDYEIKY-ADGASTIGALIVDQFSLP-----RIATRPNLPFGCGYNQGIGENFQQTSP 80
Query: 243 -NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ 300
NG+ GL K S S L G+I + C S G G + GD G G +L
Sbjct: 81 VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGD----GDG----NLVL 132
Query: 301 THPTY------NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAY 341
H Y + + S+G N ++ +FDSG+++TY Y
Sbjct: 133 LHANYYSPGSATLYFDRHSLGMNPMD----VVFDSGSTYTYFTAQPY 175
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/337 (27%), Positives = 138/337 (40%), Gaps = 43/337 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
H + +G P + +DTGSDL W+ C C+ C + ++ P SST
Sbjct: 68 HLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKP---------MFDPLKSSTY 118
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ + C+S LC +L S C Y Y D +++ G L +D ++ + S+ S
Sbjct: 119 NNISCDSPLCHKLDTGVCSPEKRCNYTYGY-GDNSLTKGVLAQDTATFTSNTGKPVSL-S 176
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA--------NQGLIPNSFSMCFG 274
R FGCG TG F D GL GLG TS+ S + +Q L+P +
Sbjct: 177 RFLFGCGHNNTGGFNDHEM--GLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKIS 234
Query: 275 SDGTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSVGG-----NAVNFEFSA 325
S R+SFG KGS G TP R+ +Y +T+ +SV N+ + +
Sbjct: 235 S----RMSFG-KGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKANM 289
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
+ DSGT L Y ++ + K T L + CY QTN + P +
Sbjct: 290 LVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYR---TQTNLKGPTLTF 346
Query: 386 TMKGGGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDN 421
G PI + P+ ++CL + N
Sbjct: 347 HFVGANVLLT--PIQTFIPPTPQTKGIFCLAIYNRTN 381
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 143/332 (43%), Gaps = 58/332 (17%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS- 164
N+S+GQP++ +V +DTGSD+ W+ C+ C +C + L ++ P+ SST S
Sbjct: 103 VNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGL---------LFDPSMSSTFSP 153
Query: 165 --KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
K PC C+ P+ + Y+ + + S F + ++ TDE S+ D
Sbjct: 154 LCKTPCGFKGCKCDP--------IPFTISYVDNSSASGTFGRDILVFETTDEGTSQISD- 204
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---- 278
+ GCG F NG+ GL + P+ LA Q I FS C G+
Sbjct: 205 -VIIGCG--HNIGFNSDPGYNGILGL----NNGPNSLATQ--IGRKFSYCIGNLADPYYN 255
Query: 279 -GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AI 326
++ G+ TPF + H Y +T+ +SVG ++ FE I
Sbjct: 256 YNQLRLGEGADLEGYSTPFEVY--HGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVI 313
Query: 327 FDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV-- 383
DSGT+ TYL D A+ + +E N L R+ + P++ CY ++ +PVV
Sbjct: 314 LDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTF 373
Query: 384 ------NLTMKGGGPFFVNDPIVIVSSEPKGL 409
+L + G F D I ++ P +
Sbjct: 374 HFVDGADLALDTGSFFSQRDDIFCMTVSPASI 405
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 120/279 (43%), Gaps = 29/279 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
+ +++ PA + + +DTGS L WL CD C++C + +Y P +
Sbjct: 39 FVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ C +L+K N C Y ++Y+ G S G L+ D L +
Sbjct: 90 KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144
Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
+ I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C S G
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN----FEFSAIFDSGTSFT 334
G + FGD P G T + + H Y+ + N + IFDSG ++T
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYT 264
Query: 335 YLN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
Y P + +S ++L+KE + E D C+
Sbjct: 265 YFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 303
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 167/427 (39%), Gaps = 74/427 (17%)
Query: 29 TFGFDFHHRYSDPVKGILAVDDLP---KKGSFAYYSALAHRDRYFRLRGRGLAA----QG 81
T GF R+ D K + ++ + K+G + R +L LAA
Sbjct: 44 TNGFRVMLRHVDSGKNLTKLERVQHGIKRG----------KSRLQKLNAMVLAASSTPDS 93
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVH 140
D+ AGN Y + +++G P +S+ LDTGSDL W C C C
Sbjct: 94 EDQLEAPIHAGNGEYLIE---------LAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYK 144
Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTM 198
I+ P SS+ SKV C S+LC PS+ C Y Y D +M
Sbjct: 145 QPTP---------IFDPKKSSSFSKVSCGSSLCS---ALPSSTCSDGCEYVYSY-GDYSM 191
Query: 199 STGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI 258
+ G L + + ++K I FGCG G + A+ GL GLG S+ S
Sbjct: 192 TQGVLATETFTFG--KSKNKVSVHNIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQ 247
Query: 259 LANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-------TPFSLRQTHPT-YNITIT 310
L Q FS C + S GS G+ + TP P+ Y +++
Sbjct: 248 LKEQ-----RFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLE 302
Query: 311 QVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
+SVG ++ E S I DSGT+ TY+ AY + + F S K +
Sbjct: 303 AISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALD-K 361
Query: 360 TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
TS + C+ L T E P + KGG + +I S L + CL + S
Sbjct: 362 TSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGDLELPAENYMIGDSN---LGVACLAMGAS 418
Query: 420 DNVNIIG 426
++I G
Sbjct: 419 SGMSIFG 425
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 127/297 (42%), Gaps = 34/297 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + DTGSDL W C+ C+ + S + FN P++SST
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY-----SQKEPKFN---PSSSSTY 183
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C+S +CE + C + SNC Y + Y D + + GFL ++ L + V
Sbjct: 184 QNVSCSSPMCEDAESC--SASNCVYSIGY-GDKSFTQGFLAKEKFTLTNSD-----VLED 235
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+ FGCG G F A GL + + + N N FS C F S+ TG
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN-----NIFSYCLPSFTSNSTGH 290
Query: 281 ISFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
++FG G S TP S + Y I I +SVG + FS AI DSGT F
Sbjct: 291 LTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 350
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T L Y ++ F + TS L F+ CY + T YP + + GG
Sbjct: 351 TRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCYDFTGLDT-VTYPTIAFSFAGG 405
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/340 (26%), Positives = 144/340 (42%), Gaps = 45/340 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VG PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSTSY 213
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C++ C C ++ C Y+V Y DG+ + G + L L S
Sbjct: 214 ASVACDNPRCHDLDAAACRNSTGACLYEVAY-GDGSYTVGDFATETLTLGDSAPVSS--- 269
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ +FS C S +
Sbjct: 270 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 319
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ FGD +T Y + ++ +SVGG ++ SA I
Sbjct: 320 STLQFGDAADAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIV 379
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L AY + + F + TS L F+ CY LS ++T+ E P V+L
Sbjct: 380 DSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVSLRF 437
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
GGG + ++ + G YCL ++ V+IIG
Sbjct: 438 AGGGELRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 475
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 105/345 (30%), Positives = 151/345 (43%), Gaps = 46/345 (13%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
SL L Y V +G P S + +DTGSD+ W+ C S H ++ P
Sbjct: 126 TSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 177
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C+S C Q C S S C Y V Y DG+ +TG D L L ++
Sbjct: 178 SSSSTYSPFSCSSAACAQLGQEGNGCSS--SQCQYTVTY-GDGSSTTGTYSSDTLALGSN 234
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + FGC V++G F D +GL GLG S+ S A G +FS C
Sbjct: 235 AVR------KFQFGCSNVESG-FND--QTDGLMGLGGGAQSLVSQTA--GTFGAAFSYCL 283
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA 325
S +G ++ G G+ G +TP PT Y + I + VGG ++ F
Sbjct: 284 PATSSSSGFLTLG-AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGT 342
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT T L AY+ +S F + K+ S + + C+ S Q++ P V L
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGI-LDTCFDFS-GQSSVSIPTVAL 400
Query: 386 TMKGGGPF-FVNDPIVIVSSEPKGLYLYCLG-VVKSDN--VNIIG 426
GG +D I++ +S + CL SD+ + IIG
Sbjct: 401 VFSGGAVVDIASDGIMLQTSNS----ILCLAFAANSDDSSLGIIG 441
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/344 (27%), Positives = 145/344 (42%), Gaps = 52/344 (15%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++S+G PA+++ +DTGSDL W C CV N S+ ++ P++SST + +P
Sbjct: 105 DMSIGTPAVAYAAIIDTGSDLVWTQCK--PCVECFNQST------PVFDPSSSSTYAALP 156
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+STLC + C Y Y D + + G L + LA K+ ++FG
Sbjct: 157 CSSTLCSDLPSSKCTSAKCGYTYTY-GDSSSTQGVLAAETFTLA------KTKLPDVAFG 209
Query: 228 CGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR----- 280
CG G F GA GL GLG S+ S L GL N FS C S D T +
Sbjct: 210 CGDTNEGDGFTQGA---GLVGLGRGPLSLVSQL---GL--NKFSYCLTSLDDTSKSPLLL 261
Query: 281 -----ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
IS + TP + P+ Y + + ++VG + SA
Sbjct: 262 GSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTG 321
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FEYPV 382
I DSGTS TYL Y + + F + K S + + C+ + + E P
Sbjct: 322 GVIVDSGTSITYLELQGYRALKKAFAAQMKLPAADG-SGIGLDTCFEAPASGVDQVEVPK 380
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ + G + +++ S L CL V+ S ++IIG
Sbjct: 381 LVFHLDGADLDLPAENYMVLDSGSGAL---CLTVMGSRGLSIIG 421
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 100/341 (29%), Positives = 144/341 (42%), Gaps = 48/341 (14%)
Query: 66 RDRYFRLRGRGLAAQGNDKTP----LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
+ R ++ G G+ + K P + GN + V +G P F +
Sbjct: 103 QARLSKISGHGIFEEMVTKLPAQSGIAIGTGN-----------YVVTVGLGTPKEDFTLV 151
Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QK 177
DTGS + W C C+ Q D P S++ + V C+S C L ++
Sbjct: 152 FDTGSGITWTQCQ--PCLGSCYPQKEQKFD-----PTKSTSYNNVSCSSASCNLLPTSER 204
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
C ++ S C YQ+ Y D + S GF + L ++ S V + FGCG+ G F
Sbjct: 205 GCSASNSTCLYQIIY-GDQSYSQGFFATETLTIS-----SSDVFTNFLFGCGQSNNGLFG 258
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETP 295
A GL GL S+PS A + FS C S TG ++FG K S G TP
Sbjct: 259 QAA---GLLGLSSSSVSLPSQTAEK--YQKQFSYCLPSTPSSTGYLNFGGKVSQTAGFTP 313
Query: 296 FSLRQTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFN 349
S Y I I +SV G+ + + S AI DSGT T L AY + E F+
Sbjct: 314 IS-PAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLPPTAYKALKEAFD 372
Query: 350 SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
+T+ +L + CY S N T +P V+++ KGG
Sbjct: 373 EKMSNYPKTNGDEL-LDTCYDFS-NYTTVSFPKVSVSFKGG 411
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 110/424 (25%), Positives = 167/424 (39%), Gaps = 73/424 (17%)
Query: 35 HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR---GRGLAAQ---GNDKTPLT 88
H RY ++ +LA D+ R F+LR R AA G+ + PLT
Sbjct: 133 HDRY---LRRLLAADE--------------SRANSFQLRIRNDRAAAASTQSGSAEVPLT 175
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
+G LN + + S G PA + V +DTGSDL W+ C C +C +
Sbjct: 176 --SGIRFQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDP--- 230
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMS 199
++ P S+T + V CN++ C + C C Y + Y DG+ S
Sbjct: 231 ------LFDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAY-GDGSFS 283
Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
G L D + L S+D + FGCG G F GL GLG + S+ S
Sbjct: 284 RGVLATDTVALG-----GASLDGFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQT 334
Query: 260 ANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQT------HPTYNITI 309
A + FS C D +G +S G S + TP + + P Y + +
Sbjct: 335 ALR--YGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNV 392
Query: 310 TQVSVGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLP 364
T +VGG A+ + + + DSGT T L Y + F A T+
Sbjct: 393 TGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSI 452
Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKSDNV 422
+ CY L+ + P++ L ++GG V+ + +V + + L + D
Sbjct: 453 LDTCYDLT-GHDEVKVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQT 511
Query: 423 NIIG 426
IIG
Sbjct: 512 PIIG 515
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 147/346 (42%), Gaps = 47/346 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G + SV L + FS C
Sbjct: 109 ----FTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
F S TG S G K + + + + + R+ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+FDSG+ +Y+ D A + +S+ L R + + CY + +
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DM 277
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P ++L G F + V V + ++CL +++V+IIG
Sbjct: 278 PAISLHFDDGARFDLGRHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 149/401 (37%), Gaps = 60/401 (14%)
Query: 65 HRDRYFR------LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
HR Y R RGR A G + S+G T ++ VG PA F
Sbjct: 60 HRHAYIRSQLASSRRGRRAAEVGASAFAMPLSSGAYTGTGQ-----YFVRFRVGTPAQPF 114
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-- 176
++ DTGSDL W+ C G + S ++ S + + + C+S C
Sbjct: 115 VLVADTGSDLTWVKCRGAGAAAGTGAGSPA----RVFRTAASKSWAPIACSSDTCTSYVP 170
Query: 177 ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR---------- 223
C S S C Y RY DG+ + G + D +A +
Sbjct: 171 FSLANCSSPASPCAYDYRY-RDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQG 229
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGT 278
+ GC G + +G+ LG S S A + FS C + T
Sbjct: 230 VVLGCAATYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFSYCLVDHLAPRNAT 285
Query: 279 GRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFEFS---------AIFD 328
++FG + +TP L R+ P Y +T+ V V G A++ AI D
Sbjct: 286 SYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDRNGGAILD 345
Query: 329 SGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SGTS T L PAY + + LA R T PFEYCY + + E P + +
Sbjct: 346 SGTSLTILATPAYRAVVTALSKHLAGLPRVTMD---PFEYCYNWT-DAGALEIPKMEVHF 401
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
G ++ + P + C+GV + V++IG
Sbjct: 402 AGSARLEPPAKSYVIDAAPG---VKCIGVQEGSWPGVSVIG 439
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 92/365 (25%), Positives = 148/365 (40%), Gaps = 54/365 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ VG PA F++ DTGSDL W+ C S L+ + + P S T
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 164 SKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ + C S C CP+ GS C Y RY DG+ + G + + +A ++ +
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSGREER 215
Query: 219 SVDSR-ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
+ + GC TG + A +G+ LG S S A++ FS C
Sbjct: 216 KAKLKGLVLGCSSSYTGPSFE--ASDGVLSLGYSGISFASHAASR--FGGRFSYCLVDHL 271
Query: 274 -GSDGTGRISFGDK---GSPGQG------------ETPFSL-RQTHPTYNITITQVSVGG 316
+ T ++FG SP +TP L R+ P Y++++ +SV G
Sbjct: 272 SPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331
Query: 317 NAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFE 366
+ + I DSGTS T L PAY + + LA R T PFE
Sbjct: 332 EFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMD---PFE 388
Query: 367 YCY-VLSPNQTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKS--DN 421
YCY SP+ + + V + + G + P ++ + P + C+G+ +
Sbjct: 389 YCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPG---VKCIGLQEGPWPG 445
Query: 422 VNIIG 426
+++IG
Sbjct: 446 ISVIG 450
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 96/326 (29%), Positives = 129/326 (39%), Gaps = 55/326 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+VS+G P V LDTGS L W+PC +SS + ++ P SS+S V
Sbjct: 94 SVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVG 153
Query: 168 CNSTLCEL-----QKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
C + C C S G+N PY V Y S T +G L+ D L L+
Sbjct: 154 CRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGST--SGLLISDTLRLSPSSSS 211
Query: 217 SKSVDSR-ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R + GC V P+GL G G SVPS L +P FS C
Sbjct: 212 SAPAPFRNFAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLK----VPK-FSYCLLS 261
Query: 274 -----GSDGTGRISFGDKGSP-GQGETPFSL------RQTHPTYNI----TITQVSVGGN 317
S +G + GD P G+ +T + P Y++ +T +SVGG
Sbjct: 262 RRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGK 321
Query: 318 AVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSL--AKEKRETSTSD-LPF 365
VN AI DSGT+FTYL+ + ++ S + R D L
Sbjct: 322 PVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL 381
Query: 366 EYCYVLSPNQTN-FEYPVVNLTMKGG 390
C+ L P E P + L KGG
Sbjct: 382 RPCFALPPGPGGAMELPDLELKFKGG 407
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 106/393 (26%), Positives = 165/393 (41%), Gaps = 54/393 (13%)
Query: 28 GTFGFDFHHRY----SDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
G HHR+ + P ++D+ ++ +A R +Y + G +G+D
Sbjct: 55 GVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQL--RAAYITR-KYSGVNGSAGDVEGSD 111
Query: 84 KT-PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
T P T DT + V +G PA++ + +DTGSD+ W+ C S H
Sbjct: 112 VTVPTTLGTSLDTLE-------YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQ 164
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
S ++ P++SST S C S C +Q + S C Y V+Y DG+ +G
Sbjct: 165 ADS--------LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKY-GDGSTGSGT 215
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
D L L + S FGC + ++G+ L + G ++ LA Q
Sbjct: 216 YSSDTLALGS------STVENFQFGCSQSESGNLLQDQTAGLMGLGGGAES-----LATQ 264
Query: 263 --GLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSV 314
G +FS C GS +G ++ G S +TP LR T P+ Y + + + V
Sbjct: 265 TAGTFGKAFSYCLPPTPGS--SGFLTLGASTSGFVVKTPM-LRSTQVPSYYGVLLQAIRV 321
Query: 315 GGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
GG +N SA I DSGT T L AY+ +S F + K+ + F+ C+
Sbjct: 322 GGRQLNIPASAFSAGSIMDSGTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGI-FDTCF 380
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPF-FVNDPIVI 401
S Q++ P V L GG +D I++
Sbjct: 381 DFS-GQSSVSIPTVALVFSGGAVVDLASDGIIL 412
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 91/340 (26%), Positives = 144/340 (42%), Gaps = 45/340 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VG PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSTSY 217
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C++ C C ++ C Y+V Y DG+ + G + L L S
Sbjct: 218 ASVACDNPRCHDLDAAACRNSTGACLYEVAY-GDGSYTVGDFATETLTLGDSAPVSS--- 273
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ +FS C S +
Sbjct: 274 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 323
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ FGD +T Y + ++ +SVGG ++ SA I
Sbjct: 324 STLQFGDAADAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIV 383
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L AY + + F + TS L F+ CY LS ++T+ E P V+L
Sbjct: 384 DSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVSLRF 441
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
GGG + ++ + G YCL ++ V+IIG
Sbjct: 442 AGGGELRLPAKNYLIPVDGAG--TYCLAFAPTNAAVSIIG 479
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/328 (26%), Positives = 137/328 (41%), Gaps = 32/328 (9%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +DTGS + ++PC +C H S Q F P S T V
Sbjct: 95 TRLWIGTPPQRFALIVDTGSTVTYVPCS--TCKH---CGSHQDPKFR---PEASETYQPV 146
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C Q C C Y+ RY ++ + S+G L EDV+ QS+ R F
Sbjct: 147 KCT-----WQCNCDDDRKQCTYERRY-AEMSTSSGVLGEDVVSFGN---QSELSPQRAIF 197
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG + A +G+ GLG S+ L + +I ++FS+C+G G G + G
Sbjct: 198 GCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLG 256
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLN 337
P S P YNI + ++ V G ++ + + DSGT++ YL
Sbjct: 257 GISPPADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCY---VLSPNQTNFEYPVVNLTMKGGGPF 393
+ A+ + S D + + C+ ++ +Q + +PVV + G G
Sbjct: 317 ESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFPVVEMVF-GNGHK 375
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
P + K YCLGV + N
Sbjct: 376 LSLSPENYLFRHSKVRGAYCLGVFSNGN 403
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 142/343 (41%), Gaps = 59/343 (17%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA + ++ +DTGSDL W+ C C C +++ I+ P SS+ +PC S
Sbjct: 144 GTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDA---------IFEPKQSSSYKTLPCLS 194
Query: 171 TLC-EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C EL P C Y++ Y DG+ S G ++ L L +D Q+ +
Sbjct: 195 ATCTELITSESNPTPCLLGGCVYEINY-GDGSSSQGDFSQETLTLGSDSFQN------FA 247
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRI 281
FGCG TG F +GL GLG + S PS ++ F+ C S TG
Sbjct: 248 FGCGHTNTGLF---KGSSGLLGLGQNSLSFPS--QSKSKYGGQFAYCLPDFGSSTSTGSF 302
Query: 282 SFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGN------AVNFEFSAIFDSGTSF 333
S G P TP +PT Y + + +SVGG+ AV S I DSGT
Sbjct: 303 SVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTVI 362
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLP-------FEYCYVLSPNQTNFEYPVVNLT 386
T L AY + +F S T DLP + CY LS + P +
Sbjct: 363 TRLLPQAYNALKTSFRS--------KTRDLPSAKPFSILDTCYDLS-RHSQVRIPTITFH 413
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG 426
+ V+D ++V + G + CL + D NIIG
Sbjct: 414 FQNNADVAVSDVGILVPVQNGGSQV-CLAFASASQMDGFNIIG 455
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 146/351 (41%), Gaps = 51/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA + ++ LDTGSD+ WL C C H + SG+V D P S + +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 173
Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + +C C ++C YQV Y DG+++ G + L A +
Sbjct: 174 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 227
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
R++ GCG G F+ A +GL GLG + S PS +A SFS C
Sbjct: 228 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRP 282
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
S + ++FG F+ +P Y + + SVGG
Sbjct: 283 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 342
Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
N I DSGTS T L P Y + + F + A R + F+ CY LS +
Sbjct: 343 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 402
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ + GG + ++ + G +C + +D V+IIG
Sbjct: 403 -VKVPTVSMHLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 450
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 102/366 (27%), Positives = 151/366 (41%), Gaps = 48/366 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 171 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE--------KLFD 222
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 223 PVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 281
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 282 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 331
Query: 275 SDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG TP L PT Y I +T + VGG ++ S
Sbjct: 332 STGTGYLDFGAGSPAAASARLTTPM-LTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAG 390
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT T L PAY+ + F + K+ + S L + CY + + P
Sbjct: 391 TIVDSGTVITRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLL--DTCYDFT-GMSQVAIP 447
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIAN 433
V+L +GG V+ ++ ++ + L +G+V + + G Y I
Sbjct: 448 TVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGK 507
Query: 434 NISLFH 439
+ F+
Sbjct: 508 KVVGFY 513
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 119/269 (44%), Gaps = 52/269 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P+ ++ +DTGSDL WL C C C + GQV D P SST
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136
Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+VPC+S C + C S AG C Y V Y DG+ STG L D L A D
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGDLATDKLAFAND----- 190
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ + ++ GCGR G F D AA GL G+G K S+ + +A + F C G D T
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGSVFEYCLG-DRT 244
Query: 279 GR------ISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
R + FG +P T F+ ++P Y + + SVGG V +A
Sbjct: 245 SRSTRSSYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLA 302
Query: 326 ----------IFDSGTSFTYLNDPAYTQI 344
+ DSGT+ + AY +
Sbjct: 303 LDTATGRGGVVVDSGTAISRFARDAYAAL 331
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 143/351 (40%), Gaps = 52/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA ++ LDTGSD+ WL C C C SGQV D P S +
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYD----QSGQVFD-----PRRSRSY 192
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C++ LC C C YQV Y DG+++ G + L A +
Sbjct: 193 GAVGCSAPLCRRLDSGGCDLRRKACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 246
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+RI+ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 247 ARIALGCGHDNEGLFVAAAGLLGLG---RGSLSFPAQISRR--YGRSFSYCLVDRTSSAN 301
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV---------- 319
+ + ++FG F+ +P Y + + +SVGG V
Sbjct: 302 PASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRL 361
Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
+ I DSGTS T L PAY+ + + F + A R + F+ CY LS +
Sbjct: 362 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKV 421
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ GG + ++ + KG +C +D V+IIG
Sbjct: 422 -VKVPTVSMHFAGGAEAALPPENYLIPVDSKG--TFCFAFAGTDGGVSIIG 469
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 152/363 (41%), Gaps = 52/363 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VGQPA F + LDTGSD+ WL C C C + I+ P +SS+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPRSSSSF 205
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC S C+ + S C YQV Y DG+ + G V + L + + +
Sbjct: 206 ASLPCESQQCQALETSGCRASKCLYQVSY-GDGSFTVGEFVTETLTFG-----NSGMIND 259
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL G + TS + +SFS C S +
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGPLSLTS--------QMKASSFSYCLVDRDSSSSSD 311
Query: 281 ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIFD 328
+ F P T Y + +T +SVGG ++ F+ I D
Sbjct: 312 LEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVD 371
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGT+ T L AY + + F S ++T+ L F+ CY LS +Q+ P V+
Sbjct: 372 SGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL-FDTCYDLS-SQSRVTIPTVSFEFA 429
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNISLF-- 438
GG + ++ + G + + S +++IIG Y +AN++ F
Sbjct: 430 GGKSLQLPPKNYLIPVDSVGTFCFAFAPTTS-SLSIIGNVQQQGTRVHYDLANSVVGFSP 488
Query: 439 HNC 441
H C
Sbjct: 489 HKC 491
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 112/425 (26%), Positives = 165/425 (38%), Gaps = 67/425 (15%)
Query: 27 FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA---QGND 83
+ T GF R+ D K + ++ + + + R RL LAA D
Sbjct: 43 YPTKGFRVMLRHVDSGKNLTKLERV-------QHGIKRGKSRLQRLNAMVLAASTLDSED 95
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ AGN Y + +++G P +S+ LDTGSDL W C C C
Sbjct: 96 QLEAPIHAGNGEYLME---------LAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQP 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMST 200
I+ P SS+ SKV C S+LC PS+ C Y Y D +M+
Sbjct: 147 TP---------IFDPKKSSSFSKVSCGSSLCS---AVPSSTCSDGCEYVYSY-GDYSMTQ 193
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L + + ++K I FGCG G + A+ GL GLG S+ S L
Sbjct: 194 GVLATETFTFG--KSKNKVSVHNIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQLK 249
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-------TPFSLRQTHPT-YNITITQV 312
FS C + S GS G+ + TP P+ Y +++ +
Sbjct: 250 EP-----RFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGI 304
Query: 313 SVGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
SVG ++ E S I DSGT+ TY+ A+ + + F S K + TS
Sbjct: 305 SVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLD-KTS 363
Query: 362 DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
+ C+ L T E P + KGG + +I S L + CL + S
Sbjct: 364 STGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSN---LGVACLAMGASSG 420
Query: 422 VNIIG 426
++I G
Sbjct: 421 MSIFG 425
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 83/160 (51%), Gaps = 15/160 (9%)
Query: 195 DGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMD 251
DG+ + G+LV+DV+HL T +Q+ S + I FGCG Q+G + AA +G+ G G
Sbjct: 4 DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63
Query: 252 KTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
+S S LA+QG + SF+ C ++G G + G+ SP TP + H Y++ +
Sbjct: 64 NSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLN 121
Query: 311 QVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
+ VG + + +A I DSGT+ YL D Y
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVY 161
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 140/345 (40%), Gaps = 42/345 (12%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN-SLGFLHYT-NVSVGQPALSFIVAL 122
R Y + R G AA A L S+G L Y VS+G PA++ + +
Sbjct: 89 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 148
Query: 123 DTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
DTGSD+ W+ PC C + ++ P SS+ S VPC + C
Sbjct: 149 DTGSDVSWVQCKPCPSPPCYSQRDP---------LFDPTRSSSYSAVPCAAASCSQLALY 199
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
C +G C Y V Y DG+ +TG D L L + FGCG Q G
Sbjct: 200 SNGC--SGGQCGYVVSY-GDGSTTTGVYSSDTLTLTGSNALKGFL-----FGCGHAQQGL 251
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGS-PGQG 292
F A +GL GLG S+ S ++ FS C + G IS G S G
Sbjct: 252 F---AGVDGLLGLGRQGQSLVSQASST--YGGVFSYCLPPTQNSVGYISLGGPSSTAGFS 306
Query: 293 ETPFSLRQTHPTYNIT-ITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDPAYTQISE 346
TP PTY I + +SVGG ++ + S A+ D+GT T L AY+ +
Sbjct: 307 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRS 366
Query: 347 TFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
F ++A ++ + + CY + T P +++ GG
Sbjct: 367 AFRAAMAPYGYPSAPATGILDTCYDFTRYGT-VTLPTISIAFGGG 410
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 147/346 (42%), Gaps = 51/346 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +DTGSD+ W+ C C SC ++ ++ P SS+
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDA---------VFDPRASSSF 64
Query: 164 SKVPCNSTLCELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
++ C++ C+L K C S + C YQV Y DG+ + G L D + S+
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDNRCLYQVSY-GDGSFTVGDLASDSFSV------SRGRT 117
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
S + FGCG G F+ A GLG K S PS L+++ FS C G
Sbjct: 118 SPVVFGCGHDNEGLFVGAAGLL---GLGAGKLSFPSQLSSR-----KFSYCLVSRDNGVR 169
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------- 325
+ + FGD P ++ +P Y ++ +S+GG ++ +A
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
I DSGTS T L AYT + + F S ++ + L F+ CY S T+
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL-FDTCYDFSA-LTSVTI 287
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P V+ +GG + +V + G + + D ++IIG
Sbjct: 288 PTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLD-LSIIG 332
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 130/303 (42%), Gaps = 39/303 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + DTGSDL W C CV + I++P+ S++
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 155
Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S C L +AG SNC Y ++Y D + S GFL ++ L +
Sbjct: 156 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTNSD---- 210
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
V + FGCG G F A GL GLG DK S PS A FS C S
Sbjct: 211 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 264
Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
TG ++FG G S TP S + Y + I ++VGG + FS A+ D
Sbjct: 265 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 324
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTM 387
SGT T L AY + +F AK + +TS + + C+ LS +T P V +
Sbjct: 325 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGFKT-VTIPKVAFSF 381
Query: 388 KGG 390
GG
Sbjct: 382 SGG 384
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 143/356 (40%), Gaps = 58/356 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA ++ LDTGSD+ W+ C C C SG V D P SS+
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYE----QSGPVFD-----PRRSSSY 179
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C + LC C C YQV Y DG+++ G V + L A +
Sbjct: 180 GAVGCGAALCRRLDSGGCDLRRGACMYQVAY-GDGSVTAGDFVTETLTFAGGARV----- 233
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+R++ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 234 ARVALGCGHDNEGLFVAAAGLLGLG---RGGLSFPTQISRR--YGRSFSYCLVDRTSSGA 288
Query: 274 ----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------ 319
GS + +SFG GS G F+ +P Y + + +SVGG V
Sbjct: 289 GAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES 347
Query: 320 -------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
I DSGTS T L +Y+ + + F + A S F+ CY L
Sbjct: 348 DLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDL 407
Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ + P V++ GG + ++ + +G +C +D V+IIG
Sbjct: 408 GGRRV-VKVPTVSMHFAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 460
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 146/351 (41%), Gaps = 51/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA + ++ LDTGSD+ WL C C H + SG+V D P S + +
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 179
Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + +C C ++C YQV Y DG+++ G + L A +
Sbjct: 180 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 233
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
R++ GCG G F+ A +GL GLG + S PS +A SFS C
Sbjct: 234 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRP 288
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
S + ++FG F+ +P Y + + SVGG
Sbjct: 289 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 348
Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
N I DSGTS T L P Y + + F + A R + F+ CY LS +
Sbjct: 349 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 408
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ + GG + ++ + G +C + +D V+IIG
Sbjct: 409 -VKVPTVSMHLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 456
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 98/345 (28%), Positives = 140/345 (40%), Gaps = 42/345 (12%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN-SLGFLHYT-NVSVGQPALSFIVAL 122
R Y + R G AA A L S+G L Y VS+G PA++ + +
Sbjct: 100 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 159
Query: 123 DTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
DTGSD+ W+ PC C + ++ P SS+ S VPC + C
Sbjct: 160 DTGSDVSWVQCKPCPSPPCYSQRDP---------LFDPTRSSSYSAVPCAAASCSQLALY 210
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
C +G C Y V Y DG+ +TG D L L + FGCG Q G
Sbjct: 211 SNGC--SGGQCGYVVSY-GDGSTTTGVYSSDTLTLTGSNALKGFL-----FGCGHAQQGL 262
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGS-PGQG 292
F A +GL GLG S+ S ++ FS C + G IS G S G
Sbjct: 263 F---AGVDGLLGLGRQGQSLVSQASST--YGGVFSYCLPPTQNSVGYISLGGPSSTAGFS 317
Query: 293 ETPFSLRQTHPTYNIT-ITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDPAYTQISE 346
TP PTY I + +SVGG ++ + S A+ D+GT T L AY+ +
Sbjct: 318 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRS 377
Query: 347 TFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
F ++A ++ + + CY + T P +++ GG
Sbjct: 378 AFRAAMAPYGYPSAPATGILDTCYDFTRYGT-VTLPTISIAFGGG 421
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 27/249 (10%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +DTGS + ++PC+ SC N + + P+ S T V CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCN--SCDQCGNHQDPK------FQPDLSDTYHPVKCNP 53
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C + C Y+ +Y ++ + S+G L ED++ S+ R FGC
Sbjct: 54 DCT-----CDTENDQCTYERQY-AEMSSSSGILGEDLVSFG---NMSELKPQRAVFGCEN 104
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGS 288
+TG A +G+ GLG S+ L +G+I +SFS+C+G G G + G
Sbjct: 105 AETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISP 163
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
P S P YNI + + V G ++ + I DSGT++ YL + A+
Sbjct: 164 PSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAF 223
Query: 342 TQISETFNS 350
+ S
Sbjct: 224 LPFIQAITS 232
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/338 (27%), Positives = 143/338 (42%), Gaps = 43/338 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VGQP+ F + LDTGSD+ WL C C C + I+ P SS+
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDP---------IFDPTASSSY 207
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ + C++ C+ + C YQV Y DG+ + G V + + + SV+ R
Sbjct: 208 NPLTCDAQQCQDLEMSACRNGKCLYQVSY-GDGSFTVGEYVTETVSFG-----AGSVN-R 260
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F+ A GL G + TS + SFS C +G+ S
Sbjct: 261 VAIGCGHDNEGLFVGSAGLLGLGGGPLSLTS--------QIKATSFSYCLVDRDSGKSST 312
Query: 284 GDKGSPGQGET---PFSLRQTHPT-YNITITQVSVGGNAVNF---EFS--------AIFD 328
+ SP G++ P Q T Y + +T VSVGG V F+ I D
Sbjct: 313 LEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVD 372
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGT+ T L AY + + F R L F+ CY LS Q+ P V+
Sbjct: 373 SGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVAL-FDTCYDLSSLQS-VRVPTVSFHFS 430
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + + ++ + G Y + S +++IIG
Sbjct: 431 GDRAWALPAKNYLIPVDGAGTYCFAFAPTTS-SMSIIG 467
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 27/249 (10%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +DTGS + ++PC+ SC N + + P+ S T V CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCN--SCDQCGNHQDPK------FQPDLSDTYHPVKCNP 53
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C + C Y+ +Y ++ + S+G L ED++ S+ R FGC
Sbjct: 54 DCT-----CDTENDQCTYERQY-AEMSSSSGILGEDLVSFG---NMSELKPQRAVFGCEN 104
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGS 288
+TG A +G+ GLG S+ L +G+I +SFS+C+G G G + G
Sbjct: 105 AETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISP 163
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
P S P YNI + + V G ++ + I DSGT++ YL + A+
Sbjct: 164 PSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAF 223
Query: 342 TQISETFNS 350
+ S
Sbjct: 224 LPFIQAITS 232
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 140/350 (40%), Gaps = 56/350 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ ++++G P L LDTGSDL W CD C C +Y+P S+T
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142
Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C S +C+ LQ +C + C Y Y DGT + G L + L +D
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
++FGCG GS + +GL G+G S+ S L FS CF
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRGPLSLVSQLGV-----TRFSYCFTPFNA 248
Query: 276 --------DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
+ R+S K +P R+ Y +++ ++VG + + +
Sbjct: 249 TAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFR 308
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
I DSGT+FT L + A+ ++ S + S + L C+ + +
Sbjct: 309 LTPMGDGGVIIDSGTTFTALEESAFVALARALASRVRLPL-ASGAHLGLSLCFAAASPEA 367
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
E P + L G + V+ E + + CLG+V + ++++G
Sbjct: 368 -VEVPRLVLHFDGADMELRRESYVV---EDRSAGVACLGMVSARGMSVLG 413
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/350 (24%), Positives = 140/350 (40%), Gaps = 56/350 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ ++++G P L LDTGSDL W CD C C +Y+P S+T
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142
Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C S +C+ LQ +C + C Y Y DGT + G L + L +D
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
++FGCG GS + +GL G+G S+ S L FS CF
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRGPLSLVSQLGV-----TRFSYCFTPFNA 248
Query: 276 --------DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
+ R+S K +P R+ Y +++ ++VG + + +
Sbjct: 249 TAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFR 308
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
I DSGT+FT L + A+ ++ S + S + L C+ + +
Sbjct: 309 LTPMGDGGVIIDSGTTFTALEERAFVALARALASRVRLPL-ASGAHLGLSLCFAAASPEA 367
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
E P + L G + V+ E + + CLG+V + ++++G
Sbjct: 368 -VEVPRLVLHFDGADMELRRESYVV---EDRSAGVACLGMVSARGMSVLG 413
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 152/363 (41%), Gaps = 52/363 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VGQPA F + LDTGSD+ WL C C C + I+ P +SS+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPRSSSSF 205
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC S C+ + S C YQV Y DG+ + G V + L + + +
Sbjct: 206 ASLPCESQQCQALETSGCRASKCLYQVSY-GDGSFTVGEFVIETLTFG-----NSGMINN 259
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL G + TS + +SFS C S +
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGSLSLTS--------QMKASSFSYCLVDRDSSSSSD 311
Query: 281 ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIFD 328
+ F P T Y + +T +SVGG ++ F+ I D
Sbjct: 312 LEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVD 371
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGT+ T L AY + + F S ++T+ L F+ CY LS +Q+ P V+
Sbjct: 372 SGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL-FDTCYDLS-SQSRVTIPTVSFEFA 429
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNISLF-- 438
GG + ++ + G + + S +++IIG Y +AN++ F
Sbjct: 430 GGKSLQLPPKNYLIPVDSVGTFCFAFAPTTS-SLSIIGNVQQQGTRVHYDLANSVVGFSP 488
Query: 439 HNC 441
H C
Sbjct: 489 HKC 491
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 110/271 (40%), Gaps = 47/271 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT D W+PC DC C +SPNTSST
Sbjct: 99 YVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSS------------PTFSPNTSSTY 146
Query: 164 SKVPCNSTLCELQK--QCPSAG-SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ + C+ C + CP+ G + C + Y D + S L +D L LA D S
Sbjct: 147 ASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFS-AMLSQDSLGLAVDTLPS--- 202
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG-LIPNSFSMCFGSDG-- 277
SFGC +GS L P GL GLG S+L+ G L FS CF S
Sbjct: 203 ---YSFGCVNAVSGSTLP---PQGLLGLGRGPM---SLLSQSGSLYSGVFSYCFPSFKSY 253
Query: 278 --TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFE 322
+G + G G P T LR H PT Y + +T VSVG V N
Sbjct: 254 YFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTG 313
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
I DSGT T +P Y I + F K
Sbjct: 314 AGTIIDSGTVITRFVEPVYAAIRDEFRKQVK 344
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 86/346 (24%), Positives = 146/346 (42%), Gaps = 47/346 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + I+ +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
F S TG S G K + + + + + R+ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+FDSG+ +Y+ D A + +S+ L R + + CY + +
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DM 277
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P ++L G F + V V + ++CL +++V+IIG
Sbjct: 278 PAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 323
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 149/346 (43%), Gaps = 51/346 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +DTGSD+ W+ C C SC ++ ++ P SS+
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDA---------VFDPRASSSF 64
Query: 164 SKVPCNSTLCELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
++ C++ C+L K C S + C YQV Y DG+ + G L D + S+
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDNRCLYQVSY-GDGSFTVGDLASDSFLV------SRGRT 117
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
S + FGCG G F+ A GLG K S PS L+++ FS C G
Sbjct: 118 SPVVFGCGHDNEGLFVGAAGLL---GLGAGKLSFPSQLSSR-----KFSYCLVSRDNGVR 169
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN-----FEFSA-- 325
+ + FGD P ++ +P Y ++ +S+GG ++ F+ S+
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
I DSGTS T L AYT + + F S ++ + L F+ CY S T+
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL-FDTCYDFSA-LTSVTI 287
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P V+ +GG + +V + G + + D ++IIG
Sbjct: 288 PTVSFHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLD-LSIIG 332
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 132/309 (42%), Gaps = 38/309 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNI 154
+SL L Y +V +G PA++ V +DTGSD+ W+ PC C + +G + D
Sbjct: 120 SSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCY----AQTGALFD--- 172
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P SST V C + C +L++Q C + C Y V+Y DG+ + G D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL 229
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ K FGC V++G F D +GL GLG S+ S A NSFS
Sbjct: 230 SGASDAVKG----FQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTA--AAYGNSFS 280
Query: 271 MCF----GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----F 321
C GS G + G S RQ Y + ++VGG + F
Sbjct: 281 YCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVF 340
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
++ DSGT T L AY+ +S F + K+ R + + C+ + QT P
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI-LDTCFDFA-GQTQISIP 398
Query: 382 VVNLTMKGG 390
V L GG
Sbjct: 399 TVALVFSGG 407
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 104/360 (28%), Positives = 151/360 (41%), Gaps = 54/360 (15%)
Query: 95 TYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVI 150
T+ +S+ L Y + +G PA+ IV +DTGSDL W+ PC C +
Sbjct: 107 TFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDP------ 160
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQ------CPS-AGSNCPYQVRYLSDGTMSTGFL 203
++ P++SS+ + VPC+S C C S A + C Y + Y + T +TG
Sbjct: 161 ---LFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRAT-TTGVY 216
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+ L L + V + FGCG Q G + +GL GLG S+ S ++Q
Sbjct: 217 STETLTL-----KPGVVVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQF 268
Query: 264 LIPNSFSMCFGSDGTGRISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVG 315
P S+ + S G G ++ G + G TP + PT Y +T+T +SVG
Sbjct: 269 GGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVG 328
Query: 316 GNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD-LPFEYCY 369
G + SA + DSGT T L AY + F S E R S+ + CY
Sbjct: 329 GAPLAVPPSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCY 388
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL---GVVKSDNVNIIG 426
+ TN P + LT GG + P + L CL G D + IIG
Sbjct: 389 DFT-GHTNVTVPTIALTFSGGATIDLATPAGV-------LVDGCLAFAGAGTDDTIGIIG 440
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 146/351 (41%), Gaps = 51/351 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA + ++ LDTGSD+ WL C C H + SG+V D P S + +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 173
Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + +C C ++C YQV Y DG+++ G + L A +
Sbjct: 174 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 227
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
R++ GCG G F+ A +GL GLG + S P+ +A SFS C
Sbjct: 228 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSSVRP 282
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
S + ++FG F+ +P Y + + SVGG
Sbjct: 283 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 342
Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
N I DSGTS T L P Y + + F + A R + F+ CY LS +
Sbjct: 343 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV 402
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ + GG + ++ + G +C + +D V+IIG
Sbjct: 403 -VKVPTVSMHLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 450
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 148/352 (42%), Gaps = 54/352 (15%)
Query: 65 HRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
R +Y + R GR + + D T L +G+ N ++ V +G P
Sbjct: 96 ERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSAN-----YFVVVGLGTPKRDLS 150
Query: 120 VALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
+ DTGSDL W C+ C SC ++ I+ P+ SS+ + C S+LC
Sbjct: 151 LVFDTGSDLTWTQCEPCAGSCYKQQDA---------IFDPSKSSSYINITCTSSLCTQLT 201
Query: 175 ---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCGR 230
++ +C S+ + C Y ++Y D + S GFL ++ L + ATD VD + FGCG+
Sbjct: 202 SAGIKSRCSSSTTACIYGIQY-GDKSTSVGFLSQERLTITATD-----IVDDFL-FGCGQ 254
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGS 288
G F A GL GLG S + + FS C S + G ++FG +
Sbjct: 255 DNEGLFSGSA---GLIGLGRHPISF--VQQTSSIYNKIFSYCLPSTSSSLGHLTFGASAA 309
Query: 289 PGQG--ETPFSLRQTHPT-YNITITQVSVGGNAV----NFEFSA---IFDSGTSFTYLND 338
TP S T Y + I +SVGG + + FSA I DSGT T L
Sbjct: 310 TNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAP 369
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
AY + F EK + D F+ CY S + P ++ GG
Sbjct: 370 TAYAALRSAFRQ-GMEKYPVANEDGLFDTCYDFSGYK-EISVPKIDFEFAGG 419
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 143/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G PA + IV +DTGS + W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSSGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 98/332 (29%), Positives = 138/332 (41%), Gaps = 64/332 (19%)
Query: 78 AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV 136
AA G+ +TPL +G Y + S+G P DTGSDL W C C
Sbjct: 64 AASGSAQTPLQLDSGGGAYDMT---------FSIGTPPQELSALADTGSDLIWAKCGACT 114
Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRY-- 192
CV + S Y PN SS+ SK+PC+ +LC QC + G+ C Y+ Y
Sbjct: 115 RCVPQGSPS---------YYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGL 165
Query: 193 LSDGTMST-GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMD 251
SD T G+L + L +D I FGC + G + G+ +
Sbjct: 166 ASDPHHYTQGYLGSETFTLGSDAVPG------IGFGCTTMSEGGYGSGSG-------LVG 212
Query: 252 KTSVPSILANQGLIPNSFSMCFGSDG--TGRISFGDKGSPGQG--ETPFSLRQTHPTYNI 307
P L +Q L +FS C SD T + FG G G TP LR + Y +
Sbjct: 213 LGRGPLSLVSQ-LNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQSTPL-LRTSTYYYTV 270
Query: 308 TITQVSVGGNAVNFEFSA--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP- 364
+ +S+G S+ IFDSGT+ +L +PAYT LAKE + T++L
Sbjct: 271 NLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYT--------LAKEAVLSQTTNLTM 322
Query: 365 ------FEYCYVLSPNQTNFEYPVVNLTMKGG 390
+E C+ + +P + L GG
Sbjct: 323 ASGRDGYEVCF----QTSGAVFPSMVLHFDGG 350
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 130/303 (42%), Gaps = 39/303 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + DTGSDL W C CV + I++P+ S++
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 183
Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S C L +AG SNC Y ++Y D + S GFL ++ L +
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTNSD---- 238
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
V + FGCG G F A GL GLG DK S PS A FS C S
Sbjct: 239 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 292
Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
TG ++FG G S TP S + Y + I ++VGG + FS A+ D
Sbjct: 293 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 352
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTM 387
SGT T L AY + +F AK + +TS + + C+ LS +T P V +
Sbjct: 353 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGFKT-VTIPKVAFSF 409
Query: 388 KGG 390
GG
Sbjct: 410 SGG 412
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 111/269 (41%), Gaps = 34/269 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P +F +DTGSDL W+ CD C C + +Y P ++ V
Sbjct: 58 LNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRD---------KLYKPK----NNLV 104
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC+++LC+ C + C Y++ Y G+ S G L+ D L +
Sbjct: 105 PCSNSLCQAVSTGENYHCDAPDDQCDYEIEYADLGS-SIGVLLSDSFPLRL--SNGTLLQ 161
Query: 222 SRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+++FGCG Q L P G+ GLG K S+ S L G+ N CF
Sbjct: 162 PKMAFGCGYDQ--KHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARG 219
Query: 279 GRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTY 335
G + FGD P TP + Y+ ++ GG + IFDSG+S+TY
Sbjct: 220 GFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTY 279
Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLP 364
N Y I N + K+ D P
Sbjct: 280 FNAQVYQSI---LNLVRKDLAGKPLKDAP 305
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 140/324 (43%), Gaps = 34/324 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +DTGS + ++PC +C H G+ D + P+ S T V
Sbjct: 91 TRLWIGTPPQRFALIVDTGSTVTYVPCS--TCEH-----CGRHQDPK-FQPDLSETYQPV 142
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C C C + C Y +Y ++ + S+G L EDV+ S+ R F
Sbjct: 143 KCTPD-C----NCDGDTNQCMYDRQY-AEMSSSSGVLGEDVVSFG---NLSELAPQRAVF 193
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG S+ L ++ +I +SFS+C+G G G + G
Sbjct: 194 GCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG 252
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLN 337
P S P YNI + ++ V G + + + DSGT++ YL
Sbjct: 253 GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLP 312
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPF-EYCYV---LSPNQTNFEYPVVNLTMKGGGPF 393
+ A+ ++ + D + + C+ + +Q +PVV++ + G
Sbjct: 313 ETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKL 372
Query: 394 FVN-DPIVIVSSEPKGLYLYCLGV 416
++ + + S+ +G YCLGV
Sbjct: 373 SLSPENYLFRHSKVRG--AYCLGV 394
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 126/296 (42%), Gaps = 34/296 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + DTGSDL W C+ C+ + S + FN P++SST
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY-----SQKEPKFN---PSSSSTY 183
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C+S +CE + C + SNC Y + Y D + + GFL ++ L + V
Sbjct: 184 QNVSCSSPMCEDAESC--SASNCVYSIVY-GDKSFTQGFLAKEKFTLTNSD-----VLED 235
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+ FGCG G F A GL + + + N N FS C F S+ TG
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN-----NIFSYCLPSFTSNSTGH 290
Query: 281 ISFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
++FG G S TP S + Y I I +SVG + FS AI DSGT F
Sbjct: 291 LTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 350
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
T L Y ++ F + TS L F+ CY + T YP + + G
Sbjct: 351 TRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCYDFTGLDT-VTYPTIAFSFAG 404
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 148/341 (43%), Gaps = 54/341 (15%)
Query: 117 SFIVALDTGSDLFWLPCD-CVSC---VHGLNSSSGQVIDFNIYSPNTSSTSSKVPC---- 168
++ + +DTGS ++PC C C HG Y + S ++ C
Sbjct: 50 TYDLIVDTGSARTYVPCKGCARCGEHAHGY------------YDYDRSMEFERLDCGEAS 97
Query: 169 NSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
++TLCE ++ C S G C Y V Y ++G+ S G++V D + L ++ + ++F
Sbjct: 98 DATLCEETMKGTCQSDG-RCSYVVSY-AEGSSSRGYVVRDRVRLG-----EGTLSAMLAF 150
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG----TG 279
GC +T + + A +GLFG G +V + LA+ GLI N FS C FG++G G
Sbjct: 151 GCEEAETNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLG 209
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTY-NITITQVSVGGNAVNF--EFSAIFDSGTSFTYL 336
R FG +P TP +P + N+ + +G + + ++ DSGT+FT++
Sbjct: 210 RFDFG-ADAPALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFV 268
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEY---CYVLSPNQTNFE---------YPVVN 384
+ ++ A + + +Y CY +S N +P +
Sbjct: 269 PRSVWVSFKTRLDTQATQAGLEIVAGPDPQYDDVCYGVSAAAMNMTLSQSTVSEWFPPLT 328
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
+ +GG + + + E +C+G+ + N I+
Sbjct: 329 IAYEGGVSLTLGPENYLFAHETNSA-AFCVGIFANPNNQIL 368
>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
Length = 149
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 69/133 (51%), Gaps = 25/133 (18%)
Query: 29 TFGFDFHHRYSD-------PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR SD P G+ P++GS YY AL D + + R LA +
Sbjct: 26 TFSSRMVHRLSDEARLEAGPRMGLW-----PQRGSGGYYRALLRSD--LQRQKRRLAGKN 78
Query: 82 N----DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
K TFS GND LG+L+Y V VG P SF+VALDTGSDLFW+PCDC+
Sbjct: 79 QLLSLSKGGSTFSPGND------LGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132
Query: 138 CVHGLNSSSGQVI 150
C L+S G ++
Sbjct: 133 CAP-LSSYRGNLV 144
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 164/373 (43%), Gaps = 57/373 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG PA F + +DTGSDL W+ C+ + NSSS Y ++SS+
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTT--ANSSSPPA---PWYDKSSSSSYR 113
Query: 165 KVPCNSTLCE-----LQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++PC C+ + C ++ S C Y Y SD + +TG L + + + + ++ K
Sbjct: 114 EIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGY-SDQSRTTGILAYETISMKSRKRSGK 172
Query: 219 SVDSR---------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ ++ GC R G+ GA+ G+ GLG S+ + + L F
Sbjct: 173 RAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIF 229
Query: 270 SMCF-----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
S C GS+ + + G TP + Y + +T V+V G V+
Sbjct: 230 SYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 289
Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCY 369
S+ IFDSGT+ +YL +PAY+++ N+ R ++P FE CY
Sbjct: 290 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR---AQEIPEGFELCY 346
Query: 370 VLSPNQTNFE--YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVVKSDNV--N 423
N T E P + + +GG + N+ +V+V+ + + L + N+ N
Sbjct: 347 ----NVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGN 402
Query: 424 IIGREYPIANNIS 436
++ +++ I +++
Sbjct: 403 LLQQDHHIEYDLA 415
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 147/341 (43%), Gaps = 42/341 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P + + DTGSDL W C+ C C + ++ P SST
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSP---------LFDPKESSTY 136
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
KV C+S+ C + C + + C Y + Y D + + G + D + + + ++ S+
Sbjct: 137 RKVSCSSSQCRALEDASCSTDENTCSYTITY-GDNSYTKGDVAVDTVTMGSSGRRPVSLR 195
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG- 277
+ I GCG TG+F A +G+ GLG TS+ S L I FS C F S+
Sbjct: 196 NMI-IGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETG 250
Query: 278 -TGRISFGDKG-SPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNF--------EFSA 325
T +I+FG G G G S+ + P Y + + +SVG + F E +
Sbjct: 251 LTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNI 310
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
+ DSGT+ T L Y ++ S K +R D CY + ++F+ P + +
Sbjct: 311 VIDSGTTLTLLPSNFYYELESVVASTIKAER-VQDPDGILSLCY---RDSSSFKVPDITV 366
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
KGG N + SE + C ++ + I G
Sbjct: 367 HFKGGDVKLGNLNTFVAVSED----VSCFAFAANEQLTIFG 403
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 147/345 (42%), Gaps = 50/345 (14%)
Query: 105 HYTN-VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+YT+ V +G P F + +DTGS + ++PC SC H N + +SP SS+
Sbjct: 34 YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCS--SCTHCGNHQDPR------FSPALSSSY 85
Query: 164 SKVPCNST----LCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C S C+ ++ YQ +Y T S+G L +DV+ + S
Sbjct: 86 KPLECGSECSTGFCDGSRK---------YQRQYAEKST-SSGVLGKDVIGFS---NSSDL 132
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDG 277
R+ FGC +TG D A +G+ GLG S+ L + + + FS+C+G +G
Sbjct: 133 GGQRLVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEG 191
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSG 330
G + G P S P YN+ + + VGG+ + ++ + DSG
Sbjct: 192 GGAMILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSG 251
Query: 331 TSFTYLNDPAYTQISETFNSLAKEK----RETSTSDLPF-EYCYV-LSPNQTNFE--YPV 382
T++ Y A+ + F S KE+ +E D F + CY N +N +P
Sbjct: 252 TTYAYFPGAAF----QAFKSAVKEQVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPS 307
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNIIG 426
V+ G G P + K YCLGV ++ D ++G
Sbjct: 308 VDFVF-GDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLG 351
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 94/337 (27%), Positives = 137/337 (40%), Gaps = 34/337 (10%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P DTGSDL W C+ CV + +I+ P+TS +
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQRE--------HIFDPSTSLSY 198
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S V C+S CE + + S C Y +RY DG+ S GF + L L S
Sbjct: 199 SNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRY-GDGSYSIGFFAREKLSLT-----ST 252
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
V + FGCG+ G F A GL GL + S+ S A + S+ + S T
Sbjct: 253 DVFNNFQFGCGQNNRGLFGGTA---GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309
Query: 279 GRISF--GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFDS 329
G +SF GD S TP + +P+ Y + + +SVG + S I DS
Sbjct: 310 GYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDS 369
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GT + L Y+ + + F L + + + CY LS +T + P + L G
Sbjct: 370 GTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSI-LDTCYDLSKYKT-VKVPKIILYFSG 427
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + +I + + L G D V IIG
Sbjct: 428 GAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIG 464
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 124/299 (41%), Gaps = 35/299 (11%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G P +V + TGSDL W+PC C H D + P SST V
Sbjct: 101 KISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHN--------CDLRFFDPMESSTYKNV 152
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PC+S C++ S+C Y + G L D L L + +S + F
Sbjct: 153 PCDSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFML-PNTGF 211
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISF 283
CG G + G+ GLG S+ + +++ LI FS C + S+ T ++SF
Sbjct: 212 ICGNRIGGDY----PGVGILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSF 265
Query: 284 GDKGSPGQGETPFSLR--QTHPTYNITIT---------QVSVGGNAVNFEFSAI-FDSGT 331
GDK G FS R T Y+ T++ +S GG ++ + + DSGT
Sbjct: 266 GDKAVV-SGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGT 324
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
FTY + Y+Q+ +++ CY SP +F P + + +GG
Sbjct: 325 MFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLCYRYSP---DFSPPTITMHFEGG 380
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 118/269 (43%), Gaps = 52/269 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P+ ++ +DTGSDL WL C C C + GQV D P SST
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136
Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+VPC+S C + C S AG C Y V Y DG+ STG L D L A D
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGELATDKLAFAND----- 190
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ + ++ GCGR G F D AA GL G+ K S+ + +A + F C G D T
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGSVFEYCLG-DRT 244
Query: 279 GR------ISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
R + FG +P T F+ ++P Y + + SVGG V +A
Sbjct: 245 SRSTRSSYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLA 302
Query: 326 ----------IFDSGTSFTYLNDPAYTQI 344
+ DSGT+ + AY +
Sbjct: 303 LDTATGRGGVVVDSGTAISRFARDAYAAL 331
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 147/349 (42%), Gaps = 55/349 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++G P SF V +DTGSDL W+ C C C G D P+ S +
Sbjct: 39 YLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQ----QPGPKFD-----PSKSRSF 89
Query: 164 SKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
K C LC + K C A + C YQ Y D + + G L + + L + ++S
Sbjct: 90 RKAACTDNLCNVSALPLKAC--AANVCQYQYTY-GDQSNTNGDLAFETISL-NNGAGTQS 145
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD 276
V + +FGCG G+F A GL GLG S+ S L++ N FS C S
Sbjct: 146 VPN-FAFGCGTQNLGTFAGAA---GLVGLGQGPLSLNSQLSHT--FANKFSYCLVSLNSL 199
Query: 277 GTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
++FG + + T + HPT Y + + + VGG +N S
Sbjct: 200 SASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGR 259
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS---DLPFEYCYVLSPNQTN-- 377
I DSGT+ T L PAY+ + + S R ++ DL F V +P+ +
Sbjct: 260 GGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGSAYGLDLCFNIAGVSNPSVPDMV 319
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
F++ + M+G F V+V + L CL + S +IIG
Sbjct: 320 FKFQGADFQMRGENLF------VLVDTSATTL---CLAMGGSQGFSIIG 359
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 126/309 (40%), Gaps = 41/309 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ V G P + V DTGS++ W+ C VSC ++ P SST
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEP---------LFDPTLSST 66
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C S C +GS C Y V Y DG+ + GFL + LA + +V +
Sbjct: 67 YRNISCTSAACTGLSSRGCSGSTCVYGVTY-GDGSSTVGFLATETFTLA-----AGNVFN 120
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGR 280
FGCG+ G F A GL GLG S+ S LA + N FS C S TG
Sbjct: 121 NFIFGCGQNNQGLFTGAA---GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGY 175
Query: 281 ISFGDK-GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGTS 332
++ G+ +P G T PT Y I + +SVGG + I DSGT
Sbjct: 176 LNIGNPLRTP--GYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTV 233
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT------NFEYPVVNLT 386
T L AY + F + + + + + + CY S T Y +++T
Sbjct: 234 ITRLPPTAYGALRTAFRAAMTQYTRAAAASI-LDTCYDFSRTTTVTFPTIKLHYTGLDVT 292
Query: 387 MKGGGPFFV 395
+ G G F+V
Sbjct: 293 IPGAGVFYV 301
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 125/277 (45%), Gaps = 28/277 (10%)
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQC 179
+DTGSD+ W+ CD C C +S ++ P S+T +PCNST+C +LQ
Sbjct: 5 IDTGSDITWIQCDPCPQCYKQQDS---------LFQPAGSATYKPLPCNSTMCQQLQSFS 55
Query: 180 PSA-GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
S S+C Y V Y D + + G + L L +D+ SV +FGCG G F +
Sbjct: 56 HSCLNSSCNYMVSY-GDKSTTRGDFALETLTLRSDDTILVSV-PNFAFGCGHANKGLF-N 112
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE- 293
GAA GL GLG P+ FS C S +G + FG+
Sbjct: 113 GAA--GLMGLGKSSIGFPA--QTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVR 168
Query: 294 -TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSL 351
TP + P+ Y +++T ++VG + + + DSGT + AY ++ + F +
Sbjct: 169 FTPLVDSSSGPSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQI 228
Query: 352 AKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
+T+ S PF+ C+ +S + P++ L +
Sbjct: 229 LP-GLQTAVSVAPFDTCFRVS-TVDDINIPLITLHFR 263
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 136/339 (40%), Gaps = 43/339 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V VG PA F + LDTGSD+ WL C C C + I+ P SST
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPTASSTY 70
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C S C + C YQV Y DG+ + G + + S SV +
Sbjct: 71 APVTCQSQQCSSLEMSSCRSGQCLYQVNY-GDGSYTFGDFATESVSFG----NSGSVKN- 124
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F+ A + P L NQ L SFS C + + S
Sbjct: 125 VALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTNQ-LKATSFSYCLVNRDSAGSST 176
Query: 284 GDKGSPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
D S G + R+ Y + ++ +SVGG V+ S I
Sbjct: 177 LDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 236
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D GT+ T L AY + + F + + + TS L F+ CY LS Q + P V+
Sbjct: 237 DCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL-FDTCYDLS-GQASVRVPTVSFHF 294
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + + ++ + G Y + S +++IIG
Sbjct: 295 ADGKSWNLPAANYLIPVDSAGTYCFAFAPTTS-SLSIIG 332
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 130/306 (42%), Gaps = 38/306 (12%)
Query: 105 HY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
HY +S+G P DTGSDL W C C +C N ++ P S+T
Sbjct: 71 HYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNP---------MFDPQKSTT 121
Query: 163 SSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+S LC +L S C Y Y S ++ G L ++ + L++ + +S +
Sbjct: 122 YRNISCDSKLCHKLDTGVCSPQKRCNYTYAYAS-AAITRGVLAQETITLSSTKGKSVPLK 180
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD-- 276
I FGCG TG F D G+ GLG S+ S + + FS C F +D
Sbjct: 181 G-IVFGCGHNNTGGFNDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPFHTDVS 236
Query: 277 GTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSV-------GGNAVNFEFSA 325
+ ++SFG KGS G+ TP +Q Y +T+ +SV G++ N E
Sbjct: 237 VSSKMSFG-KGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGN 295
Query: 326 IF-DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
+F DSGT T L Y Q+ S K T DL + CY + N PV+
Sbjct: 296 MFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYR---TKNNLRGPVLT 352
Query: 385 LTMKGG 390
+G
Sbjct: 353 AHFEGA 358
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 135/326 (41%), Gaps = 54/326 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +G P + ++ DTGSDL W+ C C +C H S+ + S+T
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA--------FFARHSTTY 137
Query: 164 SKVPCNSTLCELQKQCPSAGSN-------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S + C S C+L N C YQ Y +D + +TGF ++ L L T +
Sbjct: 138 SAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTY-ADSSTTTGFFSKEALTLNTSTGK 196
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K ++ +SFGCG +G L GA+ G+ GLG S S L + + FS C
Sbjct: 197 VKKLNG-LSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--FGSKFSYCL 253
Query: 274 GS-------------DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAV 319
G ++ KG TP + PT Y I I V V G +
Sbjct: 254 MDYTLSPPPTSFLTIGGAQNVAVSKKGI--MSFTPLLINPLSPTFYYIAIKGVYVNGVKL 311
Query: 320 NFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEY 367
S I DSGT+ T++ +PAYT+I + F + K + P F+
Sbjct: 312 PINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKK--RVKLPSPAEPTPGFDL 369
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPF 393
C +S T P ++ + GG F
Sbjct: 370 CMNVS-GVTRPALPRMSFNLAGGSVF 394
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 159/364 (43%), Gaps = 51/364 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G PA F + +DTGS L WL C CV H V I++P+TS T
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCH--------VQVDPIFTPSTSKTY 164
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+PC+S+ C K C +A C Y+ Y D + S G+L +DVL L E
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLTPSEAP 223
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S S +GCG+ G F +G+ GL DK S+ L+ + N+FS C S
Sbjct: 224 S----SGFVYGCGQDNQGLF---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCLPSS 274
Query: 277 G--------TGRISFGDKG--SPGQGETPFSLRQTHPT-YNITITQVSVGG-----NAVN 320
+G +S G S TP Q P+ Y + +T ++V G +A +
Sbjct: 275 FSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASS 334
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+ I DSGT T L Y + ++F + +K + + C+ S + +
Sbjct: 335 YNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMS-TV 393
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG----REYPIANNI 435
P + + +GG + +V E KG CL + S N ++IIG + + +A ++
Sbjct: 394 PEIQIIFRGGAGLELKAHNSLVEIE-KG--TTCLAIAASSNPISIIGNYQQQTFKVAYDV 450
Query: 436 SLFH 439
+ F
Sbjct: 451 ANFK 454
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 146/354 (41%), Gaps = 62/354 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + +DTGS WL C C H + + +++P+ S T
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCH--------IQEDPVFNPSASKTY 154
Query: 164 SKVPCN---------STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC+ +TL E C + C Y+ Y D + S G+L +DVL L +
Sbjct: 155 KTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLTPSQ 211
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
S V +GCG+ G F +G+ GL ++ S+ S L+ G N+FS C
Sbjct: 212 TLSSFV-----YGCGQDNQGLF---GRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261
Query: 275 SDGTGRISFGDKGSPGQGE----------------TPFSLRQTHPT-YNITITQVSVGGN 317
+ SF SP +G TP +P+ Y I + ++V G
Sbjct: 262 T------SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGR 315
Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
A +++ I DSGT T L P YT + + ++ +K + + + C+ S
Sbjct: 316 PLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGS 375
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P + + KGG + +V E + CL + S ++ IIG
Sbjct: 376 LAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG---ITCLAMAGSSSIAIIG 426
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 136/339 (40%), Gaps = 43/339 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V VG PA F + LDTGSD+ WL C C C + I+ P SST
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPTASSTY 211
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C S C + C YQV Y DG+ + G + + S SV +
Sbjct: 212 APVTCQSQQCSSLEMSSCRSGQCLYQVNY-GDGSYTFGDFATESVSFG----NSGSVKN- 265
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F+ A + P L NQ L SFS C + + S
Sbjct: 266 VALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTNQ-LKATSFSYCLVNRDSAGSST 317
Query: 284 GDKGSPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
D S G + R+ Y + ++ +SVGG V+ S I
Sbjct: 318 LDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 377
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D GT+ T L AY + + F + + + TS L F+ CY LS Q + P V+
Sbjct: 378 DCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL-FDTCYDLS-GQASVRVPTVSFHF 435
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + + ++ + G Y + S +++IIG
Sbjct: 436 ADGKSWNLPAANYLIPVDSAGTYCFAFAPTTS-SLSIIG 473
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 126/313 (40%), Gaps = 52/313 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+ NVS+G P + DTGSDL W C DC + V L + P TS
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------------FDPKTS 137
Query: 161 STSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
ST V C+S+ C E Q C + + C Y + Y D + + G + D L L + + +
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRP 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-- 273
+ I GCG G+F N + P L Q I FS C
Sbjct: 197 MQL-KNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP 249
Query: 274 ---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA- 325
D T +I+FG G TP + + T Y +T+ +SVG + + S
Sbjct: 250 LTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS 309
Query: 326 -------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
I DSGT+ T L Y+++ + +S+ EK++ S L CY + +
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL--CYSAT---GD 364
Query: 378 FEYPVVNLTMKGG 390
+ PV+ + G
Sbjct: 365 LKVPVITMHFDGA 377
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/354 (25%), Positives = 146/354 (41%), Gaps = 62/354 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + +DTGS WL C C H + + +++P+ S T
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCH--------IQEDPVFNPSASKTY 154
Query: 164 SKVPCN---------STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC+ +TL E C + C Y+ Y D + S G+L +DVL L +
Sbjct: 155 KTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLTPSQ 211
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
S V +GCG+ G F +G+ GL ++ S+ S L+ G N+FS C
Sbjct: 212 TLSSFV-----YGCGQDNQGLF---GRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261
Query: 275 SDGTGRISFGDKGSPGQGE----------------TPFSLRQTHPT-YNITITQVSVGGN 317
+ SF SP +G TP +P+ Y I + ++V G
Sbjct: 262 T------SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGR 315
Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
A +++ I DSGT T L P YT + + ++ +K + + + C+ S
Sbjct: 316 PLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGS 375
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P + + KGG + +V E + CL + S ++ IIG
Sbjct: 376 LAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG---ITCLAMAGSSSIAIIG 426
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 126/313 (40%), Gaps = 52/313 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+ NVS+G P + DTGSDL W C DC + V L + P TS
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------------FDPKTS 137
Query: 161 STSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
ST V C+S+ C E Q C + + C Y + Y D + + G + D L L + + +
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRP 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-- 273
+ I GCG G+F N + P L Q I FS C
Sbjct: 197 MQL-KNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP 249
Query: 274 ---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA- 325
D T +I+FG G TP + + T Y +T+ +SVG + + S
Sbjct: 250 LTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS 309
Query: 326 -------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
I DSGT+ T L Y+++ + +S+ EK++ S L CY + +
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSL--CYSAT---GD 364
Query: 378 FEYPVVNLTMKGG 390
+ PV+ + G
Sbjct: 365 LKVPVITMHFDGA 377
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/354 (23%), Positives = 139/354 (39%), Gaps = 62/354 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG P F + DTGSDL W+ C +G ++ P TS + +
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWVKC------------AGASPPGRVFRPKTSRSWA 163
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+PC+S C+L C S S C Y RY + G + + +A +
Sbjct: 164 PIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQ 223
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----G 274
+ + GC G A +G+ LG K S + A + SFS C
Sbjct: 224 LKD-VVLGCSSSHDGQSFRSA--DGVLSLGNAKISFATQAAAR--FGGSFSYCLVDHLAP 278
Query: 275 SDGTGRISFGDKGSPGQ------GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
+ TG ++FG PGQ +T L P Y + + + V G A++
Sbjct: 279 RNATGYLAFG----PGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDA 334
Query: 325 ----AIFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
I DSG + T L PAY + S+ + + K S PFE+CY + +
Sbjct: 335 KSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPK------VSFPPFEHCYNWTARRP 388
Query: 377 NFEYPVVNLTMKGGGPFFVNDPI--VIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
+ L ++ G + P ++ +P + C+GV + + +++IG
Sbjct: 389 GAPEIIPKLAVQFAGSARLEPPAKSYVIDVKPG---VKCIGVQEGEWPGLSVIG 439
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 128/281 (45%), Gaps = 40/281 (14%)
Query: 106 YTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
Y NV+ +GQP+ + + +DTGSDL WL CD CV C + P
Sbjct: 19 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH-------------PYYRP 65
Query: 162 TSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEK 215
++ VPC +C+ +C + G C Y+V Y +DG S G LV D +L T EK
Sbjct: 66 RNNLVPCMDPICQSLHSNGDHRCENPG-QCDYEVEY-ADGGSSFGVLVRDTFNLNFTSEK 123
Query: 216 QSKSVDSRISFG-CGRVQTGSFLDGAAP--NGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + ++ G CG Q F G+ +G+ GLG K+S+ S L++ GL+ N C
Sbjct: 124 RHSPL---LALGLCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHC 177
Query: 273 FGSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDS 329
G G + FGD S TP S H Y+ + +++ G F+ FDS
Sbjct: 178 LSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKH--YSPGLAELTFDGKTTGFKNLLTTFDS 235
Query: 330 GTSFTYLNDPAYT-QISETFNSLAKEKRETSTSDLPFEYCY 369
G S+TYLN AY IS L+ + + D C+
Sbjct: 236 GASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCW 276
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 127/307 (41%), Gaps = 44/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C C + I++P S +
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDP---------IFNPYKSKSF 160
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC+S LC C + C YQV Y DG+ +TG + L ++
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTRRHTCLYQVSY-GDGSFTTGDFATETLTFRGNKI------ 213
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
++++ GCG G F+ A GL + S I N + FS C S
Sbjct: 214 AKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFN-----HKFSYCLVDRSASSK 268
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + +SVGG V F+ +
Sbjct: 269 PSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNG 328
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGTS T L PAYT + + F A+ + L F+ CY LS Q++ + P V
Sbjct: 329 GVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSL-FDTCYDLS-GQSSVKVPTV 386
Query: 384 NLTMKGG 390
L +G
Sbjct: 387 VLHFRGA 393
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 115/266 (43%), Gaps = 32/266 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P +DTGSD+ WL C C C + I+ P+ S+T +P
Sbjct: 91 SVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTT---------RIFDPSKSNTYKILPF 141
Query: 169 NSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ST C+ + + N C Y + Y DG+ S G L + L L + S R
Sbjct: 142 SSTTCQSVEDTSCSSDNRKMCEYTI-YYGDGSYSQGDLSVETLTLGSTNGSSVKF-RRTV 199
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNSFSMCFG--SDGTGRIS 282
GCGR T SF +G + +G+ GLG S+ + L + I FS C S+ + +++
Sbjct: 200 IGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLN 257
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSG 330
FGD G TP Y +T+ SVG N + F S+ I DSG
Sbjct: 258 FGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSG 317
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR 356
T+ T L + Y+++ L + R
Sbjct: 318 TTLTLLPNDIYSKLESAVADLVELDR 343
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 129/307 (42%), Gaps = 39/307 (12%)
Query: 105 HY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
HY VS+G P DTGSDL W C C C N I+ P S++
Sbjct: 24 HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNP---------IFDPQKSTS 74
Query: 163 SSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+S LC +L S +C Y Y S ++ G L ++ + L++ + +S +
Sbjct: 75 YRNISCDSKLCHKLDTGVCSPQKHCNYTYAYAS-AAITQGVLAQETITLSSTKGESVPLK 133
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD-- 276
I FGCG TG F D G+ GLG S S + + FS C F +D
Sbjct: 134 G-IVFGCGHNNTGGFNDREM--GIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPFHTDVS 189
Query: 277 GTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
+ ++S G KGS G+ TP +Q Y +T+ +SVG ++F S+
Sbjct: 190 VSSKMSLG-KGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKG 248
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
DSGT T L Y ++ S K T+ DL + CY + N PV+
Sbjct: 249 NVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYR---TKNNLRGPVL 305
Query: 384 NLTMKGG 390
+GG
Sbjct: 306 TAHFEGG 312
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/350 (27%), Positives = 146/350 (41%), Gaps = 51/350 (14%)
Query: 65 HRDRYF--RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVA 121
R Y R+ GRG + K + + N +G L+Y VS+G P ++ +
Sbjct: 98 RRAEYILRRVSGRGTPQLWDSKAEAATATVPANWGFN-IGTLNYVVTVSLGTPGVAQTLE 156
Query: 122 LDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---- 174
+DTGSDL W+ PC +C + ++ P SS+ + VPC +C
Sbjct: 157 VDTGSDLSWVQCTPCAAPACYSQKDP---------LFDPAQSSSYAAVPCGGPVCGGLGI 207
Query: 175 LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
C +A C Y V Y DG+ +TG D L L+ ++ FGCG Q+G
Sbjct: 208 YASSCSAA--QCGYVVSY-GDGSKTTGVYSSDTLTLSPNDAVRG-----FFFGCGHAQSG 259
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQG 292
+ +GL GLG ++ S+ + G FS C + TG ++ G G G
Sbjct: 260 FTGN----DGLLGLGREEASL--VEQTAGTYGGVFSYCLPTRPSTTGYLTLG--GPSGAA 311
Query: 293 ETPFSLRQ--THPT----YNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLNDPAY 341
FS Q + P Y + +T +SVGG ++ F + D+GT T L AY
Sbjct: 312 PPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPPTAY 371
Query: 342 TQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
+ F S +A ++ + + CY S T P V LT GG
Sbjct: 372 AALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGT-VTLPNVALTFSGG 420
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 151/379 (39%), Gaps = 54/379 (14%)
Query: 34 FHHRYSDPVKGI-LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS-- 90
+HR+ V G + ++ + + + L R + L A N + + S
Sbjct: 30 LNHRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVY 89
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
AG+ Y +N +S+G PA F +DTGSDL W C C N S+
Sbjct: 90 AGDGEYLMN---------LSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST---- 134
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
I++P SS+ S +PC+S LC+ + + C Y Y DG+ + G + + L
Sbjct: 135 --PIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGY-GDGSETQGSMGTETLTF 191
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
S S+ I+FGCG G F G GL G+G S+PS L FS
Sbjct: 192 G-----SVSIP-NITFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFS 238
Query: 271 MCFGSDGTGRI------SFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
C G+ S + + G T PT Y IT+ +SVG + +
Sbjct: 239 YCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDP 298
Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
SA I DSGT+ TY + AY + + F S +S F+ C+
Sbjct: 299 SAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSS-GFDLCFQT 357
Query: 372 SPNQTNFEYPVVNLTMKGG 390
+ +N + P + GG
Sbjct: 358 PSDPSNLQIPTFVMHFDGG 376
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 152/359 (42%), Gaps = 65/359 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+S+G P L F V +DTGS+L W C C C + + P SST S++
Sbjct: 94 NISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFP-------RPTPAPVLQPARSSTFSRL 146
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PCN + C+ + + +A + C Y Y S T G+L + L +
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTVG------DGTF 198
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----G 277
+++FGC T + +D ++ G+ GLG S+ S LA FS C SD G
Sbjct: 199 PKVAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAV-----GRFSYCLRSDMADGG 248
Query: 278 TGRISFGDKGSPGQG---------ETPFSLRQTHPTYNIT-----ITQVSVGGNAVNFEF 323
I FG +G + P+ R TH N+T T++ V G+ F
Sbjct: 249 ASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQ 308
Query: 324 SA-----IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF--EYCYVLSPNQ 375
+ I DSGT+ TYL Y + + F S +A + T S P+ + CY S
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGG 368
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPI----VIVSSEPKG-LYLYCLGVVKSDN---VNIIG 426
V L ++ G N P+ V ++ +G + + CL V+ + + ++IIG
Sbjct: 369 GGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIG 427
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 142/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGRHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 109/252 (43%), Gaps = 29/252 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C + + P+ SST
Sbjct: 15 TRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPK---------FQPDLSSTYQS 65
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ +Y ++ + S+G L ED++ S R
Sbjct: 66 VKCN-----IDCNCDDEKQQCVYERQY-AEMSTSSGVLGEDIISFGN---LSALAPQRAV 116
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
FGC ++TG A +G+ G+G S+ L ++G+I +SFS+C+G G G +
Sbjct: 117 FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVL 175
Query: 286 KGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G FS P YNI + ++ V G + + I DSGT++ YL
Sbjct: 176 GGISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYL 235
Query: 337 NDPAYTQISETF 348
+ A+ +
Sbjct: 236 PEAAFVSFKDAI 247
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 145/345 (42%), Gaps = 45/345 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T
Sbjct: 81 LYVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTC 130
Query: 164 SKVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 131 AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG 189
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------ 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 190 -----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKS 240
Query: 273 ---FGSDGTGRISFGDKGSPGQGE-TPFSLRQTH-PTYNITITQVSVGGNAVNFEFS--- 324
F S TG S G + T R+ + + + +T +SV G + S
Sbjct: 241 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS 300
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
+FDSG+ +Y+ D A + +S+ L ++ + + CY + + P
Sbjct: 301 RKGVVFDSGSELSYIPDRALSVLSQRIRELLLKR--GAAEEESERNCYDMRSVDEG-DMP 357
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 358 AISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 402
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 85/324 (26%), Positives = 136/324 (41%), Gaps = 32/324 (9%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +DTGS + ++PC +C H S Q F P S T V C
Sbjct: 99 IGTPPQRFALIVDTGSTVTYVPCS--TCRH---CGSHQDPKFR---PEDSETYQPVKCT- 149
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
Q C + C Y+ RY ++ + S+G L EDV+ Q++ R FGC
Sbjct: 150 ----WQCNCDNDRKQCTYERRY-AEMSTSSGALGEDVVSFGN---QTELSPQRAIFGCEN 201
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
+TG + A +G+ GLG S+ L + +I +SFS+C+G G G + G
Sbjct: 202 DETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISP 260
Query: 291 QGETPFSLRQ--THPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
+ F+ P YNI + ++ V G ++ + + DSGT++ YL + A+
Sbjct: 261 PADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPESAF 320
Query: 342 TQISETFNSLAKEKRETSTSDLPF-EYCY---VLSPNQTNFEYPVVNLTMKGGGPFFVND 397
+ S D + + C+ + +Q + +PVV + G G
Sbjct: 321 LAFKHAIMKETHSLKRISGPDPRYNDICFSGAEIDVSQISKSFPVVEMVF-GNGHKLSLS 379
Query: 398 PIVIVSSEPKGLYLYCLGVVKSDN 421
P + K YCLGV + N
Sbjct: 380 PENYLFRHSKVRGAYCLGVFSNGN 403
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 153/345 (44%), Gaps = 42/345 (12%)
Query: 115 ALSFIVALDTGSDLFWLPCD-CVSC-VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
A +F + +DTGS +LPC C SC H +G+ D++ S+ S+V C S
Sbjct: 44 AQTFELIVDTGSSRTYLPCKGCASCGAH----EAGRYYDYD-----ASADFSRVEC-SAC 93
Query: 173 CELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ 232
+ +C ++G C Y V YL +G+ S G+LV DV+ L ++ + FGC +
Sbjct: 94 AGIGGKCGTSGV-CRYDVHYL-EGSGSEGYLVRDVVSLG-----GSVGNATVVFGCEERE 146
Query: 233 TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------GSDGTGRISFGD 285
GS +A +GLFG G ++ + LA+ +I + FSMC G G ++ G+
Sbjct: 147 LGSIKQQSA-DGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGN 205
Query: 286 ----KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--IFDSGTSFTYLNDP 339
+P TP + + Y +T T ++G + V I DSGTS+TY+
Sbjct: 206 FDFGADAPALVYTP--MVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVPGN 263
Query: 340 AYTQISETFNSLAKEKRETSTS------DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
+ + + A+E + DL F L + + +P + + G
Sbjct: 264 MHARFLQLAEDAARESGLEKVAPPEDYPDLCFGNSGGLGWSTVSEYFPALKIEYHGSARL 323
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLF 438
++ P + K +C+G+++ D+ I+ + + N + F
Sbjct: 324 TLS-PETYLYWHQKNASAFCVGILEHDDNRILLGQITMRNTFTEF 367
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 110/272 (40%), Gaps = 41/272 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +VSVG P + LDTGSDL W C C+ + V+D P SST +
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVW--TQCAPCLDCFEQGAAPVLD-----PAASSTHA 142
Query: 165 KVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+PC++ LC G +C Y Y D +++ G L D D+
Sbjct: 143 ALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHY-GDRSLTVGQLATDSFTFGGDDNAGGL 201
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---- 275
R++FGCG + G F A G+ G G + S+PS L SFS CF S
Sbjct: 202 AARRVTFGCGHINKGIF--QANETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFDT 254
Query: 276 DGTGRISFGDKGSP----------GQGETPFSLRQ-THPT-YNITITQVSVGGNAV---- 319
+ ++ G + G T ++ + P+ Y + + +SVGG V
Sbjct: 255 KSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314
Query: 320 -NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
S I DSG S T L + Y + F S
Sbjct: 315 SRLRSSTIIDSGASITTLPEDVYEAVKAEFVS 346
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 130/303 (42%), Gaps = 43/303 (14%)
Query: 112 GQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNST 171
G PA+ ++ +DTGSDL W+ C NSS+ ++ P+ SST + VPC S
Sbjct: 129 GTPAVPQVLLIDTGSDLSWVQC------QPCNSSTCYPQKDPVFDPSASSTYAPVPCGSE 182
Query: 172 LCE------LQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
C C S S C Y ++Y +G + G + L L+ ++ +V +
Sbjct: 183 ACRDLDPDSYANGCTNSSSGASLCQYGIQY-GNGDTTVGVYSTETLTLS---PEAATVVN 238
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF--GSDGT 278
SFGCG VQ G F + P L +Q G +FS C G+
Sbjct: 239 NFSFGCGLVQKGVFDLFDG-------LLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTA 291
Query: 279 GRISFGDKGSPGQGE-----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFD 328
G ++ G + G TP + +T Y + +T +SVGG ++ E + I D
Sbjct: 292 GFLALGAPATGGNNTAGFQFTPLQVVETT-FYLVKLTGISVGGKQLDIEPTVFAGGMIID 350
Query: 329 SGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SGT T L + AY+ + F S ++ D + CY + N TN P V LT
Sbjct: 351 SGTIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGN-TNVTVPTVALTF 409
Query: 388 KGG 390
+GG
Sbjct: 410 EGG 412
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 154/376 (40%), Gaps = 56/376 (14%)
Query: 84 KTPLTFSAGNDTYRLNS----LGF-------LHYTNVSVGQPALSFIVALDTGSDLFWLP 132
+ PL N T RL++ +G+ L+ +V +G PA + IV +DTGS W+
Sbjct: 50 RIPLFRYISNKTSRLSTQAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVF 109
Query: 133 CDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS-----NCP 187
C+C C H + + + S+T +KV C +++C L P +CP
Sbjct: 110 CECDGC-H---------TNPRTFLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCP 159
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
++V Y DG+ S G L +D L + +K +FGC G+ G +GL G
Sbjct: 160 FRVSY-QDGSASYGILYQDTLTFSDVQKIPS-----FTFGCNLDSFGANEFGNV-DGLLG 212
Query: 248 LGMDKTSVPSILANQGLIPNSFSMC---------FGSDGTGRISFGDKGSPGQGE--TPF 296
+G SV L + FS C F S TG S G +
Sbjct: 213 MGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMV 269
Query: 297 SLRQTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNS 350
+ R+ + + + +SV G + S +FDSG+ +Y+ D A + +S+
Sbjct: 270 ARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELSYIPDRALSVLSQRIRE 329
Query: 351 LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY 410
L R + + CY + + P ++L G F + V V +
Sbjct: 330 LL--LRRGAAEEESERNCYDMRSVDEG-DMPAISLHFDDGARFDLGSHGVFVERSVQEQD 386
Query: 411 LYCLGVVKSDNVNIIG 426
++CL +++V+IIG
Sbjct: 387 VWCLAFAPTESVSIIG 402
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 144/360 (40%), Gaps = 54/360 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +G P S ++ DTGSDL W+ C C +C H SS+ + P SS+
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSA--------FLPRHSSSF 139
Query: 164 SKVPCNSTLCELQKQCPSAGSN-------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S C C L P N C + Y +DG++S+GF ++ L +
Sbjct: 140 SPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSY-ADGSLSSGFFSKETTTLKSLSGS 198
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSMC- 272
+ +SFGCG +G + GA N G+ GLG S S L + N FS C
Sbjct: 199 EIHLKG-LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKFSYCL 255
Query: 273 -----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGG---- 316
F G G S + TP + PT Y ITI +++ G
Sbjct: 256 MDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLP 315
Query: 317 -NAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEY 367
N +E + DSGT+ TYL AY E S+ + + + ++L F+
Sbjct: 316 INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAY---EEVLKSVRRRVKLPNAAELTPGFDL 372
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
C S P + + GGG F P +G+ + V+S N ++IG
Sbjct: 373 CVNASGESRRPSLPRLRFRL-GGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIG 431
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 162/373 (43%), Gaps = 57/373 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG PA F + +DTGSDL W+ C+ + NSSS Y ++SS+
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTT--ANSSSPPA---PWYDKSSSSSYR 81
Query: 165 KVPCNSTLC-----ELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++PC C + C + S C Y Y SD + +TG L + + + + ++ K
Sbjct: 82 EIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGY-SDQSRTTGILAYETISMKSRKRSGK 140
Query: 219 SVDSR---------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ ++ GC R G+ GA+ G+ GLG S+ + + L F
Sbjct: 141 RAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIF 197
Query: 270 SMCF-----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
S C GS+ + + G TP + Y + +T V+V G V+
Sbjct: 198 SYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257
Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCY 369
S+ IFDSGT+ +YL +PAY+++ N+ R ++P FE CY
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR---AQEIPEGFELCY 314
Query: 370 VLSPNQTNFE--YPVVNLTMKGGGPFFV--NDPIVIVSSEPKGLYLYCLGVVKSDNV--N 423
N T E P + + +GG + N+ +V+V+ + + L + N+ N
Sbjct: 315 ----NVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGN 370
Query: 424 IIGREYPIANNIS 436
++ +++ I +++
Sbjct: 371 LLQQDHHIEYDLA 383
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 140/329 (42%), Gaps = 58/329 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS-- 164
N+S+GQP + +V +DTGSD+ W+ C C +C + L ++ P+ SST S
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGL---------LFDPSMSSTFSPL 154
Query: 165 -KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
K PC+ C S P+ V Y + T S F + V+ TDE S+ D
Sbjct: 155 CKTPCDFKGC-------SRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPD-- 205
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
+ FGCG G D NG+ GL + P LA + I FS C G
Sbjct: 206 VLFGCGH-NIGQDTD-PGHNGILGL----NNGPDSLATK--IGQKFSYCIGDLADPYYNY 257
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AIF 327
++ G+ TPF + Y +T+ +SVG ++ FE I
Sbjct: 258 HQLILGEGADLEGYSTPFEVHNGF--YYVTMEGISVGEKRLDIAPETFEMKKNRTGGVII 315
Query: 328 DSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
D+G++ T+L D + +S E N L R+T+ P+ C+ S ++ +PVV
Sbjct: 316 DTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFH 375
Query: 387 MKGG-------GPFF--VNDPIVIVSSEP 406
G G FF +ND + ++ P
Sbjct: 376 FADGADLALDSGSFFNQLNDNVFCMTVGP 404
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 78/303 (25%), Positives = 124/303 (40%), Gaps = 43/303 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +D+GSD+ W+ C C+ C + ++ P TS+T
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADP---------LFDPATSATF 177
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S VPC S +C + C +G C Y+V Y DG+ + G L + L L +
Sbjct: 178 SAVPCGSAVCRTLRTSGCGDSG-GCDYEVSY-GDGSYTKGALALETLTLGGTAVEG---- 231
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRI 281
++ GCG G F+ A GL GLG S+ L +FS C S G G +
Sbjct: 232 --VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAGSL 284
Query: 282 SFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEF-----------SAIF 327
G + +G P P+ Y + ++ + VG + + +
Sbjct: 285 VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVM 344
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
D+GT+ T L AY + + F ++ R S L + CY LS T+ P V+
Sbjct: 345 DTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLL--DTCYDLS-GYTSVRVPTVSFY 401
Query: 387 MKG 389
G
Sbjct: 402 FDG 404
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 137/335 (40%), Gaps = 59/335 (17%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + DT SDL W+ C C +C D ++ P+ SST + + C+
Sbjct: 96 IGTPPVERLAIADTASDLIWVQCSPCETCFPQ---------DTPLFEPHKSSTFANLSCD 146
Query: 170 STLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S C CP G+ C Y Y DG+ + G L + +H + Q+ + I FG
Sbjct: 147 SQPCTSSNIYYCPLVGNLCLYTNTY-GDGSSTKGVLCTESIHFGS---QTVTFPKTI-FG 201
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISFG 284
CG G+ GLG S+ S L +Q I + FS C F S T ++ FG
Sbjct: 202 CGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLKFG 259
Query: 285 -DKGSPGQG--ETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFS------AIFDSGTSFT 334
D G G TP + +P+Y + + +++G + + I D GT T
Sbjct: 260 NDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLT 319
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
YL Y F +L +E S + PF++C+ PNQ N +P + G
Sbjct: 320 YLEVNFY----HNFVTLLREALGISETKDDIPYPFDFCF---PNQANITFPKIVFQFTGA 372
Query: 391 GPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
F PK L+ + D++N+I
Sbjct: 373 KVFL----------SPKNLFF------RFDDLNMI 391
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 132/302 (43%), Gaps = 38/302 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
+ +++G P LS +ALDTGSD+ W C+ CV SC + + P SS+
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTK---------FDPRKSSS 95
Query: 163 SSKVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S+ C + A S C Y+V+Y DG+ S GF + L ++ +
Sbjct: 96 YKNVSCSSSSCRIITDSGGARGCVSSTCIYKVQY-GDGSYSVGFFATEKLTISPSD---- 150
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGS 275
V S FGCG+ G F A G+ + + L N F+ C F S
Sbjct: 151 -VISNFLFGCGQQNAGRFGRIAGLL-----GLGRGKLSLALQTSEKYNNLFTYCLPSFSS 204
Query: 276 DGTGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------AIFD 328
TG ++ G + TP S + P Y I I +SVGG+ + + S AI D
Sbjct: 205 SSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIID 264
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGT T L Y+ +S F L K+ +T + + CY S N++ P ++ K
Sbjct: 265 SGTVITRLQPTVYSALSSKFQQLMKDYPKTDGFSI-LDTCYDFSGNES-ISVPRISFFFK 322
Query: 389 GG 390
GG
Sbjct: 323 GG 324
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 142/341 (41%), Gaps = 43/341 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + +G P + LDTGSD+ W+ C+ C C + I++P++S +
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP---------IFNPSSSVSF 58
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C G C Y+V Y DG+ + G + L T Q+
Sbjct: 59 STVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLTFGTTSIQN------ 111
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL + S P+ L Q +FS C S+ +G
Sbjct: 112 VAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSESSGT 166
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG + P G TP PT Y +++ +SVGG ++ F
Sbjct: 167 LEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGI 226
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT+ T L AY + + F + + + F+ CY LS Q+ P V
Sbjct: 227 IIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI-FDTCYDLSALQS-VSIPAVGF 284
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G F + ++ + G + + S N++I+G
Sbjct: 285 HFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADS-NLSIMG 324
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 155/346 (44%), Gaps = 45/346 (13%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G P+ S+ + +DTGS L WL C CV + G + D P
Sbjct: 127 TSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGPLFD-----P 179
Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLA 211
SST + V C+++ C ELQ PSA S C YQ Y D + S G+L D +
Sbjct: 180 RASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASY-GDSSFSVGYLSTDTVSFG 238
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ S +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 239 STSYPS------FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287
Query: 272 CFGSDG-TGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFS 324
C + TG +S G + G TP + + Y IT++ +SVGG+ + E+S
Sbjct: 288 CLPTAASTGYLSIGPYNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYS 346
Query: 325 A---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+ I DSGT T L +T +S+ ++A +R + S L + C+ +Q
Sbjct: 347 SLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSIL--DTCFEGQASQ--LRV 402
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P V + GG + V++ + CL +D+ IIG
Sbjct: 403 PTVVMAFAGGASMKLTTRNVLIDVDDS---TTCLAFAPTDSTAIIG 445
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 95/348 (27%), Positives = 149/348 (42%), Gaps = 47/348 (13%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIY 155
N FL N+S+G P + ++ +DTGSDL W LPC C Q I F +
Sbjct: 84 NPAAFL--ANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYP----------QTIPF--F 129
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P+ SST C S + + + NC Y +RY D + + G L ++ L T +
Sbjct: 130 HPSRSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRY-RDFSNTRGILAKEKLTFQTSD 188
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ S I FGCG+ +G +G+ GLG S+ + N G + FS CFG
Sbjct: 189 EGLIS-KPNIVFGCGQDNSGF----TQYSGVLGLGPGTFSI--VTRNFG---SKFSYCFG 238
Query: 275 S--DGTGRISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
S D T +F G+ + E TP + Q Y + + +S+G ++ E
Sbjct: 239 SLIDPTYPHNFLILGNGARIEGDPTPLQIFQDR--YYLDLQAISLGEKLLDIEPGIFQRY 296
Query: 323 ---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
+ D+G S T L AY +SE + L E R + +CY + +
Sbjct: 297 RSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLY 356
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+PVV GG ++ + VSSE + + + D++++IG
Sbjct: 357 GFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIG 404
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 86/344 (25%), Positives = 142/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGIHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 102/400 (25%), Positives = 154/400 (38%), Gaps = 92/400 (23%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSS------------- 146
++ VG PA F++ DTGSDL W+ C D + +G + +
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166
Query: 147 -GQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMST 200
++ P+ S T + +PC+S C CP+ GS C Y RY DG+ +
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRY-KDGSAAR 225
Query: 201 GFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTG-SFLDGAAPNGLFGLGMDKTS 254
G + D +A +KQ ++ + GC TG SFL A +G+ LG S
Sbjct: 226 GTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFL---ASDGVLSLGYSNIS 282
Query: 255 VPSILANQGLIPNSFSMCF-----GSDGTGRISFGDKGSPGQGETPFS------------ 297
S A + FS C + T ++FG +P +P S
Sbjct: 283 FASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGP--NPAVSSSPPSKTACAGGGSPAA 338
Query: 298 -------LRQT--------HPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSF 333
RQT P Y +T+ +SV G + AI DSGTS
Sbjct: 339 APPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKGGGAILDSGTSL 398
Query: 334 TYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV--NLTMKGG 390
T L PAY + N LA R T PF+YCY + T + V L +
Sbjct: 399 TVLVSPAYRAVVAALNKKLAGLPRVTMD---PFDYCYNWTSPSTGEDLTVAMPELAVHFA 455
Query: 391 GPFFVNDPI--VIVSSEPKGLYLYCLGVVKSD--NVNIIG 426
G + P ++ + P + C+G+ + + V++IG
Sbjct: 456 GSARLQPPAKSYVIDAAPG---VKCIGLQEGEWPGVSVIG 492
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 142/341 (41%), Gaps = 43/341 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + +G P + LDTGSD+ W+ C+ C C + I++P++S +
Sbjct: 154 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP---------IFNPSSSVSF 204
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C G C Y+V Y DG+ + G + L T Q+
Sbjct: 205 STVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLTFGTTSIQN------ 257
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL + S P+ L Q +FS C S+ +G
Sbjct: 258 VAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSESSGT 312
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG + P G TP PT Y +++ +SVGG ++ F
Sbjct: 313 LEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGI 372
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT+ T L AY + + F + + + F+ CY LS Q+ P V
Sbjct: 373 IIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI-FDTCYDLSALQS-VSIPAVGF 430
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G F + ++ + G + + S N++I+G
Sbjct: 431 HFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADS-NLSIMG 470
>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
Length = 107
Score = 79.0 bits (193), Expect = 5e-12, Method: Composition-based stats.
Identities = 42/70 (60%), Positives = 47/70 (67%), Gaps = 3/70 (4%)
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDK 286
CG TGSFLDG A NGL GLG +K SV +L GL+ +SFSMCF D GRI+FGD
Sbjct: 20 CG--PTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGRINFGDA 77
Query: 287 GSPGQGETPF 296
G GQGE PF
Sbjct: 78 GIRGQGEMPF 87
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 133/309 (43%), Gaps = 38/309 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNI 154
+SL L Y +V +G PA++ V +DTGSD+ W+ PC C ++ +G + D
Sbjct: 120 SSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPC----HAQTGALFD--- 172
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P SST V C + C +L++Q C + C Y V+Y DG+ + G D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL 229
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ K FGC +++G F D +GL GLG S+ S A NSFS
Sbjct: 230 SGASDAVKG----FQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTA--AAYGNSFS 280
Query: 271 MCF----GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----F 321
C GS G + G S +Q Y + ++VGG + F
Sbjct: 281 YCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVF 340
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
++ DSGT T L AY+ +S F + K+ R + + C+ + QT P
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSI-LDTCFDFA-GQTQISIP 398
Query: 382 VVNLTMKGG 390
V L GG
Sbjct: 399 TVALVFSGG 407
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 124/312 (39%), Gaps = 41/312 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G PA+ + LDTGS L W+ C C NSS ++ PNTSS+ S
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWV--QCKPC----NSSQCYPQRLPLFDPNTSSSYS 182
Query: 165 KVPCNSTLCELQKQ------CPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
VPC+S C C S G C Y++ Y S G G D L L
Sbjct: 183 PVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGS-GATPAGEYSTDALTLG-----P 236
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS---FSMCFG 274
++ R FGCG Q D A +G+ GLG +P LA Q FS C
Sbjct: 237 GAIVKRFHFGCGHHQQRGKFDMA--DGVLGLG----RLPQSLAWQASARRGGGVFSHCLP 290
Query: 275 SDGTGRISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS 324
G F G+P TP P Y + T +SV G ++ F
Sbjct: 291 PTGVS-TGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREG 349
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT + L + AYT + F S A + + + C+ + N P V+
Sbjct: 350 VITDSGTVLSALQETAYTALRTAFRS-AMAEYPLAPPVGHLDTCFNFT-GYDNVTVPTVS 407
Query: 385 LTMKGGGPFFVN 396
LT +GG ++
Sbjct: 408 LTFRGGATVHLD 419
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 136/329 (41%), Gaps = 45/329 (13%)
Query: 45 ILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND--TYRLNSLG 102
++ + L S Y AL H D L L + ++ L +G D + RL+S+
Sbjct: 15 LVLLTSLAVSASSGYRLALTHVDSKIGLTKTELMRRAAHRSRLRALSGYDANSPRLHSVQ 74
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
+ +++G P + F+ DTGSDL W C C C D +Y P+ SS
Sbjct: 75 VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASS 125
Query: 162 TSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
T S VPC+S C + C + S C Y Y SDG S G L + L L +
Sbjct: 126 TFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSY-SDGAYSAGILGTETLTLGSSVPGQA 184
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FG 274
S ++FGCG G L+ G GLG S+LA G+ FS C F
Sbjct: 185 VSVSDVAFGCGTDNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFN 236
Query: 275 SDGTGRISFGDKG--SPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEF 323
S G +PG G TP +P+ Y +++ +++G + F+
Sbjct: 237 STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDL 296
Query: 324 SA------IFDSGTSFTYLNDPAYTQISE 346
A + DSGT+F+ L + + + +
Sbjct: 297 HANSTGGMVVDSGTTFSILPESGFRVVVD 325
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 63/290 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +GQP S ++ DTGSDL W+ C C +C H ++ ++ P SST
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPAT--------VFFPRHSSTF 135
Query: 164 SKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S C +C L + A S C Y+ Y +DG++++G + L T
Sbjct: 136 SPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGY-ADGSLTSGLFARETTSLKTSSG 194
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + S ++FGCG +G + G + NG+ GLG S S L + N FS C
Sbjct: 195 KEARLKS-VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYC 251
Query: 273 F-----------------GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSV 314
G DG ++ F TP PT Y + + V V
Sbjct: 252 LMDYTLSPPPTSYLIIGNGGDGISKLFF----------TPLLTNPLSPTFYYVKLKSVFV 301
Query: 315 GGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
G + + S + DSGT+ +L +PAY + K
Sbjct: 302 NGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK 351
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 127/287 (44%), Gaps = 41/287 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C+ C C ++ I++P+ S++
Sbjct: 197 YFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDP---------IFNPSLSASF 247
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + CNS +C G C Y+V Y DG+ + G ++L T ++
Sbjct: 248 STLGCNSAVCSYLDAYNCHGGGCLYKVSY-GDGSYTIGSFATEMLTFGTTSVRN------ 300
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGR 280
++ GCG G F+ A L GLG S PS L Q +FS C S+ +G
Sbjct: 301 VAIGCGHDNAGLFVGAAG---LLGLGAGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGT 355
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG + P G TP + PT Y + + +SVGG ++ F
Sbjct: 356 LEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGF 415
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
I DSGT+ T L P Y + + F + ++ + + F+ CY LS
Sbjct: 416 IVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSI-FDTCYDLS 461
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 93/346 (26%), Positives = 142/346 (41%), Gaps = 29/346 (8%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
++ S F + V++G P S + DTGSDL W V C G N +S +
Sbjct: 93 KVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVW-----VKCKKGNNDTSSAAAPTTQFD 147
Query: 157 PNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P+ SST +V C + CE L + GSNC Y Y DG+ +TG L +
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAY-GDGSNTTGVLSTETFTFDDGGS 206
Query: 216 QSKSVDSR---ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
R + FGC GSF +GL GLG S+ + L + FS C
Sbjct: 207 GRSPRQVRVGGVKFGCSTATAGSF----PADGLVGLGGGAVSLVTQLGGATSLGRRFSYC 262
Query: 273 F---GSDGTGRISFG---DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
+ + ++FG D PG TP Y + + V VG V S+
Sbjct: 263 LVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSR 322
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT+ T+L DP+ + + L++ + D + CY ++ + +
Sbjct: 323 IIVDSGTTLTFL-DPSL--LGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESI 379
Query: 383 VNLTMK-GGGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+LT++ GGG P V+ + L L + + V+I+G
Sbjct: 380 PDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILG 425
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 123/299 (41%), Gaps = 54/299 (18%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P +DTGSDL WL C+ C C + I+ P+ SS+ +PC
Sbjct: 93 SIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITP---------IFDPSLSSSYQNIPC 143
Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
S C + ++C VR G+L + L L + S S + GC
Sbjct: 144 LSDTCHSMRT-----TSC--DVR---------GYLSVETLTLDSTTGYSVSF-PKTMIGC 186
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGD 285
G TG+F +G+ GLG S+PS L I FS C G + T +++FGD
Sbjct: 187 GYRNTGTF--HGPSSGIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGD 242
Query: 286 KG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAIFDSGTSFT 334
G TP + Y +T+ SVG + F E + + DSGT+FT
Sbjct: 243 AAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFT 302
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCYVLSPNQTNFEYPVVNLTMKGG 390
+L Y + F S E + P F+ CY ++ + FE P++ KG
Sbjct: 303 FLPYDVYYR----FESAVAEYINLEHVEDPNGTFKLCYNVAYH--GFEAPLITAHFKGA 355
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 142/341 (41%), Gaps = 45/341 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA S + DTGSD+ WL C C C + I++P+ SS+
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDP---------IFNPSLSSSF 131
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C S++C +L+ + S + C YQV Y DG+ + G + L +S
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSY-GDGSFTVGDFSTETLSFGEHAVRS----- 185
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTG 279
++ GCGR G F A L GLG S PS + FS C S
Sbjct: 186 -VAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAA 239
Query: 280 RISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-----------I 326
+ FG P + L R+ Y + + ++ V G+ VN A I
Sbjct: 240 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 299
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT+ + L PAYT + + F SL S F+ CY LS +T P V L
Sbjct: 300 VDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGIS--LFDTCYDLSSMKTA-TLPAVVLD 356
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV-NIIG 426
GG + ++V+ + +G YCL + +IIG
Sbjct: 357 FDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIG 395
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 89/371 (23%), Positives = 140/371 (37%), Gaps = 64/371 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ VG PA F++ DTGSDL W+ C + NSS + P S T +
Sbjct: 94 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAA----NSSESGSGSGRAFRPEDSRTWA 149
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C S C CP+ GS C Y RY DG+ + G + + +A + +
Sbjct: 150 PISCASDTCTKSLPFSLATCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSGRGREE 208
Query: 220 VDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
+++ GC TG + +G+ LG S S A++ FS C
Sbjct: 209 RKAKLKGLVLGCTSSYTGPSFE--VSDGVLSLGYSDVSFASHAASR--FAGRFSYCLVDH 264
Query: 274 --GSDGTGRISFGDK-----------------------GSPGQGETPFSL-RQTHPTYNI 307
+ T ++FG P +TP L R+ P Y++
Sbjct: 265 LSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDV 324
Query: 308 TITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRE 357
+ VSV G + + I DSGTS T L PAY + + LA R
Sbjct: 325 AVKAVSVAGQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRV 384
Query: 358 TSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
T PFEYCY + + P + + G ++ + P + C+G+
Sbjct: 385 TMD---PFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPG---VKCIGLQ 438
Query: 418 KS--DNVNIIG 426
+ +++IG
Sbjct: 439 EGPWPGISVIG 449
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSRGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 139/318 (43%), Gaps = 39/318 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+SVG P I DTGSD+ W C+ C +C D +++P+ S+T KV
Sbjct: 89 LSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQ---------DLPMFNPSKSTTYRKVS 139
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C+S +C + S +C Y + Y D + S G D L + + + + R +
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISY-GDNSHSQGDFAVDTLTMGSTSGRVVAF-PRTA 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD--GTGR 280
GCG GSF A +G+ GLG+ S+ + + + FS C G+D G+ +
Sbjct: 198 IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNK 253
Query: 281 ISFGDKGS---PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIF 327
++FG + G TP + + Y++ + VSVG N + + + I
Sbjct: 254 LNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIII 313
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L Y ++ ++ +R + EYC+ + + +++ P + +
Sbjct: 314 DSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF-LEYCFETTTD--DYKVPFIAMHF 370
Query: 388 KGGGPFFVNDPIVIVSSE 405
+G + ++I S+
Sbjct: 371 EGANLRLQRENVLIRVSD 388
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 137/342 (40%), Gaps = 55/342 (16%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
+ P+T A RL +L ++ + G+ V +DT S+L W+ C+ H
Sbjct: 99 QVPVTSGA-----RLRTLNYVATVGIGGGEAT----VIVDTASELTWVQCEPCDACHDQQ 149
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--------QCPSAGSNCPYQVRYLSD 195
++ P++S + + VPCNS+ C+ + C + C Y + Y D
Sbjct: 150 EP--------LFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSY-RD 200
Query: 196 GTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
G+ S G L D L LA ++ Q FGCG G F +GL GLG + S+
Sbjct: 201 GSYSRGVLAHDRLSLAGEDIQG------FVFGCGTSNQGPF---GGTSGLMGLGRSQLSL 251
Query: 256 PSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPTYN 306
S +Q FS C S +G + GD S + TP S P Y
Sbjct: 252 ISQTMDQ--FGGVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYL 309
Query: 307 ITITQVSVGGNAVNFE-FS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
+T ++VGG V FS AI DSGT T L Y + F S E + +
Sbjct: 310 ANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAA 369
Query: 360 TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI 401
+ + C+ L+ + P + L GG V+ V+
Sbjct: 370 PFSI-LDTCFDLT-GLREVQVPSLKLVFDGGAEVEVDSKGVL 409
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 91/359 (25%), Positives = 142/359 (39%), Gaps = 41/359 (11%)
Query: 50 DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNV 109
D PK + + R R R + D + + S + + G + N+
Sbjct: 39 DSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMNL 98
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P + DTGS+L W C C C ++ ++ P SST V C
Sbjct: 99 SLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDP---------LFDPKASSTYKDVSC 149
Query: 169 NSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+S+ C E Q C + C Y V Y +DG+ + G D L L + + + + + I
Sbjct: 150 SSSQCTALENQASCSTEDKTCSYLVSY-ADGSYTMGKFAVDTLTLGSTDNRPVQLKNII- 207
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG-LIPNSFSMCF--GSDGTGRIS 282
GCG+ +F N G+ S++ G I FS C +D T +I+
Sbjct: 208 IGCGQNNAVTFR-----NKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKIN 262
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSAIFDSGTSFT 334
FG PG TP ++ Y +T+ +SVG + N + + + DSGT+ T
Sbjct: 263 FGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLT 322
Query: 335 YLNDPAYTQISETFNSLA---KEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
L Y +I SL K K E S L CY + + PV+ + +G
Sbjct: 323 LLPVKYYIEIENAVASLINADKSKDERIGSSL----CYNAT---ADLNIPVITMHFEGA 374
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 123/297 (41%), Gaps = 54/297 (18%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
SVG P DTGSD+ WL C+ C N ++ + + P+ SST +PC+
Sbjct: 92 SVGTPPFKLYGIADTGSDIVWLQCE--PCKECYNQTTPK------FKPSKSSTYKNIPCS 143
Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S LC+ +Q G L D L L + S + GCG
Sbjct: 144 SDLCKSGQQ----------------------GNLSVDTLTLESSTGHPISFPKTV-IGCG 180
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGRISFG 284
T SF +GA+ +G+ GLG S+ + L + I FS C S+ T +++FG
Sbjct: 181 TDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPVESNTTSKLNFG 236
Query: 285 DKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--------IFDSGTSF 333
D G TP + Y +T+ SVG + FE S+ I DSGT+
Sbjct: 237 DTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTL 296
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T + Y + L K KR + L F CY ++ + +++P++ KG
Sbjct: 297 TVIPTDVYNNLESAVLELVKLKRVNDPTRL-FNLCYSVTSD--GYDFPIITTHFKGA 350
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 142/341 (41%), Gaps = 45/341 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA S + DTGSD+ WL C C C + I++P+ SS+
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDP---------IFNPSLSSSF 64
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C S++C +L+ + S + C YQV Y DG+ + G + L +S
Sbjct: 65 KPLACASSICGKLKIKGCSRKNKCMYQVSY-GDGSFTVGDFSTETLSFGEHAVRS----- 118
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTG 279
++ GCGR G F A L GLG S PS + FS C S
Sbjct: 119 -VAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAA 172
Query: 280 RISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-----------I 326
+ FG P + L R+ Y + + ++ V G+ VN A I
Sbjct: 173 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 232
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT+ + L PAYT + + F SL S F+ CY LS +T P V L
Sbjct: 233 VDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL--FDTCYDLSSMKTA-TLPAVVLD 289
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV-NIIG 426
GG + ++V+ + +G YCL + +IIG
Sbjct: 290 FDGGASMPLPADGILVNVDDEG--TYCLAFAPEEEAFSIIG 328
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSRGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 103/362 (28%), Positives = 152/362 (41%), Gaps = 60/362 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG PA+ ++ALDT SDL WL C C C SG V D P S++
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 191
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDG------TMSTGFLVEDVLHLATDE 214
++ ++ C+ + + C Y V Y DG + S G LVE+ L A
Sbjct: 192 GEMNYDAPDCQALGRSGGGDAKRGTCIYTVLY-GDGDGHGSTSTSVGDLVEETLTFAGGV 250
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+Q+ +S GCG G F GA G+ GL + S+P +A G SFS C
Sbjct: 251 RQAY-----LSIGCGHDNKGLF--GAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLV 302
Query: 274 ------GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV---- 319
GS + ++FG SP TP L Q PT Y + + VSVGG V
Sbjct: 303 DFISGPGSP-SSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVT 361
Query: 320 ---------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEY 367
I DSGT+ T L PAYT + F + A + ST S L F+
Sbjct: 362 ERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGL-FDT 420
Query: 368 CYVLSPN---QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
CY + + + P V++ GG + +++ + +G + +V++
Sbjct: 421 CYTVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSV 480
Query: 425 IG 426
IG
Sbjct: 481 IG 482
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 123/276 (44%), Gaps = 44/276 (15%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYS 156
GF + T +++GQP+ + + +DTGSDL WL CD C H Y
Sbjct: 18 GFYNVT-LNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPH------------PYYK 64
Query: 157 PNTSSTSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDE 214
P+ + + K P C S ++C + G C Y+V Y +DG S G LV+D +L T E
Sbjct: 65 PSNNLVACKDPICQSLHTGGDQRCENPG-QCDYEVEY-ADGGSSLGVLVKDAFNLNFTSE 122
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
K+ + + G ++ G++ +G+ GLG K S+ S L+ GL+ N C
Sbjct: 123 KRQSPLLALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCL- 178
Query: 275 SDGTGRISFGDK------GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIF 327
+GR S TP S H Y+ +++ G F+ F
Sbjct: 179 ---SGRGGGFLFFGDDLYDSSRVAWTPMSPNAKH--YSPGFAELTFDGKTTGFKNLIVAF 233
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
DSG S+TYLN +Q+ + SL KRE ST L
Sbjct: 234 DSGASYTYLN----SQVYQGLISLI--KRELSTKPL 263
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 139/352 (39%), Gaps = 53/352 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P ++ LDTGSD+ WL C C C SGQ+ D P S +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCY----DQSGQMFD-----PRASHSY 197
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C + LC C C YQV Y DG+++ G + L A+ +
Sbjct: 198 GAVDCAAPLCRRLDSGGCDLRRKACLYQVAY-GDGSVTAGDFATETLTFASGARV----- 251
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
R++ GCG G F+ A GL S PS ++ + SFS C
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRR--FGRSFSYCLVDRTSSSA 306
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV--------- 319
+ + ++FG F+ +P Y + + +SVGG V
Sbjct: 307 SATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLR 366
Query: 320 ----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
I DSGTS T L PAY + + F + A R + F+ CY LS +
Sbjct: 367 LDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLK 426
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ GG + ++ + +G +C +D V+IIG
Sbjct: 427 V-VKVPTVSMHFAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 475
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 108/401 (26%), Positives = 161/401 (40%), Gaps = 62/401 (15%)
Query: 62 ALAHRDRYFRLRGRGLAAQGND----KTPLTFSAGNDT----YRLNSLGFLHYT-NVSVG 112
+LA R R R R + + T L+ +AG T + +S+ L Y + +G
Sbjct: 39 SLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIG 98
Query: 113 QPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
PA+ V +DTGSDL W+ PC C + ++ P++SS+ + VPC+
Sbjct: 99 TPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDP---------LFDPSSSSSYASVPCD 149
Query: 170 STLCELQKQCP----------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S C A + C Y + Y + T +TG + L L +
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT-TTGVYSTETLTL-----KPGV 203
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V + FGCG Q G + +GL GLG S+ S ++Q P S+ + S G G
Sbjct: 204 VVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 260
Query: 280 RISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----I 326
++ G + G TP + PT Y +T+T +SVGG + SA +
Sbjct: 261 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 320
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNL 385
DSGT T L AY + F S E R S+ + CY + N P ++L
Sbjct: 321 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFT-GHANVTVPTISL 379
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
T GG + P + L CL + N IG
Sbjct: 380 TFSGGATIDLAAPAGV-------LVDGCLAFAGAGTDNAIG 413
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 148/375 (39%), Gaps = 57/375 (15%)
Query: 82 NDKTPLTFSAGNDTYRLNS---------LGFLHY-TNVSVGQPALSFIVALDTGSDLFWL 131
ND+ +S N TY S +G +Y G PA + ++ +DTGSD+ W+
Sbjct: 105 NDRLNTIWSKNNGTYSTMSNLPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWI 164
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
C C C ++ I+ P SS+ + C S+ C EL C Y+
Sbjct: 165 QCKPCSDCYSQVDP---------IFEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCVYE 215
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
+ Y DG+ S G ++ L L +D S +FGCG TG F A GL GLG
Sbjct: 216 INY-GDGSRSQGDFSQETLTLGSDSFPS------FAFGCGHTNTGLFKGSA---GLLGLG 265
Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT 304
S PS + FS C S TG S G P P +P+
Sbjct: 266 RTALSFPS--QTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPS 323
Query: 305 -YNITITQVSVGGN------AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
Y + + +SVGG AV I DSGT T L AY + +F S K
Sbjct: 324 FYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLVPQAYDALKTSFRS----KTR 379
Query: 358 TSTSDLPF---EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCL 414
S PF + CY LS + + P + + V+ ++ + + G + CL
Sbjct: 380 NLPSAKPFSILDTCYDLS-SYSQVRIPTITFHFQNNADVAVSAVGILFTIQSDGSQV-CL 437
Query: 415 GVV---KSDNVNIIG 426
+S + NIIG
Sbjct: 438 AFASASQSISTNIIG 452
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 86/355 (24%), Positives = 147/355 (41%), Gaps = 43/355 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + LDTGS L WL C C H +Y P+ S T
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADP--------LYDPSVSKTY 176
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
K+ C S C K C + + C Y Y D + S G+L +D+L L + +
Sbjct: 177 KKLSCASVECSRLKAATLNDPLCETDSNACLYTASY-GDTSFSIGYLSQDLLTLTSSQTL 235
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ ++GCG+ G F A G+ GL DK S+ + L+ + ++FS C +
Sbjct: 236 -----PQFTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTK--YGHAFSYCLPTA 285
Query: 277 GTGRISFGDKG----SPGQGE-TPFSLRQTHPT-YNITITQVSVGGN-----AVNFEFSA 325
+G G SP + TP +P+ Y + +T ++V G A +
Sbjct: 286 NSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT 345
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
+ DSGT T L Y + + F + K + + + C+ S + P + +
Sbjct: 346 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSIS-AVPEIKM 404
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
+GG + P +++ ++ L G ++ + IIG + Y IA ++S
Sbjct: 405 IFQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 459
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 107/374 (28%), Positives = 158/374 (42%), Gaps = 52/374 (13%)
Query: 34 FHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ--GNDKTPLTFSA 91
HHRY DP + P K L R R +LR + + G + +A
Sbjct: 59 LHHRY-DPCSPV------PSK----KVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAA 107
Query: 92 GNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
T SL L Y V +G PA++ +++DTGSD+ W+ C C C ++S
Sbjct: 108 TVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDS----- 162
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVE 205
++ P++SST S C+S C Q S C Y V Y G S+
Sbjct: 163 ----LFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNY---GDSSSTTGTY 215
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
L S + FGC + ++G F D +GL GLG S+ S A G
Sbjct: 216 SSDTLTL----GSSAMTDFQFGCSQSESGGFNDQT--DGLMGLGGGAQSLASQTA--GTF 267
Query: 266 PNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PTYNITITQ-VSVGGNAVN- 320
+FS C S +G ++ G GS G +TP LR T PTY + + + + VG +N
Sbjct: 268 GTAFSYCLPPTSGSSGFLTLG-TGSSGFVKTPM-LRSTQIPTYYVVLLESIKVGSQQLNL 325
Query: 321 ----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT 376
F ++ DSGT T L AY+ +S F + ++ + S + + C+ S Q+
Sbjct: 326 PTSVFSAGSLMDSGTIITRLPPTAYSALSSAFKAGMQQYPPATPSGI-LDTCFDFS-GQS 383
Query: 377 NFEYPVVNLTMKGG 390
+ P V L GG
Sbjct: 384 SISIPTVTLVFSGG 397
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 102/424 (24%), Positives = 163/424 (38%), Gaps = 64/424 (15%)
Query: 29 TFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---NDKT 85
+ GF ++ D VK + + L + ++R RL LAA D+
Sbjct: 48 SHGFRVRLKHVDHVKNLTRFERLRR-------GVARGKNRLHRLNAMVLAAANATVGDQV 100
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
AGN + + +++G P SF +DTGSDL W C C + S
Sbjct: 101 KAPVVAGNGEFLMK---------LAIGSPPRSFSAIMDTGSDLIW--TQCKPCQQCFDQS 149
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
+ I+ P SS+ K+ C+S LC + C Y Y D + + G L
Sbjct: 150 T------PIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTY-GDSSSTQGVLAF 202
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGL 264
+ + S+ + FGCG G F GA GL GLG S+ S L Q
Sbjct: 203 ETFTFGDSTEDQISIPG-LGFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKEQKF 258
Query: 265 I----------PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVS 313
P+S + ++ T + S + + TP + P+ Y +++ +S
Sbjct: 259 AYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKT-----TPLIKNPSQPSFYYLSLQGIS 313
Query: 314 VGGNAVN-----FEF------SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VGG ++ FE I DSGT+ TY+ + A+T + F + + S +
Sbjct: 314 VGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTG 373
Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
+ C+ L E P + KG + +I S+ L CL + S +
Sbjct: 374 -GLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG---LLCLAIGSSRGM 429
Query: 423 NIIG 426
+I G
Sbjct: 430 SIFG 433
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 153/379 (40%), Gaps = 62/379 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG PA F++ DTGSDL W+ C S ++S ++ P S + S
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQ---RVFRPAGSKSWS 160
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLATDEKQS 217
+PC+S C+ C S C Y RY D + + G + D + L+ ++
Sbjct: 161 PLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRY-KDNSSARGVVGLDSATVSLSGNDGTR 219
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
K+ + GC G + +G+ LG S S A++ FS C
Sbjct: 220 KAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFASRAASR--FGGRFSYCLVDHL 275
Query: 274 -GSDGTGRISFGDKGSPGQG-----ETPFSL---RQTHPTYNITITQVSVGGNAVN---- 320
+ T ++FG+ S TP L +T P Y +++ V+V G +
Sbjct: 276 APRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPD 335
Query: 321 -FEFS----AIFDSGTSFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCYVL 371
++F AI DSGTS T L PAY IS+ F + + + PFEYCY
Sbjct: 336 VWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMD------PFEYCYNW 389
Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD--NVNIIGR-- 427
+ + E P + L G ++ + P + C+GVV+ V++IG
Sbjct: 390 T--GVSAEIPRMELRFAGAATLAPPGKSYVIDTAPG---VKCIGVVEGAWPGVSVIGNIL 444
Query: 428 ------EYPIANNISLFHN 440
E+ +AN F
Sbjct: 445 QQEHLWEFDLANRWLRFKQ 463
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSKGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 154/346 (44%), Gaps = 45/346 (13%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G P+ S+ + +DTGS L WL C CV + G + D P
Sbjct: 127 TSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGPLFD-----P 179
Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLA 211
SST + V C+++ C ELQ PSA S C YQ Y D + S G L D +
Sbjct: 180 RASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASY-GDSSFSVGSLSTDTVSFG 238
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ S +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 239 STRYPS------FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287
Query: 272 CFGSDG-TGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFS 324
C + TG +S G + G TP + + Y IT++ +SVGG+ + E+S
Sbjct: 288 CLPTAASTGYLSIGPYNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYS 346
Query: 325 A---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+ I DSGT T L +T +S+ ++A +R + S L + C+ +Q
Sbjct: 347 SLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSIL--DTCFEGQASQ--LRV 402
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P V + GG + V++ + CL +D+ IIG
Sbjct: 403 PTVAMAFAGGASMKLTTRNVLIDVDDS---TTCLAFAPTDSTAIIG 445
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/316 (26%), Positives = 135/316 (42%), Gaps = 43/316 (13%)
Query: 92 GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVI 150
G+D+ R N ++ +S+G P + +V +DTGS L W+ C +C + + +GQ
Sbjct: 16 GDDSMRKNK----YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-- 69
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
I++P SST SKV C++ C ++ C C Y +RY S G S G+L
Sbjct: 70 ---IFNPYNSSTYSKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYL 125
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+D L LA++ +S+D+ I FGCG L G+ G G S + + Q
Sbjct: 126 GKDRLTLASN----RSIDNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQT 176
Query: 264 LIPNSFSMCFGSD--GTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
+FS CF D G ++ G T P Y I Q+ + N +
Sbjct: 177 DY-TAFSYCFPRDHENEGSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIR 233
Query: 321 FEFS--------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
E I DSGT+ TY+ P + + + + K T D C++ +
Sbjct: 234 LEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWD-ERRICFISN 292
Query: 373 PNQTNF-EYPVVNLTM 387
N+ ++P V + +
Sbjct: 293 SGSANWNDFPTVEMKL 308
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 93/317 (29%), Positives = 134/317 (42%), Gaps = 66/317 (20%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G P F +DTGSDL W+ C C C + ++ P SS+ S
Sbjct: 11 QISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDP---------LFIPLASSSYSNA 61
Query: 167 PCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C +LC+ L + S + C Y Y DG+ + G + + L + S +RI
Sbjct: 62 SCTDSLCDALPRPTCSMRNTCTYSYSY-GDGSNTRGDFAFETVTL------NGSTLARIG 114
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----GRI 281
FGCG Q G+F A +GL GLG S+PS L + + FS C T I
Sbjct: 115 FGCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPI 169
Query: 282 SFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFD 328
+FG+ + TP + +P+ Y + + +SVG V SA I D
Sbjct: 170 TFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILD 229
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEY----CYVLS---------PN 374
SGT+ TY A+ I LA+ +R+ S + P Y CY +S P+
Sbjct: 230 SGTTITYWRLAAFIPI------LAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPS 283
Query: 375 QT------NFEYPVVNL 385
T +FE PV NL
Sbjct: 284 MTVHLTNVDFEIPVSNL 300
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGRRGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
gi|223949441|gb|ACN28804.1| unknown [Zea mays]
Length = 326
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 90/328 (27%), Positives = 140/328 (42%), Gaps = 48/328 (14%)
Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQ 176
+ LDTGSD+ W+ C C C + ++ P+ S++ + V C+S C
Sbjct: 1 MVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASYAAVSCDSQRCRDLDT 51
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C +A C Y+V Y DG+ + G + L L ++ GCG G F
Sbjct: 52 AACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVGN-----VAIGCGHDNEGLF 105
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGE 293
+ A L G + S PS ++ ++FS C S + FGD +
Sbjct: 106 VGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAASTLQFGDGAAEAGTV 157
Query: 294 TPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA------------IFDSGTSFTYLNDP 339
T +R +T Y + ++ +SVGG ++ SA I DSGT+ T L
Sbjct: 158 TAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSA 217
Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI 399
AY + + F A TS L F+ CY LS ++T+ E P V+L +GGG +
Sbjct: 218 AYAALRDAFVQGAPSLPRTSGVSL-FDTCYDLS-DRTSVEVPAVSLRFEGGGALRLPAKN 275
Query: 400 VIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
++ + G YCL ++ V+IIG
Sbjct: 276 YLIPVDGAG--TYCLAFAPTNAAVSIIG 301
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGRGGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 145/350 (41%), Gaps = 57/350 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLSPNQT 376
+FDSG+ +Y+ D A + +S+ L A+E+ E + CY +
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRSVDE 272
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P ++L G F + V V + ++CL +++V+IIG
Sbjct: 273 G-DMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 49/299 (16%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA + ++ALDT +D W+PC C+ C ++S + SS+ +PC
Sbjct: 32 IGTPAQTLLLALDTSNDAAWIPCSGCIGCPST-----------TVFSSDKSSSFRPLPCQ 80
Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S C +GS C + + Y S + LV+D L LATD S +FGC
Sbjct: 81 SPQCNQVPNPSCSGSACGFNLTYGSSTVAAD--LVQDNLTLATDSVPS------YTFGCI 132
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGD 285
R TGS + LG+ + + + +Q L ++FS C S + +G + G
Sbjct: 133 RKATGSSVPPQG-----LLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGP 187
Query: 286 KGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
P + + LR + Y + + + VG V+ SA + DSGT+
Sbjct: 188 VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTT 247
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCY---VLSPNQTNFEYPVVNLTM 387
FT L PAYT + + F + R + S L F+ CY ++SP T F + +N+T+
Sbjct: 248 FTRLVAPAYTAVRDEFRR--RVGRNVTVSSLGGFDTCYTVPIISPTIT-FMFAGMNVTL 303
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 108/401 (26%), Positives = 161/401 (40%), Gaps = 62/401 (15%)
Query: 62 ALAHRDRYFRLRGRGLAAQGND----KTPLTFSAGNDT----YRLNSLGFLHYT-NVSVG 112
+LA R R R R + + T L+ +AG T + +S+ L Y + +G
Sbjct: 119 SLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIG 178
Query: 113 QPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
PA+ V +DTGSDL W+ PC C + ++ P++SS+ + VPC+
Sbjct: 179 TPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDP---------LFDPSSSSSYASVPCD 229
Query: 170 STLCELQKQCP----------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S C A + C Y + Y + T +TG + L L +
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT-TTGVYSTETLTL-----KPGV 283
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V + FGCG Q G + +GL GLG S+ S ++Q P S+ + S G G
Sbjct: 284 VVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 340
Query: 280 RISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----I 326
++ G + G TP + PT Y +T+T +SVGG + SA +
Sbjct: 341 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 400
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNL 385
DSGT T L AY + F S E R S+ + CY + N P ++L
Sbjct: 401 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFT-GHANVTVPTISL 459
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
T GG + P + L CL + N IG
Sbjct: 460 TFSGGATIDLAAPAGV-------LVDGCLAFAGAGTDNAIG 493
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 132/310 (42%), Gaps = 54/310 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
N S+G+P + + +DTGS L W+ C H +S S Q + I+ P+ SST S +
Sbjct: 96 NFSIGEPPIPQLAVMDTGSSLTWVMC------HPCSSCSQQSVP--IFDPSKSSTYSNLS 147
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+ +C CPY V Y+ G+ S G + L L T ++ V S I FG
Sbjct: 148 CSEC-----NKCDVVNGECPYSVEYVGSGS-SQGIYAREQLTLETIDESIIKVPSLI-FG 200
Query: 228 CGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPN---SFSMCFGSDGT-- 278
CGR S P NG+FGLG + S L+P+ FS C G+
Sbjct: 201 CGR--KFSISSNGYPYQGINGVFGLGSGRFS---------LLPSFGKKFSYCIGNLRNTN 249
Query: 279 ---GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------ 324
R+ GDK + QG++ +L + Y + + +S+GG ++ FE S
Sbjct: 250 YKFNRLVLGDKANM-QGDST-TLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNS 307
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYCYVLSPNQTNFEYP 381
I DSG T+L + +S +L + + D P+ CY +Q +P
Sbjct: 308 GVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFP 367
Query: 382 VVNLTMKGGG 391
+V G
Sbjct: 368 LVTFHFAEGA 377
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 136/323 (42%), Gaps = 35/323 (10%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
R Y R G A Q D +A +G L+Y S+G P ++ + +D
Sbjct: 99 RRAEYILRRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
TGSDL W+ C S S + D P SS+ + VPC +C +
Sbjct: 159 TGSDLSWVQCKPCSAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+ + C Y V Y DG+ +TG D L L+ + S FGCG Q+G F +G
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
+GL GLG ++ S+ + G FS C + + G ++ G G P FS
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-LGGPSGAAPGFST 321
Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISET 347
Q P+ Y + +T +SVGG ++ SA + D+GT T L AY +
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRLPPTAYAALRSA 381
Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
F S +A T+ S+ + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 132/305 (43%), Gaps = 40/305 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS-CVHGLNSSSGQVIDFNIYSPNTSST 162
+ V +G P DTGSDL W C+ C C H I++P+ S++
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEP---------IFNPSKSTS 188
Query: 163 SSKVPCNSTLCELQK----QCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ + C+S C+ K PS + S C Y ++Y D + S GF +D L L + +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQY-GDQSYSVGFFAQDKLALTSTD--- 244
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
V + FGCG+ G F+ A GL GLG + S+ S A + FS C S
Sbjct: 245 --VFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQK--YGKLFSYCLPSTS 297
Query: 276 DGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------I 326
TG ++FG G + TP + P+ Y + + +SVGG ++ S I
Sbjct: 298 SSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTI 357
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT + L AY+ + +F + + + + + + CY S T + P +NL
Sbjct: 358 IDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASI-LDTCYDFSQYDT-VDVPKINLY 415
Query: 387 MKGGG 391
G
Sbjct: 416 FSDGA 420
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 140/357 (39%), Gaps = 48/357 (13%)
Query: 68 RYFRLRGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R R LAA+ + + +++G T + G + S+G+P L +DT
Sbjct: 47 RTAESRNLSLAAERSRRRLSVYTSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDT 106
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQK 177
GSDL W+ C S +G N +Y P S +S K+PC+S LC+ +
Sbjct: 107 GSDLMWVKC---SPCNGCNPPPSP-----LYDPARSRSSGKLPCSSQLCQALGRGRIISD 158
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
QC C Y Y G ST + VL T V + +SFG GS
Sbjct: 159 QCSDDPPLCGYHYAYGHSGDHST----QGVLGTETFTFGDGYVANNVSFGRSDTIDGSQF 214
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLI------PNSFS-MCFGSDGTGRISFGDKGSPG 290
G A GL GLG S+ S L PN +S + FGS S GD S
Sbjct: 215 GGTA--GLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTP 272
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFSA--IFDSGTSFTYLNDP 339
P R TH Y + + +SVGG+ A+N + S FDSG T L D
Sbjct: 273 LVTNPKPDRDTH--YYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDA 330
Query: 340 AYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
AY + + S + + D C+V + Q + P + L G +N
Sbjct: 331 AYQVVRQAITSEIQRLGYDAGDDT----CFVAANQQAVAQMPPLVLHFDDGADMSLN 383
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 103/424 (24%), Positives = 160/424 (37%), Gaps = 64/424 (15%)
Query: 29 TFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---NDKT 85
+ GF ++ D VK + + L + ++R RL LAA D+
Sbjct: 303 SHGFRVRLKHVDHVKNLTRFERLRR-------GVARGKNRLHRLNAMVLAAANATVGDQV 355
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
AGN + + +++G P SF +DTGSDL W C C + S
Sbjct: 356 KAPVVAGNGEFLMK---------LAIGSPPRSFSAIMDTGSDLIW--TQCKPCQQCFDQS 404
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
+ I+ P SS+ K+ C+S LC + C Y Y D + + G L
Sbjct: 405 T------PIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTY-GDSSSTQGVLAF 457
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGL 264
+ + S+ + FGCG G F GA GL GLG S+ S L Q
Sbjct: 458 ETFTFGDSTEDQISIPG-LGFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKEQ-- 511
Query: 265 IPNSFSMCFGSDGTGRISFGDKGSPG----------QGETPFSLRQTHPT-YNITITQVS 313
F+ C + + S GS TP + P+ Y +++ +S
Sbjct: 512 ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGIS 568
Query: 314 VGGNAVN-----FEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VGG ++ FE I DSGT+ TY+ + A+T + F + + S +
Sbjct: 569 VGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEFIAQMNLPVDDSGTG 628
Query: 363 LPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
+ C+ L E P + KG + +I S+ L CL + S +
Sbjct: 629 -GLDLCFNLPAGTNQVEVPKLTFHFKGADLELPGENYMIGDSKAG---LLCLAIGSSRGM 684
Query: 423 NIIG 426
+I G
Sbjct: 685 SIFG 688
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/351 (26%), Positives = 148/351 (42%), Gaps = 58/351 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P + I +DTGSDL W C C C QV+ F + P SST
Sbjct: 92 YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVPF--FDPKNSSTY 142
Query: 164 SKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
C ++ C + C + G C + Y +DG+ + G L + L +A+ + S
Sbjct: 143 RDSSCGTSFCLALGNDRSCRN-GKKCTFMYSY-ADGSFTGGNLAVETLTVASTAGKPVSF 200
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+FGC G F + ++ G+ GLG+ + S+ S L + I FS C S
Sbjct: 201 PG-FAFGCVHRSGGIFDEHSS--GIVGLGVAELSMISQL--KSTINGRFSYCLLPVFTDS 255
Query: 276 DGTGRISFGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAVNF---------- 321
+ RI+FG G G TP ++ Y IT+ SVG +++
Sbjct: 256 SMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVE 315
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
E + I DSGT++TYL Y ++ E+ K KR + + CY + +Q + P
Sbjct: 316 EGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGIS-SLCYNTTVDQ--IDAP 372
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLY------LYCLGVVKSDNVNIIG 426
++ K V +P + L C V+ + ++ I+G
Sbjct: 373 IITAHFKDAN----------VELQPWNTFLRMQEDLVCFTVLPTSDIGILG 413
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 142/356 (39%), Gaps = 45/356 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIY-SPNTSS 161
F + + VG P + + DTGSDL W+ C G ++ + ++Y P+ SS
Sbjct: 108 FEYLMAIEVGTPPVRVLAIADTGSDLVWVKC------KGKDNDNNSTAPPSVYFVPSASS 161
Query: 162 TSSKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
T +V C++ C C GS C Y Y DG+ ++G L + +T SK
Sbjct: 162 TYGRVGCDTKACRALSSAASCSPDGS-CEYLYSY-GDGSRASGQLSTETFTFSTIADSSK 219
Query: 219 SVD----------------SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
+ +++ FGC TG+F +GL GLG S+ S L
Sbjct: 220 TNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLVGLGGGPVSLASQLGAT 275
Query: 263 GLIPNSFSMCFG----SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVG 315
+ FS C ++ + ++FG + PG TP + Y I + ++V
Sbjct: 276 TSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVA 335
Query: 316 GN---AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
G + I DSGT+ TYL+ T + + K R S + + CY +S
Sbjct: 336 GTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKI-LDLCYDIS 394
Query: 373 --PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P V L + GGG + V + L L + + +V+I+G
Sbjct: 395 GVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILG 450
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 145/359 (40%), Gaps = 59/359 (16%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N+S+G P ++ ++ +DT SDL WL C C++C I+ P+ S T
Sbjct: 87 VNISIGSPPVTQLLHMDTASDLLWLQCRPCINCY---------AQSLPIFDPSRSYTHRN 137
Query: 166 VPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSK 218
C ++ Q PS N C Y +RY+ DGT S G L +++L T DE S
Sbjct: 138 ESCRTS----QYSMPSLRFNAKTRSCEYSMRYM-DGTGSKGILAKEMLMFNTIYDESSSA 192
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
++ + FGCG G L G G+ GLG + S+ + FS CFGS
Sbjct: 193 ALHD-VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGTK------FSYCFGSLDD 242
Query: 279 -----GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------- 322
+ GD G+ G+T L + Y +TI +SV G + +
Sbjct: 243 PSYPHNVLVLGDDGANILGDTT-PLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTG 301
Query: 323 -FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCY--VLSPNQT 376
I D+G S T L + AY + + + + + D+ CY L +
Sbjct: 302 LGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLV 361
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNI 435
+P+V G ++ V + P ++CL V N+N IG + NI
Sbjct: 362 ESGFPIVTFHFSDGAELSLDVKSVFMKLSPN---VFCLAVTPG-NMNSIGATAQQSYNI 416
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 129/340 (37%), Gaps = 51/340 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + +VA+D +D W+PC C C S +SP SST
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 151
Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
VPC S C CP+ GS+C + + Y + + L +D L L + V
Sbjct: 152 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 203
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
+FGC RV +G + P GL G G S + + + FS C S+
Sbjct: 204 VVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSN 258
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L H P+ Y + + + VG V SA
Sbjct: 259 FSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGS 318
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT FT L P Y + + F + F+ CY P V
Sbjct: 319 GTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY-----NVTVSVPTV 371
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
G + + V++ S G+ + SD VN
Sbjct: 372 TFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVN 411
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 138/329 (41%), Gaps = 59/329 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS-- 164
N+S+GQP + +V +DTGSD+ W+ C C +C + L ++ P+ SST S
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGL---------LFDPSKSSTFSPL 154
Query: 165 -KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
K PC+ C P+ V Y + T S F + V+ TDE S+ D
Sbjct: 155 CKTPCDFEGCRCDP--------IPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISD-- 204
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
+ FGCG G D NG+ GL S+ + L + FS C G+
Sbjct: 205 VLFGCGH-NIGHDTD-PGHNGILGLNNGPDSLVTKLGQK------FSYCIGNLADPYYNY 256
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AIF 327
++ G+ TPF + Y +T+ +SVG ++ FE I
Sbjct: 257 HQLILGEGADLEGYSTPFEVYNGF--YYVTMEGISVGEKRLDIAPETFEMKENRAGGVII 314
Query: 328 DSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
D+G++ T+L D + +S E N L R+ + P+ C+ S ++ +PVV
Sbjct: 315 DTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFH 374
Query: 387 MKGG-------GPFF--VNDPIVIVSSEP 406
G G FF +ND + ++ P
Sbjct: 375 FSDGADLALDSGSFFNQLNDNVFCMTVGP 403
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 154/344 (44%), Gaps = 55/344 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + VG +F+V +DTGS L +P + C +CV +Y P SSTS+K
Sbjct: 124 TQIIVGNT--TFLVQVDTGSLLMAIPLEGCNTCVESR----------PVYHP--SSTSTK 169
Query: 166 VPCNSTLCELQKQCP------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
V C+S C+ P S+G +C +Q+RY DG+ +G++ EDV++LA
Sbjct: 170 VACSSDQCKGSGSTPPSCSRTSSGESCDFQIRY-GDGSHVSGYIYEDVVNLA-------G 221
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VP----SILANQGLIPNSFSMCFG 274
+ + +FG +TG F + +G+ G G +S VP S++++ GL N F M
Sbjct: 222 LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSDLGL-KNQFGMLLN 279
Query: 275 SDGTGRISFGDKGSP-----------GQGETPF-SLRQTHPTYNITITQVSVGGNAVNFE 322
+G G +S G+ + Q TPF S++ T I I ++ G+ + E
Sbjct: 280 YEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKST----GIRINDYTIPGSKLGQE 335
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSG++ L AY Q+ F + + + F+ S + ++P
Sbjct: 336 --VIVDSGSTALSLASGAYDQLRNYFQTHYCSIQGVCENPNIFQGSICYSSDDVLSKFPT 393
Query: 383 VNLTMKGGGPFFVNDPIVIVSSE-PKGLYLYCLGVVKSDNVNII 425
+ T GG + +V + G Y YC + ++D+ I
Sbjct: 394 LYFTFDGGVQVAIPPKNYLVKAPLTNGKYGYCFMIERADSTMTI 437
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 160/397 (40%), Gaps = 62/397 (15%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R+R RL+ L A + + GN + + +++G P ++ LDTG
Sbjct: 67 RNRLQRLQAMALVASSSSEIEAPVLPGNGEFLMK---------LAIGTPPETYSAILDTG 117
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
SDL W C C C H I+ P SS+ SK+ C+S LCE Q S +
Sbjct: 118 SDLIWTQCKPCTQCFHQSTP---------IFDPKKSSSFSKLSCSSQLCEALPQS-SCNN 167
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPN 243
C Y Y D + + G L + L K+ ++FGCG GS F GA
Sbjct: 168 GCEYLYSY-GDYSSTQGILASETLTFG------KASVPNVAFGCGADNEGSGFSQGA--- 217
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT-------GRISFGDKGSPGQGETP 295
GL GLG S+ S L FS C + D T G ++ + S TP
Sbjct: 218 GLVGLGRGPLSLVSQLKEP-----KFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTP 272
Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
HP+ Y +++ +SVG + + S I DSGT+ TYL + A+
Sbjct: 273 LIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNL 332
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
+++ F + ++S S + C+ L TN E P + G + +I
Sbjct: 333 VAKEFTAKINLPVDSSGST-GLDVCFTLPSGSTNIEVPKLVFHFDGADLELPAENYMIGD 391
Query: 404 SEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
S + + CL + S ++I G N+ + H+
Sbjct: 392 SS---MGVACLAMGSSSGMSIFGNVQ--QQNMLVLHD 423
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 143/343 (41%), Gaps = 38/343 (11%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G PA +I+ +DTGS L WL C C + SG V D P
Sbjct: 110 TSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWL--QCSPCRVSCHRQSGPVFD-----P 162
Query: 158 NTSSTSSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
TSS+ + V C+S C+ L S + C YQ Y D + S G+L +D +
Sbjct: 163 KTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASY-GDSSFSVGYLSKDTVSFG 221
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 222 ANSVP------NFYYGCGQDNEGLFGRSA---GLMGLARNKLSLLYQLAPT--LGYSFSY 270
Query: 272 CFGS-DGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C S +G +S G G TP S Y I+++ ++V G + S
Sbjct: 271 CLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSL 330
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T L YT +S+ + K + + + + C+ ++ P V
Sbjct: 331 PTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLR-AVPAV 389
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++ GG ++ ++V + CL + + IIG
Sbjct: 390 SMAFSGGATLKLSAGNLLVDVDGA---TTCLAFAPARSAAIIG 429
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 139/352 (39%), Gaps = 53/352 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P+ ++ LDTGSD+ WL C C C SG V D P SS+
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCY----DQSGPVFD-----PRRSSSY 190
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C + LC C C YQV Y DG+++ G + L A +
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLRRRACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 244
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+R++ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRR--YGKSFSYCLVDRTSSSS 299
Query: 274 ----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAV--------- 319
+ ++FG + TP T Y + + +SVGG V
Sbjct: 300 SGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLR 359
Query: 320 ----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
I DSGTS T L P+Y+ + + F + A R + F+ CY L +
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRK 419
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ P V++ GG + ++ + +G +C +D V+IIG
Sbjct: 420 V-VKVPTVSMHFAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 468
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 149/366 (40%), Gaps = 48/366 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G P + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333
Query: 275 SDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG TP L PT Y + +T + VGG ++ S
Sbjct: 334 STGTGYLDFGAGSLAAASARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT T L AY+ + F + K+ + S L + CY + + P
Sbjct: 393 TIVDSGTVITRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLL--DTCYDFT-GMSQVAIP 449
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYC--------LGVVKSDNVNIIGREYPIAN 433
V+L +GG V+ ++ ++ + L +G+V + + G Y I
Sbjct: 450 TVSLLFQGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGK 509
Query: 434 NISLFH 439
+ F+
Sbjct: 510 KVVGFY 515
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/332 (26%), Positives = 136/332 (40%), Gaps = 41/332 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + +DTGS L WL C C +C + ++ P SST C+
Sbjct: 95 IGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQ---------ETPLFEPLKSSTYKYATCD 145
Query: 170 STLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRI 224
S C L Q+ C G C Y + Y D + S G L + L +T Q+ S + I
Sbjct: 146 SQPCTLLQPSQRDCGKLG-QCIYGIMY-GDKSFSVGILGTETLSFGSTGGAQTVSFPNTI 203
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
FGCG + G+ GLG S+ S L Q I + FS C + S T ++
Sbjct: 204 -FGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKL 260
Query: 282 SFGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV---NFEFSAIFDSGTSFT 334
FG + + G TP ++ + PTY + + V++G V + + + DSGT T
Sbjct: 261 KFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLT 320
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
YL + Y + K DL P + C+ PN+ N P + G
Sbjct: 321 YLENTFYNNFVASLQETLGVKL---LQDLPSPLKTCF---PNRANLAIPDIAFQFTGASV 374
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
++I ++ + CL VV S + I
Sbjct: 375 ALRPKNVLIPLTDSN---ILCLAVVPSSGIGI 403
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 133/299 (44%), Gaps = 49/299 (16%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA + ++ALDT +D W+PC C+ C ++S + SS+ +PC
Sbjct: 109 IGTPAQTLLLALDTSNDAAWIPCSGCIGCPST-----------TVFSSDKSSSFRPLPCQ 157
Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S C +GS C + + Y S + LV+D L LATD S +FGC
Sbjct: 158 SPQCNQVPNPSCSGSACGFNLTYGSSTVAAD--LVQDNLTLATDSVPS------YTFGCI 209
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGD 285
R TGS + LG+ + + + +Q L ++FS C S + +G + G
Sbjct: 210 RKATGSSVPPQG-----LLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGP 264
Query: 286 KGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
P + + LR + Y + + + VG V+ SA + DSGT+
Sbjct: 265 VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTT 324
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCY---VLSPNQTNFEYPVVNLTM 387
FT L PAYT + + F + R + S L F+ CY ++SP T F + +N+T+
Sbjct: 325 FTRLVAPAYTAVRDEFRR--RVGRNVTVSSLGGFDTCYTVPIISPTIT-FMFAGMNVTL 380
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 131/295 (44%), Gaps = 46/295 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G P SF LDTGS++ W+PC+ C C SS Q + P+ SST + + C S
Sbjct: 131 GTPPQSFYTVLDTGSNIAWIPCNPCSGC------SSKQ----QPFEPSKSSTYNYLTCAS 180
Query: 171 TLCELQKQCPSAGS--NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
C+L + C + + NC RY G S V+++L T S+ V++ + FGC
Sbjct: 181 QQCQLLRVCTKSDNSVNCSLTQRY---GDQSE---VDEILSSETLSVGSQQVENFV-FGC 233
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFG 284
G L P+ L G G + S S A L ++FS C F S TG + G
Sbjct: 234 SNAARG--LIQRTPS-LVGFGRNPLSFVSQTAT--LYDSTFSYCLPSLFSSAFTGSLLLG 288
Query: 285 DKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEF-----------SAIFDSG 330
+ QG TP +P+ Y + + +SVG V+ I DSG
Sbjct: 289 KEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSG 348
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
T T L +PAY + ++F S S +DL F+ CY + + E+P++ L
Sbjct: 349 TVITRLVEPAYNAMRDSFRSQLSNLTMASPTDL-FDTCY--NRPSGDVEFPLITL 400
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/355 (26%), Positives = 148/355 (41%), Gaps = 52/355 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + N+++G P ++ + +DTGSDL W+ CD C C + Y P+
Sbjct: 45 LGY-YSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDRQ---------YKPH 94
Query: 159 TSSTSSKVPCNSTLCELQKQCPSA-----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ V C LC + P+ C Y+V Y G+ S G LV D++ L
Sbjct: 95 ----GNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGS-SLGVLVRDIIPLKL- 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNSF 269
S ++FGCG QT G P G+ GLG + S+ S L ++GLI N
Sbjct: 149 -TNGTLTHSMLAFGCGYDQTHV---GHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVV 204
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFE-FS 324
C G G + FGD+ P G + Q+ + Y + G A + +
Sbjct: 205 GHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGLE 264
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYV-------LSPNQT 376
FDSG+S+TY N A+ + + N + + +T D C+ L +
Sbjct: 265 LTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICWKGPKPFKSLHDVTS 324
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-----NVNIIG 426
NF+ V++ T F V ++ ++ + CLG++ N NIIG
Sbjct: 325 NFKPLVLSFTKSKNSLFQVPPEAYLIVTKHGNV---CLGILDGTEIGLGNTNIIG 376
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 87/340 (25%), Positives = 129/340 (37%), Gaps = 51/340 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + +VA+D +D W+PC C C S +SP SST
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 132
Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
VPC S C CP+ GS+C + + Y + + L +D L L + V
Sbjct: 133 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 184
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
+FGC RV +G + P GL G G S + + + FS C S+
Sbjct: 185 VVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSN 239
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L H P+ Y + + + VG V SA
Sbjct: 240 FSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGS 299
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT FT L P Y + + F + F+ CY P V
Sbjct: 300 GTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY-----NVTVSVPTV 352
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
G + + V++ S G+ + SD VN
Sbjct: 353 TFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVN 392
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 138/336 (41%), Gaps = 52/336 (15%)
Query: 79 AQGNDKTPLTFSAGND-TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV 136
++ N TP + SA Y + G ++ +S+G P + +V DTGSDL W+ C C
Sbjct: 67 SRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQ 126
Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QKQCPSAG--SNCPYQV 190
C + I++P SST +V C + C + C + G C Y
Sbjct: 127 ECYKQKSP---------IFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSY 177
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
Y D + + G+L + + + + ++FGCG G+F + + G+
Sbjct: 178 SY-GDHSFTMGYLATERFIIGSTNNSIQ----ELAFGCGNSNGGNFDEVGS-----GIVG 227
Query: 251 DKTSVPSILANQGL-IPNSFSMCF------GSDGTGRISFGDK----GSPGQGETPFSLR 299
S+++ G I N FS C + G+I FGD GS TP +
Sbjct: 228 LGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSK 287
Query: 300 QTHPTYNITITQVSVGGNAVNFEFS----------AIFDSGTSFTYLNDPAYTQISETFN 349
+ Y +T+ +SVG + +E S I DSGT+ T+L+ Y ++ E
Sbjct: 288 EPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKL-ELVL 346
Query: 350 SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
A E S + F C+ ++ E P++ +
Sbjct: 347 EKAVEGERVSDPNGIFSICF---RDKIGIELPIITV 379
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/359 (27%), Positives = 151/359 (42%), Gaps = 65/359 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+S+G P L F V +DTGS+L W C C C + + P SST S++
Sbjct: 94 NISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFP-------RPTPAPVLQPARSSTFSRL 146
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PCN + C+ + + +A + C Y Y S T G+L + L +
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTVG------DGTF 198
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----G 277
+++FGC T + +D ++ G+ GLG S+ S LA FS C SD G
Sbjct: 199 PKVAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAV-----GRFSYCLRSDMADGG 248
Query: 278 TGRISFGDKGSPGQG---------ETPFSLRQTHPTYNIT-----ITQVSVGGNAVNFEF 323
I FG + + P+ R TH N+T T++ V G+ F
Sbjct: 249 ASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQ 308
Query: 324 SA-----IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF--EYCYVLSPNQ 375
+ I DSGT+ TYL Y + + F S +A + T S P+ + CY S
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGG 368
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPI----VIVSSEPKG-LYLYCLGVVKSDN---VNIIG 426
V L ++ G N P+ V ++ +G + + CL V+ + + ++IIG
Sbjct: 369 GGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIG 427
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 142/344 (41%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + I+ +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 98/346 (28%), Positives = 153/346 (44%), Gaps = 53/346 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + + DTGSD+ WL C C SC GQ +++P+ SST
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY-------GQTDP--LFNPSFSSTF 131
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C S+LC+ L + C + C YQV Y DG+ + G + L ++ S
Sbjct: 132 QSITCGSSLCQQLLIRGCRR--NQCLYQVSY-GDGSFTVGEFSTETLSFGSNAVNS---- 184
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F A L GLG S PS + L + FS C S G+
Sbjct: 185 --VAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--------- 325
+ FG++ + F+ T+P Y + + + VGG +VN +
Sbjct: 238 VPLIFGNQAVASNAQ--FTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGN 295
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT+ T L AY + + F + + + + TS L F+ CY LS +++ P
Sbjct: 296 GGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSL-FDTCYDLS-GRSSIMLP 353
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
V+ GG + ++V + G YCL S+N +IIG
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIG 397
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/318 (24%), Positives = 138/318 (43%), Gaps = 39/318 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+SVG P I DTGSD+ W C C +C D +++P+ S+T KV
Sbjct: 89 LSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQ---------DLPMFNPSKSTTYRKVS 139
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C+S +C + S +C Y + Y D + S G D L + + + + R +
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISY-GDNSHSQGDFAVDTLTMGSTSGRVVAF-PRTA 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD--GTGR 280
GCG GSF A +G+ GLG+ S+ + + + FS C G+D G+ +
Sbjct: 198 IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNK 253
Query: 281 ISFGDKGS---PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIF 327
++FG + G TP + + Y++ + VSVG N + + + I
Sbjct: 254 LNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIII 313
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L Y ++ ++ +R + EYC+ + + +++ P + +
Sbjct: 314 DSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF-LEYCFETTTD--DYKVPFIAMHF 370
Query: 388 KGGGPFFVNDPIVIVSSE 405
+G + ++I S+
Sbjct: 371 EGANLRLQRENVLIRVSD 388
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 140/337 (41%), Gaps = 43/337 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N SVG+P + +V +DTGSDL W+ C C C I+ P+ SST
Sbjct: 61 VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 111
Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ +S +C Q N C Y Y +DG+ S+G L + + T ++ + +V S +
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 169
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
FGCG G F DG +G+ GL S+ S L ++ FS C G
Sbjct: 170 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 221
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
++ GD TPF + Y +T+ +SVG ++ + + D
Sbjct: 222 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SGT+ T+L + +S L + ++ +P CY N+ +P +
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 339
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
G ++ + V K ++CL V++S+ NI
Sbjct: 340 AEGADLVLDANSLFVQ---KNQDVFCLAVLESNLKNI 373
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/356 (24%), Positives = 143/356 (40%), Gaps = 50/356 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA + LDTGSD+ W+ C+ C C + +++P +SST
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDP---------VFNPTSSSTY 212
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C L + + C YQV Y DG+ + G L D + K +
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKIND----- 266
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A + + NQ + SFS C +G+ S
Sbjct: 267 VALGCGHDNEGLFTGAAG-------LLGLGGGALSITNQ-MKATSFSYCLVDRDSGKSSS 318
Query: 284 GDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P Q T Y + ++ SVGG V F+ A I
Sbjct: 319 LDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVIL 378
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D GT+ T L AY + + F L ++ ++S F+ CY S + ++ + P V
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFS-SLSSVKVPTVAFHF 437
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNI 435
GG + ++ + G + + S +++IIG Y +AN I
Sbjct: 438 TGGKSLDLPAKNYLIPVDDNGTFCFAFAPTSS-SLSIIGNVQQQGTRITYDLANKI 492
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 103/410 (25%), Positives = 168/410 (40%), Gaps = 61/410 (14%)
Query: 59 YYSALAHRDRYFRLRG--RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
+Y+ + RD + R+R R L G+ + S G + L + + +G PA
Sbjct: 84 HYTGILRRD-HNRVRSIHRRLTGAGDTAATIPASLGLAFHSLE-----YVVTIGIGTPAR 137
Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
+F V DTGSDL W+ C C + ++ P+ SST VPC + C++
Sbjct: 138 NFTVLFDTGSDLTWVQCKPCTDSCYQQQEP--------LFDPSKSSTYVDVPCGTPQCKI 189
Query: 176 --QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ G+ C Y V+Y D +++ G L ++ L+ + V FGC +
Sbjct: 190 GGGQDLTCGGTTCEYSVKY-GDQSVTRGNLAQEAFTLSPSAPPAAGV----VFGCSH-EY 243
Query: 234 GSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKG 287
S + GA GL GLG +S+ S +G + FS C G+ G ++ G
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGNSGDVFSYCLPPRGSSAGYLTIG-AA 301
Query: 288 SPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLN 337
+P Q F+ Q Y + + +SV G A+ + SA + DSGT T++
Sbjct: 302 APPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVITHMP 361
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLP------FEYCYVLSPNQTNFEYPVVNLTMKGGG 391
AY + + F + + LP + CY ++ + P V L GG
Sbjct: 362 AAAYYVLRDEF-----RRHMGGYTMLPEGHVESLDTCYDVTGHDV-VTAPPVALEFGGGA 415
Query: 392 PFFVNDPIVI----VSSEPKGLYLYCLGVVKSD--NVNIIGREYPIANNI 435
V+ ++ V + + L L CL V ++ IIG A N+
Sbjct: 416 RIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNV 465
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 140/337 (41%), Gaps = 43/337 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N SVG+P + +V +DTGSDL W+ C C C I+ P+ SST
Sbjct: 61 VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 111
Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ +S +C Q N C Y Y +DG+ S+G L + + T ++ + +V S +
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 169
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
FGCG G F DG +G+ GL S+ S L ++ FS C G
Sbjct: 170 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 221
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
++ GD TPF + Y +T+ +SVG ++ + + D
Sbjct: 222 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SGT+ T+L + +S L + ++ +P CY N+ +P +
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 339
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
G ++ + V K ++CL V++S+ NI
Sbjct: 340 AEGADLVLDANSLFVQ---KNQDVFCLAVLESNLKNI 373
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 148/349 (42%), Gaps = 51/349 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P L + DTGSDL W C C S C +Y+P++S+T + +
Sbjct: 96 LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 146
Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
PCNS+L P G C Y V Y S T + F + + V
Sbjct: 147 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWT--SVFQGSETFTFGSTPAGHARV 204
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
I+FGC +G + ++ +GL GLG + S L +Q +P FS C ++
Sbjct: 205 PG-IAFGCSTASSG--FNASSASGLVGLGRGRLS----LVSQLGVPK-FSYCLTPYQDTN 256
Query: 277 GTGRISFGDK----GSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVN-----FEF 323
T + G G+ G TPF + T P Y + +T +S+G A++ F
Sbjct: 257 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 316
Query: 324 SA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
+A I DSGT+ T L + AY Q+ SL ++D + C++L P+ T+
Sbjct: 317 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFML-PSSTS 375
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ ++T+ G V + S+ GL+ + VNI+G
Sbjct: 376 APPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILG 424
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/331 (28%), Positives = 138/331 (41%), Gaps = 47/331 (14%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND--TYRLNSLGFLHYTNVSVGQPAL 116
Y AL H D L + ++ L +G D + RL+S+ + +++G P +
Sbjct: 18 YRLALTHVDSKIGFTKTELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPPV 77
Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-- 173
F+ DTGSDL W C C C D +Y P+ SST S VPC+S C
Sbjct: 78 PFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASSTFSPVPCSSATCLP 128
Query: 174 -ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD-EKQSKSVDSRISFGCGRV 231
+ C + S C Y Y SDG S G L + L + + Q+ SV S ++FGCG
Sbjct: 129 TWRSRNCSNPSSPCRYIYSY-SDGAYSVGILGTETLTIGSSVPGQTVSVGS-VAFGCGTD 186
Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFGDKG 287
G L+ G GLG S+LA G+ FS C F S G
Sbjct: 187 NGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSTMDSPFFLGTLA 238
Query: 288 --SPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSA------IFDSG 330
+PG G TP +P+ Y + + +S+G + F+ A + DSG
Sbjct: 239 ELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSG 298
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
T+FT L + ++ + L + ++S
Sbjct: 299 TTFTILAKSGFREVVDRVAQLLGQPPVNASS 329
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 115/264 (43%), Gaps = 33/264 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
HY + +G P V LDTGS L PCD CV C G D P +T
Sbjct: 46 HYAELYIGIPPQRASVILDTGSGLTAFPCDKCVDC--------GTHTD-----PKFDATK 92
Query: 164 SKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQSKSVD 221
S N C+ ++ C + N C RY S+G+M +++D++ + D +++ +
Sbjct: 93 S-TSINFVQCKYEEGCDTCRDNLCVIHQRY-SEGSMWEAVVMQDLIWVGNVDSDRAEMIM 150
Query: 222 S----RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSD 276
R FGC +TG F+ NG+ GLG+ + ++ + + + + F++CFG
Sbjct: 151 RRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQK 209
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYN--ITITQVSVGGNAVNFEFS-------AIF 327
G + G S + ++ H T N I + V +GG ++ + AI
Sbjct: 210 GGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAIV 269
Query: 328 DSGTSFTYLNDPAYTQISETFNSL 351
DSGT+ TY A T E F +
Sbjct: 270 DSGTTDTYFPSAAATPFQEAFKRI 293
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 140/337 (41%), Gaps = 43/337 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N SVG+P + +V +DTGSDL W+ C C C I+ P+ SST
Sbjct: 93 VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 143
Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ +S +C Q N C Y Y +DG+ S+G L + + T ++ + +V S +
Sbjct: 144 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 201
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
FGCG G F DG +G+ GL S+ S L ++ FS C G
Sbjct: 202 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 253
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
++ GD TPF + Y +T+ +SVG ++ + + D
Sbjct: 254 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 311
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SGT+ T+L + +S L + ++ +P CY N+ +P +
Sbjct: 312 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHF 371
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
G ++ + V K ++CL V++S+ NI
Sbjct: 372 AEGADLVLDANSLFVQ---KNQDVFCLAVLESNLKNI 405
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 146/355 (41%), Gaps = 52/355 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
L L+Y +VG A V +DT S+L W+ C C SC + ++ P++
Sbjct: 115 LRTLNYV-ATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDP---------LFDPSS 164
Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSN-----------CPYQVRYLSDGTMSTGFLVEDVL 208
S + + VPCNS+ C+ + +AG++ C Y + Y DG+ S G L D L
Sbjct: 165 SPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSY-RDGSYSRGVLARDKL 223
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
LA + + FGCG G+ G +GL GLG S+ S +Q
Sbjct: 224 RLAGQDIEG------FVFGCGTSNQGAPFGGT--SGLMGLGRSHVSLVSQTMDQ--FGGV 273
Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPF--------SLRQTHPTYNITITQVSVGGN 317
FS C S +G + GD S + TP S P Y + +T ++VGG
Sbjct: 274 FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQ 333
Query: 318 AVNFE-FSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
V FSA I DSGT T L Y + F S E + + + C+ L+
Sbjct: 334 EVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSI-LDTCFNLT- 391
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVI--VSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P + +G V+ V+ VSS+ + L + + +IIG
Sbjct: 392 GLKEVQVPSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIG 446
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 144/341 (42%), Gaps = 44/341 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG P ++ LDTGSD+ W+ C+ C C + IY+P SS+
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDP---------IYNPALSSSY 195
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + LC +L S +C YQV Y DG+ + G + L L Q+
Sbjct: 196 KLVGCQANLCQQLDVSGCSRNGSCLYQVSY-GDGSYTQGNFATETLTLGGAPLQN----- 249
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNSFSMCF---GSDGT 278
++ GCG G F+ A GL S PS L ++ G I FS C S+ +
Sbjct: 250 -VAIGCGHDNEGLFVGAAGLLGLG---GGSLSFPSQLTDENGKI---FSYCLVDRDSESS 302
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFS-----------A 325
+ FG P L+ + Y ++++ +SVGG ++ S
Sbjct: 303 STLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGV 362
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT+ T L AY + + F + K T L F+ CY LS ++ + P V
Sbjct: 363 IVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL-FDTCYDLSSKES-VDVPTVVF 420
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
GGG + +V + G + + S +++I+G
Sbjct: 421 HFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSS-SLSIVG 460
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 159/410 (38%), Gaps = 72/410 (17%)
Query: 40 DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN 99
+ VKG + D L ++ + +++ D R +G TP + R +
Sbjct: 56 EAVKGFVKRDKLRRQRMNQRWGVVSNYDS----RRKGFEMT---TTPAEVEMPMHSGRDD 108
Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
+LG ++ V VG P F + +DTGS+ WL C S S + +
Sbjct: 109 ALG-EYFAEVKVGSPGQRFWLVVDTGSEFTWLNC----------SKSFEAV--------- 148
Query: 160 SSTSSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQ 216
T + C L EL CP C Y + Y +DG+ + GF D + + T+ KQ
Sbjct: 149 --TCASRKCKVDLSELFSLSVCPKPSDPCLYDISY-ADGSSAKGFFGTDSITVGLTNGKQ 205
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMC 272
K + ++ GC T S L+G N G+ GLG K S AN+ FS C
Sbjct: 206 GKL--NNLTIGC----TKSMLNGVNFNEETGGILGLGFAKDSFIDKAANK--YGAKFSYC 257
Query: 273 FGSDGTGRISFGDKGSPGQGETPF--SLRQTH-----PTYNITITQVSVGGNAV------ 319
+ R + G +R+T P Y + + +S+GG +
Sbjct: 258 LVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQV 317
Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQ 375
N E + DSGT+ T L PAY + E SL K KR T E+C+ +
Sbjct: 318 WDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCF----DA 373
Query: 376 TNFE---YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
F+ P + GG F I+ P + C+G+V D +
Sbjct: 374 EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAP---LVKCIGIVPIDGI 420
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/350 (26%), Positives = 148/350 (42%), Gaps = 51/350 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P L + DTGSDL W C C S C +Y+P++S+T + +
Sbjct: 36 LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 86
Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
PCNS+L P G C Y V Y S T + F + + V
Sbjct: 87 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWT--SVFQGSETFTFGSTPAGHARV 144
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
I+FGC +G + ++ +GL GLG + S+ S L +P FS C ++
Sbjct: 145 PG-IAFGCSTASSG--FNASSASGLVGLGRGRLSLVSQLG----VPK-FSYCLTPYQDTN 196
Query: 277 GTGRISFGDK----GSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVN-----FEF 323
T + G G+ G TPF + T P Y + +T +S+G A++ F
Sbjct: 197 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 256
Query: 324 SA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
+A I DSGT+ T L + AY Q+ SL ++D + C++L P+ T+
Sbjct: 257 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFML-PSSTS 315
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
+ ++T+ G V + S+ GL+ + VNI+G
Sbjct: 316 APPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGN 365
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/344 (24%), Positives = 141/344 (40%), Gaps = 45/344 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSG+ +Y+ D A + +S+ L R + + CY + + P
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRSVDEG-DMPA 277
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++L G F + V V + ++CL +++V+IIG
Sbjct: 278 ISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 123/305 (40%), Gaps = 44/305 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P V +D+GSD+ W+ C C C H + ++ P S++
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDP---------VFDPADSASF 192
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
VPC+S++CE + C Y+V Y DG+ + G L + L ++V
Sbjct: 193 MGVPCSSSVCERIENAGCHAGGCRYEVMY-GDGSYTKGTLALETLTFG------RTVVRN 245
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF---GSDGT 278
++ GCG G F+ A GL G M L Q G +FS C G+D
Sbjct: 246 VAIGCGHRNRGMFVGAAGLLGLGGGSMS-------LVGQLGGQTGGAFSYCLVSRGTDSA 298
Query: 279 GRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------A 325
G + FG P G P P+ Y I ++ V VGG V F+ +
Sbjct: 299 GSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGV 358
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
+ D+GT+ T + AY + F S + F+ CY L+ + P V+
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSI-FDTCYNLN-GFVSVRVPTVSF 416
Query: 386 TMKGG 390
GG
Sbjct: 417 YFAGG 421
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 127/308 (41%), Gaps = 40/308 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +SVG P S + DTGSD+ W C S + N+ ++ P+ S+T
Sbjct: 83 YLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAP--------MFDPSKSTTYK 134
Query: 165 KVPCNSTLCELQKQCPSAG--SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C+S +C S S C Y + Y D + S G L D + + + + +
Sbjct: 135 NVACSSPVCSYSGDGSSCSDDSECLYSIAY-GDDSHSQGNLAVDTVTMQSTSGRPVAF-P 192
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG--- 279
R GCG G+F A +G+ GLG S+ + L FS C GTG
Sbjct: 193 RTVIGCGHDNAGTF--NANVSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTGSTN 248
Query: 280 ---RISFGDKGS---PGQGETP-FSLRQTHPTYNITITQVSVGGNAVNF---------EF 323
+++FG + G TP +S Q Y++ + VSVG NF E
Sbjct: 249 DSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGES 308
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC-YVLSPNQTNFEYPV 382
+ I DSGT+ TYL + + +F S + + P E+ Y + ++E P
Sbjct: 309 NIIIDSGTTLTYLP----SALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPP 364
Query: 383 VNLTMKGG 390
V + +G
Sbjct: 365 VTMHFEGA 372
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 140/341 (41%), Gaps = 43/341 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C+ C C + I++P+ S++
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP---------IFNPSYSASF 207
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C C Y+ Y DG+ STG + L T +
Sbjct: 208 STVGCDSAVCSQLDAYDCHSGGCLYEASY-GDGSYSTGSFATETLTFGTTSV------AN 260
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GLG S P+ + Q ++FS C SD +G
Sbjct: 261 VAIGCGHKNVGLFIGAAGLL---GLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGP 315
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG K P G TP PT Y +++T +SVGG ++ F
Sbjct: 316 LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGF 375
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGT T L AY + + F + + T + F+ CY LS Q P V
Sbjct: 376 IIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSI-FDTCYDLSGLQF-VSVPTVGF 433
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + ++ + G + + S +V+I+G
Sbjct: 434 HFSNGASLILPAKNYLIPMDTVGTFCFAFAPAAS-SVSIMG 473
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 96/348 (27%), Positives = 143/348 (41%), Gaps = 47/348 (13%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIY 155
N FL N+S+G P + ++ +DTGSDL W LPC C Q I F +
Sbjct: 74 NPAAFL--ANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYP----------QTIPF--F 119
Query: 156 SPNTSSTSSKVPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P+ SST C S + Q NC Y +RY D + + G L E+ L T +
Sbjct: 120 HPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYHLRY-RDFSNTRGILAEEKLTFETSD 178
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
S I FGCG+ +G +G+ GLG S+ + N G + FS CFG
Sbjct: 179 DGLIS-KQNIVFGCGQDNSGF----TKYSGVLGLGPGTFSI--VTRNFG---SKFSYCFG 228
Query: 275 SDGT----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
S I G+ +G+ TP + Q Y + + +S G ++ E
Sbjct: 229 SLTNPTYPHNILILGNGAKIEGDPTPLQIFQDR--YYLDLQAISFGEKLLDIEPGTFQRY 286
Query: 323 ---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
+ D+G S T L AY +SE + L E R D CY + +
Sbjct: 287 RSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLY 346
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+PVV GG ++ + VSSE + + + D++++IG
Sbjct: 347 GFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIG 394
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 122/307 (39%), Gaps = 44/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + ++ P S T
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADP---------VFDPTKSRTY 179
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC + LC C + C YQV Y DG+ + G + L ++
Sbjct: 180 AGIPCGAPLCRRLDSPGCNNKNKVCQYQVSY-GDGSFTFGDFSTETLTF------RRTRV 232
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+R++ GCG G F+ A GLG + S P + FS C S
Sbjct: 233 TRVALGCGHDNEGLFIGAAGLL---GLGRGRLSFPVQTGRR--FNQKFSYCLVDRSASAK 287
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + +SVGG+ V F A
Sbjct: 288 PSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNG 347
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGTS T L PAY + + F A + + L F+ C+ LS T + P V
Sbjct: 348 GVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSL-FDTCFDLS-GLTEVKVPTV 405
Query: 384 NLTMKGG 390
L +G
Sbjct: 406 VLHFRGA 412
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 117/271 (43%), Gaps = 33/271 (12%)
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
+ L ++ L+ V +G P+ + +A TGSD+ W+PC C C + ++
Sbjct: 67 FVLEAMPGLYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDC----PTPDDIGFSLDL 122
Query: 155 YSPNTSSTSSKVP-----CNSTLCELQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVED 206
Y P SSTSS++ C L C S+G C Y Y +TG+ V D
Sbjct: 123 YDPKNSSTSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSD 182
Query: 207 VLH--LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
+H + + S + + FGC + ++G +G+ G G D S+ S L +QG
Sbjct: 183 DIHFDIFMGNESFASSSASVIFGCSKSRSGHL----QADGVIGFGKDAPSLISQLNSQG- 237
Query: 265 IPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
+ ++FS C DG G + + G PG T SL + P YN+ + ++V V +
Sbjct: 238 VSHAFSRCLDDSDDGGGVLILDEVGEPGLEFT--SLVASRPCYNLNMKSIAVNNQNVPID 295
Query: 323 FS---------AIFDSGTSFTYLNDPAYTQI 344
S DSGTS Y D Y +
Sbjct: 296 SSLFTTSSTQGTFLDSGTSLAYFPDGVYDPV 326
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 128/302 (42%), Gaps = 41/302 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G+P+ +F + +DTGSD+ WL C C C ++ I+ P +SS+
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDP---------IFDPASSSSF 210
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S++ C + C +C YQV Y DG+ + G + + S SVD +
Sbjct: 211 SRLGCQTPQCRNLDVFACRNDSCLYQVSY-GDGSYTVGDFATETVSFG----NSGSVD-K 264
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A + P L +Q + +SFS C S +
Sbjct: 265 VAIGCGHDNEGLFVGAAG-------LIGLGGGPLSLTSQ-IKASSFSYCLVNRDSVDSST 316
Query: 281 ISFGDKGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVN-----FEFSA------IFD 328
+ F P F + Y + IT +SVGG + FE I D
Sbjct: 317 LEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVD 376
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
GT+ T L AY + +TF L K+ TS L F+ CY LS ++T+ P V
Sbjct: 377 CGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFAL-FDTCYNLS-SRTSVRVPTVAFLFD 434
Query: 389 GG 390
GG
Sbjct: 435 GG 436
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 141/347 (40%), Gaps = 56/347 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+VS+G PAL++ +DTGSDL W C CV C ++ P++SST + V
Sbjct: 77 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 127
Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
PC+S C +C SA S C Y Y D + + G L + LA KS +
Sbjct: 128 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 179
Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
FGCG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 180 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 231
Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 232 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 291
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FE 379
I DSGTS TYL Y + + F + S + + C+ + E
Sbjct: 292 GTGGVIVDSGTSITYLEVQGYRALKKAFAA-QMALPAADGSGVGLDLCFRAPAKGVDQVE 350
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + GG + +V G CL V+ S ++IIG
Sbjct: 351 VPRLVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 395
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 141/347 (40%), Gaps = 56/347 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+VS+G PAL++ +DTGSDL W C CV C ++ P++SST + V
Sbjct: 98 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 148
Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
PC+S C +C SA S C Y Y D + + G L + LA KS +
Sbjct: 149 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 200
Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
FGCG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 201 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 252
Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 253 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 312
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FE 379
I DSGTS TYL Y + + F + S + + C+ + E
Sbjct: 313 GTGGVIVDSGTSITYLEVQGYRALKKAFAA-QMALPAADGSGVGLDLCFRAPAKGVDQVE 371
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + GG + +V G CL V+ S ++IIG
Sbjct: 372 VPRLVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 416
>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 681
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 136/340 (40%), Gaps = 45/340 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
HYT V G P V DTGS L PC C C H + + SST
Sbjct: 67 HYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCGHHTDQP---------FQAANSSTL 117
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDEKQSK 218
+ C K+C C Y+ +G+ +VED+++L D++
Sbjct: 118 VHITCAQKSLFQCKECHVQSDTCGISQSYM-EGSSWKASVVEDIVYLGGESSFDDKEMRN 176
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDG 277
+ FGC + G F+ A +G+ GL + + + L + I N FS+CF +G
Sbjct: 177 RYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLCFTENG 235
Query: 278 TGRISFGD-KGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFSA------I 326
G +S G + +GE + + R YN+ + + +GG ++N + A I
Sbjct: 236 -GTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYTRGHYI 294
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT+ +YL T+ + F +A + S F N+ P + L
Sbjct: 295 VDSGTTDSYLPRALKTEFLQMFKEIAGRDYQVGNSCKGF-------TNKDLASLPTIQLV 347
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYL-----YCLGVVKSDN 421
M+ G + VI+ P+ L YC G+ S+N
Sbjct: 348 MEAYGD---ENAEVILDVPPEQYLLESNGAYCGGIYLSEN 384
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 146/348 (41%), Gaps = 40/348 (11%)
Query: 60 YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSL---GFLHY-TNVSVGQPA 115
+S+L+H DR R L+ T L +A N L + G Y +VS+G P
Sbjct: 46 FSSLSHYDRLTNAFRRSLS---RSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPP 102
Query: 116 LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
+ +I DTGSDL W C+ C+ S I+ P S++ S VPCNS C+
Sbjct: 103 VDYIGMADTGSDLMW--AQCLPCLKCYKQSR------PIFDPLKSTSFSHVPCNSQNCKA 154
Query: 176 --QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
C + G C Y Y D T + G L + + + S SV S I GCG
Sbjct: 155 IDDSHCGAQGV-CDYSYTY-GDQTYTKGDLGFEKITIG-----SSSVKSVI--GCGHESG 205
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGDKG--- 287
G F + + GLG + S+ S ++ I FS C S G+I+FG
Sbjct: 206 GGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVS 262
Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGN---AVNFEFSAIFDSGTSFTYLNDPAYTQI 344
PG TP + Y +T+ +S+G A + + I DSGT+ ++L Y +
Sbjct: 263 GPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKELYDGV 322
Query: 345 SETFNSLAKEKRETSTSDLPFEYCYVLSPN-QTNFEYPVVNLTMKGGG 391
+ + K KR + ++ C+ N T+ P++ GG
Sbjct: 323 VSSLLKVVKAKRVKDPGNF-WDLCFDDGINVATSSGIPIITAQFSGGA 369
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 141/347 (40%), Gaps = 56/347 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+VS+G PAL++ +DTGSDL W C CV C ++ P++SST + V
Sbjct: 108 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 158
Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
PC+S C +C SA S C Y Y D + + G L + LA KS +
Sbjct: 159 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 210
Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
FGCG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 211 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 262
Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 263 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 322
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FE 379
I DSGTS TYL Y + + F + S + + C+ + E
Sbjct: 323 GTGGVIVDSGTSITYLEVQGYRALKKAFAA-QMALPAADGSGVGLDLCFRAPAKGVDQVE 381
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + GG + +V G CL V+ S ++IIG
Sbjct: 382 VPRLVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 426
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 130/315 (41%), Gaps = 54/315 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA + + LDTGSD+ WL C C +C + ++ I+ P S T
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA---------IFDPKKSKTF 185
Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC S LC +C + S C YQV Y DG+ + G + L
Sbjct: 186 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSY-GDGSFTEGDFSTETLTF-----HGAR 239
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
VD + GCG G F+ A GLG S PS N+ FS C
Sbjct: 240 VD-HVPLGCGHDNEGLFVGAAGLL---GLGRGGLSFPSQTKNR--YNGKFSYCLVDRTSS 293
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
S I FG+ P + F+ T+P Y + + +SVGG+ V F
Sbjct: 294 GSSSKPPSTIVFGNAAVP--KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 351
Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
+ A I DSGTS T L PAY + + F L K + + S F+ C+ LS
Sbjct: 352 KLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFR-LGATKLKRAPSYSLFDTCFDLS-GM 409
Query: 376 TNFEYPVVNLTMKGG 390
T + P V GG
Sbjct: 410 TTVKVPTVVFHFGGG 424
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 130/311 (41%), Gaps = 53/311 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C +C + +++P S +
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDP---------VFNPVKSGSF 179
Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+KV C + LC ++ S G N C YQV Y DG+ +TG V + L + +
Sbjct: 180 AKVLCRTPLC---RRLESPGCNQRQTCLYQVSY-GDGSYTTGEFVTETLTFRRTKVE--- 232
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
+++ GCG G F+ A GL G+ S NQ FS C S
Sbjct: 233 ---QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ-----KFSYCLVDRSAS 284
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NFEFS- 324
+ FG+ F+ T+P Y + + +SVGG V +F+
Sbjct: 285 SKPSSVVFGNSAVSRTAR--FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 342
Query: 325 -----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
I D GTS T LN PAY + + F + A + L F+ CY LS +T +
Sbjct: 343 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLS-GKTTVK 400
Query: 380 YPVVNLTMKGG 390
P V L +G
Sbjct: 401 VPTVVLHFRGA 411
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 137/324 (42%), Gaps = 56/324 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +G P ++ DTGSDL W+ C C +C S+ +SPN
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNH---- 144
Query: 164 SKVPCNSTLCEL-----QKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
C + C+L +C A S C Y+ Y DG+ ++GF ++ L T +
Sbjct: 145 ----CYDSACQLVPLPKHHRCNHARLHSPCRYEYSY-GDGSKTSGFFSKETTTLNTSSGR 199
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ I+FGC +G + GA+ N G+ GLG S+ S L ++ N FS C
Sbjct: 200 EAKLKG-IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSYCL 256
Query: 274 -----GSDGTGRISFG---DKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGNAVNF 321
T + G + +PG+ TP + PT Y I I VSV G +
Sbjct: 257 MDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPI 316
Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYC 368
S I DSGT+ T+L +PAY QI + + R S ++ F+ C
Sbjct: 317 NPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI---LTVIKRRVRLPSPAEPTPGFDLC 373
Query: 369 YVLSPNQTNFEYPVV-NLTMKGGG 391
N + E+P + L+ K GG
Sbjct: 374 V----NVSEIEHPRLPKLSFKLGG 393
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 123 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 172
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 173 SSSSTYSPFSCGSAACAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 230
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 231 AVKS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 279
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 280 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 339
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+ DSGT T L AY+ +S F + K+ S + + C+ S Q++ P
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 397
Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
V L GG ++ +I+S
Sbjct: 398 VALVFSGGAVVSLDASGIILS 418
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 139/339 (41%), Gaps = 42/339 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA + LDTGSD+ W+ C+ C C + +++P +SST
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDP---------VFNPTSSSTY 212
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C L + + C YQV Y DG+ + G L D + K +
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKINN----- 266
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A GL G + + NQ + SFS C +G+ S
Sbjct: 267 VALGCGHDNEGLFTGAAGLLGLGGGVLS-------ITNQ-MKATSFSYCLVDRDSGKSSS 318
Query: 284 GDKGSP----GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P + T Y + ++ SVGG V F+ A I
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D GT+ T L AY + + F L ++ S+S F+ CY S T + P V
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPTVAFHF 437
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
GG + ++ + G + + S +++IIG
Sbjct: 438 TGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS-SLSIIG 475
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 93/315 (29%), Positives = 135/315 (42%), Gaps = 42/315 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P F + DTGSDL W C+ CV + + I++P+ S++
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEA--------IFNPSQSTSY 204
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
+ + C STLC+ A S C Y ++Y D + S GF ++ L L ATD
Sbjct: 205 ANISCGSTLCDSLASATGNIFNCASSTCVYGIQY-GDSSFSIGFFGKEKLSLTATD---- 259
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
V + FGCG+ G F A GLG DK S+ S A + S+ + S
Sbjct: 260 --VFNDFYFGCGQNNKGLFGGAAGLL---GLGRDKLSLVSQTAQRYNKIFSYCLPSSSSS 314
Query: 278 TGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSG 330
TG ++FG S TP ++ Y + +T +SVGG + S I DSG
Sbjct: 315 TGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSG 374
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T T L AY+ +S TF L + + + C+ S N P + L GG
Sbjct: 375 TVITRLPPAAYSALSSTFRKLMSQYPAAPALSI-LDTCFDFS-NHDTISVPKIGLFFSGG 432
Query: 391 --------GPFFVND 397
G F+VND
Sbjct: 433 VVVDIDKTGIFYVND 447
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 142/356 (39%), Gaps = 71/356 (19%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N+S+G P ++ ++ +DT SDL W+ C C++C I+ P+ S T
Sbjct: 87 VNISIGSPPITQLLHMDTASDLLWIQCLPCINC---------YAQSLPIFDPSRSYTHRN 137
Query: 166 VPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSK 218
C ++ Q PS N C Y +RY+ D T S G L ++L T DE S
Sbjct: 138 ETCRTS----QYSMPSLKFNANTRSCEYSMRYVDD-TGSKGILAREMLLFNTIYDESSSA 192
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
++ + FGCG G L G G+ GLG + S+ + FS CFGS
Sbjct: 193 ALHD-VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGKK------FSYCFGSLDD 242
Query: 279 -----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
+ GD G+ G+ TP + Y +TI +SV G + +
Sbjct: 243 PSYPHNVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQT 300
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCYVLSPNQTN 377
I D+G S T L + AY + + + + + S D+ CY N
Sbjct: 301 GLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECY-----NGN 355
Query: 378 FE-------YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
FE +P+V G ++ + + P ++CL V N+N IG
Sbjct: 356 FERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPN---VFCLAVTPG-NLNSIG 407
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 111/248 (44%), Gaps = 42/248 (16%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + +G P + +DTGSDL W C C +C I+ P+ SST
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAP---------IFDPSKSST 110
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C+ G++CPY++ Y +D + STG L + + + + + V +
Sbjct: 111 FKEKRCH-------------GNSCPYEIIY-ADESYSTGILATETVTIQSTSGE-PFVMA 155
Query: 223 RISFGCGRVQTGSFLDG--AAPNGLFGLGMDKTSVPSILANQGL-IPNSFSMCFGSDGTG 279
S GCG + G A+ +G+ GL M + S+++ L IP S CF S GT
Sbjct: 156 ETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPS---SLISQMDLPIPGLISYCFSSQGTS 212
Query: 280 RISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVG-------GNAVNFEFSAIF-D 328
+I+FG G +++ P Y + + VSVG G + + IF D
Sbjct: 213 KINFGTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFID 272
Query: 329 SGTSFTYL 336
SGT++TYL
Sbjct: 273 SGTTYTYL 280
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 130/310 (41%), Gaps = 53/310 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C +C + +++P S +
Sbjct: 42 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDP---------VFNPVKSGSF 92
Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+KV C + LC ++ S G N C YQV Y DG+ +TG V + L + +
Sbjct: 93 AKVLCRTPLC---RRLESPGCNQRQTCLYQVSY-GDGSYTTGEFVTETLTFRRTKVE--- 145
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
+++ GCG G F+ A GL G+ S NQ FS C S
Sbjct: 146 ---QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ-----KFSYCLVDRSAS 197
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NFEFS- 324
+ FG+ F+ T+P Y + + +SVGG V +F+
Sbjct: 198 SKPSSVVFGNSAVSRTAR--FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 255
Query: 325 -----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
I D GTS T LN PAY + + F + A + L F+ CY LS +T +
Sbjct: 256 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDLS-GKTTVK 313
Query: 380 YPVVNLTMKG 389
P V L +G
Sbjct: 314 VPTVVLHFRG 323
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 129/303 (42%), Gaps = 41/303 (13%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P +DTGSD W C C C LN +S I++P+ SST + C
Sbjct: 95 SIGTPPFQLYGVVDTGSDGIWFQCKPCKPC---LNQTSP------IFNPSKSSTYKNIRC 145
Query: 169 NSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+S +C+ + +C S C Y++ YL D + S G + +D L L +++ S +I
Sbjct: 146 SSPICKRGEKTRCSSNRKRKCEYEITYL-DRSGSQGDISKDTLTLNSNDGSPISF-PKIV 203
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTGR 280
GCG S +G+ G G S+ S L + I FS C S + + +
Sbjct: 204 IGCG--HKNSLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLFSKANISSK 259
Query: 281 ISFGDKGS-PGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF---------EFSAIFD 328
+ FGD G G L Q+ Y + SVG + + E +A+ D
Sbjct: 260 LYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVID 319
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
SG++ T L + Y+Q+ S+ K KR + T L Y L +E P++
Sbjct: 320 SGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLK----KYEVPIITAHF 375
Query: 388 KGG 390
+G
Sbjct: 376 RGA 378
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/314 (26%), Positives = 124/314 (39%), Gaps = 56/314 (17%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYR--------LNSLGFLHYT-NVSVGQPALSFIVA 121
+ R L+A N FS ND R + G L Y ++++G P
Sbjct: 59 KARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSAL 118
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQKQ 178
LDTGSDL W C C SC+ + +++P S++ + C LC L
Sbjct: 119 LDTGSDLIWTQCAPCASCLAQPDP---------LFAPGESASYEPMRCAGQLCSDILHHG 169
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
C C Y+ Y DGTM+ G + T + + + FGCG + GS +
Sbjct: 170 C-EMPDTCTYRYNY-GDGTMTMGVYATERFTF-TSSGGDRLMTVPLGFGCGSMNVGSLNN 226
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS-----------FGDKG 287
G +G+ G G + S+ S L+ + FS C S G+GR S +GD
Sbjct: 227 G---SGIVGFGRNPLSLVSQLSIR-----RFSYCLTSYGSGRKSTLLFGSLSGGVYGDAT 278
Query: 288 SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTY 335
P Q TP +PT Y + + ++VG + SA I DSGT+ T
Sbjct: 279 GPVQ-TTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTL 337
Query: 336 LNDPAYTQISETFN 349
L ++ F
Sbjct: 338 LPGAVLAEVVRAFR 351
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 78/281 (27%), Positives = 115/281 (40%), Gaps = 50/281 (17%)
Query: 98 LNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
+ + G L Y +++VG P LDTGSDL W CD C +C+ + ++
Sbjct: 90 VRASGDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP---------LF 140
Query: 156 SPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
SP SS+ + C LC L C C Y+ Y DGT + G+ + A+
Sbjct: 141 SPRMSSSYEPMRCAGQLCGDILHHSCVRP-DTCTYRYSY-GDGTTTLGYYATERFTFASS 198
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC- 272
+++SV + FGCG + GS + +G+ G G D S+ S L+ + FS C
Sbjct: 199 SGETQSVP--LGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLSIR-----RFSYCL 248
Query: 273 --FGSDGTGRISFG---------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN 320
+ S + FG D P Q TP +PT Y + T V+VG +
Sbjct: 249 TPYASSRKSTLQFGSLADVGLYDDATGPVQ-TTPILQSAQNPTFYYVAFTGVTVGARRLR 307
Query: 321 FEFSA-----------IFDSGTSFTYLNDPAYTQISETFNS 350
SA I DSGT+ T ++ F S
Sbjct: 308 IPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRS 348
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 135/339 (39%), Gaps = 44/339 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P +DTGSD+ WL C+ C C + +++P+ SS+ +PC
Sbjct: 92 SVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTP---------MFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S LC+ + N C Y Y D + S G L D L L + + S I G
Sbjct: 143 PSKLCQSMEDTSCNDKNYCEYST-YYGDNSHSGGDLSVDTLTLESTNGLTVSF-PNIVIG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---------GSDGT 278
CG S+ +GA+ +G+ G G S + L + FS C S+ T
Sbjct: 201 CGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFSYCLTPLFSVTNIQSNAT 256
Query: 279 GRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAIF 327
+++FGD + G TP + Y +T+ SVG V E + I
Sbjct: 257 SKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIII 316
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L Y+ + L K +R + CY S +++P++ +
Sbjct: 317 DSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQT-LNLCY--SVKAEGYDFPIITMHF 373
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
KG PI S G ++CL S + I G
Sbjct: 374 KGADVDL--HPISTFVSVADG--VFCLAFESSQDHAIFG 408
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 150/370 (40%), Gaps = 68/370 (18%)
Query: 97 RLNSLGFLHYTNV--SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFN 153
RL +L ++ ++ S G PA + V +DTGSDL W+ C C +C +
Sbjct: 138 RLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDP--------- 188
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ--------CPSAGS---NCPYQVRYLSDGTMSTGF 202
++ P S+T + V CN++ C + C S G+ C Y + Y DG+ S G
Sbjct: 189 LFDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAY-GDGSFSRGV 247
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
L D + L S+ + FGCG G F GL GLG + S+ S A++
Sbjct: 248 LATDTVALG-----GASLGGFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTASR 298
Query: 263 GLIPNSFSMCF----GSDGTGRISFG---DKGSPGQGETPFSLRQT------HPTYNITI 309
FS C D +G +S G D S + TP + + P Y + +
Sbjct: 299 --YGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNV 356
Query: 310 TQVSVGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP- 364
T +VGG A+ + + + DSGT T L Y + F R+ + P
Sbjct: 357 TGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEF------MRQFGAAGYPA 410
Query: 365 ------FEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGV 416
+ CY L+ + P++ L ++GG V+ + +V + + L +
Sbjct: 411 APGFSILDTCYDLT-GHDEVKVPLLTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASL 469
Query: 417 VKSDNVNIIG 426
D IIG
Sbjct: 470 SYEDETPIIG 479
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 123/307 (40%), Gaps = 44/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C C S + Q+ D P+ S +
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCY----SQTDQIFD-----PSKSKSF 180
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC S LC C + C YQV Y DG+ + G + L ++
Sbjct: 181 AGIPCYSPLCRRLDSPGCSLKNNLCQYQVSY-GDGSFTFGDFSTETLTF------RRAAV 233
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
R++ GCG G F+ A L GLG S P+ + N FS C S
Sbjct: 234 PRVAIGCGHDNEGLFVGAAG---LLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAK 288
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
I FGD TP T Y + + +SVGG V F +
Sbjct: 289 PSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNG 348
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGTS T L PAY + + F A + L F+ CY LS + + P V
Sbjct: 349 GVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSL-FDTCYDLS-GLSEVKVPTV 406
Query: 384 NLTMKGG 390
L +G
Sbjct: 407 VLHFRGA 413
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 108/377 (28%), Positives = 151/377 (40%), Gaps = 58/377 (15%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
RLRG A + K+ T +GN + +V +G P + DTGSDL W
Sbjct: 109 RLRGSK-ATKIPAKSGATIGSGN-----------YIVSVGLGTPKKYLSLIFDTGSDLTW 156
Query: 131 LPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPSA 182
C C + ++ P+ S+T S + C+S C Q C SA
Sbjct: 157 TQCQPCARYCYNQKDP--------VFVPSQSTTYSNISCSSPDCSQLESGTGNQPGC-SA 207
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
C Y ++Y D + S G+ ++ L L S V FGCG+ G F A
Sbjct: 208 ARACIYGIQY-GDQSFSVGYFAKETLTLT-----STDVIENFLFGCGQNNRGLFGSAA-- 259
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE-TPFSLR 299
GL GLG DK S+ A + FS C S TG ++FG G G + TP +
Sbjct: 260 -GLIGLGQDKISIVKQTAQK--YGQVFSYCLPKTSSSTGYLTFGGGGGGGALKYTP--IT 314
Query: 300 QTHPT---YNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNS 350
+ H Y + I + VGG + S AI DSGT T L AY+ + F
Sbjct: 315 KAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRLPPDAYSALKSAFEK 374
Query: 351 -LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGL 409
+AK + S L + CY LS T + P V KGG ++ ++ + +
Sbjct: 375 GMAKYPKAPELSIL--DTCYDLSKYST-IQIPKVGFVFKGGEELDLDGIGIMYGASTSQV 431
Query: 410 YLYCLGVVKSDNVNIIG 426
L G V IIG
Sbjct: 432 CLAFAGNQDPSTVAIIG 448
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 94/347 (27%), Positives = 144/347 (41%), Gaps = 60/347 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P+LSF LDTGSDL W C C C IY P+ SST SKV
Sbjct: 118 KMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTP---------IYDPSQSSTYSKV 168
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PC+S++C+ +G+NC Y Y D + + G L + L S+S+ I+F
Sbjct: 169 PCSSSMCQALPMYSCSGANCEYLYSY-GDQSSTQGILSYESFTLT-----SQSL-PHIAF 221
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTGRI 281
GCG Q + GL G G S+ S L + N FS C S T +
Sbjct: 222 GCG--QENEGGGFSQGGGLVGFGRGPLSLISQLGQS--LGNKFSYCLVSITDSPSKTSPL 277
Query: 282 SFGDKGSPGQ---GETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AI 326
G S TP ++ PT Y +++ +SVGG ++ F+ I
Sbjct: 278 FIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVI 337
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT+ TYL Y + + S + + S++ + C+ + +P +
Sbjct: 338 IDSGTTVTYLEQSGYDVVKKAVIS-SINLPQVDGSNIGLDLCFEPQSGSSTSHFPTITFH 396
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLY-------CLGVVKSDNVNIIG 426
+G + PK Y+Y CL ++ S+ ++I G
Sbjct: 397 FEGAD-----------FNLPKENYIYTDSSGIACLAMLPSNGMSIFG 432
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 139/339 (41%), Gaps = 42/339 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA + LDTGSD+ W+ C+ C C + +++P +SST
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDP---------VFNPTSSSTY 212
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C L + + C YQV Y DG+ + G L D + K +
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKINN----- 266
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A GL G + + NQ + SFS C +G+ S
Sbjct: 267 VALGCGHDNEGLFTGAAGLLGLGGGVLS-------ITNQ-MKATSFSYCLVDRDSGKSSS 318
Query: 284 GDKGSP----GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P + T Y + ++ SVGG V F+ A I
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D GT+ T L AY + + F L ++ S+S F+ CY S T + P V
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLST-VKVPTVAFHF 437
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
GG + ++ + G + + S +++IIG
Sbjct: 438 TGGKSLDLPAKNYLIPVDDSGTFCFAFAPTSS-SLSIIG 475
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 129/303 (42%), Gaps = 39/303 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +S+G P + +V +DTGS L W+ C +C + + +GQ I++P SST
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-----IFNPYNSSTY 60
Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
SKV C++ C ++ C C Y +RY S G S G+L +D L LA++
Sbjct: 61 SKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYLGKDRLTLASN--- 116
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+S+D+ I FGCG L G+ G G S + + Q +FS CF D
Sbjct: 117 -RSIDNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQTDY-TAFSYCFPRD 169
Query: 277 --GTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------A 325
G ++ G T P Y I Q+ + N + E
Sbjct: 170 HENEGSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIRLEIDPYIYISKMT 227
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF-EYPVVN 384
I DSGT+ TY+ P + + + + K T D C++ + N+ ++P V
Sbjct: 228 IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWD-ERRICFISNSGSANWNDFPTVE 286
Query: 385 LTM 387
+ +
Sbjct: 287 MKL 289
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 143/348 (41%), Gaps = 42/348 (12%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
L++L F+ V G PA ++ +++DTGSD+ W+ C+ C V D P
Sbjct: 156 LDTLEFV--VTVGFGSPAQNYTLSIDTGSDVSWI--QCLPCSGHCYKQHDPVFD-----P 206
Query: 158 NTSSTSSKVPCNSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S+T S VPC C +C ++G+ C Y+V Y DG+ + G L + L L++
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGT-CLYKVTY-GDGSSTAGVLSHETLSLSSTRDL 264
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+FGCG+ G F GL + S+PS A +FS C S
Sbjct: 265 PG-----FAFGCGQTNLGEFGGVDGLVGLGRGAL---SLPSQAA--ATFGATFSYCLPSY 314
Query: 277 GT--GRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGG------NAVNF 321
T G ++ G + T ++ +P+ Y + + + +GG V
Sbjct: 315 DTTHGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT 374
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
+FDSGT TYL AY + + F + + D PF+ CY + + F P
Sbjct: 375 RDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYD-PFDTCYDFTGHNAIF-MP 432
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV---NIIG 426
V G F ++ +++ + CL V + NIIG
Sbjct: 433 AVAFKFSDGAVFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIG 480
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 123/286 (43%), Gaps = 51/286 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC---VHGLNSSSGQVIDFNIYSPNTSS 161
++ ++ +G P + ++ DTGSDL W+ C +H S+ + S+
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGST---------FLARHST 133
Query: 162 TSSKVPCNSTLCELQKQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
T S C S+LC+L Q P+ S C Y+ Y SDG+ ++GF ++ L T
Sbjct: 134 TFSPTHCFSSLCQLVPQ-PNPNPCNHTRLHSTCRYEYVY-SDGSKTSGFFSKETTTLNTS 191
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFS 270
+ + S I+FGCG +G L G++ N G+ GLG S S L + SFS
Sbjct: 192 SGREMKLKS-IAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRR--FGRSFS 248
Query: 271 MC-----FGSDGTGRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNA 318
C T + GD S + TP + PT Y I+I V V G
Sbjct: 249 YCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVK 308
Query: 319 VNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAK 353
++ + S + DSGT+ T+L +PAY +I F K
Sbjct: 309 LHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVK 354
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 85/306 (27%), Positives = 122/306 (39%), Gaps = 53/306 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ VG P + ++ALD D W+PC CV C +++ S+T
Sbjct: 35 YIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC------------SSTVFNTVKSTTF 82
Query: 164 SKVPCNSTLCELQKQCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C + C KQ P+ GS C + Y S +S L D + L+ D
Sbjct: 83 KTLGCGAPQC---KQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALSMDPV----- 132
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT-- 278
+FGC + TGS P GL G G S S Q L ++FS C S T
Sbjct: 133 -PYYAFGCIQKATGS---SVPPQGLLGFGRGPLSFLS--QTQNLYKSTFSYCLPSFRTLN 186
Query: 279 --GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA--------- 325
G + G G P + +T L+ + Y + + + VG V+ SA
Sbjct: 187 FSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGA 246
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV--LSPNQTNFEYP 381
IFDSGT FT L PAY + F + T +S F+ CY + P F +
Sbjct: 247 GTIFDSGTVFTRLVAPAYIAVRNEFRK--RVGNATVSSLGGFDTCYSVPIVPPTITFMFS 304
Query: 382 VVNLTM 387
+N+TM
Sbjct: 305 GMNVTM 310
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 80/302 (26%), Positives = 122/302 (40%), Gaps = 39/302 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P + +DTGSD+ WL C+ C C I+ P+ S T +PC
Sbjct: 96 SVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTP---------IFDPSKSKTYKTLPC 146
Query: 169 NSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+S CE L+ S+ + C Y + Y DG+ S G L + L L + + S + G
Sbjct: 147 SSNTCESLRNTACSSDNVCEYSIDY-GDGSHSDGDLSVETLTLGSTDGSSVHFPKTV-IG 204
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDGTGRIS 282
CG G+F + + +G+ V I I FS C S+ + +++
Sbjct: 205 CGHNNGGTFQEEGSGI----VGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLN 260
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
FGD G TP Y +T+ SVG N + F + + I D
Sbjct: 261 FGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIID 320
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
SGT+ T L Y + + + K +R S L CY + ++ + PV+ K
Sbjct: 321 SGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKL-LSLCYKTTSDE--LDLPVITAHFK 377
Query: 389 GG 390
G
Sbjct: 378 GA 379
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/324 (25%), Positives = 127/324 (39%), Gaps = 53/324 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ VG P F + D +D WL C C+ C +S I+ P+ SS+ + +
Sbjct: 191 IGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDS---------IFDPSQSSSYTLLS 241
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C + C L C G C Y + Y DGT + G L+ + + + S VD R+S
Sbjct: 242 CETKHCNLLPNSSCSDDGY-CRYNITY-KDGTNTEGVLINETVSF----ESSGWVD-RVS 294
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
GC G F+ +G FGLG S PS + + S+ + DG +
Sbjct: 295 LGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINASSM---SYCLVESKDGYSSSTLEF 348
Query: 286 KGSPGQGETPFSLRQ---THPTYNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
P G L Q Y + + + VGG ++ S I S +
Sbjct: 349 NSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSS 408
Query: 332 SFTYLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T L + Y + + F +AK + E + L F+ CY LS N T E P++ + G
Sbjct: 409 LITMLENDTYNVVRDAF--VAKTQHLERLKAFLQFDTCYNLSSNNT-VELPILEFEVNDG 465
Query: 391 GPFFVNDPIVIVSSEPKGLYLYCL 414
+ + PK YLY +
Sbjct: 466 KSWLL----------PKESYLYAV 479
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 83/306 (27%), Positives = 131/306 (42%), Gaps = 49/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + ++A+DT +D W+PC CV C ++P S+T
Sbjct: 98 YIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTT-----------TPFAPAKSTTF 146
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDS 222
KV C ++ C+ + GS C + Y GT S LV+D + LATD +
Sbjct: 147 KKVGCGASQCKQVRNPTCDGSACAFNFTY---GTSSVAASLVQDTVTLATDPVPA----- 198
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---- 278
+FGC + TGS + GL + + Q L ++FS C S T
Sbjct: 199 -YAFGCIQKVTGSSVPPQGLLGLGRGPLSLLA-----QTQKLYQSTFSYCLPSFKTLNFS 252
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + L+ + Y + + + VG V+ A
Sbjct: 253 GSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGT 312
Query: 326 IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYP 381
+FDSGT FT L +PAY + F +A K+ T TS F+ CY +++P T F +
Sbjct: 313 VFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCYTAPIVAPTIT-FMFS 371
Query: 382 VVNLTM 387
+N+T+
Sbjct: 372 GMNVTL 377
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 193 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 242
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 243 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 300
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 301 AVRS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 349
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 350 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 409
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+ DSGT T L AY+ +S F + K+ S + + C+ S Q++ P
Sbjct: 410 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 467
Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
V L GG ++ +I+S
Sbjct: 468 VALVFSGGAVVSLDASGIILS 488
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 130/305 (42%), Gaps = 48/305 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V +G+PA + LDTGSD+ WL C C C H I+ P++SS+
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEP---------IFEPSSSSSY 198
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C + + C Y+V Y DG+ + G + L + + Q+
Sbjct: 199 EPLSCDTPQCNALEVSECRNATCLYEVSY-GDGSYTVGDFATETLTIGSTLVQN------ 251
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GLG ++PS L SFS C SD
Sbjct: 252 VAVGCGHSNEGLFVGAAGLL---GLGGGLLALPSQLNT-----TSFSYCLVDRDSDSAST 303
Query: 281 ISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
+ FG SP P LR Q Y + +T +SVGG + +FE I
Sbjct: 304 VDFGTSLSPDAVVAPL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 362
Query: 328 DSGTSFTYLNDPAYTQISETF--NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
DSGT+ T L Y + ++F +L EK + F+ CY LS +T E P V
Sbjct: 363 DSGTAVTRLQTEIYNSLRDSFVKGTLDLEK---AAGVAMFDTCYNLSA-KTTVEVPTVAF 418
Query: 386 TMKGG 390
GG
Sbjct: 419 HFPGG 423
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 113/277 (40%), Gaps = 50/277 (18%)
Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y +++VG P LDTGSDL W CD C +C+ + ++SP
Sbjct: 94 GDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP---------LFSPRM 144
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS+ + C LC L C C Y+ Y DGT + G+ + A+ ++
Sbjct: 145 SSSYEPMRCAGQLCGDILHHSCVRP-DTCTYRYSY-GDGTTTLGYYATERFTFASSSGET 202
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
+SV + FGCG + GS + +G+ G G D S+ S L+ + FS C +
Sbjct: 203 QSVP--LGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLSIR-----RFSYCLTPYA 252
Query: 275 SDGTGRISFG---------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
S + FG D P Q TP +PT Y + T V+VG + S
Sbjct: 253 SSRKSTLQFGSLADVGLYDDATGPVQ-TTPILQSAQNPTFYYVAFTGVTVGARRLRIPAS 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNS 350
A I DSGT+ T ++ F S
Sbjct: 312 AFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRS 348
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 101/355 (28%), Positives = 139/355 (39%), Gaps = 62/355 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++SVG PAL + +DTGSDL W C CV N ++ ++ P SST + +P
Sbjct: 119 DLSVGTPALPYAAIVDTGSDLVW--TQCKPCVECFNQTT------PVFDPAASSTYAALP 170
Query: 168 CNSTLCE--------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S LC SA S C Y Y D + + G L + LA +
Sbjct: 171 CSSALCADLPTSTCASSSSSSSASSPCGYTYTY-GDASSTQGVLATETFTLARQKVPG-- 227
Query: 220 VDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--D 276
++FGCG G F GA GL GLG S+ S L + FS C S D
Sbjct: 228 ----VAFGCGDTNEGDGFTQGA---GLVGLGRGPLSLVSQLGI-----DRFSYCLTSLDD 275
Query: 277 GTGRISF----------GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA 325
GR +P Q TP + P+ Y +++T ++VG + SA
Sbjct: 276 AAGRSPLLLGSAAGISASAATAPAQ-TTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSA 334
Query: 326 -----------IFDSGTSFTYLNDPAYTQISETF---NSLAKEKRETSTSDLPFEYCYVL 371
I DSGTS TYL AY + + F SL DL F+
Sbjct: 335 FAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVDASEIGLDLCFQGPAGA 394
Query: 372 SPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P + L GG + +V G CL V+ S ++IIG
Sbjct: 395 VDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASG--ALCLTVMASRGLSIIG 447
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 153/346 (44%), Gaps = 53/346 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + + DTGSD+ WL C C SC GQ +++P+ SST
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY-------GQTDP--LFNPSFSSTF 131
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C S+LC+ L + C + C YQV Y DG+ + G + L ++ S
Sbjct: 132 QSITCGSSLCQQLLIRGCRR--NQCLYQVSY-GDGSFTVGEFSTETLSFGSNAVNS---- 184
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F A L GLG S PS + L + FS C S G+
Sbjct: 185 --VAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--------- 325
+ FG++ + F+ T+P Y + + + VGG +V+ +
Sbjct: 238 VPLIFGNQAVASNAQ--FTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGN 295
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT+ T L AY + + F + + + + TS L F+ CY LS +++ P
Sbjct: 296 GGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSL-FDTCYDLS-GRSSIMLP 353
Query: 382 VVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
V+ GG + ++V + G YCL S+N +IIG
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSG--TYCLAFAPNSENFSIIG 397
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 90/333 (27%), Positives = 143/333 (42%), Gaps = 46/333 (13%)
Query: 111 VGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + +DTGSDL W+ C C+ C + +N ++ P SST + + C+
Sbjct: 70 IGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINP---------MFDPLKSSTYTNISCD 120
Query: 170 STLC--ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S LC +C S C Y Y +D +++ G L ++ + L ++ + S+ I FG
Sbjct: 121 SPLCYKPYIGEC-SPEKRCDYTYGY-ADSSLTKGVLAQETVTLTSNTGKPISLQG-ILFG 177
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA--------NQGLIPNSFSMCFGSDGTG 279
CG TG+F D GL GLG TS+ S + +Q L+P + S
Sbjct: 178 CGHNNTGNFNDHEM--GLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISS---- 231
Query: 280 RISFGDKGSPGQGE----TPFSLR-QTHPTYNITITQVSVGG-----NAVNFEFSAIFDS 329
++SFG KGS GE TP R Q +Y +T+ +SV N+ + + + DS
Sbjct: 232 QMSFG-KGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDS 290
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GT L Y ++ + + T L + CY QTN + P + +G
Sbjct: 291 GTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYR---TQTNLKGPTLTYHFEG 347
Query: 390 GGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDN 421
PI + P+ ++CL + N
Sbjct: 348 ANLLLT--PIQTFIPPTPETKGVFCLAITNCAN 378
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 94/349 (26%), Positives = 138/349 (39%), Gaps = 48/349 (13%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC--DCVSCVHGLNSSSGQVIDFNI 154
R++ G + S+G P DTGSDL W C C + S S
Sbjct: 83 RMDDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPS-------- 134
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRY---LSDGTMSTGFLVED 206
Y PN SST +K+PC+ LC L + C +AG+ C Y+ Y D + GFL +
Sbjct: 135 YLPNASSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARE 194
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
L D S + FGC G + G+ + P L +Q L
Sbjct: 195 TFTLGADAVPS------VRFGCTTASEGGYGSGSG-------LVGLGRGPLSLVSQ-LNA 240
Query: 267 NSFSMCFGSDGTGR--ISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNA---VN 320
++F C SD + + FG S G L + Y + + +S+G V
Sbjct: 241 STFMYCLTSDASKASPLLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVG 300
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPN--QTNF 378
+FDSGT+ TYL +PAY++ F S + T FE C+ N +N
Sbjct: 301 EPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG--FEACFQKPANGRLSNA 358
Query: 379 EYPVVNLTMKGGGPFF-VNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + L G V + +V V + C V +S +++IIG
Sbjct: 359 AVPTMVLHFDGADMALPVANYVVEVEDG-----VVCWIVQRSPSLSIIG 402
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 127/309 (41%), Gaps = 48/309 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ W+ C C+ C + ++ P S +
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDP---------VFDPTKSRSF 195
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC S LC C + C YQV Y DG+ + G + L
Sbjct: 196 ANIPCGSPLCRRLDYPGCSTKKQICLYQVSY-GDGSFTVGEFSTETLTFRGTRV------ 248
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG----SDG 277
R+ GCG G F+ A GLG + S PS + + + FS C G S
Sbjct: 249 GRVVLGCGHDNEGLFVGAAGLL---GLGRGRLSFPSQIGRR--FNSKFSYCLGDRSASSR 303
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN------FEFSA-- 325
I FGD S T F+ ++P Y + + +SVGG V+ F+ +
Sbjct: 304 PSSIVFGD--SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG 361
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGTS T L AY + + F A + L F+ C+ LS +T + P
Sbjct: 362 NGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSL-FDTCFDLS-GKTEVKVP 419
Query: 382 VVNLTMKGG 390
V L +G
Sbjct: 420 TVVLHFRGA 428
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 80/259 (30%), Positives = 117/259 (45%), Gaps = 27/259 (10%)
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
+G +C Y V+Y DG+ + GF D L L++ + FGCG G F + A
Sbjct: 17 SGGHCLYGVQY-GDGSYTIGFFAMDTLTLSSHDAIKG-----FRFGCGERNEGLFGEAA- 69
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE----TP 295
GL GLG KTS+P ++ F+ CF S GTG + FG SP TP
Sbjct: 70 --GLLGLGRGKTSLPVQTYDK--YGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLSTTP 125
Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IFDSGTSFTYLNDPAYTQISETF 348
L T PT Y + +T + VGG + F+A I DSGT T L AY+ + F
Sbjct: 126 M-LIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAF 184
Query: 349 -NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPK 407
S+A + + + + CY L+ + P V+L +GG V+ +I ++
Sbjct: 185 AASMAARGYKRAPALSLLDTCYDLT-GASEVAIPTVSLLFQGGVSLDVDASGIIYAASVS 243
Query: 408 GLYLYCLGVVKSDNVNIIG 426
L G +D+V I+G
Sbjct: 244 QACLGFAGNEAADDVAIVG 262
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 110/282 (39%), Gaps = 46/282 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL WL C C C H + Y P TS++
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEA---------FYDPKTSASF 212
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L QC S +CPY Y + F VE ++L T E +
Sbjct: 213 KNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGR 272
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S + FGCG G F + GL + +S Q L +SFS C
Sbjct: 273 SSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVD 327
Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFS------LRQTHPTYNITITQVSVGGNAVNFEF 323
++ + ++ FG DK F+ Y I I + VGG A++
Sbjct: 328 RNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPE 387
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT+ +Y +PAY I F KE
Sbjct: 388 ETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE 429
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 157/414 (37%), Gaps = 61/414 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLRGRGLAAQGNDKTP 86
G F D HR D PK + A R DR+FR A + TP
Sbjct: 33 GRFSIDLIHR------------DSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTP 80
Query: 87 LT-FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNS 144
S+ N Y + +S+G P DTGSDL W C C+SC N
Sbjct: 81 EPPVSSNNGEYLMK---------ISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNP 131
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGF 202
++ P+ S++ +V C S C L C C + Y DG+++ G
Sbjct: 132 ---------MFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY-GDGSLAQGV 181
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
+ + L L ++ Q S+ I FGCG +G+F + GLFG G S+ S + +
Sbjct: 182 IATETLTLNSNSGQPTSI-LNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238
Query: 263 GLIPNSFSMC---FGSDG--TGRISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSV 314
FS C F +D T +I FG + + TP + Y +T+ +SV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 315 GGNAVNFEFSA--------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
G F S+ D+GT T L Y ++ + A DL +
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKE-AIPMEPVQDPDLQPQ 357
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
CY + T + P+ LT G P+ S +G+Y + + + D
Sbjct: 358 LCYR---SATLIDGPI--LTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGD 406
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 49/304 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +VG PA +F++ALDT +D W+PC+ CV C +++ TS+T
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSS------------TVFNSVTSTTF 137
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C+ GS C + Y +S L D + L+TD +
Sbjct: 138 KTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALSTD------IVPG 189
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----G 279
+FGC + TGS P GL GLG S S Q L ++FS C S T G
Sbjct: 190 YTFGCIQKTTGS---SVPPQGLLGLGRGPLSFLS--QTQDLYKSTFSYCLPSFRTLNFSG 244
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
+ G G P + +T L+ + Y + + + VG V+ SA I
Sbjct: 245 TLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTI 304
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVV 383
FDSGT FT L P YT + + F +S F+ CY +++P T F + +
Sbjct: 305 FDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG--FDTCYTGPIVAPTMT-FMFSGM 361
Query: 384 NLTM 387
N+T+
Sbjct: 362 NVTL 365
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 123 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 172
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 173 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 230
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 231 AVRS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 279
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 280 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 339
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+ DSGT T L AY+ +S F + K+ S + + C+ S Q++ P
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 397
Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
V L GG ++ +I+S
Sbjct: 398 VALVFSGGAVVSLDASGIILS 418
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 105/414 (25%), Positives = 156/414 (37%), Gaps = 61/414 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLRGRGLAAQGNDKTP 86
G F D HR D PK + A R DR+FR A + TP
Sbjct: 33 GRFSIDLIHR------------DSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTP 80
Query: 87 LT-FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNS 144
S+ N Y + +S+G P DTGSDL W C C+SC N
Sbjct: 81 EPPVSSNNGEYLMK---------ISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNP 131
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGF 202
++ P+ S++ +V C S C L C C + Y DG+++ G
Sbjct: 132 ---------MFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY-GDGSLAQGV 181
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
+ + L L ++ Q S+ I FGCG +G+F + GLFG G S+ S + +
Sbjct: 182 IATETLTLNSNSGQPXSI-XNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238
Query: 263 GLIPNSFSMC---FGSDG--TGRISFGDKGSPGQG---ETPFSLRQTHPTYNITITQVSV 314
FS C F +D T +I FG + TP + Y +T+ +SV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 315 GGNAVNFEFSA--------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
G F S+ D+GT T L Y ++ + A DL +
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKE-AIPMEPVQDPDLQPQ 357
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD 420
CY + T + P+ LT G P+ S +G+Y + + + D
Sbjct: 358 LCYR---SATLIDGPI--LTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGD 406
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 143/350 (40%), Gaps = 57/350 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLSPNQT 376
+FDSG+ +Y+ D A + + + L A+E+ E + CY +
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLRQRIRELLLKRGAAEEESERN--------CYDMRSVDE 272
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P ++L G F + V V + ++CL + +V+IIG
Sbjct: 273 G-DMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTKSVSIIG 321
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 127/311 (40%), Gaps = 43/311 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
+ VS G PA+ +V +DTGSDL WL C SSGQ ++ P+ SST
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCK--------PCSSGQCSPQKDPLFDPSHSST 163
Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S VPC S C+ C S G C + + Y+ DGT + G +D L LA
Sbjct: 164 YSAVPCASGECKKLAADAYGSGC-SNGQPCGFAISYV-DGTSTVGVYGKDKLTLAPG--- 218
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
++ FGCG ++ + + L Q FS C +
Sbjct: 219 --AIVKDFYFGCGHSKSSLPGLFDG-------LLGLGRLSESLGAQYGGGGGFSYCLPAV 269
Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
+ G ++FG +P G TP PT++ +T+ ++VGG ++ SA I
Sbjct: 270 NSKPGFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIV 329
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT T L Y + F K R DL + CY L+ N P + LT
Sbjct: 330 DSGTVVTVLQSTVYRALRAAFREAMKAYRLVH-GDL--DTCYDLT-GYKNVVVPKIALTF 385
Query: 388 KGGGPFFVNDP 398
GG ++ P
Sbjct: 386 SGGATINLDVP 396
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 108/271 (39%), Gaps = 45/271 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + LDTGSDL W C C+ CV Q + + P S+T
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPARSATY 140
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C S C C YQ Y D + G L + T+E +
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFY-GDSASTAGVLANETFTFGTNETRVSL--PG 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + GS +G +G+ G G S+ S L + FS C F S R
Sbjct: 198 ISFGCGNLNAGSLANG---SGMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSR 249
Query: 281 ISFG--------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAV 319
+ FG + S TPF + PT Y + +T +SVGG N
Sbjct: 250 LYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDT 309
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
+ I DSGT+ TYL +PAY + F S
Sbjct: 310 DGTGGTIIDSGTTITYLAEPAYDAVRAAFAS 340
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/304 (28%), Positives = 131/304 (43%), Gaps = 49/304 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +VG PA +F++ALDT +D W+PC+ CV C +++ TS+T
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSS------------TVFNSVTSTTF 137
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C+ GS C + Y +S L D + L+TD +
Sbjct: 138 KTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALSTD------IVPG 189
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----G 279
+FGC + TGS P GL GLG S S Q L ++FS C S T G
Sbjct: 190 YTFGCIQKTTGS---SVPPQGLLGLGRGPLSFLS--QTQDLYKSTFSYCLPSFRTLNFSG 244
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
+ G G P + +T L+ + Y + + + VG V+ SA I
Sbjct: 245 TLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTI 304
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVV 383
FDSGT FT L P YT + + F +S F+ CY +++P T F + +
Sbjct: 305 FDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG--FDTCYTGPIVAPTMT-FMFSGM 361
Query: 384 NLTM 387
N+T+
Sbjct: 362 NVTL 365
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 78/307 (25%), Positives = 126/307 (41%), Gaps = 41/307 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P + DTGSDL W C C C ++ ++ P SST
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDP---------LFDPKASSTY 144
Query: 164 SKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C+S+ C E Q C + + C Y Y D + + G + D L L + + + +
Sbjct: 145 KDVSCSSSQCTALENQASCSTEDNTCSYSTSY-GDRSYTKGNIAVDTLTLGSTDTRPVQL 203
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+ I GCG G+F G +G+ +V I I FS C +
Sbjct: 204 KNII-IGCGHNNAGTF----NKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSEN 258
Query: 276 DGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFS 324
D T +I+FG G TP + Y +T+ +SVG V + E +
Sbjct: 259 DRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGN 318
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT+ T L Y+++ + +S+ EK++ + L CY + + + P +
Sbjct: 319 IIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSL--CYSAT---GDLKVPAI 373
Query: 384 NLTMKGG 390
+ G
Sbjct: 374 TMHFDGA 380
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/349 (26%), Positives = 147/349 (42%), Gaps = 51/349 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P L + DTGSDL W C C S C +Y+P++S+T + +
Sbjct: 94 LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 144
Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
PCNS+L P G C Y V Y S G S E +T QS+
Sbjct: 145 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGS-GWTSVFQGSETFTFGSTPAGQSRV- 202
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
I+FGC +G + ++ +GL GLG + S L +Q +P FS C ++
Sbjct: 203 -PGIAFGCSTASSG--FNASSASGLVGLGRGRLS----LVSQLGVPK-FSYCLTPYQDTN 254
Query: 277 GTGRISFGDK----GSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVNFEFSA--- 325
T + G G+ G TPF + T P Y + +T +S+G A++ A
Sbjct: 255 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLL 314
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
I DSGT+ T L + AY Q+ SL ++ + C++L P+ T+
Sbjct: 315 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAATGLDLCFML-PSSTS 373
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ ++T+ G V + S+ GL+ + VNI+G
Sbjct: 374 APPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILG 422
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 118/283 (41%), Gaps = 34/283 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++S+G P + DTGSDL W C C C ++ ++ P +S T
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDP---------LFDPKSSKTY 145
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
C++ C L Q +G+ C YQ Y D + + G + D + L + S
Sbjct: 146 RDFSCDARQCSLLDQSTCSGNICQYQYSY-GDRSYTMGNVASDTITLDSTTGSPVSFPKT 204
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGT 278
+ GCG G+F D + G+ GLG S+ S + + + FS C + +
Sbjct: 205 V-IGCGHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNS 259
Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF--------EFSAI 326
+++FG PG TP +T + Y +T+ +SVG + F E + I
Sbjct: 260 SKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNII 319
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
DSGT+ T + D ++ +S + + +R S CY
Sbjct: 320 IDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGF-LSVCY 361
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 115/429 (26%), Positives = 170/429 (39%), Gaps = 81/429 (18%)
Query: 40 DPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLR------GRGLAAQGNDKTPLTFSAG 92
+ V G+L+ D A S+L R DRY RL A + P+T A
Sbjct: 95 EEVDGLLSTD-------AARVSSLQRRIDRYRRLMITSSAEVAVAVAASKAQVPVTSGA- 146
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
+L +L ++ + G+ V +DT S+L W+ C C SC +
Sbjct: 147 ----KLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQDP------- 191
Query: 152 FNIYSPNTSSTSSKVPCNSTLCEL---------------QKQCPSAGSNCPYQVRYLSDG 196
++ P++S + + VPCNS+ C+ Q Q SA + C Y + Y DG
Sbjct: 192 --LFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAA-CSYTLSY-RDG 247
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
+ S G L D L LA + +D + FGCG G G + GL GLG + S+
Sbjct: 248 SYSRGVLAHDRLSLA-----GEVIDGFV-FGCGTSNQGPPFGGTS--GLMGLGRSQLSLV 299
Query: 257 SILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPTYNI 307
S +Q FS C SD +G + GD S + TP S P Y +
Sbjct: 300 SQTMDQ--FGGVFSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFV 357
Query: 308 TITQVSVGGNAVN--------FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
+T ++VGG V AI DSGT T L Y + F S E +
Sbjct: 358 NLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQAP 417
Query: 360 TSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI--VSSEPKGLYLYCLGVV 417
+ + C+ ++ + P + L GG V+ V+ VSS+ + L +
Sbjct: 418 GFSI-LDTCFNMT-GLREVQVPSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLK 475
Query: 418 KSDNVNIIG 426
NIIG
Sbjct: 476 SEYETNIIG 484
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 127/299 (42%), Gaps = 39/299 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P + +V +DTGS L W+ C +C + + +GQ I++P SST SKV
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-----IFNPYNSSTYSKVG 57
Query: 168 CNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
C++ C ++ C C Y +RY S G S G+L +D L LA++ +S+
Sbjct: 58 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYLGKDRLTLASN----RSI 112
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GT 278
D+ I FGCG L G+ G G S + + Q +FS CF D
Sbjct: 113 DNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQTDY-TAFSYCFPRDHENE 166
Query: 279 GRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------AIFDS 329
G ++ G T P Y I Q+ + N + E I DS
Sbjct: 167 GSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIRLEIDPYIYISKMTIVDS 224
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF-EYPVVNLTM 387
GT+ TY+ P + + + + K T D C++ + N+ ++P V + +
Sbjct: 225 GTADTYILSPVFDALDKAMTKEMQAKGYTRGWD-ERRICFISNSGSANWNDFPTVEMKL 282
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 94/354 (26%), Positives = 142/354 (40%), Gaps = 56/354 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P + LDTGSDL W C CVSC Q + + + + SST+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFD-------QPLPY--FDTSRSSTN 85
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ +PC ST C+L + C Y Y D +++ G L D
Sbjct: 86 ALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSY-GDNSVTIGLLAADKFTFVAGTSLP 144
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
++FGCG TG F + G+ G G S+PS L +FS CF +
Sbjct: 145 G-----VTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQLKV-----GNFSHCFTTI- 191
Query: 278 TGRISF-------GDKGSPGQGE---TP---FSLRQTHPT-YNITITQVSVGGNAVNFEF 323
TG I D S GQG TP ++ + +PT Y +++ ++VG +
Sbjct: 192 TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 251
Query: 324 SA----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
SA I DSGTS T L Y + + F A+ K + Y +P
Sbjct: 252 SAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEF--AAQIKLPVVPGNATGHYTCFSAP 309
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
+Q + P + L +G + V + G + CL + K D IIG
Sbjct: 310 SQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGN 363
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/287 (27%), Positives = 125/287 (43%), Gaps = 39/287 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+ +G P + I +DTGSDL W C C C QV+ ++ P SST
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVP--LFDPKNSSTY 142
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
C ++ C + + S C ++ Y +DG+ + G L + L + D K V
Sbjct: 143 RDSSCGTSFCLALGKDRSCSKEKKCTFRYSY-ADGSFTGGNLASETLTV--DSTAGKPVS 199
Query: 222 -SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+FGCG G F + +G+ GLG + S+ S L + I FS C S
Sbjct: 200 FPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQL--KSTINGLFSYCLLPVSTDS 255
Query: 276 DGTGRISFGDKGSP---GQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------E 322
+ RI+FG G G TP + Y +T+ +SVG + + E
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEE 315
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ I DSGT++T+L Y+++ ++ + K KR + + F CY
Sbjct: 316 GNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGI-FSLCY 361
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 120/273 (43%), Gaps = 46/273 (16%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
RL +L ++ V +G ++ IV DTGSDL W+ C C C + + ++
Sbjct: 61 RLQTLNYI--VTVEIGGRNMTVIV--DTGSDLTWVQCQPCRLCYNQQDP---------LF 107
Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+P+ S + + CNS+ C+ LQ C S C Y V Y DG+ + G L + L
Sbjct: 108 NPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNY-GDGSYTRGDLGMEQL 166
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
+L T S FGCGR G F +GL GLG K+ + + +
Sbjct: 167 NLGTTHV------SNFIFGCGRNNKGLF---GGASGLMGLG--KSDLSLVSQTSAIFEGV 215
Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFSLRQT-----HPT-YNITITQVSVGGNAV 319
FS C +D +G + G S + TP S + PT Y + +T +S+GG A+
Sbjct: 216 FSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVAL 275
Query: 320 ---NFEFSAIF-DSGTSFTYLNDPAYTQISETF 348
N+ S I DSGT T L P Y + F
Sbjct: 276 QAPNYRQSGILIDSGTVITRLPPPVYRDLKAEF 308
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 121/307 (39%), Gaps = 44/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + +++ P S T
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTD---------HVFDPTKSRTY 168
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC + LC C + C YQV Y DG+ + G + L +
Sbjct: 169 AGIPCGAPLCRRLDSPGCSNKNKVCQYQVSY-GDGSFTFGDFSTETLTFRRNRV------ 221
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+R++ GCG G F GL GLG + S P + + FS C S
Sbjct: 222 TRVALGCGHDNEGLF---TGAAGLLGLGRGRLSFPVQTGRR--FNHKFSYCLVDRSASAK 276
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + +SVGG V F A
Sbjct: 277 PSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNG 336
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGTS T L PAY + + F A + L F+ C+ LS T + P V
Sbjct: 337 GVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSL-FDTCFDLS-GLTEVKVPTV 394
Query: 384 NLTMKGG 390
L +G
Sbjct: 395 VLHFRGA 401
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 123/307 (40%), Gaps = 44/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ W+ C C C + +++P S +
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDP---------VFNPTKSRSF 197
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC S LC C + C YQV Y DG+ + G + L
Sbjct: 198 ANIPCGSPLCRRLDSPGCSTKKHICLYQVSY-GDGSFTYGEFSTETLTFRGTRV------ 250
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
R++ GCG G F+ A L GLG + S PS + + FS C S
Sbjct: 251 GRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSASSK 305
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + VSVGG V F+ +
Sbjct: 306 PSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNG 365
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGTS T L PAY + + F A + L F+ C+ LS +T + P V
Sbjct: 366 GVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL-FDTCFDLS-GKTEVKVPTV 423
Query: 384 NLTMKGG 390
L +G
Sbjct: 424 VLHFRGA 430
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 87/330 (26%), Positives = 131/330 (39%), Gaps = 42/330 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P ++ LDTGSD+ WL C C C + SG+V D +
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCY----AQSGRVFDPRRSRSYAAVRC 197
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
PC C C YQV Y DG+++ G L + L A + R
Sbjct: 198 GAPPCRGLDAGGGGGCDRRRGTCLYQVAY-GDGSVTAGDLATETLWFARGARVP-----R 251
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRIS 282
++ GCG G F+ A GL + S+P+ A + FS CF GSD R
Sbjct: 252 VAVGCGHDNEGLFVAAAGLLGLG---RGRLSLPTQTARR--YGRRFSYCFQGSDLDHRTI 306
Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----AIFDSGTSFTYLN 337
+R H + VG ++ + S I DSGTS T L
Sbjct: 307 ---------------IRTVHQHVGGARVR-GVGERSLRLDPSTGRGGVILDSGTSVTRLA 350
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVND 397
P Y + E F + A R F+ CY L + + P V++ + GG +
Sbjct: 351 RPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRV-VKVPTVSVHLAGGAEVALPP 409
Query: 398 PIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
++ + +G +CL + +D V+I+G
Sbjct: 410 ENYLIPVDTRG--TFCLALAGTDGGVSIVG 437
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 130/310 (41%), Gaps = 53/310 (17%)
Query: 113 QPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
Q LS I+ DTGS+ + C S S V D P S + +VPC S L
Sbjct: 110 QKNLSAII--DTGSEAVLVQC---------GSRSRPVFD-----PAASQSYRQVPCISQL 153
Query: 173 C-ELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
C +Q+Q C ++ + C Y + Y D STG +DV+ L + ++V R
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSY-GDSRNSTGDFSQDVIFLNSTNSSGQAVQFR 212
Query: 224 -ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-----G 277
++FGC G FL G+ G S+PS L ++ L + FS CF S
Sbjct: 213 DVAFGCAHSPQG-FLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRA 270
Query: 278 TGRISFGDKG--SPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
TG I GD G G TP P Y + +T +SV G + SA
Sbjct: 271 TGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 330
Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
+ DSGT+FT + D AYT F + + R+ + F+ CY +S +
Sbjct: 331 TGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLP 390
Query: 379 EYPVVNLTMK 388
P V L+++
Sbjct: 391 GVPEVRLSLQ 400
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 113/259 (43%), Gaps = 36/259 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y V G PA + + +DTGS L WL C CV H V ++ P+ S T
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCH--------VQADPLFDPSASKTY 169
Query: 164 SKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C S+ C C ++ + C Y Y D + S G+L +D+L LA +
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASY-GDSSYSMGYLSQDLLTLAPSQTL 228
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
V +GCG+ G F A G+ GLG +K S+ ++++ +FS C +
Sbjct: 229 PGFV-----YGCGQDSDGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR 278
Query: 277 GTGR-ISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSAIF 327
G G +S G G TP + +P+ Y + +T ++VGG A+ + I
Sbjct: 279 GGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTII 338
Query: 328 DSGTSFTYLNDPAYTQISE 346
DSGT T L YT +
Sbjct: 339 DSGTVITRLPMSVYTPFQQ 357
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 122/310 (39%), Gaps = 56/310 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVH------GLNSSSGQVIDFNIYSP 157
++ VG PA F++ DTGSDL W+ C S H + S V ++ P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169
Query: 158 NTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHL 210
S T S +PC+S C+ C S+ + C Y RY +D + + G + D + L
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRY-NDNSAARGVVGTDSATVAL 228
Query: 211 ATDEKQSKSVDSR-----ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
+ D + + GC G + A +G+ LG S S A++
Sbjct: 229 SGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFE--ASDGVLSLGYSNISFASRAASR--F 284
Query: 266 PNSFSMCF-----GSDGTGRISFG------DKGSPGQG-ETPFSL-RQTHPTYNITITQV 312
FS C + T ++FG +P G TP L + P Y + + V
Sbjct: 285 GGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSV 344
Query: 313 SVGGNAVNFEFSA---------IFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETS 359
SV G A++ I DSGTS T L PAY + SE L + +
Sbjct: 345 SVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMD-- 402
Query: 360 TSDLPFEYCY 369
PF+YCY
Sbjct: 403 ----PFDYCY 408
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 91/334 (27%), Positives = 134/334 (40%), Gaps = 53/334 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA + ++A+DT +D W+PC CV C ++P S+T KV C +
Sbjct: 113 GTPAQTLLLAMDTSNDAAWVPCTACVGCSTT-----------TPFAPPKSTTFKKVGCGA 161
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDSRISFGCG 229
+ C+ + GS C + Y GT S LV+D + LATD + +FGC
Sbjct: 162 SQCKQVRNPTCDGSACAFNFTY---GTSSVAASLVQDTVTLATDPVPA------YTFGCI 212
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----GRISFGD 285
+ TGS L GL + + Q L ++FS C S T G
Sbjct: 213 QKATGSSLPPQGLLGLGRGPLSLLA-----QTQKLYQSTFSYCLPSFKTLNFSGHXDLXP 267
Query: 286 KGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
P P F + Y + + + VG V+ A +FDSGT F
Sbjct: 268 VAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVF 327
Query: 334 TYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
T L +PAYT + F ++ K+ T TS F+ CY + P + G
Sbjct: 328 TRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTVP-----IVAPTITFMFSGMNV 382
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNII 425
D I+I S+ + CL + + DNVN +
Sbjct: 383 TLPPDNILIHST---AGSVTCLAMAPAPDNVNSV 413
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/350 (24%), Positives = 143/350 (40%), Gaps = 57/350 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLSPNQT 376
+FDSG+ +Y+ D A + +S+ L A+E+ E + CY +
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRSVDE 272
Query: 377 NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P ++L F + V V + ++CL +++V+IIG
Sbjct: 273 G-DMPAISLHFDDAARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIG 321
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 126/299 (42%), Gaps = 37/299 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P +DT +D W C+ C C N++S ++ P+ SST +PC+
Sbjct: 95 IGTPPFQLYGVMDTANDNIWFQCNPCKPC---FNTTSP------MFDPSKSSTYKTIPCS 145
Query: 170 STLCE--LQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
S C+ C S C Y Y + S G L D L L ++ S + I
Sbjct: 146 SPKCKNVENTHCSSDDKKVCEYSFTYGGEA-YSQGDLSIDTLTLNSNNDTPISFKN-IVI 203
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDG-TGRI 281
GCG G L+G +G GLG S S L + I FS C F ++G +G++
Sbjct: 204 GCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISGKL 259
Query: 282 SFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGT 331
FGDK G G + Y+ T+ +SVG + + FE S I DSGT
Sbjct: 260 HFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGT 319
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
+ T L + Y+++ S+ K +R S + F+ CY N + P++ G
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAKSPNQ-QFKLCY--KATLKNLDVPIITAHFNGA 375
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/329 (25%), Positives = 131/329 (39%), Gaps = 43/329 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ ++++G P L LDTGSDL W CD C C +Y+P S+T
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142
Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C S +C+ LQ +C + C Y Y DGT + G L + L +D
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
++FGCG GS + +GL G+G P L +Q + C
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRG----PLSLVSQLGVTRPRRSCRARAAA 249
Query: 279 GRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLN 337
SP +G T +L P +T + GG I DSGT+FT L
Sbjct: 250 RGGGAPTTTSPLEGITVGDTLLPIDPAV-FRLTPMGDGG--------VIIDSGTTFTALE 300
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVND 397
+ A+ ++ S + S + L C+ + + E P + L G +
Sbjct: 301 ERAFVALARALASRVRLPL-ASGAHLGLSLCFAAASPEA-VEVPRLVLHFDGADMELRRE 358
Query: 398 PIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
V+ E + + CLG+V + ++++G
Sbjct: 359 SYVV---EDRSAGVACLGMVSARGMSVLG 384
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 85/351 (24%), Positives = 142/351 (40%), Gaps = 41/351 (11%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R Y + R A D +T + ++SL ++ + G P++ ++ +DTG
Sbjct: 89 RTNYIKSRASTGMASTPDDAAVTVPTRLGGF-VDSLEYM--VTLGFGTPSVPQVLLMDTG 145
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-----ELQKQCP 180
SD+ W+ C C NS+ ++ P+ SST + + C + C + C
Sbjct: 146 SDVSWV--QCAPC----NSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCT 199
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
S G+ C Y+V Y DG+ + G + + A FGCG Q G
Sbjct: 200 SGGTQCGYRVEY-GDGSSTRGVYSNETITFAPGITVKD-----FHFGCGHDQRGP---SD 250
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPF-- 296
+GL GLG S+ ++ + +FS C + G ++ G + S + F
Sbjct: 251 KFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVF 308
Query: 297 ----SLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISET 347
L +Y + +T +SVGG ++ SA + DSGT T L + AY ++
Sbjct: 309 TPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAA 368
Query: 348 FNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
++ D F+ CY + +N P V LT GG ++ P
Sbjct: 369 LRKAFAAYPMVASED--FDTCYNFT-GYSNVTVPRVALTFSGGATIDLDVP 416
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 94/345 (27%), Positives = 140/345 (40%), Gaps = 58/345 (16%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
+R RL LA +TP+ ++GN Y ++ +S G P +DTG
Sbjct: 62 HERRARLAKHVLAGDQLFETPV--ASGNGEYLID---------ISYGNPPQKSTAIVDTG 110
Query: 126 SDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAG 183
SDL W+ C C SC L++ + P+ S++ + C S C+ L Q S
Sbjct: 111 SDLNWVQCLPCKSCYETLSAK---------FDPSKSASYKTLGCGSNFCQDLPFQ--SCA 159
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
++C Y Y DG+ ++G L D + + T + + ++FGCG G+F
Sbjct: 160 ASCQYDYMY-GDGSSTSGALSTDDVTIGTGKIPN------VAFGCGNSNLGTFAGAGG-- 210
Query: 244 GLFGLGMDKTSVPSILANQ--GLIPNSFSMC---FGSDGTGRISFGDKG-SPGQGETPFS 297
+ P L +Q G FS C GS T + GD + G TP
Sbjct: 211 -----LVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAYTPML 265
Query: 298 LRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IFDSGTSFTYLNDPAYTQIS 345
+PT Y + +SV G AVN F+ +A I DSGT+ TYL+ A+ +
Sbjct: 266 TNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMV 325
Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
+ A E S EYC+ + N YP V G
Sbjct: 326 AALKA-ALPYPEADGSFYGLEYCFS-TAGVANPTYPTVVFHFNGA 368
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 110/447 (24%), Positives = 163/447 (36%), Gaps = 70/447 (15%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDP-VKGILAVDDLPKKGSFAY 59
M+SS +L+ L CA G + +SDP + V D ++
Sbjct: 1 MSSSTSQMASLAVLVFLVVCATLASGAASVRVGLTRIHSDPDITAPEFVRDALRRD---- 56
Query: 60 YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
HR + L GR LA +D T ++ D G + +S+G P LS+
Sbjct: 57 ----MHRQQSRSLFGRELAE--SDGTTVSARTRKDLPN----GGEYLMTLSIGTPPLSYP 106
Query: 120 VALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-- 174
DTGSDL W PC C +Y+P +S+T +PCNS+L
Sbjct: 107 AIADTGSDLIWTQCAPCSGDQCF---------AQPAPLYNPASSTTFGVLPCNSSLSMCA 157
Query: 175 --LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ 232
L + P G C Y Y + T G + + V I+FGC
Sbjct: 158 GVLAGKAPPPGCACMYNQTYGTGWT--AGVQGSETFTFGSAAADQARVPG-IAFGCSNAS 214
Query: 233 TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGS 288
+ + +G+A GL GLG S+ S L FS C ++ T + G +
Sbjct: 215 SSDW-NGSA--GLVGLGRGSLSLVSQLGA-----GRFSYCLTPFQDTNSTSTLLLGPSAA 266
Query: 289 ---PGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSG 330
G TPF Y + +T +S+G A++ A I DSG
Sbjct: 267 LNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSG 326
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL-SPNQTNFEYPVVNLTMKG 389
T+ T L + AY Q+ SL + + CY L +P P + L G
Sbjct: 327 TTITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLDLCYALPTPTSAPPAMPSMTLHFDG 386
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGV 416
D +I G ++CL +
Sbjct: 387 ADMVLPADSYMI-----SGSGVWCLAM 408
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/312 (25%), Positives = 131/312 (41%), Gaps = 32/312 (10%)
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDF 152
+T +++LG + + SVG P+L LDTGSD+ WL C C C
Sbjct: 79 ETTVISALG-EYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTP-------- 129
Query: 153 NIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
I+ + S T +PC S C+ +Q S+ +C Y + Y+ DG+ S G L + L L
Sbjct: 130 -IFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYV-DGSQSLGDLSVETLTLG 187
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ + GCGR + + G+ GLG S+ + L+ FS
Sbjct: 188 STNGSPVQFPGTV-IGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKFSY 242
Query: 272 CFG---SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---- 321
C S + +++FG+ G TP + Y +T+ SVG N + F
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
+ + I DSGT+ T L + Y+++ +R + + CY ++P++ +
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQV-LGLCYKVTPDKLDA 361
Query: 379 EYPVVNLTMKGG 390
PV+ G
Sbjct: 362 SVPVITAHFSGA 373
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 104/381 (27%), Positives = 154/381 (40%), Gaps = 50/381 (13%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHY-TNVSVGQPALSFIVA 121
L H R + G G + + PLT A S+ +Y T + +G PA S+++
Sbjct: 96 LLHGHRKKKAGGVGGSQASSSSVPLTPGA--------SVAVGNYVTRLGLGTPATSYVMV 147
Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQC- 179
+DTGS L WL C C + +G V D P S T + V C+S+ C ELQ
Sbjct: 148 VDTGSSLTWL--QCSPCSVSCHRQAGPVFD-----PRASGTYAAVQCSSSECGELQAATL 200
Query: 180 -PSAGS---NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
PSA S C YQ Y D + S G+L +D + + +GCG+ G
Sbjct: 201 NPSACSVSNVCIYQASY-GDSSYSVGYLSKDTVSFGSGSFPG------FYYGCGQDNEGL 253
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQ-G 292
F A GL GL +K S+ LA + +FS C S G +S G +PGQ
Sbjct: 254 FGRSA---GLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIGSY-NPGQYS 307
Query: 293 ETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGTSFTYLNDPAYTQIS 345
TP + + Y +T++ +SV G + I DSGT T L YT +S
Sbjct: 308 YTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLPPNVYTALS 367
Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
+ + + + C+ S P V++ GG ++ V++ +
Sbjct: 368 RAVAAAMASAAPRAPTYSILDTCFRGS--AAGLRVPRVDMAFAGGATLALSPGNVLIDVD 425
Query: 406 PKGLYLYCLGVVKSDNVNIIG 426
CL + IIG
Sbjct: 426 DS---TTCLAFAPTGGTAIIG 443
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 141/321 (43%), Gaps = 40/321 (12%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 47 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 96
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 97 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 154
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 155 AVRS------FQFGCSNVESG-FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 203
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 204 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 263
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+ DSGT T L AY+ +S F + K+ S + + C+ S Q++ P
Sbjct: 264 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGI-LDTCFDFS-GQSSVSIPS 321
Query: 383 VNLTMKGGGPFFVNDPIVIVS 403
V L GG ++ +I+S
Sbjct: 322 VALVFSGGAVVSLDASGIILS 342
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 133/313 (42%), Gaps = 61/313 (19%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
HR R RGR L + L+ +G ++ + +G P S+ + LDT
Sbjct: 20 HRHR----RGRSLLQTAQVSSGLSLGSGE-----------YFARMGIGSPQRSYYLELDT 64
Query: 125 GSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG 183
GSD+ W+ C C SC ++ IY P+ SS+ +V C S LC+ G
Sbjct: 65 GSDVTWIQCAPCSSCYSQVDP---------IYDPSNSSSYRRVYCGSALCQALDYSACQG 115
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
C Y+V Y D + S+G L + +L + S + I+FGCG +G F A
Sbjct: 116 MGCSYRVVY-GDSSASSGDLGIESFYLGPN---SSTAMRNIAFGCGHSNSGLFRGEAGLL 171
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS-FGDKGSP---GQGETPFSLR 299
G+ G + S I A+ G +FS C R S + SP G+ PF+ R
Sbjct: 172 GMGGGTLSFFS--QIAASIG---PAFSYCL----VDRYSQLQSRSSPLIFGRTAIPFAAR 222
Query: 300 QT----HPT----YNITITQVSVGGNAV-----------NFEFSAIFDSGTSFTYLNDPA 340
T +P Y +T +SVGG A+ N AI DSGTS T + A
Sbjct: 223 FTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAA 282
Query: 341 YTQISETFNSLAK 353
Y + + + + ++
Sbjct: 283 YAVLRDAYRAASR 295
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 126/310 (40%), Gaps = 53/310 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V VG P F + LDTGSDL W+ CV C + Y P SS+
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWI--QCVPCYECFEQNGPH------YDPGQSSSYR 232
Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV---LHLATDEK 215
+ C+ + C L + C + CPY Y + F +E L +++ +
Sbjct: 233 NIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKP 292
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ + V++ + FGCG G F A L GLG S S L Q L +SFS C
Sbjct: 293 ELRRVEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 346
Query: 274 -GSDG--TGRISFGDK----GSPGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
SD + ++ FG+ P T + +P Y + I + VGG VN
Sbjct: 347 RNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPE 406
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
I DSGT+ +Y +PAY I E F +AK K D P E CY
Sbjct: 407 EKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF--MAKVKGYPVVKDFPVLEPCY-- 462
Query: 372 SPNQTNFEYP 381
N T E P
Sbjct: 463 --NVTGVEQP 470
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 121/299 (40%), Gaps = 44/299 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ ++SVG P + LDTGSDL W C C++ + + V+D P SST +
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVW--TQCAPCLNCFDQGAIPVLD-----PAASSTHA 146
Query: 165 KVPCNSTLCELQ--KQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQ 216
V C++ +C C GS +C Y Y D +++ G L D D
Sbjct: 147 AVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHY-GDKSITVGKLASDRFTFGPGDNAD 205
Query: 217 SKSV-DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
V + R++FGCG G F A G+ G G + S+PS L SFS CF S
Sbjct: 206 GGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFSYCFTS 258
Query: 276 DGTGRISFGDKG-SPGQ-------GETPFSLRQTHPT-YNITITQVSVGGNAVNF----- 321
S G +P + TP + P+ Y +++ ++VG +
Sbjct: 259 MFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQ 318
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETFNS---LAKEKRETSTSDLPFEYCYVLSPN 374
E SAI DSG S T L + Y + F + L E S DL F +P
Sbjct: 319 RLREASAIIDSGASITTLPEDVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPK 377
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 93/337 (27%), Positives = 142/337 (42%), Gaps = 41/337 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T + +G PA +I+ +DTGS L WL C C + SG V D P TSS+ +
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWL--QCSPCRVSCHRQSGPVFD-----PKTSSSYA 189
Query: 165 KVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C++ C L S+ C YQ Y D + S G+L +D + ++ +
Sbjct: 190 AVSCSTPQCNDLSTATLNPAACSSSDVCIYQASY-GDSSFSVGYLSKDTVSFGSNSVPN- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+GCG+ G F A GL GL +K S+ LA + SFS C S +
Sbjct: 248 -----FYYGCGQDNEGLFGRSA---GLMGLARNKLSLLYQLAPT--LGYSFSYCLPSSSS 297
Query: 279 GRISFGDKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAVNF---EFSA---IFDSG 330
+PGQ TP S Y I ++ ++V G + E+S+ I DSG
Sbjct: 298 SGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSG 357
Query: 331 TSFTYLNDPAYTQISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
T T L Y +S+ K KR + S L + C+V ++ P V++ G
Sbjct: 358 TVITRLPTTVYDALSKAVAGAMKGTKRADAYSIL--DTCFV--GQASSLRVPAVSMAFSG 413
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G ++ ++V + CL + + IIG
Sbjct: 414 GAALKLSAQNLLVDVDSSTT---CLAFAPARSAAIIG 447
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/346 (26%), Positives = 137/346 (39%), Gaps = 49/346 (14%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G PA+ + +DTGSDL W C C C I+ P SS+ SKV
Sbjct: 111 ELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 161
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+S LC + C +C Y Y D + + G L + + ++ S I
Sbjct: 162 GCSSGLCNALPRSNCNEDKDSCEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 215
Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
FGCG G DG + +GL GLG S+ S L S S+ G
Sbjct: 216 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 272
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
S +G ++ G+ SL + P+ Y + + ++VG ++ E S
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
I DSGT+ TYL + A+ + E F S + S S + C+ L N
Sbjct: 333 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS-TGLDLCFKLPNAAKNIAV 391
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
P + KG + ++ S L CL + S+ ++I G
Sbjct: 392 PKLIFHFKGADLELPGENYMVADSSTGVL---CLAMGSSNGMSIFG 434
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 121/272 (44%), Gaps = 34/272 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
+SL L Y +V +G PA++ V +DTGSD+ W+ C+ ++ +G + D P
Sbjct: 128 SSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFD-----P 182
Query: 158 NTSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST + C++ C + A S C Y V+Y DG+ +TG DVL L+
Sbjct: 183 AASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTLSG 241
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ V FGC + G+ +D +GL GLG D S+ S A + SFS C
Sbjct: 242 SD-----VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSLVSQTAAR--YGKSFSYC 293
Query: 273 FGSD--GTGRISFGDKGSPGQ------GETPFSLRQTHPTYNI-TITQVSVGGNAVN--- 320
+ +G ++ G S G TP + PTY + ++VGG +
Sbjct: 294 LPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSP 353
Query: 321 --FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
F ++ DSGT T L AY +S F +
Sbjct: 354 SVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 385
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 91/362 (25%), Positives = 146/362 (40%), Gaps = 75/362 (20%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSC-VHGLNSSSGQVIDFNIYSPNTSSTS 163
Y V +G P F V +DTGS ++ C C SC HG N+ Y SS+
Sbjct: 139 YATVLIGTPGHQFEVIVDTGSTYTFVTCYPCASCGQHGSNAP---------YDAAKSSSY 189
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+VPC S + C ++G C Y ++ D + G +V DV+ + R
Sbjct: 190 ERVPCGSGC--IFGACRASGL-CEYDEKFSEDSQVG-GHVVSDVIDVG-----GSLGTPR 240
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS----FSMCFGS-DGT 278
I FGC ++T + L NG+ LG + + L + P S F +C GS +G
Sbjct: 241 IHFGCNSLET-NMLKTQKANGMIALGRAEAGLHRQLKKKAYPPGSYDGTFGLCLGSFEGG 299
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT------------YNITITQVSVGG---------- 316
G +S G P Q F R+TH + YN+ + ++ V
Sbjct: 300 GVLSLGK--LPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEVHRMFVRNTELKKPSGAE 357
Query: 317 --NAVNFEFSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKE------KRETSTSDLPFEY 367
A + + DSGT++TYL++ + ISE + + + + + P +
Sbjct: 358 LMEAFRAGYGTVLDSGTTYTYLHEDVFIPFISEIEDKVVNDHGANFFRVRGGDPNYPNDV 417
Query: 368 CY-------VLSPNQTNFEYPVVNLTMKGGG------PFFVNDPIVIVSSEPKGLYLYCL 414
C+ LS + N+ +P NLT G F + + + +EP +C+
Sbjct: 418 CWRSLNENKQLSESNVNYLFPTFNLTFIGVNEEELPIEFLPENYLFVHPNEPNA---FCV 474
Query: 415 GV 416
GV
Sbjct: 475 GV 476
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 150/364 (41%), Gaps = 72/364 (19%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
GFL N+S+G P ++ +V +DTGS L W+ C C++C S + P S
Sbjct: 103 GFL--VNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTS---------WFDPLKS 151
Query: 161 STSSKVPC--------NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ + C N C Q Y++RYL G S G L ++ L T
Sbjct: 152 VSFKTLGCGFPGYNYINGYKCNRFNQ-------AEYKLRYLG-GDSSQGILAKESLLFET 203
Query: 213 -DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI-LANQGLIPNSFS 270
DE + K S I+FGCG + + D A NG+FGLG + P I +A Q + N FS
Sbjct: 204 LDEGKIKK--SNITFGCGHMNIKTNNDDAY-NGVFGLG----AYPHITMATQ--LGNKFS 254
Query: 271 MCFGSDGT-----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
C G + G +GS +G+ TP + H Y +T+ +SVG + + +
Sbjct: 255 YCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGH--YYVTLQSISVGSKTLKIDPN 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE-YCYVLS 372
A + DSG ++T L + + + + L K E + FE C+
Sbjct: 312 AFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGV 371
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDP------------IVIVSSEPKGLYLYCLGVVKSD 420
++ +P V GG + + I+ S + L L +G++
Sbjct: 372 VSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQ 431
Query: 421 NVNI 424
N N+
Sbjct: 432 NYNV 435
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 94/367 (25%), Positives = 140/367 (38%), Gaps = 70/367 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ VG PA F++ DTGSDL W+ C G +G ++ S + +
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCS------GAGDGTGDA-PRRVFRAAASRSWA 164
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C+S C C S S C Y RY +DG+ + G + D +A +S+
Sbjct: 165 PIACSSDTCTSYVPFSLANCSSPASPCAYDYRY-NDGSAARGVVGTDSATIALSGSESRD 223
Query: 220 VDSR------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
R + GC G + +G+ LG S S A + FS C
Sbjct: 224 GGGRRAKLQGVVLGCTASYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFSYCL 279
Query: 274 -----GSDGTGRISFGDKGSPG-----------QGETPFSL-RQTHPTYNITITQVSVGG 316
+ T ++FG G G TP L R+ P Y + + V V G
Sbjct: 280 VDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAG 339
Query: 317 NAVNFEFS---------AIFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETSTSDL 363
A++ AI DSGTS T L PAY + SE L + +
Sbjct: 340 EALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMD------ 393
Query: 364 PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI--VIVSSEPKGLYLYCLGVVKS-- 419
PFEYCY N T + L ++ G + P +V + P + C+GV +
Sbjct: 394 PFEYCY----NWTAAALEIPGLEVRFAGSARLQPPAKSYVVDAAPG---VKCIGVQEGAW 446
Query: 420 DNVNIIG 426
V++IG
Sbjct: 447 PGVSVIG 453
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 143/345 (41%), Gaps = 48/345 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + +G PA + LDTGSD+ WL C C C + ++ P SS+
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDP---------LFDPALSSSY 246
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ VPC+S C + S+C Y+V Y DG+ + G + L L D
Sbjct: 247 ATVPCDSPHCRALDASACHNNAANGNSSCVYEVAY-GDGSYTVGDFATETLTLGGD---G 302
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---G 274
+ ++ GCG G F+ A L G + S PS ++ FS C
Sbjct: 303 SAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQIS-----ATEFSYCLVDRD 354
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN----------AVNFEFS 324
S + FG S +++ Y + + +SVGG A++ + S
Sbjct: 355 SPSASTLQFGASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGS 414
Query: 325 A--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT+ T L AY+ + + F + S L F+ CY L+ +++ + P
Sbjct: 415 GGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL-FDTCYDLA-GRSSVQVPA 472
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
V+L +GGG + ++ + G YCL + V+I+G
Sbjct: 473 VSLRFEGGGELKLPAKNYLIPVDGAG--TYCLAFAATGGAVSIVG 515
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 150/354 (42%), Gaps = 52/354 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + ++++G P + + +DTGSDL W+ CD C C N +Y P+
Sbjct: 61 LGY-YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRN---------RLYKPH 110
Query: 159 TSSTSSKVPCNSTLCELQKQCPS---AGSN--CPYQVRYLSDGTMSTGFLVEDVLHLA-T 212
V C LC + P+ AG N C Y+V Y G+ S G L+ D + L T
Sbjct: 111 ----GDLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGS-SLGVLLRDNIPLKFT 165
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNS 268
+ ++ + ++FGCG QT G P G+ GLG +TS+ S L + GLI N
Sbjct: 166 NGSLARPM---LAFGCGYDQTHH---GQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNV 219
Query: 269 FSMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSA 325
C G G + FGD+ P G TP + Y + + +
Sbjct: 220 VGHCLSGRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLEL 279
Query: 326 IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYV-------LSPNQTN 377
IFDSG+S+TY N A+ + N L + +T D C+ L +N
Sbjct: 280 IFDSGSSYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLHDVTSN 339
Query: 378 FEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-----NVNIIG 426
F+ +++ T P + ++ ++ + CLG++ N NIIG
Sbjct: 340 FKPLLLSFTKSKNSPLQLPPEAYLIVTKHGNV---CLGILDGTEIGLGNTNIIG 390
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 141/360 (39%), Gaps = 51/360 (14%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G PA+ + +DTGSDL W C C C I+ P SS+ SKV
Sbjct: 110 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 160
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+S LC + C C Y Y D + + G L + + ++ S I
Sbjct: 161 GCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 214
Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
FGCG G DG + +GL GLG S+ S L S S+ G
Sbjct: 215 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 271
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
S +G ++ G+ SL + P+ Y + + ++VG ++ E S
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
I DSGT+ TYL + A+ + E F S + S S + C+ L N
Sbjct: 332 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS-TGLDLCFKLPDAAKNIAV 390
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
P + KG + ++ S L CL + S+ ++I G N ++ H+
Sbjct: 391 PKMIFHFKGADLELPGENYMVADSSTGVL---CLAMGSSNGMSIFGNVQ--QQNFNVLHD 445
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 81/195 (41%), Gaps = 30/195 (15%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P +F +DTGSDL W+ CD C C + Y P ++ V
Sbjct: 58 LQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCT---------LPPIRQYKPKGNT----V 104
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC +C + QCP+ C Y+V Y G+ S G LV D L ++
Sbjct: 105 PCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGS-SMGALVIDQFPLKL--LNGSAMQ 161
Query: 222 SRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
R++FGCG Q L A P G+ GLG K V L GL N C S G
Sbjct: 162 PRLAFGCGYDQ---ILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSKG 218
Query: 278 TGRISFGDKGSPGQG 292
G + FGD P G
Sbjct: 219 GGYLFFGDTLIPTLG 233
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 103/377 (27%), Positives = 140/377 (37%), Gaps = 79/377 (20%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVAL 122
L R R +G ++ G+ P T + +Y G +T S+G P V L
Sbjct: 67 LKRRGRASHHSQKGSSSGGHKSIPATAALYPHSY-----GGYAFT-ASLGTPPQPLPVLL 120
Query: 123 DTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC----- 173
DTGS L W+PC DC +C SS ++ P SS+S V C + C
Sbjct: 121 DTGSQLTWVPCTSNYDCRNC------SSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHS 174
Query: 174 -ELQKQCP---SAGSNC--------PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
E +C S G+NC PY V Y S T G L+ D L +
Sbjct: 175 AEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGST--AGLLIADTL------RAPGRAV 226
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA----NQGLIPNSF-------- 269
S GC V P+GL G G SVP+ L + L+ F
Sbjct: 227 SGFVLGCSLVSVHQ-----PPSGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSG 281
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
S+ G D G S + P+++ Y + ++ V+VGG AV
Sbjct: 282 SLVLGGDNDGMQYVPLVKSAAGDKQPYAV-----YYYLALSGVTVGGKAVRLPARAFAAN 336
Query: 323 ----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCYVLSPN 374
AI DSGT+FTYL DP Q A R + D L C+ L
Sbjct: 337 AAGSGGAIVDSGTTFTYL-DPTVFQPVADAVVAAVGGRYKRSKDVEEGLGLHPCFALPQG 395
Query: 375 QTNFEYPVVNLTMKGGG 391
+ P ++L KGG
Sbjct: 396 AKSMALPELSLHFKGGA 412
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/353 (24%), Positives = 137/353 (38%), Gaps = 56/353 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ VSVG P + +D+GSD+ W+ C C+ C V ++ P TS+T
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECY---------VQADPLFDPATSATF 221
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C S +C + C Y+V Y +DG+ + G L + L L +
Sbjct: 222 SGVSCGSAICRILPTSACGDGELGGCEYEVSY-ADGSYTKGALALETLTLGGTAVEG--- 277
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+ GCG G F+ A GL GLG S+ L + + +FS C S G
Sbjct: 278 ---VVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQLGGE--VGGAFSYCLASRGGYG 329
Query: 278 -------TGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS--- 324
G + G + +G P P+ Y + ++ + VG + +
Sbjct: 330 SGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETF-NSLAKE-KRETSTSDLPFEYCYVLSPN 374
+ D+GT+ T L AY + + F +LA R S + CY LS
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLS-G 448
Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV-KSDNVNIIG 426
+ P V+ G + V++ + + +YCL S ++I+G
Sbjct: 449 YASVRVPTVSFCFDGDARLILAARNVLLEVD---MGIYCLAFAPSSSGLSIMG 498
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 117/285 (41%), Gaps = 46/285 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P F + +D+GSDL W+ C C C D +Y P+ SST
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCY---------AQDSPLYVPSNSSTF 114
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV-LHLATDEKQSKSVD- 221
S VPC S+ C L A P RY G + +L D +S +VD
Sbjct: 115 SPVPCLSSDCLLIP----ATEGFPCDFRY--PGACAYEYLYADTSSSKGVFAYESATVDG 168
Query: 222 ---SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----- 273
+++FGCG GSF AA G+ GLG S S + N F+ C
Sbjct: 169 VRIDKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLD 223
Query: 274 GSDGTGRISFGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
+ + + FGD+ TP PT Y + I +V+VGG ++ SA
Sbjct: 224 PTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEID 283
Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
IFDSGT+ TY AY+ I F+S R S L
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGL 328
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 102/354 (28%), Positives = 150/354 (42%), Gaps = 57/354 (16%)
Query: 65 HRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
R +Y + R GR + D T L +G+ N + V +G P
Sbjct: 6 ERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSAN-----YVVVVGLGTPKRDLS 60
Query: 120 VALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
+ DTGSDL W C+ C SC ++ I+ P+ SS+ + + C S+LC
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDA---------IFDPSKSSSYTNITCTSSLCTQLT 111
Query: 175 ---LQKQCPSA-GSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCG 229
++ +C S+ ++C Y +Y D + S GFL ++ L + ATD VD + FGCG
Sbjct: 112 SDGIKSECSSSTDASCIYDAKY-GDNSTSVGFLSQERLTITATD-----IVDDFL-FGCG 164
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSMCF--GSDGTGRISFGDK 286
+ G F +G+A GL GLG S V +N I FS C S G ++FG
Sbjct: 165 QDNEGLF-NGSA--GLMGLGRHPISIVQQTSSNYNKI---FSYCLPATSSSLGHLTFGAS 218
Query: 287 GSPGQG--ETPFS-LRQTHPTYNITITQVSVGGNAV----NFEFSA---IFDSGTSFTYL 336
+ TP S + + Y + I +SVGG + + FSA I DSGT T L
Sbjct: 219 AATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRL 278
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
Y + F EK + + CY LS + P ++ GG
Sbjct: 279 APTVYAALRSAFRR-XMEKYPVANEAGLLDTCYDLSGYK-EISVPRIDFEFSGG 330
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 111/282 (39%), Gaps = 46/282 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL WL C C C H +G Y P TS++
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFH----QNGM-----FYDPKTSASF 210
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L QC S +CPY Y + F VE ++L T E
Sbjct: 211 KNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGG 270
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S + FGCG G F + GL + +S Q L +SFS C
Sbjct: 271 SSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVD 325
Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFS------LRQTHPTYNITITQVSVGGNAVNF-- 321
++ + ++ FG DK F+ Y I I + VGG A++
Sbjct: 326 RNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPE 385
Query: 322 ---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ I DSGT+ +Y +PAY I F KE
Sbjct: 386 ETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE 427
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 95/386 (24%), Positives = 150/386 (38%), Gaps = 66/386 (17%)
Query: 46 LAVDDLPKKGSFAYYSALAHRDRY---------FRLRGRGLAAQGNDKTPLTF---SAGN 93
L V + ++G + + HRD+ RL GR L L S G
Sbjct: 59 LEVSEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGR-LKRDAKRVASLIRRLSSGGG 117
Query: 94 DTYRLNSLGF-----------LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHG 141
+YR++ G ++ + VG P S + +D+GSD+ W+ C C C H
Sbjct: 118 GSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 177
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
+ ++ P S++ + V C+S++C+ + C Y+V Y DG+ + G
Sbjct: 178 SDP---------VFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSY-GDGSYTKG 227
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
L + L +++ ++ GCG G F+ A GL G M S L
Sbjct: 228 TLALETLTFG------RTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSM---SFVGQLGG 278
Query: 262 QGLIPNSFSMCF---GSDGTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGG 316
Q +FS C G+D +G + FG + P G P P+ Y I + + VGG
Sbjct: 279 Q--TGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGG 336
Query: 317 NAVNF-----------EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLP 364
V + + D+GT+ T L AY + F A R T +
Sbjct: 337 IRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA--I 394
Query: 365 FEYCYVLSPNQTNFEYPVVNLTMKGG 390
F+ CY L + P V+ GG
Sbjct: 395 FDTCYDLL-GFVSVRVPTVSFYFSGG 419
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 87/303 (28%), Positives = 126/303 (41%), Gaps = 44/303 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V +G PA + LDTGSD+ WL C C C H I+ P++SS+
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEP---------IFEPSSSSSY 201
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C + + C Y+V Y DG+ + G + L + + Q+
Sbjct: 202 EPLSCDTPQCNALEVSECRNATCLYEVSY-GDGSYTVGDFATETLTIGSTLVQN------ 254
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG ++PS L SFS C SD
Sbjct: 255 VAVGCGHSNEGLFVGAAG---LLGLGGGLLALPSQLNT-----TSFSYCLVDRDSDSAST 306
Query: 281 ISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
+ FG P P LR Q Y + +T +SVGG + +FE I
Sbjct: 307 VEFGTSLPPDAVVAPL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 365
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L Y + ++F E + F+ CY LS +T E P V
Sbjct: 366 DSGTAVTRLQTGIYNSLRDSFLK-GTSDLEKAAGVAMFDTCYNLSA-KTTIEVPTVAFHF 423
Query: 388 KGG 390
GG
Sbjct: 424 PGG 426
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 122/298 (40%), Gaps = 37/298 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ V G P + V DTGSD+ WL C V C ++ P+ SST
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEP---------LFDPSLSST 66
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C C + S C Y V Y DG+ + GFL D L +K +
Sbjct: 67 YRNVSCTEPACVGLSTRGCSSSTCLYGVFY-GDGSSTIGFLAMDTFMLTPAQKFKNFI-- 123
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKT-SVPSILANQGLIPNSFSMCF--GSDGTG 279
FGCG+ TG F A GL GLG T S+ S +A + N FS C S TG
Sbjct: 124 ---FGCGQNNTGLFQGTA---GLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATG 175
Query: 280 RISFGD-KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGT 331
++ G+ + +PG R PT Y I + +SVGG ++ I DSGT
Sbjct: 176 YLNIGNPQNTPGYTAMLTDTRV--PTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGT 233
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
T L AY+ + + A + + + + CY S T+ YPV+ L G
Sbjct: 234 VITRLPPTAYSALKTAVRA-AMTQYTLAPAVTILDTCYDFS-RTTSVVYPVIVLHFAG 289
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 125/289 (43%), Gaps = 49/289 (16%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+GF + T +++GQPA + + +DTGSDL WL CD C H + P
Sbjct: 68 VGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCD-APCTHCSETPH----------PLHR 115
Query: 161 STSSKVPCNSTLC-ELQKQCPSAGSNCP------YQVRYLSDGTMSTGFLVEDVLHLATD 213
++ VPC LC LQ P+ NC Y++ Y +D + G L+ DV L +
Sbjct: 116 PSNDFVPCRDPLCASLQ---PTEDYNCEHPDQCDYEINY-ADQYSTYGVLLNDVYLLNSS 171
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
V R++ GCG Q S +GL GLG K S+ S L +QGL+ N C
Sbjct: 172 NGVQLKV--RMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCL 229
Query: 274 GSDGTG-----------RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF- 321
S G G R+++ TP S + Y+ ++ GG
Sbjct: 230 SSQGGGYIFFGNAYDSARVTW----------TPISSVDSK-HYSAGPAELVFGGRKTGVG 278
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+A+FD+G+S+TY N AY + N L+ + + + D C+
Sbjct: 279 SLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCW 327
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 130/310 (41%), Gaps = 53/310 (17%)
Query: 113 QPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
Q LS I+ DTGS+ + C S S V D P S + +VPC S L
Sbjct: 9 QKNLSAII--DTGSEAVLVQC---------GSRSRPVFD-----PAASQSYRQVPCISQL 52
Query: 173 C-ELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
C +Q+Q C ++ + C Y + Y D STG +DV+ L + S++V R
Sbjct: 53 CLAVQQQTSNGSSQPCVNSSAACTYSLSY-GDSRNSTGDFSQDVIFLNSTNSSSQAVQFR 111
Query: 224 -ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-----G 277
++FGC G FL G+ G S+PS L ++ L + FS CF S
Sbjct: 112 DVAFGCAHSPQG-FLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRA 169
Query: 278 TGRISFGDKG--SPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
TG I GD G TP P Y + +T +SV G + SA
Sbjct: 170 TGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 229
Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLSPNQTNF 378
+ DSGT+FT + D AYT F + + R+ + F+ CY +S +
Sbjct: 230 TGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCYNISAGSSLP 289
Query: 379 EYPVVNLTMK 388
P V L+++
Sbjct: 290 GVPEVRLSLQ 299
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/384 (26%), Positives = 156/384 (40%), Gaps = 59/384 (15%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LAA+G + ++G + + + +G P ++A+D
Sbjct: 73 ASRDASRLLYLDSLAARGKARAYAPIASGRQLLQTPT----YVVRARLGTPPQQLLLAVD 128
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C +SS D P S++ VPC S LC CP
Sbjct: 129 TSNDAAWIPCAGCAGC----PTSSAPPFD-----PAASTSYRSVPCGSPLCAQAPNAACP 179
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
G C + + Y +D ++ L +D L +A D ++ +FGC + TG+ A
Sbjct: 180 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGDAVKT------YTFGCLQKATGT---AA 228
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 229 PPQGLLGLGRGPLSF--LSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTP 286
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
L H + Y + +T + VG V A + DSGT FT L PAY
Sbjct: 287 LLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVA 346
Query: 344 ISETFNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV 402
+ + + + S L F+ C+ N T +P V L G + +VI
Sbjct: 347 VRDEV----RRRVGAPVSSLGGFDTCF----NTTAVAWPPVTLLFDGMQVTLPEENVVIH 398
Query: 403 SSEPKGLYLYCLGVVKS-DNVNII 425
S+ + CL + + D VN +
Sbjct: 399 STYGT---ISCLAMAAAPDGVNTV 419
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 144/348 (41%), Gaps = 44/348 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V +G P F + +DTGSDL WL C C+ C SG + D P S +
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFE----QSGPIFD-----PAASISY 199
Query: 164 SKVPCNSTLCEL--------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
V C C L ++C S+ CPY Y D + +TG L + + +
Sbjct: 200 RNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWY-GDQSNTTGDLALEAFTVNLTQ 258
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSMCF 273
++ VD ++FGCG G F A L GLG S S L +G+ ++FS C
Sbjct: 259 SGTRRVDG-VAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RGVYGGHAFSYCL 312
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE--- 322
GS +I FG + P T F+ T Y + + + VGG AVN
Sbjct: 313 VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDT 372
Query: 323 FSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE 379
SA I DSGT+ +Y +PAY I + F CY +S E
Sbjct: 373 LSAGGTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVS-GAEKVE 431
Query: 380 YPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY-LYCLGVVKSDNVNIIG 426
P ++L G + + EP+G+ L LG +S ++IIG
Sbjct: 432 VPELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRS-GMSIIG 478
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 107/271 (39%), Gaps = 45/271 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + LDTGSDL W C C+ CV Q + + P S+T
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPARSATY 140
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C S C C YQ Y D + G L + T+E +
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFY-GDSASTAGVLANETFTFGTNETRVSL--PG 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + G +G +G+ G G S+ S L + FS C F S R
Sbjct: 198 ISFGCGNLNAGLLANG---SGMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSR 249
Query: 281 ISFG--------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAV 319
+ FG + S TPF + PT Y + +T +SVGG N
Sbjct: 250 LYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDT 309
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
+ I DSGT+ TYL +PAY + F S
Sbjct: 310 DGTGGTIIDSGTTITYLAEPAYDAVRAAFAS 340
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 141/360 (39%), Gaps = 51/360 (14%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G PA+ + +DTGSDL W C C C I+ P SS+ SKV
Sbjct: 2 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 52
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+S LC + C C Y Y D + + G L + + ++ S I
Sbjct: 53 GCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 106
Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
FGCG G DG + +GL GLG S+ S L S S+ G
Sbjct: 107 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 163
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
S +G ++ G+ SL + P+ Y + + ++VG ++ E S
Sbjct: 164 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 223
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
I DSGT+ TYL + A+ + E F S + S S + C+ L N
Sbjct: 224 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRMSLPVDDSGS-TGLDLCFKLPDAAKNIAV 282
Query: 381 PVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIANNISLFHN 440
P + KG + ++ S L CL + S+ ++I G N ++ H+
Sbjct: 283 PKMIFHFKGADLELPGENYMVADSSTGVL---CLAMGSSNGMSIFGNVQ--QQNFNVLHD 337
>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 656
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/333 (25%), Positives = 142/333 (42%), Gaps = 38/333 (11%)
Query: 74 GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC 133
R L + L S N+ LN HY + VG P + +DTGS + PC
Sbjct: 64 ARTLQIAKTYRRSLFTSDQNEVVPLNLGMGTHYAWIYVGTPPQRVSIIIDTGSGMTAFPC 123
Query: 134 D-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY 192
C C + + I FN N SS+ + CN C + C R
Sbjct: 124 SGCDQCGNHTD------IPFNT---NLSSSIQPISCNHRTYFSCAYCTNPTEPC----RT 170
Query: 193 LSDGTMSTGFLVEDVLHL-----ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
+G+ + ++ED+++L A D S +R FGC +TG F+ A +G+ G
Sbjct: 171 YMEGSSWSAKVMEDIVYLGDVASAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMG 229
Query: 248 LGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKG-SPGQGETPFSLRQT---H 302
+ + + + L + IP N+F++CF G G + G S GE ++
Sbjct: 230 IHNNGNDIVTKLFREKKIPSNTFTLCFSPRG-GYFALGAMDTSRHAGEVTYARINDAYGE 288
Query: 303 PTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
Y + +T + VGG++++ + A I DSGT+ + ++ A + + + +L K
Sbjct: 289 NYYAVFMTDIRVGGHSIDIDMKATNSYRYIVDSGTTNSIISGRAGQALMDLYRNLTHLKN 348
Query: 357 ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
+ +D C +LSP+Q + P + M+G
Sbjct: 349 PLNDND-----CILLSPSQIE-QLPTLQFVMEG 375
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 98/344 (28%), Positives = 138/344 (40%), Gaps = 56/344 (16%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PAL++ +DTGSDL W C CV C ++ P++SST + VPC+
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATVPCS 223
Query: 170 STLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S C +C SA S C Y Y D + + G L + LA KS + FG
Sbjct: 224 SASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGVVFG 275
Query: 228 CGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR----- 280
CG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 276 CGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSPLLL 327
Query: 281 -----ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 328 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 387
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FEYPV 382
I DSGTS TYL Y + + F + S + + C+ + E P
Sbjct: 388 GVIVDSGTSITYLEVQGYRALKKAFAAQMALP-AADGSGVGLDLCFRAPAKGVDQVEVPR 446
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ GG + +V G CL V+ S ++IIG
Sbjct: 447 LVFHFDGGADLDLPAENYMVLDGGSG--ALCLTVMGSRGLSIIG 488
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 119/306 (38%), Gaps = 51/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT +D W+PC C C + PN S+T
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC------------SSTTFLPNASTTL 92
Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C+ C + CP+ GS+ C + Y D +++ LV+D + LA D V
Sbjct: 93 GSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLA-ATLVQDAITLAND------V 145
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 146 IPGFTFGCINAVSGGSIP---PQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 200
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 260
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T P Y I + F K+ +S F+ C+ + E P V
Sbjct: 261 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAAT---NEAEAPAV 314
Query: 384 NLTMKG 389
L +G
Sbjct: 315 TLHFEG 320
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 81/174 (46%), Gaps = 20/174 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +D+GS + ++PC DC C G+ D + P SST
Sbjct: 95 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPEMSSTYQP 145
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ Y ++ + S G L ED++ +S+ R
Sbjct: 146 VKCN-----MDCNCDDDREQCVYEREY-AEHSSSKGVLGEDLISFG---NESQLTPQRAV 196
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
FGC V+TG A +G+ GLG S+ L ++GLI NSF +C+G G
Sbjct: 197 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 119/306 (38%), Gaps = 51/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT +D W+PC C C + PN S+T
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS------------TTFLPNASTTL 145
Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C+ C + CP+ GS+ C + Y D ++ T LV+D + LA D V
Sbjct: 146 GSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSL-TATLVQDAITLAND------V 198
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 199 IPGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 253
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 254 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 313
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T P Y I + F K+ +S F+ C+ + E P +
Sbjct: 314 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAAT---NEAEAPAI 367
Query: 384 NLTMKG 389
L +G
Sbjct: 368 TLHFEG 373
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 138/328 (42%), Gaps = 49/328 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ + + +G P LDTGS+ W C+ CVH N ++ I+ P+ SST
Sbjct: 57 YEYLMKLQIGTPPFEIEAVLDTGSEHIW--TQCLPCVHCYNQTA------PIFDPSKSST 108
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ +C + +CPY++ Y + + G LV + + + + Q +
Sbjct: 109 FKEI-----------RCDTHDHSCPYELVY-GGKSYTKGTLVTETVTIHSTSGQPFVMPE 156
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
I GCGR +G F G A G+ +G+D+ I G P S CF GT +I+
Sbjct: 157 TI-IGCGRNNSG-FKPGFA--GV--VGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKIN 210
Query: 283 FGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T ++ P Y + + VSVG + + + + DSG
Sbjct: 211 FGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSG 270
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE-YPVVNLTMKG 389
++ TY E++ +L ++ E + + F +L + +PV+ + G
Sbjct: 271 STLTYF--------PESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDIFPVITMHFSG 322
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
G ++ + V+S G ++CL ++
Sbjct: 323 GADLVLDKYNMYVASNTGG--VFCLAII 348
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 121/307 (39%), Gaps = 53/307 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT +D W+PC C C + PN S+T
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC------------SSTTFLPNASTTL 92
Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C+ C + CP+ GS+ C + Y D +++ LV+D + LA D V
Sbjct: 93 GSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLA-ATLVQDAITLAND------V 145
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 146 IPGFTFGCINAVSGGSIP---PQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 200
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 260
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN-FEYPV 382
I DSGT T P Y I + F K+ +S F+ C+ +TN E P
Sbjct: 261 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFA----ETNEAEAPA 313
Query: 383 VNLTMKG 389
V L +G
Sbjct: 314 VTLHFEG 320
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 139/338 (41%), Gaps = 43/338 (12%)
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
LDTGS L WL C C H +Y P+ S T K+ C S C K
Sbjct: 3 LDTGSSLSWLQCQPCAVYCHAQADP--------LYDPSVSKTYKKLSCASVECSRLKAAT 54
Query: 179 -----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
C + + C Y Y D + S G+L +D+L L + + + ++GCG+
Sbjct: 55 LNDPLCETDSNACLYTASY-GDTSFSIGYLSQDLLTLTSSQTLPQ-----FTYGCGQDNQ 108
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG----SP 289
G F A G+ GL DK S+ + L+ + ++FS C + +G G SP
Sbjct: 109 GLFGRAA---GIIGLARDKLSMLAQLSTK--YGHAFSYCLPTANSGSSGGGFLSIGSISP 163
Query: 290 GQGE-TPFSLRQTHPT-YNITITQVSVGGN-----AVNFEFSAIFDSGTSFTYLNDPAYT 342
+ TP +P+ Y + +T ++V G A + + DSGT T L Y
Sbjct: 164 TSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYA 223
Query: 343 QISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV 402
+ + F + K + + + C+ S + P + + +GG + P +++
Sbjct: 224 ALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSIS-AVPEIKMIFQGGADLTLRAPSILI 282
Query: 403 SSEPKGLYLYCLGVVKSDNVNIIG----REYPIANNIS 436
++ L G ++ + IIG + Y IA ++S
Sbjct: 283 EADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVS 320
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 124/300 (41%), Gaps = 36/300 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VGQP S+ DTGSD+ WL C +G G + D P +SS+ S
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFD-----PKSSSSYS 238
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ C+S C L + ++C Y+V Y DG+ + G L + + S S+ +
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEY-GDGSFTVGELATETFSF----RHSNSI-PNL 292
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
GCG G F+ A + L++Q L SFS C S+ + +
Sbjct: 293 PIGCGHDNEGLFVGAAG-------LIGLGGGAISLSSQ-LEATSFSYCLVDLDSESSSTL 344
Query: 282 SFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDS 329
F +P PT+ + + +SVGG + +FE I DS
Sbjct: 345 DFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 404
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GT+ T + Y + + F L K + PF+ CY LS +Q+N E P + + G
Sbjct: 405 GTTITEIPSDVYDVLRDAFVGLTK-NLPPAPGVSPFDTCYDLS-SQSNVEVPTIAFILPG 462
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 122/300 (40%), Gaps = 36/300 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VGQP S+ DTGSD+ WL C +G G + D P +SS+ S
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFD-----PKSSSSYS 238
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ C+S C L + ++C Y+V Y DG+ + G L + + S S+ +
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEY-GDGSFTVGELATETFSF----RHSNSI-PNL 292
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
GCG G F+ + L++Q L SFS C S+ + +
Sbjct: 293 PIGCGHDNEGLFVGADG-------LIGLGGGAISLSSQ-LEATSFSYCLVDLDSESSSTL 344
Query: 282 SFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDS 329
F +P PT+ + + +SVGG + +FE I DS
Sbjct: 345 DFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 404
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GT+ T + Y + + F L K PF+ CY LS +Q+N E P + + G
Sbjct: 405 GTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVS-PFDTCYDLS-SQSNVEVPTIAFILPG 462
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 98/352 (27%), Positives = 144/352 (40%), Gaps = 66/352 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+SVG P L+F V DTGSDL W C C C + P +SST SK+
Sbjct: 89 NISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139
Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S+ C+ + C + G C Y +Y S T G+L + L + S
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN----SFSMCFGSD-- 276
++FGC + G G + +G+ GLG S LIP FS C S
Sbjct: 191 -VAFGC-STENGV---GNSTSGIAGLGRGALS---------LIPQLGVGRFSYCLRSGSA 236
Query: 277 -GTGRISFGDKGSPGQG---ETPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS- 324
G I FG + G TPF + HP+ Y + +T ++VG + F F+
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296
Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
I DSGT+ TYL Y + + F S T + C+ +
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS-QTANVTTVNGTRGLDLCFKSTGGGGGI 355
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKG-LYLYCLGVV--KSDN-VNIIG 426
P + L GG + V V ++ +G + + CL ++ K D +++IG
Sbjct: 356 AVPSLVLRFDGGAEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIG 407
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 121/274 (44%), Gaps = 46/274 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P S+ + LDTGSD+ W+ C C SC ++ IY P+ SS+
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDP---------IYDPSNSSSY 62
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+V C S LC+ G C Y+V Y D + S+G L + +L + S +
Sbjct: 63 RRVYCGSALCQALDYSACQGMGCSYRVVY-GDSSASSGDLGIESFYLGPN---SSTAMRN 118
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS- 282
I+FGCG +G F A G+ G + S I A+ G +FS C R S
Sbjct: 119 IAFGCGHSNSGLFRGEAGLLGMGGGTLSFFS--QIAASIG---PAFSYCL----VDRYSQ 169
Query: 283 FGDKGSP---GQGETPFSLRQT----HPTYNI----TITQVSVGGNAV-----------N 320
+ SP G+ PF+ R T +P N +T +SVGG + N
Sbjct: 170 LQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGN 229
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
AI DSGTS T + PAY + + + + ++
Sbjct: 230 GTGGAILDSGTSVTRVVPPAYAVLRDAYRAASRN 263
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 89/323 (27%), Positives = 130/323 (40%), Gaps = 48/323 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ V +G P + DTGSDL W C+ SC + I+ P+ S++
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDV---------IFDPSKSTS 196
Query: 163 SSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDE 214
S + C S LC C ++ C Y ++Y D + S G+ + L + ATD
Sbjct: 197 YSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQY-GDSSFSVGYFSRERLTVTATD- 254
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
V FGCG+ G F A GL GLG S A + S+ +
Sbjct: 255 -----VVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAKYRKIFSYCLPST 306
Query: 275 SDGTGRISFGDKGSPGQGE-TPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------AI 326
S TG +SFG + + TPFS + + Y + IT ++VGG + S AI
Sbjct: 307 SSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAI 366
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT T L AY + F K ++ + CY LS + F P + +
Sbjct: 367 IDSGTVITRLPPTAYGALRSAFRQ-GMSKYPSAGELSILDTCYDLSGYKV-FSIPTIEFS 424
Query: 387 MKGGGPFFVNDPIVIVSSEPKGL 409
GG V V P+G+
Sbjct: 425 FAGG---------VTVKLPPQGI 438
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 115/274 (41%), Gaps = 42/274 (15%)
Query: 102 GFLHY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y ++++G P LDTGSDL W C C SC+ + +++P
Sbjct: 99 GDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDP---------LFAPAA 149
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS+ + C+ LC L C C Y+ Y DGT + G + A+ +
Sbjct: 150 SSSYVPMRCSGQLCNDILHHSCQRP-DTCTYRYNY-GDGTTTLGVYATERFTFASSSGEK 207
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG----LIP----NSF 269
SV + FGCG + GS +G +G+ G G D S+ S L+ + L P
Sbjct: 208 LSVP--LGFGCGTMNVGSLNNG---SGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKS 262
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-- 325
++ FGS G + GD + GQ +T L RQ Y + T V+VG + SA
Sbjct: 263 TLMFGSLSDG-VFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFA 321
Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ T T++ F +
Sbjct: 322 LRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRA 355
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/328 (23%), Positives = 138/328 (42%), Gaps = 49/328 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ + + +G P LDTGS+ W C+ CVH N ++ I+ P+ SST
Sbjct: 63 YEYLMKLQIGTPPFEIEAVLDTGSEHIW--TQCLPCVHCYNQTA------PIFDPSKSST 114
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ +C + +CPY++ Y + + G LV + + + + Q +
Sbjct: 115 FKEI-----------RCDTHDHSCPYELVY-GGKSYTKGTLVTETVTIHSTSGQPFVMPE 162
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
I GCGR +G F G A G+ +G+D+ I G P S CF GT +I+
Sbjct: 163 TI-IGCGRNNSG-FKPGFA--GV--VGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKIN 216
Query: 283 FGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T ++ P Y + + VSVG + + + + DSG
Sbjct: 217 FGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSG 276
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFE-YPVVNLTMKG 389
++ TY E++ +L ++ E + + F +L + +PV+ + G
Sbjct: 277 STLTYF--------PESYCNLVRKAVEQVVTAVRFPRSDILCYYSKTIDIFPVITMHFSG 328
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
G ++ + V+S G ++CL ++
Sbjct: 329 GADLVLDKYNMYVASNTGG--VFCLAII 354
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 123/305 (40%), Gaps = 49/305 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P + LDT +D W+PC S G +S++ + PN S+T
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPC---SGCTGFSSTT--------FLPNASTTLG 146
Query: 165 KVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+ C + CP+ GS+ C + Y D ++ T LV+D + LA D V
Sbjct: 147 SLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSL-TATLVQDAITLAND------VI 199
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG---- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 200 PGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYYF 254
Query: 278 TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEFS 324
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 255 SGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG 314
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT T P Y I + F K+ +S F+ C+ + E P +
Sbjct: 315 TIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAAT---NEAEAPAIT 368
Query: 385 LTMKG 389
L +G
Sbjct: 369 LHFEG 373
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 145/364 (39%), Gaps = 58/364 (15%)
Query: 52 PKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT---PLTFSAGNDTYRLNSLGFLHYTN 108
P + + A HRD + R R LAA +D T P++ + + +
Sbjct: 39 PSVTASQFVRAALHRDMH-RHNARKLAASSSDGTVSAPVSPTTVPGEFLMT--------- 88
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID--FNIYSPNTSSTSSKV 166
+++G P L F+ DTGSDL W C C S Q +Y+P++S+T S +
Sbjct: 89 LAIGTPPLPFLAIADTGSDLIW--TQCAPC-------SRQCFQQPTPLYNPSSSTTFSAL 139
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PCNS+L C C Y + Y S T F + + + I+F
Sbjct: 140 PCNSSLGLCAPAC-----ACMYNMTYGSGWTYV--FQGTETFTFGSSTPADQVRVPGIAF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRIS 282
GC +G + ++ +GL GLG S+ S L FS C ++ T +
Sbjct: 193 GCSNASSG--FNASSASGLVGLGRGSLSLVSQLGAP-----KFSYCLTPYQDTNSTSTLL 245
Query: 283 FGDKGSPGQ----GETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
G S TPF + Y + +T +S+G A+ F A I
Sbjct: 246 LGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLII 305
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L + AY Q+ SL ++ + C+ L P+ T+ + ++T+
Sbjct: 306 DSGTTITMLGNTAYQQVRAAVLSLVTLPTTDGSAATGLDLCFEL-PSSTSAPPSMPSMTL 364
Query: 388 KGGG 391
G
Sbjct: 365 HFDG 368
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 130/340 (38%), Gaps = 50/340 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +G PA + +VA+D +D W+PC + S + P SST
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPS----------FDPTRSSTYR 156
Query: 165 KVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C + C Q PS GS+C + + Y + + L +D L L D +
Sbjct: 157 PVRCGAPQCS-QAPAPSCPGGLGSSCAFNLSYAA--STFQALLGQDALALHDDVDAVAA- 212
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
+FGC V TG + P GL G G S PS + + + FS C S+
Sbjct: 213 ---YTFGCLHVVTGGSVP---PQGLVGFGRGPLSFPS--QTKDVYGSVFSYCLPSYKSSN 264
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L H P+ Y + + + VGG V SA
Sbjct: 265 FSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGR 324
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I D+GT FT L+ P Y + + F S + F+ CY P V
Sbjct: 325 GTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG--FDTCY-----NVTISVPTV 377
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
+ G + + V++ S G+ + D V+
Sbjct: 378 TFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVD 417
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 122/266 (45%), Gaps = 39/266 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L N S+GQP + + +DTGS L W+ C C SC S Q+I ++ P+ SST
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSC-------SQQIIG-PMFDPSISST 152
Query: 163 SSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C + +C +C S+ S C Y Y+ +G S G + + L + ++ +V
Sbjct: 153 YDSLSCKNIICRYAPSGECDSS-SQCVYNQTYV-EGLPSVGVIATEQLIFGSSDEGRNAV 210
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
++ + FGC + G++ D G+FGLG TSV NQ + + FS C G+
Sbjct: 211 NN-VLFGCSH-RNGNYKDRRF-TGVFGLGSGITSV----VNQ--MGSKFSYCIGNIADPD 261
Query: 281 ISFGD----KGSPGQG-ETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------- 325
S+ +G +G TP + H Y + + +SVG + + SA
Sbjct: 262 YSYNQLVLSEGVNMEGYSTPLDVVDGH--YQVILEGISVGETRLVIDPSAFKRTEKQRRV 319
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSL 351
I DSGT+ T+L + Y + +L
Sbjct: 320 IIDSGTAPTWLAENEYRALEREVRNL 345
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 83/304 (27%), Positives = 122/304 (40%), Gaps = 40/304 (13%)
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
LG Y +++ G P ++ DTGSDL WL C + + +
Sbjct: 49 LGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA-CSRRPAFVASK 107
Query: 160 SSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
S+T S VPC++ C L P+A C Y Y +DG+ +TGFL D ++
Sbjct: 108 SATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDY-ADGSSTTGFLARDTATIS 166
Query: 212 TDEKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+V ++FGCG R Q GSF + G+ GLG + S P+ + L +FS
Sbjct: 167 NGTSGGAAVRG-VAFGCGTRNQGGSF---SGTGGVIGLGQGQLSFPA--QSGSLFAQTFS 220
Query: 271 MCFGSDGTGRI----SFGDKGSPGQ----GETPFSLRQTHPT-YNITITQVSVGGNAVNF 321
C GR SF G P + TP PT Y + + + VG +
Sbjct: 221 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV 280
Query: 322 EFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYC 368
S + DSG++ TYL AY + F + R S++ E C
Sbjct: 281 PGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELC 340
Query: 369 YVLS 372
Y +S
Sbjct: 341 YNVS 344
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 135/315 (42%), Gaps = 53/315 (16%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIY 155
SLG Y V++G PA++ ++++DTGSD+ W+ PC SC + ++
Sbjct: 123 SLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD---------KLF 173
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P S+T S C S C Q G S C Y V+Y DG+ + G D L L
Sbjct: 174 DPAMSATYSAFSCGSAQC---AQLGDEGNGCLKSQCQYIVKY-GDGSNTAGTYGSDTLSL 229
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ S +V S FGC G F+ +GL GLG D S+ S A +FS
Sbjct: 230 TS----SDAVKS-FQFGCSHRAAG-FV--GELDGLMGLGGDTESLVSQTA--ATYGKAFS 279
Query: 271 MCF---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--- 320
C S G G ++ G G S TP +R + PT Y + + ++V G +N
Sbjct: 280 YCLPPPSSSGGGFLTLGAAGGASSSRYSHTPM-VRFSVPTFYGVFLQGITVAGTMLNVPA 338
Query: 321 --FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCYVLSPNQ 375
F +++ DSGT T L AY + F K++ + S P + C+ S
Sbjct: 339 SVFSGASVVDSGTVITQLPPTAYQALRTAF----KKEMKAYPSAAPVGSLDTCFDFSGFN 394
Query: 376 TNFEYPVVNLTMKGG 390
T P V LT G
Sbjct: 395 T-ITVPTVTLTFSRG 408
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/378 (23%), Positives = 143/378 (37%), Gaps = 54/378 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A+R R+ + R N P+ +G + V G P S +D
Sbjct: 85 ANRLRFLKRTSRSSKQDANANVPVRSGSGE-----------YIIQVDFGTPKQSMYTLID 133
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSA 182
TGSD+ W+PC H I+ P SS+ C+S C E+ C
Sbjct: 134 TGSDVAWIPCKQCQGCHSTAP---------IFDPAKSSSYKPFACDSQPCQEISGNC-GG 183
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
S C ++V Y DGT G L D + L + + SFGC S + +P
Sbjct: 184 NSKCQFEVSY-GDGTQVDGTLASDAITLGSQYLPN------FSFGCAE----SLSEDTSP 232
Query: 243 NGLFGLGMDKTSVPSILA-NQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLR 299
+ + A L +FS C S +G + G + + F+
Sbjct: 233 SPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTL 292
Query: 300 QTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTSFTYLNDPAYTQISETF 348
P+ Y +T+ +SVG ++ + I DSGT+ T+L AYT + + F
Sbjct: 293 IKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAYTALRDAF 352
Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
+ T D+ + CY LS ++ + P + L + + ++++ E
Sbjct: 353 RQQLSSLQPTPVEDM--DTCYDLS--SSSVDVPTITLHLDRNVDLVLPKENILITQESG- 407
Query: 409 LYLYCLGVVKSDNVNIIG 426
L CL +D+ +IIG
Sbjct: 408 --LACLAFSSTDSRSIIG 423
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 145/368 (39%), Gaps = 72/368 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
+ N+S+G P + DTGSDL WL PCD G I+ P+ S+
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKG-----------PIFDPSNST 128
Query: 162 TSSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T K+PC + C E + C + + C Y Y D + +TG+L D + + Q
Sbjct: 129 TFHKLPCTTAPCNALDESARSC-TDPTTCGYTYSY-GDHSYTTGYLASDTVTVGNASVQI 186
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
++V +FGCG G+F + + G+ GLG S S L + I FS C
Sbjct: 187 RNV----AFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLE 238
Query: 274 --------GSDGTGRISFGDK----GSPGQG----ETPFSLRQTHPTYNITITQVSVGGN 317
S T RI FGD S G TP ++ Y +TI ++VG
Sbjct: 239 NEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRK 298
Query: 318 AVNF-------------------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
+ + E + I DSGT+ T+L + Y + K +R
Sbjct: 299 KLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVN 358
Query: 359 STSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK 418
+ F C+ + E P++ + +GG + V +E L C ++
Sbjct: 359 DVKNSMFSLCF--KSGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEG---LVCFTMLP 413
Query: 419 SDNVNIIG 426
+++V I G
Sbjct: 414 TNDVGIYG 421
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 88/342 (25%), Positives = 141/342 (41%), Gaps = 40/342 (11%)
Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
S+G +Y T + +G PA +++ +DTGS L WL C C+ + SG V ++P
Sbjct: 116 SVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWL--QCSPCLVSCHRQSGPV-----FNPK 168
Query: 159 TSSTSSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+SST + V C++ C L S+ + C YQ Y D + S G+L +D + +
Sbjct: 169 SSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASY-GDSSFSVGYLSKDTVSFGS 227
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+GCG+ G F A GL GL +K S+ LA + SF+ C
Sbjct: 228 TSLP------NFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFTYC 276
Query: 273 FGSDGTGRISFGDKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAV------NFEFS 324
S + +PGQ TP S Y I ++ ++V GN +
Sbjct: 277 LPSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP 336
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT T L Y+ +S+ + K S + + C+ + P V
Sbjct: 337 TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI-LDTCF--KGQASRVSAPAVT 393
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++ GG ++ ++V + CL + + IIG
Sbjct: 394 MSFAGGAALKLSAQNLLVDVDDS---TTCLAFAPARSAAIIG 432
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 113/263 (42%), Gaps = 33/263 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+YT++ +G P I+ +DTGS+L WL C C C +++ IY S++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDT---------IYDAARSASY 150
Query: 164 SKVPC-NSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C NS LC Q A GS C + Y DG+ S G L D L + T
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY-GDGSFSYGSLSTDTLIMETVVGGKP 209
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
+FGC + GA+ G+ GL K ++P L + FS CF
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSS 265
Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPT-----YNITITQVSVGGNAVNF---EFSA 325
+ TG + FG+ P + S+ T+ Y++ + VS+ + + F
Sbjct: 266 HLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVV 325
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG+SF+ P ++Q+ E F
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAF 348
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 125/305 (40%), Gaps = 46/305 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G PA ++A+DT SD+ W+PC CV C +SP S++
Sbjct: 99 YIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSF 147
Query: 164 SKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C++ C+ Q P+ G+ C + + Y S + L +D + LA D ++
Sbjct: 148 KNVSCSAPQCK-QVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA----- 199
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
+FGC G G P LG+ + + + Q + ++FS C S +
Sbjct: 200 -FTFGCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFS 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + LR + Y + + + VG V+ +A
Sbjct: 256 GSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGT 315
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPV 382
IFDSGT +T L P Y + F K TS F+ CY V P T F +
Sbjct: 316 IFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCYSGQVKVPTIT-FMFKG 374
Query: 383 VNLTM 387
VN+TM
Sbjct: 375 VNMTM 379
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 114/274 (41%), Gaps = 22/274 (8%)
Query: 105 HYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
H+T +V++G P F + +DTGSDL W+ CD C C + +Y P+ +
Sbjct: 54 HFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCT---------LPHDRLYKPHNNV 104
Query: 162 TSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
P C++ + C + C Y+V Y G+ S G LV+D + L +
Sbjct: 105 VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGS-SIGVLVKDPVPLRL--TNGTIL 161
Query: 221 DSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ FGCG Q GS L G+ GLG K ++ + L+ + N CF G
Sbjct: 162 APNLGFGCGYDQHNGGSQLPPLT-AGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGG 220
Query: 279 GRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
G + FG P G + LR Y+ +V GGN V FDSG+S+TY
Sbjct: 221 GFLFFGGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYF 280
Query: 337 NDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
N Y + N L + + D C+
Sbjct: 281 NSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICW 314
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 120/301 (39%), Gaps = 40/301 (13%)
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
LG Y +++ G P ++ DTGSDL WL C + + +
Sbjct: 48 LGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA-CSRRPAFVASK 106
Query: 160 SSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
S+T S VPC++ C L P+A C Y Y +DG+ +TGFL D ++
Sbjct: 107 SATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDY-ADGSSTTGFLARDTATIS 165
Query: 212 TDEKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+V ++FGCG R Q GSF + G+ GLG + S P+ + L +FS
Sbjct: 166 NGTSGGAAVRG-VAFGCGTRNQGGSF---SGTGGVIGLGQGQLSFPA--QSGSLFAQTFS 219
Query: 271 MCFGSDGTGRI----SFGDKGSPGQ----GETPFSLRQTHPT-YNITITQVSVGGNAVNF 321
C GR SF G P + TP PT Y + + + VG +
Sbjct: 220 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV 279
Query: 322 EFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYC 368
S + DSG++ TYL AY + F + R S++ E C
Sbjct: 280 PGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELC 339
Query: 369 Y 369
Y
Sbjct: 340 Y 340
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 61/393 (15%)
Query: 75 RGLAAQGNDKTPLTFSAGNDTYRLNSLGFL-------HYTNVSVGQPALSFIVALDTGSD 127
R +AA+ ++ S + R++ + + ++++G P + LDTGSD
Sbjct: 74 RRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSD 133
Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN- 185
L W C CVSC ++P+ S T S +PC+ +C S G
Sbjct: 134 LTWTQCAPCVSCFRQ---------SLPRFNPSRSMTFSVLPCDLRICR-DLTWSSCGEQS 183
Query: 186 -----CPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQSKSVDSRISFGCGRVQTGSFLDG 239
C Y Y +D +++TG L D A+ D + ++FGCG G F+
Sbjct: 184 WGNGICVYAYAY-ADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSN 242
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD------GTGRISFGDKGSP 289
G+ G S+P+ L ++FS CF GS+ G + D
Sbjct: 243 E--TGIAGFSRGALSMPAQLK-----VDNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGG 295
Query: 290 GQGETP-FSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
G G +L + H + Y I++ V+VG + S I DSGT
Sbjct: 296 GHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGM 355
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
T L + Y + + F + K STS L + C+ + P + P + L +G
Sbjct: 356 TMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS-QLCFSVPPGAKP-DVPALVLHFEGATLD 413
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ + E G+ L CL + +++++IG
Sbjct: 414 LPRENYMFEIEEAGGIRLTCLAINAGEDLSVIG 446
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 142/339 (41%), Gaps = 45/339 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G+P + LDTGSD+ W+ C C C + I+ P +S++
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADP---------IFEPASSASF 199
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + CN+ C C Y+V Y DG+ + G V + + L S VD+
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSY-GDGSYTVGDFVTETITLG-----SAPVDN- 252
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG S PS + SFS C S+
Sbjct: 253 VAIGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQIN-----ATSFSYCLVDRDSESAST 304
Query: 281 ISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IF 327
+ F P P LR H Y + +T +SVGG V+ SA I
Sbjct: 305 LEFNSTLPPNAVSAPL-LRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIV 363
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
DSGT+ T L Y + + F ++ T+ L F+ CY LS ++ N E P V+
Sbjct: 364 DSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL-FDTCYDLS-SKGNVEVPTVSFHF 421
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + +V + +G + + S +++IIG
Sbjct: 422 PDGKELPLPAKNYLVPLDSEGTFCFAFAPTAS-SLSIIG 459
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 75/288 (26%), Positives = 114/288 (39%), Gaps = 49/288 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
++ GCG +G F+ A GL GLG S+ L G FS C G+
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------A 325
G G + G + +G R+ Y + +T + VGG + + S
Sbjct: 289 GAGSLVLGRTEAVPRG------RRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 342
Query: 326 IFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
+ D+GT+ T L AY + F+ ++ R + S L + CY LS
Sbjct: 343 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 388
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 79/304 (25%), Positives = 116/304 (38%), Gaps = 49/304 (16%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL--------------NSSSGQ 148
F + V+VG P + F+ DTGSDL WL C+ +G+
Sbjct: 80 FEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEA 139
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
V+ FN P SS+ S+V C+ C C C ++ Y DG +TG L
Sbjct: 140 VVYFN---PFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSY-RDGASATGLLAA 195
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
D + + + I FGC G +G+ GLG S+ S L +
Sbjct: 196 DTFTFGGNINNDTTSTASIDFGCATGTAGREFQA---DGMVGLGAGPLSLASQLGRK--- 249
Query: 266 PNSFSMCFGS----DGTGRISFGDKG---SPGQGETPFSLRQTHPT--YNITITQVSVGG 316
FS C + D + ++FG + PG TP ++ Y I+I + V G
Sbjct: 250 ---FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAG 306
Query: 317 NAVNFEFS---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-----FEYC 368
V S I D+GT T+L+ A ++ SLA+ P E C
Sbjct: 307 QPVPGTTSVSKVIVDTGTVLTFLDRAAL--LAPLTESLARVMDGAGLPRAPPPDETLELC 364
Query: 369 YVLS 372
Y +S
Sbjct: 365 YDVS 368
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 88/378 (23%), Positives = 142/378 (37%), Gaps = 54/378 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A+R R+ + R N P+ +G + V G P S +D
Sbjct: 85 ANRLRFLKRTSRSSKEDANANVPVRSGSGE-----------YIIQVDFGTPKQSMYTLID 133
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSA 182
TGSD+ W+PC H I+ P SS+ C+S C E+ C
Sbjct: 134 TGSDVAWIPCKQCQGCHSTAP---------IFDPAKSSSYKPFACDSQPCQEISGNC-GG 183
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR-VQTGSFLDGAA 241
S C ++V Y DGT G L D + L + + SFGC + ++
Sbjct: 184 NSKCQFEVLY-GDGTQVDGTLASDAITLGSQYLPN------FSFGCAESLSEDTYSSPGL 236
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLR 299
G T P+ L +FS C S +G + G + + F+
Sbjct: 237 MGLGGGSLSLLTQAPT----AELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTL 292
Query: 300 QTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTSFTYLNDPAYTQISETF 348
P+ Y +T+ +SVG ++ + I DSGT+ TYL AY + + F
Sbjct: 293 IKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAF 352
Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
+ T D+ + CY LS ++ + P + L + + ++++ E
Sbjct: 353 RQQLSSLQPTPVEDM--DTCYDLS--SSSVDVPTITLHLDRNVDLVLPKENILITQESG- 407
Query: 409 LYLYCLGVVKSDNVNIIG 426
L CL +D+ +IIG
Sbjct: 408 --LSCLAFSSTDSRSIIG 423
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/356 (25%), Positives = 143/356 (40%), Gaps = 54/356 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P + LDTGSDL W C CVSC ++P+ S T
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQ---------SLPRFNPSRSMTF 161
Query: 164 SKVPCNSTLCELQKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQ 216
S +PC+ +C S G C Y Y +D +++TG L D A+ D
Sbjct: 162 SVLPCDLRICR-DLTWSSCGEQSWGNGICVYAYAY-ADHSITTGHLDSDTFSFASADHAI 219
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
+ ++FGCG G F+ G+ G S+P+ L ++FS CF
Sbjct: 220 GGASVPDLTFGCGLFNNGIFVSNE--TGIAGFSRGALSMPAQLK-----VDNFSYCFTAI 272
Query: 274 -GSD------GTGRISFGDKGSPGQGETP-FSLRQTHPT----YNITITQVSVGGNAVNF 321
GS+ G + D G G +L + H + Y I++ V+VG +
Sbjct: 273 TGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPI 332
Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
S I DSGT T L + Y + + F + K STS L + C+
Sbjct: 333 PESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS-QLCFS 391
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P + P + L +G + + E G+ L CL + +++++IG
Sbjct: 392 VPPGAKP-DVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIG 446
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 73/149 (48%), Gaps = 24/149 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P+ ++ +DTGSDL WL C C C + GQV D P SST
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136
Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+VPC+S C + C S AG C Y V Y DG+ STG L D L A D
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGDLATDKLAFAND----- 190
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
+ + ++ GCGR G F D AA GL G
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLG 216
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 126/315 (40%), Gaps = 54/315 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA + + LDTGSD+ WL C C C + + +++P S T
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDP---------VFNPAKSKTF 186
Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC S LC +C S S C YQV Y DG+ + G + L
Sbjct: 187 ATVPCGSRLCRRLDDSSECVSRRSKACLYQVSY-GDGSFTVGDFSTETLTF-----HGAR 240
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
VD ++ GCG G F+ A GL S PS N+ FS C
Sbjct: 241 VD-HVALGCGHDNEGLFVGAAGLLGLG---RGGLSFPSQTKNR--YNGKFSYCLVDRTSS 294
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
S I FG+ P F+ T+P Y + + +SVGG+ V F
Sbjct: 295 GSSSKPPSTIVFGNGAVPKTAV--FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 352
Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
+ A I DSGTS T L AY + + F A + + L F+ C+ LS
Sbjct: 353 KLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSL-FDTCFDLS-GM 410
Query: 376 TNFEYPVVNLTMKGG 390
T + P V GG
Sbjct: 411 TTVKVPTVVFHFTGG 425
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 61/393 (15%)
Query: 75 RGLAAQGNDKTPLTFSAGNDTYRLNSLGFL-------HYTNVSVGQPALSFIVALDTGSD 127
R +AA+ ++ S + R++ + + ++++G P + LDTGSD
Sbjct: 48 RRMAARSKARSARLLSGRAASARMDPGSYTDGVPDTEYLVHMAIGTPPQPVQLILDTGSD 107
Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN- 185
L W C CVSC ++P+ S T S +PC+ +C S G
Sbjct: 108 LTWTQCAPCVSCFRQ---------SLPRFNPSRSMTFSVLPCDLRICR-DLTWSSCGEQS 157
Query: 186 -----CPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQSKSVDSRISFGCGRVQTGSFLDG 239
C Y Y +D +++TG L D A+ D + ++FGCG G F+
Sbjct: 158 WGNGICVYAYAY-ADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSN 216
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD------GTGRISFGDKGSP 289
G+ G S+P+ L ++FS CF GS+ G + D
Sbjct: 217 E--TGIAGFSRGALSMPAQLKV-----DNFSYCFTAITGSEPSPVFLGVPPNLYSDAAGG 269
Query: 290 GQGETP-FSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
G G +L + H + Y I++ V+VG + S I DSGT
Sbjct: 270 GHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGM 329
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
T L + Y + + F + K STS L + C+ + P + P + L +G
Sbjct: 330 TMLPEAVYNLVCDAFVAQTKLTVHNSTSSLS-QLCFSVPPGAKP-DVPALVLHFEGATLD 387
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ + E G+ L CL + +++++IG
Sbjct: 388 LPRENYMFEIEEAGGIRLTCLAINAGEDLSVIG 420
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/324 (25%), Positives = 126/324 (38%), Gaps = 59/324 (18%)
Query: 105 HYTNVSVGQPALSFIVA-LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++ +G P +V LDTGSDL W C C C ++ + S T
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTVCFDQ---------PVPVFRASVSHTF 144
Query: 164 SKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
S+VPC+ LC P +G +C Y Y+ D +++TG + ED A D +
Sbjct: 145 SRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYM-DHSITTGKMAEDTFTFKAPDRADT 203
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPN--GLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ I FGCG + G F PN G+ G G S+PS L + FS CF +
Sbjct: 204 AAAVPNIRFGCGMMNYGLF----TPNQSGIAGFGTGPLSLPSQLKVR-----RFSYCFTA 254
Query: 276 DGTGRIS---FGDKGSPGQGE---------TPFSLRQ------THPTYNITITQVSVGGN 317
R+S G G P E TPF+ + P Y +++ V+VG
Sbjct: 255 MEESRVSPVILG--GEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGET 312
Query: 318 AVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
+ F S DSGT+ T+ + + E F + +D
Sbjct: 313 RLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGYTDPDNL 372
Query: 367 YCYVLSPNQTNFEYPVVNLTMKGG 390
C+ + + P + L ++G
Sbjct: 373 LCFSVPAKKKAPAVPKLILHLEGA 396
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 122/296 (41%), Gaps = 36/296 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
+ VGQP LDTGSD+ WL C+ C G N Q+ I+ P SS+ + V C
Sbjct: 1 MRVGQPQQPSFFVLDTGSDVTWL--QCLPCA-GKNGCYEQITP--IFDPELSSSYNPVSC 55
Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+S C+L + ++C Y+V Y DG+ + G L + L S S+ IS GC
Sbjct: 56 DSEQCQLLDEAGCNVNSCIYKVEY-GDGSFTIGELATETLTFV----HSNSI-PNISIGC 109
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGD 285
G G F+ GL G + +S L +SFS C S + F
Sbjct: 110 GHDNEGLFVGADGLIGLGGGAISISS--------QLKASSFSYCLVDIDSPSFSTLDFNT 161
Query: 286 KGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDSGTSF 333
+P P++ + + +SVGG + FE I DSGT+
Sbjct: 162 DPPSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTI 221
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
T L Y + E F L + PF+ CY LS +Q+N E P + + G
Sbjct: 222 TQLPSDVYEVLREAFLGLTT-NLPPAPEISPFDTCYDLS-SQSNVEVPTIAFILPG 275
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 145/360 (40%), Gaps = 65/360 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN---IYSPNTSS 161
H V +G P + +DTGSDL W C L+SS+ +Y P SS
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCK-------LSSSTAVAARHGSPPVYDPGESS 143
Query: 162 TSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T + +PC+ LC+ K C S + C Y+ Y S + G L +
Sbjct: 144 TFAFLPCSDRLCQEGQFSFKNCTSK-NRCVYEDVYGS--AAAVGVLASETFTFGA----R 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
++V R+ FGCG + GS + G+ GL + S+ + L Q FS C F
Sbjct: 197 RAVSLRLGFGCGALSAGSLIGA---TGILGLSPESLSLITQLKIQ-----RFSYCLTPFA 248
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT----HPT----YNITITQVSVGGNAVNFEFSA- 325
T + FG + +T ++ T +P Y + + +S+G + ++
Sbjct: 249 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASL 308
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
I DSG++ YL + A+ + E + + T + +E C+VL P +
Sbjct: 309 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVE-DYELCFVL-PRR 366
Query: 376 T------NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN---VNIIG 426
T + P + L GG + P EP+ L CL V K+ + V+IIG
Sbjct: 367 TAAAAMEAVQVPPLVLHFDGGAAMVL--PRDNYFQEPRA-GLMCLAVGKTTDGSGVSIIG 423
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 122/300 (40%), Gaps = 40/300 (13%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P +DT SD+ W+ C C +C + + ++ P+ S T +PC
Sbjct: 93 SLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSP---------MFDPSYSKTYKNLPC 143
Query: 169 NSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ST C+ Q S S+ C + V Y DG+ S G L+ + + L + R
Sbjct: 144 SSTTCK-SVQGTSCSSDERKICEHTVNY-KDGSHSQGDLIVETVTLGSYNDPFVHF-PRT 200
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRIS 282
GC R SF G+ GLG S+ L++ I FS C SD + ++
Sbjct: 201 VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLK 254
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSG 330
FGD G T + Y +T+ SVG N + F + I DSG
Sbjct: 255 FGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSG 314
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGG 390
T+FT L D Y+++ + K +R F CY + ++ + PV+ G
Sbjct: 315 TTFTVLPDDVYSKLESAVADVVKLERAEDPLK-QFSLCYKSTYDKV--DVPVITAHFSGA 371
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 98/380 (25%), Positives = 156/380 (41%), Gaps = 55/380 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LA +G + ++G + + + S+G P ++A+D
Sbjct: 75 ASRDASRLLYLDSLAVRGRARAYAPIASGRQLLQTPT----YVVRASLGTPPQQLLLAVD 130
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C +SS D P +S++ VPC S LC CP
Sbjct: 131 TSNDASWIPCAGCAGC----PTSSAAPFD-----PASSASYRTVPCGSPLCAQAPNAACP 181
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
G C + + Y +D ++ L +D L +A + ++ +FGC + TG+ A
Sbjct: 182 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGNAVKA------YTFGCLQRATGT---AA 230
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 231 PPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTP 288
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISET 347
L H + Y + +T + VG V + DSGT FT L PAY + +
Sbjct: 289 LLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDE 348
Query: 348 FNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEP 406
+ + S L F+ C+ N T +P V L G + +VI S+
Sbjct: 349 V----RRRVGAPVSSLGGFDTCF----NTTAVAWPPVTLLFDGMQVTLPEENVVIHSTYG 400
Query: 407 KGLYLYCLGVVKS-DNVNII 425
+ CL + + D VN +
Sbjct: 401 T---ISCLAMAAAPDGVNTV 417
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 33/263 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+YT++ +G P I+ +DTGS+L WL C C C +++ IY S +
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDT---------IYDAARSVSY 150
Query: 164 SKVPC-NSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C NS LC Q A GS C + Y DG+ S G L D L + T
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY-GDGSFSYGSLSTDTLIMETVVGGKP 209
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
+FGC + GA+ G+ GL K ++P L + FS CF
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSS 265
Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE--------FSA 325
+ TG + FG+ P + S+ T+ V++ G ++N
Sbjct: 266 HLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVV 325
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG+SF+ P ++Q+ E F
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAF 348
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 123/299 (41%), Gaps = 46/299 (15%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA ++A+DT SD+ W+PC CV C +SP S++ V C+
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSFKNVSCS 153
Query: 170 STLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+ C+ Q P+ G+ C + + Y S + L +D + LA D ++ +FGC
Sbjct: 154 APQCK-QVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA------FTFGC 204
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFG 284
G G P LG+ + + + Q + ++FS C S +G + G
Sbjct: 205 VNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 261
Query: 285 DKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
P + + LR + Y + + + VG V+ +A IFDSGT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVVNLTM 387
+T L P Y + F K TS F+ CY V P T F + VN+TM
Sbjct: 322 VYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVKVPTIT-FMFKGVNMTM 379
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 102/378 (26%), Positives = 147/378 (38%), Gaps = 57/378 (15%)
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
+R +L LA TP+ ++GN Y ++ +S G P V +DTGS
Sbjct: 53 ERRAQLSKHILAEGRLFSTPV--ASGNGEYLID---------ISFGSPPQKASVIVDTGS 101
Query: 127 DLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGS 184
DL W C C +C N+++ + D P SST V C S C L Q S +
Sbjct: 102 DLIWTQCLPCETC----NAAASVIFD-----PVKSSTYDTVSCASNFCSSLPFQ--SCTT 150
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
+C Y Y DG+ ++G L ++FGCG GSF A G
Sbjct: 151 SCKYDYMY-GDGSSTSGALS------TETVTVGTGTIPNVAFGCGHTNLGSFAGAA---G 200
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISFGDKGSP-GQGETPFSLRQ 300
+ GLG S+ I + FS C GS T + GD + G T
Sbjct: 201 IVGLGQGPLSL--ISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVAYTALLTNT 258
Query: 301 THPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF 348
+PT Y +T +SV G AV + I DSGT+ TYL A+ +
Sbjct: 259 ANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAFNALVAAL 318
Query: 349 NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKG 408
+ E S +YC+ + N YP + KG + + V V+ + G
Sbjct: 319 KAEVPFP-EADGSLYGLDYCFS-TAGVANPTYPTMTFHFKGAD-YELPPENVFVALDTGG 375
Query: 409 LYLYCLGVVKSDNVNIIG 426
CL + S +I+G
Sbjct: 376 --SICLAMAASTGFSIMG 391
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/272 (30%), Positives = 114/272 (41%), Gaps = 61/272 (22%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+SVG P L+F V DTGSDL W C C C + P +SST SK+
Sbjct: 89 NISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139
Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S+ C+ + C + G C Y +Y S T G+L + L + S
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN----SFSMCFGSD-- 276
++FGC + G G + +G+ GLG S LIP FS C S
Sbjct: 191 -VAFGC-STENGV---GNSTSGIAGLGRGALS---------LIPQLGVGRFSYCLRSGSA 236
Query: 277 -GTGRISFGDKGSPGQG---ETPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS- 324
G I FG + G TPF + HP+ Y + +T ++VG + F F+
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296
Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL Y + + F S
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 328
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 100/380 (26%), Positives = 157/380 (41%), Gaps = 55/380 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LA +G + ++G L +L ++ S+G P ++A+D
Sbjct: 75 ASRDASRLLYLDSLAVRGRARAYAPIASGRQL--LQTLTYV--VRASLGTPPQQLLLAVD 130
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C +SS D P S++ VPC S LC CP
Sbjct: 131 TSNDASWIPCAGCAGC----PTSSAAPFD-----PAASASYRTVPCGSPLCAQAPNAACP 181
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
G C + + Y +D ++ L +D L +A + ++ +FGC + TG+ A
Sbjct: 182 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGNAVKA------YTFGCLQRATGT---AA 230
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 231 PPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTP 288
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISET 347
L H + Y + +T V VG V + DSGT FT L PAY + +
Sbjct: 289 LLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRDE 348
Query: 348 FNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEP 406
+ + S L F+ C+ N T +P + L G + +VI S+
Sbjct: 349 V----RRRVGAPVSSLGGFDTCF----NTTAVAWPPMTLLFDGMQVTLPEENVVIHSTYG 400
Query: 407 KGLYLYCLGVVKS-DNVNII 425
+ CL + + D VN +
Sbjct: 401 T---ISCLAMAAAPDGVNTV 417
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 111/277 (40%), Gaps = 47/277 (16%)
Query: 102 GFLHY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y +++VG P LDTGSDL W C C SC+ + I+SP
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDP---------IFSPGA 150
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLATDEK 215
SS+ + C LC L C C Y+ Y DGT + G + ++
Sbjct: 151 SSSYEPMRCAGELCNDILHHSCQRP-DTCTYRYSY-GDGTTTRGVYATERFTFSSSSSGG 208
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ + + + FGCG + GS +G +G+ G G S+ S LA + FS C
Sbjct: 209 ETTKLSAPLGFGCGTMNKGSLNNG---SGIVGFGRAPLSLVSQLAIR-----RFSYCLTP 260
Query: 276 DGTGRIS---FG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
+GR S FG D + T + +PT Y + T V+VG + S
Sbjct: 261 YASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPIS 320
Query: 325 -----------AIFDSGTSFTYLNDPAYTQISETFNS 350
AI DSGT+ T P ++ F S
Sbjct: 321 AFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRS 357
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/341 (26%), Positives = 134/341 (39%), Gaps = 58/341 (17%)
Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
+ LDTGSD+ W+ C C C SG V D P SS+ V C + LC
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYE----QSGPVFD-----PRRSSSYGAVGCGAALCRRLDS 51
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C C YQV Y DG+++ G V + L A + +R++ GCG G F
Sbjct: 52 GGCDLRRGACMYQVAY-GDGSVTAGDFVTETLTFAGGARV-----ARVALGCGHDNEGLF 105
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------------GSDGTGRISFG 284
+ A GL S P+ ++ + SFS C GS + +SFG
Sbjct: 106 VAAAGLLGLG---RGGLSFPTQISRR--YGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFG 160
Query: 285 DKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV-------------NFEFSAIF 327
GS G F+ +P Y + + +SVGG V I
Sbjct: 161 -AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIV 219
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLT 386
DSGTS T L +Y+ + + F + A S F+ CY L + + P V++
Sbjct: 220 DSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRV-VKVPTVSMH 278
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
GG + ++ + +G +C +D V+IIG
Sbjct: 279 FAGGAEAALPPENYLIPVDSRG--TFCFAFAGTDGGVSIIG 317
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 123/299 (41%), Gaps = 46/299 (15%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA ++A+DT SD+ W+PC CV C +SP S++ V C+
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSFKNVSCS 169
Query: 170 STLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+ C+ Q P+ G+ C + + Y S + L +D + LA D ++ +FGC
Sbjct: 170 APQCK-QVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA------FTFGC 220
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFG 284
G G P LG+ + + + Q + ++FS C S +G + G
Sbjct: 221 VNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 277
Query: 285 DKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
P + + LR + Y + + + VG V+ +A IFDSGT
Sbjct: 278 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 337
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY---VLSPNQTNFEYPVVNLTM 387
+T L P Y + F K TS F+ CY V P T F + VN+TM
Sbjct: 338 VYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCYSGQVKVPTIT-FMFKGVNMTM 395
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/321 (27%), Positives = 130/321 (40%), Gaps = 61/321 (19%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID---FNIYSPNTSSTSSKV 166
S+G P + LDTGS L W PC + + + + +D IY+ N SST +
Sbjct: 79 SLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSL 138
Query: 167 PCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S C C S CPY G+ +TG LV DVL L+ K ++ D
Sbjct: 139 PCRSPKCNWVFGSDLNC-STTKRCPYYGLEYGLGS-TTGQLVSDVLGLS---KLNRIPD- 192
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---DGT- 278
FGC S + P G+ G G S+P+ L GL FS C S D T
Sbjct: 193 -FLFGC------SLVSNRQPEGIAGFGRGLASIPAQL---GL--TKFSYCLVSHRFDDTP 240
Query: 279 ---------GRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSVGGNAVNF---- 321
GR D + G PF+ L Y I+++++ VGG V
Sbjct: 241 QSGDLVLHRGR-RHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRY 299
Query: 322 -------EFSAIFDSGTSFTYLN----DPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
+ I DSG++FT++ DP ++ + + K +S L CY
Sbjct: 300 LVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGL--GPCYN 357
Query: 371 LSPNQTNFEYPVVNLTMKGGG 391
++ Q+ + P + + KGG
Sbjct: 358 IT-GQSEVDVPKLTFSFKGGA 377
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/284 (26%), Positives = 116/284 (40%), Gaps = 36/284 (12%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN-IYSPNTSSTSS 164
Y +++G+PA + + +DTGS WL C ++ G N + P T
Sbjct: 40 YVTMNIGEPAEPYFLDIDTGSSFTWLEC---------HAKDGPCKTCNKVPHPLYRLTRK 90
Query: 165 K-VPCNSTLCEL-------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
K VPC LC+ K+C N C Y+V+Y DG S G L+ D L T
Sbjct: 91 KLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKY-QDGLSSLGVLLLDKFSLPTGGA 149
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLI-PNSFS 270
++ I+FGCG Q A +G+ GLG + S L + G + N
Sbjct: 150 RN------IAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVIG 203
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHP----TYNITITQVSVGGNAVNFE-FSA 325
C S G G + G++ P T + T P Y+ + + N + + A
Sbjct: 204 HCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPLKA 263
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSG+++TYL + + Q+ + + SD C+
Sbjct: 264 IFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDPALPLCW 307
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/345 (28%), Positives = 149/345 (43%), Gaps = 43/345 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + VG PA F + +DTGS L WL C CV H V I++P+ S T
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCH--------VQVDPIFTPSVSKTY 158
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C+S+ C K C +A C Y+ Y D + S G+L +DVL L
Sbjct: 159 KALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLT----P 213
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ------GLIPNSFS 270
S + S +GCG+ G F A G+ GL DK S+ L+N+ +P+SFS
Sbjct: 214 SAAPSSGFVYGCGQDNQGLFGRSA---GIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFS 270
Query: 271 MCFGSDGTGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGG-----NAVNFE 322
S +G +S G TP P+ Y + +T ++V G +A ++
Sbjct: 271 AQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYN 330
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT T L Y + ++F + +K + + C+ S + + P
Sbjct: 331 VPTIIDSGTVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMS-TVPE 389
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN-VNIIG 426
+ + +GG + +V E KG CL + S N ++IIG
Sbjct: 390 IRIIFRGGAGLELKVHNSLVEIE-KG--TTCLAIAASSNPISIIG 431
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/335 (25%), Positives = 126/335 (37%), Gaps = 38/335 (11%)
Query: 117 SFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ 176
++ +ALD G L W+ C+ C H L S ++ P S T S +P ++T+
Sbjct: 110 NYQLALDMGGGLSWM--QCLPCRHCLLQMS------PVFDPTKSPTFSNIPAHNTVWCRP 161
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
P A C + + Y D T ++G+L D + S I FGC QT F
Sbjct: 162 PYQPLANGACGFDIAY-RDNTHASGYLARDTFSFPAGNDDFVPL-SAIVFGCAH-QTEHF 218
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIP---NSFSMCFGSDGTGRISFGDKGSPGQGE 293
+ A G+ GLGM P + ++P FS C G S+ GS
Sbjct: 219 KNQRAVAGILGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSH 278
Query: 294 TPFSL-RQTHPT---------YNITITQVSVGGNAVNFEFSAIF------------DSGT 331
P ++ RQ+ P Y + + VSVG N ++ A+F D GT
Sbjct: 279 PPPNVHRQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGT 338
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
T AY I ++R + C V P + P + L + G
Sbjct: 339 RMTAFIHSAYVHIDHAVRQ-HLQRRGAHIVVVRGNTC-VQQPAPHHDVLPSMTLHFENGA 396
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
V V + G + C G V S ++ +IG
Sbjct: 397 WLRVMPEHVFMPFVVGGHHYQCFGFVSSTDLTVIG 431
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 133/316 (42%), Gaps = 54/316 (17%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIY 155
SLG Y VS+G PA++ ++++DTGSD+ W+ PC SC + ++
Sbjct: 124 SLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD---------KLF 174
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P S+T S C+S C Q G S+C Y V+Y+ D + +TG D L L
Sbjct: 175 DPAKSATYSAFSCSSAQC---AQLGGEGNGCLNSHCQYIVKYV-DHSNTTGTYGSDTLGL 230
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T + FGC G F+ +GL GLG D S+ S A +FS
Sbjct: 231 TTSDAVKN-----FQFGCSHRANG-FV--GQLDGLMGLGGDTESLVSQTA--ATYGKAFS 280
Query: 271 MCFGSDGTGRISFGDKGSPGQG-------ETPFSLRQTHPT-YNITITQVSVGGNAVN-- 320
C + F G+ G TP +R PT Y + + ++V G +N
Sbjct: 281 YCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPL-VRFNVPTFYGVFLQAITVAGTKLNVP 339
Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCYVLSPN 374
F +++ DSGT T L AY + F K++ + S P + C+ S
Sbjct: 340 ASVFSGASVVDSGTVITQLPPTAYQALRTAF----KKEMKAYPSAAPVGILDTCFDFSGI 395
Query: 375 QTNFEYPVVNLTMKGG 390
+T PVV LT G
Sbjct: 396 KT-VRVPVVTLTFSRG 410
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 129/313 (41%), Gaps = 39/313 (12%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R+ S + +++G P + +DTGSDL W C C C + ++
Sbjct: 74 RVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSP---------MF 124
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P S T S +PC S C S C Y Y +D +++ G L + + ++ +
Sbjct: 125 EPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSY-ADSSVTKGVLAREAITFSSTDG 183
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNS--FSMC 272
V I FGCG +G+F + + P L +Q G + S FS C
Sbjct: 184 DPVVV-GDIIFGCGHSNSGTFNENDM------GIIGMGGGPLSLVSQIGTLYGSKRFSQC 236
Query: 273 ---FGSDG--TGRISFGDKGS-PGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
F +D +G I+FG++ G+G TP + + +Y +T+ +SVG V F S
Sbjct: 237 LVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSS 296
Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
+ DSGT TY+ Y ++ E + DL + CY ++TN
Sbjct: 297 ETLSKGNIMIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCYR---SETN 353
Query: 378 FEYPVVNLTMKGG 390
E P++ +G
Sbjct: 354 LEGPILTAHFEGA 366
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/333 (24%), Positives = 131/333 (39%), Gaps = 60/333 (18%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P +DTGS++ W C+ CVH ++ I+ P+ SST
Sbjct: 379 VYLMKLQVGTPPFEIEAVIDTGSEITW--TQCLPCVHCYKQNAP------IFDPSKSSTF 430
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+ +CPY+V Y D T + G L D + + + + +
Sbjct: 431 KEKRCHD-------------HSCPYEVDYF-DKTYTKGTLATDTVTIHSTSGEPFVMAET 476
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I GCGR S+ + G GL S+ I G P S CF +GT +I+F
Sbjct: 477 I-IGCGR--NNSWFRPSF-EGFVGLNWGPLSL--ITQMGGEYPGLMSYCFAGNGTSKINF 530
Query: 284 GDKGSPGQG----ETPFSLRQTHPTYNITITQVSVGGNAVN--------FEFSAIFDSGT 331
G G G T F Y + + VSVG + E + + DSGT
Sbjct: 531 GTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGT 590
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE-------YCYVLSPNQTNFEYPVVN 384
+ TY E++ +L ++ E +P CY + T +PV+
Sbjct: 591 TLTYF--------PESYCNLVRQAVEHVVPAVPAADPTGNDLLCYY---SNTTEIFPVIT 639
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
+ GG ++ + + S G L+CL ++
Sbjct: 640 MHFSGGADLVLDKYNMFMESYSGG--LFCLAII 670
Score = 45.1 bits (105), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 71/325 (21%), Positives = 123/325 (37%), Gaps = 62/325 (19%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ + + +G P LDTGS+L W C+ C+H + + I+ P+ SST
Sbjct: 63 YEYLMKLQIGTPPFEVEAVLDTGSELIW--TQCLPCLHCYDQKAP------IFDPSKSST 114
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN + +CPY++ Y D + + G L + + + + +
Sbjct: 115 FKETRCN-----------TPDHSCPYKLVY-DDKSYTQGTLATETVTIHSTSGVPFVMPE 162
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
I GC R +GS G P+ +G+ + S+ I G P G G +S
Sbjct: 163 TI-IGCSRNNSGS---GFRPSSSGIVGLSRGSLSLISQMGGAYP----------GDGVVS 208
Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG---NAVNFEFSA-----IFDSGTSFT 334
T F+ Y + + VSVG V F A + DSGT T
Sbjct: 209 ----------TTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLT 258
Query: 335 YLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGP 392
Y + + + R + S +D+ CY + T +PV+ + GG
Sbjct: 259 YFPVSYCNLVRKAVERVVTADRVVDPSRNDM---LCYY---SNTIEIFPVITVHFSGGAD 312
Query: 393 FFVNDPIVIVSSEPKGLYLYCLGVV 417
++ + + G ++CL ++
Sbjct: 313 LVLDKYNMYMELNRGG--VFCLAII 335
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 127/310 (40%), Gaps = 50/310 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + I+ P S T
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC+S C ++ SAG N C YQV Y DG+ + G + L + +
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
++ GCG G F+ A L GLG K S P ++ FS C
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297
Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
S + FG+ TP S + Y + + +SVGG V +++F
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357
Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
DSGTS T L PAY + + F AK + L F+ C+ LS N +
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSL-FDTCFDLS-NMNEVKV 415
Query: 381 PVVNLTMKGG 390
P V L +G
Sbjct: 416 PTVVLHFRGA 425
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 151/375 (40%), Gaps = 81/375 (21%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
GFL N+S+G P ++ +V +DTGS L W+ C C++C S + P S
Sbjct: 103 GFL--VNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTS---------WFDPLKS 151
Query: 161 STSSKVPC--------NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ + C N C Q Y++RYL G S G L ++ L T
Sbjct: 152 VSFKTLGCGFPGYNYINGYKCNRFNQ-------AEYKLRYLG-GDSSQGILAKESLLFET 203
Query: 213 -DEKQ-----------SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI-L 259
DE + SK S I+FGCG + + D A NG+FGLG + P I +
Sbjct: 204 LDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAY-NGVFGLG----AYPHITM 258
Query: 260 ANQGLIPNSFSMCFGSDGT-----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVS 313
A Q + N FS C G + G +GS +G+ TP + H Y +T+ +S
Sbjct: 259 ATQ--LGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGH--YYVTLQSIS 313
Query: 314 VGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VG + + +A + DSG ++T L + + + + L K E +
Sbjct: 314 VGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQ 373
Query: 363 LPFE-YCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP------------IVIVSSEPKGL 409
FE C+ ++ +P V GG + + I+ S + L
Sbjct: 374 RKFEGLCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELL 433
Query: 410 YLYCLGVVKSDNVNI 424
L +G++ N N+
Sbjct: 434 NLSVIGILAQQNYNV 448
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 127/310 (40%), Gaps = 50/310 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + I+ P S T
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC+S C ++ SAG N C YQV Y DG+ + G + L + +
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
++ GCG G F+ A L GLG K S P ++ FS C
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297
Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
S + FG+ TP S + Y + + +SVGG V +++F
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQI 357
Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
DSGTS T L PAY + + F AK + L F+ C+ LS N +
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSL-FDTCFDLS-NMNEVKV 415
Query: 381 PVVNLTMKGG 390
P V L +G
Sbjct: 416 PTVVLHFRGA 425
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/303 (24%), Positives = 119/303 (39%), Gaps = 40/303 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P + V +D+GSD+ W+ C+ C C H + +++P SS+
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDP---------VFNPADSSSY 184
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C ST+C C Y+V Y DG+ + G L + L +++
Sbjct: 185 AGVSCASTVCSHVDNAGCHEGRCRYEVSY-GDGSYTKGTLALETLTFG------RTLIRN 237
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG---TGR 280
++ GCG G F+ A GL GLG S L Q +FS C S G +G
Sbjct: 238 VAIGCGHHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQA--GGTFSYCLVSRGIQSSGL 292
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTY--------NITITQVSVGGNAVNF----EFSAIF 327
+ FG + P G P ++ + +V + + + +
Sbjct: 293 LQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVM 352
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D+GT+ T L AY + F + S + F+ CY L + P V+
Sbjct: 353 DTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSI-FDTCYDLF-GFVSVRVPTVSFYF 410
Query: 388 KGG 390
GG
Sbjct: 411 SGG 413
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 117/295 (39%), Gaps = 47/295 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C C+ C + Q + + S+T
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------AAQPTPY--FDVKRSATY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+PC S+ C C YQ Y D + G L + +K +
Sbjct: 140 RALPCRSSRCAALSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTFGA-ASSTKVRAAN 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + G + +G+ G G S+ S L P+ FS C + S R
Sbjct: 198 ISFGCGSLNAGELANS---SGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSPTPSR 249
Query: 281 ISFG----------DKGSPGQGETPFSLRQTHP-TYNITITQVSVGGN---------AVN 320
+ FG GSP Q TPF + P Y +++ +S+G A+N
Sbjct: 250 LYFGVFANLNSTNTSSGSPVQ-STPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAIN 308
Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
+ + I DSGTS T+L AY + S T D+ + C+ P
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDT-DIGLDTCFQWPP 362
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 120/311 (38%), Gaps = 55/311 (17%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
F + + V P + + DTGS L WL C + ++P SS+
Sbjct: 74 FEYLMALDVSTPPVRMLALADTGSSLVWLKCKLPAA----------------HTP-ASSS 116
Query: 163 SSKVPCNSTLCEL---QKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+++PC++ C+ C + GS C Y+ + +DG+ + G + D +T
Sbjct: 117 YARLPCDAFACKALGDAASCRATGSGNNICVYRYAF-ADGSCTAGPVTVDAFTFST---- 171
Query: 217 SKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
R+ FGC R + S D +GL GL S+ S L+ + + FS C
Sbjct: 172 ------RLDFGCATRTEGLSVPD----DGLVGLANGPISLVSQLSAKTPFAHKFSYCLVP 221
Query: 274 ---GSDGTGRISFGDKG----SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
+ ++FG SPG TP + Y I + + V G V + +
Sbjct: 222 YSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT 281
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL---SPNQTNFEY 380
I DSGT TYL + + K R S L + CY + +P
Sbjct: 282 KLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETL-YAVCYDVRRRAPEDVGKSI 340
Query: 381 PVVNLTMKGGG 391
P V L + GGG
Sbjct: 341 PDVTLVLGGGG 351
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/322 (26%), Positives = 132/322 (40%), Gaps = 70/322 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSSGQVIDFNIYSPNT 159
+ +++G P + V LDTGSDL W+PC DC+ C N+ + +++SP
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNN---DLKSPSVFSPLH 139
Query: 160 SSTSSKVPCNSTLC-ELQKQ------CPSAGSN------------CPYQVRYLSDGTMST 200
SSTS + C S+ C E+ C AG + CP +G + +
Sbjct: 140 SSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLIS 199
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L D+L T + R SFGC T ++ + P G+ G G S+PS L
Sbjct: 200 GILTRDILKARTRDV------PRFSFGC---VTSTYRE---PIGIAGFGRGLLSLPSQL- 246
Query: 261 NQGLIPNSFSMCF-------GSDGTGRISFGDKG-----SPGQGETPFSLRQTHP-TYNI 307
G + FS CF + + + G + TP +P +Y I
Sbjct: 247 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYI 304
Query: 308 TITQVSVGGNAVNFEFS-------------AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ +++G N + + DSGT++T+L +P Y+Q+ T S
Sbjct: 305 GLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITY 364
Query: 355 KRETST-SDLPFEYCY-VLSPN 374
R T T S F+ CY V PN
Sbjct: 365 PRATETESRTGFDLCYKVPCPN 386
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 113/291 (38%), Gaps = 46/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
++ GCG +G F+ A GL GLG S+ L G FS C G+
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
Query: 277 GTGRISFGDKGSPGQGETPFSL---RQTHPTYNITITQVSVGGNAVNFEFS--------- 324
G G + G + G L Q Y + +T + VGG + + S
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA 348
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
+ D+GT+ T L AY + F+ ++ R + S L + CY LS
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 397
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/356 (23%), Positives = 137/356 (38%), Gaps = 61/356 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +D+GSD+ W+ C C+ C + ++ P +S+T
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADP---------LFDPASSATF 175
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S V C S +C + C +G C Y+V Y DG+ + G L + L L +
Sbjct: 176 SAVSCGSAICRTLRTSGCGDSG-GCEYEVSY-GDGSYTKGTLALETLTLGGTAVEG---- 229
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS------ 275
++ GCG G F+ A GL GLG S+ L +FS C S
Sbjct: 230 --VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGSGS 282
Query: 276 ---DGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFE------- 322
D G + G + +G P P+ Y + ++ + VG + +
Sbjct: 283 GAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLT 342
Query: 323 ----FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
+ D+GT+ T L AY + + F ++ R S L + CY LS T+
Sbjct: 343 EDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLL--DTCYDLS-GYTS 399
Query: 378 FEYPVVNLTMKGGGPFF---------VNDPIVIVSSEPKGLYLYCLGVVKSDNVNI 424
P V+ G V+ I ++ P L LG ++ + + I
Sbjct: 400 VRVPTVSFYFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQI 455
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/360 (25%), Positives = 147/360 (40%), Gaps = 57/360 (15%)
Query: 66 RDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R +Y R +G+ D + T G+ ++SL ++ V +G P++S ++ +DT
Sbjct: 90 RSKYIMSRVSKGMMGDDADVSIPTHLGGS----VDSLEYV--VTVGLGTPSVSQVLLIDT 143
Query: 125 GSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--- 178
GSDL W+ PC+ +C + ++ P+ SST + +PCN+ C
Sbjct: 144 GSDLSWVQCQPCNSTTCYPQKDP---------LFDPSKSSTYAPIPCNTDACRDLTDDGY 194
Query: 179 ---CPS--AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
C S + C + + Y DG+ + G + L LA FGCG Q
Sbjct: 195 GGGCASGDGAAQCGFAITY-GDGSQTRGVYSNETLALAPGVAVKD-----FRFGCGHDQD 248
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----------DGTGRISF 283
G+ +GL GLG S+ ++ + +FS C + G G S
Sbjct: 249 GA---NDKYDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSG 303
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLND 338
G + G TP +R+ Y + +T ++VGG ++ SA I DSGT T L
Sbjct: 304 GVVNTSGFVFTPM-IREEETFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQH 362
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
AY + F +L + CY S +N P V LT GG ++ P
Sbjct: 363 TAYNALQAAFRKAMAAYPLVRNGEL--DTCYDFS-GYSNVTLPKVALTFSGGATIDLDVP 419
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/330 (25%), Positives = 134/330 (40%), Gaps = 39/330 (11%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G PA +++ +DTGS L WL C C+ + SG V ++P +SST + V C++
Sbjct: 3 LGTPATQYVMVVDTGSSLTWLQCS--PCLVSCHRQSGPV-----FNPKSSSTYASVGCSA 55
Query: 171 TLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C L S+ + C YQ Y D + S G+L +D + +
Sbjct: 56 QQCSDLPSATLNPSACSSSNVCIYQASY-GDSSFSVGYLSKDTVSFGSTSLP------NF 108
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
+GCG+ G F A GL GL +K S+ LA + SF+ C S +
Sbjct: 109 YYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSL 163
Query: 285 DKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAV------NFEFSAIFDSGTSFTYL 336
+PGQ TP S Y I ++ ++V GN + I DSGT T L
Sbjct: 164 GSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRL 223
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVN 396
Y+ +S+ + K S + + C+ + P V ++ GG ++
Sbjct: 224 PTSVYSALSKAVAAAMKGTSRASAYSI-LDTCF--KGQASRVSAPAVTMSFAGGAALKLS 280
Query: 397 DPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
++V + CL + + IIG
Sbjct: 281 AQNLLVDVDDS---TTCLAFAPARSAAIIG 307
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 95/383 (24%), Positives = 142/383 (37%), Gaps = 49/383 (12%)
Query: 44 GILAVDDLPKKGSFAYYSALAHRD--RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSL 101
G+ + P+ + + RD R+ R LA LT A N
Sbjct: 26 GLTRIHADPEVTASEFVRGALRRDMHRHARFAREQLAPSSAAAAGLTVGAPTQKDLRN-- 83
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNT 159
G + +S+G P LS+ DTGSDL W C C + + Q + +Y+P++
Sbjct: 84 GGEYIMTLSIGTPPLSYRAIADTGSDLIW--TQCAPCGDTVTDTDNQCFKQSGCLYNPSS 141
Query: 160 SSTSSKVPCNSTL---CELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S+T +PCNS L + P G C Y Y + T G + +
Sbjct: 142 STTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTGWT--AGVQSVETFTFGSSSTP 199
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
I+FGC + + +G+A GL GLG S+ S L +FS C
Sbjct: 200 PAVRVPNIAFGCSNASSNDW-NGSA--GLVGLGRGSMSLVSQLGA-----GAFSYCLTPF 251
Query: 274 -GSDGTGRISFGD------KGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFE 322
++ T + G KG+ TPF S Y + +T +SVG A+
Sbjct: 252 QDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIP 311
Query: 323 FSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS---TSDLPFEYC 368
A I DSGT+ T L D AY Q+ SL + + + C
Sbjct: 312 PDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLC 371
Query: 369 YVLSPNQTNFEYPVVNLTMKGGG 391
+ L + P + L +GG
Sbjct: 372 FALKASTPPPAMPSMTLHFEGGA 394
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 17/167 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +DTGS++ ++PC C G G+ D T S+S+
Sbjct: 52 TKLYIGTPPQEFTLVVDTGSNMTFVPC----C--GSEEYCGKHEDPAF---QTESSSTYQ 102
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
P N C C S C Y++ Y DG+ S G L ED++ +S+ R+ F
Sbjct: 103 PVN---CHPSCDCDYLRSQCSYKMHY-GDGSYSRGVLAEDIISFG---NESEFAPQRLVF 155
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
GC GS A +G+ GLG ++++ L ++G+I +SFS+C+
Sbjct: 156 GCELDAIGSLYSLRA-DGIIGLGRGRSTIVDQLVDKGVISDSFSLCY 201
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 120/275 (43%), Gaps = 36/275 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+ +G P + I +DTGSDL W C C C QV+ ++ P SST
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVP--LFDPKNSSTY 142
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
C ++ C + + S C ++ Y +DG+ + G L + L D K V
Sbjct: 143 RDSSCGTSFCLALGKDRSCSKEKKCTFRYSY-ADGSFTGGNLASET--LTVDSTAGKPVS 199
Query: 222 -SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+FGCG G F + +G+ GLG + S+ S L + I FS C S
Sbjct: 200 FPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQL--KSTINGLFSYCLLPVSTDS 255
Query: 276 DGTGRISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
+ RI+FG G G G LR + Y+ T+V G + I DSGT++T
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLRLPYKGYS-KKTEVEEG--------NIIVDSGTTYT 306
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+L Y+++ ++ + K KR + + F CY
Sbjct: 307 FLPQEFYSKLEKSVANSIKGKRVRDPNGI-FSLCY 340
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 120/295 (40%), Gaps = 47/295 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C C+ C + Q + + S+T
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------ADQPTPY--FDVKKSATY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+PC S+ C C YQ Y D + G L + +K +
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQ-YYYGDTASTAGVLANETFTFGA-ANSTKVRATN 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
I+FGCG + G D A +G+ G G S+ S L P+ FS C + S R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249
Query: 281 ISFG----------DKGSPGQGETPFSLRQTHP-TYNITITQVSVGGN---------AVN 320
+ FG GSP Q TPF + P Y +++ +S+G A+N
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQ-STPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIN 308
Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
+ + I DSGTS T+L AY + S A + +D+ + C+ P
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPLPAMNDTDIGLDTCFQWPP 362
>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
Length = 178
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 65/127 (51%), Gaps = 9/127 (7%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 221 DSRISFG 227
+ ++FG
Sbjct: 172 STSVTFG 178
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 122/307 (39%), Gaps = 44/307 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C C C + S V D P S +
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCY----AQSDPVFD-----PRKSRSF 176
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C S LC C + C YQV Y DG+ + G + L ++
Sbjct: 177 ASIACRSPLCHRLDSPGCNTQKQTCMYQVSY-GDGSFTFGDFSTETLTF------RRTRV 229
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+R++ GCG G F+ A GLG + S PS + + FS C S
Sbjct: 230 ARVALGCGHDNEGLFVGAAGLL---GLGRGRLSFPSQTGRR--FNHKFSYCLVDRSASSK 284
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFS----- 324
+ FGD TP T Y + + +SVGG V F+
Sbjct: 285 PSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNG 344
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGTS T L PAY + F + A + L F+ C+ LS +T + P V
Sbjct: 345 GVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSL-FDTCFDLS-GKTEVKVPTV 402
Query: 384 NLTMKGG 390
L +G
Sbjct: 403 VLHFRGA 409
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 131/304 (43%), Gaps = 53/304 (17%)
Query: 60 YSALAHR--DRYFRLRGR-GLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
++ AHR +R L R G A+ G+ ++PL +G Y + S+G P
Sbjct: 42 FTRAAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMT---------FSMGTPPQ 92
Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE- 174
+ DTGSDL W C C C ++S Y P SS+ SK+PC+S LC
Sbjct: 93 TLSALADTGSDLIWAKCGACKRCAPRGSAS---------YYPTKSSSFSKLPCSSALCRT 143
Query: 175 LQKQ-------CPSAGSNCPYQVRY-LSDGTM--STGFLVEDVLHLATDEKQSKSVDSRI 224
L+ Q + G+ C Y+ Y LS + G++ + L +D Q I
Sbjct: 144 LESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQG------I 197
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRIS 282
FGC T S + +GL GLG K S L Q L +FS C SD + +
Sbjct: 198 GFGC---TTMSEGGYGSGSGLVGLGRGKLS----LVRQ-LKVGAFSYCLTSDPSTSSPLL 249
Query: 283 FGDKG--SPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSFTYLND 338
FG PG TP +T Y + + +S+G IFDSGT+ T+L +
Sbjct: 250 FGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAE 309
Query: 339 PAYT 342
PAYT
Sbjct: 310 PAYT 313
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 122/315 (38%), Gaps = 57/315 (18%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH----YTNVSVGQPALSFI 119
A + R F LR R + A + P + L F H +++VG P +
Sbjct: 30 AAKPRAFPLRARQVPAGALPRPP------------SKLRFHHNVSLTVSLAVGTPPQNVT 77
Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ--- 176
+ LDTGS+L WL C + G ++ + P S+T + VPC ST C +
Sbjct: 78 MVLDTGSELSWL--LCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDLP 135
Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
C A C + Y +DG+ S G L DV + ++ R +FGC
Sbjct: 136 APPSCDGASRQCHVSLSY-ADGSASDGALATDVFAVG------EAPPLRSAFGCMSTAYD 188
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-SDGTGRISFGDKGSP--GQ 291
S DG A GL G+ S + + + FS C D G + G P
Sbjct: 189 SSPDGVATAGLLGMNRGTLSFVTQASTR-----RFSYCISDRDDAGVLLLGHSDLPFLPL 243
Query: 292 GETPFSLRQTHP-------TYNITITQVSVGGNAVNFEFSAI-----------FDSGTSF 333
TP + T P Y++ + + VGG A+ S + DSGT F
Sbjct: 244 NYTPL-YQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQF 302
Query: 334 TYLNDPAYTQISETF 348
T+L AY+ + F
Sbjct: 303 TFLLGDAYSALKAEF 317
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 144/357 (40%), Gaps = 64/357 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+S+G P ++F V DTGS L W C C C + P +SST SK+
Sbjct: 93 NLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPP---------FQPASSSTFSKL 143
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGT-MSTGFLVEDVLHLATDEKQSKSVDSRIS 225
PC S+LC+ P N V Y G + G+L + LH+ ++
Sbjct: 144 PCASSLCQFLTS-PYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPG------VA 196
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD---GTGRIS 282
FGC + G G + +G+ GLG S+++ G+ FS C SD G I
Sbjct: 197 FGC-STENGV---GNSSSGIVGLGRSPL---SLVSQVGV--GRFSYCLRSDADAGDSPIL 247
Query: 283 FGDKGSPGQG---ETPFSLRQTHPT---YNITITQVSVGG-----NAVNFEFS------- 324
FG G TP P+ Y + +T ++VG + F F+
Sbjct: 248 FGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGL 307
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCYVLSPNQTNF 378
I DSGT+ TYL Y + F S T+T + F+ C+ +
Sbjct: 308 VGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGS 367
Query: 379 EYPVVNLTMK--GGGPFFVNDP----IVIVSSEPKGLYLYCLGVVKSD---NVNIIG 426
PV L ++ GG + V +V V S+ + + CL V+ + +++IIG
Sbjct: 368 GVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAA-VECLLVLPASEKLSISIIG 423
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 36/273 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
+SL L Y +V +G PA++ V +DTGSD+ W+ C+ ++ +G + D P
Sbjct: 101 SSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFD-----P 155
Query: 158 NTSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST + C++ C + A S C Y V+Y DG+ +TG DVL L+
Sbjct: 156 AASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTLSG 214
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSM 271
+ V FGC + G+ +D +GL GLG D S V A G SF
Sbjct: 215 SD-----VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSPVSQTAARYG---KSFFY 265
Query: 272 CFGSD--GTGRISFGDKGSPGQ------GETPFSLRQTHPTYNI-TITQVSVGGNAVN-- 320
C + +G ++ G S G TP + PTY + ++VGG +
Sbjct: 266 CLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 325
Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
F ++ DSGT T L AY +S F +
Sbjct: 326 PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 358
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 120/295 (40%), Gaps = 47/295 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C C+ C + Q + + S+T
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------ADQPTPY--FDVKKSATY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+PC S+ C C YQ Y D + G L + +K +
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTFGA-ANSTKVRATN 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
I+FGCG + G D A +G+ G G S+ S L P+ FS C + S R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249
Query: 281 ISFG----------DKGSPGQGETPFSLRQTHP-TYNITITQVSVGGN---------AVN 320
+ FG GSP Q TPF + P Y +++ +S+G A+N
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQ-STPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIN 308
Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSP 373
+ + I DSGTS T+L AY + S A + +D+ + C+ P
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPLTAMNDTDIGLDTCFQWPP 362
>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 467
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 93/386 (24%), Positives = 152/386 (39%), Gaps = 89/386 (23%)
Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPN 158
S G + + +G P F A+DT SDL W C CV C L+ +++P
Sbjct: 83 SAGGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDP---------VFNPV 133
Query: 159 TSSTSSKVPCNSTLC-ELQ-KQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLA 211
S++ + VPCNS C EL +C G + C Y Y + T + G L D L +
Sbjct: 134 ASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNAT-TRGILAVDRLAIG 192
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN--GLFGLGMDKTSVPSILANQGLI---- 265
D V + FGC + S + G P G+ GLG S+ S L+ + +
Sbjct: 193 DD------VFRGVVFGC----SSSSVGGPPPQVSGVVGLGRGALSLVSQLSVRRFMYCLP 242
Query: 266 -PNSFS---MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN 320
P S S + G+D + + + + P S +P+ Y + + +S+G A++
Sbjct: 243 PPVSRSAGRLVLGADAAATV----RNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMS 298
Query: 321 FE------------------------------------FSAIFDSGTSFTYLNDPAYTQI 344
F + I D ++ T+L + Y
Sbjct: 299 FRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLY--- 355
Query: 345 SETFNSLAKEKR--ETSTSDLPFEYCYVLSPN--QTNFEYPVVNLTMKGGGPFFVNDPIV 400
E + L +E R S SDL + C++L + P V+L +G + +
Sbjct: 356 EEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMF 415
Query: 401 IVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ E + + CL V K+D V+I+G
Sbjct: 416 V---EDRASGMMCLMVGKTDGVSILG 438
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 138/353 (39%), Gaps = 52/353 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ ++++G P + LDTGSDL W C C + + G + P+ SST
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVW--TQCRPCPVCFSRALGPL------DPSNSSTFD 466
Query: 165 KVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+PC+S +C+ N C Y Y +DG+++TG L + A + ++
Sbjct: 467 VLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY-ADGSITTGHLDAETFTFAAADGTGQA 525
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
++FGCG G F G+ G G S+PS L ++FS CF +
Sbjct: 526 TVPDLAFGCGLFNNGIFTSNE--TGIAGFGRGALSLPSQLKV-----DNFSHCFTAITGS 578
Query: 280 RISFGDKGSPGQ---------GETPF-----SLRQTHPTYNITITQVSVGGNAVNFEFS- 324
S G P TP SLR Y +++ ++VG + S
Sbjct: 579 EPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLR----AYYLSLKGITVGSTRLPIPEST 634
Query: 325 ----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS-P 373
I DSGT T L AY + + F + + + +TS C+ S P
Sbjct: 635 FALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCFSFSVP 694
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ + P + L +G + + E G + CL + D++ IIG
Sbjct: 695 RRAKPDVPKLVLHFEGATLDLPRENYMF-EFEDAGGSVTCLAINAGDDLTIIG 746
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 107/431 (24%), Positives = 174/431 (40%), Gaps = 68/431 (15%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAH---RDRYFRLRGRGLAAQGNDKTPLTFSAG 92
H S P K + K S A +AL R Y R R + A Q D P
Sbjct: 51 HSPSSPYKNV-------KAESLAKDTALESTLSRHAYLRARQQK-ALQPADFVPPPLIRD 102
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
+ N+S+G P + V LDTGSDLFW+ C+ C C +
Sbjct: 103 KSAF---------LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDP------- 146
Query: 152 FNIYSPNTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
IY+ S + +++ CN C + QC +GS C YQ Y +DG+ ++G L + +
Sbjct: 147 --IYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGS-CLYQTSY-ADGSRTSGLLSYEKV 202
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
T + +++ FGCG +Q +F+ + G+ GLG S+ S L+ G + S
Sbjct: 203 AF-TSHYSDEDKTAQVGFGCG-LQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKS 260
Query: 269 FSMCFGS----DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN 320
F+ CFG+ + G + FGD TP + + + + + + + N+ +
Sbjct: 261 FAYCFGNLSNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSS 320
Query: 321 FEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE----TSTSDLPFEYCYV 370
FE I DSG++ + Y + K+ TS+ D C+
Sbjct: 321 FERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-----CFE 375
Query: 371 LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG---- 426
+ +P + L ++ G +ND I L+CLG + ++IIG
Sbjct: 376 GKIGRDLPLFPTLVLYLESTG--ILNDRWSIFLQRYDE--LFCLGFTSGEGLSIIGTLAQ 431
Query: 427 REYPIANNISL 437
+ Y N+ L
Sbjct: 432 QSYKFGYNLEL 442
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/312 (27%), Positives = 131/312 (41%), Gaps = 51/312 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +S+G P + DTGSDL W C C C N ++ P +SS+
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNP---------MFDPRSSSSY 110
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C + C C + C Y Y +D +++ G L ++ L L + + +
Sbjct: 111 TNITCGTESCNKLDSSLCSTDQKTCNYTYSY-ADNSITQGVLAQETLTLTSTTGEPVAFQ 169
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS-ILANQGLIPNSFSMC---FGSDG 277
I FGCG +G F D GL GLG S+ S I ++ G N FS C F +D
Sbjct: 170 GII-FGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDP 225
Query: 278 --TGRISFGDKGSP----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
T +++FG KGS G TP + + Y T+ +SV +N FS
Sbjct: 226 SITSQMNFG-KGSEVLGNGTVSTPL-ISKDGTGYFATLLGISV--EDINLPFSNGSSLGT 281
Query: 325 -----AIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNF 378
+ DSGT+ TYL + Y + I + N +A E +E CY TN
Sbjct: 282 ITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG----YELCY---QTPTNL 334
Query: 379 EYPVVNLTMKGG 390
P + + +GG
Sbjct: 335 NGPTLTIHFEGG 346
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 138/353 (39%), Gaps = 49/353 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG PA+ ++A+DTGSD+ WL C C C SG V D P S++
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 184
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ ++ C+ + + C Y V Y DG+ + G +E+ L A +
Sbjct: 185 REMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGGVQV---- 240
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------- 273
+S GCG G F AA G+ GLG + S PS +A G SFS C
Sbjct: 241 -PHMSIGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSS 297
Query: 274 -GSDGTGRISFGD---KGSPGQGETPFSLRQTHPTY--------------NITITQVSVG 315
G + ++ GD GSP TP T+ +T+ +
Sbjct: 298 PGRSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLK 357
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCYVLSP 373
+ I DSGT+ T L AY + F + A + + S F+ CY +
Sbjct: 358 LDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTM-- 415
Query: 374 NQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ P V++ GG + ++ + G + +V+IIG
Sbjct: 416 GGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIG 468
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 138/356 (38%), Gaps = 50/356 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA V LDTGSD+ W+ C C C + I+ P +SST
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDP---------IFDPTSSSTF 214
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+ C + C YQV Y DG+ + G D + K +
Sbjct: 215 KSLTCSDPKCASLDVSACRSNKCLYQVSY-GDGSFTVGNYATDTVTFGESGKVND----- 268
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A GL G + T NQ + SFS C + + S
Sbjct: 269 VALGCGHDNEGLFTGAAGLLGLGGGALSMT-------NQ-IKAKSFSYCLVDRDSAKSSS 320
Query: 284 GDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P T Y + ++ SVGG V+ FE A I
Sbjct: 321 LDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVIL 380
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D GT+ T L AY + + F L + ++ ++ F+ CY S T + P V
Sbjct: 381 DCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLST-VKVPTVTFHF 439
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR--------EYPIANNI 435
GG + ++ + G + + S +++IIG Y +ANN+
Sbjct: 440 TGGKSLNLPAKNYLIPIDDAGTFCFAFAPTSS-SLSIIGNVQQQGTRITYDLANNL 494
>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 141/349 (40%), Gaps = 55/349 (15%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS---CVHGLNSSSGQVIDFNIYSP 157
G + + VGQP F + DTGSD+ WL C C S C + I+ P
Sbjct: 145 GAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDP---------IFDP 195
Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+SS+ S + CNS C+L + C YQV Y DG+ +TG L + L S
Sbjct: 196 KSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY-GDGSFTTGELATETLSFG----NS 250
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
S+ + + GCG G F GA GL G + +S L +SFS C
Sbjct: 251 NSIPN-LPIGCGHDNEGLFAGGAGLIGLGGGAISLSS--------QLKASSFSYCLVNLD 301
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA--- 325
SD + + F +P +Y + + +SVGG + FE
Sbjct: 302 SDSSSTLEFNSNMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL 361
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQTNF 378
I DSGT + L Y + E F L +S S P F+ CY S Q+N
Sbjct: 362 GGIIVDSGTIISRLPSDVYESLREAFVKLT-----SSLSPAPGISVFDTCYNFS-GQSNV 415
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
E P + + G + ++ + G YCL +K+ +++IIG
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAG--TYCLAFIKTKSSLSIIG 462
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/303 (28%), Positives = 124/303 (40%), Gaps = 39/303 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
V G PA + DTGSDL W+ C C V D P SS+ + VPC
Sbjct: 116 VGFGSPAQTSATMFDTGSDLSWI--QCQPCSGHCYKQHDPVFD-----PAKSSSYAVVPC 168
Query: 169 NSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+T C +C G+ C Y V Y DG+ +TG L + L ++ + + + FG
Sbjct: 169 GTTECAAAGGEC--NGTTCVYGVEY-GDGSSTTGVLARETLTFSSSSEFTGFI-----FG 220
Query: 228 CGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISF 283
CG G F +DG G L + + P+ G I FS C S T G +S
Sbjct: 221 CGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAF----GGI---FSYCLPSYNTTPGYLSI 273
Query: 284 GDKGSPGQGETPFSLRQTHPTYN----ITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
G GQ ++ P Y I + +++GG + EF+ + DSGT
Sbjct: 274 GATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGTLLDSGTIL 333
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
TYL PAYT + + F + + D + CY + Q+ P V+ G F
Sbjct: 334 TYLPPPAYTALRDRFKFTMQGSKPAPPYDE-LDTCYDFT-GQSGILIPGVSFNFSDGAVF 391
Query: 394 FVN 396
+N
Sbjct: 392 NLN 394
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 92/315 (29%), Positives = 127/315 (40%), Gaps = 54/315 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA + + LDTGSD+ WL C C +C + + I+ P S T
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDV---------IFDPKKSKTF 188
Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC S LC +C + S C YQV Y DG+ + G + L
Sbjct: 189 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSY-GDGSFTEGDFSTETLTF-----HGAR 242
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
VD + GCG G F+ A GLG S PS + FS C
Sbjct: 243 VD-HVPLGCGHDNEGLFVGAAGLL---GLGRGGLSFPS--QTKSRYNGKFSYCLVDRTSS 296
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
S I FG+ P + F+ T+P Y + + +SVGG+ V F
Sbjct: 297 GSSSKPPSTIVFGNDAVP--KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 354
Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
+ A I DSGTS T L AY + + F L K + + S F+ C+ LS
Sbjct: 355 KLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFR-LGATKLKRAPSYSLFDTCFDLS-GM 412
Query: 376 TNFEYPVVNLTMKGG 390
T + P V GG
Sbjct: 413 TTVKVPTVVFHFGGG 427
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 116/286 (40%), Gaps = 42/286 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P + +DTGSDL WL C C +C LN ++ P +SST S +
Sbjct: 63 LSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNP---------MFDPQSSSTYSNIA 113
Query: 168 CNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
S C C +NC Y Y D +++ G L ++ L L + + ++ I
Sbjct: 114 YGSESCSKLYSTSCSPDQNNCNYTYSY-EDDSITEGVLAQETLTLTSTTGKPVALKGVI- 171
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGR 280
FGCG G F D G+ GLG S+ S + + FS C T
Sbjct: 172 FGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGS-SFGGKMFSQCLVPFHTNPSITSP 228
Query: 281 ISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVNFEF------------ 323
+SFG KGS G TP + TH Y +T+ +SV +N F
Sbjct: 229 MSFG-KGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLPFNDGSSLEPITKG 285
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + DSGT T L + Y ++ E + L ++ CY
Sbjct: 286 NMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCY 331
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 80/316 (25%), Positives = 122/316 (38%), Gaps = 64/316 (20%)
Query: 122 LDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
+DTGSDL W+PC C++C ++S+G ++ P SS+ V C + C+
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPED-SASNG------VFLPRMSSSLHLVTCADSNCKTLY 53
Query: 175 ------LQKQCPSAGSNC-----PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
L + C + NC PY ++Y T G L+ + L+L + + +
Sbjct: 54 GNNTELLCQSCAGSLKNCSETCPPYGIQYGRGST--AGLLLTETLNLPLENGEGARAITH 111
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS------DG 277
+ GC S + P+G+ G G S+PS L + + F+ C S +
Sbjct: 112 FAVGC------SIVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDEENK 164
Query: 278 TGRISFGDKGSPGQ---GETPFSLRQTHPT-------YNITITQVSVGGNAVN------F 321
+ GDK P TPF P Y I + VS+GG +
Sbjct: 165 KSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLL 224
Query: 322 EFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPN 374
F I DSGT+FT +D + I+ F S +R D CY ++
Sbjct: 225 RFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMGLCYDVT-G 283
Query: 375 QTNFEYPVVNLTMKGG 390
N P KGG
Sbjct: 284 LENIVLPEFAFHFKGG 299
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 100/425 (23%), Positives = 153/425 (36%), Gaps = 62/425 (14%)
Query: 30 FGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
F + HRYS + G+ Y + ++R LA T F
Sbjct: 28 FSLEIVHRYSR--------ESPFYPGNITDYERITRLVELSKIRAHNLAI----TTSSGF 75
Query: 90 SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQ 148
S R++ + V +G P + + DTGS LFW C+ C L
Sbjct: 76 SPEAFRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPP---- 131
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQK---QCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
I++ S T +PC C + QC C Y++ Y + G+ + G +
Sbjct: 132 -----IFNSTASRTYRDLPCQHQFCTNNQNVFQC--RDDKCVYRIAY-AGGSATAGVAAQ 183
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQG 263
D+L A +++ FGC R +F G+ GL M S+ +
Sbjct: 184 DILQSAENDRIP------FYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSL--LQQMNH 235
Query: 264 LIPNSFSMCFG-------SDGTGRISFGD---KGSPGQGETPFSLRQTHPTYNITITQVS 313
+ N FS C S T + FG+ K TPF + P Y + + VS
Sbjct: 236 ITKNRFSYCLNLFDLSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVS 295
Query: 314 VGGNAVNF---EFS--------AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTS 361
V GN + F+ I DSGT+ TY++ AY + F N + +
Sbjct: 296 VAGNRMQIPPGTFALKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNI 355
Query: 362 DLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN 421
L CY T YP + +G FFV V ++ + +G + L +
Sbjct: 356 QLSGYICYK-QQGHTFHNYPSMAFHFQGAD-FFVEPEYVYLTVQDRGAFCVALQPISPQQ 413
Query: 422 VNIIG 426
IIG
Sbjct: 414 RTIIG 418
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 129/290 (44%), Gaps = 51/290 (17%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+GF + T +++GQPA + + +DTGSDL WL CD C H + +Y P
Sbjct: 66 VGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCD-APCTHCSETP------HPLYRP--- 114
Query: 161 STSSKVPCNSTLC-ELQKQCPSAGSNCP------YQVRYLSDGTMSTGFLVEDVLHLA-T 212
++ VPC LC LQ P+ NC Y++ Y +D + G L+ DV L T
Sbjct: 115 -SNDFVPCRDPLCASLQ---PTEDYNCEHPDQCDYEINY-ADQYSTFGVLLNDVYLLNFT 169
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ Q K R++ GCG Q S +GL GLG K S+ S L +QGL+ N C
Sbjct: 170 NGVQLKV---RMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHC 226
Query: 273 FGSDGTG-----------RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
+ G G R+++ TP S + Y+ ++ GG
Sbjct: 227 LSAQGGGYIFFGNAYDSARVTW----------TPISSVDSK-HYSAGPAELVFGGRKTGV 275
Query: 322 -EFSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
+A+FD+G+S+TY N AY +S L+ + + + D C+
Sbjct: 276 GSLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCW 325
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 99/395 (25%), Positives = 163/395 (41%), Gaps = 59/395 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A R R R LAA ++ T T SA +++ + +++G P +S+ D
Sbjct: 50 ALRRDMHRHNARQLAASSSNGT--TVSAPT---QISPTAGEYLMTLAIGTPPVSYQAIAD 104
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL----CELQKQC 179
TGSDL W C C SS +Y+P++S+T + +PCNS+L L
Sbjct: 105 TGSDLIW--TQCAPC-----SSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTT 157
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
P G C Y + Y S T + + + + +++ I+FGC G +
Sbjct: 158 PPPGCTCMYNMTYGSGWT--SVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGG--FNT 213
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGS----PGQ 291
++ +GL GLG S+ S L +P FS C ++ T + G S G
Sbjct: 214 SSASGLVGLGRGSLSLVSQLG----VPK-FSYCLTPYQDTNSTSTLLLGPSASLNDTGGV 268
Query: 292 GETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYL 336
TPF + Y + +T +S+G A++ +A I DSGT+ T L
Sbjct: 269 SSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLL 328
Query: 337 NDPAYTQISETFNSLAK-EKRETSTSDLPFEYCYVLSPNQTNF--EYPVVNLTMKGGGPF 393
+ AY Q+ SL + ++ + C+ L P+ T+ P + L G
Sbjct: 329 GNTAYQQVRAAVVSLVTLPTTDGGSAATGLDLCFEL-PSSTSAPPTMPSMTLHFDGADMV 387
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDN--VNIIG 426
D +++ S L+CL + + V+I+G
Sbjct: 388 LPADSYMMLDSN-----LWCLAMQNQTDGGVSILG 417
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 132/311 (42%), Gaps = 56/311 (18%)
Query: 66 RDRYFRLRGRGLAA----QGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
R + +LR + + + Q +T + ++G +L +L ++ V +G +S IV
Sbjct: 100 RVQSLQLRIKAMTSSTTEQSVSETQIPLTSG---IKLETLNYI--VTVELGGKNMSLIV- 153
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----L 175
DTGSDL W+ C C SC + +Y P+ SS+ V CNS+ C+
Sbjct: 154 -DTGSDLTWVQCQPCRSCYNQQGP---------LYDPSVSSSYKTVFCNSSTCQDLVAAT 203
Query: 176 QKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
P G N C Y V Y DG+ + G L + + L + ++ + FGCG
Sbjct: 204 GNSGPCGGFNGVVKTTCEYVVSY-GDGSYTRGDLASESIVLGDTKLEN------LVFGCG 256
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DG-TGRISFGD- 285
R G F +GL GLG ++SV + FS C S DG +G +SFG+
Sbjct: 257 RNNKGLF---GGASGLMGLG--RSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGND 311
Query: 286 ----KGSPGQGETPFSLR-QTHPTYNITITQVSVGG---NAVNFEFSAIFDSGTSFTYLN 337
K S TP Q Y + +T S+GG ++F + DSGT T L
Sbjct: 312 FSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTVITRLP 371
Query: 338 DPAYTQISETF 348
Y + F
Sbjct: 372 PSIYKAVKTEF 382
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 112/291 (38%), Gaps = 46/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
++ GCG +G F+ A GL GLG S+ L G FS C G+
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAG 288
Query: 277 GTGRISFGDKGSPGQGETPFSL---RQTHPTYNITITQVSVGGNAVNFE----------- 322
G G + G + G L Q Y + +T + VGG + +
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGA 348
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
+ D+GT+ T L AY + F+ ++ R + S L + CY LS
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 397
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 117/315 (37%), Gaps = 64/315 (20%)
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH----YTNVSVGQPALSFIVALDTG 125
F LR R + A+ + P + L F H +++VG P + + LDTG
Sbjct: 58 FALRARQMPARALPRQP------------SKLRFHHNVSLTVSLAVGTPPQNVTMVLDTG 105
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCP 180
S+L WL C + ++ S + P SST + VPC S C + C
Sbjct: 106 SELSWLLCAPAGARNKFSAMS--------FRPRASSTFAAVPCASAQCRSRDLPSPPACD 157
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
A S C + Y +DG+ S G L DV + + R +FGC S DG
Sbjct: 158 GASSRCSVSLSY-ADGSSSDGALATDVFAVGSGPPL------RAAFGCMSSAFDSSPDGV 210
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-SDGTGRISFGDKGSPG--------- 290
A GL G+ S S + + FS C D G + G P
Sbjct: 211 ASAGLLGMNRGALSFVSQASTR-----RFSYCISDRDDAGVLLLGHSDLPTFLPLNYTPM 265
Query: 291 -QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI-----------FDSGTSFTYLND 338
Q P Y++ + + VGG + S + DSGT FT+L
Sbjct: 266 YQPALPLPYFD-RVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 324
Query: 339 PAYTQISETFNSLAK 353
AY+ + F A+
Sbjct: 325 DAYSALKAEFTRQAR 339
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 113/276 (40%), Gaps = 46/276 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL W+ C C +C + Y P SS+
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQ---------NGPYYDPKDSSSF 245
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDE-K 215
+ C+ C+L + C +CPY Y + F +E ++L T E K
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ + FGCG G F A L GLG S + L Q L +SFS C
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL--QSLYGHSFSYCLVD 360
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVN--- 320
S + ++ FG+ P T F + +P Y + I + VGG +
Sbjct: 361 RNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPE 420
Query: 321 --FEFSA------IFDSGTSFTYLNDPAYTQISETF 348
+ SA I DSGT+ TY +PAY I E F
Sbjct: 421 ETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAF 456
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/234 (30%), Positives = 102/234 (43%), Gaps = 30/234 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +S+G P + DTGSDL WL C C +C LN ++ +SST
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNP---------MFDSQSSSTF 109
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S + C S C C NC Y Y+ DG+ + G L ++ L L + + +
Sbjct: 110 SNIACGSESCSKLYSTSCSPDQINCKYNYSYV-DGSETQGVLAQETLTLTSTTGEPVAFK 168
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG-- 279
I FGCG G+F D G+ GLG S+ S + + L N FS C T
Sbjct: 169 GVI-FGCGHNNNGAFNDKEM--GIIGLGRGPLSLVSQIGSS-LGGNMFSQCLVPFNTNPS 224
Query: 280 ---RISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA 325
+SFG KGS G TP + T+ + Y +T+ +SV +N F+A
Sbjct: 225 ISSPMSFG-KGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISV--EDINLPFNA 275
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 123/310 (39%), Gaps = 49/310 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C C C + +++P SST
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDP---------LFNPAASSTY 203
Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KVPC + LC K+ +G C YQV Y DG+ + G + L
Sbjct: 204 RKVPCATPLC---KKLDISGCRNKRYCEYQVSY-GDGSFTVGDFSTETLTF------RGQ 253
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V R++ GCG G F+ A GLG S PS Q FS C S
Sbjct: 254 VIRRVALGCGHDNEGLFIGAAGLL---GLGRGSLSFPSQTGAQ--FSKRFSYCLVDRSAS 308
Query: 276 DGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVN------FEFSA-- 325
+ FG P TP S + Y + + +SVGG + F A
Sbjct: 309 GTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATG 368
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGTS T L D AY+ + + F + L F+ CY LS +T + P
Sbjct: 369 NGGVIIDSGTSVTRLVDSAYSTMRDAFRVGTGNLKSAGGFSL-FDTCYDLSGLKT-VKVP 426
Query: 382 VVNLTMKGGG 391
+ +GG
Sbjct: 427 TLVFHFQGGA 436
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 88/308 (28%), Positives = 126/308 (40%), Gaps = 50/308 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + I+ P S T
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC+S C ++ SAG N C YQV Y DG+ + G + L + +
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
++ GCG G F+ A L GLG K S P ++ FS C
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297
Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
S + FG+ TP S + Y + + +SVGG V +++F
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357
Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
DSGTS T L PAY + + F AK + L F+ C+ LS N +
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSL-FDTCFDLS-NMNEVKV 415
Query: 381 PVVNLTMK 388
P V L +
Sbjct: 416 PTVVLHFR 423
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 124/293 (42%), Gaps = 54/293 (18%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ P+T A RL +L ++ + G+ V +DT S+L W+ C C SC
Sbjct: 112 RVPVTSGA-----RLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASC---- 158
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGS--------NCPYQVRYL 193
+ G + D P +S + + +PCNS+ C+ LQ SA +C Y + Y
Sbjct: 159 HDQQGPLFD-----PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSY- 212
Query: 194 SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
DG+ S G L D L LA + V FGCG G F +GL GLG +
Sbjct: 213 RDGSYSQGVLAHDKLSLAGE------VIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQL 263
Query: 254 SVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPT 304
S+ S +Q FS C S+ +G + GD S + TP S P
Sbjct: 264 SLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF 321
Query: 305 YNITITQVSVGGNAVNFEFSA---IFDSGTSFTYLNDPAYTQISETFNSLAKE 354
Y + +T +++GG V E SA I DSGT T L Y + F S E
Sbjct: 322 YFVNLTGITIGGQEV--ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAE 372
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 109/263 (41%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C ++C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357
>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 487
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 97/349 (27%), Positives = 141/349 (40%), Gaps = 55/349 (15%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS---CVHGLNSSSGQVIDFNIYSP 157
G + + VGQP F + DTGSD+ WL C C S C + I+ P
Sbjct: 145 GAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDP---------IFDP 195
Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+SS+ S + CNS C+L + C YQV Y DG+ +TG L + L S
Sbjct: 196 KSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHY-GDGSFTTGELATETLSFG----NS 250
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
S+ + + GCG G F GA GL G + +S L +SFS C
Sbjct: 251 NSIPN-LPIGCGHDNEGLFAGGAGLIGLGGGAISLSS--------QLKASSFSYCLVNLD 301
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA--- 325
SD + + F +P +Y + + +SVGG + FE
Sbjct: 302 SDSSSTLEFNSYMPSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGL 361
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP----FEYCYVLSPNQTNF 378
I DSGT + L Y + E F L +S S P F+ CY S Q+N
Sbjct: 362 GGIIVDSGTIISRLPSDVYESLREAFVKLT-----SSLSPAPGISVFDTCYNFS-GQSNV 415
Query: 379 EYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
E P + + G + ++ + G YCL +K+ +++IIG
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAG--TYCLAFIKTKSSLSIIG 462
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 90/331 (27%), Positives = 142/331 (42%), Gaps = 57/331 (17%)
Query: 38 YSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYR 97
++ +K L +DD + +L R + + GR + + PLT R
Sbjct: 83 WNKKLKKHLIMDDFQLR-------SLQSRMKSI-ISGRNIDDSVDAPIPLT-----SGIR 129
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
L +L ++ V +G ++ IV DTGSDL W+ C C C + + +++
Sbjct: 130 LQTLNYI--VTVELGGRKMTVIV--DTGSDLSWVQCQPCKRCYNQQDP---------VFN 176
Query: 157 PNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
P+TS + V C+S C+ LQ C S +C Y V Y DG+ + G L + L
Sbjct: 177 PSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNY-GDGSYTRGELGTEHLD 235
Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
L S +V++ I FGCGR G F +GL GLG ++S+ I + F
Sbjct: 236 LGN----STAVNNFI-FGCGRNNQGLF---GGASGLVGLG--RSSLSLISQTSAMFGGVF 285
Query: 270 SMCF---GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-----YNITITQVSVGGNAVNF 321
S C ++ +G + G S + TP S + P Y + +T ++VG AV
Sbjct: 286 SYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQA 345
Query: 322 ----EFSAIFDSGTSFTYLNDPAYTQISETF 348
+ + DSGT T L Y + + F
Sbjct: 346 PSFGKDGMMIDSGTVITRLPPSIYQALKDEF 376
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 100/360 (27%), Positives = 145/360 (40%), Gaps = 59/360 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + LDTGSDL W+ CD C C Y+PN SS+
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPH---------YNPNESSSY 220
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTG-FLVEDVLHLAT---- 212
+ C C+L + C + CPY Y +DG+ +TG F +E T
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDY-ADGSNTTGDFALETFTVNLTWPNG 279
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
EK VD + FGCG G F GL GLG S PS L Q + +SFS C
Sbjct: 280 KEKFKHVVD--VMFGCGHWNKGFF---HGAGGLLGLGRGPLSFPSQL--QSIYGHSFSYC 332
Query: 273 F-----GSDGTGRISFG-DKGSPGQGETPFS-LRQTHPT-----YNITITQVSVGGNAVN 320
+ + ++ FG DK F+ L T Y + I + VGG ++
Sbjct: 333 LTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLD 392
Query: 321 -----FEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ +S+ I DSG++ T+ D AY I E F K ++ + D CY
Sbjct: 393 IPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIK-LQQIAADDFIMSPCY 451
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN---VNIIG 426
+S E P + G + EP + CL ++K+ N + IIG
Sbjct: 452 NVS-GAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDE--VICLAILKTPNHSHLTIIG 508
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 141/344 (40%), Gaps = 57/344 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G P + ++A+DT +D W+PC C C L ++P S+T
Sbjct: 98 YIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTL------------FAPEKSTTF 145
Query: 164 SKVPCNSTLCELQKQCPSAG-SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C S C Q PS G S C + + Y S + +V+D + LATD
Sbjct: 146 KNVSCGSPQCN-QVPNPSCGTSACTFNLTYGSSSIAAN--VVQDTVTLATDPIPD----- 197
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
+FGC TG+ A P GL GLG S+ S Q L ++FS C S + +
Sbjct: 198 -YTFGCVAKTTGA---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 251
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + L+ + Y + + + VG V+ A
Sbjct: 252 GSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGT 311
Query: 326 IFDSGTSFTYLNDPAYTQISETFN---SLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPV 382
+FDSGT FT L PAYT + + F ++A + T TS F+ CY + P
Sbjct: 312 VFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVP-----IVAPT 366
Query: 383 VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS-DNVNII 425
+ G D I+I S+ CL + + DNVN +
Sbjct: 367 ITFMFSGMNVTLPEDNILIHSTAGSTT---CLAMASAPDNVNSV 407
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 124/293 (42%), Gaps = 54/293 (18%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ P+T A RL +L ++ + G+ V +DT S+L W+ C C SC
Sbjct: 113 RVPVTSGA-----RLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASC---- 159
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGS--------NCPYQVRYL 193
+ G + D P +S + + +PCNS+ C+ LQ SA +C Y + Y
Sbjct: 160 HDQQGPLFD-----PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSY- 213
Query: 194 SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
DG+ S G L D L LA + V FGCG G F +GL GLG +
Sbjct: 214 RDGSYSQGVLAHDKLSLAGE------VIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQL 264
Query: 254 SVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPT 304
S+ S +Q FS C S+ +G + GD S + TP S P
Sbjct: 265 SLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF 322
Query: 305 YNITITQVSVGGNAVNFEFSA---IFDSGTSFTYLNDPAYTQISETFNSLAKE 354
Y + +T +++GG V E SA I DSGT T L Y + F S E
Sbjct: 323 YFVNLTGITIGGQEV--ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAE 373
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 131/322 (40%), Gaps = 70/322 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSSGQVIDFNIYSPNT 159
+ +++G P + V +DTGSDL W+PC DC+ C + S + +I+SP
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCN---DLKSNNLKSSSIFSPLH 67
Query: 160 SSTSSKVPCNSTLC-ELQKQ------CPSAGSN------------CPYQVRYLSDGTMST 200
SS+S + C S+ C E+ C AG + CP +G + +
Sbjct: 68 SSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVS 127
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L D+L T + R SFGC T ++ + P G+ G G S+PS L
Sbjct: 128 GILTRDILKARTRDV------PRFSFGC---VTSTYHE---PIGIAGFGRGLLSLPSQL- 174
Query: 261 NQGLIPNSFSMCF-------GSDGTGRISFGDKG-----SPGQGETPFSLRQTHP-TYNI 307
G + FS CF + + + G + TP +P +Y I
Sbjct: 175 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYI 232
Query: 308 TITQVSVGGNAVNFEF-------------SAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ +++G N + + DSGT++T+L +P Y+Q+ S
Sbjct: 233 GLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITY 292
Query: 355 KRETST-SDLPFEYCY-VLSPN 374
R T T S F+ CY V PN
Sbjct: 293 PRATETESRTGFDLCYKVPCPN 314
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 83/322 (25%), Positives = 124/322 (38%), Gaps = 55/322 (17%)
Query: 105 HYTNVSVGQP-----ALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPN 158
+ ++VG P + +++ D GSD+ WL C C C H +Y+
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGP---------VYNRL 175
Query: 159 TSSTSSKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
SS++S V C + C C + C Y+V Y DG+ S G + L +
Sbjct: 176 KSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEY-GDGSSSAGDFGVETLTFPPGVR 234
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ GCG G F AA G+ GLG S PS +A G SFS C
Sbjct: 235 VPG-----VAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQIA--GRYGRSFSYCLAG 285
Query: 276 DGTG----RISFGDKGSPGQGETP-------FSLRQTHPTYNITITQVSVGGNAVNF--- 321
GTG ++FG S T + + + Y + + +SVGG V
Sbjct: 286 QGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTE 345
Query: 322 ----------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY---C 368
I DSGT+ T L+ PAY + F A ++ + PF + C
Sbjct: 346 SDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTC 405
Query: 369 YVLSPNQTNFEYPVVNLTMKGG 390
Y + + P V++ GG
Sbjct: 406 YSSVRGRVMKKVPAVSMHFAGG 427
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 57/309 (18%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDC--VSCVHGLNSSSGQVIDF--------- 152
++ +V +G PAL + + LDT +DL W+ C H S+GQ +
Sbjct: 124 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEAS 183
Query: 153 -NIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N Y P SS+ ++ C+ C + Q PS +C Y + DGT++ G ++
Sbjct: 184 KNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIYGKEK 242
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ + + + I GC ++ G +D A +G+ LG S A +
Sbjct: 243 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR--FGQ 297
Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
FS C S D + ++FG + PG ET P Y +T V VGG
Sbjct: 298 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGER 357
Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--- 364
++ I D+ TS T L AY ++ + S LP
Sbjct: 358 LDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDR--------HLSHLPRVY 409
Query: 365 ----FEYCY 369
FEYCY
Sbjct: 410 ELEGFEYCY 418
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 87/335 (25%), Positives = 140/335 (41%), Gaps = 60/335 (17%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDT 95
R D V+ L +D L + ++ + R R +A + PLT
Sbjct: 68 ERKGDWVEKQLVLDGL-------HVRSIQNHIRK-RTSSSQIADSSETQVPLT-----SG 114
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNI 154
+ +L ++ V++G + + V +DTGSDL W+ C+ C SC + + +
Sbjct: 115 IKFQTLNYI----VTMGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQ---------NGPL 161
Query: 155 YSPNTSSTSSKVPCNSTLCELQK--QC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ P+TS + + CNST C+ + C PS + C Y V Y DG+ ++G L + L
Sbjct: 162 FKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNY-GDGSYTSGELGIEKLG 220
Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
SV S FGCGR G F +GL GLG + S+ I F
Sbjct: 221 FG-----GISV-SNFVFGCGRNNKGLF---GGASGLMGLGRSELSM--ISQTNATFGGVF 269
Query: 270 SMCFGSD----GTGRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAV 319
S C S +G + G++ + TP + + P Y + +T + VGG ++
Sbjct: 270 SYCLPSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSL 329
Query: 320 NFEFSA------IFDSGTSFTYLNDPAYTQISETF 348
+ + S+ I DSGT + L Y + F
Sbjct: 330 HVQASSFGNGGVILDSGTVISRLAPSVYKALKAKF 364
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 137/342 (40%), Gaps = 57/342 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G P + ++A+DT +D W+PC C C L ++P S+T
Sbjct: 78 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTL------------FAPEKSTTF 125
Query: 164 SKVPCNSTLCELQKQCPSAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C + C KQ P+ G S+C + + Y S + LV+D + LATD S
Sbjct: 126 KNVSCAAPEC---KQVPNPGCGVSSCNFNLTYGSSSIAAN--LVQDTITLATDPVPS--- 177
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----D 276
+FGC TG+ A P GL GLG S+ S Q L ++FS C S +
Sbjct: 178 ---YTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLN 229
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA--------- 325
+G + G P + + L+ + Y + + + VG V+ +A
Sbjct: 230 FSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGA 289
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
IFDSGT FT L P Y + + F K T TS F+ CY P +
Sbjct: 290 GTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKL-TVTSLGGFDTCY-----NVPIVVPTI 343
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
G D I+I S+ L G DNVN +
Sbjct: 344 TFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGA--PDNVNSV 383
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 108/263 (41%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 108/263 (41%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 81/177 (45%), Gaps = 11/177 (6%)
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSIL 259
G V D + ++ + ++ D I FGCG Q G L+ +G+ GL S+P+ L
Sbjct: 2 GVYVRDSMQFVGEDGERENAD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQL 59
Query: 260 ANQGLIPNSFSMCFGSDGTGR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSV 314
A++G+I N+F C +D +G + GD P G T +R + Q++
Sbjct: 60 ASRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINH 119
Query: 315 GGNAVNFE---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
G +N + +FD+G+++TY D A T++ + A + SD +C
Sbjct: 120 GDQQLNAQGKLTQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFC 176
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 109/285 (38%), Gaps = 56/285 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
++ GCG +G F+ A GL GLG S+ L G FS C S G G
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFD 328
G G S Y + +T + VGG + + S + D
Sbjct: 289 ----------GAGSLASSF------YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMD 332
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLS 372
+GT+ T L AY + F+ ++ R + S L + CY LS
Sbjct: 333 TGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLS 375
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 128/307 (41%), Gaps = 47/307 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y V +G P + DTGS L W C+ C SC + I+ P+ SS+
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDP---------IFDPSKSSS 190
Query: 163 SSKVPCNSTLCELQKQCPSAG------SNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEK 215
+ + C S+LC Q SAG ++C Y V+Y D ++S GFL ++ L + ATD
Sbjct: 191 YTNIKCTSSLC---TQFRSAGCSSSTDASCIYDVKY-GDNSISRGFLSQERLTITATD-- 244
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ FGCG+ G F A GL +G+ + + + + FS C S
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRGTA---GL--MGLSRHPISFVQQTSSIYNKIFSYCLPS 295
Query: 276 --DGTGRISFGDKGSPGQG--ETPFS-LRQTHPTYNITITQVSVGGNAV----NFEFSA- 325
G ++FG + TPFS + + Y + I +SVGG + + FSA
Sbjct: 296 TPSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAG 355
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
I DSGT T L AY + F K + + CY S + P +
Sbjct: 356 GSIIDSGTVITRLPPTAYAALRSAFRQFMM-KYPVAYGTRLLDTCYDFSGYK-EISVPRI 413
Query: 384 NLTMKGG 390
+ GG
Sbjct: 414 DFEFAGG 420
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 109/434 (25%), Positives = 173/434 (39%), Gaps = 74/434 (17%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAH---RDRYFRLRGRGLAAQGNDKTPLTFSAG 92
H S P K + K S A +AL R Y R R + A Q D P
Sbjct: 38 HSPSSPYKNV-------KAESLAKDTALESTLSRHAYLRARQQK-ALQPADFVPPPLIRD 89
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
+ N+S+G P + V LDTGSDLFW+ C+ C C +
Sbjct: 90 KSAF---------LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDP------- 133
Query: 152 FNIYSPNTSSTSSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
IY+ S + +++ CN C + QC +GS C YQ Y +DG ++G L + +
Sbjct: 134 --IYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGS-CLYQTAY-ADGARTSGLLSYEKV 189
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
T + +++ FGCG +Q +F+ G+ GLG S+ S L+ G + S
Sbjct: 190 AF-TSHYSDEDKTAQVGFGCG-LQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKS 247
Query: 269 FSMCFGS----DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------N 317
F+ CFG+ + G + FGD TP + + Y + + + +G N
Sbjct: 248 FAYCFGNISNPNAGGFLVFGDATYLNGDMTPMVIAE---FYYVNLLGIGLGVGEPRLDIN 304
Query: 318 AVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE----TSTSDLPFEY 367
+ +FE I DSG++ + Y + K+ TS+ D
Sbjct: 305 SSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD----- 359
Query: 368 CYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG- 426
C+ + +P + L ++ G +ND I L+CLG + ++IIG
Sbjct: 360 CFEGKIERDLPLFPTLVLYLESTG--ILNDRWSIFLQRYDE--LFCLGFTSGEGLSIIGT 415
Query: 427 ---REYPIANNISL 437
+ Y N+ L
Sbjct: 416 LAQQSYKFGYNLEL 429
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 151/364 (41%), Gaps = 55/364 (15%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYS 156
SLG +Y + +G P F V DTGSD W+ C VSC + ++
Sbjct: 157 SLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKD---------RLFD 207
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C C +C Y ++Y DG+ + GF +D L +A D +
Sbjct: 208 PAKSSTYANVSCADPACADLDASGCNAGHCLYGIQY-GDGSYTVGFFAKDTLAVAQDAIK 266
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F A GL GLG TS+ ++ A + SFS C
Sbjct: 267 G------FKFGCGEKNRGLFGQTA---GLLGLGRGPTSI-TVQAYE-KYGGSFSYCLPAS 315
Query: 275 SDGTGRISF---GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIF--- 327
S TG + F S +T L PT Y + +T + VGG + ++F
Sbjct: 316 SAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNS 375
Query: 328 ----DSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLSPNQTNFEY 380
DSGT T L D AY +S F + K+ + S L + CY + +
Sbjct: 376 GTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSIL--DTCYDFT-GLSQVSL 432
Query: 381 PVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKS---DNVNIIG----REYPIA 432
P V+L +GG ++ IV S+ + CLG + ++V I+G R Y +
Sbjct: 433 PTVSLVFQGGACLDLDASGIVYAISQSQ----VCLGFASNGDDESVGIVGNTQQRTYGVL 488
Query: 433 NNIS 436
++S
Sbjct: 489 YDVS 492
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 108/263 (41%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 114 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 168
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 169 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 228
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + +FS
Sbjct: 229 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 276
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 277 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 336
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTI 359
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 108/251 (43%), Gaps = 39/251 (15%)
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 10 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 69
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 70 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNT 127
Query: 325 --AIFDSGTSFTYLNDPAY--------TQISETFNSLAKEKRETSTSDLPFEYCYVLSPN 374
I DSGT+ YL D AY +S + SL + + C++ S +
Sbjct: 128 QGTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQ----------CFITS-S 176
Query: 375 QTNFEYPVVNLTMKGGGPFFVN-DPIVIVSSEPKGLYLYCLGVVKSDNVNIIGREYPIAN 433
+ +P V L GG V + ++ + L+C+G ++ G+E I
Sbjct: 177 SVDSSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQ-----GQEITILG 231
Query: 434 NISLFHNCYSY 444
++ L + Y
Sbjct: 232 DLVLKDKIFVY 242
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 123/308 (39%), Gaps = 49/308 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V +G P F + LDTGSDL W+ CV C + Y P S +
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWI--QCVPCFDCFEQNGP------YYDPKDSISFR 247
Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ CN C+L + C +CPY Y + F +E T K
Sbjct: 248 NITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGK 307
Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R+ FGCG G F GL GLG S S L Q L +SFS C
Sbjct: 308 SEFRRVENVMFGCGHWNRGLF---HGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 362
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAV---- 319
+ + ++ FG+ P T + +P Y + I + VGG +
Sbjct: 363 RDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPE 422
Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY-CYVL 371
N+ SA I DSGT+ +Y +DPAY I E F L K K D P + CY +
Sbjct: 423 ENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAF--LRKVKGYKLVEDFPILHPCYNV 480
Query: 372 S-PNQTNF 378
S ++ NF
Sbjct: 481 SGTDELNF 488
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 123/308 (39%), Gaps = 49/308 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V +G P F + LDTGSDL W+ CV C + Y P S +
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWI--QCVPCFDCFEQNGP------YYDPKDSISFR 247
Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ CN C+L + C +CPY Y + F +E T K
Sbjct: 248 NITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGK 307
Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R+ FGCG G F GL GLG S S L Q L +SFS C
Sbjct: 308 SEFRRVENVMFGCGHWNRGLF---HGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 362
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAV---- 319
+ + ++ FG+ P T + +P Y + I + VGG +
Sbjct: 363 RDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPE 422
Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY-CYVL 371
N+ SA I DSGT+ +Y +DPAY I E F L K K D P + CY +
Sbjct: 423 ENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAF--LRKVKGYKLVEDFPILHPCYNV 480
Query: 372 S-PNQTNF 378
S ++ NF
Sbjct: 481 SGTDELNF 488
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 111/276 (40%), Gaps = 48/276 (17%)
Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y ++++G P LDTGSDL W C C SC+ + +++P
Sbjct: 92 GDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDP---------LFAPGQ 142
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
S++ + C TLC L C C Y+ Y DGTM+ G + A+
Sbjct: 143 SASYEPMRCAGTLCSDILHHSCERP-DTCTYRYNY-GDGTMTVGVYATERFTFASSGGGG 200
Query: 218 KSVDS-RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ + + FGCG V GS +G +G+ G G + S+ S L+ + FS C S
Sbjct: 201 LTTTTVPLGFGCGSVNVGSLNNG---SGIVGFGRNPLSLVSQLSIR-----RFSYCLTSY 252
Query: 277 GTGRIS-----------FGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
+ R S +GD Q TP +PT Y + T ++VG + S
Sbjct: 253 ASRRQSTLLFGSLSDGVYGDATGRVQ-TTPLLQSPQNPTFYYVHFTGLTVGARRLRIPES 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFN 349
A I DSGT+ T L ++ F
Sbjct: 312 AFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFR 347
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 84/190 (44%), Gaps = 32/190 (16%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHG---LNSSSGQVI--------DFNI 154
T + +G P F + +D+GS + ++PC DC C L+S Q++ F I
Sbjct: 94 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQVMLSSPKDQILCLVSCKVQIFKI 153
Query: 155 ----------YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
+ P SST V CN + C C Y+ Y ++ + S G L
Sbjct: 154 SYGLFDEDPKFQPELSSTYQPVKCN-----MDCNCDDDKEQCVYEREY-AEHSSSKGVLG 207
Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
ED++ +S R FGC V+TG A +G+ GLG S+ L ++GL
Sbjct: 208 EDLISFGN---ESHLTPQRAVFGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDKGL 263
Query: 265 IPNSFSMCFG 274
I NSF +C+G
Sbjct: 264 ISNSFGLCYG 273
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 116/290 (40%), Gaps = 43/290 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +S+G P + IV DTGSDL W+ C C C + ++ P+ SS+
Sbjct: 94 YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSP---------LFDPSRSSSY 144
Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C S C ++ C + C Y Y D + + G L + + + +
Sbjct: 145 RHMLCGSRFCNALDVSEQACTMDTNICEYHYSY-GDKSYTNGNLATEKFTIGSTSSRPVH 203
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF---- 273
+ S I FGCG G+F + L + L +Q +I FS C
Sbjct: 204 L-SPIVFGCGTGNGGTF------DELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLS 256
Query: 274 -GSDGTGRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------- 321
S+ T +I FG P TP +Q Y +T+ +SVG + +
Sbjct: 257 EQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGN 316
Query: 322 --EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + I DSGT+ T+L+ +T++ K +R + L F C+
Sbjct: 317 VEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGL-FSVCF 365
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 144/360 (40%), Gaps = 68/360 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN---IYSPNTSS 161
H V + QP + DTGSDL W C L+SS+ +Y P SS
Sbjct: 16 HSLTVGIVQPRKLIV---DTGSDLIWTQCK-------LSSSTAAAARHGSPPVYDPGESS 65
Query: 162 TSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T + +PC+ LC+ K C S + C Y+ Y S + G L +
Sbjct: 66 TFAFLPCSDRLCQEGQFSFKNCTSK-NRCVYEDVYGS--AAAVGVLASETFTFGA----R 118
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
++V R+ FGCG + GS + G+ GL + S+ + L Q FS C F
Sbjct: 119 RAVSLRLGFGCGALSAGSLIGA---TGILGLSPESLSLITQLKIQ-----RFSYCLTPFA 170
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT----HPT----YNITITQVSVGGNAVNFEFSA- 325
T + FG + +T ++ T +P Y + + +S+G + ++
Sbjct: 171 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASL 230
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQ 375
I DSG++ YL + A+ + E + + T + +E C+VL P +
Sbjct: 231 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVE-DYELCFVL-PRR 288
Query: 376 T------NFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDN---VNIIG 426
T + P + L GG + P EP+ L CL V K+ + V+IIG
Sbjct: 289 TAAAAMEAVQVPPLVLHFDGGAAMVL--PRDNYFQEPRA-GLMCLAVGKTTDGSGVSIIG 345
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 98/357 (27%), Positives = 142/357 (39%), Gaps = 51/357 (14%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
H R + GR + P + A D+ + + +G PA+ V +DT
Sbjct: 95 HITRKAKASGR-TTTLSDVSIPTSLGAAVDSLE-------YVVTLGIGTPAVQQTVLIDT 146
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE------LQKQ 178
GSDL W+ C C NSSS +Y P SST + VPC+S C+
Sbjct: 147 GSDLSWV--QCKPC----NSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHG 200
Query: 179 C--PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C S S C Y + Y + T + G + L L+ Q D FGCG VQ G+F
Sbjct: 201 CTNSSGTSLCQYGIEYGNRDT-TVGVYSTETLTLS---PQVSVKD--FGFGCGLVQQGTF 254
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIP--NSFSMCF--GSDGTGRISFG----DKGS 288
+ P L +Q +FS C G+ TG ++ G + +
Sbjct: 255 DLFDG-------LLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTTGFLALGAPTNNNDT 307
Query: 289 PGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYT 342
G TP SL + Y + +T VSVGG ++ + I DSGT T L D AY+
Sbjct: 308 AGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMIIDSGTIITGLPDTAYS 367
Query: 343 QISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDP 398
+ F +++ +D + CY + N P V LT GG ++ P
Sbjct: 368 ALRTAFRTAMSAYPLLPPNNDDVLDTCYNFT-GIANVTVPTVALTFDGGATIDLDVP 423
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 89/330 (26%), Positives = 127/330 (38%), Gaps = 50/330 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G P + ++ALDT SD W+PC CV C S+S ++P S++ V C S
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGC-----STSKP------FAPIKSTSFRNVSCGS 152
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C+ GS C + Y S ++ +V+D L LATD +FGC
Sbjct: 153 PHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLATDPIPG------YTFGCVN 204
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDK 286
TGS +AP + +Q L ++FS C S + +G + G
Sbjct: 205 KTTGS----SAPQQGLLGLGRGPLS-LLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259
Query: 287 GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
P + + LR + Y + + + VG V+ +A IFDSGT F
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
T L +P YT + F K +T F+ CY P + G
Sbjct: 320 TRLAEPVYTAVRNEFRRRVGPKLPVTTLG-GFDTCY-----NVPIVVPTITFLFSGMNVT 373
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
D IVI S+ L G DNVN
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGA--PDNVN 401
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 87/329 (26%), Positives = 132/329 (40%), Gaps = 71/329 (21%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD----CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+S G P + + +DTGSDL W PC C +C ++ S NI+ P +SS+S
Sbjct: 94 LSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-----NIFIPKSSSSSK 148
Query: 165 KVPCNSTLC------ELQKQC-------PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
+ C + C ++Q +C P+ CP + + G ++ G ++ + L L
Sbjct: 149 VLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDLP 207
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
K V + I GC S L + P G+ G G S+PS L GL FS
Sbjct: 208 -----GKGVPNFI-VGC------SVLSTSQPAGISGFGRGPPSLPSQL---GL--KKFSY 250
Query: 272 CFGS----DGTGRISFGDKGSPGQGE-------TPF----SLRQTHP---TYNITITQVS 313
C S D T S G GE TPF + H Y + + ++
Sbjct: 251 CLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHIT 310
Query: 314 VGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VGG V + I DSGT+FTY+ + ++ F + KR T
Sbjct: 311 VGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEG 370
Query: 363 LP-FEYCYVLSPNQTNFEYPVVNLTMKGG 390
+ C+ +S T +P + L +GG
Sbjct: 371 ITGLRPCFNISGLNTP-SFPELTLKFRGG 398
>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
Length = 533
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 96/357 (26%), Positives = 146/357 (40%), Gaps = 57/357 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F++ +DTGSDL WL C C +C SG V D P+ S++
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACF----DQSGPVFD-----PSQSTSF 221
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNC-PYQVRYL---SDGTMSTGFLVEDVLHLATDEKQS 217
+PCN+ C+L +C S P +Y D + ++G L + L ++ + S
Sbjct: 222 KIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPS 281
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+ GCG G F L GLG S PS L + I SFS C D
Sbjct: 282 SLEIRDMVIGCGHSNKGLFQGAGG---LLGLGQGALSFPSQLRSSP-IGQSFSYCL-VDR 336
Query: 278 TGRISFGDKGSPGQG-----------ETPFSLRQTHPTYN--------ITITQVSVGGNA 318
T +S S G G TPF +R + I I Q + A
Sbjct: 337 TNNLSVSSAISFGAGFALSRHFDQMRFTPF-VRTNNSVETFYYLGIQGIKIDQELLPIPA 395
Query: 319 VNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE---YCY 369
F + I DSGT+ TYLN AY + F + R PF+ CY
Sbjct: 396 ERFAIAPNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-----PFDILGICY 450
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ +T +P +++ + G + + +P+ +CL ++ +D ++IIG
Sbjct: 451 NAT-GRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAK-HCLAILPTDGMSIIG 505
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 94/205 (45%), Gaps = 21/205 (10%)
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
Y R ++ + S G++VED D+ R+ FGC +TG A +G+ G
Sbjct: 8 YYSRTYAERSSSEGWMVEDAFGFPDDQPPV-----RMVFGCENGETGEIYRQLA-DGIMG 61
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS--LRQTH-PT 304
+G + + S L +G+I + FS+CFG G + GD P T ++ L H
Sbjct: 62 MGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNLHLHY 121
Query: 305 YNITITQVSVGG-----NAVNFE--FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
YN+ + ++V G NA F + + DSGT+FTYL A+ ++ S A
Sbjct: 122 YNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYALSHGL 181
Query: 358 TSTSDLPFEY---CYVLSPNQTNFE 379
ST +Y C+ +P+ NF+
Sbjct: 182 QSTPGADPQYNDICWKGAPD--NFQ 204
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/313 (27%), Positives = 127/313 (40%), Gaps = 45/313 (14%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS--CVHGLNSSSGQVIDFNIY 155
L++L F+ V G PA + + LDTGSDL W+ C S C + DF+
Sbjct: 132 LDTLEFV--VVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDP------DFD-- 181
Query: 156 SPNTSSTSSKVPCNSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P SS+ + VPC + +C C G+ C Y V+Y DG+ +TG L D L +
Sbjct: 182 -PAKSSSYAAVPCGTPVCAAAGGMC--NGTTCLYGVQY-GDGSSTTGVLSRDTLTFNSSS 237
Query: 215 KQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
K + +FGCG G F +DG G L + + PS FS C
Sbjct: 238 KFTG-----FTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFG-------GVFSYC 285
Query: 273 FGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYN----ITITQVSVGG------NAVN 320
S T G ++ G ++ P Y I + +++GG +V
Sbjct: 286 LPSYNTTPGYLNIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF 345
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEY 380
+ + DSGT TYL PAYT + + F + + + P + CY + Q
Sbjct: 346 TKTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYE-PLDTCYDFT-GQGAIVI 403
Query: 381 PVVNLTMKGGGPF 393
P V+ G F
Sbjct: 404 PAVSFNFSDGAVF 416
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 107/269 (39%), Gaps = 42/269 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++VG P + LDTGSDL W C C C + P SST
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQ---------GIPLLDPAASSTY 136
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ----SKS 219
+ +PC + C G +C Y Y D +++ G + D + ++ S
Sbjct: 137 AALPCGAPRCRALPFTSCGGRSCVYVYHY-GDKSVTVGKIATDRFTFGDNGRRNGDGSLP 195
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---D 276
R++FGCG G F G+ G G + S+PS L SFS CF S
Sbjct: 196 ATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFDS 248
Query: 277 GTGRISFGDKGSPG-------QGE---TPFSLRQTHPT-YNITITQVSVGGNAVNFE--- 322
+ ++ G G+P GE TP + P+ Y +++ +SVG +
Sbjct: 249 KSSIVTLG--GAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETK 306
Query: 323 -FSAIFDSGTSFTYLNDPAYTQISETFNS 350
S I DSG S T L + Y + F +
Sbjct: 307 FRSTIIDSGASITTLPEEVYEAVKAEFAA 335
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 122/308 (39%), Gaps = 53/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG P + LDTGSDL W C C C D + P SST
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQ---------DLPVLDPAASSTY 134
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ +PC + C + +C Y Y D +++ G + D
Sbjct: 135 AALPCGAARCRALPFTSCGVRTLGNHRSCIYAYHY-GDKSLTVGEIATDRFTFGDSGGSG 193
Query: 218 KSVDS-RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+S+ + R++FGCG + G F G+ G G + S+PS L SFS CF S
Sbjct: 194 ESLHTRRLTFGCGHLNKGVFQSNE--TGIAGFGRGRWSLPSQLNV-----TSFSYCFTSM 246
Query: 276 --DGTGRISFGDKGSPG-------QGE---TPFSLRQTHPT-YNITITQVSVGGNAV--- 319
+ ++ G GSP GE TP + P+ Y +++ +SVG +
Sbjct: 247 FESKSSLVTLG--GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVP 304
Query: 320 NFEF-SAIFDSGTSFTYLNDPAYTQISETFNS---LAKEKRETSTSDLPFEYCYVLSPNQ 375
+F S I DSG S T L + Y + F + L E S DL C+ L P
Sbjct: 305 ETKFRSTIIDSGASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDL----CFAL-PVT 359
Query: 376 TNFEYPVV 383
+ P V
Sbjct: 360 ALWRRPAV 367
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/318 (24%), Positives = 119/318 (37%), Gaps = 54/318 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++VG P + LDTGSDL W C C C H + P SST
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQ---------GLPLLDPAASSTY 142
Query: 164 SKVPCNSTLCELQ--KQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ +PC + C C G +C Y Y D +++ G + D D
Sbjct: 143 AALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHY-GDKSVTVGEIATDRFTFGGD 201
Query: 214 --EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ S+ R++FGCG G F G+ G G + S+PS L +FS
Sbjct: 202 NGDGDSRLPTRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNV-----TTFSY 254
Query: 272 CFGS---DGTGRISFGDKGSPG-----------QGE---TPFSLRQTHPT-YNITITQVS 313
CF S + ++ G G+P GE TP + P+ Y +++ +S
Sbjct: 255 CFTSMFESKSSLVTLG--GAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGIS 312
Query: 314 VGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
VG + S I DSG S T L + Y + F + + C+
Sbjct: 313 VGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCF 372
Query: 370 VLSPNQTNFEYPVVNLTM 387
L PV +LT+
Sbjct: 373 ALPVTALWRRPPVPSLTL 390
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 101/381 (26%), Positives = 149/381 (39%), Gaps = 76/381 (19%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD----CVSCV 139
KTP + S +S G + T +S G P + + DTGS L W PC C C
Sbjct: 61 KTPKSNSVFKSPLSPHSYG-AYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS 119
Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG-------SNC 186
+G + P SS+S V C + C +++ QC S C
Sbjct: 120 FPKIDPTG----IPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTC 175
Query: 187 P-YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
P Y V+Y S T G L+ + L D+K V GC SFL P+G+
Sbjct: 176 PAYVVQYGSGST--AGLLLSETLDFP-DKKIPNFV-----VGC------SFLSIHQPSGI 221
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--------------DGTGRISFGDKGSPGQ 291
G G S+PS + GL F+ C S D TG S G +P +
Sbjct: 222 AGFGRGSESLPSQM---GL--KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFR 276
Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPA 340
S Y + I ++ VG AV + +I DSG++FT+++ P
Sbjct: 277 QNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPV 336
Query: 341 YTQISETF-NSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTMKGGGPFF--VN 396
++ F LA R T L C+ +S + + ++P + KGG + +N
Sbjct: 337 LEVVAREFEKQLANWTRATDVETLTGLRPCFDIS-KEKSVKFPELIFQFKGGAKWALPLN 395
Query: 397 DPIVIVSSEPKGLYLYCLGVV 417
+ +VSS + CL VV
Sbjct: 396 NYFALVSSS----GVACLTVV 412
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 143/357 (40%), Gaps = 54/357 (15%)
Query: 44 GILAVDDLPKKGSFAYYSALAHRDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLG 102
G+ + P + + RD + R R LA+ G D+T T + G
Sbjct: 32 GLTRIHSNPDVSATEFVRDALRRDMHRHARFTRELASSG-DRT-----VAAPTRKDLPNG 85
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ +++G P LS+ DTGSDL W C C +GQ Y+P++S+T
Sbjct: 86 GEYIMTLAIGTPPLSYPAIADTGSDLIW--TQCAPCGSQCFKQAGQP-----YNPSSSTT 138
Query: 163 SSKVPCNSTL---CELQKQCPSAGSNCPYQVRYLSDGTMSTGFL--VEDVLHLATDEKQS 217
+PCNS++ L P G +C Y Y GT T + VE +T Q+
Sbjct: 139 FGVLPCNSSVSMCAALAGPSPPPGCSCMYNQTY---GTGWTAGIQSVETFTFGSTPADQT 195
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
+ I+FGC + + +G+A GL GLG S+ S L FS C
Sbjct: 196 RV--PGIAFGCSNASSDDW-NGSA--GLVGLGRGSMSLVSQLGA-----GMFSYCLTPFQ 245
Query: 274 GSDGTGRISFGDKGS---PGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
++ T + G + G TPF S Y + +T +S+G A++ +A
Sbjct: 246 DANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAF 305
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
I DSGT+ T L D AY Q+ SL + + C+ L+
Sbjct: 306 ALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESLVTLPVADGSDSTGLDLCFALT 362
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 107/263 (40%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + L S
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----S 274
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 107/263 (40%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 114 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 168
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 169 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 228
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + L S
Sbjct: 229 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----S 276
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 277 YCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 336
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTI 359
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/332 (28%), Positives = 134/332 (40%), Gaps = 52/332 (15%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + DTGSDL W+ C C +C D ++ P SST C+
Sbjct: 98 IGTPPVERLAIADTGSDLIWVQCSPCQNCFPQ---------DTPLFEPLKSSTFKAATCD 148
Query: 170 STLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRI 224
S C Q+QC G C Y Y D + + G + + L +T + Q+ S S I
Sbjct: 149 SQPCTSVPPSQRQCGKVG-QCIYSYSY-GDKSFTVGVVGTETLSFGSTGDAQTVSFPSSI 206
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
FGCG +F GL GLG S+ S L Q I FS C F S+ T ++
Sbjct: 207 -FGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQ--IGYKFSYCLLPFSSNSTSKL 263
Query: 282 SFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV---NFEFSAIFDSGTSFT 334
FG + + G TP ++ P+ Y + + V++G V + + I DSGT T
Sbjct: 264 KFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVLT 323
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFF 394
YL Y SL + S DLPF + + + PV+ G
Sbjct: 324 YLEQTFYNNF---VASLQEVLSVESAQDLPFPFKFCFP--YRDMTIPVIAFQFTGAS--- 375
Query: 395 VNDPIVIVSSEPKGLY-------LYCLGVVKS 419
V+ +PK L + CL VV S
Sbjct: 376 -------VALQPKNLLIKLQDRNMLCLAVVPS 400
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 129/313 (41%), Gaps = 38/313 (12%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R+ S + +++G P + +DTGSDL W C C C + ++
Sbjct: 42 RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSP---------MF 92
Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P S+T + +PC+S C L S C Y Y +D +++ G L + + ++ +
Sbjct: 93 EPLRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAY-ADSSVTKGVLARETVTFSSTD 151
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS--FSMC 272
+ V I FGCG +G+F + G+ S+++ G + S FS C
Sbjct: 152 GEPVVV-GDIVFGCGHSNSGTFNEND-----MGIIGLGGGPLSLVSQFGNLYGSKRFSQC 205
Query: 273 ---FGSD--GTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
F +D G ISFGD G TP + Y +T+ +SVG V+F S
Sbjct: 206 LVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS 265
Query: 325 AIF-------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
+ DSGT TYL Y ++ + + DL + CY ++TN
Sbjct: 266 EMLSKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYR---SETN 322
Query: 378 FEYPVVNLTMKGG 390
E P++ +G
Sbjct: 323 LEGPILIAHFEGA 335
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 129/315 (40%), Gaps = 50/315 (15%)
Query: 98 LNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
+ G LH+T VS+G P + LDTGSDL W C + Q + +Y
Sbjct: 81 IRPFGRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLF--------DTRQHREKPLYD 132
Query: 157 PNTSSTSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
P SS+ + PC+ LCE K C + + C Y Y S T G L +
Sbjct: 133 PAKSSSFAAAPCDGRLCETGSFNTKNC--SRNKCIYTYNYGSATT--KGELASETFTFGE 188
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ S S+D FGCG++ +GS L GA+ G+ G+ D+ S L +Q IP FS C
Sbjct: 189 HRRVSVSLD----FGCGKLTSGS-LPGAS--GILGISPDRLS----LVSQLQIPR-FSYC 236
Query: 273 ----FGSDGTGRISFGDKGSPGQGETPFSLRQTHPT---------YNITITQVSVGGNAV 319
+ T I FG + T ++ T Y + + +SVG +
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296
Query: 320 NFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTN 377
N S AI G+ T+++ T + + A ++ LP V++
Sbjct: 297 NVPVSSFAIGRDGSGGTFVDSGDTTGMLPSVVMEALKEAMVEAVKLP-----VVNATDHG 351
Query: 378 FEYPV-VNLTMKGGG 391
+EY + L GGG
Sbjct: 352 YEYELCFQLPRNGGG 366
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/361 (24%), Positives = 142/361 (39%), Gaps = 70/361 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + S+G P F + +DTGSDL ++ C C C D +Y P+ SST
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQ---------DGPLYQPSNSSTF 84
Query: 164 SKVPCNSTLCEL------------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
+ VPC+S C L + P G+ C Y+ RY D + + G + +
Sbjct: 85 TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGA-CSYEYRY-GDNSSTVGVFAYETATVG 142
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ ++FGCG GSF+ G+ GLG S S N F+
Sbjct: 143 GIRV------NHVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFAY 191
Query: 272 CFGSDGT-----GRISFGDKGSPGQGETPFSLRQTHP----TYNITITQVSVGGNAVNFE 322
C S + + FGD + F+ ++P Y + I ++ GG +
Sbjct: 192 CLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIP 251
Query: 323 FSA-----------IFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYV 370
SA IFDSGT+ TY + AY +I F S+ + S LP
Sbjct: 252 DSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLP------ 305
Query: 371 LSPNQTNFEYPV---VNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK--SDNVNII 425
L N + ++P+ + G + N + P + CL +++ SD N+I
Sbjct: 306 LCVNVSGIDHPIYPSFTIEFDQGATYRPNQGNYFIEVSPN---IDCLAMLESSSDGFNVI 362
Query: 426 G 426
G
Sbjct: 363 G 363
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 100/355 (28%), Positives = 145/355 (40%), Gaps = 57/355 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P + LDTGSDL W C C +C Q + + + P+TSST
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFD-------QALPY--FDPSTSSTL 85
Query: 164 SKVPCNSTLCELQKQCPSAGS-------NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S C+STLC+ S GS C Y Y D +++TGFL D
Sbjct: 86 SLTSCDSTLCQ-GLPVASCGSPKFWPNQTCVYTYSY-GDKSVTTGFLEVDKFTFV---GA 140
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
SV ++FGCG G F G+ G G S+PS L +FS CF +
Sbjct: 141 GASVPG-VAFGCGLFNNGVFKSNE--TGIAGFGRGPLSLPSQLKV-----GNFSHCFTTI 192
Query: 277 GTGRISF-------GDKGSPGQGE---TP---FSLRQTHPT-YNITITQVSVGGNAVNFE 322
TG I D S GQG TP ++ + +PT Y +++ ++VG +
Sbjct: 193 -TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVP 251
Query: 323 FSA----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
SA I DSGTS T L Y + + F A+ K + Y +
Sbjct: 252 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEF--AAQIKLPVVPGNATGHYTCFSA 309
Query: 373 PNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIGR 427
P+Q + P + L +G + V + G + CL + K D IIG
Sbjct: 310 PSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGN 364
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 122/304 (40%), Gaps = 40/304 (13%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++ +G P ++ I DTGSDL W C+ C N S I++P SS+ KV
Sbjct: 93 SIFIGTPPVNVIAIADTGSDLTW--TQCLPCRECFNQSQ------PIFNPRRSSSYRKVS 144
Query: 168 CNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C S C + C +C Y Y D + + G L D + + + K K+V
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSY-GDRSFTYGDLASDQITIGS-FKLPKTV----- 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGR 280
GCG G+F G + G + V + G+ P FS C ++ TG
Sbjct: 198 IGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAGVKPR-FSYCLPTFFSNANITGT 255
Query: 281 ISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSVGG---------NAVNFEFSAIFD 328
ISFG K + TP R Y +T+ +SVG +A+ + I D
Sbjct: 256 ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIID 315
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQT-NFEYPVVNLTM 387
SGT+ T L Y + T + K KR S + E CY S Q + P++
Sbjct: 316 SGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGI-LELCY--SAGQVDDLNIPIITAHF 372
Query: 388 KGGG 391
GG
Sbjct: 373 AGGA 376
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 92/346 (26%), Positives = 138/346 (39%), Gaps = 46/346 (13%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
++ S F + V++G P S + DTGSDL W V C G N +S +
Sbjct: 93 KVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVW-----VKCKKGNNDTSSAAAPTTQFD 147
Query: 157 PNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P+ SST +V C + CE L + GSNC Y Y DG+ +TG L +
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAY-GDGSNTTGVLSTETFTFDDGGA 206
Query: 216 QSKSVDSRI---SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
RI FGC GSF +GL GLG S+ + L + FS C
Sbjct: 207 GRSPRQVRIGGVKFGCSTATAGSF----PADGLVGLGGGAVSLVTQLGGATSLGRRFSYC 262
Query: 273 F---GSDGTGRISFG---DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
+ + ++FG D PG TP VG V S+
Sbjct: 263 LVPHSVNASSALNFGALADVTEPGAASTPL-----------------VGNKTVASAASSR 305
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPV 382
I DSGT+ T+L DP+ + + L++ + D + CY ++ + +
Sbjct: 306 IIVDSGTTLTFL-DPSL--LGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESI 362
Query: 383 VNLTMK-GGGPFFVNDPI-VIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+LT++ GGG P V+ + L L + + V+I+G
Sbjct: 363 PDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILG 408
>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
Length = 449
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 143/357 (40%), Gaps = 57/357 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F++ +DTGSDL WL C C +C SG V D P+ S++
Sbjct: 87 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKACF----DQSGPVFD-----PSQSTSF 137
Query: 164 SKVPCNSTLCEL--QKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+PCN+ C+L +C S C Y Y D + ++G L + L ++ +
Sbjct: 138 KIIPCNAAACDLVVHDECRDNSSKTSPKTCKYFYWY-GDSSRTSGDLALESLSVSLSDHP 196
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S + GCG G GL GLG S PS L + I SFS C D
Sbjct: 197 SSLEIRDMVIGCGHSNKGL---FQGAGGLLGLGQGALSFPSQLRSSP-IGQSFSYCL-VD 251
Query: 277 GTGRISFGDKGSPGQG-----------ETPF--SLRQTHPTYNITITQVSVGGN------ 317
T +S S G G TPF + Y + I + +
Sbjct: 252 RTNNLSVSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPA 311
Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE---YCY 369
A N I DSGT+ TYLN AY + F + R PF+ CY
Sbjct: 312 ERFAIATNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRAD-----PFDILGICY 366
Query: 370 VLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ + +P +++ + G + + +P+ +CL ++ +D ++IIG
Sbjct: 367 NAT-GRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAK-HCLAILPTDGMSIIG 421
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 104/397 (26%), Positives = 163/397 (41%), Gaps = 37/397 (9%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNS 100
P +L D + S A A R +LR RG ++ + ++ + G T S
Sbjct: 62 PFSAVL-THDHARIASLAARLAKTPSSRPTKLR-RGSSSSPDAESLASVPLGPGT----S 115
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
+G +Y T + +G PA S+++ +DTGS L WL C C+ + SG V + S
Sbjct: 116 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWL--QCSPCLVSCHRQSGPVFNPRSSSSYA 173
Query: 160 SSTSSKVPCNS-TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + S C++ T L S + C YQ Y D + S G+L +D + S
Sbjct: 174 SVSCSAPQCDALTTATLNPSTCSTSNVCIYQASY-GDSSFSVGYLSKDTVSFG-----ST 227
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
SV + +GCG+ G F A GL GL +K S+ LA + SFS C + +
Sbjct: 228 SVPN-FYYGCGQDNEGLFGQSA---GLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSS 281
Query: 279 GRISFGDKG-SPGQ-GETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFDS 329
+PGQ TP + + Y I +T ++V G ++ SA I DS
Sbjct: 282 SSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDS 341
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKG 389
GT T L Y+ +S+ K S + + C+ + P V++ G
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMKGTPRASAFSI-LDTCF--QGQASRLRVPQVSMAFAG 398
Query: 390 GGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
G + ++V + CL + + IIG
Sbjct: 399 GAALKLKATNLLVDVDSA---TTCLAFAPARSAAIIG 432
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 105/236 (44%), Gaps = 22/236 (9%)
Query: 199 STGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI 258
S+G L ED++ ++S+ R FGC +TG A +G+ GLG + S+
Sbjct: 4 SSGVLGEDIVSFG---RESELKAQRAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQ 59
Query: 259 LANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSV 314
L +G+I +SFS+C+G G + G P + FS LR P YNI + ++ V
Sbjct: 60 LVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRS--PYYNIELKEIHV 117
Query: 315 GGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF-E 366
G A+ + + DSGT++ YL + A+ + S ++ D + +
Sbjct: 118 AGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPDPSYKD 177
Query: 367 YCYV---LSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKS 419
C+ + ++ + +P V++ G G P + K YCLGV ++
Sbjct: 178 ICFAGARRNVSKLHEVFPDVDMVF-GNGQKLSLTPENYLFRHSKVDGAYCLGVFQN 232
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 146/383 (38%), Gaps = 60/383 (15%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R R R + L A N + GN + + +++G P ++ +DTG
Sbjct: 67 RHRLQRFKAMALVASSNSEIDAPVLPGNGEFLMK---------LAIGTPPETYSAIMDTG 117
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
SDL W C C C I+ P SS+ SK+ C+S LCE Q +
Sbjct: 118 SDLIWTQCKPCTQCFDQPTP---------IFDPKKSSSFSKLSCSSKLCEALPQS-TCSD 167
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPN 243
C Y Y D + + G L + L K ++FGCG GS F G+
Sbjct: 168 GCEYLYGY-GDYSSTQGMLASETLTFG------KVSVPEVAFGCGEDNEGSGFSQGS--- 217
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE--------TP 295
GL GLG S+ S L FS C S + S GS + TP
Sbjct: 218 GLVGLGRGPLSLVSQLKEP-----KFSYCLTSVDDTKASTLLMGSLASVKASDSEIKTTP 272
Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
P+ Y +++ +SVG ++ + S I DSGT+ TYL A+
Sbjct: 273 LIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTTITYLEQSAFDL 332
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVS 403
+++ F S + S S E C+ L T+ E P + G + +I
Sbjct: 333 VAKEFTSQINLPVDNSGST-GLEVCFTLPSGSTDIEVPKLVFHFDGADLELPAENYMIAD 391
Query: 404 SEPKGLYLYCLGVVKSDNVNIIG 426
+ + + CL + S ++I G
Sbjct: 392 AS---MGVACLAMGSSSGMSIFG 411
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 70/274 (25%), Positives = 108/274 (39%), Gaps = 40/274 (14%)
Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
+ +DT SD+ W+ C H + +Y P+ SS+S+ PC+S C
Sbjct: 158 MVIDTASDVPWVQCAPCPAPHCHAQTD------VLYDPSKSSSSAAFPCSSPACRNLGPY 211
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR--VQT 233
C AG C Y+V+Y DG+ S G + DVL L + + S S FGC +Q
Sbjct: 212 ANGCTPAGDQCQYRVQY-PDGSASAGTYISDVLTL--NPAKPASAISEFRFGCSHALLQP 268
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD---------GTGRISFG 284
GSF + +G+ LG S+P+ + + FS C G R++
Sbjct: 269 GSFSNKT--SGIMALGRGAQSLPT--QTKATYGDVFSYCLPPTPVHSGFFILGVPRVAAS 324
Query: 285 DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
TP + P Y + + + V G + F A+ DS T T L
Sbjct: 325 RYAV-----TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPP 379
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLS 372
AY + F + + R + + + CY S
Sbjct: 380 TAYMALRAAFVAEMRAYRAAAPKEH-LDTCYDFS 412
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 128/330 (38%), Gaps = 49/330 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-----------DCVSCVHGLN-SSSGQVID 151
++ +V G PAL + + LDT +DL W+ C +S G + +++ +
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N Y P SS+ ++ C+ C L Q PS +C Y + + DGT++ G ++
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSY-YQQMQDGTLTMGIYGKEK 244
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ + + + I GC ++ G +D A +G+ LG + S A +
Sbjct: 245 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQ 299
Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
FS C S D + ++FG + PG ET P Y +T + VGG
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGER 359
Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
++ I D+ TS T L AY ++ + D FEY
Sbjct: 360 LDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELD-GFEY 418
Query: 368 CYVLS------PNQTNFEYPVVNLTMKGGG 391
CY + N P + + M GG
Sbjct: 419 CYRWTFAGDGVDLAHNVTVPRLTVEMAGGA 448
>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
Length = 642
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 132/308 (42%), Gaps = 52/308 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
HY + +G PA V +DTGS L LPC C C GQ D ++ + S+T+
Sbjct: 95 HYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGC--------GQHTD-PLFDVSKSTTA 145
Query: 164 SKVPCNS----TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDE 214
+ C+ CE Q +C Y + +G+M +V++++ + DE
Sbjct: 146 KYLACHDFDSCRSCE-QDRC--------YISQSYMEGSMWEAVMVDELVWVGGFSSPADE 196
Query: 215 KQS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSM 271
+ K+ R GC +TG F+ NG+ GLG +++V S + N G + N F++
Sbjct: 197 MEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQNLFTL 255
Query: 272 CFGSDGTGRISFG----DKGSPGQGETPFSLRQT--HPTY--NITITQVSVGGN--AVNF 321
CF DG G + FG + G TP ++ +P + +I + VS+G + +N
Sbjct: 256 CFAGDG-GELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGIDTGTINS 314
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYP 381
I DSGT+ T+ + F+ A S L E L P
Sbjct: 315 GRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKLTSEELAAL---------P 365
Query: 382 VVNLTMKG 389
V+++ + G
Sbjct: 366 VISIILSG 373
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 78/304 (25%), Positives = 122/304 (40%), Gaps = 42/304 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P S V +D+GSD+ W+ C C C + ++ P S+T
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP---------VFDPAGSATY 187
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ + C+S++C+ C Y+V Y DG+ + G L + L + +
Sbjct: 188 AGISCDSSVCDRLDNAGCNDGRCRYEVSY-GDGSYTRGTLALETLTFG------RVLIRN 240
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
I+ GCG + G F+ A GL G M S L Q +FS C G++ TG
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAM---SFVGQLGGQ--TGGAFSYCLVSRGTESTGT 295
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTY------NITITQVSVGGNAVNFEFS------AIF 327
+ FG P G P P++ + + + V FE + +
Sbjct: 296 LEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVM 355
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
D+GT+ T L PAY +TF A R S F+ CY L+ + P V+
Sbjct: 356 DTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSI--FDTCYNLN-GFVSVRVPTVSFY 412
Query: 387 MKGG 390
GG
Sbjct: 413 FSGG 416
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 125/327 (38%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 121/323 (37%), Gaps = 68/323 (21%)
Query: 70 FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
F + R LAA + +D + L AG D T R ++G L+Y + +G PA + V +
Sbjct: 57 FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQM 115
Query: 123 DTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS- 181
+ +Y S T V C+ C P
Sbjct: 116 E----------------------------LTLYDIKESLTGKLVSCDQDFCYAINGGPPS 147
Query: 182 ---AGSNCPYQVRYLSDGTMSTGFLVE---------DVLHLATDEKQSKSVDSRISFGCG 229
A +C Y Y +DG+ S G+ V+ + HL + + C
Sbjct: 148 YCIANMSCSYTEIY-ADGSSSFGYFVKGYCTASKYNSIPHLNNNPLL------EVPLRCS 200
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGS 288
Q+G A +G+ G G TS+ S LA+ G + F+ C G +G G + G
Sbjct: 201 ATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQ 260
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDP 339
P TP QTH YN+ + V VGG +N + I DSGT+ YL +
Sbjct: 261 PKVNTTPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEV 318
Query: 340 AYTQISETFNSLAKEKRETSTSD 362
Y Q+ S + + + D
Sbjct: 319 VYDQLLSKIFSWQSDLKVHTIHD 341
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/334 (23%), Positives = 137/334 (41%), Gaps = 58/334 (17%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P +DTGSDL W C C C + I+ P+ SST
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDP---------IFDPSKSST 131
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ C+ G +C Y++ Y D T S G L + + + + + V +
Sbjct: 132 FNEQRCH-------------GKSCHYEIIY-EDNTYSKGILATETVTIHSTSGE-PFVMA 176
Query: 223 RISFGCGRVQTGSFLD----GAAPNGLFGLGMDKTSVPSI--LANQGLIPNSFSMCFGSD 276
+ GCG T LD ++ +G+ GL M S+ S L GLI S CF
Sbjct: 177 ETTIGCGLHNTD--LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLI----SYCFSGQ 230
Query: 277 GTGRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSA 325
GT +I+FG G +++ +P Y + + VSV N + + +
Sbjct: 231 GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNI 290
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVV 383
+ DSG++ TY + + + R + S +D+ CY ++T +PV+
Sbjct: 291 VIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDM---LCYF---SETIDIFPVI 344
Query: 384 NLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
+ GG ++ + + S G L+CL ++
Sbjct: 345 TMHFSGGADLVLDKYNMYMESNSGG--LFCLAII 376
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 104/249 (41%), Gaps = 44/249 (17%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P + +DTGSD+ W C C +C I+ P+ SST
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAP---------IFDPSKSST 470
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN G++C Y++ Y +D T S G L + + + + + V +
Sbjct: 471 FREQRCN-------------GNSCHYEIIY-ADKTYSKGILATETVTIPSTSGE-PFVMA 515
Query: 223 RISFGCGRVQTGSFLDGAA--PNGLFGLGMDKTSVPSI--LANQGLIPNSFSMCFGSDGT 278
GCG T G A +G+ GL M S+ S L GLI S CF GT
Sbjct: 516 ETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI----SYCFSGQGT 571
Query: 279 GRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIF- 327
+I+FG G +++ +P Y + + VSV N + + E IF
Sbjct: 572 SKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFI 631
Query: 328 DSGTSFTYL 336
DSGT+ TY
Sbjct: 632 DSGTTLTYF 640
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 111/283 (39%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL W+ C C C + Y P S++
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGA---------FYDPKASASY 205
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L K C S +CPY Y + F VE ++L T
Sbjct: 206 KNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGS 265
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S+ + + FGCG G F A L GLG S S L Q L +SFS C
Sbjct: 266 SELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 320
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
++ + ++ FG+ P T F R+ + Y + I + V G +N
Sbjct: 321 RNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPE 380
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
I DSGT+ +Y +PAY I AK K
Sbjct: 381 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 423
>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
Length = 456
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 89/342 (26%), Positives = 142/342 (41%), Gaps = 52/342 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCV-SCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG PA + ++ LDTGSD+ W P + + + S +T +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGS-----------STGAAP 170
Query: 164 SKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ P + + + ++ SAG ++C YQV Y DG+++ G + L A +
Sbjct: 171 APTPRWNCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-- 227
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
R++ GCG G F+ A +GL GLG + S PS +A SFS C +
Sbjct: 228 ---QRVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTS 279
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------------NAVNFEFSA 325
S + S G TP + Y + + SVGG N
Sbjct: 280 ---SRRARPSRRWGGTP----RMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 332
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
I DSGTS T L P Y + + F + A R + F+ CY LS + + P V++
Sbjct: 333 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRV-VKVPTVSM 391
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSD-NVNIIG 426
+ GG + ++ + G +C + +D V+IIG
Sbjct: 392 HLAGGASVALPPENYLIPVDTSG--TFCFAMAGTDGGVSIIG 431
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 152/355 (42%), Gaps = 55/355 (15%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
RL SL ++ V +G ++ IV DTGSDL W+ C C C + + ++
Sbjct: 60 RLQSLNYI--VTVELGGRKMTVIV--DTGSDLSWVQCQPCNRCYNQQDP---------VF 106
Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+P+ S + V CNS C LQ C S C Y V Y DG+ ++G + + L
Sbjct: 107 NPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNY-GDGSYTSGEVGMEHL 165
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
+L + +V++ I FGCGR G F +GL GLG S+ S ++ +
Sbjct: 166 NLG-----NTTVNNFI-FGCGRKNQGLF---GGASGLVGLGRTDLSLISQISP--MFGGV 214
Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFS-LRQTH----PTYNITITQVSVGGNAVN 320
FS C ++ +G + G S + TP S R H P Y + +T ++VGG V
Sbjct: 215 FSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQ 274
Query: 321 F----EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYVLSPN 374
+ I DSGT + L Y + F K+ ++ S + + C+ LS
Sbjct: 275 APSFGKDRMIIDSGTVISRLPPSIYQALKAEF---VKQFSGYPSAPSFMILDSCFNLSGY 331
Query: 375 QTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK---SDNVNIIG 426
Q + P + + +G V+ V S + + CL + D V IIG
Sbjct: 332 Q-EVKIPDIKMYFEGSAELNVDVTGVFYSVKTDASQV-CLAIASLPYEDEVGIIG 384
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/336 (23%), Positives = 130/336 (38%), Gaps = 63/336 (18%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-----------DCVSCVHGLN-SSSGQVID 151
++ +V G PAL + + LDT +DL W+ C +S G + +++ +
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N Y P SS+ ++ C+ C L Q PS +C Y + + DGT++ G ++
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSY-YQQMQDGTLTMGIYGKEK 244
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ + + + I GC ++ G +D A +G+ LG + S A +
Sbjct: 245 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQ 299
Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
FS C S D + ++FG + PG ET P Y +T + VGG
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGER 359
Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--- 364
++ I D+ TS T L AY ++ + S LP
Sbjct: 360 LDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDR--------HLSHLPRVY 411
Query: 365 ----FEYCYVLS------PNQTNFEYPVVNLTMKGG 390
FEYCY + N P + + M GG
Sbjct: 412 ELDGFEYCYRWTFAGDGVDLTHNVTVPRLTVEMAGG 447
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 88/330 (26%), Positives = 126/330 (38%), Gaps = 50/330 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G P + ++ALDT SD W+PC CV C S+S ++P S++ V C S
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGC-----STSKP------FAPIKSTSFRNVSCGS 152
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C+ GS C + Y S ++ +V+D L LA D +FGC
Sbjct: 153 PHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLAADPIPG------YTFGCVN 204
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDK 286
TGS +AP + +Q L ++FS C S + +G + G
Sbjct: 205 KTTGS----SAPQQGLLGLGRGPLS-LLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259
Query: 287 GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
P + + LR + Y + + + VG V+ +A IFDSGT F
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPF 393
T L +P YT + F K +T F+ CY P + G
Sbjct: 320 TRLAEPVYTAVRNEFRRRVGPKLPVTTLG-GFDTCY-----NVPIVVPTITFLFSGMNVA 373
Query: 394 FVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
D IVI S+ L G DNVN
Sbjct: 374 LPPDNIVIHSTAGSTTCLAMAGA--PDNVN 401
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 135/323 (41%), Gaps = 35/323 (10%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
R Y R G A Q D +A +G L+Y S+G P ++ + +D
Sbjct: 99 RRAEYILRRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
TGSDL W+ C S S + D P SS+ + VPC +C +
Sbjct: 159 TGSDLSWVQCKPCSAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+ + C Y V Y DG+ +TG D L L+ + S FGCG Q+G F +G
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
+GL GLG ++ S+ + G FS C + + G ++ G G P FS
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-VGGPSGAAPGFST 321
Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSAI-----FDSGTSFTYLNDPAYTQISET 347
Q P+ Y + +T +SVGG ++ SA D+GT T L AY +
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSA 381
Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
F S +A T+ S+ + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 134/323 (41%), Gaps = 35/323 (10%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
R Y R G A Q D A +G L+Y S+G P ++ + +D
Sbjct: 99 RRAEYILRRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
TGSDL W+ C + S + D P SS+ + VPC +C +
Sbjct: 159 TGSDLSWVQCKPCAAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+ + C Y V Y DG+ +TG D L L+ + S FGCG Q+G F +G
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
+GL GLG ++ S+ + G FS C + + G ++ G G P FS
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-VGGPSGAAPGFST 321
Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSAI-----FDSGTSFTYLNDPAYTQISET 347
Q P+ Y + +T +SVGG ++ SA D+GT T L AY +
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSA 381
Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
F S +A T+ S+ + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 90/358 (25%), Positives = 141/358 (39%), Gaps = 55/358 (15%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LA +G P+ ++G + + + +G PA ++A+D
Sbjct: 72 AARDASRLLYLDSLAVKGRAYAPI--ASGRQLLQTPT----YVVRARLGTPAQQLLLAVD 125
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C + ++P S++ VPC S C L C
Sbjct: 126 TSNDAAWIPCSGCAGCPTS-----------SPFNPAASASYRPVPCGSPQCVLAPNPSCS 174
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+C + + Y +D ++ L +D L +A D V +FGC + TG+ A
Sbjct: 175 PNAKSCGFSLSY-ADSSLQAA-LSQDTLAVAGD------VVKAYTFGCLQRATGT---AA 223
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 224 PPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTP 281
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
L H + Y + +T + VG V+ SA + DSGT FT L P Y
Sbjct: 282 LLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLA 341
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVI 401
+ + +S F+ CY T +P V L G + +VI
Sbjct: 342 LRDEVRRRVGAGAAAVSSLGGFDTCY-----NTTVAWPPVTLLFDGMQVTLPEENVVI 394
>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 148/381 (38%), Gaps = 76/381 (19%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD----CVSCV 139
KTP + S +S G + T +S G P + + DTGS L W PC C C
Sbjct: 61 KTPKSNSVFKSPLSPHSYG-AYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS 119
Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG-------SNC 186
+G + P SS+S V C + C +++ QC S C
Sbjct: 120 FPKIDPTG----IPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTC 175
Query: 187 P-YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
P Y V+Y S T G L+ + L K + + + GC SFL P+G+
Sbjct: 176 PAYVVQYGSGST--AGLLLSETLDFP-----DKXIPNFV-VGC------SFLSIHQPSGI 221
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--------------DGTGRISFGDKGSPGQ 291
G G S+PS GL F+ C S D TG S G +P +
Sbjct: 222 AGFGRGSESLPS---QMGL--KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFR 276
Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPA 340
S Y + I ++ VG AV + +I DSG++FT+++ P
Sbjct: 277 QNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPV 336
Query: 341 YTQISETF-NSLAKEKRETSTSDLP-FEYCYVLSPNQTNFEYPVVNLTMKGGGPFF--VN 396
++ F LA R T L C+ +S + + ++P + KGG + +N
Sbjct: 337 LEVVAREFEKQLANWTRATDVETLTGLRPCFDIS-KEKSVKFPELIFQFKGGAKWALPLN 395
Query: 397 DPIVIVSSEPKGLYLYCLGVV 417
+ +VSS + CL VV
Sbjct: 396 NYFALVSSS----GVACLTVV 412
>gi|330794218|ref|XP_003285177.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
gi|325084898|gb|EGC38316.1| hypothetical protein DICPUDRAFT_96947 [Dictyostelium purpureum]
Length = 817
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 154/366 (42%), Gaps = 71/366 (19%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNI---YSPN 158
F ++ + VG P F V +DTGS +P +C +S D N+ YS
Sbjct: 203 FEYFIPILVGTPPQMFTVQVDTGSTSLAVPGSNCYLYKSQSIKTSCSCSDGNLDGLYSLE 262
Query: 159 TSSTSSKVPCNSTL-CELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S +S+++ C+ T C C + SN CP+ ++Y DG+ G LV D + +
Sbjct: 263 ESISSNQLNCSDTSNC---NTCKNNKSNKPCPFVLKY-GDGSFIAGSLVIDHVTIGDFTV 318
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP---------NGLFGLGMDKTS------VPSILA 260
+K FG + ++ SF P +G+ GL + + S +
Sbjct: 319 PAK-------FGNIQKESLSFSQLTCPSTQRSQAVRDGILGLSFQQLDPDNGDDIFSKIV 371
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP--FSLRQTHPTYNITITQVSVGGNA 318
IPN FSMC G DG G ++ G ETP + +H Y+IT+T + VG ++
Sbjct: 372 AHYNIPNVFSMCLGKDG-GLLTIGGTNDHITQETPKYTPIFDSH-YYSITVTNIYVGNDS 429
Query: 319 VNFE----FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS-----DLPFEY-- 367
+N ++I DSGT+ Y +D E F S+ + E + PF
Sbjct: 430 LNLAPPDLSTSIVDSGTTLLYFSD-------EIFYSIVRNLEEKHCELPGICNDPFWEGN 482
Query: 368 CYVLSPNQTNFEYPVVNLTMKG--GGPFFVNDPIVIVSSEPKGLY------LYCLGVVKS 419
C+ L + EYP + L MKG G P F + P LY LYC G+
Sbjct: 483 CHHLEEKLIS-EYPTIYLEMKGMNGEPSFKLEV-------PPDLYFLNINGLYCFGISHM 534
Query: 420 DNVNII 425
++++
Sbjct: 535 KEISVL 540
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 62/131 (47%), Gaps = 21/131 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA S + +DTGSDL WL C C SC + I+ P SS+
Sbjct: 129 YFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADP---------IFDPRNSSSF 179
Query: 164 SKVPCNSTLC---ELQKQCPSAG--SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++PC S LC E+ S G S C YQV Y DG+ S G D+ L T K
Sbjct: 180 QRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTGSKAMS 238
Query: 219 SVDSRISFGCG 229
++FGCG
Sbjct: 239 -----VAFGCG 244
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 112/272 (41%), Gaps = 44/272 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V VG P F + +DTGSDL WL C C+ C G V D P S++
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCF----DQRGPVFD-----PMASTSY 200
Query: 164 SKVPCNSTLCEL------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C T C L + C S+ S+ CPY Y D + +TG L + +
Sbjct: 201 RNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWY-GDQSNTTGDLALEAFTVNLTASS 259
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S+ VD + GCG G F A L GLG S S L + + ++FS C
Sbjct: 260 SRRVDG-VVLGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHAFSYCLVDH 313
Query: 277 GTG---RISFGDK----GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
G+ +I FGD P T F+ T Y + + + VGG ++ +
Sbjct: 314 GSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGV 373
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETF 348
I DSGT+ +Y +PAY I + F
Sbjct: 374 SKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAF 405
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 125/301 (41%), Gaps = 49/301 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V +G P + + LDTGSDL W+ CV C H +G Y P SS+
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWI--QCVPC-HDCFEQNGPY-----YDPKESSSFR 141
Query: 165 KVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ C+ C L C + CPY Y D + +TG + + K
Sbjct: 142 NIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWY-GDSSNTTGDFATETFTVNLTSPTGK 200
Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R+ FGCG G F GA+ GL GLG S S L Q L +SFS C
Sbjct: 201 SEFKRVENVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 255
Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFSLR---QTHPT---YNITITQVSVGGNAVNFEF 323
++ + ++ FG DK E F+ + +P Y + I + VGG +N
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPE 315
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
S I DSGT+ +Y +PAY I + F + K K D P + CY +
Sbjct: 316 STWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAF--VKKVKGYPIVQDFPILDPCYNV 373
Query: 372 S 372
S
Sbjct: 374 S 374
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 136/336 (40%), Gaps = 72/336 (21%)
Query: 99 NSLGFLH----YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
+ L F H ++VG P + + LDTGS+L WL C + ++
Sbjct: 51 DKLSFRHNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLG------------SV 98
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
++P +SST S VPC+S +C + + C C + Y +D T G L D
Sbjct: 99 FNPVSSSTYSPVPCSSPICRTRTRDLPIPASCDPKTHFCHVAISY-ADATSIEGNLAHDT 157
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDG---AAPNGLFGLGMDKTSVPSILANQGL 264
+ + + FGC + +G D A GL +GM++ S+ S + G
Sbjct: 158 FVIGSVTRPGT------LFGC--MDSGLSSDSEEDAKSTGL--MGMNRGSL-SFVNQLGF 206
Query: 265 IPNSFSMCF-GSDGTGRISFGDKGSPGQGE---TPFSLRQT------HPTYNITITQVSV 314
+ FS C GSD +G + GD G TP L+ T Y + + + V
Sbjct: 207 --SKFSYCISGSDSSGILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRV 264
Query: 315 GGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
G ++ S + DSGT FT+L P YT + F +A+ K D
Sbjct: 265 GSKILSLPKSVFVPDHTGAGQTMVDSGTQFTFLMGPVYTALKNEF--IAQTKSVLRIVDD 322
Query: 364 P-------FEYCY-VLSPNQTNFE-YPVVNLTMKGG 390
P + CY V S + NF PV++L +G
Sbjct: 323 PNFVFQGTMDLCYRVGSSTRPNFTGLPVISLMFRGA 358
>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 450
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 131/342 (38%), Gaps = 46/342 (13%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P + +DTGS + W+ C C C I+ P+ S T +PC
Sbjct: 102 SVGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTP---------IFDPSKSKTYKTLPC 152
Query: 169 NSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+S +C+ PS S+ C Y ++Y DG+ S G L + L L + S + +
Sbjct: 153 SSNMCQSVISTPSCSSDKIGCKYTIKY-GDGSHSQGDLSVETLTLGSTNGSSVQFPNTV- 210
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDGTGR 280
GCG G+F + G G + G FS C S+ + +
Sbjct: 211 IGCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGG----KFSYCLAPMFSQSNSSSK 266
Query: 281 ISFGDKG---SPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF------------EFS 324
++FGD G TP S + Y +T+ SVG + F E +
Sbjct: 267 LNFGDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGN 326
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGT+ T L Y+ + + R + S+ CY +P+ + PV+
Sbjct: 327 IIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNF-LSLCYQTTPS-GQLDVPVIT 384
Query: 385 LTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
KG +PI +G + C S+ V+I G
Sbjct: 385 AHFKGADVEL--NPISTFVQVAEG--VVCFAFHSSEVVSIFG 422
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 76/171 (44%), Gaps = 22/171 (12%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R R+ + + LA + D+ T G T L ++ + +G PA S + +DT
Sbjct: 15 RRVRWIESKAK-LAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPARSLFMVVDT 73
Query: 125 GSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG 183
GSDL WL C C SC + I+ P SS+ ++PC S LC+ + +G
Sbjct: 74 GSDLPWLQCQPCKSCYKQADP---------IFDPRNSSSFQRIPCLSPLCKALEVHSCSG 124
Query: 184 -----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S C YQV Y DG+ S G D+ L T K ++FGCG
Sbjct: 125 SRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTGSKAMS-----VAFGCG 169
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + L S C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + L S C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 123/307 (40%), Gaps = 52/307 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + + LDT +D W PC C+ C SS+ +S SST
Sbjct: 95 YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGC-----SST------TTFSAQNSSTF 143
Query: 164 SKVPCNSTLCELQK--QCPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ + C+ C + CP+ G+ +C + Y D T S LV+D LHL + V
Sbjct: 144 ATLDCSKPECTQARGLSCPTTGNVDCLFNQTYGGDSTFS-ATLVQDSLHLGPN------V 196
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
SFGC +GS + P GL GLG S+ I + L FS C S
Sbjct: 197 IPNFSFGCISSASGSSI---PPQGLMGLGRGPLSL--ISQSGSLYSGLFSYCLPSFKSYY 251
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T L H P+ Y + +T +SVG V N
Sbjct: 252 FSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGA 311
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCYVLSPNQTNFEYPV 382
I DSGT T YT + + F +++ S S L F+ C+ P
Sbjct: 312 GTIIDSGTVITRFVPAIYTAVRDEF----RKQVGGSFSPLGAFDTCFA---TNNEVSAPA 364
Query: 383 VNLTMKG 389
+ L + G
Sbjct: 365 ITLHLSG 371
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 122/287 (42%), Gaps = 45/287 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P S + +D+GSD+ W+ C C C H + ++ P S++
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDP---------LFDPADSASF 93
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C+S +C+ Q +AG N C Y+V Y DG+ + G L + L L ++V
Sbjct: 94 MGVSCSSAVCD---QVDNAGCNSGRCRYEVSY-GDGSSTKGTLALETLTLG------RTV 143
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT-- 278
++ GCG + G F+ A GL G M + V + +G N+FS C S T
Sbjct: 144 VQNVAIGCGHMNQGMFVGAAGLLGLGGGSM--SFVGQLSRERG---NAFSYCLVSRVTNS 198
Query: 279 -GRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------ 324
G + FG + P G P P+ Y I ++ + VG V FE +
Sbjct: 199 NGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGG 258
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
+ D+GT+ T AY + F S + F+ CY L
Sbjct: 259 VVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSI-FDTCYNL 304
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 118/313 (37%), Gaps = 61/313 (19%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDC--VSCVHGLNSSSGQVIDF--------- 152
++ +V +G PAL + + LDT +DL W+ C H S GQ +
Sbjct: 123 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAK 182
Query: 153 -----NIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFL 203
N Y P SS+ ++ C+ C + Q PS +C Y + DGT++ G
Sbjct: 183 KEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIY 241
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
++ + + + + I GC ++ G +D A +G+ LG S A +
Sbjct: 242 GKEKATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR- 297
Query: 264 LIPNSFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSV 314
FS C S D + ++FG + PG ET P Y +T V V
Sbjct: 298 -FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLV 356
Query: 315 GGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
GG ++ I D+ TS T L AY ++ + S L
Sbjct: 357 GGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDR--------HLSHL 408
Query: 364 P-------FEYCY 369
P FEYCY
Sbjct: 409 PRVYELEGFEYCY 421
>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
Japonica Group]
gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
Length = 405
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 91/377 (24%), Positives = 143/377 (37%), Gaps = 57/377 (15%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RGR LA G D TP +AG L+S G L+ N ++G P +D +L
Sbjct: 27 RGRLLA--GVDATPP--AAGGAVAVPIYLSSQG-LYVANFTIGTPPQPVSAVVDLTGELV 81
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPY 188
W C C C D ++ P SST +PC S LCE P + NC
Sbjct: 82 WTQCTPCQPCFEQ---------DLPLFDPTKSSTFRGLPCGSHLCE---SIPESSRNCTS 129
Query: 189 QVRYLSDGTMS--TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLF 246
V T + TG TD + + FGC + P+G+
Sbjct: 130 DVCIYEAPTKAGDTGG------KAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIV 183
Query: 247 GLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQG----ETPFSLRQ-- 300
GLG P L Q + +FS C +G + G G TPF ++
Sbjct: 184 GLGR----TPWSLVTQMNV-TAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSA 238
Query: 301 ------THPTYNITITQVSVGGNAVNFEFSA----IFDSGTSFTYLNDPAYTQISETFNS 350
++P Y + + + GG + S+ + D+ + +YL D AY + + +
Sbjct: 239 GSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTA 298
Query: 351 LAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLY 410
A + ++ P++ C+ P + P + T GG V +++S G
Sbjct: 299 -AVGVQPVASPPKPYDLCF---PKAVAGDAPELVFTFDGGAALTVPPANYLLAS---GNG 351
Query: 411 LYCLGVVKSDNVNIIGR 427
CL + S ++N+ G
Sbjct: 352 TVCLTIGSSASLNLTGE 368
>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
Length = 524
Score = 65.5 bits (158), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 128/343 (37%), Gaps = 56/343 (16%)
Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK 177
V +DTGSDL W+ C S + ++ P+ S++ + VPCN++ CE
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDP--------LFDPSGSASYAAVPCNASACEASL 227
Query: 178 QCPSA----------------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C Y + Y DG+ S G L D + L SVD
Sbjct: 228 KAATGVPGSCATVGGGGGGGKSERCYYSLAY-GDGSFSRGVLATDTVALG-----GASVD 281
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+ FGCG G F GL GLG + S+ S A + FS C D
Sbjct: 282 GFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDA 335
Query: 278 TGRISFGDKGSPGQGETPFSLRQT------HPTYNITIT----QVSVGGNAVNFEFSAIF 327
G +S G S + TP S + P Y + +T + A + +
Sbjct: 336 AGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLL 395
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT T L Y + F E+ + + CY L+ + P++ L
Sbjct: 396 DSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLT-GHDEVKVPLLTLR 454
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK---SDNVNIIG 426
++GG V+ ++ + G + CL + D IIG
Sbjct: 455 LEGGADMTVDAAGMLFMARKDGSQV-CLAMASLSFEDQTPIIG 496
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 135/340 (39%), Gaps = 53/340 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G P + ++A+DT +D W+PC C C L ++P S+T
Sbjct: 93 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTL------------FAPEKSTTF 140
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDS 222
V C + C KQ P+ G + L+ G+ S LV+D + LATD S
Sbjct: 141 KNVSCAAPEC---KQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLATDPVPS----- 192
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
+FGC TG+ A P GL GLG S+ S Q L ++FS C S + +
Sbjct: 193 -YTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 246
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + L+ + Y + + + VG V+ +A
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
IFDSGT FT L P Y + + F K T TS F+ CY P +
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKL-TVTSLGGFDTCY-----NVPIVVPTITF 360
Query: 386 TMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNII 425
G D I+I S+ L G DNVN +
Sbjct: 361 IFTGMNVTLPQDNILIHSTAGSTTCLAMAGA--PDNVNSV 398
>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
Length = 525
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 84/343 (24%), Positives = 128/343 (37%), Gaps = 56/343 (16%)
Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK 177
V +DTGSDL W+ C S + ++ P+ S++ + VPCN++ CE
Sbjct: 177 LTVIVDTGSDLTWVQCKPCSVCYAQRDP--------LFDPSGSASYAAVPCNASACEASL 228
Query: 178 QCPSA----------------GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C Y + Y DG+ S G L D + L SVD
Sbjct: 229 KAATGVPGSCATVGGGGGGGKSERCYYSLAY-GDGSFSRGVLATDTVALG-----GASVD 282
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+ FGCG G F GL GLG + S+ S A + FS C D
Sbjct: 283 GFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTAPR--FGGVFSYCLPAATSGDA 336
Query: 278 TGRISFGDKGSPGQGETPFSLRQT------HPTYNITIT----QVSVGGNAVNFEFSAIF 327
G +S G S + TP S + P Y + +T + A + +
Sbjct: 337 AGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLL 396
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLT 386
DSGT T L Y + F E+ + + CY L+ + P++ L
Sbjct: 397 DSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLT-GHDEVKVPLLTLR 455
Query: 387 MKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVK---SDNVNIIG 426
++GG V+ ++ + G + CL + D IIG
Sbjct: 456 LEGGADMTVDAAGMLFMARKDGSQV-CLAMASLSFEDQTPIIG 497
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 125/327 (38%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 95/351 (27%), Positives = 135/351 (38%), Gaps = 46/351 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V VG P F + +DTGSDL WL C C+ C G V D P SS+
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFE----QRGPVFD-----PAASSSY 199
Query: 164 SKVPCNSTLC------ELQKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C C E + C A +CPY Y + +E T
Sbjct: 200 RNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 259
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
S+ VD + FGCG G F A L GLG S S L + + ++FS C
Sbjct: 260 SRRVDG-VVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHTFSYCLVEH 313
Query: 274 GSDGTGRISFGDK----GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFS--- 324
GSD ++ FG+ P T F+ + Y + + V VGG+ +N
Sbjct: 314 GSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWD 373
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLSPNQ 375
I DSGT+ +Y +PAY I + F L + D P CY +S +
Sbjct: 374 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMS-RLYPLIPDFPVLNPCYNVSGVE 432
Query: 376 TNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
E P ++L G + V +P G+ + ++IIG
Sbjct: 433 RP-EVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIG 482
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 127/303 (41%), Gaps = 55/303 (18%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+TP+T G+ Y + +++G PALS +DTGSDL W C+ C C
Sbjct: 30 ETPVTPDIGSGEYLIQ---------MAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSS 80
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMST 200
++SST SKV C S+LC+ C + G +C Y Y D + ++
Sbjct: 81 IYDP-----------SSSSTYSKVLCQSSLCQPPSIFSCNNDG-DCEYVYPY-GDRSSTS 127
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L ++ ++ S+S+ I+FGCG G D GL G G S+ S L
Sbjct: 128 GILSDETFSIS-----SQSL-PNITFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLG 177
Query: 261 NQGLIPNSFSMCF----GSDGTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVS 313
+ N FS C S T + G+ S G TP + Y +++ +S
Sbjct: 178 PS--MGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGIS 235
Query: 314 VGGNAV-----NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VGG ++ F+ + I DSGT+ T+L AY + E S + D
Sbjct: 236 VGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLPQADGQLD 295
Query: 363 LPF 365
L F
Sbjct: 296 LCF 298
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 125/327 (38%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 113/284 (39%), Gaps = 39/284 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P S + +D+GSD+ W+ C C C H + ++ P S++
Sbjct: 43 YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDP---------LFDPADSASF 93
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C+S +C+ + C Y+V Y DG+ + G L + L ++V
Sbjct: 94 MGVSCSSAVCDRVENAGCNSGRCRYEVSY-GDGSYTKGTLALETLTFG------RTVVRN 146
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---GR 280
++ GCG G F+ A GL G M S G N+FS C S GT G
Sbjct: 147 VAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLS-----GQTGNAFSYCLVSRGTNTNGF 201
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIF 327
+ FG + P G P P+ Y I + + VG V F+ + +
Sbjct: 202 LEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVM 261
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
D+GT+ T AY F + S + F+ CY L
Sbjct: 262 DTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSI-FDTCYNL 304
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 118/287 (41%), Gaps = 36/287 (12%)
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + V LD+ SD+ W+ CV C + QV F Y P+ S TS+ C+S C
Sbjct: 25 PGVIQTVVLDSASDVPWV--QCVPCP--IPPCHPQVDSF--YDPSRSPTSAAFSCSSPTC 78
Query: 174 EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
C A + C Y VRY DG+ ++G + D+L L + + S FGC
Sbjct: 79 TALGPYANGC--ANNQCQYLVRY-PDGSSTSGAYIADLLTL-----DAGNAVSGFKFGCS 130
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSP 289
+ GSF AA G+ LG S+ S A++ N+FS C + + F G P
Sbjct: 131 HAEQGSFDARAA--GIMALGGGPESLLSQTASR--YGNAFSYCIPATASDS-GFFTLGVP 185
Query: 290 GQGETPF------SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
+ + + RQ Y + + ++VGG + F ++ DS T+ T L
Sbjct: 186 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 245
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
AY + F S R + CY + N P ++L
Sbjct: 246 TAYQALRAAFRSSMTMYRSAPPKGY-LDTCYDFT-GVVNIRLPKISL 290
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 99/381 (25%), Positives = 150/381 (39%), Gaps = 60/381 (15%)
Query: 68 RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSD 127
R RL LAA N + +GN + +N +++G P ++ +DTGSD
Sbjct: 72 RLERLNAMVLAASSNAEINSPVLSGNGEFLMN---------LAIGTPPETYSAIMDTGSD 122
Query: 128 LFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNC 186
L W C C C + I+ P SS+ SK+ C+S LC+ Q S +C
Sbjct: 123 LIWTQCKPCTQCFDQPSP---------IFDPKKSSSFSKLSCSSQLCKALPQS-SCSDSC 172
Query: 187 PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGL 245
Y Y D + + G + + K + FGCG G F G+ GL
Sbjct: 173 EYLYTY-GDYSSTQGTMATETFTFG------KVSIPNVGFGCGEDNEGDGFTQGS---GL 222
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT-------GRISFGDKGSPGQGETPFS 297
GLG S+ S L FS C S D T G ++ + S TP
Sbjct: 223 VGLGRGPLSLVSQLKEA-----KFSYCLTSIDDTKTSTLLMGSLASVNGTSAAIRTTPLI 277
Query: 298 LRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQIS 345
P+ Y +++ +SVGG + + S I DSGT+ TYL + A+ +
Sbjct: 278 QNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTTITYLEESAFDLVK 337
Query: 346 ETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSE 405
+ F S + S + E CY L + + E P + L G + +I S
Sbjct: 338 KEFTSQMGLPVDNSGAT-GLELCYNLPSDTSELEVPKLVLHFTGADLELPGENYMIADSS 396
Query: 406 PKGLYLYCLGVVKSDNVNIIG 426
+ + CL + S ++I G
Sbjct: 397 ---MGVICLAMGSSGGMSIFG 414
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 95/213 (44%), Gaps = 38/213 (17%)
Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQ- 176
V +DTGSDL W+ C+ C+SC + ++ P+TSS+ +PCNS+ C+ LQ
Sbjct: 158 VIIDTGSDLTWVQCEPCMSCYNQQGP---------VFKPSTSSSYQSIPCNSSTCQSLQL 208
Query: 177 -----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRV 231
C S SNC Y V Y DG+ + G L + L SV S FGCG+
Sbjct: 209 TTGNAGACESNPSNCSYAVNY-GDGSYTNGELGAEHLSFG-----GISV-SNFVFGCGKN 261
Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGS 288
G F +GL GLG S+ I FS C + +G ++ G++ S
Sbjct: 262 NKGLF---GGVSGLMGLGRSNLSL--ISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESS 316
Query: 289 PGQGETPFSLRQTHPT------YNITITQVSVG 315
+ TP + + P Y + +T + VG
Sbjct: 317 VFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 65.1 bits (157), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 128/329 (38%), Gaps = 52/329 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P +DTGSDL W C C +C I+ P+ SST
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP---------IFDPSNSST 110
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN G++C Y++ Y +D T S G L + + + + + V
Sbjct: 111 FKEKRCN-------------GNSCHYKIIY-ADTTYSKGTLATETVTIHSTSGE-PFVMP 155
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
+ GCG S +G+ GL +S+ I G P S CF S GT +I+
Sbjct: 156 ETTIGCGH---NSSWFKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKIN 210
Query: 283 FGDK---GSPGQGETPFSLRQTHP-TYNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T L P Y + + VSVG V E + I DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
T+ TY + E + R + + +D+ CY T +PV+ +
Sbjct: 271 TTLTYFPVSYCNLVREAVDHYVTAVRTADPTGNDM---LCYY---TDTIDIFPVITMHFS 324
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
GG ++ + + + +G +CL ++
Sbjct: 325 GGADLVLDKYNMYIETITRG--TFCLAII 351
>gi|7548466|gb|AAA34371.2| secreted aspartyl proteinase 1 [Candida albicans]
Length = 391
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFNIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 128/329 (38%), Gaps = 52/329 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P +DTGSDL W C C +C I+ P+ SST
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP---------IFDPSNSST 110
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN G++C Y++ Y +D T S G L + + + + + V
Sbjct: 111 FKEKRCN-------------GNSCHYKIIY-ADTTYSKGTLATETVTIHSTSGE-PFVMP 155
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
+ GCG S +G+ GL +S+ I G P S CF S GT +I+
Sbjct: 156 ETTIGCGH---NSSWFKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKIN 210
Query: 283 FGDK---GSPGQGETPFSLRQTHP-TYNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T L P Y + + VSVG V E + I DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR--ETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMK 388
T+ TY + E + R + + +D+ CY T +PV+ +
Sbjct: 271 TTLTYFPVSYCNLVREAVDHYVTAVRTADPTGNDM---LCYY---TDTIDIFPVITMHFS 324
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVV 417
GG ++ + + + +G +CL ++
Sbjct: 325 GGADLVLDKYNMYIETITRG--TFCLAII 351
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 126/317 (39%), Gaps = 49/317 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA ++A+DT +D W+PC C C + ++P S++
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTS-----------SPFNPAASASY 102
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
VPC S C L C +C + + Y +D ++ L +D L +A D V
Sbjct: 103 RPVPCGSPQCVLAPNPSCSPNAKSCGFSLSY-ADSSLQAA-LSQDTLAVAGD------VV 154
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DG 277
+FGC + TG+ A P GL GLG S + + + +FS C S +
Sbjct: 155 KAYTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNF 209
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA---------- 325
+G + G G P + +T L H + Y + +T + VG V+ SA
Sbjct: 210 SGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAG 269
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
+ DSGT FT L P Y + + +S F+ CY T +P V
Sbjct: 270 TVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCY-----NTTVAWPPVT 324
Query: 385 LTMKGGGPFFVNDPIVI 401
L G + +VI
Sbjct: 325 LLFDGMQVTLPEENVVI 341
>gi|90399145|emb|CAJ86169.1| H0913C04.10 [Oryza sativa Indica Group]
gi|125550292|gb|EAY96114.1| hypothetical protein OsI_17992 [Oryza sativa Indica Group]
Length = 491
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 136/381 (35%), Gaps = 103/381 (27%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
VS+G P V LDTGS L W+PC C +C SS +++ P SS+S
Sbjct: 92 TVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNC-----SSLSAASPLHVFHPKNSSSS 146
Query: 164 SKVPCNS------------TLCELQKQCPSAGSNC------------PYQVRYLSDGTMS 199
+ C + + C CP G+NC PY V Y S T
Sbjct: 147 RLIGCRNPSCLWIHSPDHLSDCRAASSCP--GANCTPRNANANNVCPPYLVVYGSGST-- 202
Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
G L+ D L ++V + + GC P+GL G G SVPS L
Sbjct: 203 AGLLISDTL-----RTPGRAVRNFV-IGCSLASVHQ-----PPSGLAGFGRGAPSVPSQL 251
Query: 260 ANQGLIPNSFSMCFGS---DGTGRIS------------------FGDKGSPGQGETPFSL 298
GL FS C S D +S + P+S+
Sbjct: 252 ---GL--TKFSYCLLSRRFDDNAAVSGELILGGAGGKDGGVGMQYAPLARSASARPPYSV 306
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSA----------IFDSGTSFTYLNDPAYTQISETF 348
Y + +T ++VGG +V A I DSGT+F+Y + + ++
Sbjct: 307 -----YYYLALTAITVGGKSVQLPERAFVAGGAGGGAIVDSGTTFSYFDRTVFEPVAAAV 361
Query: 349 NSLAK---EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPI----VI 401
+ + + L C+ + P E P ++L KGG +N P+ V+
Sbjct: 362 VAAVGGRYSRSKVVEEGLGLSPCFAMPPGTKTMELPEMSLHFKGGS--VMNLPVENYFVV 419
Query: 402 VSSEPKG-----LYLYCLGVV 417
P G CL VV
Sbjct: 420 AGPAPSGGAPAMAEAICLAVV 440
>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
Japonica Group]
Length = 377
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 142/375 (37%), Gaps = 53/375 (14%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RGR LA G D TP +AG L+S G L+ N ++G P +D +L
Sbjct: 27 RGRLLA--GVDATPP--AAGGAVAVPIYLSSQG-LYVANFTIGTPPQPVSAVVDLTGELV 81
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPY 188
W C C C D ++ P SST +PC S LCE P + NC
Sbjct: 82 WTQCTPCQPCFEQ---------DLPLFDPTKSSTFRGLPCGSHLCE---SIPESSRNCTS 129
Query: 189 QVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGL 248
V T + + TD + + FGC + P+G+ GL
Sbjct: 130 DVCIYEAPTKAG----DTGGKAGTDTFAIGAAKETLGFGCVVMTDKRLKTIGGPSGIVGL 185
Query: 249 GMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQG----ETPFSLRQ---- 300
G P L Q + +FS C +G + G G TPF ++
Sbjct: 186 GR----TPWSLVTQMNV-TAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGS 240
Query: 301 ----THPTYNITITQVSVGGNAVNFEFSA----IFDSGTSFTYLNDPAYTQISETFNSLA 352
++P Y + + + GG + S+ + D+ + +YL D AY + + + A
Sbjct: 241 SDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTA-A 299
Query: 353 KEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIVSSEPKGLYLY 412
+ ++ P++ C+ P + P + T GG V +++S G
Sbjct: 300 VGVQPVASPPKPYDLCF---PKAVAGDAPELVFTFDGGAALTVPPANYLLAS---GNGTV 353
Query: 413 CLGVVKSDNVNIIGR 427
CL + S ++N+ G
Sbjct: 354 CLTIGSSASLNLTGE 368
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 35/287 (12%)
Query: 101 LGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
+G L+Y S+G P ++ + +DTGSDL W+ C + S + D P
Sbjct: 43 IGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFD-----PAQ 97
Query: 160 SSTSSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
SS+ + VPC +C + + + C Y V Y DG+ +TG D L L+
Sbjct: 98 SSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS----- 151
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ S FGCG Q+G F +G +GL GLG ++ S+ + G FS C +
Sbjct: 152 ASSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTK 206
Query: 277 GT--GRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSAI-- 326
+ G ++ G G P FS Q P+ Y + +T +SVGG ++ SA
Sbjct: 207 PSTAGYLTLG-VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG 265
Query: 327 ---FDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
D+GT T L AY + F S +A T+ S+ + CY
Sbjct: 266 GTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCY 312
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 118/306 (38%), Gaps = 65/306 (21%)
Query: 84 KTPLTFSAGND-TYRLN-SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
KTP SA + YR + ++ +G P S + LDTGS L W+ C
Sbjct: 54 KTPALKSAASPYNYRSRFKYSMILLVSLPIGTPPQSQQMILDTGSQLSWIQCH------- 106
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLS 194
+ ++ P+ SS+ S +PCN LC+ L C C Y Y +
Sbjct: 107 -KKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFY-A 163
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
DGT++ G LV + + +T + + GC D + G+ G+ + + S
Sbjct: 164 DGTLAEGNLVREKITFSTSQSTPPLI-----LGCAE-------DASDDKGILGMNLGRLS 211
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP------------FSLRQTH 302
A+Q I FS C + R F GS GE P FS Q
Sbjct: 212 ----FASQAKI-TKFSYCVPTRQV-RPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRM 265
Query: 303 P-----TYNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISE 346
P + + + + +G +N SA + DSG+ FTYL D AY ++ E
Sbjct: 266 PNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVRE 325
Query: 347 TFNSLA 352
LA
Sbjct: 326 EVVRLA 331
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 143/363 (39%), Gaps = 65/363 (17%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIY 155
RL +L ++ V+VG + + +DTGSDL W+ C C C + ++
Sbjct: 139 RLQTLNYI----VTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP---------LF 185
Query: 156 SPNTSSTSSKVPCNSTLC-ELQKQCPSAG-------SNCPYQVRYLSDGTMSTGFLVEDV 207
+P+ SS+ +PCNS C LQ S+G ++C YQ+ Y DG+ S G L +
Sbjct: 186 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEK 244
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
L L E +D+ I FGCGR G F +GL GL + S+ S L +
Sbjct: 245 LTLGKTE-----IDNFI-FGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGS 293
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFS-LRQTHPT--------------YNITITQV 312
FS C + G G GS G FS + P Y + +T +
Sbjct: 294 VFSYCLPTTGV-----GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGI 348
Query: 313 SVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
S+GG +N ++ DSGT T L+ Y F R T +
Sbjct: 349 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSI-L 407
Query: 366 EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIV--IVSSEPKGLYLYCLGVVKSDNVN 423
C+ L+ + P V +G V+ V V S+ + L + D
Sbjct: 408 NTCFNLTGYE-EVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTM 466
Query: 424 IIG 426
IIG
Sbjct: 467 IIG 469
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 86/366 (23%), Positives = 139/366 (37%), Gaps = 41/366 (11%)
Query: 46 LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN-DKTPLTFSAGNDTYRLNSLGFL 104
L D PK S Y S H R+ + R ++ + +T T S + + G
Sbjct: 35 LVHRDSPK--SPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGE 92
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++S+G P + DTGSDL W C C C + ++ P +S T
Sbjct: 93 YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAP---------LFDPKSSKTY 143
Query: 164 SKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C++ C+ + S S C Y Y D + + G L D + L +
Sbjct: 144 RDLSCDTRQCQNLGESSSCSSEQLCQYSY-YYGDRSFTNGNLAVDTVTLPSTNGGPVYFP 202
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT 278
+ GCGR G+F +G+ GLG S+ S + + + FS C F S+
Sbjct: 203 KTV-IGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESA 257
Query: 279 G---RISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV--------NFEFS 324
G ++ FG G TP + Y +T+ +SVG + E +
Sbjct: 258 GNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGN 317
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVN 384
I DSGTS T +T+ + + T + +CY +P + + PV+
Sbjct: 318 IIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTP---DLKVPVIT 374
Query: 385 LTMKGG 390
G
Sbjct: 375 AHFNGA 380
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 107/263 (40%), Gaps = 34/263 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS 226
Query: 214 EKQSKSVDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFS 270
+ FGC V+ F G G + P IL+ + +FS
Sbjct: 227 FMD-------LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFS 274
Query: 271 MCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSA 325
C +D T G + G D+ + G T PTY++T+ ++ G V
Sbjct: 275 YCLPTDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRLVTSSSEM 334
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTI 357
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 90/349 (25%), Positives = 130/349 (37%), Gaps = 42/349 (12%)
Query: 74 GRGLAAQGNDKTPL-TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW-- 130
R D TP T S G ++ ++ T + Q A+S V +DT SD+ W
Sbjct: 124 ARSTTVSNRDYTPSSTASVGTNSGTSKTIEKSDQTATNEHQDAVSQTVVVDTSSDIPWVQ 183
Query: 131 -LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQ----CPSAGS 184
LPC C + +Y P SST + +PC S C EL C
Sbjct: 184 CLPCPIPQC---------HLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTD 234
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
C Y V Y DG +TG V D L ++ V FGC GSF + A G
Sbjct: 235 ECKYIVNY-GDGKATTGTYVTDTLTMS-----PTIVVKDFRFGCSHAVRGSFSNQNA--G 286
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS----LRQ 300
+ LG + S+ A+ N+FS C + F G P + FS ++
Sbjct: 287 ILALGGGRGSLLEQTADA--YGNAFSYCIPKPSSA--GFLSLGGPVEASLKFSYTPLIKN 342
Query: 301 TH-PT-YNITITQVSVGGNAV-----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
H PT Y + + + V G + F A+ DSG T L Y + F S
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMA 402
Query: 354 EKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIVIV 402
+ + CY + + + P V+L GG + +I+
Sbjct: 403 AYGPLAAPVRNLDTCYDFT-RFPDVKVPKVSLVFAGGATLDLEPASIIL 450
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 64.7 bits (156), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 99/225 (44%), Gaps = 27/225 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C+ C C + I++P+ S++
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP---------IFNPSYSASF 207
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C C Y+ Y DG+ STG + L T +
Sbjct: 208 STVGCDSAVCSQLDAYDCHSGGCLYEASY-GDGSYSTGSFATETLTFGTTSV------AN 260
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG S P+ + Q ++FS C SD +G
Sbjct: 261 VAIGCGHKNVGLFIGAAG---LLGLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGP 315
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
+ FG K P G TP PT Y +++T +S+ A + F
Sbjct: 316 LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISISAIACVWSF 360
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 143/363 (39%), Gaps = 65/363 (17%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIY 155
RL +L ++ V+VG + + +DTGSDL W+ C C C + ++
Sbjct: 60 RLQTLNYI----VTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP---------LF 106
Query: 156 SPNTSSTSSKVPCNSTLC-ELQKQCPSAG-------SNCPYQVRYLSDGTMSTGFLVEDV 207
+P+ SS+ +PCNS C LQ S+G ++C YQ+ Y DG+ S G L +
Sbjct: 107 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEK 165
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
L L E +D+ I FGCGR G F +GL GL + S+ S L +
Sbjct: 166 LTLGKTE-----IDNFI-FGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGS 214
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFS-LRQTHPT--------------YNITITQV 312
FS C + G G GS G FS + P Y + +T +
Sbjct: 215 VFSYCLPTTGV-----GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGI 269
Query: 313 SVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
S+GG +N ++ DSGT T L+ Y F R T +
Sbjct: 270 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSI-L 328
Query: 366 EYCYVLSPNQTNFEYPVVNLTMKGGGPFFVNDPIV--IVSSEPKGLYLYCLGVVKSDNVN 423
C+ L+ + P V +G V+ V V S+ + L + D
Sbjct: 329 NTCFNLTGYE-EVNIPTVKFIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTM 387
Query: 424 IIG 426
IIG
Sbjct: 388 IIG 390
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 131/343 (38%), Gaps = 51/343 (14%)
Query: 112 GQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
G A + V +DTGSDL W+ PC SC + ++ P S T + VPC
Sbjct: 188 GGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDP---------LFDPAASPTFAAVPC 238
Query: 169 NSTLC--ELQKQCPSAGS----------NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S C L+ + GS C Y + Y DG+ S G L +D L L T K
Sbjct: 239 GSPACAASLKDATGAPGSCARSAGNSEQRCYYALSY-GDGSFSRGVLAQDTLGLGTTTKL 297
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
V FGCG G F A GL GLG S+ S A + FS C
Sbjct: 298 DGFV-----FGCGLSNRGLFGGTA---GLMGLGRTDLSLVSQTAAR--FGGVFSYCLPAT 347
Query: 275 SDGTGRISFGDKGS---PGQGETPFSLRQTHPTY---NITITQVSVGGNAVNFEFSA--- 325
+ TG +S G S P T T P + NIT V G F A
Sbjct: 348 TTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNV 407
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
+ DSGT T L Y + F + S L + CY L+ + P++ L
Sbjct: 408 LVDSGTVITRLAPSVYKAVRAEFARRFEYPAAPGFSIL--DACYDLT-GRDEVNVPLLTL 464
Query: 386 TMKGGGPFFVNDP--IVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
T++GG V+ + +V + + L + D IIG
Sbjct: 465 TLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIG 507
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/313 (25%), Positives = 122/313 (38%), Gaps = 35/313 (11%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
L++L F+ V +G PA + DTGSDL W+ C C S H ++
Sbjct: 139 LDTLEFV--VAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQD------PLFD 190
Query: 157 PNTSSTSSKVPCNSTLCELQKQ-CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P+ SST + V C C C + C Y VRY DG+ +TG L D L L +
Sbjct: 191 PSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRY-GDGSSTTGVLSRDTLALTSSRA 249
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ FGCG G F G L + + A+ G + FS C S
Sbjct: 250 LTG-----FPFGCGTRNLGDF--GRVDGLLGLGRGELSLPSQAAASFGAV---FSYCLPS 299
Query: 276 DG--TGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------NAVNFEF 323
TG ++ G + G ++ P Y + + + +GG AV
Sbjct: 300 SNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRG 359
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVV 383
+ DSGT TYL AY + + F L E+ + + + CY + ++ P V
Sbjct: 360 GTLLDSGTVLTYLPAQAYALLRDRFR-LTMERYTPAPPNDVLDACYDFA-GESEVVVPAV 417
Query: 384 NLTMKGGGPFFVN 396
+ G F ++
Sbjct: 418 SFRFGDGAVFELD 430
>gi|302783208|ref|XP_002973377.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
gi|300159130|gb|EFJ25751.1| hypothetical protein SELMODRAFT_413681 [Selaginella moellendorffii]
Length = 472
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 102/408 (25%), Positives = 161/408 (39%), Gaps = 54/408 (13%)
Query: 6 RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
R S +C LL L+ C + F H + PV G+ V+ + +K +AL
Sbjct: 4 RRSVLCFLLALV------CI----WEFSRPHVEAAPVSGL--VNAIARK---VLPAALKE 48
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDT--YRLNSLGFLHYTNVSVGQPALSFIVALD 123
+ R LA D + G +T Y N L F N+++G P + +
Sbjct: 49 GGAIVWKQRRTLANITTDFSVRGGDKGLETSFYVDNGLNFAM--NLNLGTPPVQHNFTMA 106
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----K 177
S+ FW C CV C N ++S +S++ +++PC S C
Sbjct: 107 LNSEFFWAACSPCVDCNVSTNDP--------LFSSASSTSYTRIPCTSPFCSTSPGFSTN 158
Query: 178 QCPSAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
C S+ + C Y Y +D + S G + DV+ + T K + R+S GCGR T
Sbjct: 159 ACGSSAVGSTTCLYNFSYSTDYS-SAGEMASDVVAMKTPRKTRGNKSLRMSLGCGREST- 216
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG-TGRISFGDKGSPGQGE 293
+ L +GL G S LA + F C SD +G+I G+
Sbjct: 217 TLLGILNTSGLVGFAKTDKSFIGQLAEMDYT-SKFIYCVPSDTFSGKIVLGNYKISSHSS 275
Query: 294 ---TPFSLRQTHPTY----NITITQV---SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
TP + T Y +I+IT V G + I DS +F+Y +YT
Sbjct: 276 LSYTPMIVNSTALYYIGLRSISITDTLTFPVQGILADGTGGTIIDSTFAFSYFTPDSYTP 335
Query: 344 ISETFNSLAKEKRETSTSD----LPFEYCYVLSPNQTNFEYPVVNLTM 387
+ + +L + S+++ L + CY +S N + E V L +
Sbjct: 336 LVQAIQNLNSNLTKVSSNETAALLGNDICYNVSVNDDDAENATVCLAV 383
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 96/395 (24%), Positives = 142/395 (35%), Gaps = 61/395 (15%)
Query: 60 YSALAHRDRYFRLRG---RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
Y L R LRG R + A ND S G + N+S+G P +
Sbjct: 56 YQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGG----------AYLMNISLGTPPV 105
Query: 117 SFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
+ DTGSDL W C C +C + ++ P S T + C++ C+
Sbjct: 106 PMLGIADTGSDLIWRQCLPCPNCYEQVEP---------LFDPKESETYKTLDCDNEFCQD 156
Query: 176 QKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
Q S + C Y Y D + + G L D L + + E S I+FGCG
Sbjct: 157 LGQQGSCDDDNTCTYSYSY-GDRSYTRGDLSSDTLTIGSTEGDPASFPG-IAFGCGHDNG 214
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT--GRISFGDKG- 287
G+F + +G+ + ++ + FS C SD T +I+FG G
Sbjct: 215 GTFNE----KDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGV 270
Query: 288 --SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------------EFSAIFDSGT 331
G TP Y +T+ +SVG V F E + I DSGT
Sbjct: 271 VSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGT 330
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTMKGGG 391
+ T L YT + + + T + + F CY + N E P + G
Sbjct: 331 TLTLLPQDFYTDVESALTNAIGGQTTTDPNGI-FSLCY---SSVNNLEIPTITAHFTGAD 386
Query: 392 PFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVNIIG 426
+ E L C ++ S N+ I G
Sbjct: 387 VQLPPLNTFVQVQED----LVCFSMIPSSNLAIFG 417
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 97/454 (21%), Positives = 165/454 (36%), Gaps = 87/454 (19%)
Query: 35 HHRYS---------DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
H R+S + VKG + D L ++ + +++ DR R +GL +
Sbjct: 42 HERFSGGGGDVDQVEAVKGFVNRDGLRRQRMNQRW-GVSNYDR----RRKGLETTTTTEV 96
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
+ AG D ++LG ++T V VG P F +A DTGS+ W C + +
Sbjct: 97 EMPMRAGRD----DALG-EYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTK 151
Query: 146 SGQVIDF------------------------------NIYSPNTSSTSSKVPCNSTLCEL 175
+ ++ P+ S + V C S C++
Sbjct: 152 KTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKI 211
Query: 176 Q-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
CP C Y + Y +DG+ + GF D + + + +++ ++ GC
Sbjct: 212 DLSQLFSLSLCPKPSDPCLYDISY-ADGSSAKGFFGTDTITVDLKNGKEGKLNN-LTIGC 269
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDK 286
+ G+ GLG K S A + FS C + R S+
Sbjct: 270 TKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYE--YGAKFSYCLVDHLSHRNVSSYLTI 327
Query: 287 GSPGQGETPFSLRQTH-----PTYNITITQVSVGGNAV---------NFEFSAIFDSGTS 332
G + +++T P Y + + +S+GG + N + + DSGT+
Sbjct: 328 GGHHNAKLLGEIKRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTT 387
Query: 333 FTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLSPNQTNFE---YPVVNLTMK 388
T L PAY + E SL K KR T ++C+ + F+ P +
Sbjct: 388 LTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCF----DAEGFDDSVVPRLVFHFA 443
Query: 389 GGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNV 422
GG F I+ P + C+G+V D +
Sbjct: 444 GGARFEPPVKSYIIDVAP---LVKCIGIVPIDGI 474
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/256 (31%), Positives = 109/256 (42%), Gaps = 38/256 (14%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
+S+G L Y VS+G P ++ V +DTGSD+ W+ C + ++ P
Sbjct: 493 HSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKD------QLFDP 546
Query: 158 NTSSTSSKVPCNSTLC-ELQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
SS+ S VPC + C EL C +AGS C Y V Y DG+ +TG D L L
Sbjct: 547 AKSSSYSAVPCAADACSELSTYGHGC-AAGSQCGYVVSY-GDGSNTTGVYGSDTLTLTDA 604
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGL---GMDKTSVPSILANQGLIPNSFS 270
+ + + FGCG Q G F A +GL L GM TS S G+ FS
Sbjct: 605 DAVTGFL-----FGCGHAQAGLF---AGIDGLLALGRKGMSLTSQTSGAYGGGV----FS 652
Query: 271 MCF--GSDGTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGN------AVN 320
C TG ++ G S G T PT Y + +T + VGG A
Sbjct: 653 YCLPPSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASA 712
Query: 321 FEFSAIFDSGTSFTYL 336
F + D+GT T L
Sbjct: 713 FAGGTVVDTGTVITRL 728
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/317 (25%), Positives = 117/317 (36%), Gaps = 71/317 (22%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P S++
Sbjct: 133 YFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADP---------LFDPAASASF 183
Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC+S +C C +G+ C YQV Y DG+ + G L + L S
Sbjct: 184 TAVPCDSGVCRTLPGGSSGCADSGA-CRYQVSY-GDGSYTQGVLAMETLTFG----DSTP 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--- 276
V ++ GCG G F+ A GL GLG S+ L +FS C S
Sbjct: 238 VQG-VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAD 291
Query: 277 -GTGRISFG-DKGSP-GQGETPFSLRQTHPTYNITITQVSV------------------G 315
G G + FG D P G P P++ G
Sbjct: 292 AGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDG 351
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-------FEYC 368
G V + D+GT+ T L AY + + F S T DLP + C
Sbjct: 352 GGGV------VMDTGTAVTRLPPDAYAALRDAFAS-------TIGGDLPRAPGVSLLDTC 398
Query: 369 YVLSPNQTNFEYPVVNL 385
Y LS + P V L
Sbjct: 399 YDLS-GYASVRVPTVAL 414
>gi|68475693|ref|XP_718053.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
gi|68475828|ref|XP_717987.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
gi|7548425|gb|AAA34368.2| secreted aspartyl proteinase 1 [Candida albicans]
gi|7548465|gb|AAA34370.2| secreted aspartyl proteinase 1 [Candida albicans]
gi|46439729|gb|EAK99043.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
gi|46439804|gb|EAK99117.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
Length = 391
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 110/270 (40%), Gaps = 54/270 (20%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
H V VG P V LD GSDL W C V ++ Q+ ++ SS+ S
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLV------GPTAKQLEP--VFDAARSSSFS 158
Query: 165 KVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTM-STGFLVEDVLHLATDEKQSKS 219
+PC+S LCE K C C Y+ Y G M +TG L +
Sbjct: 159 VLPCDSKLCEAGTFTNKTC--TDRKCAYENDY---GIMTATGVLATETFTFGAHH----G 209
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD 276
V + ++FGCG++ G+ A +G+ GL S+ LA FS C F
Sbjct: 210 VSANLTFGCGKLANGTI---AEASGILGLSPGPLSMLKQLAI-----TKFSYCLTPFADR 261
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT---------YNITITQVSVGGNAVNF--EFSA 325
T + FG G+ +T + QT P Y + + +SVG ++ E A
Sbjct: 262 KTSPVMFGAMADLGKYKTTGKV-QTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLA 320
Query: 326 I---------FDSGTSFTYLNDPAYTQISE 346
I DS T+ YL +PA+T++ +
Sbjct: 321 IKPDGTGGTVLDSATTLAYLVEPAFTELKK 350
>gi|193885194|pdb|2QZW|A Chain A, Secreted Aspartic Proteinase (Sap) 1 From Candida Albicans
gi|193885195|pdb|2QZW|B Chain B, Secreted Aspartic Proteinase (Sap) 1 From Candida Albicans
Length = 341
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 7 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 63
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 64 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 98
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 99 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 148
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 149 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 208
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 209 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 249
>gi|353678009|sp|C4YSF6.1|CARP1_CANAW RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
Full=Aspartate protease 1; AltName: Full=Secreted
aspartic protease 1; Flags: Precursor
gi|238883021|gb|EEQ46659.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 391
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|353678008|sp|P0CY27.1|CARP1_CANAL RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
Full=Aspartate protease 1; AltName: Full=Secreted
aspartic protease 1; Flags: Precursor
gi|7548436|gb|AAA34369.2| secreted aspartyl proteinase 1 [Candida albicans]
Length = 391
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
Length = 357
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + L S C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALPPRNVFYNDP 312
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 124/327 (37%), Gaps = 54/327 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYV-----------LSPNQTNF 378
T L + + +T TS + CY+ ++P
Sbjct: 226 QRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYICYLSEHDYSGWNGTITPFSNWS 285
Query: 379 EYPVVNLTMKGGGPF-------FVNDP 398
P++ + GG F NDP
Sbjct: 286 ALPLLEIGFAGGAALALSPRNVFYNDP 312
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 108/270 (40%), Gaps = 73/270 (27%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+SVG P L+F V DTGSDL W C C C + P +SST SK+
Sbjct: 89 NISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139
Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S+ C+ + C + G C Y +Y S T G+L + L + S
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190
Query: 223 RISFGCGRVQTGSFLDGAAPNGL--FGLGMDKTSVPSILANQGLIPNSFSMCFGSD---G 277
++FGC + NGL LG+ + FS C S G
Sbjct: 191 -VAFGC-----------STENGLGQLDLGVGR----------------FSYCLRSGSAAG 222
Query: 278 TGRISFGDKGSPGQG---ETPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS--- 324
I FG + G TPF + HP+ Y + +T ++VG + F F+
Sbjct: 223 ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNG 282
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL Y + + F S
Sbjct: 283 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 312
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 129/330 (39%), Gaps = 58/330 (17%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
+HR Y L A T + ++GN + N + +G P + LD
Sbjct: 72 SHRLTYLS----SLVAGKPKPTSVPVASGNQLHIGN-----YVVRAKLGTPPQLMFMVLD 122
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCP 180
T +D WLPC C C + S + +SST S V C++ C + CP
Sbjct: 123 TSNDAVWLPCSGCSGCSNASTSFNTN----------SSSTYSTVSCSTAQCTQARGLTCP 172
Query: 181 SAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
S+ S C + Y D + S LV+D L LA D V SFGC +G+ L
Sbjct: 173 SSSPQPSVCSFNQSYGGDSSFSAS-LVQDTLTLAPD------VIPNFSFGCINSASGNSL 225
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE 293
P GL GLG S+ + L FS C S +G + G G P
Sbjct: 226 ---PPQGLMGLGRGPMSL--VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIR 280
Query: 294 -TPFSLRQTHPT-YNITITQVSVGG-----NAVNFEFSA------IFDSGTSFTYLNDPA 340
TP P+ Y + +T VSVG + V F A I DSGT T P
Sbjct: 281 YTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPV 340
Query: 341 YTQISETFNSLAKEKRETSTSDL-PFEYCY 369
Y I + F K+ +S S L F+ C+
Sbjct: 341 YEAIRDEFR---KQVNVSSFSTLGAFDTCF 367
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 118/287 (41%), Gaps = 36/287 (12%)
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + V LD+ SD+ W+ CV C + QV F Y P+ S +S+ C+S C
Sbjct: 155 PGVIQTVVLDSASDVPWV--QCVPCP--IPPCHPQVDSF--YDPSRSPSSAPFSCSSPTC 208
Query: 174 EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
C A + C Y VRY DG+ ++G + D+L L + + S FGC
Sbjct: 209 TALGPYANGC--ANNQCQYLVRY-PDGSSTSGAYIADLLTL-----DAGNAVSGFKFGCS 260
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSP 289
+ GSF AA G+ LG S+ S A++ N+FS C + + F G P
Sbjct: 261 HAEQGSFDARAA--GIMALGGGPESLLSQTASR--YGNAFSYCIPATASDS-GFFTLGVP 315
Query: 290 GQGETPF------SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
+ + + RQ Y + + ++VGG + F ++ DS T+ T L
Sbjct: 316 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 375
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNL 385
AY + F S R + CY + N P ++L
Sbjct: 376 TAYQALRSAFRSSMTMYRSAPPKGY-LDTCYDFT-GVVNIRLPKISL 420
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/323 (26%), Positives = 126/323 (39%), Gaps = 58/323 (17%)
Query: 46 LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH 105
L + + KG + + RLR A G D T + RL+S+ +
Sbjct: 25 LVLTHVDSKGGYTKTELMRRAVHRSRLR----ALSGYDAT---------SPRLHSVQVEY 71
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+++G+P + F+ DTGSDL W C C C D +Y P+ SST S
Sbjct: 72 LMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASSTFS 122
Query: 165 KVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+PC+S C + C + S C Y+ Y DG S G L + L L
Sbjct: 123 PLPCSSATCLPIWSRNC-TPSSLCRYRYAY-GDGAYSAGILGTETLTLGPSSAPVSV--G 178
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGT 278
++FGCG G L+ G GLG S+LA G+ FS C F S
Sbjct: 179 GVAFGCGTDNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSALD 230
Query: 279 GRISFGDKGSPGQG-----ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSA-- 325
G G TP +P+ Y +++ +S+G + F+
Sbjct: 231 SPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDG 290
Query: 326 ----IFDSGTSFTYLNDPAYTQI 344
I DSGT+FT L + + ++
Sbjct: 291 TGGMIVDSGTTFTILAESGFREV 313
>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
Length = 423
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 81/336 (24%), Positives = 121/336 (36%), Gaps = 65/336 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + +VA+D +D W+PC C C S +SP SST
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 151
Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
VPC S C CP+ GS+C + + Y + + L +D L L + V
Sbjct: 152 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 203
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
+FGC RV G+ A + L + ++A+QG
Sbjct: 204 VVSYTFGCLRVVNGNSRAAAGAHRL-----RPRAALLLVADQG----------------- 241
Query: 281 ISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA-----------IF 327
G G P + +T L H P+ Y + + + VG V SA I
Sbjct: 242 -HLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTII 300
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLSPNQTNFEYPVVNLTM 387
D+GT FT L P Y + + F + F+ CY P V
Sbjct: 301 DAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY-----NVTVSVPTVTFMF 353
Query: 388 KGGGPFFVNDPIVIVSSEPKGLYLYCLGVVKSDNVN 423
G + + V++ S G+ + SD VN
Sbjct: 354 AGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVN 389
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 119/313 (38%), Gaps = 49/313 (15%)
Query: 68 RYFRLRGRGLAAQ-----GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP-ALSFIVA 121
R +R R AA G P T G +NS +H +S+G P + ++
Sbjct: 53 RRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIH---LSIGAPRSQPVVLT 109
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
LDTGSD+ W C+ C C + + F+ + NT + V C+ LC +
Sbjct: 110 LDTGSDVVWTQCEPCAECF------TQPLPRFDTAASNTVRS---VACSDPLCNAHSEHG 160
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
C Y Y DG++S G + D + K I FGCG G FL
Sbjct: 161 CFLHGCTYVSGY-GDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQ-- 217
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS------FGDKGSPGQG-- 292
G+ G G S+PS L + FS CF + + S GD + G
Sbjct: 218 TETGIAGFGRGPLSLPSQLKVR-----QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPI 272
Query: 293 -ETPFSLRQTHP-----TYNITITQVSVGGNAVNF-EFSA------IFDSGTSFTYLNDP 339
TPF +R P Y ++ V+VG + E A DSGT T D
Sbjct: 273 LSTPF-VRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSGTDITTFPDA 331
Query: 340 AYTQISETFNSLA 352
+ Q+ F + A
Sbjct: 332 VFRQLKSAFIAQA 344
>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 449
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/336 (25%), Positives = 139/336 (41%), Gaps = 56/336 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+++VG P + + +DTGS+L WL C N+S + ++P SS+ S +P
Sbjct: 76 SLTVGTPPQNVTMVIDTGSELSWLHC---------NTSQNSSSSSSTFNPVWSSSYSPIP 126
Query: 168 CNSTLCELQKQ----CPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
C+S+ C Q + PS SN C + Y +D + S G L D ++ + S
Sbjct: 127 CSSSTCTDQTRDFPIRPSCDSNQFCHATLSY-ADASSSEGNLATDTFYIGS------SGI 179
Query: 222 SRISFGC-GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTG 279
+ FGC + + + + + GL +GM++ S+ S ++ G FS C D +G
Sbjct: 180 PNVVFGCMDSIFSSNSEEDSKNTGL--MGMNRGSL-SFVSQMGF--PKFSYCISEYDFSG 234
Query: 280 RISFGDKG----SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF------------ 323
+ GD +P + P ++ V + G V +
Sbjct: 235 LLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDH 294
Query: 324 ----SAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE----YCYVLSPN 374
+ DSGT FT+L PAYT + + F N A R S+ F+ CY + N
Sbjct: 295 TGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRVPTN 354
Query: 375 QTNF-EYPVVNLTMKGGGPFFVNDPIVI-VSSEPKG 408
QT P V L +G D I+ V E +G
Sbjct: 355 QTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRG 390
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 108/272 (39%), Gaps = 42/272 (15%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P +DTGSDL WL CD +C H G+ I F+ + SS+ K+P
Sbjct: 8 ELSIGTPPQLIPAMIDTGSDLVWLKCD--NCDHCDLDHHGETIFFS----DASSSYKKLP 61
Query: 168 CNSTLCELQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD--EKQSKSVDS 222
CNST C P C Y+ Y DG+ ++G + D + + + +S
Sbjct: 62 CNSTHCSGMSSAGIGPRCEETCKYKYEY-GDGSRTSGDVGSDRISFRSHGAGEDHRSFFD 120
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GR 280
FGCGR G D GL GLG S+ L ++ + FS C S +
Sbjct: 121 GFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSPPSA 175
Query: 281 ISFGDKGSPG--QGETPFSLRQTH------PTYNITITQVSVGGNAVN------------ 320
SF GS +G S H Y + + ++VGG V
Sbjct: 176 KSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSV 235
Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFN 349
+ DSGT++T L P Y + ++
Sbjct: 236 GPFLANKTVIDSGTTYTLLTPPVYEAMRKSIE 267
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.137 0.421
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,403,271,145
Number of Sequences: 23463169
Number of extensions: 335123496
Number of successful extensions: 674075
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 316
Number of HSP's successfully gapped in prelim test: 2285
Number of HSP's that attempted gapping in prelim test: 669863
Number of HSP's gapped (non-prelim): 2957
length of query: 444
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 298
effective length of database: 8,933,572,693
effective search space: 2662204662514
effective search space used: 2662204662514
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)