BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016583
(387 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 510 bits (1313), Expect = e-142, Method: Compositional matrix adjust.
Identities = 245/351 (69%), Positives = 287/351 (81%), Gaps = 5/351 (1%)
Query: 25 FGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK 84
+GFGTFGFD HHRYSDPVKG+L+VDDLP+KGS YY+++AHRD + GR L + N
Sbjct: 36 YGFGTFGFDLHHRYSDPVKGMLSVDDLPEKGSLHYYASMAHRD--ILIHGRKLVSD-NTS 92
Query: 85 TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS--CVHGL 142
TPLTF +GN+TYR +SLGFLHY NVS+G P+LS++VALDTGSDLFWLPCDC + CV GL
Sbjct: 93 TPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALDTGSDLFWLPCDCTNSGCVQGL 152
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
SG+ IDFNIY PN SSTS +PCN+TLC Q +CPSA S CPYQV+YLS+GT STG
Sbjct: 153 QFPSGEQIDFNIYRPNASSTSQTIPCNNTLCSRQSRCPSAQSTCPYQVQYLSNGTSSTGV 212
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVED+LHL TD+ QS+++D++I FGCGRVQTGSFLDGAAPNGLFGLGM SVPS LA +
Sbjct: 213 LVEDLLHLTTDDAQSRALDAKIIFGCGRVQTGSFLDGAAPNGLFGLGMTNISVPSTLARE 272
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
G NSFSMCFG DG GRISFGD GS GQGETPF+LRQ HPTYN++IT+++VGG + E
Sbjct: 273 GYTSNSFSMCFGRDGIGRISFGDTGSSGQGETPFNLRQLHPTYNVSITKINVGGRDADLE 332
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
FSAIFDSGTSFTYLNDPAYT ISE+FN AKEKR +S SD+PFEYCY + S
Sbjct: 333 FSAIFDSGTSFTYLNDPAYTLISESFNIGAKEKRYSSISDIPFEYCYEMSS 383
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 235/366 (64%), Positives = 277/366 (75%), Gaps = 10/366 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
C +FGFD HHR+SDPVK IL V DLP KG+ YY +AHRDR FR GR LAA +
Sbjct: 24 CHALNSFGFDIHHRFSDPVKEILGVHDLPDKGTRLYYVVMAHRDRIFR--GRRLAAAVH- 80
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
+PLTF N+TY++ + GFLH+ NVSVG P LSF+VALDTGSDLFWLPC+C CV G+
Sbjct: 81 HSPLTFVPANETYQIGAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGV- 139
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
S+G+ I FNIY SSTS V CNS LCELQ+QCPS+ S CPY+V YLS+GT +TGFL
Sbjct: 140 ESNGEKIAFNIYDLKGSSTSQTVLCNSNLCELQRQCPSSDSICPYEVNYLSNGTSTTGFL 199
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVLHL TD+ ++K D+RI+FGCG+VQTG+FLDGAAPNGLFGLGM SVPSILA +G
Sbjct: 200 VEDVLHLITDDDETKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMGNESVPSILAKEG 259
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSFSMCFGSDG GRI+FGD S QG+TPF+LR HPTYNIT+TQ+ VGGNA + EF
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGGNAADLEF 319
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS--DLPFEYCYVLRSFLHLQALV 381
AIFDSGTSFT+LNDPAY QI+ +FNS K +R +S+S +LPFEYCY L S V
Sbjct: 320 HAIFDSGTSFTHLNDPAYKQITNSFNSAIKLQRYSSSSSDELPFEYCYDLSS----NKTV 375
Query: 382 VLPFPL 387
LP L
Sbjct: 376 ELPINL 381
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 224/352 (63%), Positives = 278/352 (78%), Gaps = 8/352 (2%)
Query: 23 CCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
CC+G TFGFD HHR+SD +KG+L +DD+P+KG+ YY+ +AHRDR FR GR LA +
Sbjct: 26 CCYGLSTFGFDIHHRFSDQIKGMLGIDDVPQKGTPQYYAVMAHRDRVFR--GRRLAG-AD 82
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG- 141
+PLTF+AGNDT+++ S GFLH+ NVSVG P L F+VALDTGSDLFWLPCDC+SCVHG
Sbjct: 83 HHSPLTFAAGNDTHQIASSGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCDCISCVHGG 142
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCN-STLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L + +G+++ FN Y + SSTS++V CN ST C ++QCPSAGS C YQV YLS+ T S
Sbjct: 143 LRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCRQRQQCPSAGSTCRYQVDYLSNDTSSR 202
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
GF+VEDVLHL TD+ Q+K D+RI+FGCG+VQTG FL+GAAPNGLFGLGMD SVPSILA
Sbjct: 203 GFVVEDVLHLITDDDQTKDADTRIAFGCGQVQTGVFLNGAAPNGLFGLGMDNISVPSILA 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
+GLI NSFSMCFGSD GRI+FGD GSP Q +TPF++R+ HPTYNITIT++ V + +
Sbjct: 263 REGLISNSFSMCFGSDSAGRITFGDTGSPDQRKTPFNVRKLHPTYNITITKIIVEDSVAD 322
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCY 369
EF AIFDSGTSFTY+NDPAYT+I E +NS K KR +S S++PF+YCY
Sbjct: 323 LEFHAIFDSGTSFTYINDPAYTRIGEMYNSKVKAKRHSSQSPDSNIPFDYCY 374
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 223/361 (61%), Positives = 274/361 (75%), Gaps = 5/361 (1%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFR 71
+L+++ S C G G FGF+FHHR+SD V G+L D LP + S YY +AHRDR
Sbjct: 15 ILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL-- 72
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+RGR LA++ D++ +TF+ GN+T R+N+LGFLHY NV+VG P+ F+VALDTGSDLFWL
Sbjct: 73 IRGRRLASE--DQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWL 130
Query: 132 PCDC-VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQV 190
PCDC +CV L + G +D NIYSPN SSTSSKVPCNSTLC +C S S+CPYQ+
Sbjct: 131 PCDCSTNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCPYQI 190
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
RYLS+GT STG LVEDVLHL + EK SK + +RI+ GCG VQTG F DGAAPNGLFGLG+
Sbjct: 191 RYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFGLGL 250
Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
+ SVPS+LA +G+ NSFSMCFG DG GRISFGDKGS Q ETP ++RQ HPTYN+T+T
Sbjct: 251 EDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNVTVT 310
Query: 311 QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
Q+SVGGN + EF A+FD+GTSFTYL D YT ISE+FNSLA +KR + S+LPFEYCY
Sbjct: 311 QISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEYCYA 370
Query: 371 L 371
+
Sbjct: 371 V 371
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 223/352 (63%), Positives = 272/352 (77%), Gaps = 11/352 (3%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDD---LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
C+ G FG D HHR+SDPV IL + + LP KG+ YY+A+ HRDR F GR LA
Sbjct: 33 CYSLGKFGLDIHHRFSDPVTEILGIGNDELLPHKGTPQYYAAMVHRDRVFH--GRRLA-- 88
Query: 81 GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
+ TP+TF+AGN+T+++ + GFLH+ NVSVG P L F+VALDTGSDLFWLPC+C SCV
Sbjct: 89 DDRDTPITFAAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDTGSDLFWLPCNCTSCVR 148
Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
GL + +G+VID NIY + SST VPCNS +C+ Q QC S+GS+C Y+V YLS+ T S+
Sbjct: 149 GLKTQNGKVIDLNIYELDKSSTRKNVPCNSNMCK-QTQCHSSGSSCRYEVEYLSNDTSSS 207
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
GFLVEDVLHL TD Q+K +D++I+ GCG+VQTG FL+GAAPNGLFGLGM+ SVPSILA
Sbjct: 208 GFLVEDVLHLITDNDQTKDIDTQITIGCGQVQTGVFLNGAAPNGLFGLGMENVSVPSILA 267
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
+GLI +SFSMCFGSDG+GRI+FGD GS QG+TPF+LR++HPTYN+TITQ+ VGG A +
Sbjct: 268 QKGLISDSFSMCFGSDGSGRITFGDTGSSDQGKTPFNLRESHPTYNVTITQIIVGGYAAD 327
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE---TSTSDLPFEYCY 369
EF AIFDSGTSFTYLNDPAYT ISE FNSL K R + SDLPFEYCY
Sbjct: 328 HEFHAIFDSGTSFTYLNDPAYTLISEKFNSLVKANRHSPLSPDSDLPFEYCY 379
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 228/350 (65%), Positives = 271/350 (77%), Gaps = 6/350 (1%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
C +FGFD HHR+SDPVK IL V DLP KG+ YY A+AHRDR FR GR LAA
Sbjct: 24 CHALHSFGFDIHHRFSDPVKEILGVHDLPDKGTRQYYVAMAHRDRIFR--GRRLAA--GY 79
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
+PLTF N+TY++ + GFLH+ NVSVG P LSF+VALDTGSDLFWLPC+C CVHG+
Sbjct: 80 HSPLTFIPSNETYQIEAFGFLHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVHGIG 139
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
S+G+ I FNIY SSTS V CNS+LCELQ+QCPS+ + CPY+V YLS+GT +TGFL
Sbjct: 140 LSNGEKIAFNIYDLKGSSTSQPVLCNSSLCELQRQCPSSDTICPYEVNYLSNGTSTTGFL 199
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVLHL TD+ ++K D+RI+FGCG+VQTG+FLDGAAPNGLFGLGM SVPSILA +G
Sbjct: 200 VEDVLHLITDDDKTKDADTRITFGCGQVQTGAFLDGAAPNGLFGLGMSNESVPSILAKEG 259
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSFSMCFGSDG GRI+FGD S QG+TPF+LR HPTYNIT+TQ+ VG + EF
Sbjct: 260 LTSNSFSMCFGSDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDLEF 319
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS--DLPFEYCYVL 371
AIFDSGTSFTYLNDPAY QI+ +FNS K +R +++S +LPFEYCY L
Sbjct: 320 HAIFDSGTSFTYLNDPAYKQITNSFNSEIKLQRHSTSSSNELPFEYCYEL 369
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 222/356 (62%), Positives = 275/356 (77%), Gaps = 8/356 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN- 82
C+G +FGFD HHR+SDPVKGIL +D++P KGS YY A+AHRDR FR GR LA G+
Sbjct: 33 CYGSSSFGFDIHHRFSDPVKGILGIDNIPDKGSREYYVAMAHRDRVFR--GRRLADGGDV 90
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D+ LTFS N TY+++ G+LH+ NVSVG PA S++VALDTGSDLFWLPC+C CVHG+
Sbjct: 91 DQKLLTFSPDNTTYQISLFGYLHFANVSVGTPASSYLVALDTGSDLFWLPCNCTKCVHGI 150
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQC-PSAGSNCPYQVRYLSDGTMSTG 201
S+GQ I FNIY SSTS V CNS+LCE + QC S+G CPYQV YLS+ T +TG
Sbjct: 151 QLSTGQKIAFNIYDNKESSTSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTG 210
Query: 202 FLVEDVLHLATD-EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
FLVEDVLHL TD + Q++ + I+FGCG+VQTG+FLDGAAPNGLFGLGM SVPSILA
Sbjct: 211 FLVEDVLHLITDNDDQTQHANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILA 270
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAV 319
QGL NSFSMCF +DG GRI+FGD S QG+TPF++R +H TYNIT+TQ+ VGGN+
Sbjct: 271 KQGLTSNSFSMCFAADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSA 330
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE--TSTSDLPFEYCYVLRS 373
+ EF+AIFD+GTSFTYLN+PAY QI+++F+S K +R +++ DLPFEYCY LR+
Sbjct: 331 DLEFNAIFDTGTSFTYLNNPAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRT 386
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 216/338 (63%), Positives = 267/338 (78%), Gaps = 5/338 (1%)
Query: 35 HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND 94
HHR+SD V G+L D LP + S YY +AHRDR +RGR LA + D++ +TFS GN+
Sbjct: 38 HHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL--IRGRRLANE--DQSLVTFSDGNE 93
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
T R+++LGFLHY NV+VG P+ F+VALDTGSDLFWLPCDC +CV L + G +D NI
Sbjct: 94 TVRVDALGFLHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
YSPN SSTS+KVPCNSTLC +C S S+CPYQ+RYLS+GT STG LVEDVLHL +++
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
K SK++ +R++FGCG+VQTG F DGAAPNGLFGLG++ SVPS+LA +G+ NSFSMCFG
Sbjct: 214 KSSKAIPARVTFGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
+DG GRISFGDKGS Q ETP ++RQ HPTYNIT+T++SVGGN + EF A+FDSGTSFT
Sbjct: 274 NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFT 333
Query: 335 YLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVL 371
YL D AYT ISE+FNSLA +KR +T+ S+LPFEYCY L
Sbjct: 334 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 446 bits (1147), Expect = e-123, Method: Compositional matrix adjust.
Identities = 215/338 (63%), Positives = 265/338 (78%), Gaps = 5/338 (1%)
Query: 35 HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND 94
HHR+SD V G+L D LP + S YY +AHRDR +RGR LA + D++ +TFS GN+
Sbjct: 38 HHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL--IRGRRLANE--DQSLVTFSDGNE 93
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
T R+++LGFLHY NV+VG P+ F+VALDTGSDLFWLPCDC +CV L + G +D NI
Sbjct: 94 TIRVDALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNI 153
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
YSPN SSTS+KVPCNSTLC +C S SNCPYQ+RYLS+GT STG LVEDVLHL +++
Sbjct: 154 YSPNASSTSTKVPCNSTLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSND 213
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
K SK++ +R++ GCG+VQTG F DGAAPNGLFGLG++ SVPS+LA +G+ NSFSMCFG
Sbjct: 214 KSSKAIPARVTLGCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFG 273
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
+DG GRISFGDKGS Q ETP ++RQ HPTYNIT+T++SV GN + EF A+FDSGTSFT
Sbjct: 274 NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDLEFDAVFDSGTSFT 333
Query: 335 YLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVL 371
YL D AYT ISE+FNSLA +KR +T+ S+LPFEYCY L
Sbjct: 334 YLTDAAYTLISESFNSLALDKRYQTTDSELPFEYCYAL 371
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 199/345 (57%), Positives = 253/345 (73%), Gaps = 4/345 (1%)
Query: 27 FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
FG+F F+ HH YS V+ IL P +G+ YY+A+ D + R G Q D P
Sbjct: 55 FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDHFVHSRRLG---QVQDHRP 111
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
LTF +GN+T R++ LGFL+Y V+VG P + ++VALDTGSDLFWLPCDCV+C+ GLN++
Sbjct: 112 LTFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQ 171
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
G V +FNIYSPN SSTS +V C+S+LC QC S CPYQV YLSD T STG+LVED
Sbjct: 172 GPV-NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 230
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
+LHL T++ QSK V++RI+ GCG+ Q+G+FL AAPNGLFGLG++ SVPSILAN GLI
Sbjct: 231 ILHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLIS 290
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
NSFS+CFG GRI FGDKGSPGQ ETPF+L + HPTYN++ITQ+ VGG+ + + + I
Sbjct: 291 NSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVI 350
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
FDSGTSFTYLNDPAY+ ++ F S+ +EK+ T SD+PFE CY L
Sbjct: 351 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYEL 395
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 199/345 (57%), Positives = 253/345 (73%), Gaps = 4/345 (1%)
Query: 27 FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
FG+F F+ HH YS V+ IL P +G+ YY+A+ D + R G Q D P
Sbjct: 32 FGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLG---QVQDHRP 88
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
LTF +GN+T R++ LGFL+Y V+VG P + ++VALDTGSDLFWLPCDCV+C+ GLN++
Sbjct: 89 LTFLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQ 148
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
G V +FNIYSPN SSTS +V C+S+LC QC S CPYQV YLSD T STG+LVED
Sbjct: 149 GPV-NFNIYSPNNSSTSKEVQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVED 207
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
+LHL T++ QSK V++RI+ GCG+ Q+G+FL AAPNGLFGLG++ SVPSILAN GLI
Sbjct: 208 ILHLTTNDVQSKPVNARITLGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLIS 267
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
NSFS+CFG GRI FGDKGSPGQ ETPF+L + HPTYN++ITQ+ VGG+ + + + I
Sbjct: 268 NSFSLCFGPARMGRIEFGDKGSPGQNETPFNLGRRHPTYNVSITQIGVGGHISDLDVAVI 327
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
FDSGTSFTYLNDPAY+ ++ F S+ +EK+ T SD+PFE CY L
Sbjct: 328 FDSGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDIPFENCYEL 372
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 203/327 (62%), Positives = 251/327 (76%), Gaps = 21/327 (6%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGF----------------LHY 106
+AHRDR +RGR LA + D++ +TFS GN+T R+++LGF LHY
Sbjct: 1 MAHRDRL--IRGRRLANE--DQSLVTFSDGNETVRVDALGFFKVNVFMETCELFMRDLHY 56
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
NV+VG P+ F+VALDTGSDLFWLPCDC +CV L + G +D NIYSPN SSTS+KV
Sbjct: 57 ANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKV 116
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PCNSTLC +C S S+CPYQ+RYLS+GT STG LVEDVLHL +++K SK++ +R++F
Sbjct: 117 PCNSTLCTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTF 176
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDK 286
GCG+VQTG F DGAAPNGLFGLG++ SVPS+LA +G+ NSFSMCFG+DG GRISFGDK
Sbjct: 177 GCGQVQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDK 236
Query: 287 GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
GS Q ETP ++RQ HPTYNIT+T++SVGGN + EF A+FDSGTSFTYL D AYT ISE
Sbjct: 237 GSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISE 296
Query: 347 TFNSLAKEKR-ETSTSDLPFEYCYVLR 372
+FNSLA +KR +T+ S+LPFEYCY LR
Sbjct: 297 SFNSLALDKRYQTTDSELPFEYCYALR 323
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 393 bits (1010), Expect = e-107, Method: Compositional matrix adjust.
Identities = 209/347 (60%), Positives = 250/347 (72%), Gaps = 31/347 (8%)
Query: 63 LAHRDRYFRLRGRGLAA-----QGNDKTPLTFSAGNDTYRLNSLGF-------------- 103
+A RDR + GR LA N+KT LTF GN+TYR++ LG
Sbjct: 1 MAQRDRV--IHGRRLATSTGGDNKNNKTLLTFYYGNETYRIDGLGLRNSCVSLYSNGLFG 58
Query: 104 --LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
LHY NVSVG P++SF+VALDTGS+L WLPCDC SCVH L S SG V D NIYSPNTSS
Sbjct: 59 YILHYANVSVGTPSVSFLVALDTGSNLLWLPCDCSSCVHSLRSPSGTV-DLNIYSPNTSS 117
Query: 162 TSSKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
TS KVPCNSTLC ++ CPS SNCPYQV YLS+GT +TG++V+D+LHL +D+ QSK+
Sbjct: 118 TSEKVPCNSTLCSQTQRDRCPSDQSNCPYQVVYLSNGTSTTGYIVQDLLHLISDDSQSKA 177
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
VD++I+FGCG+VQTGSFL G APNGLFGLGM SVPS LA+ G SFSMCF +G G
Sbjct: 178 VDAKITFGCGKVQTGSFLTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFSPNGIG 237
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLND 338
RISFGDKGS GQGET F+ Q + YNI+ITQ S+GG A + +SAIFDSGTSFTYLND
Sbjct: 238 RISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLVYSAIFDSGTSFTYLND 297
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLPF 385
PAYT I+E+FN L KE R +ST +PF+YCY +RSF+ Q +LPF
Sbjct: 298 PAYTLIAESFNKLVKETRRSST-QVPFDYCYDIRSFISAQ---ILPF 340
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 194/358 (54%), Positives = 254/358 (70%), Gaps = 7/358 (1%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAV-DDLPKKGSFAYYSALAHRDRYFR 71
LLI + + C G F F HHR+SD K + + P+KGSF YY+ALAHRD+
Sbjct: 10 LLITIWVFSKTCKG-RVFTFKMHHRFSDSFKNWSGLTRNWPEKGSFEYYAALAHRDQM-- 66
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
LRGR L+ + L FS GN T+R++SLGFLHYT V +G P + F+VALDTGSDLFW+
Sbjct: 67 LRGRRLS---DADASLAFSDGNSTFRISSLGFLHYTTVELGTPGVKFMVALDTGSDLFWV 123
Query: 132 PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVR 191
PCDC C +S + +IY+P SSTS KV CN+ +C + +C S+CPY V
Sbjct: 124 PCDCSRCAPTHGASYASDFELSIYNPRESSTSKKVTCNNDMCAQRNRCLGTFSSCPYIVS 183
Query: 192 YLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMD 251
Y+S T ++G LV+DVLHL T++ + V++ ++FGCG+VQ+GSFLD AAPNGLFGLGM+
Sbjct: 184 YVSAQTSTSGILVKDVLHLTTEDGGREFVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGME 243
Query: 252 KTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
K SVPS+L+ +GLI +SFSMCFG DG GRISFGDKGSP Q ETPF++ HPTYN+T+TQ
Sbjct: 244 KISVPSVLSREGLIADSFSMCFGHDGIGRISFGDKGSPDQEETPFNVNPAHPTYNVTVTQ 303
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
VG ++ EF+A+FDSGTSFTY+ DPAY+++SE F+SLA++KR +PFEYCY
Sbjct: 304 ARVGTMLIDVEFTALFDSGTSFTYMVDPAYSRVSEKFHSLARDKRRPPDPRIPFEYCY 361
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 193/365 (52%), Positives = 251/365 (68%), Gaps = 13/365 (3%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK-----GILAVDDLPKKGSFAYYSALA 64
+ LL L CC C G + F HHR+S+PV+ + P++G+ YY+ LA
Sbjct: 8 IVSLLSLWECCQ--CHGH-VYTFTMHHRHSEPVRKWSHSAAAGIPAPPEEGTVEYYAELA 64
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
RDR LRGR L+ L FS GN T+R++SLGFLHYT V +G P + F+VALDT
Sbjct: 65 DRDRL--LRGRKLS---QIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDT 119
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
GSDLFW+PCDC C +++ D N+Y+PN SSTS KV CN++LC + QC S
Sbjct: 120 GSDLFWVPCDCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLCTHRSQCLGTFS 179
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
NCPY V Y+S T ++G LVEDVLHL ++ V++ + FGCG++Q+GSFLD AAPNG
Sbjct: 180 NCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNG 239
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT 304
LFGLGM+K SVPS+L+ +G +SFSMCFG DG GRISFGDKGS Q ETPF+L +HPT
Sbjct: 240 LFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPT 299
Query: 305 YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
YNIT+TQV VG ++ EF+A+FDSGTSFTYL DP YT+++E+F+S +++R S S +P
Sbjct: 300 YNITVTQVRVGTTVIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIP 359
Query: 365 FEYCY 369
FEYCY
Sbjct: 360 FEYCY 364
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 197/365 (53%), Positives = 254/365 (69%), Gaps = 17/365 (4%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVK--------GILAVDDLPKKGSFAYYSALA 64
+ I+ S C G + F HHR+S+PV+ GI A P+KG+ YY+ LA
Sbjct: 5 VFIIASLFLSLCHGH-VYTFTMHHRHSEPVRKWSHSTASGIPAP---PEKGTVEYYAELA 60
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
RDR LRGR L+ Q +D L FS GN T+R++SLGFLHYT V +G P + F+VALDT
Sbjct: 61 DRDRL--LRGRKLS-QIDDG--LAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDT 115
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
GSDLFW+PCDC C +S+ D N+Y+PN SSTS KV CN++LC + QC S
Sbjct: 116 GSDLFWVPCDCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCMHRSQCLGTLS 175
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
NCPY V Y+S T ++G LVEDVLHL ++ V++ + FGCG++Q+GSFLD AAPNG
Sbjct: 176 NCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEANVIFGCGQIQSGSFLDVAAPNG 235
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT 304
LFGLGM+K SVPS+L+ +G +SFSMCFG DG GRISFGDKGS Q ETPF+L +HPT
Sbjct: 236 LFGLGMEKISVPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSFDQDETPFNLNPSHPT 295
Query: 305 YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
YNIT+TQV VG ++ EF+A+FDSGTSFTYL DP YT+++E+F+S +++R S S +P
Sbjct: 296 YNITVTQVRVGTTLIDVEFTALFDSGTSFTYLVDPTYTRLTESFHSQVQDRRHRSDSRIP 355
Query: 365 FEYCY 369
FEYCY
Sbjct: 356 FEYCY 360
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 195/364 (53%), Positives = 251/364 (68%), Gaps = 12/364 (3%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAH 65
++ILLS F F HHR+S+PVK + P KGSF YY+ LAH
Sbjct: 9 IVILLSILGFRSCHARIFSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYYAELAH 68
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
RDR LRGR L+ + LTFS GN T+R++SLGFLHYT VS+G P F+VALDTG
Sbjct: 69 RDR--ALRGRRLS---DIDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTG 123
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
SDLFW+PCDC C ++ + +IY+P SSTS KV C+++LC + +C SN
Sbjct: 124 SDLFWVPCDCSRCAPTEGTTYASDFELSIYNPKGSSTSRKVTCDNSLCAHRNRCLGTFSN 183
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
CPY V Y+S T ++G LVEDVLHL T++ + + V++ ++FGCG+VQTGSFLD AAPNGL
Sbjct: 184 CPYMVSYVSAETSTSGILVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGL 243
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTY 305
FGLG++K SVPSIL+ +G +SFSMCFG DG GRISFGDKGSP Q ETPF+L HPTY
Sbjct: 244 FGLGLEKISVPSILSKEGFTADSFSMCFGPDGIGRISFGDKGSPDQEETPFNLNALHPTY 303
Query: 306 NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
NIT+TQV VG ++ +F+A+FDSGTSFTYL DP YT + ++F+S A++ R S +PF
Sbjct: 304 NITVTQVRVGTTLIDLDFTALFDSGTSFTYLVDPIYTNVLKSFHSQAQDSRRPPDSRIPF 363
Query: 366 EYCY 369
E+CY
Sbjct: 364 EFCY 367
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 261/376 (69%), Gaps = 12/376 (3%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK------GILAVDDLPKK 54
M+ + + + ++ IL+ G C G F F+ HHR+SD VK G A P K
Sbjct: 1 MSCCFFKTTLFLIPILMLLSFGSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFA--KFPPK 57
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP-LTFSAGNDTYRLNSLGFLHYTNVSVGQ 113
GSF Y++AL RD + +RGR L+ ++ LTFS GN T R++SLGFLHYT V +G
Sbjct: 58 GSFEYFNALVLRD--WLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGT 115
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + F+VALDTGSDLFW+PCDC C ++ + +IY+P S+T+ KV CN++LC
Sbjct: 116 PGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC 175
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ QC S CPY V Y+S T ++G L+EDV+HL T++K + V++ ++FGCG+VQ+
Sbjct: 176 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQS 235
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE 293
GSFLD AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS Q E
Sbjct: 236 GSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEE 295
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
TPF+L +HP YNIT+T+V VG ++ EF+A+FD+GTSFTYL DP YT +SE+F+S A+
Sbjct: 296 TPFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQ 355
Query: 354 EKRETSTSDLPFEYCY 369
+KR + S +PFEYCY
Sbjct: 356 DKRHSPDSRIPFEYCY 371
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 186/343 (54%), Positives = 241/343 (70%), Gaps = 8/343 (2%)
Query: 30 FGFDFHHRYSDPVKGI---LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP 86
F F HHR+SD +K + + P KGSF YY+ LAHRD+ LRGR L N + P
Sbjct: 28 FTFKMHHRFSDMLKDLSDSTTSRNFPSKGSFEYYAELAHRDQM--LRGRKLY---NVEAP 82
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSS 146
L FS GN T+R++SLGFLHYT V +G P + F+VALDTGSDLFW+PCDC C +
Sbjct: 83 LAFSDGNSTFRISSLGFLHYTTVELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAY 142
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
+ +IY P SSTS KV CN+ LC + +C S+CPY V Y+S T ++G LVED
Sbjct: 143 ASDFELSIYDPKQSSTSKKVTCNNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVED 202
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
VLHL +++ +S+ + ++FGCG+VQ+GSFL+ AAPNGLFGLGMD+ SVPSIL+ +GL
Sbjct: 203 VLHLTSEDSNQESIKAYVTFGCGQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTA 262
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
+SFSMCFG DG GRISFGDKGSP Q ETPF+ +HP+YNI++TQV VG V+ +F+A+
Sbjct: 263 DSFSMCFGHDGVGRISFGDKGSPDQEETPFNSNPSHPSYNISVTQVRVGTTLVDVDFTAL 322
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSGTSFTYL +P Y +SE F++ A++KR +PFEYCY
Sbjct: 323 FDSGTSFTYLINPIYAMVSENFHAQAQDKRRPPDPRIPFEYCY 365
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/384 (50%), Positives = 262/384 (68%), Gaps = 16/384 (4%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK------GILAVDDLPKK 54
M+ + + + ++ IL+ G C G F F+ HHR+SD VK G A P K
Sbjct: 1 MSCCFFKTTLFLIPILMLLSFGSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFA--KFPPK 57
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP-LTFSAGNDTYRLNSLGFLHYTNVSVGQ 113
GSF Y++AL RD + +RGR L+ ++ LTFS GN T R++SLGFLHYT V +G
Sbjct: 58 GSFEYFNALVLRD--WLIRGRRLSESESESESSLTFSDGNSTSRISSLGFLHYTTVKLGT 115
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + F+VALDTGSDLFW+PCDC C ++ + +IY+P S+T+ KV CN++LC
Sbjct: 116 PGMRFMVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLC 175
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ QC S CPY V Y+S T ++G L+EDV+HL T++K + V++ ++FGCG+VQ+
Sbjct: 176 AQRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQS 235
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE 293
GSFLD AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS Q E
Sbjct: 236 GSFLDIAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEE 295
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
TPF+L +HP YNIT+T+V VG ++ EF+A+FD+GTSFTYL DP YT +SE+ A+
Sbjct: 296 TPFNLNPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQ 351
Query: 354 EKRETSTSDLPFEYCYVLRSFLHL 377
+KR + S +PFEYCY +R L L
Sbjct: 352 DKRHSPDSRIPFEYCYDMREKLVL 375
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 189/352 (53%), Positives = 250/352 (71%), Gaps = 8/352 (2%)
Query: 22 GCCFGFGTFGFDFHHRYSDPVK----GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGL 77
G C G F F+ HHR+SD VK P KGSF Y++AL RD + +RGR L
Sbjct: 22 GSCNG-RIFTFEMHHRFSDEVKQWSDSTGRFVKFPPKGSFEYFNALVLRD--WLIRGRRL 78
Query: 78 AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
+ ++ + LTFS GN T R++SLGFLHYT V +G P + F+VALDTGSDLFW+PCDC
Sbjct: 79 SDSESESS-LTFSDGNSTSRISSLGFLHYTTVKLGTPGMRFMVALDTGSDLFWVPCDCGK 137
Query: 138 CVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
C ++ + +IY+P S+T+ KV CN++LC + QC S CPY V Y+S T
Sbjct: 138 CAPTEGATYASEFELSIYNPKISTTNKKVTCNNSLCAQRNQCLGTFSTCPYMVSYVSAQT 197
Query: 198 MSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
++G L+EDV+HL T++K + V++ ++FGCG+VQ+GSFLD AAPNGLFGLGM+K SVPS
Sbjct: 198 STSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLDIAAPNGLFGLGMEKISVPS 257
Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN 317
+LA +GL+ +SFSMCFG DG GRISFGDKGS Q ETPF+L +HP YNIT+T+V VG
Sbjct: 258 VLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTT 317
Query: 318 AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
++ EF+A+FD+GTSFTYL DP YT +SE+F+S A++KR + S +PFEYCY
Sbjct: 318 LIDDEFTALFDTGTSFTYLVDPMYTTVSESFHSQAQDKRHSPDSRIPFEYCY 369
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/374 (52%), Positives = 255/374 (68%), Gaps = 9/374 (2%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFG--FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFA 58
MAS++ + +L++ + AG +F FD HHR+SD +KGI + LP+K +
Sbjct: 1 MASTFSSGAQMLLVLSVFILAGSLRSGDAASFKFDIHHRFSDSIKGIFHSEGLPEKHTPG 60
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
YY+ + HRDR +RGR LAA D T LTF+ GNDT + LGFL+Y NVSVG P+L F
Sbjct: 61 YYATMVHRDRL--VRGRRLAASDVD-TQLTFAYGNDTAFIPDLGFLYYANVSVGTPSLDF 117
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
+VALDTGSDLFWLPC+C SC LN+S+G N YSPN S+TSS VPC S+LC +
Sbjct: 118 LVALDTGSDLFWLPCECSSCFTYLNTSNGGKFMLNHYSPNDSTTSSTVPCTSSLC---NR 174
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
C S + CPY++RYLS T S G+LVEDVLHLATD+ K V+++I+FGCG VQTG F
Sbjct: 175 CTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLATDDSLLKPVEAKITFGCGTVQTGIFAT 234
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
AAPNGL GLGM+K SVPS LA+QGL NSFSMCFG+DG GRI FGD G Q +TPF+
Sbjct: 235 TAAPNGLIGLGMEKISVPSFLADQGLTSNSFSMCFGADGYGRIDFGDTGPADQKQTPFNT 294
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
+ +YN+T ++VGG + F+AIFDSGTSFTYL +PAY+ I++ ++ K KR +
Sbjct: 295 MLEYQSYNVTFNVINVGGEPNDVPFTAIFDSGTSFTYLTEPAYSTITKQMDAGMKLKRYS 354
Query: 359 S-TSDLPFEYCYVL 371
+ PFEYCY +
Sbjct: 355 LFGPNFPFEYCYEI 368
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 227/322 (70%), Gaps = 12/322 (3%)
Query: 30 FGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
F F HHR+S+PVK + P KGSF YY+ LAHRDR LRGR L+ +
Sbjct: 26 FSFQMHHRFSEPVKKWSEGAGNGFPAGNWPAKGSFEYYAELAHRDR--ALRGRRLS---D 80
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
LTFS GN T+R++SLGFLHYT VS+G P F+VALDTGSDLFW+PCDC C
Sbjct: 81 IDGLLTFSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTGSDLFWVPCDCSRCAPTE 140
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++ + +IY+P SSTS KV CN++LC + +C SNCPY V Y+S T ++G
Sbjct: 141 GTTYASDFELSIYNPKGSSTSRKVTCNNSLCAHRNRCLGTFSNCPYMVSYVSAETSTSGI 200
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVLHL T++ + + V++ ++FGCG+VQTGSFLD AAPNGLFGLG++K SVPSIL+ +
Sbjct: 201 LVEDVLHLTTEDNRQEFVEAYVTFGCGQVQTGSFLDIAAPNGLFGLGLEKISVPSILSKE 260
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
G +SFSMCFG DG GRISFGDKG P Q ETPF+L HPTYNIT+TQV VG ++ +
Sbjct: 261 GFTADSFSMCFGPDGIGRISFGDKGGPDQEETPFNLNALHPTYNITVTQVRVGTTLIDLD 320
Query: 323 FSAIFDSGTSFTYLNDPAYTQI 344
F+A+FDSGTSFTYL DP YT +
Sbjct: 321 FTALFDSGTSFTYLVDPIYTNV 342
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 55/97 (56%), Positives = 70/97 (72%), Gaps = 3/97 (3%)
Query: 7 NSPVCVLLILLS-CCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
NS ++++L+S + C+G GTFGFD HHR+SDPVKGIL VDDLP+K S YY A+AH
Sbjct: 491 NSXWVLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAH 550
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLG 102
RD + + GR L+ K PLTFS GN+TYRL+SLG
Sbjct: 551 RD--WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLG 585
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 185/344 (53%), Positives = 228/344 (66%), Gaps = 5/344 (1%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
+F F HHR+SD +K I + LP+K + YY+A+ HRDR L GR LA D TPL
Sbjct: 30 ASFKFTIHHRFSDSIKEIFGSEGLPEKHTPGYYAAMVHRDRL--LHGRNLATTNGD-TPL 86
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
FS GN+TY L+ LG L+Y NVS+G P L F+VALDTGSDLFWLPC+C C L
Sbjct: 87 MFSYGNETYELSGLGNLYYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDN 146
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N YS N SSTS +VPC+S+LCEL QC S S+CPYQ YLS+ + S G+LV+D+
Sbjct: 147 GKFWLNHYSSNASSTSIRVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDI 206
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
LH+ATD+ Q K VD +++ GCG+VQTG F + APNGL GLGM K SVPS LA+QGL +
Sbjct: 207 LHMATDDSQLKPVDVKVTLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTD 266
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIF 327
SFSMCFG G GRI FGD G GQ ETPF+ +YN+TI Q+ V N +AI
Sbjct: 267 SFSMCFGYYGYGRIDFGDIGPVGQRETPFN--PASLSYNVTILQIIVTNRPTNVHLTAII 324
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
DSG SFTYL DP Y+ I+E ++ + +R S SD PFEYCY L
Sbjct: 325 DSGASFTYLTDPFYSIITENMDAAMELERIKSDSDFPFEYCYRL 368
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 183/346 (52%), Positives = 232/346 (67%), Gaps = 8/346 (2%)
Query: 32 FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA 91
D HHRYS V+G + P G+ YY+ALA D LR R L+
Sbjct: 34 LDVHHRYSATVRGWAGLRRGPSPGTAEYYAALAGHDD---LRRRSLSLAAAPAPGAGGPF 90
Query: 92 ----GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C L+S
Sbjct: 91 AFVDGNDTYRLNQFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAP-LSSPDY 149
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
+ F++YSP SSTS KVPC+S +C+LQ +C +A ++CPY++ YLSD T S G LVEDV
Sbjct: 150 GNLKFDVYSPRKSSTSRKVPCSSNMCDLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDV 209
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
++LAT+ SK + I+FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA+QG+ N
Sbjct: 210 MYLATESGHSKITQAPITFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASQGVAAN 269
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIF 327
SFSMCFG DG GRI+FGD GS Q ETP ++ + +P YNI+I GG + +FSA+
Sbjct: 270 SFSMCFGEDGHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTKFSAVV 329
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
DSGTSFT L+DP YT+I+ F+ KEKR + S LPFEYCY + S
Sbjct: 330 DSGTSFTALSDPMYTEITSAFDKQVKEKRNPADSSLPFEYCYTISS 375
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 183/343 (53%), Positives = 238/343 (69%), Gaps = 4/343 (1%)
Query: 32 FDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS 90
D HHRYS V+G+ + P G+ YY+ALA D R R AA G L F+
Sbjct: 27 LDVHHRYSAAVRGLAGHLRAPPPAGTAEYYAALAGHD--LRRRSLAAAAGGGGAGNLAFA 84
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + G +
Sbjct: 85 DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCIKCAPLASPDYGD-L 143
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
F++YSP SSTS KVPC+S+LC+ Q C +A ++CPY ++YLS+ T S G LVEDVL+L
Sbjct: 144 KFDMYSPRKSSTSRKVPCSSSLCDPQADCSAASNSCPYSIQYLSENTSSKGVLVEDVLYL 203
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T+ QSK + I+FGCG+VQ+GSFL AAPNGL GLGMD SVPS+LA++G+ NSFS
Sbjct: 204 TTESGQSKITQAPITFGCGQVQSGSFLGSAAPNGLLGLGMDSKSVPSLLASKGIAANSFS 263
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GS Q ETP ++ + +P YNI+IT VGG + + +FSA+ DSG
Sbjct: 264 MCFGEDGHGRINFGDTGSSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTKFSAVVDSG 323
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
TSFT L+DP YT+I+ TFN+ KE R+ + +PFEYCY + +
Sbjct: 324 TSFTALSDPMYTEITSTFNAQVKESRKHLDASMPFEYCYSISA 366
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 360 bits (923), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 191/352 (54%), Positives = 239/352 (67%), Gaps = 15/352 (4%)
Query: 28 GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
G +FHHR+S V+ G P G FAY +ALA DR+ R L+A G
Sbjct: 21 GAPSLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRH-----RALSAAG 75
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
+ PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 76 G-RPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCTPP 134
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
+S++ F Y P+ SSTS VPCNS C L+K+C S S+CPY++ Y+S T S+G
Sbjct: 135 PSSAASAPASF--YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSG 191
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
FLVEDVL+L+T++ + + ++I FGCG VQTGSFLD AAPNGLFGLG+D SVPSILA
Sbjct: 192 FLVEDVLYLSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251
Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
+GL NSFSMCFG DG GRISFGD+GS Q ETP + Q HPTY ITIT ++VG N ++
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDL 311
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
E S IFD+GTSFTYL DPAYT I++ F+S + R + S +PFEYCY L S
Sbjct: 312 EVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSS 363
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 360 bits (923), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 191/352 (54%), Positives = 239/352 (67%), Gaps = 15/352 (4%)
Query: 28 GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
G +FHHR+S V+ G P G FAY +ALA DR+ R L+A G
Sbjct: 21 GAPSLEFHHRFSARVRRWADSRGHELPGGWPSPGGFAYVAALAGHDRH-----RALSAAG 75
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
+ PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 76 G-RPPLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCTPP 134
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
+S++ F Y P+ SSTS VPCNS C L+K+C S S+CPY++ Y+S T S+G
Sbjct: 135 PSSAASAPASF--YIPSLSSTSQAVPCNSDFCGLRKEC-SKTSSCPYKMVYVSADTSSSG 191
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
FLVEDVL+L+T++ + + ++I FGCG VQTGSFLD AAPNGLFGLG+D SVPSILA
Sbjct: 192 FLVEDVLYLSTEDTHPQFLKAQIMFGCGEVQTGSFLDAAAPNGLFGLGVDMISVPSILAQ 251
Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF 321
+GL NSFSMCFG DG GRISFGD+GS Q ETP + Q HPTY ITIT ++VG N ++
Sbjct: 252 KGLTSNSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGIAVGNNLMDL 311
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
E S IFD+GTSFTYL DPAYT I++ F+S + R + S +PFEYCY L S
Sbjct: 312 EVSTIFDTGTSFTYLADPAYTYITDGFHSQVQANRHAADSRIPFEYCYDLSS 363
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 189/381 (49%), Positives = 240/381 (62%), Gaps = 40/381 (10%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGIL-----AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLA 78
C F F HHRYS+PVK P+KGS YY+ LA RDR+ LRGR L+
Sbjct: 20 CCNAHIFTFTMHHRYSEPVKKWSHSAPSPSHRWPEKGSVEYYAELADRDRF--LRGRRLS 77
Query: 79 AQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC 138
L FS GN T+R++SLGFLHYT + +G P + F+VALDTGSDLFW+PCDC C
Sbjct: 78 QF---DAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTGSDLFWVPCDCTRC 134
Query: 139 ----VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
S+ D ++Y+PN SSTS KV CN++LC + QC SNCPY V Y+S
Sbjct: 135 SATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLCTHRNQCLGTFSNCPYMVSYVS 194
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
T ++G LVEDVLHL + V++ + FGCG+VQ+GSFLD AAPNGLFGLGM+K S
Sbjct: 195 AETSTSGILVEDVLHLTQPDDNHDLVEANVIFGCGQVQSGSFLDVAAPNGLFGLGMEKIS 254
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
VPS+L+ +G +SFSMCFG DG GRISFGDKGS Q ETPF++ +HPTYNITI QV V
Sbjct: 255 VPSMLSREGFTADSFSMCFGRDGIGRISFGDKGSLDQDETPFNVNPSHPTYNITINQVRV 314
Query: 315 GGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET--------------------------F 348
G ++ EF+A+FDSGTSFTYL DP Y+++SE+ F
Sbjct: 315 GTTLIDVEFTALFDSGTSFTYLVDPTYSRLSESVSDKICFHLARCYLKIKVTIEVFMLQF 374
Query: 349 NSLAKEKRETSTSDLPFEYCY 369
+S +++R S +PF+YCY
Sbjct: 375 HSQVEDRRRPPDSRIPFDYCY 395
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 179/354 (50%), Positives = 233/354 (65%), Gaps = 19/354 (5%)
Query: 30 FGFDFHHRYSDPVKGILAV-------DDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
F F HHR+SD +K V D P KG+ YY+ LA RDR+FR G+ L+
Sbjct: 28 FSFKMHHRFSDQLKNWSGVSGKFTLPDSWPVKGTIEYYAQLAFRDRFFR--GQRLSEFDG 85
Query: 83 DKTPLTFSAGNDTYRLNSLGFLH-------YTNVSVGQPALSFIVALDTGSDLFWLPCDC 135
PL FS GN ++R++SLGF YT V +G P F+VALDTGSDLFW+PCDC
Sbjct: 86 ---PLAFSDGNSSFRISSLGFALFDVFFFFYTTVQLGTPGTKFMVALDTGSDLFWVPCDC 142
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSD 195
C S + ++YSP SSTS VPCN+ LC + QC A NCPY V Y+S
Sbjct: 143 SRCAPTEGSPYASDFELSVYSPKKSSTSKTVPCNNNLCAQRDQCTEAFGNCPYVVSYVSA 202
Query: 196 GTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
T +TG L+ED+LHL T+ K S+ + + I+FGCG+VQ+GSFLD AAPNGLFGLGM++ SV
Sbjct: 203 ETSTTGILIEDLLHLKTEHKHSEPIQAYITFGCGQVQSGSFLDVAAPNGLFGLGMEQISV 262
Query: 256 PSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVG 315
PSIL+ +GL+ NSFSMCF DG GRI+FGDKGS Q ETPF+L Q HP YNIT+T + VG
Sbjct: 263 PSILSREGLMANSFSMCFSDDGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVG 322
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
++ + +A+FDSGTSF+Y DP Y+++S +F++ ++ R +PFEYCY
Sbjct: 323 TTLIDADITALFDSGTSFSYFTDPIYSKLSASFHAQTRDGRHPPNPRIPFEYCY 376
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 185/339 (54%), Positives = 232/339 (68%), Gaps = 6/339 (1%)
Query: 32 FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK-TPLTFS 90
D HHRYS A P G+ YY+ALA D LR R L G F+
Sbjct: 29 LDVHHRYSA-AVRRWAAAAAPPHGTAEYYAALAGHDG---LRRRSLGVGGGGGGAEFAFA 84
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C L S + +
Sbjct: 85 DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAP-LQSPNYGSL 143
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
F++YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L
Sbjct: 144 KFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL 203
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+D QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFS
Sbjct: 204 TSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFS 263
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSG
Sbjct: 264 MCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSG 323
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
TSFT L+DP YTQI+ +F++ + R S +PFE+CY
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCY 362
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 357 bits (916), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 235/351 (66%), Gaps = 18/351 (5%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
+FHHR+S P++ G P GS AY +ALA DR+ R ++A G +
Sbjct: 32 LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D PLTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 87 DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GF
Sbjct: 147 TAASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 202
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 203 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
GL NSFSMCFG DG GRISFGD+ S Q ETP + + HPTY ITI+ ++VG + +
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 322
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
F IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY L S
Sbjct: 323 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSS 373
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 356 bits (914), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 182/351 (51%), Positives = 235/351 (66%), Gaps = 16/351 (4%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
+FHHR+S P++ G P GS AY +ALA DR+ R ++A G +
Sbjct: 32 LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D PLTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 87 DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GF
Sbjct: 147 TAASGS-FQATFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 204
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 205 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 264
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
GL NSFSMCFG DG GRISFGD+ S Q ETP + + HPTY ITI+ ++VG + +
Sbjct: 265 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 324
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
F IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY L S
Sbjct: 325 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSS 375
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 181/349 (51%), Positives = 234/349 (67%), Gaps = 18/349 (5%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---N 82
+FHHR+S P++ G P GS AY +ALA DR+ R ++A G +
Sbjct: 32 LEFHHRFSAPLRRWVEARGRALPGGWPAPGSAAYVAALAGHDRH-----RAVSAAGGSSS 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D PLTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C
Sbjct: 87 DAPPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPA 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
++SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GF
Sbjct: 147 TAASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGF 202
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
LVEDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +
Sbjct: 203 LVEDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQK 262
Query: 263 GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
GL NSFSMCFG DG GRISFGD+ S Q ETP + + HPTY ITI+ ++VG + +
Sbjct: 263 GLTSNSFSMCFGRDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMD 322
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
F IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY L
Sbjct: 323 FITIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDL 371
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 184/339 (54%), Positives = 232/339 (68%), Gaps = 6/339 (1%)
Query: 32 FDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDK-TPLTFS 90
D HHRYS A P G+ YY+ALA D LR R L G F+
Sbjct: 29 LDVHHRYSA-AVRRWAAAAAPPHGTAEYYAALAGHDG---LRRRSLGVGGGGGGAEFAFA 84
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G +
Sbjct: 85 DGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-L 143
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
F++YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L
Sbjct: 144 KFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYL 203
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+D QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFS
Sbjct: 204 TSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFS 263
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSG
Sbjct: 264 MCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSG 323
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
TSFT L+DP YTQI+ +F++ + R S +PFE+CY
Sbjct: 324 TSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCY 362
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 355 bits (910), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 183/349 (52%), Positives = 234/349 (67%), Gaps = 14/349 (4%)
Query: 32 FDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
+FHHR+S P++ G P GS AY +ALA DR+ R A G T
Sbjct: 31 LEFHHRFSAPLRRWAEARGRALPGGWPAPGSAAYVAALAGHDRH---RAVSAAGGGGSGT 87
Query: 86 P-LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS 144
P LTF+ GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 88 PPLTFAEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATA 147
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
+SG Y P SSTS VPCNS C+LQK+C S CPY++ Y+S GT S+GFLV
Sbjct: 148 ASGSA---TFYIPGMSSTSKAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGFLV 203
Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
EDVL+L+T+ + + ++I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +GL
Sbjct: 204 EDVLYLSTENAHPQILKAQIMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGL 263
Query: 265 IPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
NSFSMCFG DG GRISFGD+GS Q ETP ++ Q HPTY ITI+ +++G + +F
Sbjct: 264 TSNSFSMCFGRDGIGRISFGDQGSSDQEETPLNINQQHPTYAITISGITIGNKPTDLDFI 323
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
IFD+GTSFTYL DPAYT I+++F++ + R + S +PFEYCY L S
Sbjct: 324 TIFDTGTSFTYLADPAYTYITQSFHAQVQANRHAADSRIPFEYCYDLSS 372
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 187/342 (54%), Positives = 234/342 (68%), Gaps = 1/342 (0%)
Query: 30 FGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
D HHRYS V+ P G+ YY+ALA D R G AA G + F
Sbjct: 29 LSLDVHHRYSATVREWAGHHRAPPAGTAEYYAALARHDLRRRSLAAGPAAGGGGGGEVAF 88
Query: 90 SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV 149
+ GNDTYRLN LGFLHY V++G P ++F+VALDTGSDLFW+PCDC++C L S + +
Sbjct: 89 ADGNDTYRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAP-LVSPNYRD 147
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ F+ YSP SSTS KVPC+S LC+LQ C SA S+CPY + YLSD T STG LVEDVL+
Sbjct: 148 LKFDTYSPQKSSTSRKVPCSSNLCDLQSACRSASSSCPYSIEYLSDNTSSTGVLVEDVLY 207
Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
L T+ Q K V + I+FGCGR+QTGSFL AAPNGL GLGMD SVPS+LA++G+ NSF
Sbjct: 208 LITEYGQPKIVTAPITFGCGRIQTGSFLGSAAPNGLLGLGMDSISVPSLLASEGVAANSF 267
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDS 329
SMCFG DG GRI+FGD GS Q ETP ++ + +P YNI+IT VG + N F+AI DS
Sbjct: 268 SMCFGDDGRGRINFGDTGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNTNFNAIVDS 327
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
GTSFT L+DP Y++I+ +FNS ++K S LPFE+CY +
Sbjct: 328 GTSFTALSDPMYSEITSSFNSQVQDKPTQLDSSLPFEFCYSI 369
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 184/348 (52%), Positives = 238/348 (68%), Gaps = 17/348 (4%)
Query: 32 FDFHHRYSDPVKGILAVD------DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
+FHHR+S ++G P G AY +ALA DR+ R LAA D
Sbjct: 30 LEFHHRFSARLRGWADARGHELPGGWPPPGGAAYVAALAGHDRH-----RALAAA--DHP 82
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C + +
Sbjct: 83 PLTFSEGNATLKVSNLGFLHYALVTVGTPGHTFMVALDTGSDLFWLPCQCDGCPPPASGA 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
SG + Y P+ SSTS VPCNS C+ +K C S S+CPY++ Y+S T S+GFLVE
Sbjct: 143 SGSA---SFYIPSMSSTSQAVPCNSDFCDHRKDC-STTSSCPYKMVYVSADTSSSGFLVE 198
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
DVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D SVPSILA++GL
Sbjct: 199 DVLYLSTEDNHPQILKAQIMFGCGQVQTGSFLDAAAPNGLFGLGIDMISVPSILAHKGLT 258
Query: 266 PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
+SFSMCFG DG GRISFGD+GS Q ETP + Q HPTY ITIT ++VG ++ EFS
Sbjct: 259 SDSFSMCFGRDGIGRISFGDQGSSDQEETPLDINQKHPTYAITITGITVGTEPMDLEFST 318
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
IFD+GT+FTYL DPAYT I+++F++ + R + + +PFEYCY L S
Sbjct: 319 IFDTGTTFTYLADPAYTYITQSFHTQVRANRHAADTRIPFEYCYDLSS 366
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 236/350 (67%), Gaps = 12/350 (3%)
Query: 32 FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
+FHHR+S PV +G + P+ GS Y +AL DR L G+
Sbjct: 35 LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 95 PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
++SG + Y P+ SSTS VPCNS CEL+K+C S S CPY++ Y+S T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSF+MCF DG GRISFGD+GS Q ETP + HPTY I+I++++VG + + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEF 330
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
S IFD+GTSFTYL DPAYT I+++F++ R + S +PFEYCY L S
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSS 380
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 236/350 (67%), Gaps = 12/350 (3%)
Query: 32 FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
+FHHR+S PV +G + P+ GS Y +AL DR L G+
Sbjct: 35 LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 95 PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
++SG + Y P+ SSTS VPCNS CEL+K+C S S CPY++ Y+S T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSF+MCF DG GRISFGD+GS Q ETP + HPTY I+I++++VG + + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDLEF 330
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
S IFD+GTSFTYL DPAYT I+++F++ R + S +PFEYCY L S
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSS 380
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 181/350 (51%), Positives = 236/350 (67%), Gaps = 12/350 (3%)
Query: 32 FDFHHRYSDPV------KGILAVDDLPKKGSFAYYSALAHRDRYFRLRG--RGLAAQGND 83
+FHHR+S PV +G + P+ GS Y +AL DR L G+
Sbjct: 35 LEFHHRFSSPVQRWAEARGHVLPGGWPEHGSADYVAALNGHDRRRALSAAGGDGGGGGDK 94
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
PLTFS GN T ++++LGFLHY V+VG P +F+VALDTGSDLFWLPC C C +
Sbjct: 95 PPPLTFSEGNATLKVSNLGFLHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPAS 154
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
++SG + Y P+ SSTS VPCNS CEL+K+C S S CPY++ Y+S T S+GFL
Sbjct: 155 AASGSA---SFYIPSMSSTSQAVPCNSQFCELRKEC-STTSQCPYKMVYVSADTSSSGFL 210
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
VEDVL+L+T++ + + ++I FGCG+VQTGSFLD AAPNGLFGLG+D S+PSILA +G
Sbjct: 211 VEDVLYLSTEDAIPQILKAQILFGCGQVQTGSFLDAAAPNGLFGLGIDMISIPSILAQKG 270
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF 323
L NSF+MCF DG GRISFGD+GS Q ETP + HPTY I+I++++VG + + EF
Sbjct: 271 LTSNSFAMCFSRDGIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDLEF 330
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
S IFD+GTSFTYL DPAYT I+++F++ R + S +PFEYCY L S
Sbjct: 331 STIFDTGTSFTYLADPAYTYITQSFHAQVHANRHAADSRIPFEYCYDLSS 380
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 186/356 (52%), Positives = 233/356 (65%), Gaps = 18/356 (5%)
Query: 30 FGFDFHHRYSDPVKGILAVDDLP-------KKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
GFD HHR S V+ P +G+ YY+AL DR R RGLA +G+
Sbjct: 29 IGFDLHHRSSPVVRRWAEARGHPGAAWWAEAEGTPEYYAALHRHDRAHLAR-RGLA-EGD 86
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
+ LTF++GN T+RL G LHY V+VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 87 GEGLLTFASGNLTFRLE--GSLHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIA 144
Query: 143 NSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---SNCPYQVRYLSDGTM 198
N+S + D YSP SSTS V C LCE C +AG ++CPY VRY+S T
Sbjct: 145 NASDLRGGPDLRPYSPGKSSTSKAVTCEHALCERPNACAAAGNSSTSCPYTVRYVSANTS 204
Query: 199 STGFLVEDVLHLATDEK--QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G LVEDVLHL+ + S +V + + GCG+VQTG+FLDGAA +GL GLGMDK SVP
Sbjct: 205 SSGVLVEDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAFLDGAAVDGLLGLGMDKVSVP 264
Query: 257 SILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVG 315
S+L GL+ +SFSMCF DG GRI+FGD G GQ ETPF++R THPTYNI++T +SV
Sbjct: 265 SVLHAAGLVASDSFSMCFSPDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISVTAMSVS 324
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
G V EF+AI DSGTSFTYLNDPAYT+++ FNS +E+R ++ +PFEYCY L
Sbjct: 325 GKEVAAEFAAIVDSGTSFTYLNDPAYTELATGFNSEVRERRANLSASIPFEYCYEL 380
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 164/281 (58%), Positives = 210/281 (74%), Gaps = 1/281 (0%)
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQ 148
F+ GNDTYRLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G
Sbjct: 19 FADGNDTYRLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS 78
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ F++YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL
Sbjct: 79 -LKFDVYSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVL 137
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
+L +D QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NS
Sbjct: 138 YLTSDSAQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANS 197
Query: 269 FSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFD 328
FSMCFG DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI D
Sbjct: 198 FSMCFGDDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVD 257
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SGTSFT L+DP YTQI+ +F++ + R S +PFE+CY
Sbjct: 258 SGTSFTALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCY 298
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 327 bits (839), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 158/273 (57%), Positives = 203/273 (74%), Gaps = 1/273 (0%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
RLN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G + F++YS
Sbjct: 68 RLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-LKFDVYS 126
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L +D Q
Sbjct: 127 PAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQ 186
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
SK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFSMCFG D
Sbjct: 187 SKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFGDD 246
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYL 336
G GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSGTSFT L
Sbjct: 247 GHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSGTSFTAL 306
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+DP YTQI+ +F++ + R S +PFE+CY
Sbjct: 307 SDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCY 339
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 158/275 (57%), Positives = 203/275 (73%), Gaps = 1/275 (0%)
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI 154
T LN GFLHY V++G P ++F+VALDTGSDLFW+PCDC+ C + + G + F++
Sbjct: 52 TADLNDFGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCLKCAPFQSPNYGS-LKFDV 110
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
YSP S+TS KVPC+S LC+LQ C S ++CPY ++YLSD T S+G LVEDVL+L +D
Sbjct: 111 YSPAQSTTSRKVPCSSNLCDLQNACRSKSNSCPYSIQYLSDNTSSSGVLVEDVLYLTSDS 170
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
QSK V + I FGCG+VQTGSFL AAPNGL GLGMD SVPS+LA++GL NSFSMCFG
Sbjct: 171 AQSKIVTAPIMFGCGQVQTGSFLGSAAPNGLLGLGMDSKSVPSLLASKGLAANSFSMCFG 230
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
DG GRI+FGD GS Q ETP ++ + +P YNITIT ++VG +++ EFSAI DSGTSFT
Sbjct: 231 DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSISTEFSAIVDSGTSFT 290
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
L+DP YTQI+ +F++ + R S +PFE+CY
Sbjct: 291 ALSDPMYTQITSSFDAQIRSSRNMLDSSMPFEFCY 325
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 177/364 (48%), Positives = 229/364 (62%), Gaps = 29/364 (7%)
Query: 29 TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
+FGFD HHR+S V+ G LA D P +G+ YYSAL+ DR R A G
Sbjct: 33 SFGFDLHHRFSPVVRRWAEARGGPLAADQWPARGTPEYYSALSRHDRARRA-----LAGG 87
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC--V 139
D LTF+AGNDTY+ G L+Y V +G P +F+VALDTGSDLFW+PCDC C +
Sbjct: 88 ADDGLLTFAAGNDTYQS---GTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATI 144
Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTM 198
N + YSP SSTS +V C++ LC + C +A +CPY+V+Y+S T
Sbjct: 145 PSANGTGQDAPSLRPYSPRRSSTSKQVACDNPLCGQRNGCSAATNGSCPYEVQYVSANTS 204
Query: 199 STGFLVEDVLHLATDE----KQSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDK 252
S+G LV+DVLHL + +++ + + FGCG+VQTG+FLDG A +GL GLGM K
Sbjct: 205 SSGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGK 264
Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
SVPS LA GL+ +SFSMCFG DG GR++FGD GS GQ ETPF++R +PTYN++ T
Sbjct: 265 VSVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTS 324
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR----ETSTSDLPFEY 367
+ VG +V EF+A+ DSGTSFTYL+DP YTQ++ FNS E+R S PFEY
Sbjct: 325 IGVGSESVAAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEY 384
Query: 368 CYVL 371
CY L
Sbjct: 385 CYRL 388
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 183/353 (51%), Positives = 233/353 (66%), Gaps = 8/353 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G F F+ HH +SD VK L +DDL P+KGS Y+ LA RDR +RGRGLA+ N
Sbjct: 23 CEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 79
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
++TP+TF GN T ++ LGFLHY NVSVG PA F+VALDTGSDLFWLPC+C S C+
Sbjct: 80 EETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRD 139
Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q N+YSPNTSSTSS + C+ C +C S S+CPYQ++YLS T +T
Sbjct: 140 LKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTT 199
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L EDVLHL T+++ + V + I+ GCG+ QTG AA NGL GLG+ SVPSILA
Sbjct: 200 GTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILA 259
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ NSFSMCFG+ D GRISFGDKG Q ETP + PTY +++T+VSVGG+A
Sbjct: 260 KAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDA 319
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
V + A+FD+GTSFT+L +P Y I++ F+ +KR +LPFE+CY L
Sbjct: 320 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDL 372
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 166/347 (47%), Positives = 233/347 (67%), Gaps = 12/347 (3%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G+ F+ HHR+S+ VK +L LP+ GS YY AL HRDR GR L + N++T +
Sbjct: 20 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRQLTSNNNNQTTI 74
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHGLNSSS 146
+F+ GN T + FLHY NV++G PA F+VALDTGSDLFWLPC+C S CV + +
Sbjct: 75 SFAQGNST---EEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQ 131
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
G+ I NIY+P+ S +SSKV CNSTLC L+ +C S S+CPY++RYLS G+ STG LVED
Sbjct: 132 GERIKLNIYNPSKSKSSSKVTCNSTLCALRNRCISPVSDCPYRIRYLSPGSKSTGVLVED 191
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
V+H++T+E +++ D+RI+FGC Q G F + A NG+ GL + +VP++L G+
Sbjct: 192 VIHMSTEEGEAR--DARITFGCSESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVAS 248
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
+SFSMCFG +G G ISFGDKGS Q ETP S + Y+++IT+ VG V+ EF+A
Sbjct: 249 DSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTAT 308
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
FDSGT+ T+L +P YT ++ F+ ++R + + D PFE+CY++ S
Sbjct: 309 FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITS 355
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 174/366 (47%), Positives = 232/366 (63%), Gaps = 9/366 (2%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFR 71
+L+L+ C G F F+ HH +SD VK L DDL P+ GS Y+ LAHRDR+
Sbjct: 13 MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRF-- 70
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+RGRGLA+ N++TPLT N T LN LGFLHY NVS+G PA F+VALDTGSDLFWL
Sbjct: 71 IRGRGLASN-NEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWL 129
Query: 132 PCDC-VSCVHGLNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189
PC+C +C+H L + + + N+Y+PN S+TSS + C+ C +C S S CPYQ
Sbjct: 130 PCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQ 189
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
+ LS T++TG L++DVLHL T+++ K V++ ++ GCG+ QTG+F A NG+ GL
Sbjct: 190 IA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLS 248
Query: 250 MDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307
M + SVPS+LA + NSFSMCFG GRISFGDKG Q ETP +T Y +
Sbjct: 249 MKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGV 308
Query: 308 TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
+T VSVGG V+ A+FD+G+SFT L + AY ++ F+ L ++KR D PFE+
Sbjct: 309 NVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEF 368
Query: 368 CYVLRS 373
CY LR
Sbjct: 369 CYDLRE 374
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 324 bits (830), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 176/363 (48%), Positives = 229/363 (63%), Gaps = 29/363 (7%)
Query: 30 FGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
FGFD HHR+S V+ G LA D P +G+ YYSAL+ DR R A G
Sbjct: 36 FGFDLHHRFSPVVRRWAEARGGPLAADRWPARGTPEYYSALSRHDR-----ARRALAGGA 90
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC--VH 140
D LTF+AGNDTY+ G L+Y V +G P +F+VALDTGSDLFW+PCDC C +
Sbjct: 91 DDGLLTFAAGNDTYQS---GTLYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIP 147
Query: 141 GLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTMS 199
N++ YSP SSTS +V C++ LC + C +A +CPY+V+Y+S T S
Sbjct: 148 SANATGPDAPPLRPYSPRRSSTSEQVACDNPLCGRRNGCSAATNGSCPYEVQYVSANTSS 207
Query: 200 TGFLVEDVLHLATDE----KQSKSVDSRISFGCGRVQTGSFLD--GAAPNGLFGLGMDKT 253
+G LV+DVLHL + +++ + + FGCG+VQTG+FLD G A +GL GLGM K
Sbjct: 208 SGVLVQDVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKV 267
Query: 254 SVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
SVPS LA GL+ +SFSMCFG DG GR++FGD GS GQ ETPF++R +PTYN++ T +
Sbjct: 268 SVPSALAASGLVASDSFSMCFGDDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTSI 327
Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR----ETSTSDLPFEYC 368
+G +V EF+A+ DSGTSFTYL+DP YTQ++ FNS E+R S PFEYC
Sbjct: 328 GIGSESVAAEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYC 387
Query: 369 YVL 371
Y L
Sbjct: 388 YRL 390
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 174/366 (47%), Positives = 232/366 (63%), Gaps = 9/366 (2%)
Query: 13 LLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFR 71
+L+L+ C G F F+ HH +SD VK L DDL P+ GS Y+ LAHRDR+
Sbjct: 1 MLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEYFKVLAHRDRF-- 58
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+RGRGLA+ N++TPLT N T LN LGFLHY NVS+G PA F+VALDTGSDLFWL
Sbjct: 59 IRGRGLASN-NEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWL 117
Query: 132 PCDC-VSCVHGLNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQ 189
PC+C +C+H L + + + N+Y+PN S+TSS + C+ C +C S S CPYQ
Sbjct: 118 PCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGKCSSPESICPYQ 177
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
+ LS T++TG L++DVLHL T+++ K V++ ++ GCG+ QTG+F A NG+ GL
Sbjct: 178 IA-LSSNTVTTGTLLQDVLHLVTEDEDLKPVNANVTLGCGQNQTGAFQTDIAVNGVLGLS 236
Query: 250 MDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNI 307
M + SVPS+LA + NSFSMCFG GRISFGDKG Q ETP +T Y +
Sbjct: 237 MKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGV 296
Query: 308 TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
+T VSVGG V+ A+FD+G+SFT L + AY ++ F+ L ++KR D PFE+
Sbjct: 297 NVTGVSVGGVPVDVPLFALFDTGSSFTLLLESAYGVFTKAFDDLMEDKRRPVDPDFPFEF 356
Query: 368 CYVLRS 373
CY LR
Sbjct: 357 CYDLRE 362
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 178/317 (56%), Positives = 223/317 (70%), Gaps = 12/317 (3%)
Query: 33 DFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAG 92
D HHRYS V+ A P G+ YY+ALA D LR R LA G + F+ G
Sbjct: 25 DVHHRYSATVRE-WAGHRAPPAGTAEYYAALAGHD----LRRRSLAGGGE----VAFADG 75
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF 152
NDTYRLN LGFLHY V++G P ++F+VALDTGSDLFW+PCDC++C L S + + + F
Sbjct: 76 NDTYRLNELGFLHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAP-LVSPNYRDLKF 134
Query: 153 NIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ YSP SSTS KVPC+S LC+ Q C SA S+CPY ++YLSD T STG LVEDVL+L T
Sbjct: 135 DTYSPQKSSTSRKVPCSSNLCDEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVT 194
Query: 213 DE-KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFS 270
+ +Q K V + I+FGCGR QTGSFL AAPNGL GLGMD SVPS+LA+QG+ NSFS
Sbjct: 195 EYGRQPKIVTAPITFGCGRTQTGSFLGTAAPNGLLGLGMDTISVPSLLASQGVAAANSFS 254
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCF DG GRI+FGD GS Q ETP ++ + +P YNI+IT +VG +++ +F+AI DSG
Sbjct: 255 MCFAQDGHGRINFGDTGSSDQQETPLNMYKQNPYYNISITGATVGSKSIHTKFNAIVDSG 314
Query: 331 TSFTYLNDPAYTQISET 347
TSFT L+DP YTQI+ +
Sbjct: 315 TSFTALSDPMYTQITSS 331
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 317 bits (812), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 177/361 (49%), Positives = 230/361 (63%), Gaps = 25/361 (6%)
Query: 29 TFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
+ GFD HHR+S V+ A D P +GS YYSAL+ DR R R LA G
Sbjct: 33 SVGFDLHHRFSPVVRQWAEARGHPFAAQDWPARGSPEYYSALSRHDRAVLSR-RALA-DG 90
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
D +TF+AGNDT L +G L+Y V VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 91 ADGL-VTFAAGNDT--LQYIGSLYYAVVEVGTPNATFLVALDTGSDLFWVPCDCKQCASI 147
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA-GSNCPYQVRYLSDGTMST 200
N + YSP SSTS +V C++ LC+ C +A +CPY+V+YLS T ++
Sbjct: 148 ANVTGQPATALRPYSPRESSTSKQVTCDNALCDRPNGCSAATNGSCPYEVQYLSANTSTS 207
Query: 201 GFLVEDVLHLATDE-----KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
G LV+DVLHL + + +++ + + FGCG+VQTG+FLDGAA +GL GLG + SV
Sbjct: 208 GVLVQDVLHLTRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDGAAFDGLMGLGRENVSV 267
Query: 256 PSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
PS+LA+ GL+ +SFSMCFG DG GRI+FGD GS GQGETPF+ R+T YN++ T V+V
Sbjct: 268 PSVLASSGLVASDSFSMCFGDDGVGRINFGDSGSSGQGETPFTGRRT--LYNVSFTAVNV 325
Query: 315 GGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET----STSDLPFEYCYV 370
+V EF+A+ DSGTSFTYL DP YT+++ FNSL +E+R S PFEYCY
Sbjct: 326 ETKSVAAEFAAVIDSGTSFTYLADPEYTELATNFNSLVRERRTNFSSGSADPFPFEYCYA 385
Query: 371 L 371
L
Sbjct: 386 L 386
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 317 bits (812), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 169/377 (44%), Positives = 237/377 (62%), Gaps = 26/377 (6%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G+ F+ HHR+S+ VK +L LP+ GS YY AL HRDR GR L + N++T +
Sbjct: 30 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRRLTSN-NNQTTI 83
Query: 88 TFSAGNDTYRLNS----------LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
+F+ GN T ++ +LHY NV++G PA F+VALDTGSDLFWLPC+C S
Sbjct: 84 SFAQGNSTEEISLYDQNLAPPLFFNYLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNS 143
Query: 138 -CVHGLNSSSG------QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQV 190
CV + + G Q I NIY+P+ S++SSKV CNSTLC L+ +C S S+CPY++
Sbjct: 144 TCVRSMETDQGETHMNAQRIRLNIYNPSISTSSSKVTCNSTLCALRNRCISPLSDCPYRI 203
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
RYLS G+ STG LVEDV+H++T+E +++ D+RI+FGC Q G F + A NG+ GL M
Sbjct: 204 RYLSPGSKSTGVLVEDVIHMSTEEGEAR--DARITFGCSETQLGLFQE-VAVNGIMGLAM 260
Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
+VP++L G+ +SFSMCFG +G G ISFGDKGS Q ETP + Y+++IT
Sbjct: 261 ADIAVPNMLVKAGVASDSFSMCFGPNGKGTISFGDKGSSDQHETPLGGTISPLFYDVSIT 320
Query: 311 QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
+ VG V +FSAIFDSGT+ T+L DP YT ++ F+ ++R + D FE+CY+
Sbjct: 321 KFKVGKVTVETKFSAIFDSGTAVTWLLDPYYTALTTNFHLSVPDRRLPANVDSTFEFCYI 380
Query: 371 LRSFLHLQALVVLPFPL 387
+ S + L + F +
Sbjct: 381 ITSTSDEEKLPSISFEM 397
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 146/266 (54%), Positives = 190/266 (71%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
LHYT V +G P F+VALDTGSDLFW+PCDC C S + ++YSP SSTS
Sbjct: 3 LHYTTVQLGTPGTKFMVALDTGSDLFWVPCDCSRCAPTEGSPYASDFELSVYSPKKSSTS 62
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
VPCN++LC + QC A NCPY V Y+S T +TG L+ED+LHL T+ K S+ + +
Sbjct: 63 KTVPCNNSLCAQRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLKTENKHSEPIQAY 122
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I+FGCG+VQ+GSFLD AAPNGLFGLGM++ SVPSIL+ +GL+ NSFSMCF DG GRI+F
Sbjct: 123 ITFGCGQVQSGSFLDVAAPNGLFGLGMEQISVPSILSREGLMANSFSMCFSDDGVGRINF 182
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
GDKGS Q ETPF+L Q HP YNIT+T + VG ++ + +A+FDSGTSF+Y DP Y++
Sbjct: 183 GDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADITALFDSGTSFSYFTDPIYSK 242
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCY 369
+S +F++ ++ R +PFEYCY
Sbjct: 243 LSASFHAQTRDGRHPPNPRIPFEYCY 268
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 170/353 (48%), Positives = 227/353 (64%), Gaps = 9/353 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G FGF+ HH +SD VK L +DDL P++GS Y+ LAHRDR +RGRGLA+ N
Sbjct: 23 CEASGKFGFEVHHIFSDAVKQSLGLDDLVPEQGSLEYFKVLAHRDRL--IRGRGLASN-N 79
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC-VSCVHG 141
+ TP+TF GN T + LG L+Y NVSVG P SF+VALDTGSDLFWLPC+C +C+
Sbjct: 80 EDTPVTFDGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRD 139
Query: 142 LNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q + N+Y+PN S+TSS + C+ C K+C S S CPYQ+ Y S+ T +T
Sbjct: 140 LEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPKSICPYQISY-SNSTGTT 198
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L++DVLHLAT+++ V + ++ GCG+ QTG F + NG+ GLG+ SVPS+LA
Sbjct: 199 GTLLQDVLHLATEDENLTPVKTNVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLA 258
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ +SFSMCFG GRISFGDKG Q ETPF Y + +T VSVGG+
Sbjct: 259 KANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGDP 318
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
V A FD+G+SFT+L +PAY ++++F+ L ++KR +LPFE+CY L
Sbjct: 319 VGTRLFAKFDTGSSFTHLMEPAYGVLTKSFDDLVEDKRRPVDPELPFEFCYDL 371
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 183/353 (51%), Positives = 231/353 (65%), Gaps = 14/353 (3%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G F F+ HH +SD VK L +DDL P+KGS Y+ LA RDR +RGRGLA+ N
Sbjct: 23 CEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 79
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
++TP+TF GN T ++ LGFLHY NVSVG PA F+VALDTGSDLFWLPC+C S C+
Sbjct: 80 EETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCIRD 139
Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q N+YSPNTSSTSS + C+ C +C S S+CPYQ++YLS T +T
Sbjct: 140 LKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTT 199
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L EDVLHL T+++ + V + I+ GCG+ QTG AA NGL GLG+ SVPSILA
Sbjct: 200 GTLFEDVLHLVTEDEGLEPVKANITLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILA 259
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ NSFSMCFG+ D GRISFGDKG Q ETP L T P ++T+VSVGG+A
Sbjct: 260 KAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETP--LLPTEP----SVTEVSVGGDA 313
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
V + A+FD+GTSFT+L +P Y I++ F+ +KR +LPFE+CY L
Sbjct: 314 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDL 366
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 314 bits (804), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 169/359 (47%), Positives = 229/359 (63%), Gaps = 15/359 (4%)
Query: 24 CFGF------GTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRG 76
C+GF G FGF+ HH +SD VK L + DL P++GS Y+ LAHRDR +RGRG
Sbjct: 17 CWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRL--IRGRG 74
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC- 135
LA+ ND+TP+TF GN T + LG L+Y NVSVG P SF+VALDTGSDLFWLPC+C
Sbjct: 75 LASN-NDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG 133
Query: 136 VSCVHGLNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
+C+ L Q + N+Y+PN S+TSS + C+ C K+C S S CPYQ+ Y S
Sbjct: 134 TTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-S 192
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
+ T + G L++DVLHLAT+++ V + ++ GCG+ QTG F + NG+ GLG+ S
Sbjct: 193 NSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 255 VPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV 312
VPS+LA + NSFSMCFG GRISFGD+G Q ETPF Y + I+ V
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNISGV 312
Query: 313 SVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
SV G+ V+ A FD+G+SFT+L +PAY ++++F+ L +++R +LPFE+CY L
Sbjct: 313 SVAGDPVDIRLFAKFDTGSSFTHLREPAYGVLTKSFDELVEDRRRPVDPELPFEFCYDL 371
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 185/353 (52%), Positives = 230/353 (65%), Gaps = 8/353 (2%)
Query: 24 CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
C G F F+ HH +SD VK L +DDL P+KGS Y+ LA RDR +RGRGLA+ N
Sbjct: 24 CEASGKFSFEVHHMFSDRVKQTLGLDDLVPEKGSLEYFKVLAQRDRL--IRGRGLASN-N 80
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHG 141
++TP+TF GN T ++ LGFLHY NVSVG PA F+VALDTGS+LFWLPC+C S C+
Sbjct: 81 EETPITFMRGNRTVSIDFLGFLHYANVSVGTPATWFLVALDTGSNLFWLPCNCGSTCIRD 140
Query: 142 LNSSS-GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST 200
L Q N+YSPNTSSTSS + CN C QC S S+CPYQ++YLS T +T
Sbjct: 141 LKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQCSSPASSCPYQIQYLSKDTFTT 200
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L EDVLHL T++ K V + I+ GCGR QTG AA NGL GLGM SVPSILA
Sbjct: 201 GTLFEDVLHLVTEDVDLKPVKANITLGCGRNQTGFLQSSAAINGLLGLGMKDYSVPSILA 260
Query: 261 NQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
+ NSFSMCFG+ D GRISFGDKG Q ETP + PTY + +T+VSVGG+
Sbjct: 261 KAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPTEPSPTYAVNVTEVSVGGDV 320
Query: 319 VNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
V + A+FD+GTSFT+L +P Y I++ F+ +KR ++PFE+CY L
Sbjct: 321 VGVQLLALFDTGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPEIPFEFCYDL 373
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 311 bits (796), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 177/375 (47%), Positives = 231/375 (61%), Gaps = 32/375 (8%)
Query: 28 GTFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
G GFD HHR+S VK G A +GS YYSAL+ DR R + A G
Sbjct: 7 GGVGFDLHHRFSPVVKRWAESRGRPAAAAWWPEGSPEYYSALSAHDR-----ARRVLAGG 61
Query: 82 NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
++ L+F+ GN T R G LHY V++G P +F+VALDTGSDLFW+PCDC C
Sbjct: 62 KGESLLSFADGNSTTR--HAGSLHYAKVALGTPNATFVVALDTGSDLFWVPCDCKRCAPI 119
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
N+S YSP SSTS V C+ +LC+ C + +CPY V+Y+S T S+G
Sbjct: 120 ANTSE----LLKPYSPRQSSTSKPVTCSHSLCDRPNACGNGNGSCPYTVKYVSANTSSSG 175
Query: 202 FLVEDVLHLATDEKQS---------KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
LVEDVL++ S ++V +R+ FGCG+ QTG+FLDGAA GL GLGMD+
Sbjct: 176 VLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAFLDGAAMEGLLGLGMDR 235
Query: 253 TSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPG-QGETPFSLRQTHPTYNITIT 310
SVPS+LA GL+ +SFSMCF DG GRI+FG+ G Q ETPF + +T PTYNI++T
Sbjct: 236 VSVPSLLAAAGLVGSDSFSMCFSPDGNGRINFGEPSDAGAQNETPFIVSKTRPTYNISVT 295
Query: 311 QVSVGGN-AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
V+V G A+ EF+A+ DSGTSFTYLNDPAY+ ++ +FNS +EKR ++ +PFEYCY
Sbjct: 296 AVNVKGKGAMAAEFAAVVDSGTSFTYLNDPAYSLLATSFNSQVREKRANLSASIPFEYCY 355
Query: 370 VLRSFLHLQALVVLP 384
L Q V++P
Sbjct: 356 ALS---RGQTEVLMP 367
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 151/270 (55%), Positives = 190/270 (70%), Gaps = 4/270 (1%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
LHY V+VG P +F+VALDTGSDLFWLPC C C ++SG Y P SSTS
Sbjct: 6 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSA---TFYIPGMSSTS 62
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
VPCNS C+LQK+C S CPY++ Y+S GT S+GFLVEDVL+L+T+ + + ++
Sbjct: 63 KAVPCNSNFCDLQKEC-STALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQILKAQ 121
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I GCG+ QTGSFLD AAPNGLFGLG+D+ SVPSILA +GL NSFSMCFG DG GRISF
Sbjct: 122 IMLGCGQTQTGSFLDAAAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFGRDGIGRISF 181
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQ 343
GD+ S Q ETP + + HPTY ITI+ ++VG + +F IFD+GTSFTYL DPAYT
Sbjct: 182 GDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDMDFITIFDTGTSFTYLADPAYTY 241
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I+++F++ + R + S +PFEYCY L S
Sbjct: 242 ITQSFHAQVQANRHAADSRIPFEYCYDLSS 271
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 181/372 (48%), Positives = 228/372 (61%), Gaps = 40/372 (10%)
Query: 28 GTFGFDFHHRYSDPVK----------------GILAVDDLPKKGSFAYYSALAHRDRYFR 71
G GF+ HHR+S V+ L ++ P GS YYSAL DR
Sbjct: 28 GGIGFNLHHRFSPVVRQWMVDARGGGHGVPGSSWLLPEEAPAVGSPEYYSALLRHDRALF 87
Query: 72 LRGRGLAAQGNDK-TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
R RGLA+ + + T LTF+ GN T RL++ +LHY V VG P+ F+VALDTGSDLFW
Sbjct: 88 TRRRGLASAADGQSTTLTFADGNAT-RLDTYEYLHYAEVEVGTPSSKFLVALDTGSDLFW 146
Query: 131 LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---SNCP 187
LPC+C C N S+ +YSP+ SSTS VPC LCE C +AG S+CP
Sbjct: 147 LPCECKLCAK--NGST-------MYSPSLSSTSKTVPCGHPLCERPDACATAGKSSSSCP 197
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
Y+V+Y+S T S+G LVEDVLHL K+V + I FGCG+VQTG+FL GAA GL
Sbjct: 198 YEVKYVSANTGSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAFLRGAAAGGL 257
Query: 246 FGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPF----SLRQ 300
GLG+DK SVPS LA+ GL+ +SFSMCF DG GRI+FGD GSP Q ETP SL+
Sbjct: 258 MGLGLDKVSVPSALASSGLVASDSFSMCFSRDGVGRINFGDAGSPDQAETPLIAAGSLQP 317
Query: 301 THPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
++ YNI++ ++V A+ EF+A+ DSGTSFTYL+DPAYT ++ FNS E ET
Sbjct: 318 SY--YNISVGAITVDSKAMAVEFTAVVDSGTSFTYLDDPAYTFLTTNFNSRVSEASETYG 375
Query: 361 SDL-PFEYCYVL 371
S FE+CY L
Sbjct: 376 SGYEKFEFCYRL 387
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 144/259 (55%), Positives = 191/259 (73%), Gaps = 4/259 (1%)
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
+VALDTGSDLFW+PCDC C ++ + +IY+P S+T+ KV CN++LC + Q
Sbjct: 1 MVALDTGSDLFWVPCDCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCAQRNQ 60
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
C S CPY V Y+S T ++G L+EDV+HL T++K + V++ ++FGCG+VQ+GSFLD
Sbjct: 61 CLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPERVEAYVTFGCGQVQSGSFLD 120
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
AAPNGLFGLGM+K SVPS+LA +GL+ +SFSMCFG DG GRISFGDKGS Q ETPF+L
Sbjct: 121 IAAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFGHDGVGRISFGDKGSSDQEETPFNL 180
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
+HP YNIT+T+V VG ++ EF+A+FD+GTSFTYL DP YT +SE+ A++KR +
Sbjct: 181 NPSHPNYNITVTRVRVGTTLIDDEFTALFDTGTSFTYLVDPMYTTVSES----AQDKRHS 236
Query: 359 STSDLPFEYCYVLRSFLHL 377
S +PFEYCY +R L L
Sbjct: 237 PDSRIPFEYCYDMREKLVL 255
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 183/365 (50%), Positives = 225/365 (61%), Gaps = 29/365 (7%)
Query: 30 FGFDFHHRYSDPVK---------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
GFD HHRYS V+ G+ GS YYSAL+ D R RGLA Q
Sbjct: 27 LGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFAR-RGLA-Q 84
Query: 81 GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
G+ +TF+ GN T RL+ G LHY V+VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 85 GDGL--VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAP 140
Query: 141 GLNSSS---GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
N ++ G + YSP+ SSTS V C S LC+ C +A S+CPY VRY T
Sbjct: 141 LGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANT 200
Query: 198 MSTGFLVEDVLHLATDEKQSKS-----VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
S+G LVEDVL+L ++ + + V + + FGCG+VQTGSFLDGAA +GL GLGM+K
Sbjct: 201 SSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEK 260
Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
SVPSILA+ G++ NSFSMCF DG GRI+FGD GS Q ETPF ++ TH YNI+IT
Sbjct: 261 VSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITS 320
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----ETSTSDLPFE 366
+SVG + F AI DSGTSFTYLNDPAYT + FN+ E+R T + PFE
Sbjct: 321 MSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFE 380
Query: 367 YCYVL 371
YCY L
Sbjct: 381 YCYSL 385
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 183/365 (50%), Positives = 225/365 (61%), Gaps = 29/365 (7%)
Query: 30 FGFDFHHRYSDPVK---------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
GFD HHRYS V+ G+ GS YYSAL+ D R RGLA Q
Sbjct: 27 LGFDLHHRYSPIVQRWAEERGHAGVSWPAGAEVIGSPEYYSALSRHDHALFAR-RGLA-Q 84
Query: 81 GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
G+ +TF+ GN T RL+ G LHY V+VG P +F+VALDTGSDLFW+PCDC C
Sbjct: 85 GDGL--VTFADGNITLRLD--GSLHYAEVAVGTPNTTFLVALDTGSDLFWVPCDCKQCAP 140
Query: 141 GLNSSS---GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
N ++ G + YSP+ SSTS V C S LC+ C +A S+CPY VRY T
Sbjct: 141 LGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLCDQPNACATATSSCPYAVRYAMANT 200
Query: 198 MSTGFLVEDVLHLATDEKQSKS-----VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK 252
S+G LVEDVL+L ++ + + V + + FGCG+VQTGSFLDGAA +GL GLGM+K
Sbjct: 201 SSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDGAAADGLMGLGMEK 260
Query: 253 TSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQ 311
SVPSILA+ G++ NSFSMCF DG GRI+FGD GS Q ETPF ++ TH YNI+IT
Sbjct: 261 VSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTHSYYNISITS 320
Query: 312 VSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----ETSTSDLPFE 366
+SVG + F AI DSGTSFTYLNDPAYT + FN+ E+R T + PFE
Sbjct: 321 MSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGSTRSGPFPFE 380
Query: 367 YCYVL 371
YCY L
Sbjct: 381 YCYSL 385
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 257 bits (657), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 200/340 (58%), Gaps = 21/340 (6%)
Query: 24 CFGF------GTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHRDRYFRLRGRG 76
C+GF G FGF+ HH +SD VK L + DL P++GS Y+ LAHRDR +RGRG
Sbjct: 17 CWGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRL--IRGRG 74
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC- 135
LA+ ND+TP+TF GN T + LG L+Y NVSVG P SF+VALDTGSDLFWLPC+C
Sbjct: 75 LASN-NDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCG 133
Query: 136 VSCVHGLNS-SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS 194
+C+ L Q + N+Y+PN S+TSS + C+ C K+C S S CPYQ+ Y S
Sbjct: 134 TTCIRDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRCFGSKKCSSPSSICPYQISY-S 192
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
+ T + G L++DVLHLAT+++ V + ++ GCG+ QTG F + NG+ GLG+ S
Sbjct: 193 NSTGTKGTLLQDVLHLATEDENLTPVKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYS 252
Query: 255 VPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPF-----SLRQTHPTYNI 307
VPS+LA + NSFSMCFG GRISFGD+G Q ETPF R P
Sbjct: 253 VPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPRRRPVDPELPF 312
Query: 308 TIT-QVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
+S + F + G S LN+P +T ++
Sbjct: 313 EFCYDLSPNATTIQFPLVEMTFIGGSKIILNNPFFTARTQ 352
>gi|297819832|ref|XP_002877799.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
gi|297323637|gb|EFH54058.1| hypothetical protein ARALYDRAFT_906483 [Arabidopsis lyrata subsp.
lyrata]
Length = 414
Score = 247 bits (630), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 146/366 (39%), Positives = 201/366 (54%), Gaps = 59/366 (16%)
Query: 10 VCVLLILLSCCAGC--CFGFGTFGFDFHHRYSDPVKGILAVDDL-PKKGSFAYYSALAHR 66
V VLL +L C G C G F F+ HH +SD VK L DL P+KGS Y+ LA R
Sbjct: 7 VFVLLSVLVACWGLQRCESAGKFSFEVHHMFSDTVKQNLGFGDLVPEKGSLEYFKLLAQR 66
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
DR +RGRGL++ N++ P+TF GN T ++ L GS
Sbjct: 67 DRL--IRGRGLSSN-NEEAPVTFILGNRTVSIDFL-----------------------GS 100
Query: 127 DLFWLPCDC-VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
DLFWLPC+C +C+ L D + Q C S S
Sbjct: 101 DLFWLPCNCGTTCIRDLE-------DIGLS--------------------QGGCSSPASV 133
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
CPYQ+ YL + T + G L EDVLHL T+++ + V + I+ GCG+ QTG + A NGL
Sbjct: 134 CPYQIPYLFNTTSTRGTLFEDVLHLVTEDEGLEPVKANITLGCGQNQTGLYRKSLAVNGL 193
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP 303
GLGM SVPS+LA + + NSFSMCFG+ D GRISFGD+G Q +TP + +P
Sbjct: 194 LGLGMKDYSVPSVLAKENITANSFSMCFGNIIDFIGRISFGDRGHTDQLQTPLVPIEPNP 253
Query: 304 TYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
TY + +T+V+VGG+ + + A+FD+GTSFT+L +PAY +++ F+ +KR ++
Sbjct: 254 TYAVNVTEVTVGGDILEIQMLALFDTGTSFTHLLEPAYGLLTKAFDDHVTDKRRPIDPEI 313
Query: 364 PFEYCY 369
PFE+CY
Sbjct: 314 PFEFCY 319
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 160/374 (42%), Positives = 213/374 (56%), Gaps = 23/374 (6%)
Query: 29 TFGFDFHHRYSDPVKGILA--VDDL----PKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
TF HR+SD VK + D L P+K S YY L + D F+ + L Q
Sbjct: 35 TFSSRLIHRFSDEVKALRVSRKDSLSYSWPEKKSMDYYQILVNSD--FQRQKMKLGPQYQ 92
Query: 83 DKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
P S G+ T L + G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 93 FLFP---SQGSKTMSLGDDFGWLHYTWIDIGTPHVSFLVALDAGSDLLWVPCDCLQCAP- 148
Query: 142 LNSS--SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
L++S S D N YSP+ SSTS + C+ LCEL C S CPY + Y ++ T S
Sbjct: 149 LSASYYSSLDRDLNEYSPSHSSTSKHLSCSHQLCELGPNCNSPKQPCPYSMDYYTENTSS 208
Query: 200 TGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS 257
+G LVED+LHLA+ D S SV + + GCG Q+G +LDG AP+GL GLG+ + SVPS
Sbjct: 209 SGLLVEDILHLASNGDNALSYSVRAPVVIGCGMKQSGGYLDGVAPDGLMGLGLAEISVPS 268
Query: 258 ILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGG 316
LA GLI NSFSMCF D +GRI FGD+G Q TPF +L + TY + + VG
Sbjct: 269 FLAKAGLIRNSFSMCFDEDDSGRIFFGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVGS 328
Query: 317 NAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
+ + F A+ D+GTSFT+L + Y +I+E F+ +S + P++YCY S
Sbjct: 329 SCLKQTSFRALVDTGTSFTFLPNGVYERITEEFDRQVNATI-SSFNGYPWKYCYKSSSN- 386
Query: 376 HLQAL--VVLPFPL 387
HL + V L FPL
Sbjct: 387 HLTKVPSVKLIFPL 400
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 183/338 (54%), Gaps = 14/338 (4%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SFT L Y + F+ R D ++YCY
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCY 360
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 243 bits (620), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 183/338 (54%), Gaps = 14/338 (4%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 263
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SFT L Y + F+ R D ++YCY
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCY 360
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 183/338 (54%), Gaps = 14/338 (4%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 3 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 60
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 61 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 114
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 115 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 174
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFSMCF
Sbjct: 175 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCF 233
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 234 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 293
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SFT L Y + F+ R D ++YCY
Sbjct: 294 SFTSLPLDVYKAFTMEFDKQMNATR-VPYEDTTWKYCY 330
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/372 (39%), Positives = 210/372 (56%), Gaps = 21/372 (5%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALA 64
++L++ S TF HR+S K G + P+K S YY L
Sbjct: 2 LILVMSSFLVQNTVELATFSSRLIHRFSKEYKEVSVSRGGDVNGTWWPEKKSKEYYQILV 61
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALD 123
D L+ + L G L S G+ T L N G+LHYT + +G P +SF+VALD
Sbjct: 62 SSD----LKRQKLKL-GPHYQLLFPSQGSKTMSLGNDFGWLHYTWIDIGTPHVSFMVALD 116
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQKQCPS 181
+GSDLFW+PCDCV C L++S +D ++ YSP+ SSTS ++ C+ LC++ C +
Sbjct: 117 SGSDLFWVPCDCVQCAP-LSASHYSSLDRDLSEYSPSQSSTSKQLSCSHRLCDMGPNCKN 175
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDG 239
+CPY + Y ++ T S+G LVED++HLA+ D+ + SV + + GCG Q+G +LDG
Sbjct: 176 PKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDDTLNTSVKAPVIIGCGMKQSGGYLDG 235
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SL 298
AP+GL GLG+ + SVPS LA GLI NSFSMCF D +GRI FGD+G Q PF L
Sbjct: 236 VAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMCFNEDDSGRIFFGDQGPATQQSAPFLKL 295
Query: 299 RQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
+ TY + + VG + + FSA+ DSGTSFT+L D + I+E F++ R
Sbjct: 296 NGNYTTYIVGVEVCCVGTSCLKQSSFSALVDSGTSFTFLPDDVFEMIAEEFDTQVNASR- 354
Query: 358 TSTSDLPFEYCY 369
+S ++YCY
Sbjct: 355 SSFEGYSWKYCY 366
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 182/338 (53%), Gaps = 14/338 (4%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
E V++ + GCG+ Q+G +LDG AP+GL LGM SVPS LA GL+ NSFSMCF
Sbjct: 205 EDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCF 263
Query: 274 GSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFDSGT 331
D +GRI FGD+G P Q TPF L TY + + + +G + F A+ DSGT
Sbjct: 264 KEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGT 323
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SFT L Y + F+ R D ++YCY
Sbjct: 324 SFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCY 360
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 123/221 (55%), Positives = 156/221 (70%), Gaps = 4/221 (1%)
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
N YSPN S+TSS VPC S+LC +C S + CPY++RYLS T S G+LVEDVLHLA
Sbjct: 3 LNHYSPNDSTTSSTVPCTSSLC---NRCTSNQNVCPYEMRYLSANTSSIGYLVEDVLHLA 59
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
TD+ K V+++I+FGCG VQTG F AAPNGL GLGM+K SVPS LA+QGL NSFSM
Sbjct: 60 TDDSLLKPVEAKITFGCGTVQTGIFATTAAPNGLIGLGMEKISVPSFLADQGLTSNSFSM 119
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGT 331
CFG+DG GRI FGD G Q +TPF+ + +YN+T ++VGG + F+AIFDSGT
Sbjct: 120 CFGADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTFNVINVGGEPNDVPFTAIFDSGT 179
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETS-TSDLPFEYCYVL 371
SFTYL +PAY+ I++ ++ K KR + + PFEYCY +
Sbjct: 180 SFTYLTEPAYSTITKQMDAGMKLKRYSLFGPNFPFEYCYEI 220
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 152/399 (38%), Positives = 208/399 (52%), Gaps = 24/399 (6%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDD-----LPKKG 55
MA+ + + V+L++ SC A F HR+SD VK A P+
Sbjct: 1 MAARFLVAMSVVVLLIESCMAA------MFSARLIHRFSDEVKAFRAARSGLSGSWPEWR 54
Query: 56 SFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQP 114
+ YY L D R G+ L S G+ T N G+LHYT + +G P
Sbjct: 55 TMEYYKMLVRSDW-----ERQKVMLGSKYQFLFPSEGSKTMSFGNDYGWLHYTWIDIGTP 109
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLC 173
+SF+VALD GSDL W+PCDC+ C S G + D N YSP+ SSTS + C+ LC
Sbjct: 110 NISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSLDRDLNQYSPSGSSTSKHLSCSHQLC 169
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRV 231
E C S CPY + Y S+ T S+G L+ED+LHL + D+ + SV + + GCG
Sbjct: 170 ESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDDASNSSVRAPVIIGCGMR 229
Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQ 291
QTG +LDG AP+GL GLG+ + SVPS L+ GL+ NSFS+CF D +GRI FGD+G Q
Sbjct: 230 QTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLCFNDDDSGRIFFGDQGLATQ 289
Query: 292 GETPFSLRQ-THPTYNITITQVSVGGNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFN 349
T F + TY + + +G + + F A+ DSG SFT+L D +Y + + F+
Sbjct: 290 QTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFRALVDSGASFTFLPDESYRNVVDEFD 349
Query: 350 SLAKEKRETSTSDLPFEYCYVLRSFLHLQ-ALVVLPFPL 387
R S P+EYCY S L+ V+L F L
Sbjct: 350 KQVNATR-FSFEGYPWEYCYKSSSKELLKNPSVILKFAL 387
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/354 (39%), Positives = 197/354 (55%), Gaps = 33/354 (9%)
Query: 36 HRYSDPVKGIL----AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
HR+SD + + + + LP+K S YY LA D FR + L A+ P
Sbjct: 31 HRFSDEGRASIRTPSSSESLPEKQSLEYYRLLAKSD--FRRQRMNLGAKFQSLVPSEGSK 88
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
T S+GND G+LHYT + +G P++SF+VALDTGSDL W+PC+CV C ++ S
Sbjct: 89 TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYS 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
S D N Y+P++SSTS C+ LC+ C S CPY V YLS T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVE 202
Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
D+LHL + S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
GL+ NSFS+CF + +GRI FGD G Q TPF + + Y + + +G + +
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLENNSGYIVGVEACCIGNSCLK 322
Query: 321 -FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCY 369
F+ DSG SFTYL + Y ++ +L ++ +TS + +EYCY
Sbjct: 323 QTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKSFEGVSWEYCY 371
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 236 bits (603), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 180/340 (52%), Gaps = 19/340 (5%)
Query: 36 HRYSDPVKGILAVDD----LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA 91
HR SD + LA P+ GS YY AL D + R L + FS
Sbjct: 80 HRLSDEAR--LAAGPHGARWPRHGSGGYYRALVRSDLQRQKRKHQLLSVSEAGG--IFSP 135
Query: 92 GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID 151
GND G+L+YT V VG P SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 136 GND------FGWLYYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRD 189
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
IY P S+TS +PC+ LC C S CPY YL + T S+G L+ED+LHL
Sbjct: 190 LGIYKPAESTTSRHLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLD 249
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ E + V + + GCGR Q+GS+LDG AP+GL GLGM SVPS LA GL+ NSFSM
Sbjct: 250 SRESHAP-VKASVVIGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSM 308
Query: 272 CFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDS 329
CF D +GRI FGD+G Q TPF L + TY + + + VG F A+ DS
Sbjct: 309 CFKED-SGRIFFGDQGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDS 367
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
GTSFT L Y ++ F+ R T D FEYCY
Sbjct: 368 GTSFTALPLNVYKAVAVEFDKQVHAPRITQ-EDASFEYCY 406
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 150/355 (42%), Positives = 197/355 (55%), Gaps = 21/355 (5%)
Query: 29 TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+SD K G + D PKK SF YY L D L+ + L G
Sbjct: 24 TFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSD----LKRQKLKL-G 78
Query: 82 NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
+ L S G+D L N G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 79 AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCA- 137
Query: 141 GLNSSSGQVI--DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
L++S + D N YSP+ SSTS + CN LCEL C S+ CPY Y S+ T
Sbjct: 138 PLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTS 197
Query: 199 STGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G L+ED LHLA ++ SV + + GCGR Q+G+F DGAAP+GL GLG SVP
Sbjct: 198 SSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVP 257
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S+LA GL+ N+FS+CF + +G I FGD+G Q T F L TY I + VG
Sbjct: 258 SLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVG 317
Query: 316 GNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+++ F A+ DSGTSFT+L Y +I F+ R +S P++YCY
Sbjct: 318 SSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATR-SSFKGSPWKYCY 371
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 144/352 (40%), Positives = 199/352 (56%), Gaps = 17/352 (4%)
Query: 29 TFGFDFHHRYSDPVKGI-LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
TF HR+S+ +K + + D P + + Y+ L R+ + R + G + L
Sbjct: 26 TFSVKLFHRFSEEMKPVQVQTGDWPDRRTLHYHEKLL-RNDFLRHK----INLGGARHKL 80
Query: 88 TF-SAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
F S G+ T N G+LHYT + +G P+ SF+VALD GSDL W+PCDC+ C L++S
Sbjct: 81 LFPSQGSKTMSFGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWVPCDCIHCA-PLSAS 139
Query: 146 --SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQVRYLSDGTMSTGF 202
S D N YSP+ S +S + C+ LC++ C S CPY + YLSD T S+G
Sbjct: 140 FYSNLDRDLNEYSPSRSLSSKHLSCSHRLCDMGSNCKTSKQQQCPYTINYLSDNTSSSGL 199
Query: 203 LVEDVLHLATDE--KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
LVED+ HL + + + SV + + GCG Q+G +LDG AP+GL GLG ++SVPS LA
Sbjct: 200 LVEDIFHLQSGDGSTSNSSVQAPVVVGCGMKQSGGYLDGTAPDGLIGLGPGESSVPSFLA 259
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAV 319
GLI +SFS+CF D +GR+ FGD+GS Q TPF L TY + + +G +
Sbjct: 260 KSGLIRDSFSLCFNEDDSGRLFFGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCP 319
Query: 320 NF-EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
F+A FDSGTSFT+L AY I+E F+ R T P+EYCYV
Sbjct: 320 KVTSFNAQFDSGTSFTFLPGHAYGAIAEEFDKQVNATRSTFQGS-PWEYCYV 370
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 150/355 (42%), Positives = 197/355 (55%), Gaps = 21/355 (5%)
Query: 29 TFGFDFHHRYSDPVK-------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+SD K G + D PKK SF YY L D L+ + L G
Sbjct: 14 TFSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSD----LKRQKLKL-G 68
Query: 82 NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
+ L S G+D L N G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 69 AEYQLLFPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCA- 127
Query: 141 GLNSSSGQVI--DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
L++S + D N YSP+ SSTS + CN LCEL C S+ CPY Y S+ T
Sbjct: 128 PLSASYYDRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTS 187
Query: 199 STGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G L+ED LHLA ++ SV + + GCGR Q+G+F DGAAP+GL GLG SVP
Sbjct: 188 SSGLLIEDRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVP 247
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S+LA GL+ N+FS+CF + +G I FGD+G Q T F L TY I + VG
Sbjct: 248 SLLAKAGLVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVG 307
Query: 316 GNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+++ F A+ DSGTSFT+L Y +I F+ R +S P++YCY
Sbjct: 308 SSSLKTAGFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATR-SSFKGSPWKYCY 361
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 181/315 (57%), Gaps = 12/315 (3%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALS 117
Y+ AL D + R G Q L+ S G + N LG+L+YT V VG P S
Sbjct: 60 YFRALVRSDLQRQKRRVGGKYQL-----LSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTS 114
Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQ 176
F+VALDTGSDLFW+PCDC+ C L+S G + D IY P+ S+TS +PC+ LC
Sbjct: 115 FLVALDTGSDLFWVPCDCIQCAP-LSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPA 173
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C + CPY + Y S+ T S+G L+ED+LHL + E + V++ + GCG+ Q+GS+
Sbjct: 174 SGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAP-VNASVIIGCGKKQSGSY 232
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF 296
L+G AP+GL GLGM SVPS LA GL+ NSFSMCF D +GRI FGD+G P Q TPF
Sbjct: 233 LEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPF 292
Query: 297 -SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ TY + + + +G F A+ D+GTSFT L AY I+ F+
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINA 352
Query: 355 KRETSTSDLPFEYCY 369
R S+ D FEYCY
Sbjct: 353 SR-ASSDDYSFEYCY 366
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 137/315 (43%), Positives = 181/315 (57%), Gaps = 12/315 (3%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALS 117
Y+ AL D + R G Q L+ S G + N LG+L+YT V VG P S
Sbjct: 60 YFRALVRSDLQRQKRRVGGKYQL-----LSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTS 114
Query: 118 FIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQ 176
F+VALDTGSDLFW+PCDC+ C L+S G + D IY P+ S+TS +PC+ LC
Sbjct: 115 FLVALDTGSDLFWVPCDCIQCAP-LSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPA 173
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C + CPY + Y S+ T S+G L+ED+LHL + E + V++ + GCG+ Q+GS+
Sbjct: 174 SGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAP-VNASVIIGCGKKQSGSY 232
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF 296
L+G AP+GL GLGM SVPS LA GL+ NSFSMCF D +GRI FGD+G P Q TPF
Sbjct: 233 LEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPF 292
Query: 297 -SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ TY + + + +G F A+ D+GTSFT L AY I+ F+
Sbjct: 293 VPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINA 352
Query: 355 KRETSTSDLPFEYCY 369
R S+ D FEYCY
Sbjct: 353 SR-ASSDDYSFEYCY 366
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 234 bits (597), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 147/355 (41%), Positives = 190/355 (53%), Gaps = 30/355 (8%)
Query: 29 TFGFDFHHRYSD-------PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR SD P G+ P++GS YY AL D + + R LA +
Sbjct: 26 TFSSRMVHRLSDEARLEAGPRMGLW-----PQRGSGGYYRALLRSD--LQRQKRRLAGKN 78
Query: 82 N----DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
K TFS GND LG+L+Y V VG P SF+VALDTGSDLFW+PCDC+
Sbjct: 79 QLLSLSKGGSTFSPGND------LGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132
Query: 138 CVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDG 196
C L+S G + D IY P S+TS +PC+ LC+ C + C Y + Y S+
Sbjct: 133 CAP-LSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCTYNIDYFSEN 191
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
T S+G L+ED LHL + E + V++ + GCGR Q+G +LDG AP+GL GLGM SVP
Sbjct: 192 TTSSGLLIEDSLHLNSREGHAP-VNASVIIGCGRKQSGDYLDGIAPDGLLGLGMADISVP 250
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S LA GL+ NSFSMCF D +GRI FGD+G Q TPF L TY + + + +G
Sbjct: 251 SFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAVNVDKSCIG 310
Query: 316 GNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ F A+ DSGTSFT L Y + F+ R D ++YCY
Sbjct: 311 HKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASR-VPYEDSTWKYCY 364
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 234 bits (596), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 144/364 (39%), Positives = 194/364 (53%), Gaps = 18/364 (4%)
Query: 36 HRYSDPVKGILAVDD-----LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS 90
HR+SD VK A P+ + YY L D R G+ L S
Sbjct: 11 HRFSDEVKAFRAARSGLSGSWPEWRTMEYYKMLVRSDWE-----RQKVMLGSKYQFLFPS 65
Query: 91 AGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQV 149
G+ T N G+LHYT + +G P +SF+VALD GSDL W+PCDC+ C S G +
Sbjct: 66 EGSKTMSFGNDYGWLHYTWIDIGTPNISFLVALDAGSDLLWIPCDCIQCAPLSASYYGSL 125
Query: 150 -IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
D N YSP+ SSTS + C+ LCE C S CPY + Y S+ T S+G L+ED+L
Sbjct: 126 DRDLNQYSPSGSSTSKHLSCSHQLCESSPNCDSPKQLCPYTINYYSENTSSSGLLIEDIL 185
Query: 209 HLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
HL + D+ + SV + + GCG QTG +LDG AP+GL GLG+ + SVPS L+ GL+
Sbjct: 186 HLTSGIDDASNSSVRAPVIIGCGMRQTGGYLDGVAPDGLMGLGLGEISVPSFLSKAGLVK 245
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAV-NFEFS 324
NSFS+CF D +GRI FGD+G Q T F + TY + + +G + + F
Sbjct: 246 NSFSLCFNDDDSGRIFFGDQGLATQQTTLFLPSDGKYETYIVGVEACCIGSSCIKQTSFR 305
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQ-ALVVL 383
A+ DSG SFT+L D +Y + + F+ R S P+EYCY S L+ V+L
Sbjct: 306 ALVDSGASFTFLPDESYRNVVDEFDKQVNATR-FSFEGYPWEYCYKSSSKELLKNPSVIL 364
Query: 384 PFPL 387
F L
Sbjct: 365 KFAL 368
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 192/349 (55%), Gaps = 13/349 (3%)
Query: 29 TFGFDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
TF HR++D +K + P + S YY L D + R + G L
Sbjct: 22 TFSARLVHRFADEMKPVRPPTGYWPDRWSMGYYRMLLTGD----ILRRKIKVGGARYQLL 77
Query: 88 TFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS- 145
S G+ T L N G+LHYT + +G P+ SF+VALD GSDL W+PCDCV C L+SS
Sbjct: 78 FPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAP-LSSSY 136
Query: 146 -SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
S D N YSP+ S +S + C+ LC+ C S+ CPY V YLS+ T S+G LV
Sbjct: 137 YSNLDRDLNEYSPSRSLSSKHLSCSHQLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLV 196
Query: 205 EDVLHLATDEKQSKS-VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
ED+LHL + S S V + + GCG Q+G +LDG AP+GL GLG ++SVPS LA G
Sbjct: 197 EDILHLQSGGSLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSG 256
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF- 321
LI +SFS+CF D +GRI FGD+G Q T F L + TY I + VG + +
Sbjct: 257 LIHDSFSLCFNEDDSGRIFFGDQGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKMT 316
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
F DSGTSFT+L Y I+E F+ R +S P+EYCYV
Sbjct: 317 SFKVQVDSGTSFTFLPGHVYGAIAEEFDQQVNGSR-SSFEGSPWEYCYV 364
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 142/356 (39%), Positives = 195/356 (54%), Gaps = 35/356 (9%)
Query: 36 HRYSDP----VKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
HR+SD +K + D LP K S YY LA D FR + L A+ P
Sbjct: 31 HRFSDEGRASIKTPSSSDSLPNKQSLEYYRLLAESD--FRRQRMNLGAKVQSLVPSEGSK 88
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
T S+GND G+LHYT + +G P++SF+VALDTGS+L W+PC+CV C ++ S
Sbjct: 89 TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYS 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
S D N Y+P++SSTS C+ LC+ C S CPY V YLS T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSTSKVFLCSHKLCDSASDCESPKEQCPYTVNYLSGNTSSSGLLVE 202
Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
D+LHL + S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNA 318
GL+ NSFS+CF + +GRI FGD G Q TPF + Y + + +G +
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSC 322
Query: 319 VN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCY 369
+ F+ DSG SFTYL + Y ++ +L ++ +TS + +EYCY
Sbjct: 323 LKQTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKNFEGVSWEYCY 373
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 146/349 (41%), Positives = 192/349 (55%), Gaps = 13/349 (3%)
Query: 29 TFGFDFHHRYSDPVKGILA-VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
TF HR++D +K + P + S YY L D + R + G L
Sbjct: 23 TFSARLVHRFADEMKPVRPPTGYWPDQRSMRYYQMLLTGD----ILRRKIKVGGTRYQLL 78
Query: 88 TFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS- 145
S G+ T L N G+LHYT + +G P+ SF+VALD GSDL W+PCDCV C L+SS
Sbjct: 79 FPSHGSKTMSLGNDFGWLHYTWIDIGTPSTSFLVALDAGSDLLWIPCDCVQCAP-LSSSY 137
Query: 146 -SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
S D N YSP+ S +S + C+ LC+ C S+ CPY V YLS+ T S+G LV
Sbjct: 138 YSNLDRDLNEYSPSRSLSSKHLSCSHRLCDKGSNCKSSQQQCPYMVSYLSENTSSSGLLV 197
Query: 205 EDVLHLATDEKQSKS-VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
ED+LHL + S S V + + GCG Q+G +LDG AP+GL GLG ++SVPS LA G
Sbjct: 198 EDILHLQSGGTLSNSSVQAPVVLGCGMKQSGGYLDGVAPDGLLGLGPGESSVPSFLAKSG 257
Query: 264 LIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNF- 321
LI SFS+CF D +GR+ FGD+G Q T F L + TY I + +G + +
Sbjct: 258 LIHYSFSLCFNEDDSGRMFFGDQGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKMT 317
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
F A DSGTSFT+L Y I+E F+ R +S P+EYCYV
Sbjct: 318 SFKAQVDSGTSFTFLPGHVYGAITEEFDQQVNGSR-SSFEGSPWEYCYV 365
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 143/364 (39%), Positives = 193/364 (53%), Gaps = 13/364 (3%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKG-ILAVDDLPKKGSFAYYSALAHRDRYF 70
+LL +LS + F HR+SD + I + P+K SF YY L D
Sbjct: 8 ILLFILSLVSEKSLA-SLFSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSIDS-- 64
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
R + L A+ P S G+ T N G+LHYT + +G P++SF+VALD+GSDL
Sbjct: 65 RRQKMNLGAKFQSLVP---SEGSKTISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLL 121
Query: 130 WLPCDCVSC--VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCP 187
W+PC+CV C + SS D N + P+ S+TS PC+ LCE C S CP
Sbjct: 122 WIPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCP 181
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
Y V Y S+ T S+G LVEDVLHLA S SV +R+ GCG Q+G FL G AP+G+ G
Sbjct: 182 YTVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMG 241
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYN 306
LG + SVPS LA GL+ NSFSMCF + +GRI FGD G Q T F + Y
Sbjct: 242 LGPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYF 301
Query: 307 ITITQVSVGGNAVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
+ + VG + + F+ + DSG SFT+L + Y +++ +S + P+
Sbjct: 302 VGVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGG-PW 360
Query: 366 EYCY 369
EYCY
Sbjct: 361 EYCY 364
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 145/327 (44%), Positives = 185/327 (56%), Gaps = 22/327 (6%)
Query: 52 PKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSA-GNDTYRLNSLGFLHYTNVS 110
P++GS YY +L D + R G G L+FS G N G+L+YT V
Sbjct: 158 PRRGSGDYYRSLVRSDLQRQKRRLG----GGKHQLLSFSKDGGIIPTGNDFGWLYYTWVD 213
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSC--VHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
VG P SF+VALDTGSDLFW+PCDC+ C + G + S + D IY P S+TS +PC
Sbjct: 214 VGTPNTSFMVALDTGSDLFWIPCDCIECAPLSGYHGSLDR--DLGIYKPAESTTSRHLPC 271
Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+ LC L C + CPY +YL + T S+G LVED+LHL + E + V + + GC
Sbjct: 272 SHELCLLGSDCTNQKQPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAP-VKASVIIGC 330
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGS 288
GR Q+GS+LDG AP+GL GLGM SVPS LA GL+ NSFSMCF D +GRI FGD+G
Sbjct: 331 GRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFTKD-SGRIFFGDQGV 389
Query: 289 PGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTYLNDPAYTQI 344
Q TPF L TY + + + VG FE F AI DSGTSFT L Y +
Sbjct: 390 STQQSTPFVPLYGKLQTYTVNVDKSCVGHKC--FESTSFQAIVDSGTSFTALPLDIYKAV 447
Query: 345 SETFNSLAKEKR--ETSTSDLPFEYCY 369
+ F+ R + +TS F+YCY
Sbjct: 448 AIEFDKQVNASRLPQEATS---FDYCY 471
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 144/357 (40%), Positives = 191/357 (53%), Gaps = 23/357 (6%)
Query: 29 TFGFDFHHRYSDPVKGIL-------AVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+S+ K +L + P K SF Y L D + + L AQ
Sbjct: 23 TFSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLLLDND--LKRQKMKLGAQN 80
Query: 82 NDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVH 140
P S G+ T+ N L +LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 81 QLLFP---SLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSDLSWVPCDCIQCA- 136
Query: 141 GLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
L++S + +D ++ Y P+ S+TS + CN LCEL C + CPY Y T
Sbjct: 137 PLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPCPYIADYADPNTS 196
Query: 199 STGFLVEDVLHLATDEKQSKSVDSRIS----FGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
S+GFLVED+LHLA+ S S R+ GCGR QTG +LDGAAP+G+ GLG S
Sbjct: 197 SSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAPDGVMGLGPGSIS 256
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVS 313
VPS+LA GLI SFS+CF +G+G I FGD+G Q TP Q + Y I +
Sbjct: 257 VPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYC 316
Query: 314 VGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
VG + + F A+ DSG SFTYL Y +I F+ +R +S P+ YCY
Sbjct: 317 VGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQGG-PWNYCY 372
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 151/376 (40%), Positives = 199/376 (52%), Gaps = 25/376 (6%)
Query: 29 TFGFDFHHRYSDPVKGI-------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR+SD K I + D PK+ SF Y+ L D R G
Sbjct: 27 TFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDL-----KRQRMKLG 81
Query: 82 NDKTPLTF-SAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
+ K L F S G+ N L +LHYT + +G P +SF+VALD GSDL W+PCDC+ C
Sbjct: 82 SQKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCA 141
Query: 140 HGLNSSSGQV---IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLS-D 195
L++S + D + YSP+ SSTS + C+ LCE C + CPY Y +
Sbjct: 142 -PLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDDFE 200
Query: 196 GTMSTGFLVEDVLHLAT--DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
T S GFLVED LHLA+ D K + + + GCGR Q GSF DGAAP+G+ GLG
Sbjct: 201 NTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDI 260
Query: 254 SVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQV 312
SVPS+LA GLI N FS+CF + +GRI FGD+G Q TPF ++ T+ Y + +
Sbjct: 261 SVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVESY 320
Query: 313 SVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
VG + + F A+ DSG+SFTYL Y ++ F+ KR S D ++YCY
Sbjct: 321 CVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKR-ISFQDGLWDYCYNA 379
Query: 372 RS-FLHLQALVVLPFP 386
S LH + L FP
Sbjct: 380 SSQELHDIPAIQLKFP 395
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 142/374 (37%), Positives = 206/374 (55%), Gaps = 20/374 (5%)
Query: 29 TFGFDFHHRYSDPVKGILAVDD--------LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
TF HR+S+ +K + A P+KGS YY L D FR + L ++
Sbjct: 23 TFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGD--FRRQKMKLGSR 80
Query: 81 GNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
P S G+ T L N G+LHYT + +G P++SF+VALD GSDL W+PC+C+ C
Sbjct: 81 FQLLFP---SEGSXTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCA 137
Query: 140 HGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
S G + D N Y P++SSTS + C+ LC+ + C S +CPY + Y+++ T
Sbjct: 138 PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTS 197
Query: 199 STGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G L++DVLHL++ + S ++ + + GCG Q+G +L G AP+GLFGLG+ + SV
Sbjct: 198 SSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVL 257
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S LA + L+ NSFS+CF DG+GRI FGD+G Q T F L + TY + + +
Sbjct: 258 SSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIE 317
Query: 316 GNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS- 373
+ + F A+ DSGTSFTYL + AY I F+ S P++YCY + +
Sbjct: 318 NSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISAD 377
Query: 374 FLHLQALVVLPFPL 387
+ V L FPL
Sbjct: 378 AMPKVPSVTLLFPL 391
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 142/374 (37%), Positives = 206/374 (55%), Gaps = 20/374 (5%)
Query: 29 TFGFDFHHRYSDPVKGILAVDD--------LPKKGSFAYYSALAHRDRYFRLRGRGLAAQ 80
TF HR+S+ +K + A P+KGS YY L D FR + L ++
Sbjct: 23 TFTSRILHRFSEEMKALRASGSTNTSVRVSWPEKGSMEYYQELVSGD--FRRQKMKLGSR 80
Query: 81 GNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
P S G+ T L N G+LHYT + +G P++SF+VALD GSDL W+PC+C+ C
Sbjct: 81 FQLLFP---SEGSKTIALGNDFGWLHYTWIDIGTPSVSFLVALDAGSDLLWVPCNCIQCA 137
Query: 140 HGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTM 198
S G + D N Y P++SSTS + C+ LC+ + C S +CPY + Y+++ T
Sbjct: 138 PLSASYYGSLDKDLNEYRPSSSSTSKHISCSHNLCDSGQSCQSPKQSCPYVIDYITENTS 197
Query: 199 STGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
S+G L++DVLHL++ + S ++ + + GCG Q+G +L G AP+GLFGLG+ + SV
Sbjct: 198 SSGLLIQDVLHLSSGCENSSNCTIQAPVILGCGMKQSGGYLSGVAPDGLFGLGLGEISVL 257
Query: 257 SILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVG 315
S LA + L+ NSFS+CF DG+GRI FGD+G Q T F L + TY + + +
Sbjct: 258 SSLAKEELVQNSFSLCFNEDGSGRIFFGDEGPASQQTTSFVPLDGKYETYIVGVEACCIE 317
Query: 316 GNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS- 373
+ + F A+ DSGTSFTYL + AY I F+ S P++YCY + +
Sbjct: 318 NSCLKQTSFKALIDSGTSFTYLPEEAYENIVIEFDKRLNTTSAVSFKGYPWKYCYKISAD 377
Query: 374 FLHLQALVVLPFPL 387
+ V L FPL
Sbjct: 378 AMPKVPSVTLLFPL 391
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 140/354 (39%), Positives = 196/354 (55%), Gaps = 33/354 (9%)
Query: 36 HRYSDP----VKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP----L 87
HR+SD +K + + LP+K S AYY LA D FR + L A+ P
Sbjct: 31 HRFSDEGRASIKTPSSSESLPEKQSLAYYRLLAKSD--FRRQRMNLGAKFQSLVPSEGSK 88
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNS--S 145
T S+GND G+LHYT + +G P++SF+VALDTGSDL W+PC+CV C ++ S
Sbjct: 89 TISSGND------FGWLHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYS 142
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
S D N Y+P++SS+S C+ LC C S C Y V+YLS T S+G LVE
Sbjct: 143 SLATKDLNEYNPSSSSSSKVFLCSHKLCGSASDCDSPKEQCTYTVKYLSGNTSSSGLLVE 202
Query: 206 DVLHLATDEKQ-----SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
D+LHL + S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+
Sbjct: 203 DILHLTYNTNNRLMNGSSSVKARVVVGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLS 262
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
GL+ NSFS+CF + +GRI FGD G Q PF + + Y + + +G + +
Sbjct: 263 KAGLMRNSFSLCFDEEDSGRIYFGDMGPSIQQSAPFLQLENNSGYIVGVEACCIGNSCLK 322
Query: 321 -FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCY 369
F+ DSG SFTYL + Y ++ +L ++ +TS + +EYCY
Sbjct: 323 QTSFTTFIDSGQSFTYLPEEIYRKV-----ALEIDRHINATSKSFEGVSWEYCY 371
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 142/372 (38%), Positives = 198/372 (53%), Gaps = 30/372 (8%)
Query: 28 GTFGFDFHHRYSDPVKGILA---------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLA 78
TF HR+S+ K LA + P++ S Y+ L D R R R
Sbjct: 23 ATFSSRLIHRFSEEAKAHLASRGNKSSVLLQAWPQRNSSEYFRLLLRSD-VARQRMR--- 78
Query: 79 AQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
G+ L S G T+ N+L +LHYT + +G P +SF+VALD GSD+ W+PCDC+
Sbjct: 79 -LGSQYETLYPSEGGQTFFFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIE 137
Query: 138 CVHGLNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSD 195
C L++ + V+D N Y P+ S+TS +PC LC++ C + CPY+V+Y S
Sbjct: 138 CA-SLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSFCKGSKDPCPYEVQYASA 196
Query: 196 GTMSTGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
T S+G++ ED LHL +D K ++ SV + I GCGR QTG +L GA P+G+ GLG
Sbjct: 197 NTSSSGYVFEDKLHLTSDGKHAEQNSVQASIILGCGRKQTGDYLHGAGPDGVLGLGPGNI 256
Query: 254 SVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVS 313
SVPS+LA GLI NSFS+C + +GRI FGD+G Q TPF Y + +
Sbjct: 257 SVPSLLAKAGLIQNSFSICLDENESGRIIFGDQGHVTQHSTPF---LPIIAYMVGVESFC 313
Query: 314 VGGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLR 372
VG + F A+ DSG+SFT+L + Y ++ F+ R S +EYCY
Sbjct: 314 VGSLCLKETRFQALIDSGSSFTFLPNEVYQKVVTEFDKQVNASRIVLQSS--WEYCYNAS 371
Query: 373 SFLHLQALVVLP 384
S Q LV +P
Sbjct: 372 S----QELVNIP 379
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 142/374 (37%), Positives = 202/374 (54%), Gaps = 32/374 (8%)
Query: 29 TFGFDFHHRYSDPVKGILA---------VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA 79
TF HR+S+ K LA + P++ S Y+ L D R R R L +
Sbjct: 24 TFSSRLIHRFSEEAKAHLASRGSDGSVLLQAWPERNSSEYFRLLLRSD-VTRQRMR-LGS 81
Query: 80 QGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCV 139
Q P F G N+L +LHYT + +G P +SF+VALD GSD+ W+PCDC+ C
Sbjct: 82 QYEMLYP--FEGGQTFLFGNALYWLHYTWIDIGTPNVSFLVALDAGSDMLWVPCDCIECA 139
Query: 140 HGLNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGT 197
L++ + V+D N Y P+ S+TS +PC LC++ C + CPY V+Y S T
Sbjct: 140 -SLSAGNYNVLDRDLNQYRPSLSNTSRHLPCGHKLCDVHSVCKGSKDPCPYAVQYSSANT 198
Query: 198 MSTGFLVEDVLHLATDEKQSK--SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
S+G++ ED LHL ++ K ++ SV + I GCGR QTG +L GA P+G+ GLG SV
Sbjct: 199 SSSGYVFEDKLHLTSNGKHAEQNSVQASIILGCGRKQTGEYLRGAGPDGVLGLGPGNISV 258
Query: 256 PSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSV 314
PS+LA GLI NSFS+CF + +GRI FGD+G Q TPF + Y + + V
Sbjct: 259 PSLLAKAGLIQNSFSICFEENESGRIIFGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCV 318
Query: 315 GGNAV-NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL---PFEYCYV 370
G + F A+ DSG+SFT+L + Y ++ F+ K+ +TS + +EYCY
Sbjct: 319 GSLCLKETRFQALIDSGSSFTFLPNEVYQKVVIEFD-----KQVNATSIVLQNSWEYCYN 373
Query: 371 LRSFLHLQALVVLP 384
S Q L+ +P
Sbjct: 374 ASS----QELISIP 383
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 142/353 (40%), Positives = 184/353 (52%), Gaps = 18/353 (5%)
Query: 29 TFGFDFHHRYSDPVK------GILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
TF HR+SD K G V PK+GS Y+ L + D + L +Q
Sbjct: 24 TFSSRIIHRFSDEAKVHLRNNGGENVQSWPKRGSSEYFRLLLNSD--LTRQKMKLGSQDQ 81
Query: 83 DKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
P S G+ T N +LHYT + +G P +SF+VALDTGSD+FW+PCDC+ C
Sbjct: 82 SFYP---SEGSKTLSFGNDFVWLHYTWIDIGTPNVSFLVALDTGSDMFWVPCDCIECAP- 137
Query: 142 LNSSSGQVID--FNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMS 199
L+++ +D N YSP+ SS+S +PC LC C CPY Y SD T S
Sbjct: 138 LSAAFYNALDRDLNQYSPSLSSSSRHLPCGHQLCNQNSNCKGFKDRCPYIKEYTSDNTSS 197
Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
+GFL+ED LHLA++ S+ + + GCGR Q+G FL+GAAPNG+ GLG SVP++L
Sbjct: 198 SGFLIEDKLHLASNNATKNSIQASVILGCGRKQSGYFLEGAAPNGMLGLGPGSISVPALL 257
Query: 260 ANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-TPFSLRQTH-PTYNITITQVSVGGN 317
A GLI NS S+C G+GRI FGD+G Q TPF L Y + + + VG
Sbjct: 258 AKAGLIRNSISICLNEKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVGSF 317
Query: 318 AVN-FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
EF A D+GTSFTYL Y + F R TS F CY
Sbjct: 318 CYKETEFKAFIDTGTSFTYLPKGVYETVVAEFEKQVHATRITSQIQSDFNCCY 370
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 144/396 (36%), Positives = 204/396 (51%), Gaps = 26/396 (6%)
Query: 11 CVLLILL--SCCAGCCFGFGTFGFDFHHRYSDPVK--------GILAVDDLPKKGSFAYY 60
C LL+L S C T + HR+SD K G ++ P S Y+
Sbjct: 4 CALLLLFIASLFVNCSLAL-TLSLNLVHRFSDEAKSLWESRRTGNVSAKFWPPTNSLKYF 62
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL-NSLGFLHYTNVSVGQPALSFI 119
L D L+ R L G+ L S G+ N +LHYT + +G P++ F+
Sbjct: 63 QMLMDYD----LKRRRLNI-GSKYDVLFPSEGSQVIFFGNEFNWLHYTWIDLGTPSVPFL 117
Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNI--YSPNTSSTSSKVPCNSTLCELQK 177
VALD GSDL W+PCDC+ C L+++ V+D ++ Y+P SSTS + C LC
Sbjct: 118 VALDVGSDLLWVPCDCIQCA-PLSANYYSVLDRDLSEYNPALSSTSKHLFCGHQLCAWST 176
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS--VDSRISFGCGRVQTGS 235
C SA C Y+ Y SD T ++GF++ED L L + K + + + FGCGR Q+GS
Sbjct: 177 TCKSANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHGTHSLLQASVVFGCGRKQSGS 236
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP 295
+LDGAAP+G+ GLG SVP++LA +GL+ N+FS+CF ++G+GRI FGD G Q T
Sbjct: 237 YLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLCFDNNGSGRILFGDDGPATQQTTQ 296
Query: 296 F-SLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
F L Y I + VG + + F A+ DSG+SFTYL Y +I F+ K
Sbjct: 297 FLPLFGEFAAYFIGVESFCVGSSCLQRSGFQALVDSGSSFTYLPAEVYKKIVFEFDKQVK 356
Query: 354 -EKRETSTSDLPFEYCYVLRSFLHLQA-LVVLPFPL 387
+LP+ YCY + + + + L FPL
Sbjct: 357 VNATRIVLRELPWNYCYNISTLVSFNIPSMQLVFPL 392
>gi|359496966|ref|XP_002269916.2| PREDICTED: aspartic proteinase-like protein 1-like, partial [Vitis
vinifera]
Length = 294
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 103/142 (72%), Positives = 120/142 (84%)
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
CG+VQTGSFL+GAAPNGLFGLGM SVPSILA +GL+ +SFSMCFG+DGTGRISFGD+G
Sbjct: 1 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 60
Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET 347
S GQ ETPF+ ++ YNI+ITQ+SVGG + + F AIFDSGTSFTYLNDPAYT ISE+
Sbjct: 61 SSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFDAIFDSGTSFTYLNDPAYTSISES 120
Query: 348 FNSLAKEKRETSTSDLPFEYCY 369
FN AK+KR +S SDLPFEYCY
Sbjct: 121 FNLRAKDKRSSSDSDLPFEYCY 142
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 194/384 (50%), Gaps = 22/384 (5%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFG---FGTFGFDFHHRYSDPV-------KGILAVDD 50
MA++ R+ V L+++ CC D H++S G+ D
Sbjct: 1 MATTVRSRGV---LVMVHCCVLWMLATTFANALRMDLFHKFSKQAIEAMRSRNGMDYAQD 57
Query: 51 LPKKGSFAYYSALAHRD--RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTN 108
P +G+ + + L D R+ R R LAA D+ L GN T +L G LHY+
Sbjct: 58 WPTEGTIEFQTMLRDHDVARHTRTARRILAASSMDQYVLI--QGNATEQLFG-GGLHYSY 114
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVH-GLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G P + F+V LDTGSDL W+PC+C SC S + N Y+P+ SST+ V
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+ LCE+ C + CPY++ Y+S T ++G L ED ++ E V + G
Sbjct: 175 CSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFM-RESGGNPVKLPVYLG 233
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
CG+VQTGS L GAAPNGL GLG SVP+ LA+ G + +SFS+C G+G ++FGD+G
Sbjct: 234 CGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEG 293
Query: 288 SPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQIS 345
Q TP + TY + I ++VG + A+FD+GTSFTYL+ Y Q
Sbjct: 294 PAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHALFDTGTSFTYLSKTVYPQFV 353
Query: 346 ETFNSLAKEKRETSTSDLPFEYCY 369
+ +++ + ++ CY
Sbjct: 354 QAYDAQMSLPKWNDPRFSKWDLCY 377
>gi|296084698|emb|CBI25840.3| unnamed protein product [Vitis vinifera]
Length = 306
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 103/142 (72%), Positives = 120/142 (84%)
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKG 287
CG+VQTGSFL+GAAPNGLFGLGM SVPSILA +GL+ +SFSMCFG+DGTGRISFGD+G
Sbjct: 13 CGKVQTGSFLEGAAPNGLFGLGMGSISVPSILAKEGLVADSFSMCFGNDGTGRISFGDEG 72
Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISET 347
S GQ ETPF+ ++ YNI+ITQ+SVGG + + F AIFDSGTSFTYLNDPAYT ISE+
Sbjct: 73 SSGQEETPFNPSKSQLLYNISITQISVGGTSADLNFDAIFDSGTSFTYLNDPAYTSISES 132
Query: 348 FNSLAKEKRETSTSDLPFEYCY 369
FN AK+KR +S SDLPFEYCY
Sbjct: 133 FNLRAKDKRSSSDSDLPFEYCY 154
>gi|6562288|emb|CAB62658.1| putative protein [Arabidopsis thaliana]
Length = 426
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 130/347 (37%), Positives = 194/347 (55%), Gaps = 50/347 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G+ F+ HHR+S+ VK +L LP+ GS YY AL HRDR GR L + N++T +
Sbjct: 20 GSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDR-----GRQLTSNNNNQTTI 74
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
+F+ GN T ++ L+ N++ P L F + V C L
Sbjct: 75 SFAQGNSTEEIS----LYDKNLA---PPLYFHLT------------QAVICFGYL----- 110
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVED 206
+ +P + L K +C S S+CPY++RYLS G+ STG LVED
Sbjct: 111 ---------------AIAIPLVYGVWRLTKARCISPVSDCPYRIRYLSPGSKSTGVLVED 155
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
V+H++T+E +++ D+RI+FG Q G F + A NG+ GL + +VP++L G+
Sbjct: 156 VIHMSTEEGEAR--DARITFG--ESQLGLFKE-VAVNGIMGLAIADIAVPNMLVKAGVAS 210
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI 326
+SFSMCFG +G G ISFGDKGS Q ETP S + Y+++IT+ VG V+ EF+A
Sbjct: 211 DSFSMCFGPNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVDTEFTAT 270
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
FDSGT+ T+L +P YT ++ F+ ++R + + D PFE+CY++ S
Sbjct: 271 FDSGTAVTWLIEPYYTALTTNFHLSVPDRRLSKSVDSPFEFCYIITS 317
>gi|374255989|gb|AEZ00856.1| putative peptidase A1 protein, partial [Elaeis guineensis]
Length = 263
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 97/162 (59%), Positives = 120/162 (74%)
Query: 210 LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
T+E K V + I FGCG+VQTG+FLD AAPNGLFGLGMDK SVPS+LA++G NSF
Sbjct: 1 FKTEETIPKVVKAPIVFGCGQVQTGAFLDSAAPNGLFGLGMDKVSVPSVLASKGYASNSF 60
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDS 329
SMCFGSDG GRI FGD GS QGETPF + +HPTYNI++ + VG ++++ SAI DS
Sbjct: 61 SMCFGSDGMGRIYFGDTGSSDQGETPFDVNHSHPTYNISLIGMEVGNSSIDVNSSAIVDS 120
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
GTSFT L DP YT++SE+F++ +E R S +PFEYCY L
Sbjct: 121 GTSFTCLADPMYTKLSESFHAQVRENRHESDPGIPFEYCYGL 162
>gi|297739018|emb|CBI28370.3| unnamed protein product [Vitis vinifera]
Length = 150
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/143 (61%), Positives = 110/143 (76%), Gaps = 2/143 (1%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V++++ + C+G GTFGFD HHR+SDPVKGIL VDDLP+K S YY A+AHRD
Sbjct: 10 VLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAHRD-- 67
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
+ + GR L+ K PLTFS GN+TYRL+SLG+LHY NVS+G P+L F+VALDTGSDLF
Sbjct: 68 WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLGYLHYANVSLGTPSLWFLVALDTGSDLF 127
Query: 130 WLPCDCVSCVHGLNSSSGQVIDF 152
WLPCDC SC+ GLN++SG+V F
Sbjct: 128 WLPCDCTSCIKGLNTTSGKVCYF 150
>gi|359496801|ref|XP_003635339.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 151
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 88/143 (61%), Positives = 110/143 (76%), Gaps = 2/143 (1%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V++++ + C+G GTFGFD HHR+SDPVKGIL VDDLP+K S YY A+AHRD
Sbjct: 10 VLVVVLISGWVSQICYGLGTFGFDMHHRFSDPVKGILDVDDLPEKLSLQYYKAMAHRD-- 67
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
+ + GR L+ K PLTFS GN+TYRL+SLG+LHY NVS+G P+L F+VALDTGSDLF
Sbjct: 68 WVIHGRRLSTSDEVKPPLTFSDGNETYRLSSLGYLHYANVSLGTPSLWFLVALDTGSDLF 127
Query: 130 WLPCDCVSCVHGLNSSSGQVIDF 152
WLPCDC SC+ GLN++SG+V F
Sbjct: 128 WLPCDCTSCIKGLNTTSGKVCYF 150
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 93/221 (42%), Positives = 122/221 (55%), Gaps = 4/221 (1%)
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
D IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 5 DLRIYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHL 64
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
E V++ + GCG+ Q+G +LDG AP+GL GLGM SVPS LA GL+ NSFS
Sbjct: 65 NYREDHVP-VNASVIIGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFS 123
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN-FEFSAIFD 328
MCF D +GRI FGD+G P Q TPF L TY + + + +G + F A+ D
Sbjct: 124 MCFKEDSSGRIFFGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVD 183
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SGTSFT L Y + F+ R D ++YCY
Sbjct: 184 SGTSFTSLPFDVYKAFTMEFDKQMNATR-VPYEDTTWKYCY 223
>gi|115469998|ref|NP_001058598.1| Os06g0717900 [Oryza sativa Japonica Group]
gi|54291047|dbj|BAD61724.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|113596638|dbj|BAF20512.1| Os06g0717900 [Oryza sativa Japonica Group]
Length = 307
Score = 144 bits (363), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 76/134 (56%), Positives = 90/134 (67%), Gaps = 6/134 (4%)
Query: 244 GLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTH 302
L GLGM+K SVPSILA+ G++ NSFSMCF DG GRI+FGD GS Q ETPF ++ TH
Sbjct: 8 ALMGLGMEKVSVPSILASTGVVKSNSFSMCFSKDGLGRINFGDTGSADQSETPFIVKSTH 67
Query: 303 PTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-----E 357
YNI+IT +SVG + F AI DSGTSFTYLNDPAYT + FN+ E+R
Sbjct: 68 SYYNISITSMSVGDKNLPLGFYAIADSGTSFTYLNDPAYTAYTTNFNAQISERRANFSGS 127
Query: 358 TSTSDLPFEYCYVL 371
T + PFEYCY L
Sbjct: 128 TRSGPFPFEYCYSL 141
>gi|351722911|ref|NP_001237772.1| uncharacterized protein LOC100500675 [Glycine max]
gi|255630909|gb|ACU15817.1| unknown [Glycine max]
Length = 244
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 63/102 (61%), Positives = 78/102 (76%), Gaps = 3/102 (2%)
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSG 330
MCFG DG GRI+FGD GSP Q +TPF++R+ HPTYNITITQ+ V + + EF AIFDSG
Sbjct: 1 MCFGPDGAGRITFGDTGSPDQRKTPFNVRKLHPTYNITITQIVVEDSVADLEFHAIFDSG 60
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCY 369
TSFTY+NDPAYT++ E +NS K R +S S++PFEYCY
Sbjct: 61 TSFTYINDPAYTRLGEMYNSKVKANRHSSQSPDSNIPFEYCY 102
>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 217
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 97/183 (53%), Gaps = 10/183 (5%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C D
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLR 144
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY P S+TS +PC+ LC+ C + CPY + Y S+ T S+G L+ED LHL
Sbjct: 145 IYRPAESTTSRHLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYR 204
Query: 214 EKQ 216
E
Sbjct: 205 EDH 207
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 92/257 (35%), Positives = 132/257 (51%), Gaps = 25/257 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P + F V +DTGSD+ W+ C+ SC +G +SG I N + P +SSTS
Sbjct: 77 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCN--SC-NGCPQTSGLQIQLNFFDPGSSSTS 133
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C KQ C S + C Y +Y DG+ ++G+ V D++HL T + S
Sbjct: 134 SMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSM 192
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S + FGC QTG A +G+FG G + SV S L++QG+ P FS C
Sbjct: 193 TTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKG 252
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
D G G + G+ P T SL P YN+ + +SV G + + S
Sbjct: 253 DSSGGGILVLGEIVEPNIVYT--SLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRG 310
Query: 326 -IFDSGTSFTYLNDPAY 341
I DSGT+ YL + AY
Sbjct: 311 TIVDSGTTLAYLAEEAY 327
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 130 bits (328), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 147/300 (49%), Gaps = 35/300 (11%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
YY L D+ RLR R L + S +DT+ L+YT + +G P F
Sbjct: 12 YYRTLREHDQR-RLR-RILP----EVVAFPISGDDDTFTTG----LYYTRIYLGTPPQQF 61
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
V +DTGSD+ W+ +CV C + +S + +I+ P S++ + + C C L
Sbjct: 62 YVHVDTGSDVAWV--NCVPCTN-CKRASNVALPISIFDPEKSTSKTSISCTDEECYLASN 118
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKSVDSRISFGCGRVQT 233
+C +CPY Y DG+ + G+L+ DVL + + + S +R++FGCG QT
Sbjct: 119 SKCSFNSMSCPYSTLY-GDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQT 177
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPGQ 291
G++L +GL G G + S+PS L+ Q + N F+ C D G+G + G PG
Sbjct: 178 GTWLT----DGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGL 233
Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVN----FEFS----AIFDSGTSFTYLNDPAYTQ 343
TP +Q+H YN+ + + V G V F+ S I DSGT+ TYL PAY Q
Sbjct: 234 VYTPIVPKQSH--YNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQ 291
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 143/300 (47%), Gaps = 32/300 (10%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S L RD LR R + N + D +++ L+YT V +G P + F V
Sbjct: 38 SQLRARDA---LRHRRMLQSSNGVVDFSVQGTFDPFQVG----LYYTKVQLGTPPVEFNV 90
Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C+ S G +SG I N + P +SSTSS + C+ C Q
Sbjct: 91 QIDTGSDVLWVSCNSCS---GCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSS 147
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
C S + C Y +Y DG+ ++G+ V D++HL T + S + +S + FGC QT
Sbjct: 148 DATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQT 206
Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
G A +G+FG G + SV S L++QG+ P FS C D G G + G+ P
Sbjct: 207 GDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPN 266
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
T SL P YN+ + ++V G + + S I DSGT+ YL + AY
Sbjct: 267 IVYT--SLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAY 324
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 94/289 (32%), Positives = 139/289 (48%), Gaps = 27/289 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P + F V +DTGSD+ W+ C+ S G +SG I N + P +SSTS
Sbjct: 24 LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCS---GCPQTSGLQIQLNFFDPGSSSTS 80
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q C S + C Y +Y DG+ ++G+ V D++HL T + S
Sbjct: 81 SMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQY-GDGSGTSGYYVSDMMHLNTIFEGSV 139
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S + FGC QTG A +G+FG G + SV S L++QG+ P FS C
Sbjct: 140 TTNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKG 199
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
D G G + G+ P T SL P YN+ + ++V G + + S
Sbjct: 200 DSSGGGILVLGEIVEPNIVYT--SLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRG 257
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ YL + AY + + T+ S CY++ S
Sbjct: 258 TIVDSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSR--GNQCYLITS 304
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 106/319 (33%), Positives = 153/319 (47%), Gaps = 43/319 (13%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGL-----AAQGNDKTPLTFSAGNDTYRLNSLGFLH 105
LP KG + L RD R RGL A G P+ SA + Y + L+
Sbjct: 38 LPHKGVPVEH--LKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSA--NPYMVG----LY 89
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+T V +G PA + V +DTGSD+ W+ C C C +SSG I ++P++SSTSS
Sbjct: 90 FTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGC----PTSSGLNIQLEFFNPDSSSTSS 145
Query: 165 KVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
++PC+ C Q A S C Y Y DG+ ++GF V D ++ T
Sbjct: 146 RIPCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTY-GDGSGTSGFYVSDTMYFDTVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + + FGC Q+G + A +G+FG G + SV S L + G+ P +FS C
Sbjct: 205 EQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVFTP--LVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322
Query: 325 --AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 323 QGTIVDSGTTLVYLVDGAY 341
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 100/302 (33%), Positives = 140/302 (46%), Gaps = 27/302 (8%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPL--TFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
S L RDR R + G P+ TF + S L+YT + +G P F
Sbjct: 44 SQLKERDRVRHSRMLQSSGGGVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDF 103
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ 178
V +DTGSD+ W+ C S +G SSG I N + P +S T+S + C+ C L Q
Sbjct: 104 YVQIDTGSDVLWVSC---SSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQ 160
Query: 179 -----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS--KSVDSRISFGCGRV 231
C + + C Y +Y DG+ ++G+ V D+LH T S K+ + I FGC +
Sbjct: 161 SSDSVCAAQNNQCGYTFQY-GDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTL 219
Query: 232 QTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGS 288
QTG A +G+FG G SV S LA+QG+ P FS C D G G + G+
Sbjct: 220 QTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVE 279
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDP 339
P TP L + P YN+ + + V G + + S I DSGT+ YL +
Sbjct: 280 PNIVYTP--LVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEA 337
Query: 340 AY 341
AY
Sbjct: 338 AY 339
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/331 (31%), Positives = 158/331 (47%), Gaps = 33/331 (9%)
Query: 63 LAHRDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
+AH R+R GR L + G + FS + TY +G L+YT V +G P F V
Sbjct: 45 IAHLRSRDRVRHGRMLQSSGG---VIDFSV-SGTYDPFLVG-LYYTRVQLGNPPKDFYVQ 99
Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--- 178
+DTGSD+ W+ C+ SC +G ++SG I N + P +S+T+S V C+ +C L Q
Sbjct: 100 IDTGSDVLWVSCN--SC-NGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSD 156
Query: 179 --CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTG 234
C + C Y +Y DG+ ++G+ V D++HL D + + + + FGC QTG
Sbjct: 157 SACFGQSNQCAYVFQY-GDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTG 215
Query: 235 SFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPGQ 291
A +G+FG G SV S L+++G+ P FS C D G G + G+ P
Sbjct: 216 DLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPNV 275
Query: 292 GETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFSAIFDSGTSFTYLNDPAYT 342
TP L + P YN+ + +SV G A + I DSGT+ YL + AY
Sbjct: 276 VYTP--LVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYN 333
Query: 343 QISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
++ + ++ L CYV S
Sbjct: 334 AFVVAVTNIVSQSTQSVV--LKGNRCYVTSS 362
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 152/335 (45%), Gaps = 33/335 (9%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S L RDR GR L + G D + + L+YT + +G P F V
Sbjct: 14 SKLKERDRV--RHGRMLQSSGVGVVDFPVQGTFDPFLVG----LYYTRLQLGTPPRDFYV 67
Query: 121 ALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C SC +G +SG I N + P +S T+S + C+ C L Q
Sbjct: 68 QIDTGSDVLWVSCG--SC-NGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSS 124
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFGCGRVQT 233
C + + C Y +Y DG+ ++G+ V D+LH T S +S I FGC +QT
Sbjct: 125 DSVCSAQNNLCGYNFQY-GDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQT 183
Query: 234 GSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISFGDKGSPG 290
G A +G+FG G SV S LA+QG+ P +FS C D G G + G+ P
Sbjct: 184 GDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEIVEPN 243
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
TP L + P YN+ + +SV G + + S I DSGT+ YL + AY
Sbjct: 244 IVYTP--LVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAY 301
Query: 342 TQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLH 376
S+ S +CY++ S ++
Sbjct: 302 DPFISAITSIVSPSVRPYLSK--GNHCYLISSSIN 334
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 128/254 (50%), Gaps = 24/254 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ F V +DTGSD+ W+ C C+ C +++ Y + SST
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-----PRKSDLVELTPYDVDASST 138
Query: 163 SSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDEKQSK 218
+ V C+ C Q+ +GS C Y + Y DG+ + G+LV+DV+H L T +Q+
Sbjct: 139 AKSVSCSDNFCSYVNQRSECHSGSTCQYVIMY-GDGSSTNGYLVKDVVHLDLVTGNRQTG 197
Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD 276
S + I FGCG Q+G + AA +G+ G G +S S LA+QG + SF+ C ++
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IF 327
G G + G+ SP TP + H Y++ + + VG + + +A I
Sbjct: 258 GGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLELSSNAFDSGDDKGVII 315
Query: 328 DSGTSFTYLNDPAY 341
DSGT+ YL D Y
Sbjct: 316 DSGTTLVYLPDAVY 329
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 81/254 (31%), Positives = 126/254 (49%), Gaps = 24/254 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ F V +DTGSD+ W+ C C+ C +++ Y + SST
Sbjct: 84 LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRC-----PRKSDLVELTPYDADASST 138
Query: 163 SSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDEKQSK 218
+ V C+ C Q+ +GS C Y + Y DG+ + G+LV DV+H L T +Q+
Sbjct: 139 AKSVSCSDNFCSYVNQRSECHSGSTCQYVILY-GDGSSTNGYLVRDVVHLDLVTGNRQTG 197
Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD 276
S + I FGCG Q+G + AA +G+ G G +S S LA+QG + SF+ C ++
Sbjct: 198 STNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNN 257
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IF 327
G G + G+ SP TP + H Y++ + + VG + + A I
Sbjct: 258 GGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLNAIEVGNSVLQLSSDAFDSGDDKGVII 315
Query: 328 DSGTSFTYLNDPAY 341
DSGT+ YL D Y
Sbjct: 316 DSGTTLVYLPDAVY 329
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/290 (32%), Positives = 138/290 (47%), Gaps = 29/290 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKIRLGSPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 TPVSCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QGL P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRS 373
I D+GT+ YL++ AY E N++++ R + CYV+ +
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVIAT 360
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 140/298 (46%), Gaps = 37/298 (12%)
Query: 67 DRYFRLRG---RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
D Y LR R L + S ND + + L+YT +S+G P F V +D
Sbjct: 4 DHYHTLRKHDQRRLRRMLPEVVSFPISGDNDIFAMG----LYYTRISLGTPPQQFYVDVD 59
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQV-IDFNIYSPNTSSTSSKVPCNSTLCEL---QKQ 178
TGS++ W+ C C C H SG V + + + P S+T + C C + + Q
Sbjct: 60 TGSNVAWVKCAPCTGCEH-----SGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQ 114
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKSVDSRISFGCGRVQTGS 235
C +CPY + Y DG+ + G+ + DV + +D +KS +R+ FGCG QTGS
Sbjct: 115 CSPERLSCPYSLLY-GDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGS 173
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDKGSPGQGE 293
+ + +GL G G S+P+ LA Q + N F+ C D +GR + G P
Sbjct: 174 W----SVDGLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVY 229
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAV------NFEFS--AIFDSGTSFTYLNDPAYTQ 343
TP + H YN+ + + + G V + E++ I DSGT+ TYL PAY +
Sbjct: 230 TPMVFGEDH--YNVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQPAYDE 285
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/290 (32%), Positives = 138/290 (47%), Gaps = 29/290 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QG+ P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRS 373
I D+GT+ YL++ AY E N++++ R + CYV+ +
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITT 360
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 104/300 (34%), Positives = 135/300 (45%), Gaps = 38/300 (12%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
AH DR RGR LAA PL GN L S L+YT V +G PA F V +D
Sbjct: 43 AHDDRR---RGRFLAAI---DVPL---GGNG---LPSSTGLYYTKVGLGSPAKEFYVQVD 90
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
TGSD+ W+ C C +C SG +D +Y PN S TS+ VPC C P +
Sbjct: 91 TGSDILWVNCAGCTAC----PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPIS 146
Query: 183 G----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF 236
G +CPY + Y DG+ ++G V D L + +K +S + FGCG Q+GS
Sbjct: 147 GCKQDMSCPYSITY-GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSL 205
Query: 237 LDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGE 293
+ A +G+ G G +SV S LA G + FS C S G G S G P
Sbjct: 206 SSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNT 265
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAYTQI 344
TP R H YN+ + + V G + I DSGT+ YL Y Q+
Sbjct: 266 TPLVPRMAH--YNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL 323
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/290 (32%), Positives = 138/290 (47%), Gaps = 29/290 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QG+ P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRS 373
I D+GT+ YL++ AY E N++++ R + CYV+ +
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITT 360
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/290 (32%), Positives = 138/290 (47%), Gaps = 29/290 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P F V +DTGSD+ W+ C SC +G +SG I N + P +S T+
Sbjct: 80 LYYTKLRLGTPPRDFYVQVDTGSDVLWV--SCASC-NGCPQTSGLQIQLNFFDPGSSVTA 136
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C+ C Q +G + C Y +Y DG+ ++GF V DVL S
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQY-GDGSGTSGFYVSDVLQFDMIVGSSL 195
Query: 219 SVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S + FGC QTG + A +G+FG G SV S LA+QG+ P FS C
Sbjct: 196 VPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKG 255
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
+ G G + G+ P TP L + P YN+ + +SV G A+ S
Sbjct: 256 ENGGGGILVLGEIVEPNMVFTP--LVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313
Query: 326 -IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRS 373
I D+GT+ YL++ AY E N++++ R + CYV+ +
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKG---NQCYVITT 360
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 128/259 (49%), Gaps = 28/259 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSST
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
SSK+PC+ C Q A S C Y Y DG+ ++G+ V D ++ T
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 325 --AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 323 QGTIVDSGTTLAYLADGAY 341
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 128/259 (49%), Gaps = 28/259 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSST
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DE 214
SSK+PC+ C Q A S C Y Y DG+ ++G+ V D ++ T
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 325 --AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 323 QGTIVDSGTTLAYLADGAY 341
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 138/294 (46%), Gaps = 32/294 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T V +G P F V +DTGSD+ W+ C S +G +SG I + P +S+T+
Sbjct: 83 LYFTRVQLGSPPKDFYVQIDTGSDVLWVSC---SSCNGCPVTSGLQIPLTFFDPGSSTTA 139
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS- 217
+ V C+ C Q C S + C Y +Y DG+ ++G+ V D++HL T S
Sbjct: 140 ALVSCSDQRCTAGIQSSDSLCSSRTNQCGYTFQY-GDGSGTSGYYVADLMHLDTLLLSSG 198
Query: 218 ------KSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
++ DS +SF C +QTG A +G+FG G + SV S LA+QG+ P FS
Sbjct: 199 ELSQICQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFS 258
Query: 271 MCFGSD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--- 325
C D G G + G+ P TP L + P YN+ + +SV G + + S
Sbjct: 259 HCLKGDDSGGGVLVLGEIVEPNIVYTP--LVPSQPHYNLYLQSISVAGQTLAIDPSVFGA 316
Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ YL + AY S+ T S CY++ S
Sbjct: 317 SSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSK--GNQCYLVTS 368
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 90/259 (34%), Positives = 127/259 (49%), Gaps = 28/259 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSST
Sbjct: 90 LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDE 214
SSK+PC+ C Q A S C Y Y DG+ ++G+ V D ++
Sbjct: 146 SSKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDSVMGN 204
Query: 215 KQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL 264
Query: 274 -GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------- 324
GSD G G + G+ PG TP L + P YN+ + + V G + + S
Sbjct: 265 KGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNT 322
Query: 325 --AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 323 QGTIVDSGTTLAYLADGAY 341
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/258 (34%), Positives = 126/258 (48%), Gaps = 28/258 (10%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V +G P + V +DTGSD+ W+ C C C SSSG I ++P+TSSTS
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGC----PSSSGLNIQLEFFNPDTSSTS 172
Query: 164 SKVPCNSTLCELQKQCPSA------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEK 215
SK+PC+ C Q A S C Y Y DG+ ++G+ V D ++ T +
Sbjct: 173 SKIPCSDDRCTAALQTSEAVCQTSDNSPCGYTFTY-GDGSGTSGYYVSDTMYFDTVMGNE 231
Query: 216 QSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 232 QTANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLK 291
Query: 274 GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------- 324
GSD G G + G+ PG TP Q H YN+ + + V G + + S
Sbjct: 292 GSDNGGGILVLGEIVEPGLVYTPLVPSQPH--YNLNLESIVVNGQKLPIDSSLFTTSNTQ 349
Query: 325 -AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 350 GTIVDSGTTLAYLADGAY 367
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 130/291 (44%), Gaps = 28/291 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P + + V +DTGSD+ WL C C SCV S I Y P+ SST
Sbjct: 36 LYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPS---IKLTTYDPSRSST 92
Query: 163 SSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ C + C + C SAG C Y Y DG+ + G+ ++DV+ +
Sbjct: 93 DGALSCRDSNCGAALGSNEVSCTSAGY-CAYSTTY-GDGSSTQGYFIQDVMTFQEIHNNT 150
Query: 218 K-SVDSRISFGCGRVQTGSFL-DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ + + + FGCG Q+G+ L A +GL G G S+PS LA+ G + N F+ C
Sbjct: 151 QVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQG 210
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
D G G I G P TP R Y + + ++V G V S
Sbjct: 211 DNQGGGTIVIGSVSEPNISYTPIVSRN---HYAVGMQNIAVNGRNVTTPASFDTTSTSAG 267
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF 374
I DSGT+ YL DPAYTQ ++ + + L +C + F
Sbjct: 268 GVIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFSSHSQCLQLAWCSLQADF 318
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 107/356 (30%), Positives = 155/356 (43%), Gaps = 52/356 (14%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V L+LLS C GF F+ H++ KG +AL D
Sbjct: 7 VLVGLLLLSFCLP---GFCNLVFEVQHKF---------------KGRERSLNALKSHD-- 46
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
R GR L+ + G + + + L+Y + +G P F V +DTGSD+
Sbjct: 47 VRRHGRLLSV-------IDLELGGNGHPAET--GLYYARIGIGSPPNDFHVQVDTGSDIL 97
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN---- 185
W+ +CV C + S V D +Y+P +SSTS+ + C+ C P G
Sbjct: 98 WV--NCVGCSNCPKKSDIGV-DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLL 154
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA-AP 242
C Y+V Y DG+ + G+ V D + L A ++ + I FGCG Q+G + A
Sbjct: 155 CQYKVIY-GDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEAL 213
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
+G+ G G +S+ S LA G + F+ C S G G + G+ P TP Q
Sbjct: 214 DGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLKTTPVVPNQA 273
Query: 302 HPTYNITITQVSVGGNAVN-----FEFS----AIFDSGTSFTYLNDPAYTQISETF 348
H YN+ + V VG A++ FE S AI DSGT+ YL D Y + E
Sbjct: 274 H--YNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPDSIYLPLMEKI 327
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 127/259 (49%), Gaps = 25/259 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ +C+SC SG ++ +Y P SST
Sbjct: 32 LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 88
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
SKV C+ C L C ++ C Y V Y DG+ +TG+ V D+L + + Q
Sbjct: 89 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 146
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ +S ++FGCG Q G A +G+ G G TS+ S L+ G + F+ C +
Sbjct: 147 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 206
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
+G G + G+ P TP L P YN+ + + VGG A+ +
Sbjct: 207 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 264
Query: 326 IFDSGTSFTYLNDPAYTQI 344
I DSGT+ TYL + Y +I
Sbjct: 265 IIDSGTTLTYLPEIVYKEI 283
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 95/301 (31%), Positives = 139/301 (46%), Gaps = 32/301 (10%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
+ L RDR R GR L G + +D Y + L++T V +G PA F V
Sbjct: 32 TTLKARDRA-RHGGRILQDGGGGILDFSVQGTSDPYLVG----LYFTKVKMGSPAKEFYV 86
Query: 121 ALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ--- 176
+DTGSD+ WL C+ C +C SSG ID N + +SST++ V C+ +C
Sbjct: 87 QIDTGSDILWLNCNTCNNC----PKSSGLGIDLNYFDTASSSTAALVSCSDPVCSYAVQT 142
Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS--KSVDSRISFGCGRVQ 232
QC S + C Y +Y DG+ ++G+ V D ++ QS + S + FGC Q
Sbjct: 143 ATSQCSSQANQCSYTFQY-GDGSGTSGYYVYDAMYFDVIMGQSVFSNSSSTVVFGCSTYQ 201
Query: 233 TGSFL-DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSP 289
+G A +G+FG G SV S +++QG+ P FS C G+ G + G+ P
Sbjct: 202 SGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQGSGGGILVLGEILEP 261
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
TP Q H YN+ + ++V G + + I DSGT+ YL A
Sbjct: 262 NIVYTPLVPLQPH--YNLNLQSIAVNGQILPIDQDVFATGNNRGTIVDSGTTLAYLVQEA 319
Query: 341 Y 341
Y
Sbjct: 320 Y 320
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 127/259 (49%), Gaps = 25/259 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ +C+SC SG ++ +Y P SST
Sbjct: 88 LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 144
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
SKV C+ C L C ++ C Y V Y DG+ +TG+ V D+L + + Q
Sbjct: 145 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 202
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ +S ++FGCG Q G A +G+ G G TS+ S L+ G + F+ C +
Sbjct: 203 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 262
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
+G G + G+ P TP L P YN+ + + VGG A+ +
Sbjct: 263 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 320
Query: 326 IFDSGTSFTYLNDPAYTQI 344
I DSGT+ TYL + Y +I
Sbjct: 321 IIDSGTTLTYLPEIVYKEI 339
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 106/356 (29%), Positives = 155/356 (43%), Gaps = 52/356 (14%)
Query: 10 VCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRY 69
V V L+LLS C GF F+ H++ KG +AL D
Sbjct: 7 VLVGLLLLSFCLP---GFCNLVFEVQHKF---------------KGRERSLNALKSHD-- 46
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
R GR L+ + G + + + L+Y + +G P F V +DTGSD+
Sbjct: 47 VRRHGRLLSV-------IDLELGGNGHPAET--GLYYARIGIGSPPNDFHVQVDTGSDIL 97
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN---- 185
W+ +CV C + S V D +Y+P +SSTS+ + C+ C P G
Sbjct: 98 WV--NCVGCSNCPKKSDIGV-DLQLYNPKSSSTSTLITCDQPFCSATYDAPIPGCKPDLL 154
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA-AP 242
C Y+V Y DG+ + G+ V D + L A ++ + I FGCG Q+G + A
Sbjct: 155 CQYKVIY-GDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCGAKQSGELGSSSEAL 213
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
+G+ G G +S+ S LA G + F+ C S G G + G+ P TP Q
Sbjct: 214 DGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEVVEPKLXNTPVVPNQA 273
Query: 302 HPTYNITITQVSVGGNAVN-----FEFS----AIFDSGTSFTYLNDPAYTQISETF 348
H YN+ + V VG A++ FE S AI DSGT+ YL + Y + E
Sbjct: 274 H--YNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPESIYLPLMEKI 327
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 127/259 (49%), Gaps = 25/259 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ +C+SC SG ++ +Y P SST
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWV--NCISCDR-CPRKSGLGLELTLYDPKDSSTG 59
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
SKV C+ C L C ++ C Y V Y DG+ +TG+ V D+L + + Q
Sbjct: 60 SKVSCDQGFCAATYGGLLPGCTTS-LPCEYSVTY-GDGSSTTGYFVSDLLQFDQVSGDGQ 117
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ +S ++FGCG Q G A +G+ G G TS+ S L+ G + F+ C +
Sbjct: 118 TRPANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 177
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
+G G + G+ P TP L P YN+ + + VGG A+ +
Sbjct: 178 INGGGIFAIGNVVQPKVKTTP--LVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGT 235
Query: 326 IFDSGTSFTYLNDPAYTQI 344
I DSGT+ TYL + Y +I
Sbjct: 236 IIDSGTTLTYLPEIVYKEI 254
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 88/260 (33%), Positives = 124/260 (47%), Gaps = 30/260 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P ++ + +DTGSDL W+ C C+ C + S I Y S++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC----PAFSDLKIPIVPYDVKASAS 90
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
SSKVPC+ C L Q +G N C Y +Y DG+ + G+LVEDVLH + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 219 SVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
+ FGCG Q+G A +G+ G G S S LA QG PN F+ C G
Sbjct: 149 -----VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------AI 326
G G + G+ P TP +H YN+ + +SV + + FS I
Sbjct: 204 RGGGILVLGNVIEPDIQYTPLVPYMSH--YNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261
Query: 327 FDSGTSFTYLNDPAYTQISE 346
FDSGT+ YL D AY ++
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQ 281
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 119/259 (45%), Gaps = 25/259 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P + V +DTGSD+ W+ C C C H SG +D +Y P SST
Sbjct: 85 LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPH----KSGLGLDLTLYDPKASST 140
Query: 163 SSKVPCNSTLCE--LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
S V C+ C + P G+N C Y V Y DG+ + G V D L T + Q
Sbjct: 141 GSMVMCDQAFCAATFGGKLPKCGANVPCEYSVTY-GDGSSTIGSFVTDALQFDQVTRDGQ 199
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ ++ + FGCG Q G A +G+ G G TS+ S L G + F+ C +
Sbjct: 200 TQPANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDT 259
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
G G S GD P TP L P YN+ + + VGG + +
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTP--LVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGT 317
Query: 326 IFDSGTSFTYLNDPAYTQI 344
I DSGT+ TYL + + ++
Sbjct: 318 IIDSGTTLTYLPELVFKEV 336
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 105/327 (32%), Positives = 153/327 (46%), Gaps = 42/327 (12%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
P++ +D+L + S L RDR R GR + G P+ S+ D
Sbjct: 43 PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94
Query: 96 YRLNS-LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
Y + S + L++T V +G P F V +DTGSD+ W+ C C +C H SSG ID +
Sbjct: 95 YLVGSKMTMLYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLH 150
Query: 154 IYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ S T+ V C+ +C QC S + C Y RY DG+ ++G+ + D
Sbjct: 151 FFDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTF 208
Query: 209 HLATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLI 265
+ +S +S I FGC Q+G A +G+FG G K SV S L+++G+
Sbjct: 209 YFDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGIT 268
Query: 266 PNSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NA 318
P FS C DG+G F G+ PG +P L + P YN+ + + V G +A
Sbjct: 269 PPVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDA 326
Query: 319 VNFEFS----AIFDSGTSFTYLNDPAY 341
FE S I D+GT+ TYL AY
Sbjct: 327 AVFEASNTRGTIVDTGTTLTYLVKEAY 353
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 128/261 (49%), Gaps = 30/261 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G PA F V +DTGSD+ W+ C C C +SSG I ++P++SST
Sbjct: 88 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 143
Query: 163 SSKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
+S++ C+ C Q A S C Y Y DG+ ++G+ V D + T
Sbjct: 144 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 202
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS
Sbjct: 203 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 262
Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 263 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 320
Query: 325 ----AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY 341
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 128/261 (49%), Gaps = 30/261 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G PA F V +DTGSD+ W+ C C C +SSG I ++P++SST
Sbjct: 90 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 145
Query: 163 SSKVPCNSTLCELQKQCPSA--------GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
+S++ C+ C Q A S C Y Y DG+ ++G+ V D + T
Sbjct: 146 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 204
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS
Sbjct: 205 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 264
Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 265 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 322
Query: 325 ----AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 323 NTQGTIVDSGTTLAYLADGAY 343
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 126/283 (44%), Gaps = 26/283 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT V +G P F V +DTGSD+ W+ C C C H SG +D +Y P SST
Sbjct: 87 LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPH----KSGLGLDLTLYDPKASST 142
Query: 163 SSKVPCNSTLCE--LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
S V C+ C + P +N C Y V Y DG+ + G V D L T + Q
Sbjct: 143 GSTVMCDQGFCADTFGGRLPKCSANVPCEYSVTY-GDGSSTVGSFVNDALQFDQVTGDGQ 201
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ ++ + FGCG Q G + A +G+ G G TS+ S LA G + F+ C +
Sbjct: 202 TQPANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDT 261
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSA 325
G G + GD P TP L P YN+ + + VGG + +
Sbjct: 262 IKGGGIFAIGDVVQPKVKTTP--LVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRGT 319
Query: 326 IFDSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEY 367
I DSGT+ TYL + + ++ FN L FEY
Sbjct: 320 IIDSGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEY 362
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 153/343 (44%), Gaps = 48/343 (13%)
Query: 56 SFAYYSALAHRDRYFRLRGRGLAA---------------QGNDKTPLTFSA--GNDTYRL 98
S Y ++L H +R F L GL QG + FS +D Y +
Sbjct: 4 SAVYCASLLHLERAFPLNNHGLELHQLRARDRLRHARLLQGFVGGVVDFSVQGSSDPYLV 63
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSP 157
L++T V +G P F V +DTGSD+ W+ C+ C +C +SG I N +
Sbjct: 64 G----LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPR----TSGLGIQLNFFDS 115
Query: 158 NTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
++SST+ +V C+ +C QC S C Y +Y DG+ ++G+ V D L+
Sbjct: 116 SSSSTAGQVRCSDPICTSAVQTTATQCSSQTDQCSYTFQY-GDGSGTSGYYVSDTLYFDA 174
Query: 213 DEKQS--KSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
QS + + I FGC Q+G A +G+FG G + SV S L+ +G+ P F
Sbjct: 175 ILGQSLIDNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVF 234
Query: 270 SMCFGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-- 325
S C DG+ G + G+ PG +P L + P YN+ + ++V G + + +A
Sbjct: 235 SHCLKGDGSGGGILVLGEILEPGIVYSP--LVPSQPHYNLNLLSIAVNGQLLPIDPAAFA 292
Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
I DSGT+ YL AY N++ TS
Sbjct: 293 TSNSQGTIVDSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITS 335
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 138/301 (45%), Gaps = 34/301 (11%)
Query: 62 ALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
AL RDR GR L + +D Y + L++T V +G PA F V
Sbjct: 46 ALRARDR--ARHGRILQGVVGGVVDFSVQGTSDPYFVG----LYFTKVKLGSPAKEFYVQ 99
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C C +C H SSG I+ + + SST++ V C +C Q
Sbjct: 100 IDTGSDILWINCITCSNCPH----SSGLGIELDFFDTAGSSTAALVSCGDPICSYAVQTA 155
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT---DEKQSKSVDSRISFGCGRVQ 232
C S + C Y +Y DG+ +TG+ V D ++ T + + S I FGC Q
Sbjct: 156 TSECSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSVVANSSSTIIFGCSTYQ 214
Query: 233 TGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSP 289
+G A +G+FG G SV S L+++G+ P FS C G +G G + G+ P
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
+P L + P YN+ + ++V G + + + I DSGT+ YL A
Sbjct: 275 SIVYSP--LVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEA 332
Query: 341 Y 341
Y
Sbjct: 333 Y 333
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/260 (33%), Positives = 123/260 (47%), Gaps = 30/260 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P ++ + +DTGSDL W+ C C+ C + S I Y S++
Sbjct: 35 LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGC----PAFSDLKIPIVPYDVKASAS 90
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
SSKVPC+ C L Q +G N C Y +Y DG+ + G+LVEDVLH + +
Sbjct: 91 SSKVPCSDPSCTLITQISESGCNDQNQCGYSFQY-GDGSGTLGYLVEDVLHYMVNATAT- 148
Query: 219 SVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
+ FGCG Q+G A +G+ G G S S LA QG PN F+ C G
Sbjct: 149 -----VIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGE 203
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------AI 326
G G + G+ P TP H YN+ + +SV + + FS I
Sbjct: 204 RGGGILVLGNVIEPDIQYTPLVPYMYH--YNVVLQSISVNNANLTIDPKLFSNDVMQGTI 261
Query: 327 FDSGTSFTYLNDPAYTQISE 346
FDSGT+ YL D AY ++
Sbjct: 262 FDSGTTLAYLPDEAYQAFTQ 281
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 130/261 (49%), Gaps = 30/261 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G PA F V +DTGSD+ W+ C C C +SSG I ++P++SST
Sbjct: 4 LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGC----PTSSGLNIQLESFNPDSSST 59
Query: 163 SSKVPCNSTLCELQKQ-----CPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLAT-- 212
+S++ C+ C Q C ++ S C Y Y DG+ ++G+ V D + T
Sbjct: 60 ASRITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTY-GDGSGTSGYYVSDTMFFETVM 118
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS
Sbjct: 119 GNEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSH 178
Query: 272 CF-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 179 CLKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTS 236
Query: 325 ----AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 237 NTQGTIVDSGTTLAYLADGAY 257
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 127/258 (49%), Gaps = 27/258 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P F V +DTGSD+ W+ C C +C +SG I N + +SST
Sbjct: 80 LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQ----TSGLGIQLNYFDTTSSST 135
Query: 163 SSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ VPC+ +C Q QCP + C Y +Y DG+ ++G+ V D + +S
Sbjct: 136 ARLVPCSHPICTSQIQTTATQCPPQSNQCSYAFQY-GDGSGTSGYYVSDTFYFDAVLGES 194
Query: 218 KSVDSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+S I FGC Q+G A +G+FG G + SV S L++ G+ P FS C
Sbjct: 195 LIANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLK 254
Query: 274 GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
G D G G + G+ PG +P L + P YN+ + ++V G + + +A
Sbjct: 255 GEDSGGGILVLGEILEPGIVYSP--LVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNR 312
Query: 326 --IFDSGTSFTYLNDPAY 341
I D+GT+ YL + AY
Sbjct: 313 GTIIDTGTTLAYLVEEAY 330
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 127/285 (44%), Gaps = 34/285 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y + +G PA + + +DTGSDL WL CD C SC G + +Y P +
Sbjct: 30 LYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPH---------GLYDPKRAR 80
Query: 162 TSSKVPCNSTLC-ELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C C ++Q+ C C Y+V Y+ DG+ + G LVED + L
Sbjct: 81 V---VDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYV-DGSSTMGILVEDTITLVL--TN 134
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+R GCG Q G+ A +G+ GL K S+PS LA +G+ N C
Sbjct: 135 GTRFQTRAVIGCGYDQQGTLAKAPAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLAG 194
Query: 274 GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------A 325
GS+G G + FGD P G TP R Y + + GG + E + A
Sbjct: 195 GSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDVGGA 254
Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCY 369
+FDSGTSFTYL AYT + S + E +D +C+
Sbjct: 255 MFDSGTSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLPFCW 299
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 138/301 (45%), Gaps = 34/301 (11%)
Query: 62 ALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
AL RDR GR L + +D Y + L++T V +G PA F V
Sbjct: 46 ALRARDR--ARHGRILQGVVGGVVDFSVQGTSDPYFVG----LYFTKVKLGSPAKDFYVQ 99
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ-- 178
+DTGSD+ W+ C C +C H SSG I+ + + SST++ V C +C Q
Sbjct: 100 IDTGSDILWINCITCSNCPH----SSGLGIELDFFDTAGSSTAALVSCADPICSYAVQTA 155
Query: 179 ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT---DEKQSKSVDSRISFGCGRVQ 232
C S + C Y +Y DG+ +TG+ V D ++ T + + S I FGC Q
Sbjct: 156 TSGCSSQANQCSYTFQY-GDGSGTTGYYVSDTMYFDTVLLGQSMVANSSSTIVFGCSTYQ 214
Query: 233 TGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSP 289
+G A +G+FG G SV S L+++G+ P FS C G +G G + G+ P
Sbjct: 215 SGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKGGENGGGVLVLGEILEP 274
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPA 340
+P L + P YN+ + ++V G + + + I DSGT+ YL A
Sbjct: 275 SIVYSP--LVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTIVDSGTTLAYLVQEA 332
Query: 341 Y 341
Y
Sbjct: 333 Y 333
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 97/316 (30%), Positives = 137/316 (43%), Gaps = 42/316 (13%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGF-----LH 105
P+ GS AH RGR LAA PL LG L+
Sbjct: 38 FPRLGSKGGGDITAHLTHDSNRRGRLLAAA---DVPL-----------GGLGLPTDTGLY 83
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
YT + +G P + V +DTGSD+ W+ +C+SC + S ID +Y P SS+ S
Sbjct: 84 YTEIEIGTPPKQYHVQVDTGSDILWV--NCISC-NKCPRKSDLGIDLRLYDPKGSSSGST 140
Query: 166 VPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKS 219
V C+ C + P N C Y V Y DG+ +TG+ V D L + + Q++
Sbjct: 141 VSCDQKFCAATYGGKLPGCAKNIPCEYSVMY-GDGSSTTGYFVSDSLQYNQVSGDGQTRH 199
Query: 220 VDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DG 277
++ + FGCG Q G A +G+ G G TS+ S LA G + FS C + G
Sbjct: 200 ANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKG 259
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFD 328
G + GD P TP L P YN+ + ++VGG + + I D
Sbjct: 260 GGIFAIGDVVQPKVKSTP--LVPDMPHYNVNLESINVGGTTLQLPSHMFETGEKKGTIID 317
Query: 329 SGTSFTYLNDPAYTQI 344
SGT+ TYL + Y +
Sbjct: 318 SGTTLTYLPELVYKDV 333
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/283 (32%), Positives = 130/283 (45%), Gaps = 26/283 (9%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+YT + +G P F V +DTGSD+ W+ +CVSC + SG ID +Y P SS+ S
Sbjct: 87 YYTKIEIGTPPKPFHVQVDTGSDILWV--NCVSC-DKCPTKSGLGIDLALYDPKGSSSGS 143
Query: 165 KVPCNSTLCELQ----KQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQ 216
V C++ C ++ P +AG C Y+ Y DG+ + G V D L + Q
Sbjct: 144 AVSCDNKFCAATYGSGEKLPGCTAGKPCEYRAEY-GDGSSTAGSFVSDSLQYNQLSGNAQ 202
Query: 217 SKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ + + FGCG Q G A +G+ G G TS S LA+ G + FS C +
Sbjct: 203 TRHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDT 262
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS----A 325
G G + G+ P TP +H YN+ + + V GNA+ FE S
Sbjct: 263 IKGGGIFAIGEVVQPKVKSTPLLPNMSH--YNVNLQSIDVAGNALQLPPHIFETSEKRGT 320
Query: 326 IFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEY 367
I DSGT+ TYL + Y I + F T L FEY
Sbjct: 321 IIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEY 363
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 151/326 (46%), Gaps = 45/326 (13%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
P++ +D+L + S L RDR R GR + G P+ S+ D
Sbjct: 43 PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
Y + L++T V +G P F V +DTGSD+ W+ C C +C H SSG ID +
Sbjct: 95 YLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHF 146
Query: 155 YSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ S T+ V C+ +C QC S + C Y RY DG+ ++G+ + D +
Sbjct: 147 FDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFY 204
Query: 210 LATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+S +S I FGC Q+G A +G+FG G K SV S L+++G+ P
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 267 NSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAV 319
FS C DG+G F G+ PG +P L + P YN+ + + V G +A
Sbjct: 265 PVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDAA 322
Query: 320 NFEFS----AIFDSGTSFTYLNDPAY 341
FE S I D+GT+ TYL AY
Sbjct: 323 VFEASNTRGTIVDTGTTLTYLVKEAY 348
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 104/326 (31%), Positives = 151/326 (46%), Gaps = 45/326 (13%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDT 95
P++ +D+L + S L RDR R GR + G P+ S+ D
Sbjct: 43 PLQRAFPLDELVE------LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DP 94
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
Y + L++T V +G P F V +DTGSD+ W+ C C +C H SSG ID +
Sbjct: 95 YLVG----LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHF 146
Query: 155 YSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
+ S T+ V C+ +C QC S + C Y RY DG+ ++G+ + D +
Sbjct: 147 FDAPGSLTAGSVTCSDPICSSVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFY 204
Query: 210 LATDEKQSKSVDSR--ISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+S +S I FGC Q+G A +G+FG G K SV S L+++G+ P
Sbjct: 205 FDAILGESLVANSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITP 264
Query: 267 NSFSMCFGSDGTGRISF--GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAV 319
FS C DG+G F G+ PG +P L + P YN+ + + V G +A
Sbjct: 265 PVFSHCLKGDGSGGGVFVLGEILVPGMVYSP--LVPSQPHYNLNLLSIGVNGQMLPLDAA 322
Query: 320 NFEFS----AIFDSGTSFTYLNDPAY 341
FE S I D+GT+ TYL AY
Sbjct: 323 VFEASNTRGTIVDTGTTLTYLVKEAY 348
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 132/284 (46%), Gaps = 33/284 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T + VG P S+ + +DTGSDL W+ CD C+SC G + +Y P S+
Sbjct: 191 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHV---------LYKPTRSN 241
Query: 162 TSSKVPCNSTLC-ELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S V LC ++QK + + C Y+++Y +D + S G LV D LHL T
Sbjct: 242 VVSSV---DALCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTTNG 297
Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
++ + FGCG Q G L+ +G+ GL K S+P LA++GLI N C
Sbjct: 298 SKTKLN--VVFGCGYDQAGLLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLS 355
Query: 275 SDGT--GRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA----- 325
+DG G + GD P G P + T Y I ++ G + F+ +
Sbjct: 356 NDGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSKVGKM 415
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FDSG+S+TY AY + + N ++ SD C+
Sbjct: 416 VFDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICW 459
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 143/306 (46%), Gaps = 39/306 (12%)
Query: 61 SALAHRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPA 115
S L RDR R GR + G P+ S+ D Y + L++T V +G P
Sbjct: 57 SELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSS--DPYLVG----LYFTKVKLGSPP 110
Query: 116 LSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE 174
F V +DTGSD+ W+ C C +C H SSG ID + + S T+ V C+ +C
Sbjct: 111 TEFNVQIDTGSDILWVTCSSCSNCPH----SSGLGIDLHFFDAPGSFTAGSVTCSDPICS 166
Query: 175 -----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR--ISFG 227
QC S + C Y RY DG+ ++G+ + D + +S +S I FG
Sbjct: 167 SVFQTTAAQC-SENNQCGYSFRY-GDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFG 224
Query: 228 CGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF--G 284
C Q+G A +G+FG G K SV S L+++G+ P FS C DG+G F G
Sbjct: 225 CSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLG 284
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS----AIFDSGTSFTY 335
+ PG +P L + P YN+ + + V G +A FE S I D+GT+ TY
Sbjct: 285 EILVPGMVYSP--LLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTY 342
Query: 336 LNDPAY 341
L AY
Sbjct: 343 LVKEAY 348
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/310 (31%), Positives = 149/310 (48%), Gaps = 37/310 (11%)
Query: 68 RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
RY RL+G A + +D+ LT AG D T R + G L+Y + +G PA S+ V
Sbjct: 38 RYPRLQGSLSALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
+DTGSD+ W+ C C C S+ G I+ +Y+ + S + V C+ C P
Sbjct: 97 VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152
Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
+G +CPY Y DG+ + G+ V+DV+ +A D K +++ + + FGCG Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210
Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
G LD + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
TP Q H YN+ +T V VG +N AI DSGT+ YL +
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSGTTLAYLPEII 327
Query: 341 YTQISETFNS 350
Y + + S
Sbjct: 328 YEPLVKKITS 337
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/270 (33%), Positives = 129/270 (47%), Gaps = 25/270 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +SG I N + P +SSTS
Sbjct: 76 LYYTKVKLGTPPREFYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPRSSSTS 132
Query: 164 SKVP-----CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C S + C S + C Y +Y DG+ ++G+ V D++H A + +
Sbjct: 133 SLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQY-GDGSGTSGYYVSDLMHFAGIFEGTL 191
Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S S FGC +QTG A +G+FG G SV S L+ QG+ P FS C
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCLKG 251
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFS 324
D G G + G+ P +P L Q+ P YN+ + +SV G V
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNNRG 309
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT+ YL + AY +L +
Sbjct: 310 TIVDSGTTLAYLAEEAYNPFVNAITALVPQ 339
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 88/257 (34%), Positives = 126/257 (49%), Gaps = 25/257 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P V +DTGSD+ W+ C SC +G +SG I N + P +SSTS
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPGSSSTS 132
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C C Q C + C Y +Y DG+ ++G+ V D++H A+ + +
Sbjct: 133 SLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S S FGC +QTG A +G+FG G SV S L++QG+ P FS C
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------- 325
D G G + G+ P +P L + P YN+ + +SV G V S
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNNRG 309
Query: 326 -IFDSGTSFTYLNDPAY 341
I DSGT+ YL + AY
Sbjct: 310 TIVDSGTTLAYLAEEAY 326
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 127/278 (45%), Gaps = 29/278 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
L++T V +G PA F V +DTGSD+ W+ PCD G SSG I+ N++ S
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD------GCPDSSGLGIELNLFDTTKS 136
Query: 161 STSSKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDE 214
S++ +PC +C QC + +C Y Y D + ++GF V D +H + E
Sbjct: 137 SSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHY-RDRSGTSGFYVTDSMHFDILLGE 195
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + I FGC Q G A +G+FG G + SV S L+++G+ P FS C
Sbjct: 196 STIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG----NAVNFEFS--- 324
G +G G + G+ P +P L + P Y + + +++ G N F S
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSP--LIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAG 313
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
I DSGT+ YL + Y I S + + S
Sbjct: 314 ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTIS 351
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 148/310 (47%), Gaps = 37/310 (11%)
Query: 68 RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
RY RL+G A + +D+ LT AG D T R + G L+Y + +G PA S+ V
Sbjct: 38 RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
+DTGSD+ W+ C C C S+ G I+ +Y+ + S + V C+ C P
Sbjct: 97 VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152
Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
+G +CPY Y DG+ + G+ V+DV+ +A D K +++ + + FGCG Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210
Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
G LD + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
TP Q H YN+ +T V VG + AI DSGT+ YL +
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEII 327
Query: 341 YTQISETFNS 350
Y + + S
Sbjct: 328 YEPLVKKITS 337
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 126/265 (47%), Gaps = 24/265 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T + +G P + V +DTGSD+ W+ +C+SC SG +D Y P SS+
Sbjct: 83 LYFTEIKLGTPPKRYYVQVDTGSDILWV--NCISC-EKCPRKSGLGLDLTFYDPKASSSG 139
Query: 164 SKVPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C + P +N C Y V Y DG+ +TGF V D L T + Q+
Sbjct: 140 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMY-GDGSSTTGFFVTDALQFDQVTGDGQT 198
Query: 218 KSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ ++ ++FGCG Q G A +G+ G G TS+ S LA G + F+ C +
Sbjct: 199 QPGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTI 258
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEF----SAI 326
G G + G+ P TP L P YN+ + + VGG + FE I
Sbjct: 259 KGGGIFAIGNVVQPKVKTTP--LVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTI 316
Query: 327 FDSGTSFTYLNDPAYTQI-SETFNS 350
DSGT+ TYL + + ++ + FN
Sbjct: 317 IDSGTTLTYLPELVFKEVMAAIFNK 341
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 130/283 (45%), Gaps = 31/283 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T + VG P S+ + +DTGSDL W+ CD C SC G + Y P S+
Sbjct: 193 LYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQ---------YKPTRSN 243
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S V +S ++QK + + C Y+++Y +D + S G LV D LHL T
Sbjct: 244 VVSSV--DSLCLDVQKNQKNGHHDESLLQCDYEIQY-ADHSSSLGVLVRDELHLVTTNGS 300
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ + FGCG Q G L+ A +G+ GL K S+P LA++GLI N C +
Sbjct: 301 KTKLN--VVFGCGYDQEGLILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHCLSN 358
Query: 276 DGT--GRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----I 326
DG G + GD P G P + T Y I ++ G + F+ +
Sbjct: 359 DGAGGGYMFLGDDFVPYWGMNWVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSKVGKVF 418
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSG+S+TY AY + + N ++ SD C+
Sbjct: 419 FDSGSSYTYFPKEAYLDLVASLNEVSGLGLVQDDSDTTLPICW 461
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 127/278 (45%), Gaps = 29/278 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
L++T V +G PA F V +DTGSD+ W+ PCD G SSG I+ N++ S
Sbjct: 83 LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCD------GCPDSSGLGIELNLFDTTKS 136
Query: 161 STSSKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH--LATDE 214
S++ +PC +C QC + +C Y Y D + ++GF V D +H + E
Sbjct: 137 SSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHY-RDRSGTSGFYVTDSMHFDILLGE 195
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + I FGC Q G A +G+FG G + SV S L+++G+ P FS C
Sbjct: 196 STIANSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL 255
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG----NAVNFEFS--- 324
G +G G + G+ P +P L + P Y + + +++ G N F S
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSP--LIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAG 313
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
I DSGT+ YL + Y I S + + S
Sbjct: 314 ETIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTIS 351
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 132/278 (47%), Gaps = 30/278 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTS 160
L+YT +S+G P + + +DTGS W+ CD C SC G + +Y P +
Sbjct: 159 LYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHP---------LYRP--A 207
Query: 161 STSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
T+ +P + LCE Q + P + C Y++ Y +DG+ S G V D + ++ + ++
Sbjct: 208 RTADALPASDPLCEGAQHENP---NQCDYEISY-ADGSSSMGVYVRDSMQFVGEDGEREN 263
Query: 220 VDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
D I FGCG Q G L+ +G+ GL S+P+ LA++G+I N+F C +D +
Sbjct: 264 AD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPS 321
Query: 279 GR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE---FSAIFDSG 330
G + GD P G T +R + Q++ G +N + +FD+G
Sbjct: 322 GAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTG 381
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
+++TY D A T++ + A + SD +C
Sbjct: 382 STYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFC 419
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 137/284 (48%), Gaps = 32/284 (11%)
Query: 82 NDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DC 135
+D+ L AG D R + LG L+Y + +G P + V +DTGSD+ W+ C C
Sbjct: 51 DDQRQLRILAGVDLPLGGIGRPDILG-LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQC 109
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQ-KQCP--SAGSNCPYQVR 191
C SS G ID +Y+ N S T VPC+ C E+ Q P +A +CPY
Sbjct: 110 RECPK--TSSLG--IDLTLYNINESDTGKLVPCDQEFCYEINGGQLPGCTANMSCPYLEI 165
Query: 192 YLSDGTMSTGFLVEDVLHLA--TDEKQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFG 247
Y DG+ + G+ V+DV+ A + + ++ + + + FGCG Q+G + A +G+ G
Sbjct: 166 Y-GDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCGARQSGDLGSSNEEALDGILG 224
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN 306
G +S+ S LA G + F+ C G++G G G P TP Q H YN
Sbjct: 225 FGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGGIFVIGHVVQPKVNMTPLIPNQPH--YN 282
Query: 307 ITITQVSVGGNAVN-----FEF----SAIFDSGTSFTYLNDPAY 341
+ +T V VG ++ FE AI DSGT+ YL + Y
Sbjct: 283 VNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSGTTLAYLPEMVY 326
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 130/288 (45%), Gaps = 31/288 (10%)
Query: 100 SLGFLHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIY 155
+G L+YT + VG+P + + +DTGS+L W+ CD C SC G N +Y
Sbjct: 25 QMGMLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN---------QLY 75
Query: 156 SP---NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
P N +S +L + C + C Y++ Y +D + S G L +D HL
Sbjct: 76 KPRKDNLVRSSEAFCVEVQRNQLTEHCENC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL 133
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+S I FGCG Q G L+ +G+ GL K S+PS LA++G+I N
Sbjct: 134 --HNGSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGH 191
Query: 272 CFGSD--GTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
C SD G G I G P G T P Y + +T++S G ++ +
Sbjct: 192 CLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGR 251
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FD+G+S+TY + AY+Q+ + ++ + SD C+
Sbjct: 252 VGKVLFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICW 299
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 130/285 (45%), Gaps = 33/285 (11%)
Query: 104 LHYTNVSVGQPA--LSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSP-- 157
L+YT + VG+P + + +DTGSDL W+ CD C SC G N +Y P
Sbjct: 197 LYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKGAN---------QLYKPRK 247
Query: 158 -NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
N +S +L + C S C Y++ Y +D + S G L +D HL
Sbjct: 248 DNLVRSSEPFCVEVQRNQLTEHCESC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL--HN 303
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S I FGCG Q G L+ +G+ GL K S+PS LA++G+I N C S
Sbjct: 304 GSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 363
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHP---TYNITITQVSVGGNAVNFEFS------ 324
D G G I G P G T + HP Y + +T++S G ++ +
Sbjct: 364 DLNGEGYIFMGSDLVPSHGMTWVPMLH-HPHLEVYQMQVTKMSYGNAMLSLDGENGRVGK 422
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FD+G+S+TY + AY+Q+ + ++ + SD C+
Sbjct: 423 VLFDTGSSYTYFPNQAYSQLVTSLQEVSDLELTRDDSDEALPICW 467
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 138/303 (45%), Gaps = 37/303 (12%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLP 132
RGR L+A + F+ G + L ++ L++T + +G P+ + V +DTGSD+ W+
Sbjct: 46 RGRILSA-------VDFNLGGNG--LPTVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVN 96
Query: 133 C-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC----ELQKQCPSAGSNCP 187
C +C C S I +Y P S TS V C C E + A + CP
Sbjct: 97 CVECTRCPR----KSDIGIGLTLYDPKRSKTSEFVSCEHNFCSSTYEGRILGCKAENPCP 152
Query: 188 YQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRVQTGSFLDGA--APN 243
Y + Y DG+ +TG+ V+D L + + + +S I FGCG Q+G+F + A +
Sbjct: 153 YSISY-GDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGCGAAQSGTFASSSEEALD 211
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-GTGRISFGDKGSPGQGETPFSLRQTH 302
G+ G G +SV S LA G + FS C ++ G G S G+ P TP H
Sbjct: 212 GIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGEVVEPKVKTTPLVPNMAH 271
Query: 303 PTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
YN+ + + V G+ + + DSGT+ YL Y Q+ LAK
Sbjct: 272 --YNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPRIVYDQLMSKV--LAK 327
Query: 354 EKR 356
+ R
Sbjct: 328 QPR 330
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 128/264 (48%), Gaps = 24/264 (9%)
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D YR+ L++T V +G P F V +DTGSD+ W+ C SC +G SSG I N
Sbjct: 76 DPYRVG----LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCG--SC-NGCPQSSGLHIPLN 128
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ P +SST+S + C+ C L Q C S G+ C Y +Y DG+ ++G+ V D+L
Sbjct: 129 FFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQY-GDGSGTSGYYVSDLL 187
Query: 209 HL-ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+ A + + I FGC QTG A +G+FG G SV S +++QG+ P
Sbjct: 188 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 247
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN--AVNFEFS 324
FS C DG G + L + P YN+ + +SV G A++ E
Sbjct: 248 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVF 307
Query: 325 A-------IFDSGTSFTYLNDPAY 341
A I DSGT+ YL + AY
Sbjct: 308 ATSTNRGTIVDSGTTLAYLAEEAY 331
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 149/311 (47%), Gaps = 37/311 (11%)
Query: 68 RYFRLRGRGLA-AQGNDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVA 121
RY RL+G A + +D+ LT AG D T R + G L+Y + +G PA S+ V
Sbjct: 38 RYPRLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPG-LYYAKIGIGTPAKSYYVQ 96
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
+DTGSD+ W+ C C C S+ G I+ +Y+ + S + V C+ C P
Sbjct: 97 VDTGSDIMWVNCIQCKQCPR--RSTLG--IELTLYNIDESDSGKLVSCDDDFCYQISGGP 152
Query: 181 SAG----SNCPYQVRYLSDGTMSTGFLVEDVLH---LATDEKQSKSVDSRISFGCGRVQT 233
+G +CPY Y DG+ + G+ V+DV+ +A D K +++ + + FGCG Q+
Sbjct: 153 LSGCKANMSCPYLEIY-GDGSSTAGYFVKDVVQYDSVAGDLK-TQTANGSVIFGCGARQS 210
Query: 234 GSFLDGA---APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSP 289
G LD + A +G+ G G +S+ S LA+ G + F+ C G +G G + G P
Sbjct: 211 GD-LDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQP 269
Query: 290 GQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPA 340
TP Q H YN+ +T V VG + AI DSGT+ YL +
Sbjct: 270 KVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEII 327
Query: 341 YTQISETFNSL 351
Y + + +L
Sbjct: 328 YEPLVKKEPAL 338
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 127/283 (44%), Gaps = 32/283 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T++ VG P + + +DTGSDL W+ CD C SC G N +Y P +
Sbjct: 100 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP---------LYKPKKGN 150
Query: 162 TSSKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
VP +LC E+Q+ + C Y++ Y +D + S G L D LHL
Sbjct: 151 L---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGS 206
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ I FGC Q G L+ A +G+ GL K S+PS LA+Q +I N C S
Sbjct: 207 LTKLG--IMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 264
Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTH-PTYNITITQVSVGGNAVNF------EFSAI 326
D T G + GD P G + +H P Y+ I ++S G ++ +
Sbjct: 265 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 324
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FD+G+S+TY AY + + ++ E SD C+
Sbjct: 325 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCW 367
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 132/274 (48%), Gaps = 25/274 (9%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++++G PA + + +DTGS L W+ CD C +C G + + NI P S
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNCTKGPHPLYKPAKE-NIVPPRDSHC 187
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ N C+ KQ C Y++ Y +D + S G L D + L T + + +++D
Sbjct: 188 -QELQGNQNYCDTCKQ-------CDYEIAY-ADRSSSAGVLARDNMELITADGERENMD- 237
Query: 223 RISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTG 279
+ FGC Q G L A+ +G+ GL S+P+ LA QG+I N F C +D G+
Sbjct: 238 -LVFGCAHDQQGKLLGSPASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSGSA 296
Query: 280 RISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDSGTS 332
+ GD P G T +R Y+ + +V+ G +N A IFDSG+S
Sbjct: 297 YMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQAGKLTQVIFDSGSS 356
Query: 333 FTYLNDPAYTQISETFNSLAKE-KRETSTSDLPF 365
+TY YT + + +++ R+ S LPF
Sbjct: 357 YTYFPHEIYTSLITSLEAVSPGFVRDESDQTLPF 390
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 128/264 (48%), Gaps = 24/264 (9%)
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN 153
D YR+ L++T V +G P F V +DTGSD+ W+ C SC +G SSG I N
Sbjct: 61 DPYRVG----LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCG--SC-NGCPQSSGLHIPLN 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+ P +SST+S + C+ C L Q C S G+ C Y +Y DG+ ++G+ V D+L
Sbjct: 114 FFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGNQCIYTFQY-GDGSGTSGYYVSDLL 172
Query: 209 HL-ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIP 266
+ A + + I FGC QTG A +G+FG G SV S +++QG+ P
Sbjct: 173 NFDAIVGSSVTNSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITP 232
Query: 267 NSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN--AVNFEFS 324
FS C DG G + L + P YN+ + +SV G A++ E
Sbjct: 233 KVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVF 292
Query: 325 A-------IFDSGTSFTYLNDPAY 341
A I DSGT+ YL + AY
Sbjct: 293 ATSTNRGTIVDSGTTLAYLAEEAY 316
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 94/317 (29%), Positives = 142/317 (44%), Gaps = 32/317 (10%)
Query: 70 FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
F + R LAA + +D + L AG D T R ++G L+Y + +G PA + V +
Sbjct: 57 FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQV 115
Query: 123 DTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS 181
DTGSD+ W+ C C C SS G ++ +Y S T V C+ C P
Sbjct: 116 DTGSDIMWVNCIQCNECPK--KSSLG--MELTLYDIKESLTGKLVSCDQDFCYAINGGPP 171
Query: 182 ----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGS 235
A +C Y Y +DG+ S G+ V D++ + + ++ S + + FGC Q+G
Sbjct: 172 SYCIANMSCSYTEIY-ADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGD 230
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGET 294
A +G+ G G TS+ S LA+ G + F+ C G +G G + G P T
Sbjct: 231 LSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290
Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQIS 345
P QTH YN+ + V VGG +N + I DSGT+ YL + Y Q+
Sbjct: 291 PLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 348
Query: 346 ETFNSLAKEKRETSTSD 362
S + + + D
Sbjct: 349 SKIFSWQSDLKVHTIHD 365
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 79/260 (30%), Positives = 117/260 (45%), Gaps = 26/260 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G P + V +DTGSD+ W+ C C C S ID +Y P S T
Sbjct: 69 LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPR----KSDLGIDLTLYDPKGSET 124
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQ 216
S + C+ C P G CPY + Y DG+ +TG+ V+D L + D +
Sbjct: 125 SELISCDQEFCSATYDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNHVNDNLR 183
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ +S I FGCG VQ+G+ + A +G+ G G +SV S LA G + FS C
Sbjct: 184 TAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLD 243
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
+ G G + G+ P TP R H YN+ + + V + +
Sbjct: 244 NIRGGGIFAIGEVVEPKVSTTPLVPRMAH--YNVVLKSIEVDTDILQLPSDIFDSGNGKG 301
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
I DSGT+ YL Y ++
Sbjct: 302 TIIDSGTTLAYLPAIVYDEL 321
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 127/283 (44%), Gaps = 32/283 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T++ VG P + + +DTGSDL W+ CD C SC G N +Y P +
Sbjct: 313 LYFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPNP---------LYKPKKGN 363
Query: 162 TSSKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
VP +LC E+Q+ + C Y++ Y +D + S G L D LHL
Sbjct: 364 L---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEY-ADHSSSMGVLASDDLHLMLANGS 419
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ I FGC Q G L+ A +G+ GL K S+PS LA+Q +I N C S
Sbjct: 420 LTKLG--IMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTS 477
Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTH-PTYNITITQVSVGGNAVNF------EFSAI 326
D T G + GD P G + +H P Y+ I ++S G ++ +
Sbjct: 478 DATGGGYMFLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQDGRTERVV 537
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FD+G+S+TY AY + + ++ E SD C+
Sbjct: 538 FDTGSSYTYFPKEAYYALVASLKDVSDEGLIQDGSDPTLPVCW 580
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 144/314 (45%), Gaps = 33/314 (10%)
Query: 70 FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
F + R LAA + +D + L AG D T R ++G L+Y + +G PA + V +
Sbjct: 57 FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQV 115
Query: 123 DTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS 181
DTGSD+ W+ C C C SS G ++ +Y S T V C+ C P
Sbjct: 116 DTGSDIMWVNCIQCNECPK--KSSLG--MELTLYDIKESLTGKLVSCDQDFCYAINGGPP 171
Query: 182 ----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGS 235
A +C Y Y +DG+ S G+ V D++ + + ++ S + + FGC Q+G
Sbjct: 172 SYCIANMSCSYTEIY-ADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGD 230
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGET 294
A +G+ G G TS+ S LA+ G + F+ C G +G G + G P T
Sbjct: 231 LSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290
Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQ-I 344
P QTH YN+ + V VGG +N + I DSGT+ YL + Y Q +
Sbjct: 291 PLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 348
Query: 345 SETFNSLAKEKRET 358
S+ F+ + K T
Sbjct: 349 SKIFSWQSDLKVHT 362
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/344 (29%), Positives = 151/344 (43%), Gaps = 49/344 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F F+ H+++ K + +L SF + LA+ D PL
Sbjct: 26 GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 65
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
G D+ R +S+G L++T + +G P + V +DTGSD+ W+ C C C + +
Sbjct: 66 ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 117
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
G I ++Y SSTS V C C Q + G+ C Y V Y DG+ S G V
Sbjct: 118 G--IPLSLYDSKASSTSKNVGCEDAFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFV 174
Query: 205 EDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
+D + L T ++ + + FGCG+ Q+G +A +G+ G G TSV S LA
Sbjct: 175 KDNITLDQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAA 234
Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
G + FS C + +G G + G+ SP TP Q H YN+ + + V G +
Sbjct: 235 GGSVKRIFSHCLDNMNGGGIFAIGEVESPVVKTTPLVPNQVH--YNVILKGMDVDGEPID 292
Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
N + I DSGT+ YL Y + E + + K
Sbjct: 293 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVK 336
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 79/262 (30%), Positives = 119/262 (45%), Gaps = 26/262 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G P + V +DTGSD+ W+ C +C C S ID +Y P S T
Sbjct: 69 LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPR----KSDLGIDLTLYDPKGSET 124
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQ 216
S V C+ C P G CPY + Y DG+ +TG+ V+D L + +
Sbjct: 125 SDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITY-GDGSATTGYYVQDYLTYNRINGNLR 183
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ +S I FGCG VQ+G+ + A +G+ G G +SV S LA G + FS C
Sbjct: 184 TSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLD 243
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
+ G G + G+ P TP R H YN+ + + V + +
Sbjct: 244 NVRGGGIFAIGEVVEPKVSTTPLVPRMAH--YNVVLKSIEVDTDILQLPSDIFDSVNGKG 301
Query: 325 AIFDSGTSFTYLNDPAYTQISE 346
+ DSGT+ YL D Y ++ +
Sbjct: 302 TVIDSGTTLAYLPDIVYDELIQ 323
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 129/285 (45%), Gaps = 35/285 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L++T + VG P + + +DT SDL W+ CD C SC G N+ +Y P +
Sbjct: 207 LYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGANA---------LYKPRRDN 257
Query: 162 TSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ P +S EL + AG C Y++ Y +D + S G L D LHL
Sbjct: 258 IVT--PKDSLCVELHRN-QKAGYCETCQQCDYEIEY-ADHSSSMGVLARDELHLTM--AN 311
Query: 217 SKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
S + + +FGC Q G L+ +G+ GL K S+PS LAN+G+I N C +
Sbjct: 312 GSSTNLKFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNVVGHCLAN 371
Query: 276 D--GTGRISFGDKGSP--GQGETPFSLRQTHPTYNITITQ-------VSVGGNAVNFEFS 324
D G G + GD P G P + +Y I + +S+GG
Sbjct: 372 DVVGGGYMFLGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSLGGQERRVR-R 430
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FDSG+S+TY AY+++ + ++ E TSD +C+
Sbjct: 431 IVFDSGSSYTYFTKEAYSELVASLKQVSGEALIQDTSDPTLPFCW 475
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 131/277 (47%), Gaps = 31/277 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPEKPNV 209
Query: 163 SSKVPCNSTLC-ELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VP + C ELQ + C Y++ Y +D + S G L D + L T + + ++
Sbjct: 210 ---VPPRDSYCQELQGNQNYGDTSKQCDYEITY-ADRSSSMGILARDNMQLITADGEREN 265
Query: 220 VDSRISFGCGRVQTGSFLDGAA-PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+D FGCG Q G+ L A +G+ GL S+P+ LA+QG+I N F C +D +
Sbjct: 266 LD--FVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323
Query: 279 --GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDS 329
G + GD P G T +R Y+ + +V+ G +N A IFDS
Sbjct: 324 NGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDS 383
Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPF 365
G+S+TYL YT I+ + ++ S LPF
Sbjct: 384 GSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPF 420
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 131/277 (47%), Gaps = 31/277 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 159 YYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPEKPNV 209
Query: 163 SSKVPCNSTLC-ELQ--KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VP + C ELQ + C Y++ Y +D + S G L D + L T + + ++
Sbjct: 210 ---VPPRDSYCQELQGNQNYGDTSKQCDYEITY-ADRSSSMGILARDNMQLITADGEREN 265
Query: 220 VDSRISFGCGRVQTGSFLDGAA-PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+D FGCG Q G+ L A +G+ GL S+P+ LA+QG+I N F C +D +
Sbjct: 266 LD--FVFGCGYDQQGNLLSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPS 323
Query: 279 --GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFDS 329
G + GD P G T +R Y+ + +V+ G +N A IFDS
Sbjct: 324 NGGYMFLGDDYVPRWGMTWMPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQVIFDS 383
Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPF 365
G+S+TYL YT I+ + ++ S LPF
Sbjct: 384 GSSYTYLPHDDYTNLIASLKSLSPSLLQDESDRTLPF 420
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 141/306 (46%), Gaps = 43/306 (14%)
Query: 90 SAGNDTYRLNSLGF-----LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGL 142
S GN + R + G L+Y + +G P + + +DTGSDL W CD C +C G
Sbjct: 20 SVGNHSVRFHVGGNIYPDGLYYMALLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGP 79
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQK----QCPSAGSNCPYQVRYLSDGT 197
+ +Y+P + V C+ +C ++Q+ +C S C Y+V Y +DG+
Sbjct: 80 H---------GLYNPKKAKV---VDCHLPVCAQIQQGGSYECNSDVKQCDYEVEY-ADGS 126
Query: 198 MSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVP 256
+ G LVED L + + ++ GCG Q G+ A+ +G+ GL K ++P
Sbjct: 127 STMGVLVEDTLTVRL--TNGTLIQTKAIIGCGYDQQGTLAKSPASTDGVIGLSSSKVALP 184
Query: 257 SILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQV 312
+ LA +G+I N C GS+G G + FGD+ P G TP + Y + +
Sbjct: 185 AQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMMGKPEMLGYQARLQSI 244
Query: 313 SVGGNAVNFE---------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
GG+++ S +FDSGTSFTYL AY + + R S + L
Sbjct: 245 RYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTL 304
Query: 364 PFEYCY 369
P YC+
Sbjct: 305 P--YCW 308
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 91/308 (29%), Positives = 140/308 (45%), Gaps = 38/308 (12%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRL------NSLGF-LHYTNVSVGQPA 115
L HR LR R G + G +R+ ++LG+ L+ T V +G P
Sbjct: 37 LNHRVEIDTLRARDRVRHG--RILRASVGGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPP 94
Query: 116 LSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE 174
F V +DTGSD+ W+ C+ C +C SSG I+ N + SST++ VPC+ +C
Sbjct: 95 REFTVQIDTGSDILWINCNTCSNC----PKSSGLGIELNFFDTVGSSTAALVPCSDPMCA 150
Query: 175 -----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD----SRIS 225
QC + C Y +Y DG+ ++G V D ++ QS + + I
Sbjct: 151 SAIQGAAAQCSPQVNQCSYTFQY-EDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIV 209
Query: 226 FGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--IS 282
FGC Q+G A +G+ G G + SV S L+++G+ P FS C DG G +
Sbjct: 210 FGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGGGILV 269
Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSF 333
G+ P +P L + P YN+ + ++V G ++ + I DSGT+
Sbjct: 270 LGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKRGTIIDSGTTL 327
Query: 334 TYLNDPAY 341
+YL AY
Sbjct: 328 SYLVQEAY 335
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 151/344 (43%), Gaps = 49/344 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F F+ H+++ K + +L SF + LA+ D PL
Sbjct: 27 GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 66
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
G D+ R +S+G L++T + +G P + V +DTGSD+ W+ C C C + +
Sbjct: 67 ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 118
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
G I ++Y TSSTS V C C Q + G+ C Y V Y DG+ S G +
Sbjct: 119 G--IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFI 175
Query: 205 ED--VLHLATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
+D L T ++ + + FGCG+ Q+G +A +G+ G G TS+ S LA
Sbjct: 176 KDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 235
Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
G FS C + +G G + G+ SP TP Q H YN+ + + V G+ +
Sbjct: 236 GGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDGDPID 293
Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
N + I DSGT+ YL Y + E + + K
Sbjct: 294 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVK 337
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 151/344 (43%), Gaps = 49/344 (14%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F F+ H+++ K + +L SF + LA+ D PL
Sbjct: 23 GNFVFNVTHKFAGKEK---QLSELKSHDSFRHARMLANID-----------------LPL 62
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSS 146
G D+ R +S+G L++T + +G P + V +DTGSD+ W+ C C C + +
Sbjct: 63 ----GGDS-RADSIG-LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKC--PVKTDL 114
Query: 147 GQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLV 204
G I ++Y TSSTS V C C Q + G+ C Y V Y DG+ S G +
Sbjct: 115 G--IPLSLYDSKTSSTSKNVGCEDDFCSFIMQSETCGAKKPCSYHVVY-GDGSTSDGDFI 171
Query: 205 ED--VLHLATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILAN 261
+D L T ++ + + FGCG+ Q+G +A +G+ G G TS+ S LA
Sbjct: 172 KDNITLEQVTGNLRTAPLAQEVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAA 231
Query: 262 QGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
G FS C + +G G + G+ SP TP Q H YN+ + + V G+ +
Sbjct: 232 GGSTKRIFSHCLDNMNGGGIFAVGEVESPVVKTTPIVPNQVH--YNVILKGMDVDGDPID 289
Query: 320 --------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
N + I DSGT+ YL Y + E + + K
Sbjct: 290 LPPSLASTNGDGGTIIDSGTTLAYLPQNLYNSLIEKITAKQQVK 333
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 123/283 (43%), Gaps = 32/283 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYSPN 158
L+ ++++G P + + +DTGSDL W+ CD C C + +Y PN
Sbjct: 61 LYTVSINIGNPPKPYELDIDTGSDLTWVQCDGPDAPCKGCTMPKDK---------LYKPN 111
Query: 159 TSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
V C+ +C L + C C Y V+Y +D + G LV D +H+
Sbjct: 112 GKQV---VKCSDPICVATQSTHVLGQICSKQSPPCVYNVQY-ADHASTLGVLVRDYMHIG 167
Query: 212 TDEKQSKSVDSRISFGCGRVQ--TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ +K D ++FGCG Q +G + P G+ GLG KTS+ S L + G I N
Sbjct: 168 SPSSSTK--DPLVAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILSQLTSIGFIHNVL 225
Query: 270 SMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAI 326
C ++G G + GDK P G TP YN + G + I
Sbjct: 226 GHCLSAEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFFNGKPTPAKGLQII 285
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSG+S+TY + P YT ++ N+ K K + D C+
Sbjct: 286 FDSGSSYTYFSSPVYTIVANMVNNDLKGKPLSRVKDPSLPICW 328
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 106/346 (30%), Positives = 163/346 (47%), Gaps = 40/346 (11%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRG-RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNV 109
+P G +AL RDR R RG+A D FS T NS+G L+YT V
Sbjct: 30 IPPTGHRVEVAALKARDRARHARMLRGVAGGVVD-----FSV-QGTSDPNSVG-LYYTKV 82
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
+G P F V +DTGSD+ W+ C+ C +C SS I+ N + SST++ +PC
Sbjct: 83 KMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQ----SSQLGIELNFFDTVGSSTAALIPC 138
Query: 169 NSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +C + Q C + C Y +Y DG+ ++G+ V D ++ + Q +V+S
Sbjct: 139 SDPICTSRVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFSLIMGQPPAVNSS 197
Query: 224 --ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGT 278
I FGC Q+G A +G+FG G SV S L+++G+ P FS C DG
Sbjct: 198 ATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGG 257
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS-----AIFD 328
G + G+ P +P L + P YN+ + ++V G N F S I D
Sbjct: 258 GVLVLGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRGGTIVD 315
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRS 373
GT+ YL AY + N ++++ R+T++ CY++ +
Sbjct: 316 CGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVST 358
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 88/357 (24%), Positives = 148/357 (41%), Gaps = 51/357 (14%)
Query: 6 RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAH 65
R + V L++++ C G + F+ H+++ + + A+ + SA+
Sbjct: 9 RLATVLSLVVIVELGFVVCLSNGNYVFNVQHKFAGKERSLSALKQHDARRHRRILSAVD- 67
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
L G G A+ L++ + +G P + V +DTG
Sbjct: 68 ----LPLGGNGHPAEAG---------------------LYFAKIGLGNPPKDYYVQVDTG 102
Query: 126 SDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
SD+ W+ C +C C + S + +Y P +S++++++ C+ C G
Sbjct: 103 SDILWVNCANCDKC----PTKSDLGVKLTLYDPQSSTSATRIYCDDDFCAATYNGVLQGC 158
Query: 185 N----CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF-L 237
C Y V Y DG+ + GF V+D L T Q+ S + + FGCG Q+G
Sbjct: 159 TKDLPCQYSVVY-GDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVIFGCGAKQSGELGT 217
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPF 296
A +G+ G G +S+ S LA G + F+ C + G G + G+ SP TP
Sbjct: 218 SSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGIFAIGEVVSPKVNTTPM 277
Query: 297 SLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDPAYTQI 344
Q H YN+ + ++ VGGN + I DSGT+ YL + Y +
Sbjct: 278 VPNQPH--YNVVMKEIEVGGNVLELPTDIFDTGDRRGTIIDSGTTLAYLPEVVYESM 332
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 209
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 210 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 268
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 269 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 328
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
DG G + G+ P TP Q H YN+ + ++ VGG+ ++ A I
Sbjct: 329 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ Y Y + E S + R
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 416
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 209
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 210 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 268
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 269 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 328
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
DG G + G+ P TP Q H YN+ + ++ VGG+ ++ A I
Sbjct: 329 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 386
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ Y Y + E S + R
Sbjct: 387 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 416
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 54 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 102
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 103 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 155
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 156 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 215
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 216 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 274
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 275 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 316
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 35 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 83
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 84 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVF--SMN 136
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 137 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 196
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 197 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 255
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 256 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 297
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 123/270 (45%), Gaps = 24/270 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 73 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 128
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 129 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 187
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 188 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 247
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
DG G + G+ P TP Q H YN+ + ++ VGG+ ++ A I
Sbjct: 248 DGGGIFAIGEVVEPKVNITPLVQNQAH--YNVVMKEIEVGGDPLDVPSDAFESGDRKGTI 305
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ Y Y + E S + R
Sbjct: 306 IDSGTTLAYFPQEVYVPLIEKILSQQPDLR 335
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 57 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 105
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 106 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 158
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 159 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 218
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 219 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 277
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 319
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 45 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 93
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 94 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 146
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 147 YTQGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 206
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 207 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 265
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 266 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 307
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/258 (33%), Positives = 127/258 (49%), Gaps = 25/258 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T V +G P + F V +DTGSD+ W+ C+ SC +G SSG I N + ++SS+S
Sbjct: 78 LYFTKVKLGTPPMEFTVQIDTGSDILWVNCN--SC-NGCPRSSGLGIQLNFFDASSSSSS 134
Query: 164 SKVP-----CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S V CNS QC + + C Y +Y DG+ ++G+ V + ++ QS
Sbjct: 135 SLVSCSDPICNSAFQTTATQCLTQSNQCSYTFQY-GDGSGTSGYYVSESMYFDMVMGQSM 193
Query: 219 SVDSRIS--FGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S S FGC Q+G A +G+FG G SV S L+ +G+ P FS C
Sbjct: 194 IANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKG 253
Query: 276 DGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFS 324
+G G + G+ PG +P L + P YN+ + +SV G A +
Sbjct: 254 EGNGGGILVLGEVLEPGIVYSP--LVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINRG 311
Query: 325 AIFDSGTSFTYLNDPAYT 342
I DSGT+ YL + AYT
Sbjct: 312 TIIDSGTTLAYLVEEAYT 329
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 130/285 (45%), Gaps = 28/285 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G P+ + V +DTGSD+ W+ C C SC SG ID +Y P S++
Sbjct: 88 LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPR----KSGLGIDLTLYDPTASAS 143
Query: 163 SSKVPCNSTLCELQKQC---PSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
S V C C PS +N C Y + Y DG+ +TGF V D L + +
Sbjct: 144 SKTVTCGQEFCATATNGGVPPSCAANSPCQYSITY-GDGSSTTGFFVADFLQYDQVSGDG 202
Query: 216 QSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q+ ++ ++FGCG G+ A +G+ G G +S+ S L + G + FS C
Sbjct: 203 QTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLD 262
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
+ +G G + G+ P TP L P YN+ + + VGG+ + +
Sbjct: 263 TVNGGGIFAIGNVVQPKVKTTP--LVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSR 320
Query: 325 -AIFDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEY 367
I DSGT+ YL + Y + S F++ + L F+Y
Sbjct: 321 GTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQY 365
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 128/271 (47%), Gaps = 23/271 (8%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R++S+G L++T + +G P + V +DTGSD+ W+ C C C N + +++
Sbjct: 67 RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLN----FRLSLF 121
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
N SSTS KV C+ C Q S C Y + Y +D + S G + D+L L
Sbjct: 122 DMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY-ADESTSDGKFIRDMLTLEQV 180
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T + ++ + + FGCG Q+G +G +A +G+ G G TSV S LA G FS
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFS 240
Query: 271 MCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C + G G + G SP TP Q H YN+ + + V G +++ S
Sbjct: 241 HCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTSLDLPRSIVRNG 298
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT+ Y Y + ET LA++
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI--LARQ 327
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/362 (26%), Positives = 150/362 (41%), Gaps = 58/362 (16%)
Query: 6 RNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVD--DLPKKGSFAYYSAL 63
R V +++ L CC F ++ P + + A+ D ++G F L
Sbjct: 4 RERLVRLVVSLFVVVQLCCHANANMVFPVVRKFKGPAENLAAIKAHDAGRRGRFLSVVDL 63
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A L G G R S G L+YT + +G + V +D
Sbjct: 64 A-------LGGNG--------------------RPTSTG-LYYTKIGLGPN--DYYVQVD 93
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
TGSD W+ C C +C SG ++ +Y PN+S TS VPC+ C P +
Sbjct: 94 TGSDTLWVNCVGCTTC----PKKSGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGPIS 149
Query: 183 G----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV--DSRISFGCGRVQTGSF 236
G +CPY + Y DG+ ++G ++D L ++V ++ + FGCG Q+G+
Sbjct: 150 GCKKDMSCPYSITY-GDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTL 208
Query: 237 --LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGE 293
+ +G+ G G +SV S LA G + FS C + +G G + G+ P
Sbjct: 209 SSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRVFSHCLDTVNGGGIFAIGEVVQPKVKT 268
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQI 344
TP R H YN+ + + V G+ + I DSGT+ YL Y Q+
Sbjct: 269 TPLVPRMAH--YNVVLKDIEVAGDPIQLPTDIFDSTSGRGTIIDSGTTLAYLPVSIYDQL 326
Query: 345 SE 346
E
Sbjct: 327 LE 328
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 128/271 (47%), Gaps = 23/271 (8%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R++S+G L++T + +G P + V +DTGSD+ W+ C C C N + +++
Sbjct: 67 RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLN----FRLSLF 121
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
N SSTS KV C+ C Q S C Y + Y +D + S G + D+L L
Sbjct: 122 DMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIVY-ADESTSDGKFIRDMLTLEQV 180
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T + ++ + + FGCG Q+G +G +A +G+ G G TSV S LA G FS
Sbjct: 181 TGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFS 240
Query: 271 MCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C + G G + G SP TP Q H YN+ + + V G +++ S
Sbjct: 241 HCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTSLDLPRSIVRNG 298
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT+ Y Y + ET LA++
Sbjct: 299 GTIVDSGTTLAYFPKVLYDSLIETI--LARQ 327
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 138/309 (44%), Gaps = 36/309 (11%)
Query: 59 YYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
++ L DR GR L N T D Y + L+YT + +G P F
Sbjct: 5 HFEMLKAHDR--ARHGRSL----NTIVDFTLQGTADPY----VAGLYYTRIELGTPPRPF 54
Query: 119 IVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC---- 173
V +DTGSD+ W+ C C +C +SG + N + P SST+S + C + C
Sbjct: 55 YVQIDTGSDILWVNCKPCNACPL----TSGLGVALNFFDPRGSSTASPLSCIDSKCVSSN 110
Query: 174 ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEKQSKSVDSRISFGCGRV 231
++ + + C Y Y DG+ + G+ V D + ++ + + ++I+FGC
Sbjct: 111 QISESVCTTDRYCGYSFEY-GDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYN 169
Query: 232 QTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSD-GTGRISFGDKGS 288
Q+G A +G+FG G + SV S L +QGL P FS C G+D G G + G+
Sbjct: 170 QSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGILVLGEITE 229
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---------FSAIFDSGTSFTYLNDP 339
PG TP Q H YN+ + ++V G ++ + I D GT+ YL +
Sbjct: 230 PGMVYTPIVPSQPH--YNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLAYLAEE 287
Query: 340 AYTQISETF 348
AY T
Sbjct: 288 AYEPFVNTI 296
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 124/278 (44%), Gaps = 35/278 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T V +G P +IV +DTGSD+ W+ C S G S I +Y P SST+
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCS---GCPRKSALNIPLTMYDPRESSTT 57
Query: 164 SKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS- 217
S V C+ LC + QC A +NC Y Y DG+ S G+ V D +
Sbjct: 58 SLVSCSDPLCVRGRRFAEAQCSQATNNCEYIFSY-GDGSTSEGYYVRDAMQYNVISSNGL 116
Query: 218 KSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ S++ FGC QTG A +G+ G G + SVP+ LA Q IP FS C +
Sbjct: 117 ANTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL--E 174
Query: 277 GTGR----ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFSA---- 325
G R + G PG TP H YN+ + +SV N + +FS+
Sbjct: 175 GEKRGGGILVIGGIAEPGMTYTPLVPDSVH--YNVVLRGISVNSNRLPIDAEDFSSTNDT 232
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
I DSGT+ Y AY N + RE +++
Sbjct: 233 GVIMDSGTTLAYFPSGAY-------NVFVQAIREATSA 263
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 128/284 (45%), Gaps = 31/284 (10%)
Query: 104 LHYTNVSVGQP--ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSP-- 157
L+YT + VG+P + + +DTGS+L W+ CD C SC G N +Y P
Sbjct: 202 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN---------QLYKPRK 252
Query: 158 -NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
N +S +L + C + C Y++ Y +D + S G L +D HL
Sbjct: 253 DNLVRSSEAFCVEVQRNQLTEHCENC-HQCDYEIEY-ADHSYSMGVLTKDKFHLKL--HN 308
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+S I FGCG Q G L+ +G+ GL K S+PS LA++G+I N C S
Sbjct: 309 GSLAESDIVFGCGYDQQGLLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 368
Query: 276 D--GTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFEFS------A 325
D G G I G P G T P Y + +T++S G ++ +
Sbjct: 369 DLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGMLSLDGENGRVGKV 428
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FD+G+S+TY + AY+Q+ + ++ + SD C+
Sbjct: 429 LFDTGSSYTYFPNQAYSQLVTSLQEVSGLELTRDDSDETLPICW 472
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 131/268 (48%), Gaps = 29/268 (10%)
Query: 95 TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
T R +S+G L+Y + +G P+ + + +DTG+D+ W+ C C C + S +D
Sbjct: 64 TGRPDSVG-LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECP----TRSNLGMDLT 118
Query: 154 IYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDV 207
+Y+ SS+ VPC+ LC+ L C S ++ CPY Y DG+ + G+ V+DV
Sbjct: 119 LYNIKESSSGKLVPCDQELCKEINGGLLTGCTSKTNDSCPYLEIY-GDGSSTAGYFVKDV 177
Query: 208 LHL--ATDEKQSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+ + + ++ S + + FGCG Q+G S+ + A +G+ G G S+ S L++ G
Sbjct: 178 VLFDQVSGDLKTASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSG 237
Query: 264 LIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
+ F+ C G +G G + G P TP L P Y++ +T + VG +N
Sbjct: 238 KVKKMFAHCLNGVNGGGIFAIGHVVQPTVNTTP--LLPDQPHYSVNMTAIQVGHTFLNLS 295
Query: 323 FSA---------IFDSGTSFTYLNDPAY 341
A I DSGT+ YL D Y
Sbjct: 296 TDASEQRDSKGTIIDSGTTLAYLPDGIY 323
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/245 (31%), Positives = 117/245 (47%), Gaps = 29/245 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC +CV C + + + P SST
Sbjct: 91 TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPR---------FQPELSSTYQP 141
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN+ C C G C Y+ RY ++ + S+G L EDV+ K+S+ V R
Sbjct: 142 VKCNAD-C----NCDENGVQCTYERRY-AEMSTSSGVLAEDVMSFG---KESELVPQRAV 192
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +++G A +G+ GLG SV L +G++ NSFS+C+G G G +
Sbjct: 193 FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G SP S P YNI + ++ V G + ++ AI DSGT++ Y
Sbjct: 252 GGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYF 311
Query: 337 NDPAY 341
+ AY
Sbjct: 312 PEKAY 316
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 132/282 (46%), Gaps = 35/282 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + T +++GQP + + LDTGSDL WL CD CV C+ + +Y P
Sbjct: 57 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCDAPCVRCLEAPHP---------LYQP- 105
Query: 159 TSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+S +PCN LC+ ++C + C Y+V Y +DG S G LV DV + +
Sbjct: 106 ---SSDLIPCNDPLCKALHLNSNQRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSM--N 158
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + R++ GCG Q +G+ GLG K S+ S L +QG + N C
Sbjct: 159 YTKGLRLTPRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNVIGHCL 218
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDS 329
S G G + FGD S TP S R+ Y+ + ++ GG + +FDS
Sbjct: 219 SSLGGGILFFGDDLYDSSRVSWTPMS-REYSKHYSPAMGGELLFGGRTTGLKNLLTVFDS 277
Query: 330 GTSFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
G+S+TY N AY ++ E KE R+ T L ++
Sbjct: 278 GSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 319
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/245 (31%), Positives = 116/245 (47%), Gaps = 29/245 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC +CV C + + + P SST
Sbjct: 91 TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPR---------FQPELSSTYQP 141
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN+ C G C Y+ RY ++ + S+G L EDV+ K+S+ V R
Sbjct: 142 VKCNADC-----NCDENGVQCTYERRY-AEMSTSSGVLAEDVMSFG---KESELVPQRAV 192
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +++G A +G+ GLG SV L +G++ NSFS+C+G G G +
Sbjct: 193 FGCETMESGDLYTQRA-DGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDVGGGAMVL 251
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G SP S P YNI + ++ V G + ++ AI DSGT++ Y
Sbjct: 252 GGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAYF 311
Query: 337 NDPAY 341
+ AY
Sbjct: 312 PEKAY 316
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 75/254 (29%), Positives = 125/254 (49%), Gaps = 30/254 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTS 160
L+YT +S+G P + + +DTGS W+ CD C SC G + +Y P +
Sbjct: 159 LYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHP---------LYRP--A 207
Query: 161 STSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
T+ +P + LCE Q + P + C Y++ Y +DG+ S G V D + ++ + ++
Sbjct: 208 RTADALPASDPLCEGAQHENP---NQCDYEISY-ADGSSSMGVYVRDSMQFVGEDGEREN 263
Query: 220 VDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
D I FGCG Q G L+ +G+ GL S+P+ LA++G+I N+F C +D +
Sbjct: 264 AD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPS 321
Query: 279 GRISF---GDKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE---FSAIFDSG 330
G + GD P G T +R + Q++ G +N + +FD+G
Sbjct: 322 GAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTG 381
Query: 331 TSFTYLNDPAYTQI 344
+++TY D A T++
Sbjct: 382 STYTYFPDEALTRL 395
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 127/273 (46%), Gaps = 27/273 (9%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R++S+G L++T + +G P + V +DTGSD+ W+ C C C N + +++
Sbjct: 67 RVDSVG-LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLN----FHLSLF 121
Query: 156 SPNTSSTSSKVPCNSTLCELQKQC----PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL- 210
N SSTS KV C+ C Q P+ G C Y + Y +D + S G + D L L
Sbjct: 122 DVNASSTSKKVGCDDDFCSFISQSDSCQPAVG--CSYHIVY-ADESTSEGNFIRDKLTLE 178
Query: 211 -ATDEKQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
T + Q+ + + FGCG Q+G +A +G+ G G TSV S LA G
Sbjct: 179 QVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRV 238
Query: 269 FSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
FS C + G G + G SP TP Q H YN+ + + V G A++ S
Sbjct: 239 FSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMH--YNVMLMGMDVDGTALDLPPSIMR 296
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT+ Y Y + ET LA++
Sbjct: 297 NGGTIVDSGTTLAYFPKVLYDSLIETI--LARQ 327
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/261 (33%), Positives = 123/261 (47%), Gaps = 23/261 (8%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y ++S+GQP + + DTGSDL WL CD CV C + +Y PN
Sbjct: 64 LGY-YYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRCTKAPHP---------LYRPN 113
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ K P ++L +C C Y+V Y +DG S G LV+DV L +
Sbjct: 114 NNLVICKDPMCASLHPPGYKCEHP-EQCDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGL 169
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+ R++ GCG Q P +G+ GLG K+S+ S L +QG+I N C S G
Sbjct: 170 RLAPRLALGCGYDQIPG--QSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRG 227
Query: 278 TGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFT 334
G + FGD S TP LR H Y+ ++ +GG F+ FDSG+S+T
Sbjct: 228 GGFLFFGDDLYDSSRVVWTPM-LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYT 286
Query: 335 YLNDPAYTQISETFNSLAKEK 355
YLN AY + EK
Sbjct: 287 YLNSLAYQALVHLVRKELSEK 307
>gi|110741881|dbj|BAE98882.1| predicted GPI-anchored protein [Arabidopsis thaliana]
Length = 313
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 90/160 (56%), Gaps = 12/160 (7%)
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S SV +R+ GCG+ Q+G +LDG AP+GL GLG + SVPS L+ GL+ NSFS+CF +
Sbjct: 4 SSSVKARVVIGCGKKQSGDYLDGVAPDGLMGLGPAEISVPSFLSKAGLMRNSFSLCFDEE 63
Query: 277 GTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAV-NFEFSAIFDSGTSF 333
+GRI FGD G Q TPF + Y + + +G + + F+ DSG SF
Sbjct: 64 DSGRIYFGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLKQTSFTTFIDSGQSF 123
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCY 369
TYL + Y ++ +L ++ +TS + +EYCY
Sbjct: 124 TYLPEEIYRKV-----ALEIDRHINATSKNFEGVSWEYCY 158
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 133/296 (44%), Gaps = 36/296 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L++T + +G P + V +DTGSD+ W+ +C+SC SG +D Y P SS+
Sbjct: 86 LYFTEIKLGTPPKRYYVQVDTGSDILWV--NCISCSK-CPRKSGLGLDLTFYDPKASSSG 142
Query: 164 SKVPCNSTLCELQ--KQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C + P +N C Y V Y DG+ +TGF + D L T + Q+
Sbjct: 143 STVSCDQGFCAATYGGKLPGCTANVPCEYSVMY-GDGSSTTGFFITDALQFDQVTGDGQT 201
Query: 218 KSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ ++ I+FGCG Q G + A +G+ G G TS+ S LA G F+ C +
Sbjct: 202 QPGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTI 261
Query: 276 DGTGRISFGDKGSP----------GQGETPFSLRQ----THPTYNITITQVSVGGNAVNF 321
G G + G+ P G P L + P YN+ + + VGG +
Sbjct: 262 KGGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQL 321
Query: 322 ---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD-LPFEY 367
+ I DSGT+ TYL + + Q+ + S ++ + D L F+Y
Sbjct: 322 PAHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQY 377
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 134/284 (47%), Gaps = 32/284 (11%)
Query: 82 NDKTPLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DC 135
+D+ L AG D + R +++G L+Y V +G P+ + V +DTGSD+ W+ C C
Sbjct: 59 DDRRQLRILAGVDLPLGGSGRPDTVG-LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQC 117
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP----SAGSNCPYQVR 191
C SS G ++ +Y+ S + VPC+ C P +A +CPY
Sbjct: 118 RECPR--TSSLG--MELTLYNIKDSVSGKLVPCDEEFCYEVNGGPLSGCTANMSCPYLEI 173
Query: 192 YLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFG 247
Y DG+ + G+ V+DV+ + + Q+ S + + FGCG Q+G A +G+ G
Sbjct: 174 Y-GDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFGCGARQSGDLGPTSEEALDGILG 232
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN 306
G +S+ S LA + F+ C G +G G + G P TP Q H YN
Sbjct: 233 FGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIGHVVQPKVNMTPLIPNQPH--YN 290
Query: 307 ITITQVSVGGNAVNF---EFS------AIFDSGTSFTYLNDPAY 341
+ +T V VG + ++ EF AI DSGT+ YL + Y
Sbjct: 291 VNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTTLAYLPEIVY 334
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 130/289 (44%), Gaps = 36/289 (12%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLP 132
RGR LA +G D FS G L+ G L++T V +G P +IV +DTGSD+ W+
Sbjct: 5 RGRFLA-EGVD-----FSLGGTADPLS--GGLYFTQVGLGNPVKHYIVQVDTGSDVLWVN 56
Query: 133 CDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCP 187
C S G S I +Y P SST+S V C+ LC + QC +NC
Sbjct: 57 CRPCS---GCPRKSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCE 113
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQS-KSVDSRISFGCGRVQTGSF-LDGAAPNGL 245
Y Y DG+ S G+ V D + + S++ FGC QTG A +G+
Sbjct: 114 YIFSY-GDGSTSEGYYVRDAMQYNVISSNGLANTTSQVLFGCSIRQTGDLSTSQQAVDGI 172
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR----ISFGDKGSPGQGETPFSLRQT 301
G G + SVP+ LA Q IP FS C +G R + G PG TP
Sbjct: 173 IGFGQLELSVPNQLAAQQNIPRVFSHCL--EGEKRGGGILVIGGIAEPGMTYTPLVPDSV 230
Query: 302 HPTYNITITQVSVGGNAVNF---EFSA------IFDSGTSFTYLNDPAY 341
H YN+ + +SV N + +FS+ I DSGT+ Y AY
Sbjct: 231 H--YNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAY 277
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/282 (32%), Positives = 129/282 (45%), Gaps = 43/282 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y + +G PA + + +DTGSDL WL CD C SC G + +Y P +
Sbjct: 22 LYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPH---------GLYDPKKAR 72
Query: 162 TSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH-LATDEK 215
V C LC L +Q C C Y V Y +DG+ + G L+ED + L T+
Sbjct: 73 L---VDCRVPLCALVQQGGSYACGGPVRQCDYDVEY-ADGSSTMGVLMEDTITLLLTNGT 128
Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+SK+ GCG Q G+ A+ +G+ GL K S+PS LA +G++ N C
Sbjct: 129 RSKTT---AIIGCGYDQQGTLAQTPASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLA 185
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------AI 326
GS+G G + FGD P G T + T NI GG + + + +
Sbjct: 186 GGSNGGGYLFFGDSLVPALGMTWTPIMGKSITGNI-------GGKSGDADDKTGDIGGVM 238
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPF 365
FDSGTSFTYL AY + ++ R + + LPF
Sbjct: 239 FDSGTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKTDNTLPF 280
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 120/262 (45%), Gaps = 28/262 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G + V +DTGSD W+ C C +C SG +D +Y PN S T
Sbjct: 75 LYYTKIGLGPK--DYYVQVDTGSDTLWVNCVGCTAC----PKKSGLGMDLTLYDPNLSKT 128
Query: 163 SSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S VPC+ C + Q + G +CPY + Y DG+ ++G ++D L +
Sbjct: 129 SKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITY-GDGSTTSGSYIKDDLTFDRVVGDLR 187
Query: 219 SV--DSRISFGCGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+V ++ + FGCG Q+G+ + +G+ G G +SV S LA G + FS C
Sbjct: 188 TVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLD 247
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
S G G + G+ P TP L Q YN+ + + V G+ +
Sbjct: 248 SISGGGIFAIGEVVQPKVKTTP--LLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRG 305
Query: 325 AIFDSGTSFTYLNDPAYTQISE 346
I DSGT+ YL Y Q+ E
Sbjct: 306 TIIDSGTTLAYLPVSIYDQLLE 327
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 125/278 (44%), Gaps = 33/278 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHG----LNSSSGQVIDFNIYSPN 158
+YT++++G P + + +DTGSD W+ CD C +C G + G+++
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNCTKGPHPVYKPTEGKIVH------P 69
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++ N CE KQ C Y++ Y +D + S G L D + L T + + K
Sbjct: 70 RDPLCEELQGNQNYCETCKQ-------CDYEITY-ADRSSSKGVLARDNMQLTTADGEMK 121
Query: 219 SVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+VD FGC Q G LD + +G+ GL S+ + LAN G+I N F C +D
Sbjct: 122 NVD--FVFGCAHNQQGKLLDSPTSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDP 179
Query: 278 T--GRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
+ G + GD P G T +R Y+ + +V+ G +N A IFD
Sbjct: 180 SSGGYMFLGDDYVPRWGMTWVPIRNGPGNVYSTEVPKVNYGAQELNLRGQAGKLTQVIFD 239
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKE-KRETSTSDLPF 365
SG+S+TY YT + + R+ S LPF
Sbjct: 240 SGSSYTYFPHEIYTNLIALLEDASPGFVRDESDQTLPF 277
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC---------SKVPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 117/260 (45%), Gaps = 40/260 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 203 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 250
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ T +
Sbjct: 251 EKIVPPKDLLCQELQGNQNYCETC-KQCDYEIEY-ADRSSSMGVLARDDMHIITTNGGRE 308
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LANQG+I N F C D
Sbjct: 309 KLD--FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDP 366
Query: 277 -GTGRISFGDKGSPGQGETPFSLR---------QTHPTY--NITITQVSVGGNAVNFEFS 324
G G + GD P G T +R + Y + ++ GN+V
Sbjct: 367 NGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQ---- 422
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
IFDSG+S+TYL D Y +
Sbjct: 423 VIFDSGSSYTYLPDEIYKNL 442
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 117/260 (45%), Gaps = 40/260 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 204 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 251
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ T +
Sbjct: 252 EKIVPPKDLLCQELQGNQNYCETC-KQCDYEIEY-ADRSSSMGVLARDDMHIITTNGGRE 309
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LANQG+I N F C D
Sbjct: 310 KLD--FVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQLANQGIISNVFGHCITRDP 367
Query: 277 -GTGRISFGDKGSPGQGETPFSLR---------QTHPTY--NITITQVSVGGNAVNFEFS 324
G G + GD P G T +R + Y + ++ GN+V
Sbjct: 368 NGGGYMFLGDDYVPRWGMTSTPIRSAPDNLFHTEAQKVYYGDQQLSMRGASGNSVQ---- 423
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
IFDSG+S+TYL D Y +
Sbjct: 424 VIFDSGSSYTYLPDEIYKNL 443
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/296 (31%), Positives = 136/296 (45%), Gaps = 31/296 (10%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R R GR L T +D Y + L++T V +G P F V +DTG
Sbjct: 51 RARDQARHGRLLRGVVGGVVDFTVYGTSDPYLVG----LYFTKVKLGSPPREFNVQIDTG 106
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP-----CNSTLCELQKQC 179
SD+ W+ C+ C C +SG I+ + + P++SST+S V C S + +C
Sbjct: 107 SDILWVTCNSCNDCPR----TSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAEC 162
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS--RISFGCGRVQTGSFL 237
+ C Y Y DG+ +TG+ V D+L+ T S +S I FGC Q+G
Sbjct: 163 SPQSNQCSYSFHY-GDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDLT 221
Query: 238 D-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGET 294
A +G+FG G SV S L++ G+ P FS C DG G++ G+ P +
Sbjct: 222 KVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCLKGEGDGGGKLVLGEILEPNIIYS 281
Query: 295 PFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
P Q+H YN+ + +SV G + + + I DSGT+ TYL + AY
Sbjct: 282 PLVPSQSH--YNLNLQSISVNGQLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAY 335
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 80/249 (32%), Positives = 116/249 (46%), Gaps = 26/249 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y + +G PA F V +DTGS + ++PC G N + P SST+S+
Sbjct: 79 YATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDA------AFDPEASSTASR 132
Query: 166 VPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ C S C +C + C Y R ++ + S+G L+EDVL L + I
Sbjct: 133 ISCTSPKCSCGSPRCGCSTQQCTY-TRSYAEQSSSSGILLEDVLAL-----HDGLPGAPI 186
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISF 283
FGC +TG A +GLFGLG SV + L G+I + FS+CFG +G G +
Sbjct: 187 IFGCETRETGEIFRQRA-DGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGALLL 245
Query: 284 GDKGSPGQ---GETPFSLRQTHP-TYNITITQVSVGGNAVNFE-------FSAIFDSGTS 332
GD PG TP THP YN+ + ++V G + + + DSGT+
Sbjct: 246 GDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQGYGTVLDSGTT 305
Query: 333 FTYLNDPAY 341
FTY+ P +
Sbjct: 306 FTYMPSPVF 314
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/271 (33%), Positives = 130/271 (47%), Gaps = 24/271 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +S I + + P SS++
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 139
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C+ C Q S S C Y +Y DG+ ++GF + D + T + ++
Sbjct: 140 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGFYISDFMSFDTVITSTLAI 198
Query: 221 DSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+S FGC +QTG A +G+FGLG SV S LA QGL P FS C D
Sbjct: 199 NSSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
G G + G P TP L + P YN+ + ++V G + + S I
Sbjct: 259 SGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316
Query: 327 FDSGTSFTYLNDPAYTQ-ISETFNSLAKEKR 356
D+GT+ YL D AY+ I N++++ R
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAIANAVSQYGR 347
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 120/276 (43%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK---------VPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC +C + +C S C Y+++Y G+ S G LV D L
Sbjct: 108 L---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS-SLGVLVTDSFALRL-- 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + +A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
+ G G + FGD P T P + + Y+ + GG + +FDSG
Sbjct: 222 STRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+SFTY + Y + + L+K +E LP
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPL 317
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/260 (33%), Positives = 120/260 (46%), Gaps = 21/260 (8%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y ++S+GQP + + TGSDL WL CD CV C + +Y PN
Sbjct: 64 LGY-YYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRCTKAXHX---------LYRPN 113
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ K P + L +C C Y+V Y +DG S G LV+DV L +
Sbjct: 114 NNLVICKDPMCAXLHPPGYKCEHP-EQCDYEVEY-ADGGSSLGVLVKDVFPL--NFTNGL 169
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ R++ GCG Q +G+ GLG K+S+ S L +QG+I N C S G
Sbjct: 170 RLAPRLALGCGYDQIPG-XSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSHGG 228
Query: 279 GRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTY 335
G + FGD S TP LR H Y+ ++ +GG F+ FDSG+S+TY
Sbjct: 229 GFLFFGDDLYDSSRVVWTPM-LRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTY 287
Query: 336 LNDPAYTQISETFNSLAKEK 355
LN AY + EK
Sbjct: 288 LNSLAYQALVHLVRKELSEK 307
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 131/280 (46%), Gaps = 31/280 (11%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
LG+ + T +++GQP + + LDTGSDL WL CD CVH L + +Y P
Sbjct: 54 LGYYNVT-INIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEAPH------PLYQP--- 102
Query: 161 STSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
++ +PCN LC+ +C + C Y+V Y +DG S G LV DV L +
Sbjct: 103 -SNDLIPCNDPLCKALHFNGNHRCETP-EQCDYEVEY-ADGGSSLGVLVRDVFSL--NYT 157
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ + R++ GCG Q +G+ GLG K S+ S L +QG + N C S
Sbjct: 158 KGLRLTPRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNVVGHCLSS 217
Query: 276 DGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITI-TQVSVGGNAVNFE-FSAIFDSGT 331
G G + FG+ S TP + R+ Y+ + ++ GG + +FDSG+
Sbjct: 218 LGGGILFFGNDLYDSSRVSWTPMA-RENSKHYSPAMGGELLFGGRTTGLKNLLTVFDSGS 276
Query: 332 SFTYLNDPAYTQIS-----ETFNSLAKEKRETSTSDLPFE 366
S+TY N AY ++ E KE R+ T L ++
Sbjct: 277 SYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQ 316
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 231
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 232 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 290
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 291 GSTLVYLPEIIYSEL 305
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/295 (31%), Positives = 130/295 (44%), Gaps = 34/295 (11%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++G P + + +DTGSDL WL CD C C + +Y P
Sbjct: 82 VGFYNVT-INIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 130
Query: 159 TSSTSSKVPCNSTLCELQKQCPS----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TD 213
++ VPC LC Q + C Y+V Y +D S G LV DV L T+
Sbjct: 131 ---SNDLVPCRHPLCASVHQTDNYECEVEHQCDYEVEY-ADHYSSLGVLVNDVYVLNFTN 186
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q K R++ GCG Q +G+ GLG K+S+ S L QGL+ N C
Sbjct: 187 GVQLKV---RMALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCL 243
Query: 274 GSDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGT 331
+ G G I FGD S TP S R + Y+ ++ +GG F A+FD+G+
Sbjct: 244 SAQGGGYIFFGDVYDSSRLAWTPMSSRD-YKHYSAGAAELVLGGKRTGFGNLLAVFDAGS 302
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDL------PFEYCYVLRSFLHLQAL 380
S+TY N AY E KE E T L PF Y ++ + AL
Sbjct: 303 SYTYFNSNAYQLTKELAGKPIKEAPEDQTLPLCWYGKRPFRSVYEVKKYFKPIAL 357
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 82 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 137
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 138 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 195
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 196 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 256 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 314
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 315 GSTLVYLPEIIYSEL 329
>gi|15010764|gb|AAK74041.1| AT3g51330/F24M12_370 [Arabidopsis thaliana]
gi|23505835|gb|AAN28777.1| At3g51330/F24M12_370 [Arabidopsis thaliana]
Length = 260
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 69/103 (66%), Gaps = 2/103 (1%)
Query: 271 MCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFD 328
MCFG+ D GRISFGDKG Q ETP + PTY +++T+VSVGG+AV + A+FD
Sbjct: 1 MCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVSVTEVSVGGDAVGVQLLALFD 60
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
+GTSFT+L +P Y I++ F+ +KR +LPFE+CY L
Sbjct: 61 TGTSFTHLLEPEYGLITKAFDDHVTDKRRPIDPELPFEFCYDL 103
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 128/273 (46%), Gaps = 33/273 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P+ + V +DTGSD+ W+ +C+ C G ++SG I+ Y P S T+
Sbjct: 84 LYYTQIEIGSPSKGYYVQVDTGSDILWV--NCIRC-DGCPTTSGLGIELTQYDPAGSGTT 140
Query: 164 SKVPCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
V C+ C L CPS S C +++ Y DG+ +TGF V D + +
Sbjct: 141 --VGCDQEFCVANSPNGLPPACPSTSSPCQFRIAY-GDGSSTTGFYVSDSVQYNQVSGNG 197
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q+ ++ I+FGCG Q G L + A +G+ G G +S+ S LA + F+ C
Sbjct: 198 QTTPSNASITFGCG-AQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL 256
Query: 274 GS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------- 324
+ G G + G+ P TP TH YN+ + +SVGG + S
Sbjct: 257 DTVHGGGIFAIGNVVQPKVKTTPLVQNVTH--YNVNLQGISVGGATLQLPSSTFDSGDSK 314
Query: 325 -AIFDSGTSFTYLNDPAY----TQISETFNSLA 352
I DSGT+ YL Y T + + + LA
Sbjct: 315 GTIIDSGTTLAYLPREVYRTLLTAVFDKYQDLA 347
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 120/255 (47%), Gaps = 21/255 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 221 DSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT 278
+ ++FGCG Q+GS + A A +G+ G G + S LA G FS C S +G
Sbjct: 172 STSVTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGG 231
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDS 329
G + G+ P TP ++ + + + ++V G + + DS
Sbjct: 232 GIFAIGEVVEPKVKTTPI-VKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDS 290
Query: 330 GTSFTYLNDPAYTQI 344
G++ YL + Y+++
Sbjct: 291 GSTLVYLPEIIYSEL 305
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 134/297 (45%), Gaps = 28/297 (9%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYT 342
P TP L P YN+ + + VGG A+ + IFDSG S + D T
Sbjct: 275 VVQPKVKTTP--LVSDMPHYNVILKGIDVGGTALGLP-TNIFDSGNSKGTIIDSGTT 328
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 134/297 (45%), Gaps = 28/297 (9%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYT 342
P TP L P YN+ + + VGG A+ + IFDSG S + D T
Sbjct: 275 VVQPKVKTTP--LVPDMPHYNVILKGIDVGGTALGLP-TNIFDSGNSKGTIIDSGTT 328
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 120/260 (46%), Gaps = 27/260 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T + +G PA S+ V +DTGSD+ W+ C C +C SG I+ +Y P+ SS+
Sbjct: 80 LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPR----KSGLGIELTLYDPSGSSS 135
Query: 163 SSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEK 215
+ V C C + C A + C Y + Y DG+ +TGF V D L +
Sbjct: 136 GTGVTCGQDFCVATHGGVIPSCVPA-APCQYSISY-GDGSSTTGFFVTDFLQYNQVSGNS 193
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q+ ++ I+FGCG G + A +G+ G G +S+ S LA G + F+ C
Sbjct: 194 QTTLANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLD 253
Query: 275 S-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------- 324
+ +G G + GD P TP L P YN+ + + VGG + +
Sbjct: 254 TINGGGIFAIGDVVQPKVSTTP--LVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKG 311
Query: 325 AIFDSGTSFTYLNDPAYTQI 344
I DSGT+ YL Y I
Sbjct: 312 TIIDSGTTLAYLPGVVYNAI 331
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 127/269 (47%), Gaps = 27/269 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++T V +G P F V +DTGSD+ W+ C+ C +C +SG I N + ++SST
Sbjct: 65 LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPR----TSGLGIQLNFFDSSSSST 120
Query: 163 SSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ V C+ +C QC + C Y +Y DG+ ++G+ V D L+ +S
Sbjct: 121 AGLVHCSDPICTSAVQTTVTQCSPQTNQCSYTFQY-EDGSGTSGYYVSDTLYFDAILGES 179
Query: 218 KSVDSR--ISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
V+S I FGC Q+G + A +G+FG G + SV S L+ G+ P FS C
Sbjct: 180 LVVNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLK 239
Query: 275 SD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
+ G G + G+ PG +P L + P YN+ + ++V G + + S
Sbjct: 240 GEGIGGGILVLGEILEPGMVYSP--LVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQ 297
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLA 352
I DSGT+ YL AY N +
Sbjct: 298 GTIVDSGTTLAYLVAEAYDPFVSAVNVIV 326
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 93/297 (31%), Positives = 134/297 (45%), Gaps = 28/297 (9%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYT 342
P TP L P YN+ + + VGG A+ + IFDSG S + D T
Sbjct: 275 VVQPKVKTTP--LVPDMPHYNVILKGIDVGGTALGLP-TNIFDSGNSKGTIIDSGTT 328
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 74/277 (26%), Positives = 129/277 (46%), Gaps = 32/277 (11%)
Query: 101 LGFLH------YTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFN 153
LG+ H YT + +G P +F V +DTGS + ++PC DC C G +++
Sbjct: 3 LGYRHTRHSYFYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHC--GKHTA-------E 53
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ P+ S+T+ K+ C LC + ++ Y R ++ + S G+++ED
Sbjct: 54 WFDPDKSTTAKKLACGDPLCNCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDS 113
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ R+ FGC +TG A +G+ G+G + + S L + +I + FS+CF
Sbjct: 114 DSPV-----RLVFGCENGETGEIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCF 167
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTH---PTYNITITQVSVGGNAVNFE-------F 323
G G + GD P T ++ TH YN+ + ++V G + F+ +
Sbjct: 168 GYPKDGILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGY 227
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
+ DSGT+FTYL A+ +++ ++K ST
Sbjct: 228 GTVLDSGTTFTYLPTDAFKAMAKAVGDYVEKKGLQST 264
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 126/262 (48%), Gaps = 25/262 (9%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSSTSS 164
T + +G P+ F + +D+GS + ++PC S S +I+ + + P+ SST S
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
V CN + C + S C Y+ +Y ++ + S+G L ED++ K+S+ R
Sbjct: 153 PVKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRA 203
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
FGC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 204 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 262
Query: 285 DKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTY 335
G P + FS P YNI + ++ V G A+ N + + DSGT++ Y
Sbjct: 263 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 322
Query: 336 LNDPAYT----QISETFNSLAK 353
L + A+ ++ NSL K
Sbjct: 323 LPEQAFVAFKDAVTNKVNSLKK 344
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 155/361 (42%), Gaps = 57/361 (15%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
+L++L + GC G F R P G +G + +AL D R+
Sbjct: 14 LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RL G A G P DT L+YT + +G P + V +DTGSD+
Sbjct: 62 GRLLGAVDLALGGVGLP------TDT-------GLYYTRIEIGSPPKGYYVQVDTGSDIL 108
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG 183
W+ +C+ C G + SG I+ Y P S T+ V C C + CPS
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
S C +++ Y DG+ +TGF V D + + Q+ + ++ I+FGCG Q G L +
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
A +G+ G G +S+ S LA + F+ C + G G + G+ P TP
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVP 281
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAY-TQISETF 348
TH YN+ + +SVGG + S I DSGT+ YL Y T ++ F
Sbjct: 282 NVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVF 339
Query: 349 N 349
+
Sbjct: 340 D 340
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 76/262 (29%), Positives = 126/262 (48%), Gaps = 25/262 (9%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSSTSS 164
T + +G P+ F + +D+GS + ++PC S S +I+ + + P+ SST S
Sbjct: 94 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
V CN + C + S C Y+ +Y ++ + S+G L ED++ K+S+ R
Sbjct: 154 PVKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRA 204
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
FGC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 205 VFGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMV 263
Query: 285 DKGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTY 335
G P + FS P YNI + ++ V G A+ N + + DSGT++ Y
Sbjct: 264 LGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAY 323
Query: 336 LNDPAYT----QISETFNSLAK 353
L + A+ ++ NSL K
Sbjct: 324 LPEQAFVAFKDAVTNKVNSLKK 345
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 90/284 (31%), Positives = 127/284 (44%), Gaps = 47/284 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPLYRPTKNKL 107
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC +++C K+C + C YQ++Y +D S G LV D L K
Sbjct: 108 ---VPCANSICTALHSGSSPNKKC-TTQQQCDYQIKY-TDKASSLGVLVMDSFSLPLRNK 162
Query: 216 QSKSVDSRISFGCG-RVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+V +SFGCG Q G +GAAP +GL GLG S+ S L QG+ N
Sbjct: 163 S--NVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGH 218
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE-------- 322
C + G G + FGD P T S+ R T Y S G + F+
Sbjct: 219 CLSTSGGGFLFFGDDMVPTSRVTWVSMVRSTSGNY------YSPGSATLYFDRRSLSTKP 272
Query: 323 FSAIFDSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
+FDSG+++TY + P IS SL+K ++ S LP
Sbjct: 273 MEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPL 316
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 104/361 (28%), Positives = 155/361 (42%), Gaps = 57/361 (15%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
+L++L + GC G F R P G +G + +AL D R+
Sbjct: 14 LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RL G A G P DT L+YT + +G P + V +DTGSD+
Sbjct: 62 GRLLGAVDLALGGVGLP------TDT-------GLYYTRIEIGSPPKGYYVQVDTGSDIL 108
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG 183
W+ +C+ C G + SG I+ Y P S T+ V C C + CPS
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
S C +++ Y DG+ +TGF V D + + Q+ + ++ I+FGCG Q G L +
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
A +G+ G G +S+ S LA + F+ C + G G + G+ P TP
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVP 281
Query: 299 RQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAY-TQISETF 348
TH YN+ + +SVGG + S I DSGT+ YL Y T ++ F
Sbjct: 282 NVTH--YNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLAYLPREVYRTLLAAVF 339
Query: 349 N 349
+
Sbjct: 340 D 340
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 124/275 (45%), Gaps = 36/275 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 194 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 241
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L +D +H+ +
Sbjct: 242 EKIVPPRDLLCQELQGDQNYCATC-KQCDYEIEY-ADRSSSMGVLAKDDMHMIATNGGRE 299
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LA+QG+I N F C +
Sbjct: 300 KLD--FVFGCAYDQQGQLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHCITKEP 357
Query: 277 -GTGRISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T +R Y+ +V+ G + A IFD
Sbjct: 358 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQLRMHGQAGSSIQVIFD 417
Query: 329 SGTSFTYLNDPAY----TQISETFNSLAKEKRETS 359
SG+S+TYL D Y T I + S ++ +T+
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSFVQDTSDTT 452
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 128/277 (46%), Gaps = 31/277 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P F + DTGSDL W C+ C +D P S++
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCE--PCAKTCYKQKEPRLD-----PTKSTSYK 185
Query: 165 KVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C+S C+L + C S C YQV+Y DG+ S GF + L L+ S +
Sbjct: 186 NISCSSAFCKLLDTEGGESCSSP--TCLYQVQY-GDGSYSIGFFATETLTLS-----SSN 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V FGCG+ +G F GAA GL GLG K S+PS A + S+ + S G
Sbjct: 238 VFKNFLFGCGQQNSGLF-RGAA--GLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKG 294
Query: 280 RISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTS 332
+SFG + S TP S ++ P Y + IT++SVGGN ++ + S + DSGT
Sbjct: 295 YLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTV 354
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T L AY+ +S F L + T + F+ CY
Sbjct: 355 ITRLPSTAYSALSSAFQKLMTDYPSTDGYSI-FDTCY 390
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 121/256 (47%), Gaps = 24/256 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+Y + +G P ++ + +DTGSD+ W+ C C C + S +D +Y SS+
Sbjct: 82 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKEC----PTRSSLGMDLTLYDIKESSS 137
Query: 163 SSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEK 215
VPC+ C+ L C +A +CPY Y DG+ + G+ V+D++ + +
Sbjct: 138 GKLVPCDQEFCKEINGGLLTGC-TANISCPYLEIY-GDGSSTAGYFVKDIVLYDQVSGDL 195
Query: 216 QSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
++ S + I FGCG Q+G S + A +G+ G G +S+ S LA+ G + F+ C
Sbjct: 196 KTDSANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL 255
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQV-------SVGGNAVNFEFSA 325
G +G G + G P TP Q H + N+T QV S +A
Sbjct: 256 NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGT 315
Query: 326 IFDSGTSFTYLNDPAY 341
I DSGT+ YL + Y
Sbjct: 316 IIDSGTTLAYLPEGIY 331
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 112/256 (43%), Gaps = 32/256 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P
Sbjct: 187 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKPTKEKI 237
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +HL +
Sbjct: 238 ---VPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHLIATNGGRE 292
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S+PS LA+ G+I N F C +
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSNAAISLPSQLASHGIISNIFGHCITREQ 350
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T S+R Y+ V G + A IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMREQAGNTVQVIFD 410
Query: 329 SGTSFTYLNDPAYTQI 344
SG+S+TYL D Y +
Sbjct: 411 SGSSYTYLPDEIYENL 426
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 135/291 (46%), Gaps = 41/291 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ VG P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 191 YYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 238
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP +LC+ Q C + C Y++ Y +D + S G L +D +HL +
Sbjct: 239 EKIVPPRDSLCQELQGDQNYCETC-KQCDYEIEY-ADRSSSMGVLAKDDMHLIATNGGRE 296
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
+D FGC Q G L A +G+ GL S+PS LA++G+I N F C +
Sbjct: 297 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354
Query: 276 DGTGRISFGDKGSPGQGETPFSLR-QTHPTYNITITQVSVG------GNAVNFEFSAIFD 328
+G G + GD P G T +R Y+ +V+ G GN+V IFD
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQ----VIFD 410
Query: 329 SGTSFTYLNDPAYTQ----ISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
SG+S+TYL + Y I E S ++ +T T L ++ + +RSF
Sbjct: 411 SGSSYTYLPEEMYKNLIDAIKEDSPSFVQDSSDT-TLPLCWKADFSVRSFF 460
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 88/271 (32%), Positives = 130/271 (47%), Gaps = 24/271 (8%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +S I + + P SS++
Sbjct: 83 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 139
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C+ C Q S S C Y +Y DG+ ++G+ + D + T + ++
Sbjct: 140 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGYYISDFMSFDTVITSTLAI 198
Query: 221 DSR--ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+S FGC +Q+G A +G+FGLG SV S LA QGL P FS C D
Sbjct: 199 NSSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDK 258
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------I 326
G G + G P TP L + P YN+ + ++V G + + S I
Sbjct: 259 SGGGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTI 316
Query: 327 FDSGTSFTYLNDPAYTQ-ISETFNSLAKEKR 356
D+GT+ YL D AY+ I N++++ R
Sbjct: 317 IDTGTTLAYLPDEAYSPFIQAVANAVSQYGR 347
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 75/246 (30%), Positives = 118/246 (47%), Gaps = 29/246 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC DC C G+ D + P+ SST
Sbjct: 90 TRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHC--------GKHQDPR-FQPDESSTYHP 140
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C G NC Y+ RY ++ + S+G L ED++ QS+ V R
Sbjct: 141 VKCN-----MDCNCDHDGVNCVYERRY-AEMSSSSGVLGEDIISFGN---QSEVVPQRAV 191
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
FGC V+TG A +G+ GLG + S+ L ++ +I +SFS+C+G G +
Sbjct: 192 FGCENVETGDLYSQRA-DGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVL 250
Query: 286 KGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P + FS + P YNI + ++ V G + + + DSGT++ YL
Sbjct: 251 GGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAYL 310
Query: 337 NDPAYT 342
+ A+
Sbjct: 311 PEEAFV 316
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 132/284 (46%), Gaps = 32/284 (11%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G F V +DTGSD+ W+ C+ C +C SS I+ N + SST++ +PC+
Sbjct: 75 GXXXXXFNVQIDTGSDILWVNCNTCSNCPQ----SSQLGIELNFFDTVGSSTAALIPCSD 130
Query: 171 TLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR-- 223
+C +C + C Y +Y DG+ ++G+ V D ++ Q +V+S
Sbjct: 131 LICTSGVQGAAAECSPRVNQCSYTFQY-GDGSGTSGYYVSDAMYFNLIMGQPPAVNSTAT 189
Query: 224 ISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GR 280
I FGC Q+G A +G+FG G SV S L++QG+ P FS C DG G
Sbjct: 190 IVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDGNGGGI 249
Query: 281 ISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFS-----AIFDSG 330
+ G+ P +P L + P YN+ + ++V G N F S I D G
Sbjct: 250 LVLGEILEPSIVYSP--LVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCG 307
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRS 373
T+ YL AY + N ++++ R+T++ CY++ +
Sbjct: 308 TTLAYLIQEAYDPLVTAINTAVSQSARQTNSKG---NQCYLVST 348
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 126/284 (44%), Gaps = 47/284 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 57 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPLYRPTKNKL 107
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
VPC +++C K+C + C YQ++Y +D S G LV D L K
Sbjct: 108 ---VPCANSICTALHSGSSPNKKC-TTQQQCDYQIKY-TDKASSLGVLVTDSFSLPLRNK 162
Query: 216 QSKSVDSRISFGCG-RVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+V +SFGCG Q G +GAAP +GL GLG S+ S L QG+ N
Sbjct: 163 S--NVRPSLSFGCGYDQQVGK--NGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGH 218
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE-------- 322
C + G G + FGD P T + R T Y S G + F+
Sbjct: 219 CLSTSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNY------YSPGSATLYFDRRSLSTKP 272
Query: 323 FSAIFDSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
+FDSG+++TY + P IS SL+K ++ S LP
Sbjct: 273 MEVVFDSGSTYTYFSAQPYQATISAIKGSLSKSLKQVSDPSLPL 316
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 122/256 (47%), Gaps = 24/256 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+Y + +G P ++ + +DTGSD+ W+ C C C + S +D +Y SS+
Sbjct: 84 LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKEC----PTRSNLGMDLTLYDIKESSS 139
Query: 163 SSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL--HLATDEK 215
VPC+ C+ L C +A +CPY Y DG+ + G+ V+D++ + +
Sbjct: 140 GKFVPCDQEFCKEINGGLLTGC-TANISCPYLEIY-GDGSSTAGYFVKDIVLYDQVSGDL 197
Query: 216 QSKSVDSRISFGCGRVQTG--SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
++ S + I FGCG Q+G S + A G+ G G +S+ S LA+ G + F+ C
Sbjct: 198 KTDSANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL 257
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-------A 325
G +G G + G P TP Q H + N+T QV +++ + S
Sbjct: 258 NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGT 317
Query: 326 IFDSGTSFTYLNDPAY 341
I DSGT+ YL + Y
Sbjct: 318 IIDSGTTLAYLPEGIY 333
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 123/281 (43%), Gaps = 43/281 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 54 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRP---TA 101
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+S VPC + LC +CPS C YQ++Y +D S G L+ D L
Sbjct: 102 NSLVPCANALCTALHSGHGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDNFSLPM--- 156
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S ++ ++FGCG Q + AA +G+ GLG S+ S L QG+ N C
Sbjct: 157 RSSNIRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE--------FSA 325
++G G + FGD P S P I+ S G + F+
Sbjct: 217 STNGGGFLFFGDD------IVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEV 270
Query: 326 IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF 365
+FDSG+++TY Y + S L+K ++ S LP
Sbjct: 271 VFDSGSTYTYFTAQPYQAVVSALKSGLSKSLKQVSDPSLPL 311
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 122/282 (43%), Gaps = 28/282 (9%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
GF + T + VGQP + + DTGSDL WL CD C C L+ +Y P
Sbjct: 55 GFYNVT-LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP-- 102
Query: 160 SSTSSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
++ VPC LC + +C + C Y+V Y +DG S G LV DV L +
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEY-ADGGSSLGVLVRDVFPL--NL 156
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ R++ GCG Q +G+ GLG S+ S L NQG++ N CF
Sbjct: 157 TNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE-FSAIFDSGTS 332
S G G + FGD + + +P Y+ ++ G + +FDSG+S
Sbjct: 217 SKGGGYLFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 276
Query: 333 FTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRS 373
+TY N AY ++ N LA + + D C+ R
Sbjct: 277 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRK 318
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 131/284 (46%), Gaps = 27/284 (9%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P + V +DTGSD+ W+ C C +C S I+ ++YSP++SST
Sbjct: 73 LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNC----PKKSDLGIELSLYSPSSSST 128
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVED--VLHLATDEKQ 216
S++V CN C P G C Y+V Y DG+ + G+ V D VL T Q
Sbjct: 129 SNRVTCNQDFCTSTYDGPIPGCTPELLCEYRVAY-GDGSSTAGYFVRDHVVLDRVTGNFQ 187
Query: 217 SKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ S + I FGCG Q+G AA +G+ G G +S+ S LA+ G + F+ C +
Sbjct: 188 TTSTNGSIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDN 247
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN---------FEFSA 325
+G G + G+ P TP +Q H YN+ + + V +N
Sbjct: 248 INGGGIFAIGEVVQPKVRTTPLVPQQAH--YNVFMKAIEVDNEVLNLPTDVFDTDLRKGT 305
Query: 326 IFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLP-FEY 367
I DSGT+ Y D Y IS+ F + K T FEY
Sbjct: 306 IIDSGTTLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFEY 349
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 125/278 (44%), Gaps = 39/278 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + Y P +
Sbjct: 73 YYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSC---------NKVPHPWYKPTKNKI 123
Query: 163 SSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VPC ++LC K+C + C YQ++Y +D S G L+ D L+ + S +
Sbjct: 124 ---VPCAASLCTSLTPNKKC-AVPQQCDYQIKY-TDKASSLGVLIADNFTLSL--RNSST 176
Query: 220 VDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
V + ++FGCG Q + AA +GL GLG S+ S L QG+ N CF ++G
Sbjct: 177 VRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNG 236
Query: 278 TGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE--------FSAIFD 328
G + FGD P T + R T Y S G + F+ +FD
Sbjct: 237 GGFLFFGDDIVPTSRVTWVPMARTTSGNY------YSPGSGTLYFDRRSLGMKPMEVVFD 290
Query: 329 SGTSFTYL-NDPAYTQISETFNSLAKEKRETSTSDLPF 365
SG+++ Y +P +S L+K +E S LP
Sbjct: 291 SGSTYAYFAAEPYQATVSALKAGLSKSLKEVSDVSLPL 328
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 98.6 bits (244), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 39/312 (12%)
Query: 55 GSFAYYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGND-----TYRLNSLGFLHYTN 108
G F+ A R+R L+ ++ Q L F AG D + R +++G L+Y
Sbjct: 38 GIFSVKYKYAGRERSLSTLKAHDISRQ------LRFLAGIDIPLGGSGRPDAVG-LYYAK 90
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G P+ + V +DTGSD+ W+ C C C SS G ++ Y S+T V
Sbjct: 91 IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPR--TSSLG--MELTPYDLEESTTGKLVS 146
Query: 168 CNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVD 221
C+ C P +G +CPY ++ DG+ + G+ V+D + + + ++ + +
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 222 SRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGT 278
I FGCG Q+G A +G+ G G +S+ S LA+ + F+ C G++G
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSA------IFDS 329
G + G P TP Q H YN+ +T V VG +N F A I DS
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFEAGDRKGTIIDS 323
Query: 330 GTSFTYLNDPAY 341
GT+ YL + Y
Sbjct: 324 GTTLAYLPELIY 335
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 125/275 (45%), Gaps = 18/275 (6%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
LG+ + ++++G+ +F +D+GSDL W+ CD C H +Y PN +
Sbjct: 52 LGY-YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPRE------QLYKPNNN 103
Query: 161 STSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ + P C S C SA C Y++ Y G+ S G LV D H+
Sbjct: 104 ALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGS-SLGVLVND--HVPLKLTNGSL 160
Query: 220 VDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
RI+FGCG S D + P G+ GLG + S S L++ G++ N C +G
Sbjct: 161 AAPRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG- 219
Query: 279 GRISFGDKGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTY 335
G + FGD+ P G T S+ Y+ +V GG A + + +FDSG+S+TY
Sbjct: 220 GFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTY 279
Query: 336 LNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCY 369
N AY I + N+L + E + D C+
Sbjct: 280 FNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCW 314
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 142/312 (45%), Gaps = 39/312 (12%)
Query: 55 GSFAYYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGND-----TYRLNSLGFLHYTN 108
G F+ A R+R L+ ++ Q L F AG D + R +++G L+Y
Sbjct: 38 GVFSVKYKYAGRERSLSTLKAHDISRQ------LRFLAGVDIPLGGSGRPDAVG-LYYAK 90
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G P+ + V +DTGSD+ W+ C C C SS G ++ Y S+T V
Sbjct: 91 IGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPR--TSSLG--MELTPYDLEESTTGKLVS 146
Query: 168 CNSTLCELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVD 221
C+ C P +G +CPY ++ DG+ + G+ V+D + + + ++ + +
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPY-LQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 222 SRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGT 278
I FGCG Q+G A +G+ G G +S+ S LA+ + F+ C G++G
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSA------IFDS 329
G + G P TP Q H YN+ +T V VG +N F A I DS
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFEAGDRKGTIIDS 323
Query: 330 GTSFTYLNDPAY 341
GT+ YL + Y
Sbjct: 324 GTTLAYLPELIY 335
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 92/306 (30%), Positives = 143/306 (46%), Gaps = 49/306 (16%)
Query: 93 NDTYRL-NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
N++Y S G+ + + +G P +V +DTGSDL W+ + C +C +
Sbjct: 11 NESYEFPESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADP----- 65
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
I+ P+ SST +K+ C+S+ C L Q SA +NC Y Y DG+++ G+ ++
Sbjct: 66 ----IFDPSKSSTYNKIACSSSACADLLGTQTCSAAANCIYAYGY-GDGSVTRGYFSKET 120
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ ATD + + FG TG+F D G+ GLG S+PS L + ++ N
Sbjct: 121 I-TATD-----TAGEEVKFGASVYNTGTFGDTGG-EGILGLGQGPVSMPSQLGS--VLGN 171
Query: 268 SFSMCF------GSDGTGRISFGDKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGN 317
FS C GS+ T + FGD P GE TP HPT Y I + +SVGG+
Sbjct: 172 KFSYCLVDWLSAGSE-TSTMYFGDAAVP-SGEVQYTPIVPNADHPTYYYIAVQGISVGGS 229
Query: 318 AVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
++ + S I DSGT+ TYL + + + S + + T+TS +
Sbjct: 230 LLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTS--QVRYPTTTSATGLD 287
Query: 367 YCYVLR 372
C+ R
Sbjct: 288 LCFNTR 293
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 112/221 (50%), Gaps = 16/221 (7%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P V +DTGSD+ W+ C SC +G +SG I N + P +SSTS
Sbjct: 76 LYYTKVKLGTPPRELYVQIDTGSDVLWVSCG--SC-NGCPQTSGLQIQLNYFDPGSSSTS 132
Query: 164 SKVPCNSTLCELQKQ-----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C C Q C + C Y +Y DG+ ++G+ V D++H A+ + +
Sbjct: 133 SLISCLDRRCRSGVQTSDASCSGRNNQCTYTFQY-GDGSGTSGYYVSDLMHFASIFEGTL 191
Query: 219 SVDSRIS--FGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ +S S FGC +QTG A +G+FG G SV S L++QG+ P FS C
Sbjct: 192 TTNSSASVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKG 251
Query: 276 D--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
D G G + G+ P +P L + P YN+ + +SV
Sbjct: 252 DNSGGGVLVLGEIVEPNIVYSP--LVPSQPHYNLNLQSISV 290
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 68/222 (30%), Positives = 106/222 (47%), Gaps = 18/222 (8%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSP 157
N + ++YT + +G P F V +DTGSD+ W+ C CV C + + + P
Sbjct: 76 NPISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISCVGC---------PLQNVTFFDP 126
Query: 158 NTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS++ K+ C+ C S S Y+V Y SDG+ ++G+ + D++ T +
Sbjct: 127 GASSSAVKLACSDKRCFSDLHKKSGCSPLEYKVEY-SDGSFTSGYYISDLISFETVMSSN 185
Query: 218 KSVDSR--ISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+V S FGC + G L + +G+ GLG + V S L++Q L P FS+C
Sbjct: 186 LTVKSSAPFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLS 245
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSV 314
G +G G I G+ P TP QTH YN+ + +V
Sbjct: 246 GGQEGGGVIILGENRLPNTVYTPLVRSQTH--YNVNLKTFAV 285
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 124/261 (47%), Gaps = 33/261 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P+ F + +D+GS + ++PC C C + + + P+ SST S
Sbjct: 93 TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPR---------FQPDLSSTYSP 143
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C + S C Y+ +Y ++ + S+G L ED++ K+S+ R
Sbjct: 144 VKCN-----VDCTCDNERSQCTYERQY-AEMSSSSGVLGEDIMSFG---KESELKPQRAV 194
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
FGC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 195 FGCENTETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253
Query: 286 KGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTYL 336
G P + FS P YNI + ++ V G A+ N + + DSGT++ YL
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313
Query: 337 NDPAYT----QISETFNSLAK 353
+ A+ ++ NSL K
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKK 334
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 115/276 (41%), Gaps = 29/276 (10%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +S+G P + + +DTGSDL WL CD CVSC + +Y P +
Sbjct: 57 LYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSC---------NKVPHPLYRPTKNK 107
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC LC + +C S C Y+++Y G+ S G L+ D A
Sbjct: 108 I---VPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGS-SLGVLLTD--SFAVRL 161
Query: 215 KQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S V ++FGCG Q GS + A +G+ GLG S+ S L G+ N C
Sbjct: 162 ANSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCL 221
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
G G + FGD P T P Y+ + GG ++ + DSG
Sbjct: 222 SIRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281
Query: 331 TSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF 365
+SFTY Y + S L+K +E LP
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLSKTLKEVFDPSLPL 317
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 122/259 (47%), Gaps = 33/259 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P +F + +DTGS L ++PC C C G+ D N + P+ SST
Sbjct: 94 TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQC--------GKHQDPN-FQPDWSSTYQP 144
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ C+ ++ C S +C Y +Y ++ + S+G L ED++ KQS+ R
Sbjct: 145 LKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSFG---KQSELKPQRTV 195
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L +G+I NSFS+C+G G G +
Sbjct: 196 FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVL 254
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S YNI + ++ + G + ++ I DSGT++ YL
Sbjct: 255 GGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314
Query: 337 NDPAYT----QISETFNSL 351
+PA+ I + NSL
Sbjct: 315 PEPAFKAFKDAIMKELNSL 333
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/259 (30%), Positives = 122/259 (47%), Gaps = 33/259 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P +F + +DTGS L ++PC C C G+ D N + P+ SST
Sbjct: 94 TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQC--------GKHQDPN-FQPDWSSTYQP 144
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ C+ ++ C S +C Y +Y ++ + S+G L ED++ KQS+ R
Sbjct: 145 LKCS-----MECTCDSEMMHCVYDRQY-AEMSSSSGVLGEDIVSFG---KQSELKPQRTV 195
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L +G+I NSFS+C+G G G +
Sbjct: 196 FGCENVETGDIYSQRA-DGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVL 254
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S YNI + ++ + G + ++ I DSGT++ YL
Sbjct: 255 GGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYL 314
Query: 337 NDPAYT----QISETFNSL 351
+PA+ I + NSL
Sbjct: 315 PEPAFKAFKDAIMKELNSL 333
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/273 (30%), Positives = 120/273 (43%), Gaps = 32/273 (11%)
Query: 105 HYTNV-SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
HY+ + ++G P +F + +DTGSDL W+ CD C C L+ +Y P
Sbjct: 67 HYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCKGCTKPLDK---------LYKPK--- 114
Query: 162 TSSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+++VPC S+LC+ C C Y+V Y G+ S G L+ D L +
Sbjct: 115 -NNRVPCASSLCQAIQNNNCDIPTEQCDYEVEYADLGS-SLGVLLSDYFPLRLNN--GSL 170
Query: 220 VDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ RI+FGCG Q +L +P G+ GLG K S+ S L G+ N CF
Sbjct: 171 LQPRIAFGCGYDQ--KYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRV 228
Query: 277 GTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
G + FGD P G TP + Y+ ++ GG + IFDSG+S+
Sbjct: 229 TGGFLFFGDHLLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSY 288
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
TY N Y I N + K+ D P E
Sbjct: 289 TYFNAQVYQSI---LNLVRKDLSGMPLKDAPEE 318
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 134/309 (43%), Gaps = 38/309 (12%)
Query: 78 AAQGNDKTPLTFSAGNDTYRLNSLGFL----------HYT-NVSVGQPALSFIVALDTGS 126
A N K P T + N+ +RL+S HYT ++++G P + + +D+GS
Sbjct: 26 AQPRNAKKPKTPYSDNNHHRLSSSAVFKLQGNVYPLGHYTVSLNIGYPPKLYDLDIDSGS 85
Query: 127 DLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----LQKQC 179
DL W+ CD C C + +Y PN + V C LC + C
Sbjct: 86 DLTWVQCDAPCKGCTKPRD---------QLYKPN----HNLVQCVDQLCSEVHLSMAYNC 132
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
PS C Y+V Y G+ S G LV D ++ V R++FGCG Q S +
Sbjct: 133 PSPDDPCDYEVEYADHGS-SLGVLVRD--YIPFQFTNGSVVRPRVAFGCGYDQKYSGSNS 189
Query: 240 A-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSL 298
A +G+ GLG + S+ S L + GLI N C + G G + FGD P G S+
Sbjct: 190 PPATSGVLGLGNGRASILSQLHSLGLIRNVVGHCLSAQGGGFLFFGDDFIPSSGIVWTSM 249
Query: 299 RQTHPTYNITI--TQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ + + ++ G A + IFDSG+S+TY N AY + + K K
Sbjct: 250 LSSSSEKHYSSGPAELVFNGKATAVKGLELIFDSGSSYTYFNSQAYQAVVDLVTKDLKGK 309
Query: 356 RETSTSDLP 364
+ +D P
Sbjct: 310 QLKRATDDP 318
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/279 (27%), Positives = 123/279 (44%), Gaps = 34/279 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S +D +Y S+T
Sbjct: 77 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRC----PTKSDLGVDLTLYDMKASTT 132
Query: 163 SSKVPCNSTLCEL-QKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQS 217
S V C+ C L P G C Y V Y DG+ +TG+ V+D + + Q+
Sbjct: 133 SDAVGCDDNFCSLYDGPLPGCKPGLQCLYSVLY-GDGSSTTGYFVQDFVQYNRISGNFQT 191
Query: 218 KSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+ + FGCG Q+G + A +G+ G G +S+ S LA+ G + FS C +
Sbjct: 192 TPTNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNV 251
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQ---------THPTYNITITQVSVGGNAVNFEFSA- 325
DG G + G+ P + F L + YN+ + ++ VGG+ ++ A
Sbjct: 252 DGGGIFAIGEVVEP---KVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAF 308
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
I DSGT+ Y Y + E S + R
Sbjct: 309 ESGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLR 347
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 121/261 (46%), Gaps = 32/261 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P+ SST
Sbjct: 79 TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQC--------GKHQDPR-FQPDLSSTYRP 129
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C G C Y+ RY ++ + S+G + EDV+ +S+ R
Sbjct: 130 VKCNPS-C----NCDDEGKQCTYERRY-AEMSSSSGVIAEDVVSFGN---ESELKPQRAV 180
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG + SV L ++G+I +SFS+C+G G G +
Sbjct: 181 FGCENVETGDLYSQRA-DGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVL 239
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI + ++ V G + + + DSGT++ Y
Sbjct: 240 GQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYF 299
Query: 337 NDPAYTQISETFNSLAKEKRE 357
+ A+ + + ++ KE R
Sbjct: 300 PEAAFHALKD---AIMKEIRH 317
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 112/256 (43%), Gaps = 32/256 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C +C G + +Y P +
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP---------LYKP---AK 234
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ +
Sbjct: 235 EKIVPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHMIATNGGRE 292
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S PS LA+ G+I N F C +
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQ 350
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T S+R Y+ V G + A IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 329 SGTSFTYLNDPAYTQI 344
SG+S+TYL + Y +
Sbjct: 411 SGSSYTYLPNEIYENL 426
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 124/275 (45%), Gaps = 18/275 (6%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
LG+ + ++++G+ +F +D+GSDL W+ CD C H +Y PN +
Sbjct: 52 LGY-YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPRE------QLYKPNNN 103
Query: 161 STSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ + P C S C SA C Y++ Y G+ S G LV D H+
Sbjct: 104 ALNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHGS-SLGVLVND--HVPLKLTNGSL 160
Query: 220 VDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
RI+FGCG S D + P G+ GLG + S S L++ G++ N C +G
Sbjct: 161 AAPRIAFGCGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEG- 219
Query: 279 GRISFGDKGSPGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTY 335
G + FGD+ P G T S+ Y+ +V G A + + +FDSG+S+TY
Sbjct: 220 GFLFFGDEFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTY 279
Query: 336 LNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCY 369
N AY I + N+L + E + D C+
Sbjct: 280 FNSQAYNSILALVKNNLRGKPLEDAPEDKSLPVCW 314
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 123/265 (46%), Gaps = 27/265 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SST S V
Sbjct: 87 TRLYIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 138
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C++ C S S C Y+ +Y ++ + S+G L ED++ T +S+ R F
Sbjct: 139 KCSADCT-----CDSDKSQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 189
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L ++G+I +SFSMC+G G G + G
Sbjct: 190 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 248
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 249 AMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLP 308
Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + S + ++ D
Sbjct: 309 EQAFVAFKDAVTSKVRPLKKIRGPD 333
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 123/283 (43%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRP---TA 100
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+ VPC + LC +CPS C YQ++Y +D S G L+ D L
Sbjct: 101 NRLVPCANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM--- 155
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S ++ ++FGCG Q + AA +G+ GLG S+ S L QG+ N C
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------F 323
++G G + FGD P T P + R + Y S G + F+
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPM 268
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+FDSG+++TY Y + L+K ++ S LP
Sbjct: 269 EVVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 311
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 123/283 (43%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 53 YYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRP---TA 100
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+ VPC + LC +CPS C YQ++Y +D S G L+ D L
Sbjct: 101 NRLVPCANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM--- 155
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S ++ ++FGCG Q + AA +G+ GLG S+ S L QG+ N C
Sbjct: 156 RSSNIRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCL 215
Query: 274 GSDGTGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------F 323
++G G + FGD P T P + R + Y S G + F+
Sbjct: 216 STNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPM 268
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
+FDSG+++TY Y + L+K ++ S LP
Sbjct: 269 EVVFDSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 311
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 131/293 (44%), Gaps = 30/293 (10%)
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTN-VSVGQPALSFIVALDTG 125
DR F RGR L + T + L +YT+ V +G P F + +DTG
Sbjct: 12 DRRFERRGRKLE-----------ESARMTLHDDLLTKGYYTSRVFIGTPPNEFALIVDTG 60
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVI--DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA 182
S + ++PC C C H S S + + P SS+ K+ C S+ C + C S
Sbjct: 61 STVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQKIGCRSSDC-ITGLCDSN 119
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
C Y+ R ++ + S G L +D+L + + +SFGC ++G A
Sbjct: 120 SHQCKYE-RMYAEMSTSKGVLGKDLLDFGPASRLQSQL---LSFGCETAESGDLYLQVA- 174
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQ 300
+G+ GLG S+ L G I +SFS+C+G +G G + G +P S +
Sbjct: 175 DGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMVLGAIPAPSGMVFAKSDPR 234
Query: 301 THPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYLNDPAYTQISE 346
YN+ +T++ V G N N +F I DSGT++ YL D A+ ++
Sbjct: 235 RSNYYNLELTEIQVQGASLKLDSNVFNGKFGTILDSGTTYAYLPDRAFEAFTD 287
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 126/294 (42%), Gaps = 36/294 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++GQP + + +DTGSDL WL CD C C + +Y P
Sbjct: 74 VGFYNVT-LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 122
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY---LSDGTMSTGFLVEDVLHLA-TDE 214
++ VPC +LC + P+Q Y +D S G L+ DV L T+
Sbjct: 123 ---SNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNG 179
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q K R++ GCG Q +G+ GLG KTS+ S L +QGL+ N C
Sbjct: 180 VQLKV---RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLS 236
Query: 275 SDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTS 332
+ G G I FGD S TP S R ++ GG A+FD+G+S
Sbjct: 237 AQGGGYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGGKKSGIGSLHAVFDTGSS 296
Query: 333 FTYLNDPAYTQI-----SETFNSLAKEKRETSTSDL------PFEYCYVLRSFL 375
+TY N AY + E+ KE + T L PF Y +R +
Sbjct: 297 YTYFNPYAYQALISWLGKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYF 350
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 87/311 (27%), Positives = 138/311 (44%), Gaps = 43/311 (13%)
Query: 51 LPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVS 110
LP S+ S LA R RG G A N + L + Y + T +
Sbjct: 47 LPLTRSYPNASRLAASSR----RGLGDGAHPNARMRLHDDLLTNGY--------YTTRLY 94
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +D+GS + ++PC SC N + + P+ SS+ S V CN
Sbjct: 95 IGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSSYSPVKCN- 145
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
+ C S C Y+ +Y ++ + S+G L ED++ ++S+ R FGC
Sbjct: 146 ----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKPQRAVFGCEN 197
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
+TG A +G+ GLG + S+ L +G+I +SFS+C+G G + G P
Sbjct: 198 SETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPA 256
Query: 291 QGETPFS----LRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGTSFTYLNDP 339
+ FS LR P YNI + ++ V G A+ N + + DSGT++ YL +
Sbjct: 257 PSDMVFSHSDPLRS--PYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQ 314
Query: 340 AYTQISETFNS 350
A+ + S
Sbjct: 315 AFVAFKDAVTS 325
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 119/261 (45%), Gaps = 33/261 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C + + P +SST
Sbjct: 85 TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FDPESSSTYKP 135
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN + C S G C Y+ +Y ++ + S+G L EDV+ QS+ + R
Sbjct: 136 IKCN-----IDCICDSDGVQCVYERQY-AEMSTSSGVLGEDVISFGN---QSELIPQRAV 186
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC ++TG A +G+ GLG S+ L +G I +SFS+C+G G G +
Sbjct: 187 FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P +S P YN+ + ++ V G + + A+ DSGT++ YL
Sbjct: 246 GGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYL 305
Query: 337 NDPAYT----QISETFNSLAK 353
A++ I + +SL K
Sbjct: 306 PAEAFSAFKDAIMDEIHSLKK 326
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 119/261 (45%), Gaps = 33/261 (12%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C + + P +SST
Sbjct: 85 TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FDPESSSTYKP 135
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN + C S G C Y+ +Y ++ + S+G L EDV+ QS+ + R
Sbjct: 136 IKCN-----IDCICDSDGVQCVYERQY-AEMSTSSGVLGEDVISFGN---QSELIPQRAV 186
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC ++TG A +G+ GLG S+ L +G I +SFS+C+G G G +
Sbjct: 187 FGCENMETGDLFSQRA-DGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVL 245
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P +S P YN+ + ++ V G + + A+ DSGT++ YL
Sbjct: 246 GGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYL 305
Query: 337 NDPAYT----QISETFNSLAK 353
A++ I + +SL K
Sbjct: 306 PAEAFSAFKDAIMDEIHSLKK 326
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 126/287 (43%), Gaps = 51/287 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSV-----------LFDPSKSSSS 139
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C++ C KQ P +AG +C + + Y G+ L +D L LA D +S
Sbjct: 140 RNLQCDAPQC---KQAPNPTCTAGKSCGFNMTY--GGSTIEASLTQDTLTLANDVIKS-- 192
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
+FGC TG+ L GL GLG S+ I Q L ++FS C S
Sbjct: 193 ----YTFGCISKATGTSLPA---QGLMGLGRGPLSL--ISQTQNLYMSTFSYCLPNSKSS 243
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 244 NFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTG 303
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT FT L +PAY + F K TS F+ CY
Sbjct: 304 AGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLGG--FDTCY 348
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 81/272 (29%), Positives = 121/272 (44%), Gaps = 28/272 (10%)
Query: 105 HYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
HYT ++++G P + + +D+GSDL W+ CD C C + +Y PN
Sbjct: 63 HYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAPCKGCTKPRD---------QLYKPN--- 110
Query: 162 TSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ V C LC ++ C S C Y+V Y G+ S G LV D ++
Sbjct: 111 -HNLVQCVDQLCSEVQLSMEYTCASPDDQCDYEVEYADHGS-SLGVLVRD--YIPFQFTN 166
Query: 217 SKSVDSRISFGCGRVQTGSFLDGA-APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
V R++FGCG Q S + A +G+ GLG + S+ S L + GLI N C +
Sbjct: 167 GSVVRPRVAFGCGYDQKYSGSNSPPATSGVLGLGNGRASILSQLHSLGLIHNVVGHCLSA 226
Query: 276 DGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTS 332
G G + FGD P G S+ + Y+ ++ G A + IFDSG+S
Sbjct: 227 RGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHYSSGPAELVFNGKATVVKGLELIFDSGSS 286
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
+TY N AY + + K K+ +D P
Sbjct: 287 YTYFNSQAYQAVVDLVTQDLKGKQLKRATDDP 318
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 98/311 (31%), Positives = 132/311 (42%), Gaps = 45/311 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ VSVG P + +DTGSD+ WL C CVSC H + ++ P SST
Sbjct: 37 YFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCD---------EVFDPYKSSTY 87
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + CNS C G+ C YQV Y DG+ STG D + L + + V ++
Sbjct: 88 STLGCNSRQCLNLDVGGCVGNKCLYQVDY-GDGSFSTGEFATDAVSLNSTSGGGQVVLNK 146
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
I GCG G F+ A GL S P+ + ++ FS C +D T R
Sbjct: 147 IPLGCGHDNEGYFVGAAGLLGLG---KGPLSFPNQINSEN--GGRFSYCLTGRDTDSTER 201
Query: 281 IS--FGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
S FGD P G TP + T Y + +T +SVGG+ + SA
Sbjct: 202 SSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGG 261
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF---------L 375
I DSGTS T L + AY + E F + + T+ L F+ CY L L
Sbjct: 262 VIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSL-FDTCYNLSDLSSVDVPTVTL 320
Query: 376 HLQALVVLPFP 386
H Q L P
Sbjct: 321 HFQGGADLKLP 331
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 123/265 (46%), Gaps = 27/265 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SST S V
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 141
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S + C Y+ +Y ++ + S+G L ED++ T +S+ R F
Sbjct: 142 KCN-----VDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L ++G+I +SFSMC+G G G + G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 252 AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLP 311
Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + +S ++ D
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPD 336
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 123/265 (46%), Gaps = 27/265 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SST S V
Sbjct: 90 TRLHIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSTYSPV 141
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S + C Y+ +Y ++ + S+G L ED++ T +S+ R F
Sbjct: 142 KCN-----VDCTCDSDKNQCTYERQY-AEMSSSSGVLGEDIVSFGT---ESELKPQRAVF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L ++G+I +SFSMC+G G G + G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLG 251
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 252 AMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLP 311
Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + +S ++ D
Sbjct: 312 EQAFVAFKDAVSSQVHPLKKIRGPD 336
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 132/309 (42%), Gaps = 50/309 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ ++VG P +V +DTGSDL WL CV C H + +Y P +SST
Sbjct: 88 YFAVINVGDPPTRALVVIDTGSDLIWL--QCVPCRHCYRQVT------PLYDPRSSSTHR 139
Query: 165 KVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
++PC S C C + C Y V Y DG+ S+G L D L D
Sbjct: 140 RIPCASPRCRDVLRYPGCDARTGGCVYMVVY-GDGSASSGDLATDRLVFPDDTHVHN--- 195
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG------S 275
++ GCG G L+ AA GL G+G + S P+ LA + FS C G
Sbjct: 196 --VTLGCGHDNVG-LLESAA--GLLGVGRGQLSFPTQLAPA--YGHVFSYCLGDRLSRAQ 248
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
+G+ + FG +P T F+ +T+P Y + + SVGG V +A
Sbjct: 249 NGSSYLVFGR--TPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNP 306
Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCYVLRSFL 375
+ DSGT+ + AY + + F+S A R+ +T F+ CY LR
Sbjct: 307 ATGRGGIVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNG 366
Query: 376 HLQALVVLP 384
A V +P
Sbjct: 367 APAAAVRVP 375
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 140/339 (41%), Gaps = 38/339 (11%)
Query: 18 SCCAGCCFGFGTFGFDFHHRYS--DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGR 75
+C A G G F DF HR S P + P A A A R + GR
Sbjct: 21 TCTASAAAGEGGFSVDFIHRDSARSPYR-------HPALSPHARALAAARRSLRGEVLGR 73
Query: 76 GLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDC 135
+ P++ + G ++ + F + V+VG P + DTGSDL W+ C
Sbjct: 74 SYSGASPAAAPVSAADGGVESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNC-- 131
Query: 136 VSCVHGLNSSSGQVIDFN-----IYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQ 189
+SS G + D + ++ P SST S++ C S C+ Q A S C YQ
Sbjct: 132 -------SSSGGGLADADAGGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQ 184
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y DG+ + G L + + + R++FGC G+F +GL GLG
Sbjct: 185 YSY-GDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFGCSTASAGTFRS----DGLVGLG 239
Query: 250 MDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFGDKG---SPGQGETPFSLRQTH 302
S+ S L I S C + ++ + ++FG + PG TP
Sbjct: 240 AGAFSLVSQLGATTHIDRKLSYCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVD 299
Query: 303 PTYNITITQVSVGGNAVNFEFSAIF-DSGTSFTYLNDPA 340
Y + + V+VGG V S I DSGT+ T+L DPA
Sbjct: 300 SYYTVALESVAVGGQEVATHDSRIIVDSGTTLTFL-DPA 337
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 118/257 (45%), Gaps = 39/257 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P SS+
Sbjct: 82 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSSSYKA 132
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN C C G C Y+ RY ++ + S+G L ED++ +S+ R
Sbjct: 133 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGN---ESQLTPQRAV 183
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG K SV L ++G+I + FS+C+G G G +
Sbjct: 184 FGCENVETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 242
Query: 284 GDKGSPGQGET-----PFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
G K SP G PF P YNI + Q+ V G ++ N + + DSGT
Sbjct: 243 G-KISPPAGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 297
Query: 332 SFTYLNDPAYTQISETF 348
++ Y A+ I +
Sbjct: 298 TYAYFPKEAFIAIKDAI 314
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 123/289 (42%), Gaps = 37/289 (12%)
Query: 114 PALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNST 171
P + + DTGSDL W+ CD C SC G N+ Y P + VP
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGANA---------WYKPRRGNI---VPPKDL 246
Query: 172 LCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
LC ++ AG C Y++ Y +D + S G L D L L ++ F
Sbjct: 247 LCMEVQRNQKAGYCETCDQCDYEIEY-ADHSSSMGVLATDKLLLMVANGSLTKLN--FIF 303
Query: 227 GCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRISF 283
GC Q G L +G+ GL K S+PS LA+QG+I N C +D G G +
Sbjct: 304 GCAYDQQGLLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFL 363
Query: 284 GDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNF------EFSAIFDSGTSFTY 335
GD P G P + Y+ + +++ G + ++ +FDSG+S+TY
Sbjct: 364 GDDFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLSLGGMESRVKHILFDSGSSYTY 423
Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV----LRSFLHLQAL 380
AY+++ + N ++ STSD C+ +R F++ L
Sbjct: 424 FPKEAYSELVASLNEVSGAGLVQSTSDTTLPLCWRANFPIRKFIYRTEL 472
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 124/268 (46%), Gaps = 35/268 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+YT + VG+P + + +DTGSDL W+ CD C SC G + +Y P +
Sbjct: 198 LYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKGRSP---------LYKPRREN 248
Query: 162 TSSKVPCNSTLC-ELQK-----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
V +LC E+Q+ QC +A C Y+V+Y +D + S G LV+D L
Sbjct: 249 V---VSFKDSLCMEVQRNYDGDQC-AACQQCNYEVQY-ADQSSSLGVLVKDEFTLRFSNG 303
Query: 216 QSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+++ FGC Q G L+ + +G+ GL K S+PS LA++G+I N C
Sbjct: 304 SLTKLNA--IFGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNVVGHCLT 361
Query: 275 SD--GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEF------S 324
D G G + GD P G ++ + Y + ++ G ++ +
Sbjct: 362 GDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDTWGSSREQ 421
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLA 352
+FDSG+S+TY AY Q+ ++
Sbjct: 422 VVFDSGSSYTYFTKEAYYQLVANLEEVS 449
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 116/277 (41%), Gaps = 41/277 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTK--------------YKPN 108
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D + L
Sbjct: 109 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 162
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ R++FGCG Q P G+ GLG K + + L + G+ N C
Sbjct: 163 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 221
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL P+ N + + G +N
Sbjct: 222 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 277
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 278 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 314
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/326 (29%), Positives = 136/326 (41%), Gaps = 30/326 (9%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPL 87
G F DF HR D + A LP + + R GR + P+
Sbjct: 28 GGFSVDFIHR--DSARSPFAQPSLPPHARALAAARRSLRGAAL---GRYVGGASPAPGPV 82
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
+ G ++ + F + V+VG P + DTGSDL W+ +C S G +S G
Sbjct: 83 PEADGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWV--NCSSNGGGGGASDG 140
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVED 206
V ++ P+ S+T S + C S C+ Q A S C YQ Y DG+ + G L +
Sbjct: 141 AV----VFHPSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAY-GDGSRTIGVLSTE 195
Query: 207 VLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
A + + R+SFGC GSF +GL GLG S+ S L
Sbjct: 196 TFSFAAAGGGGEGQVRVPRVSFGCSTGSAGSFRS----DGLVGLGAGALSLVSQLGAAAR 251
Query: 265 IPNSFSMCF-----GSDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGG 316
I FS C ++ + +SFG + PG TP + Y + + V+V G
Sbjct: 252 IARRFSYCLVPPYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAG 311
Query: 317 NAVNFEFSA--IFDSGTSFTYLNDPA 340
V S+ I DSGT+ T+L DPA
Sbjct: 312 QDVASANSSRIIVDSGTTLTFL-DPA 336
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/272 (30%), Positives = 116/272 (42%), Gaps = 24/272 (8%)
Query: 104 LHYTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
L Y +VS +G P F + +DTGSDL W+ CD C C L+ ++Y P
Sbjct: 64 LGYYSVSLYIGNPPKLFELDIDTGSDLTWVQCDAPCTGCTKPLH---------HLYKPRN 114
Query: 160 SSTSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ S P C++ QC SA C Y+++Y +G+ S G LV D L
Sbjct: 115 NLLSCIDPLCSAVQNSGTYQCQSATDQCDYEIQYADEGS-SLGVLVTDYFPLRL--MNGS 171
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
+ +++FGCG Q P G+ GLG KTS+ S L G++ N C G
Sbjct: 172 FLRPKMTFGCGYDQKSPGPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLSRKG 231
Query: 278 TGRISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-IFDSGTSFT 334
G + FG P G P S + Y ++ GG + IFDSG+S+T
Sbjct: 232 GGFLFFGQDPVPSFGISWAPMSQKSLDKYYASGPAELLYGGKPTGTKAEEFIFDSGSSYT 291
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
Y N Y T N + KE D P E
Sbjct: 292 YFNAQVY---QSTLNLIRKELSGKPLRDAPEE 320
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 143/308 (46%), Gaps = 51/308 (16%)
Query: 70 FRLRGRGLAAQG---NDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
R R G+ A+G N PL + + Y Y + +G PA F V +DTGS
Sbjct: 32 LRRRDGGIIARGLLRNATLPLHGAVKDYGY--------FYATLHLGTPARQFAVIVDTGS 83
Query: 127 DLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG--- 183
+ ++PC SC + G + P +SS+S+ + C+S C + P G
Sbjct: 84 TITYVPC--ASC----GRNCGPHHKDAAFDPASSSSSAVIGCDSDKCICGR--PPCGCSE 135
Query: 184 -SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
C YQ Y ++ + S G LV D L L + +V+ + FGC +TG + A
Sbjct: 136 KRECTYQRTY-AEQSSSAGLLVSDQLQL-----RDGAVE--VVFGCETKETGEIYNQEA- 186
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSLRQT 301
+G+ GLG + S+ + LA G+I + F++CFGS +G G + GD + E +L+ T
Sbjct: 187 DGILGLGNSEVSLVNQLAGSGVIDDVFALCFGSVEGDGALMLGDVDA---AEYDVALQYT 243
Query: 302 -------HPT-YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISE 346
HP Y++ + + VGG + + + + DSGT+FTYL A+ E
Sbjct: 244 ALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEGYGTVLDSGTTFTYLPSEAFQLFKE 303
Query: 347 TFNSLAKE 354
++ A E
Sbjct: 304 AVSAYALE 311
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 116/277 (41%), Gaps = 36/277 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 113
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D + L
Sbjct: 114 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 167
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ R++FGCG Q P G+ GLG K + + L + G+ N C
Sbjct: 168 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 226
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL P+ N + + G +N
Sbjct: 227 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 282
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 124/269 (46%), Gaps = 32/269 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT + +G P + V +DTGSD+ W+ + +SC G + SG I+ Y P S T+
Sbjct: 84 LYYTRIEIGSPPKGYYVQVDTGSDILWV--NGISC-DGCPTRSGLGIELTQYDPAGSGTT 140
Query: 164 SKVPCNSTLC-------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDE 214
V C C + CPSA S C +++ Y DG+ +TGF V D + +
Sbjct: 141 --VGCEQEFCVANSAASGVPPACPSAASPCQFRITY-GDGSSTTGFYVTDFVQYNQVSGN 197
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
Q+ + I+FGCG Q G L + A +G+ G G S+ S LA + F+ C
Sbjct: 198 GQTTPSNVSITFGCG-AQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHC 256
Query: 273 FGS-DGTGRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFS------ 324
+ G G + G+ P + TP TH YN+ + +SVGG + S
Sbjct: 257 LDTVRGGGIFAIGNVVQPPIVKTTPLVPNATH--YNVNLQGISVGGATLQLPTSTFDSGD 314
Query: 325 ---AIFDSGTSFTYLNDPAY-TQISETFN 349
I DSGT+ YL Y T ++ F+
Sbjct: 315 SKGTIIDSGTTLAYLPREVYRTLLTAVFD 343
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 51/287 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA + +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C + C KQ P + +C + + Y G+ +L +D L LATD
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSAIEAYLTQDTLTLATD------ 185
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V +FGC +G+ L GL GLG S+ I +Q L ++FS C S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT +T L +PAY + F K TS F+ CY
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAMRNEFRRRVKNANATSLGG--FDTCY 345
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 116/277 (41%), Gaps = 32/277 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTKNK 115
Query: 162 TSSKVPCNSTLC-------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG LV D L LA
Sbjct: 116 L---VPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGS-STGVLVNDSFALRLAN 171
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
V ++FGCG Q S + + +G+ GLG S+ S G+ N C
Sbjct: 172 ----GSVVRPSLAFGCGYDQQVSSGEMSPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHC 227
Query: 273 FGSDGTGRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFDS 329
G G + FGD P Q TP Y+ + G ++ + + +FDS
Sbjct: 228 LSLRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDS 287
Query: 330 GTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
G+SFTY Y + L++ +E S LP
Sbjct: 288 GSSFTYFAAQPYQALVTALKGDLSRTLKEVSDPSLPL 324
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 125/288 (43%), Gaps = 38/288 (13%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y + VG P+ + + +D+GS+L W+ CD C+SC G + +Y S
Sbjct: 78 LYYVTMLVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHP---------LYKLKKGS 128
Query: 162 TSSKVPCNSTLCELQK-------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VP LC + A C Y V Y +D S GFLV D +
Sbjct: 129 L---VPSKDPLCAAVQAGSGHYHNHKEASQRCDYDVAY-ADHGYSEGFLVRDSVRALLTN 184
Query: 215 KQSKSVDSRISFGCGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K + +S FGCG Q S + A +G+ GLG S+PS A QGLI N C
Sbjct: 185 KTVLTANS--VFGCGYNQRESLPVSDARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCI 242
Query: 274 ---GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--- 325
G DG G + FGD + P R + Y + Q++ G ++ +
Sbjct: 243 FGAGRDG-GYMFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKDGDGKKL 301
Query: 326 ---IFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSG+++TY + AY +S +L+ ++ E +SD C+
Sbjct: 302 GGIIFDSGSTYTYFTNQAYGAFLSVVKENLSGKQLEQDSSDSFLSLCW 349
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 115/277 (41%), Gaps = 36/277 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 64 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 113
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D + L
Sbjct: 114 HNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGY-SDHASSIGALVTDEVPLKL- 167
Query: 214 EKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ R++FGCG Q G+ GLG K + + L + G+ N C
Sbjct: 168 -ANGSIMNLRLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHC 226
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL P+ N + + G +N
Sbjct: 227 LSHTGKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGIN----V 282
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 283 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 319
>gi|294461400|gb|ADE76261.1| unknown [Picea sitchensis]
Length = 165
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/127 (41%), Positives = 69/127 (54%), Gaps = 12/127 (9%)
Query: 29 TFGFDFHHRYSDPVKGI------LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN 82
++ +H++S+ VK L D P +GS YY AL H D GR LA
Sbjct: 27 SYSLQMYHKFSNEVKEWMTWRHGLDTDGWPVEGSNEYYKALYHHDS--ARHGRKLA---- 80
Query: 83 DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
D LTF GN+T + LGFL Y+ V VG P ++ VALDTGSD+FW+PCDC +C
Sbjct: 81 DHPSLTFLEGNETVEIPQLGFLFYSMVQVGTPNVTLFVALDTGSDVFWVPCDCQACAPTS 140
Query: 143 NSSSGQV 149
+S G V
Sbjct: 141 AASYGLV 147
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 133/302 (44%), Gaps = 53/302 (17%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSP 157
N G H +SVG P L+F +DTGSDL W C C + + +Y P
Sbjct: 91 NGAGAYHMI-LSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTP--------LYDP 141
Query: 158 NTSSTSSKVPCNSTLCELQKQCPSA-----GSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST SK+PC S LC+ PSA + C Y RY + G+L D L +
Sbjct: 142 ARSSTFSKLPCASPLCQ---ALPSAFRACNATGCVYDYRYAVG--FTAGYLAADTLAIGD 196
Query: 213 DEKQSKSVDS--RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ + S ++FGC G +DGA +G+ GLG S S+L+ G+ FS
Sbjct: 197 GDGDGDASSSFAGVAFGCSTANGGD-MDGA--SGIVGLGR---SALSLLSQIGV--GRFS 248
Query: 271 MCFGSD---GTGRISF-------GDK-GSPGQGETPFSLRQTHPTYNITITQVSVGGNAV 319
C SD G I F GDK S P + R+ P Y + +T ++VG +
Sbjct: 249 YCLRSDADAGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDL 308
Query: 320 -----NFEFSA------IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEY 367
F F+A I DSGT+FTYL + YT + + F S A S + F+
Sbjct: 309 PVTSSTFGFTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFDL 368
Query: 368 CY 369
C+
Sbjct: 369 CF 370
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 126/286 (44%), Gaps = 35/286 (12%)
Query: 97 RLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNI 154
R SLG +Y +V +G PA + V DTGSDL W+ C C C + +
Sbjct: 140 RGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP---------L 190
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ P+ SST + V C + C EL S+ S C Y+V+Y D + + G LV D L L+
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSAS 249
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ V FGCG G F +GLFGLG +K S+PS A F+ C
Sbjct: 250 DTLPGFV-----FGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCL 299
Query: 274 GSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFE-------FS 324
S +GR G+P T + T Y I + + VGG A+
Sbjct: 300 PSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG 359
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
+ DSGT T L AY + F S+A+ K+ + S L + CY
Sbjct: 360 TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL--DTCY 403
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 128/271 (47%), Gaps = 36/271 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
SL L Y V +G PA++ +++DTGSD+ W+ C C C ++S ++
Sbjct: 124 TSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDS---------LFD 174
Query: 157 PNTSSTSSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
P+ SST S C+S C + Q+ + S C Y V Y+ DG+ +TG D L L +
Sbjct: 175 PSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYV-DGSSTTGTYSSDTLTLGS 233
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + FGC + ++G F D +GL GLG D S+ S A G +FS C
Sbjct: 234 NAIKG------FQFGCSQSESGGFSD--QTDGLMGLGGDAQSLVSQTA--GTFGKAFSYC 283
Query: 273 F--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVN-----FEF 323
+G ++ G G +TP LR T PT Y + + + VGG +N F
Sbjct: 284 LPPTPGSSGFLTLGAASRSGFVKTPM-LRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSA 342
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
++ DSGT T L AY+ +S F + K+
Sbjct: 343 GSVMDSGTVITRLPPTAYSALSSAFKAGMKK 373
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 126/286 (44%), Gaps = 35/286 (12%)
Query: 97 RLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNI 154
R SLG +Y +V +G PA + V DTGSDL W+ C C C + +
Sbjct: 140 RGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDP---------L 190
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ P+ SST + V C + C EL S+ S C Y+V+Y D + + G LV D L L+
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQY-GDQSQTDGNLVRDTLTLSAS 249
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ V FGCG G F +GLFGLG +K S+PS A F+ C
Sbjct: 250 DTLPGFV-----FGCGDQNAGLF---GQVDGLFGLGREKVSLPSQGAPS--YGPGFTYCL 299
Query: 274 GSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNF-------EFS 324
S +GR G+P T + T Y I + + VGG A+
Sbjct: 300 PSSSSGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG 359
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
+ DSGT T L AY + F S+A+ K+ + S L + CY
Sbjct: 360 TVIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSIL--DTCY 403
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 124/292 (42%), Gaps = 48/292 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V +G P F +DTGSDL W C C+ CV Q + + P S++
Sbjct: 88 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVE-------QPTPY--FEPAKSTSY 138
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC+S +C + C YQ Y D S G L + T+ ++ R
Sbjct: 139 ASLPCSSAMCNALYSPLCFQNACVYQAFY-GDSASSAGVLANETFTFGTNS--TRVAVPR 195
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+SFGCG + G+ +G+ G+ G G S+ S L + FS C F S T R
Sbjct: 196 VSFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSR 247
Query: 281 ISFGDKGS----------PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
+ FG + P Q TPF + PT Y + +T +SV G+ + + S
Sbjct: 248 LYFGAYATLNSTNTSSSGPVQ-STPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 306
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGT+ T+L PAY + F + R +T F+ C+
Sbjct: 307 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCF 358
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 71/265 (26%), Positives = 120/265 (45%), Gaps = 27/265 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SS+ S V
Sbjct: 91 TRLYIGTPPQEFALIVDSGSTVTYVPC--ASCEQCGNHQDPR------FQPDLSSSYSPV 142
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S C Y+ +Y ++ + S+G L ED++ ++S+ R F
Sbjct: 143 KCN-----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKAQRAVF 193
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDK 286
GC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G +
Sbjct: 194 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLG 252
Query: 287 GSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
G P + FS P YNI + ++ V G A+ + + DSGT++ YL
Sbjct: 253 GVPTPSDMVFSRSDPLRSPYYNIELKEIHVAGKALRVDSRIFDSKHGTVLDSGTTYAYLP 312
Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + S ++ D
Sbjct: 313 EQAFMAFKDAVTSKVHSLKKIRGPD 337
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 124/292 (42%), Gaps = 48/292 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V +G P F +DTGSDL W C C+ CV Q + + P S++
Sbjct: 85 YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVE-------QPTPY--FEPAKSTSY 135
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC+S +C + C YQ Y D S G L + T+ ++ R
Sbjct: 136 ASLPCSSAMCNALYSPLCFQNACVYQAFY-GDSASSAGVLANETFTFGTNS--TRVAVPR 192
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+SFGCG + G+ +G+ G+ G G S+ S L + FS C F S T R
Sbjct: 193 VSFGCGNMNAGTLFNGS---GMVGFGRGALSLVSQLGSP-----RFSYCLTSFMSPATSR 244
Query: 281 ISFGDKGS----------PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
+ FG + P Q TPF + PT Y + +T +SV G+ + + S
Sbjct: 245 LYFGAYATLNSTNTSSSGPVQ-STPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAIN 303
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGT+ T+L PAY + F + R +T F+ C+
Sbjct: 304 ETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCF 355
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 125/294 (42%), Gaps = 36/294 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++GQP + + +DTGSDL WL CD C C + +Y P
Sbjct: 76 VGFYNVT-LNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQTPHP---------LYRP- 124
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY---LSDGTMSTGFLVEDVLHLA-TDE 214
++ VPC LC + P+Q Y +D S G L+ DV L T+
Sbjct: 125 ---SNDLVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGVLLHDVYTLNFTNG 181
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
Q K R++ GCG Q +G+ GLG KTS+ S L +QGL+ N C
Sbjct: 182 VQLKV---RMALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLS 238
Query: 275 SDGTGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTS 332
+ G G I FGD S TP S R ++ GG A+FD+G+S
Sbjct: 239 AQGGGYIFFGDVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGGKKSGVGNLHAVFDTGSS 298
Query: 333 FTYLNDPAYTQI-----SETFNSLAKEKRETSTSDL------PFEYCYVLRSFL 375
+TY N AY + E+ KE + T L PF Y +R +
Sbjct: 299 YTYFNSYAYQVLISWLKKESGGKPLKEAHDDQTLPLCWRGRRPFRSIYEVRKYF 352
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 121/251 (48%), Gaps = 25/251 (9%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y +++G+PA + + +DTGS+L WL +C VHG + Y+P + + K
Sbjct: 39 YATLNIGEPAKPYFLDVDTGSNLTWL--ECHHPVHGCKGCHPRP-PHPYYTP--ADGNLK 93
Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C S LC ++ P N C Y+++Y++ S G L D++ + +K+
Sbjct: 94 VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
RI+FGCG Q +P +G+ GLGM K + + L +I N C S
Sbjct: 151 -----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGLAAQLKGHKMIKENVIGHCLSS 205
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
G G + GD P +G T +R++ Y+ + +V + + N F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 334 TYLNDPAYTQI 344
T++ Y +I
Sbjct: 266 THVPAQIYNEI 276
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 125/287 (43%), Gaps = 51/287 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C + C KQ P + +C + + Y G+ +L +D L LA+D
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------ 185
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V +FGC +G+ L GL GLG S+ I +Q L ++FS C S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT +T L +PAY + F K TS F+ CY
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG--FDTCY 345
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/254 (32%), Positives = 115/254 (45%), Gaps = 40/254 (15%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+YT V +G P F V +DTGSD+ W+ C SC +G +S I + + P SS++
Sbjct: 131 LYYTKVKLGTPPREFNVQIDTGSDVLWV--SCTSC-NGCPKTSELQIQLSFFDPGVSSSA 187
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C+ C Q S S C Y +Y DG+ ++G+ + D
Sbjct: 188 SLVSCSDRRCYSNFQTESGCSPNNLCSYSFKY-GDGSGTSGYYISD-------------- 232
Query: 221 DSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
F C +Q+G A +G+FGLG SV S LA QGL P FS C D G
Sbjct: 233 -----FMCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSG 287
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFD 328
G + G P TP L + P YN+ + ++V G + + S I D
Sbjct: 288 GGIMVLGQIKRPDTVYTP--LVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIID 345
Query: 329 SGTSFTYLNDPAYT 342
+GT+ YL D AY+
Sbjct: 346 TGTTLAYLPDEAYS 359
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 76/256 (29%), Positives = 111/256 (43%), Gaps = 32/256 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+YT++ +G P + + +DTGSDL W+ CD C + G + +Y P +
Sbjct: 187 YYTSIFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHP---------LYKP---AK 234
Query: 163 SSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
VP LC+ Q C + C Y++ Y +D + S G L D +H+ +
Sbjct: 235 EKIVPPRDLLCQELQGNQNYCETC-KQCDYEIEY-ADQSSSMGVLARDDMHMIATNGGRE 292
Query: 219 SVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+D FGC Q G L A +G+ GL S PS LA+ G+I N F C +
Sbjct: 293 KLD--FVFGCAYDQQGQLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQ 350
Query: 277 -GTGRISFGDKGSPGQGETPFSLRQ-THPTYNITITQVSVGGNAVNFEFSA------IFD 328
G G + GD P G T S+R Y+ V G + A IFD
Sbjct: 351 GGGGYMFLGDDYVPRWGVTWTSIRSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFD 410
Query: 329 SGTSFTYLNDPAYTQI 344
SG+S+TYL + Y +
Sbjct: 411 SGSSYTYLPNEIYENL 426
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 101/235 (42%), Gaps = 25/235 (10%)
Query: 128 LFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG---- 183
+F L C +C SG +D +Y PN S TS+ VPC C P +G
Sbjct: 26 VFLLQLGCTAC----PKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQD 81
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
+CPY + Y DG+ ++G V D L + +K +S + FGCG Q+GS +
Sbjct: 82 MSCPYSITY-GDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSD 140
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPFSL 298
A +G+ G G +SV S LA G + FS C S G G S G P TP
Sbjct: 141 EALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVP 200
Query: 299 RQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAYTQI 344
R H YN+ + + V G + I DSGT+ YL Y Q+
Sbjct: 201 RMAH--YNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL 253
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 82/287 (28%), Positives = 125/287 (43%), Gaps = 51/287 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++G PA +VALDT +D W+PC CV C + ++ P+ SS+S
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSV-----------LFDPSKSSSS 136
Query: 164 SKVPCNSTLCELQKQCP----SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C + C KQ P + +C + + Y G+ +L +D L LA+D
Sbjct: 137 RTLQCEAPQC---KQAPNPSCTVSKSCGFNMTY--GGSTIEAYLTQDTLTLASD------ 185
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V +FGC +G+ L GL GLG S+ I +Q L ++FS C S
Sbjct: 186 VIPNYTFGCINKASGTSLPA---QGLMGLGRGPLSL--ISQSQNLYQSTFSYCLPNSKSS 240
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-------- 325
+ +G + G K P + +T L+ + Y + + + VG V+ SA
Sbjct: 241 NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATG 300
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT +T L +PAY + F K TS F+ CY
Sbjct: 301 AGTIFDSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLGG--FDTCY 345
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 120/279 (43%), Gaps = 50/279 (17%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
+G PA + + +DTGSDL WL CD C SC + +Y P + VPC
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDAPCRSC---------NKVPHPLYRPTANRL---VPC 48
Query: 169 NSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ LC +CPS C YQ++Y +D S G L+ D L +S ++
Sbjct: 49 ANALCTALHSGQGSNNKCPSP-KQCDYQIKY-TDSASSQGVLINDSFSLPM---RSSNIR 103
Query: 222 SRISFGCGRVQ----TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
++FGCG Q G+ AA +G+ GLG S+ S L QG+ N C ++G
Sbjct: 104 PGLTFGCGYDQQVGKNGAVQ--AAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNG 161
Query: 278 TGRISFGDKGSPGQGET--PFSLRQTHPTYNITITQVSVGGNAVNFE--------FSAIF 327
G + FGD P T P + R + Y S G + F+ +F
Sbjct: 162 GGFLFFGDDVVPSSRVTWVPMAQRTSGNYY-------SPGSGTLYFDRRSLGVKPMEVVF 214
Query: 328 DSGTSFTYLN-DPAYTQISETFNSLAKEKRETSTSDLPF 365
DSG+++TY P +S L+K ++ S LP
Sbjct: 215 DSGSTYTYFTAQPYQAVVSALKGGLSKSLKQVSDPTLPL 253
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 124/275 (45%), Gaps = 20/275 (7%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
+GF + T +++GQP + + +DTGS+L WL CD C C + +Y P+
Sbjct: 71 VGFYNVT-LNIGQPPRPYFLDVDTGSELTWLQCDAPCSQCSETPHP---------LYKPS 120
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQS 217
K P ++L + C Y+++Y +D + G L+ DV L T+ Q
Sbjct: 121 NDFIPCKDPLCASLQPTDDYTCEDPNQCDYEIKY-ADQYSTLGVLLNDVYLLNFTNGVQL 179
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
K R++ GCG Q S +G+ GLG K S+ S L +QGL+ N C S G
Sbjct: 180 KV---RMALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRG 236
Query: 278 TGRISFGD-KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTY 335
G I FG+ S TP S + Y+ ++ GG + IFD+G+S+TY
Sbjct: 237 GGYIFFGNVYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTY 296
Query: 336 LNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
N AY + N L ++ + + D C+
Sbjct: 297 FNSQAYQAMISLLNKELHRKPIKAAPDDQTLPMCW 331
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 128/284 (45%), Gaps = 38/284 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T++ +G PA +V LDTGSD W+ C C C + ++ P+ SST
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEA---------LFDPSKSSTY 184
Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S + C+S C+ K S+ CPY++ Y +D + + G L D L L+ +
Sbjct: 185 SDITCSSRECQELGSSHKHNCSSDKKCPYEITY-ADDSYTVGNLARDTLTLSPTDAVPGF 243
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DG 277
V FGCG GSF +GL GLG K S+ S +A + FS C S
Sbjct: 244 V-----FGCGHNNAGSF---GEIDGLLGLGRGKASLSSQVAAR--YGAGFSYCLPSSPSA 293
Query: 278 TGRISFG--DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-------IF 327
TG +SF +P + + HP+ Y + +T ++V G A+ S I
Sbjct: 294 TGYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTII 353
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
DSGT+F+ L AY + + S + +S + F+ CY L
Sbjct: 354 DSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTI-FDTCYDL 396
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 112/377 (29%), Positives = 155/377 (41%), Gaps = 56/377 (14%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDT 95
HR+ P + DD P + A D R+ A G D ++ A
Sbjct: 24 HRHG-PCSPLQTPDDAPSDADLLEHDQ-ARVDSIHRMIANETAVVGQD---VSLPA---- 74
Query: 96 YRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVID 151
R S+G +Y +V +G PA V DTGSDL W+ PC C H +
Sbjct: 75 ERGISVGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDP------- 127
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQKQ-CPSAGSN--CPYQVRYLSDGTMSTGFLVEDVL 208
+++P++SST S V C C +Q C S+ + CPY+V Y D + + G L D L
Sbjct: 128 --LFAPSSSSTFSAVRCGEPECPRARQSCSSSPGDDRCPYEVVY-GDKSRTVGHLGNDTL 184
Query: 209 HLATDEKQSKSVDSR-----ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
L T + S ++ FGCG TG F +GLFGLG K S+ S A G
Sbjct: 185 TLGTTPSTNASENNSNKLPGFVFGCGENNTGLF---GKADGLFGLGRGKVSLSSQAA--G 239
Query: 264 LIPNSFSMCF---GSDGTGRISFGDKG-SPGQGE-TPFSLRQTHPT-YNITITQVSVGGN 317
FS C S+ G +S G +P TP R P+ Y + + + V G
Sbjct: 240 KYGEGFSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGR 299
Query: 318 AVN-------FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEY 367
A+ + I DSGT T L AY+ + F S + KR S L +
Sbjct: 300 AIKVSSRPALWPAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSIL--DT 357
Query: 368 CYVLRSFLHLQALVVLP 384
CY + H A V +P
Sbjct: 358 CYDFTA--HANATVSIP 372
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 120/251 (47%), Gaps = 25/251 (9%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y +++G+PA + + +DTGS+L WL +C VHG + Y+P + + K
Sbjct: 39 YATLNIGEPAKPYFLDVDTGSNLTWL--ECHHPVHGCKGCHPRP-PHPYYTP--ADGNLK 93
Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C S LC ++ P N C Y+++Y++ S G L D++ + +K+
Sbjct: 94 VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
RI+FGCG Q +P +G+ GLGM K + L +I N C S
Sbjct: 151 -----RIAFGCGYKQEEPADSPPSPVDGILGLGMGKAGFAAQLKGHKMIKENVIGHCLSS 205
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
G G + GD P +G T +R++ Y+ + +V + + N F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 334 TYLNDPAYTQI 344
T++ Y +I
Sbjct: 266 THVPAQIYNEI 276
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 95/313 (30%), Positives = 134/313 (42%), Gaps = 61/313 (19%)
Query: 65 HRDRYFRLRGRGL-AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
HR R G+ A G + AGN + ++ V++G PALS+ +D
Sbjct: 68 HRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMD---------VAIGTPALSYAAIVD 118
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPS 181
TGSDL W C CV C ++ P++SST + VPC+S LC +L +
Sbjct: 119 TGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATVPCSSALCSDLPTSTCT 169
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGA 240
+ S C Y Y D + + G L + L ++K+ V +FGCG G F GA
Sbjct: 170 SASKCGYTYTY-GDASSTQGVLASETFTLGKEKKKLPGV----AFGCGDTNEGDGFTQGA 224
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKG----------- 287
GL GLG S+ S L GL + FS C S DG G+ G
Sbjct: 225 ---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDGDGKSPLLLGGSAAAISESAAT 276
Query: 288 SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTY 335
+P Q TP + P+ Y +++T ++VG + SA I DSGTS TY
Sbjct: 277 APVQ-TTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTSITY 335
Query: 336 LNDPAYTQISETF 348
L Y + + F
Sbjct: 336 LELQGYRALKKAF 348
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 122/282 (43%), Gaps = 43/282 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y +++G PA + + +DTGSDL WL CD C SC + +Y P +
Sbjct: 52 YYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSC---------NKVPHPLYKP---TK 99
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+ VPC +++C K+C + C YQ++Y +D S G LV D L +
Sbjct: 100 NKLVPCAASICTTLHSAQSPNKKC-AVPQQCDYQIKY-TDSASSLGVLVTDNFTLPL--R 155
Query: 216 QSKSVDSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S SV +FGCG Q + + A +GL GLG S+ S L G+ N C
Sbjct: 156 NSSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCL 215
Query: 274 GSDGTGRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNAVNFE--------FS 324
++G G + FGD P T + R T Y S G + F+
Sbjct: 216 STNGGGFLFFGDNVVPTSRATWVPMVRSTSGNY------YSPGSGTLYFDRRSLGVKPME 269
Query: 325 AIFDSGTSFTYL-NDPAYTQISETFNSLAKEKRETSTSDLPF 365
+FDSG+++TY P +S L+K ++ S LP
Sbjct: 270 VVFDSGSTYTYFAAQPYQATVSALKAGLSKSLQQVSDPSLPL 311
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 89/301 (29%), Positives = 130/301 (43%), Gaps = 34/301 (11%)
Query: 96 YRLNSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFN 153
+R LG +Y +V +G P +V DTGSDL W+ C C +C +
Sbjct: 178 HRGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDP--------- 228
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ P+ S+T S VPC + C C S C Y+V Y D + + G L D L L
Sbjct: 229 LFDPSQSTTYSAVPCGAQECLDSGTCSSG--KCRYEVVY-GDMSQTDGNLARDTLTLGPS 285
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
Q + FGCG TG F +GLFGLG D+ S+ S A + FS C
Sbjct: 286 SDQLQG----FVFGCGDDDTGLF---GRADGLFGLGRDRVSLASQAAAR--YGAGFSYCL 336
Query: 274 GSD--GTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA- 325
S G +S G +P + T R P+ Y + + + V G V F A
Sbjct: 337 PSSWRAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP 396
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKE-KRETSTSDLPFEYCYVLRSFLHLQALVV 382
+ DSGT T L AY+ + +F + KR + S L Y + R+ + + ++ +
Sbjct: 397 GTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVAL 456
Query: 383 L 383
L
Sbjct: 457 L 457
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 109/242 (45%), Gaps = 29/242 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +D+GS + ++PC DC C G+ D + P SST
Sbjct: 95 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPEMSSTYQP 145
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ Y ++ + S G L ED++ +S+ R
Sbjct: 146 VKCN-----MDCNCDDDREQCVYEREY-AEHSSSKGVLGEDLISFGN---ESQLTPQRAV 196
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++GLI NSF +C+G G G +
Sbjct: 197 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMIL 255
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI +T + V G ++ E A+ DSGT++ YL
Sbjct: 256 GGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFDGEHGAVLDSGTTYAYL 315
Query: 337 ND 338
D
Sbjct: 316 PD 317
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 115
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG L+ D L L
Sbjct: 116 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 171
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 172 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 227
Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
C G G + FGD P Q TP + Y+ + G ++ + +FD
Sbjct: 228 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 287
Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
SG+SFTY Y + + L++ E + LP
Sbjct: 288 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 325
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 118/279 (42%), Gaps = 34/279 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 63 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 113
Query: 162 TSSKVPCNSTLCEL--------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLA 211
VPC LC + +C S C Y ++Y G+ STG LV D L L
Sbjct: 114 L---VPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGS-STGVLVNDSFALRLT 169
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFS 270
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 170 NGSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVG 225
Query: 271 MCFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIF 327
C G G + FGD P Q TP + Y+ + G ++ + +F
Sbjct: 226 HCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVF 285
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
DSG+SFTY Y + + L++ E + LP
Sbjct: 286 DSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 324
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 56 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 106
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG L+ D L L
Sbjct: 107 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 162
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 163 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 218
Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
C G G + FGD P Q TP + Y+ + G ++ + +FD
Sbjct: 219 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 278
Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
SG+SFTY Y + + L++ E + LP
Sbjct: 279 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 316
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/257 (31%), Positives = 110/257 (42%), Gaps = 30/257 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G PA + V DTGSD W+ C CV G + D P SST +
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWV--QCRPCVVKCYKQKGPLFD-----PAKSSTYA 215
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
V C + C G +C Y V+Y DG+ + GF +D L +A D +
Sbjct: 216 NVSCTDSACADLDTNGCTGGHCLYAVQY-GDGSYTVGFFAQDTLTIAHDAIKG------F 268
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRIS 282
FGCG G F A GL GLG KTS+ N+ +F+ C + GTG +
Sbjct: 269 RFGCGEKNNGLFGKTA---GLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGTGYLD 323
Query: 283 FGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFT 334
FG GS G TP + Y + +T + VGG V S + DSGT T
Sbjct: 324 FG-PGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVIT 382
Query: 335 YLNDPAYTQISETFNSL 351
L AYT +S F+ +
Sbjct: 383 RLPATAYTALSSAFDKV 399
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 109/242 (45%), Gaps = 29/242 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +D+GS + ++PC DC C G+ D + P SST
Sbjct: 96 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPELSSTYQP 146
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ Y ++ + S G L ED++ +S+ R
Sbjct: 147 VKCN-----MDCNCDDDKEQCVYEREY-AEHSSSKGVLGEDLISFGN---ESQLTPQRAV 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++GLI NSF +C+G G G +
Sbjct: 198 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMIL 256
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI +T + V G ++ E A+ DSGT++ YL
Sbjct: 257 GGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFDGEHGAVLDSGTTYAYL 316
Query: 337 ND 338
D
Sbjct: 317 PD 318
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 124/298 (41%), Gaps = 30/298 (10%)
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPN 158
LG +Y +V +G P +V DTGSDL W+ C C C + ++ P+
Sbjct: 133 LGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDP---------LFDPS 183
Query: 159 TSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S+T S VPC + C + C Y+V Y D + + G L D L L S
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVY-GDMSQTDGNLARDTLTLGPSSSSSS 242
Query: 219 SVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
S FGCG TG F +GLFGLG D+ S+ S A + FS C S
Sbjct: 243 SDQLQEFVFGCGDDDTGLF---GKADGLFGLGRDRVSLASQAAAK--YGAGFSYCLPSSS 297
Query: 278 T--GRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFD 328
T G +S G P T R P+ Y + + + V G V + + D
Sbjct: 298 TAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVID 357
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLRSFLHLQALVVL 383
SGT T L AY + +F L + KR + S L Y + R+ + + ++ +L
Sbjct: 358 SGTVITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALL 415
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 124/282 (43%), Gaps = 35/282 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
F + VS+G P +S V +DTGSD+ W+ PC +C NS Q+ D P
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPAC----NSQRDQLFD-----PAK 191
Query: 160 SSTSSKVPCNSTLC-ELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SST S VPC + C EL+ + +GS C Y V Y DG+ +TG D L LA
Sbjct: 192 SSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSY-GDGSNTTGVYGSDTLALAPGNTVG 250
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+ FGCG Q G F A +GL LG S+ S A G FS C S
Sbjct: 251 TFL-----FGCGHAQAGMF---AGIDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQ 300
Query: 277 -GTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFD 328
G ++ G S G T PT Y + +T +SVGG V SA + D
Sbjct: 301 SAAGYLTLGGPTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVD 360
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+GT T L AY + F ++A ++ ++ + CY
Sbjct: 361 TGTVITRLPPTAYAALRSAFRGAIAPYGYPSAPANGILDTCY 402
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 118/278 (42%), Gaps = 33/278 (11%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
L+Y +++G P + + +D+GSDL WL CD C SC + +Y P S
Sbjct: 65 LYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPCRSC---------NEVPHPLYRPTKSK 115
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLAT 212
VPC LC + +C S C Y ++Y G+ STG L+ D L L
Sbjct: 116 L---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGS-STGVLINDSFALRLTN 171
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIPNSFSM 271
SV +FGCG Q D ++P +G+ GLG S+ S L +G+ N
Sbjct: 172 GSVARPSV----AFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGH 227
Query: 272 CFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS-AIFD 328
C G G + FGD P Q TP + Y+ + G ++ + +FD
Sbjct: 228 CLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFD 287
Query: 329 SGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPF 365
SG+SFTY Y + + L++ E + LP
Sbjct: 288 SGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPL 325
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 124/282 (43%), Gaps = 35/282 (12%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
F + VS+G P +S V +DTGSD+ W+ PC +C NS Q+ D P
Sbjct: 141 FQYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPAC----NSQRDQLFD-----PAK 191
Query: 160 SSTSSKVPCNSTLC-ELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SST S VPC + C EL+ + +GS C Y V Y DG+ +TG D L LA
Sbjct: 192 SSTYSAVPCGADACSELRIYEAGCSGSQCGYVVSY-GDGSNTTGVYGSDTLALAPGNTVG 250
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD- 276
+ FGCG Q G F A +GL LG S+ S A G FS C S
Sbjct: 251 TFL-----FGCGHAQAGMF---AGIDGLLALGRQSMSLKSQAA--GAYGGVFSYCLPSKQ 300
Query: 277 -GTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFD 328
G ++ G S G T PT Y + +T +SVGG V SA + D
Sbjct: 301 SAAGYLTLGGPSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVD 360
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+GT T L AY + F ++A ++ ++ + CY
Sbjct: 361 TGTVITRLPPTAYAALRSAFRGAIAPCGYPSAPANGILDTCY 402
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 118/281 (41%), Gaps = 33/281 (11%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P + +DTGSD+ WL C C C I++P+ SS+ +PC
Sbjct: 92 SVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTP---------IFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+S LC+ + N C Y + + SD + S G L + L L + S S + G
Sbjct: 143 SSNLCQSVRYTSCNKQNSCEYTINF-SDQSYSQGELSVETLTLDSTTGHSVSFPKTV-IG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGRIS 282
CG G F +G+ GLG+ S+ + L + I FS C S+ T +++
Sbjct: 201 CGHNNRGMF--QGETSGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLVDSNKTSKLN 256
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF-------SAIFDSGTS 332
FGD G TPF + Y +T+ SVG + FE + I DSGT+
Sbjct: 257 FGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTT 316
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
T L YT + L K R + L CY + S
Sbjct: 317 LTLLPSHVYTNLESAVAQLVKLDRVDDPNQL-LNLCYSITS 356
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 134/296 (45%), Gaps = 36/296 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P V +DTGSD+ W+ C C SC+ S + +IY+ + SST
Sbjct: 82 LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCL----SKQDIIPPLSIYNLSASST 137
Query: 163 SSKVPCNSTLCELQK-QCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
SS C+ LC ++ C +G+N C Y Y D + S G V D +H + +
Sbjct: 138 SSVSSCSDPLCTGEEVVCSRSGNNSACAYVSSY-QDKSASVGAYVRDDMHYVLHGGNATT 196
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
SRI FGC TGS+ +G+ G G+ +VP+ +A Q + FS C G + G
Sbjct: 197 --SRIFFGCATNITGSW----PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHG 250
Query: 278 TGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNF---EFS--------- 324
G + FG+ +P E F+ L YN+ + +SV + EFS
Sbjct: 251 GGILEFGE--APNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNT 308
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA 379
I DSGT+F L A + + SL K L C+ L+S L ++
Sbjct: 309 GVIIDSGTTFVLLTTKANRMLFQEIKSLTTAKLGPKLEGLE---CFYLKSGLTMET 361
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 72/265 (27%), Positives = 119/265 (44%), Gaps = 27/265 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +D+GS + ++PC SC N + + P+ SS+ S V
Sbjct: 90 TRLYIGTPPQEFALIVDSGSTVTYVPCS--SCEQCGNHQDPR------FQPDLSSSYSPV 141
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
CN + C S C Y+ +Y ++ + S+G L ED++ ++S+ F
Sbjct: 142 KCN-----VDCTCDSDKKQCTYERQY-AEMSSSSGVLGEDIVSFG---RESELKPQHAIF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG + S+ L +G+I +SFS+C+G G G + G
Sbjct: 193 GCENSETGDLFSQHA-DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLG 251
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLN 337
+P S P YNI + ++ V G A+ E + DSGT++ YL
Sbjct: 252 GMLAPPDMIFSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLP 311
Query: 338 DPAYTQISETFNSLAKEKRETSTSD 362
+ A+ E S ++ D
Sbjct: 312 EQAFVAFKEAVTSKVHSLKKIRGPD 336
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 121/261 (46%), Gaps = 50/261 (19%)
Query: 105 HYTNVSVGQPA-LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y N+++G P+ +F V +DTGS L ++PC +C + G D
Sbjct: 112 YYANIALGDPSPRTFQVIVDTGSTLTYVPC--ATCAKCGTHTGGTRFD------------ 157
Query: 164 SKVPCNSTLCELQKQCPSAG-------------SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P L +KQC +AG + C Y R ++G+ +G LV D +H
Sbjct: 158 ---PTGKWLTCQEKQCKAAGGPGICAGGRGAAANRCTYS-RTYAEGSGVSGDLVRDKMHF 213
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDK-TSVPSILANQGLIPNSF 269
D + + + FGC ++G+ D A +GL GLG ++ S+P+ LA+ +P F
Sbjct: 214 GGDIAPATNGTLDVVFGCTNAESGTIHDQEA-DGLIGLGNNQFASIPNQLADTHGLPRVF 272
Query: 270 SMCFGS-DGTGRISFGDKGSPGQGETP------FSLRQTHPTYNITIT-QVSVGGNAV-- 319
S+CFGS +G G +SFG P TP + + HP Y + T + +G AV
Sbjct: 273 SLCFGSFEGGGALSFGRL--PATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVAT 330
Query: 320 ----NFEFSAIFDSGTSFTYL 336
+ + DSGT+FTY+
Sbjct: 331 PSDLAVGYGTVMDSGTTFTYV 351
>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
Length = 373
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 128/289 (44%), Gaps = 42/289 (14%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P ++ +DT S+L W+ SC N S +V FN P SS+ PC S
Sbjct: 5 IGTPPREVLLLVDTASELTWV--QGTSCT---NCSPTKVPPFN---PGLSSSFISEPCTS 56
Query: 171 TLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
++C Q C + +C +QV YL DG+ + G + ++ L + + + ++ I
Sbjct: 57 SVCLGRSKLGFQSACNRSTGSCSFQVAYL-DGSEAYGVIAREIFSLQSWDGAASTLGDVI 115
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL--IPNSFSMCFGS-----DG 277
FGC +D ++ G GL S P+ + ++ + + FS CF + +
Sbjct: 116 -FGCASKDLQRPVDFSS--GTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNS 172
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSAI----- 326
+G I FGD G P SL Q P Y + + +SVGG ++ SA
Sbjct: 173 SGVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRL 232
Query: 327 ------FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSGT+ ++L +PA+T + E F TS SD E CY
Sbjct: 233 GNGGTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCY 281
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 119/266 (44%), Gaps = 29/266 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P +SST
Sbjct: 86 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPESSSTYQP 136
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V C + C S C Y+ +Y ++ + S+G L ED++ QS+ R
Sbjct: 137 VKCT-----IDCNCDSDRMQCVYERQY-AEMSTSSGVLGEDLISFGN---QSELAPQRAV 187
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++ +I +SFS+C+G G G +
Sbjct: 188 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVISDSFSLCYGGMDVGGGAMVL 246
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYL 336
G P +S P YNI + ++ V G N + + + DSGT++ YL
Sbjct: 247 GGISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 306
Query: 337 NDPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + + ++ S D
Sbjct: 307 PEAAFLAFKDAIVKELQSLKKISGPD 332
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 116/270 (42%), Gaps = 33/270 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L + N SVGQP + +DTGS L W+ C C C SS +I +++P SST
Sbjct: 67 LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHC------SSNHMIH-PVFNPALSST 119
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C+ C + + C Y+ Y+S GT S G L ++ L T + V
Sbjct: 120 FVECSCDDRFCRYAPNGHCSSNKCVYEQVYIS-GTGSKGVLAKERLTFTTPNGNT-VVTQ 177
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDG 277
I+FGCG + G L+ G+ GLG TS+ L ++ FS C G + G
Sbjct: 178 PIAFGCGH-ENGEQLESEF-TGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNYG 229
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------FSAIF 327
++ G+ TP + Y + + +SVG +N E I
Sbjct: 230 YNQLVLGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVIL 289
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
D+GT +T+L D AY ++ S+ K E
Sbjct: 290 DTGTLYTWLADIAYRELYNEIKSILDPKLE 319
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 117/255 (45%), Gaps = 39/255 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P S++
Sbjct: 78 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSTSYQA 128
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN C C G C Y+ RY ++ + S+G L ED++ + + S R
Sbjct: 129 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGNESQLSPQ---RAV 179
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +TG A +G+ GLG K SV L ++G+I + FS+C+G G G +
Sbjct: 180 FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 238
Query: 284 GDKGSPGQG-----ETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
G K SP G PF P YNI + Q+ V G ++ N + + DSGT
Sbjct: 239 G-KISPPPGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 293
Query: 332 SFTYLNDPAYTQISE 346
++ Y A+ I +
Sbjct: 294 TYAYFPKEAFIAIKD 308
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 124/282 (43%), Gaps = 23/282 (8%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSC--VHGLNSSS--GQVIDFNIYSPNT 159
+ +++G PA + + +DTGS L WL CD C++C H L G + +Y P
Sbjct: 39 FVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSLFYPRLIGSFVPHGLYKPEL 98
Query: 160 --SSTSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ ++ C +L+K N C Y ++Y+ G S G L+ D L
Sbjct: 99 KYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGT 156
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
+ + I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C
Sbjct: 157 N---PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCIS 213
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
S G G + FGD P G T + + H Y+ + N+ IFDSG
Sbjct: 214 SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGA 273
Query: 332 SFTYLN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
++TY P + +S ++L+KE + E D C+
Sbjct: 274 TYTYFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 315
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/251 (29%), Positives = 119/251 (47%), Gaps = 25/251 (9%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
Y +++G+PA + + +DTGS+L WL +C VHG + Y+P + K
Sbjct: 39 YATLNIGEPAKPYFLDVDTGSNLTWL--ECHPPVHGCKGCHPRP-PHPYYTP--ADGKLK 93
Query: 166 VPCNSTLCELQKQ----CPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C S LC ++ P N C Y+++Y++ S G L D++ + +K+
Sbjct: 94 VVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTG--KSEGDLATDIISVNGRDKK- 150
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGS 275
RI+FGCG Q +P NG+ GLGM K + L +I N C S
Sbjct: 151 -----RIAFGCGYKQEEPPDSPPSPVNGILGLGMGKAGFAAQLKGLKMIKENVIGHCLSS 205
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSF 333
G G + GD P +G T +R++ Y+ + +V + + N F A+FDSG+++
Sbjct: 206 KGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVFIDKQPIRGNPTFEAVFDSGSTY 265
Query: 334 TYLNDPAYTQI 344
T++ Y +I
Sbjct: 266 THVPAQIYNEI 276
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 118/266 (44%), Gaps = 29/266 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P +SST
Sbjct: 114 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPESSSTYQP 164
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V C + C C Y+ +Y ++ + S+G L EDV+ QS+ R
Sbjct: 165 VKCT-----IDCNCDGDRMQCVYERQY-AEMSTSSGVLGEDVISFG---NQSELAPQRAV 215
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++ +I +SFS+C+G G G +
Sbjct: 216 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVL 274
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-------NAVNFEFSAIFDSGTSFTYL 336
G P +S P YNI + ++ V G N + + + DSGT++ YL
Sbjct: 275 GGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYL 334
Query: 337 NDPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + + ++ S D
Sbjct: 335 PEAAFLAFKDAIVKELQSLKQISGPD 360
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 76/255 (29%), Positives = 117/255 (45%), Gaps = 39/255 (15%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P S++
Sbjct: 78 TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQC--------GKHQDPK-FQPELSTSYQA 128
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN C C G C Y+ RY ++ + S+G L ED++ + + S R
Sbjct: 129 LKCNPD-C----NCDDEGKLCVYERRY-AEMSSSSGVLSEDLISFGNESQLSPQ---RAV 179
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +TG A +G+ GLG K SV L ++G+I + FS+C+G G G +
Sbjct: 180 FGCENEETGDLFSQRA-DGIMGLGRGKLSVVDQLVDKGVIEDVFSLCYGGMEVGGGAMVL 238
Query: 284 GDKGSPGQGET-----PFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIFDSGT 331
G K SP G PF P YNI + Q+ V G ++ N + + DSGT
Sbjct: 239 G-KISPPPGMVFSHSDPFR----SPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGT 293
Query: 332 SFTYLNDPAYTQISE 346
++ Y A+ I +
Sbjct: 294 TYAYFPKEAFIAIKD 308
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 122/283 (43%), Gaps = 45/283 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + ++A+DT +D W+PC CV C +++ S+T
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGC------------SSTVFNNVKSTTF 143
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C + C+ GS C + + Y S + L +DV+ LATD S
Sbjct: 144 KTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSSIAAN--LSQDVVTLATDSIPS------ 195
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
+FGC TGS + P GL GLG S+ S Q L ++FS C S + +G
Sbjct: 196 YTFGCLTEATGSSIP---PQGLLGLGRGPMSLLS--QTQNLYQSTFSYCLPSFRSLNFSG 250
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
+ G G P + +T L+ + Y + + + VG V+ SA I
Sbjct: 251 SLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTI 310
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSGT FT L PAYT + + F + T TS F+ CY
Sbjct: 311 FDSGTVFTRLVAPAYTAVRDAFRK--RVGNATVTSLGGFDTCY 351
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 89/290 (30%), Positives = 128/290 (44%), Gaps = 42/290 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +DTGSD+ WL C C +C ++ +++P++SS+
Sbjct: 16 YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDA---------LFNPSSSSSF 66
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+S+LC + C YQ Y DG+ + G LV D + L + V +
Sbjct: 67 KVLDCSSSLCLNLDVMGCLSNKCLYQADY-GDGSFTMGELVTDNVVLDDAFGPGQVVLTN 125
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
I GCG G+F A G+ GLG S P+ L N FS C SD +
Sbjct: 126 IPLGCGHDNEGTFGTAA---GILGLGRGPLSFPNNL--DASTRNIFSYCLPDRESDPNHK 180
Query: 281 --ISFGDKGSP--GQGETPFSLRQTHPT----YNITITQVSVGGN------AVNFEFSA- 325
+ FGD P G F + +P Y + IT +SVGGN A F+ +
Sbjct: 181 STLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSH 240
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCY 369
IFDSGT+ T L AYT + + F A TS +D F+ CY
Sbjct: 241 GNGGTIFDSGTTITRLEARAYTAVRDAFR--AATMHLTSAADFKIFDTCY 288
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 84/257 (32%), Positives = 114/257 (44%), Gaps = 31/257 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 176 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 227
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P +SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 228 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 286
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P + G F+ C
Sbjct: 287 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPAR 336
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
S GTG + FG P TP L PT Y + +T + VGG + F+A I
Sbjct: 337 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 395
Query: 328 DSGTSFTYLNDPAYTQI 344
DSGT T L AY+ +
Sbjct: 396 DSGTVITRLPPAAYSSL 412
>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 440
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 88/322 (27%), Positives = 136/322 (42%), Gaps = 45/322 (13%)
Query: 69 YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDL 128
+ R + R +Q +D++P T + ++ + F +G P + DTGSDL
Sbjct: 62 FARSKRRLRLSQNDDRSPGTITIPDEPITEYLMRFY------IGTPPVERFAIADTGSDL 115
Query: 129 FWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QKQCPSAG 183
W+ C C CV + ++ P SST VPC+S C L Q+ C
Sbjct: 116 IWVQCAPCEKCVPQ---------NAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKS 166
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
C YQ Y D T+ +G L + ++ + K +++FGC + +
Sbjct: 167 GQCYYQYIY-GDHTLVSGILGFESINFGSKNNAIKF--PKLTFGCTFSNNDTVDESKRNM 223
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGD----KGSPGQGETPF 296
GL GLG+ S+ S L Q I FS CF S+ T ++ FG+ K G TP
Sbjct: 224 GLVGLGVGPLSLISQLGYQ--IGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPL 281
Query: 297 SLRQTHPT-YNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNS 350
++ P+ Y + + VS+G V S + DSGTSFT L Y + F +
Sbjct: 282 IIKSIGPSYYYLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNK----FVA 337
Query: 351 LAKEKRETSTSDLP---FEYCY 369
L KE +P + +C+
Sbjct: 338 LVKEVYGVEAVKIPPLVYNFCF 359
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 130/309 (42%), Gaps = 33/309 (10%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYR-LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
R R +AA+ N + + + D L+ G + ++SVG P F DTGSDL W+
Sbjct: 22 RVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWV 81
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
+ C C G I+ P SST ++ C+S LC EL C S C Y
Sbjct: 82 QSEPCTGCSGG-----------TIFDPRQSSTFREMDCSSQLCAELPGSCEPGSSTCSYS 130
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y S T G D + L T S+ S + GCG V +G DG +GL GLG
Sbjct: 131 YEYGSGET--EGEFARDTISLGTTSDGSQKFPS-FAVGCGMVNSG--FDGV--DGLVGLG 183
Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDK----GSPGQGETPFSLRQT 301
S+ S L+ I + FS C + + FG G+ Q T
Sbjct: 184 QGPVSLTSQLS--AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDT 241
Query: 302 HPTYNI-TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
+PTY + T+ ++V G + + I DSGT+ TY+ Y ++ S+ R
Sbjct: 242 YPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPR-VDG 300
Query: 361 SDLPFEYCY 369
S + + CY
Sbjct: 301 SSMGLDLCY 309
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 153/340 (45%), Gaps = 45/340 (13%)
Query: 34 FHHRYSD----PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTF 89
HHR+ P K + +++D + +A R ++ G A G +++ +T
Sbjct: 61 LHHRHGPCSPLPTKKMPSLEDRLHRDQL--RAAYIKRKFSGDVKKDGQGAGGVEQSHVTV 118
Query: 90 SAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQ 148
T LN+L +L V +G PA + V +D+GSD+ W+ C C+ C ++
Sbjct: 119 PTTLGT-SLNTLEYL--ITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDP---- 171
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLV 204
++ P+ SST S C+S C Q C S+ S C Y VRY +DG+ +TG
Sbjct: 172 -----LFDPSLSSTYSPFSCSSAACAQLGQDGNGC-SSSSQCQYIVRY-ADGSSTTGTYS 224
Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
D L L ++ S FGC V++G F D +GL GLG S+ S A G
Sbjct: 225 SDTLALGSN------TISNFQFGCSHVESG-FND--LTDGLMGLGGGAPSLASQTA--GT 273
Query: 265 IPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN- 320
+FS C +G ++ G G+ G +TP PT Y + + + VGG ++
Sbjct: 274 FGTAFSYCLPPTPSSSGFLTLG-AGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSI 332
Query: 321 ----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
F + DSGT T L AY+ +S F + K+ R
Sbjct: 333 PTSVFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYR 372
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 127/292 (43%), Gaps = 32/292 (10%)
Query: 67 DRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGS 126
DR F RGRGL +D L + G+ + + V +G PA F + +DTGS
Sbjct: 71 DRRFERRGRGLVEDAR------MVLHDD---LLTKGY-YTSRVFIGTPAQEFALIVDTGS 120
Query: 127 DLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN 185
+ ++PC C C H Q + P+ SS+ V CNS C + K C +
Sbjct: 121 TVTYVPCSSCTHCGHH------QACFDPRFKPDNSSSYQTVSCNSPDC-ITKMCDARVHQ 173
Query: 186 CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
C Y+ R ++ + S G L +D+L S+ + FGC +TG A +G+
Sbjct: 174 CKYE-RVYAEMSSSKGVLGKDLLGFGNG---SRLQPHPLLFGCETAETGDLYLQHA-DGI 228
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHP 303
GLG S+ L G + +SFS+C+G +G G + G P S
Sbjct: 229 MGLGRGPLSIVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGAIPPPPAMVFAKSDPNRSN 288
Query: 304 TYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAYTQISETF 348
YN+ ++++ V G ++N + DSGT++ YL D A+ +
Sbjct: 289 YYNLELSEIQVQGVSLNVPSEVFNGRLGTVLDSGTTYAYLPDKAFDAFKDAI 340
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 74/266 (27%), Positives = 116/266 (43%), Gaps = 29/266 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P +SST
Sbjct: 90 TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQC--------GKHQDPR-FQPESSSTYKP 140
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ CN + C C G C Y+ RY ++ + S+G L EDVL +S+ R
Sbjct: 141 MQCNPS-C----NCDDEGKQCTYERRY-AEMSSSSGLLAEDVLSFGN---ESELTPQRAI 191
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISF 283
FGC V+TG A +G+ GLG SV L + ++ NSFS+C+G G +
Sbjct: 192 FGCETVETGELFSQRA-DGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVL 250
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G+ P S YNI + ++ V G + + + DSGT++ YL
Sbjct: 251 GNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYL 310
Query: 337 NDPAYTQISETFNSLAKEKRETSTSD 362
+ A+ + K ++ D
Sbjct: 311 PEEAFVAFKDAIIKEIKFLKQIHGPD 336
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 84/257 (32%), Positives = 114/257 (44%), Gaps = 31/257 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 223
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P +SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 224 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P + G F+ C
Sbjct: 283 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPAR 332
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
S GTG + FG P TP L PT Y + +T + VGG + F+A I
Sbjct: 333 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 391
Query: 328 DSGTSFTYLNDPAYTQI 344
DSGT T L AY+ +
Sbjct: 392 DSGTVITRLPPAAYSSL 408
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 84/257 (32%), Positives = 114/257 (44%), Gaps = 31/257 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P +SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PASSSTYANVSCAAPACSDLDVSGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P + G F+ C
Sbjct: 284 KG-----FRFGCGERNDGLFGEAA---GLLGLGRGKTSLP--VQTYGKYGGVFAHCLPPR 333
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IF 327
S GTG + FG P TP L PT Y + +T + VGG + F+A I
Sbjct: 334 STGTGYLDFGAGSPPATTTTPM-LTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIV 392
Query: 328 DSGTSFTYLNDPAYTQI 344
DSGT T L AY+ +
Sbjct: 393 DSGTVITRLPPAAYSSL 409
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 96/302 (31%), Positives = 132/302 (43%), Gaps = 39/302 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN-IYSPNTSSTS 163
+ +V +G PA V DTGSDL W+ C G SS G + +++P+ SST
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQC-------GPCSSGGCYKQQDPLFAPSDSSTF 206
Query: 164 SKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV- 220
S V C + C ++ C + + CPY+V Y D + + G L D L L T + S
Sbjct: 207 SAVRCGARECRARQSCGGSPGDDRCPYEVVY-GDKSRTQGHLGNDTLTLGTMAPANASAE 265
Query: 221 -DSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
D+++ FGCG TG F +GLFGLG K S+ S A G FS C
Sbjct: 266 NDNKLPGFVFGCGENNTGLF---GQADGLFGLGRGKVSLSSQAA--GKFGEGFSYCLPSS 320
Query: 274 GSDGTGRISFGDK-GSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFE-----FSA 325
S G +S G +P + TP R T P+ Y + + + V G A+
Sbjct: 321 SSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPL 380
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKE---KRETSTSDLPFEYCYVLRSFLHLQALVV 382
I DSGT T L AY + F S + KR S L + CY + H A V
Sbjct: 381 IVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSIL--DTCYDFTA--HANATVS 436
Query: 383 LP 384
+P
Sbjct: 437 IP 438
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/272 (31%), Positives = 120/272 (44%), Gaps = 49/272 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++S+G PAL++ +DTGSDL W C CV N S+ ++ P++SST S +P
Sbjct: 121 DMSIGTPALAYAAIVDTGSDLVW--TQCKPCVECFNQST------PVFDPSSSSTYSTLP 172
Query: 168 CNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C+S+LC C SA +C Y Y D + + G L + LA K+ ++
Sbjct: 173 CSSSLCSDLPTSTCTSAAKDCGYTYTY-GDASSTQGVLAAETFTLA------KTKLPGVA 225
Query: 226 FGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR--- 280
FGCG G F GA GL GLG S+ S L GL FS C S D T +
Sbjct: 226 FGCGDTNEGDGFTQGA---GLVGLGRGPLSLVSQL---GL--GKFSYCLTSLDDTSKSPL 277
Query: 281 -------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------- 325
IS + TP + P+ Y +T+ ++VG + SA
Sbjct: 278 LLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDG 337
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAK 353
I DSGTS TYL Y + + F + K
Sbjct: 338 TGGVIVDSGTSITYLELQGYRPLKKAFAAQMK 369
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 132/295 (44%), Gaps = 36/295 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT + +G P V +DTGSD+ W+ C C SC+ S + +IY+ + SST
Sbjct: 82 LYYTEIGLGNPVQKLKVIVDTGSDILWVKCSPCRSCL----SKQDIIPPLSIYNLSASST 137
Query: 163 SSKVPCNSTLCE-LQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
SS C+ LC Q C +GSN C Y + Y D + S G V+D +H + +
Sbjct: 138 SSVSSCSDPLCTGEQAVCSRSGSNSACAYGISY-QDKSTSIGAYVKDDMHYVL--QGGNA 194
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
S I FGC TGS+ +G+ G G +VP+ +A Q + FS C G + G
Sbjct: 195 TTSHIFFGCAINITGSW----PADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHG 250
Query: 278 TGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNF---EFS--------- 324
G + FG++ P E F+ L YN+ + +SV + EFS
Sbjct: 251 GGILEFGEE--PNTTEMVFTPLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNET 308
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQ 378
I DSGTSF L A + +L K L C+ L+S L ++
Sbjct: 309 GVIIDSGTSFALLATKANRILFSEIKNLTTAKLGPKLEGLQ---CFYLKSGLTVE 360
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 120/283 (42%), Gaps = 35/283 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P DTGSD+ WL C+ C C + I++P+ SS+ +PC
Sbjct: 92 SVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTP---------IFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+S LC + + N C Y++ Y D + S G L D L L + S +I G
Sbjct: 143 SSKLCHSVRDTSCSDQNSCQYKISY-GDSSHSQGDLSVDTLSLESTSGSPVSF-PKIVIG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------GSDGTGRI 281
CG G+F G A +G+ GLG S+ + L + I FS C S+ + +
Sbjct: 201 CGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 282 SFGDKG-SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIFDSG 330
SFGD G G L + P Y +T+ SVG V F E + I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
T+ T + YT + L K R + F CY L+S
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDR-VDDPNQQFSLCYSLKS 358
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 113/273 (41%), Gaps = 30/273 (10%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P +F +DTGSDL W+ CD C C N Y P + +
Sbjct: 53 MQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNLQ---------YKPK----GNII 99
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC++ +C + CP+ C Y+V+Y G+ S G LV D L +
Sbjct: 100 PCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGS-SMGALVTDQFPLKL--VNGSFMQ 156
Query: 222 SRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
++FGCG Q+ S A G+ GLG K + + L + GL N C S G G
Sbjct: 157 PPVAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGF 216
Query: 281 ISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
+ FGD P G TP + H Y + G + IFD+G+S+TY N
Sbjct: 217 LFFGDNLVPSIGVAWTPLLSQDNH--YTTGPADLLFNGKPTGLKGLKLIFDTGSSYTYFN 274
Query: 338 DPAY-TQISETFNSLAKEKRETSTSDLPFEYCY 369
AY T I+ N L + + D C+
Sbjct: 275 SKAYQTIINLIGNDLKVSPLKVAKEDKTLPICW 307
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 117/267 (43%), Gaps = 36/267 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y + +G PA + V DTGSD W+ C+ CV + ++
Sbjct: 179 RALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQE--------KLFD 230
Query: 157 PNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P SST + + C + C K C +G +C Y V+Y DG+ S GF D L L+
Sbjct: 231 PARSSTDANISCAAPACSDLYTKGC--SGGHCLYGVQY-GDGSYSIGFFAMDTLTLS--- 284
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
S FGCG G F + A GL GLG KTS+P ++ F+ CF
Sbjct: 285 --SYDAIKGFRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQAYDK--YGGVFAHCFP 337
Query: 274 -GSDGTGRISFGDKGSPG---QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---- 325
S GTG + FG SP + TP + Y + +T + VGG ++ S
Sbjct: 338 ARSSGTGYLDFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA 397
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT T L AY+ + F S
Sbjct: 398 GTIVDSGTVITRLPPAAYSSLRSAFAS 424
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 130/280 (46%), Gaps = 38/280 (13%)
Query: 106 YTNV--SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
Y NV S+GQPA + + +DTGSDL WL CD C C+ + +Y P
Sbjct: 70 YYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQCIEAPHP---------LYRP---- 116
Query: 162 TSSKVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+++ V C LC Q P + C Y+V Y +DG S G LV+DV L +
Sbjct: 117 SNNLVICEDPLCA-SLQPPGVHNCQDPDQCDYEVEY-ADGGSSLGVLVKDVFVL--NFTN 172
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K ++ ++ GCG Q L G + +G+ GLG +S+PS L++QGL+ N C
Sbjct: 173 GKRLNPLLALGCGYDQ----LPGRSNHPLDGILGLGRGISSIPSQLSSQGLVSNVIGHCL 228
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
G G + FG+ S G TP S R Y+ ++ G + +FDSG
Sbjct: 229 SGRGGGFLFFGEDIYDSSGVTWTPMS-RDHLKHYSPGFAELIFDGKSTGIRNLLVVFDSG 287
Query: 331 TSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+S+TYLN AY + + L+++ + D C+
Sbjct: 288 SSYTYLNAQAYQHLVFSLKRELSRKPISEALDDQTLPLCW 327
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 130/309 (42%), Gaps = 33/309 (10%)
Query: 73 RGRGLAAQGNDKTPLTFSAGNDTYR-LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
R R +AA+ N + + + D L+ G + ++SVG P F DTGSDL W+
Sbjct: 22 RVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFRAIADTGSDLVWV 81
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
+ C C G I+ P SST ++ C+S LC EL C S C Y
Sbjct: 82 QSEPCTGCSGG-----------TIFDPRQSSTFREMDCSSQLCTELPGSCEPGSSACSYS 130
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y S T G D + L T S+ S + GCG V +G DG +GL GLG
Sbjct: 131 YEYGSGET--EGEFARDTISLGTTSGGSQKFPS-FAVGCGMVNSG--FDGV--DGLVGLG 183
Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDK----GSPGQGETPFSLRQT 301
S+ S L+ I + FS C + + FG G+ Q T
Sbjct: 184 QGPVSLTSQLS--AAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDT 241
Query: 302 HPTYNI-TITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
+PTY + T+ ++V G + + I DSGT+ TY+ Y ++ S+ R
Sbjct: 242 YPTYYLLTVNGIAVAGQTMGSPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPR-VDG 300
Query: 361 SDLPFEYCY 369
S + + CY
Sbjct: 301 SSMGLDLCY 309
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 124/287 (43%), Gaps = 37/287 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + + C + C +G NC Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PARSSTYANISCAAPACSDLDTRGCSGGNCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333
Query: 275 SDGTGRISFGDKGSPGQG----ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
S GTG + FG GSP TP L PT Y + +T + VGG ++ S
Sbjct: 334 SSGTGYLDFG-PGSPAAAGARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTA 391
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
I DSGT T L AY+ + F S +A + + + + CY
Sbjct: 392 GTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPAVSLLDTCY 438
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 115/256 (44%), Gaps = 37/256 (14%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P +F + +DTGS + ++PC C C + + P SST
Sbjct: 92 TRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPK---------FEPELSSTYQP 142
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C + C Y+ +Y ++ + S+G L ED++ QS+ V R
Sbjct: 143 VSCN-----IDCTCDNERKQCVYERQY-AEMSSSSGVLGEDIISFG---NQSELVPQRAI 193
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC +TG A +G+ GLG S+ L +G+I +SFS+C+G G G +
Sbjct: 194 FGCENQETGDLYSQRA-DGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMIL 252
Query: 284 GDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTS 332
G P + ++ P YNI + + V G ++ + S + DSGT+
Sbjct: 253 GGISPP----SGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTT 308
Query: 333 FTYLNDPAYTQISETF 348
+ YL + A+T +
Sbjct: 309 YAYLPEAAFTAFKDAM 324
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 80/258 (31%), Positives = 110/258 (42%), Gaps = 32/258 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G PA + V DTGSD W+ C CV + ++ P SST
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEP--------LFDPAKSSTY 214
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C + C G +C Y V+Y DG+ + GF +D L +A D +
Sbjct: 215 ANVSCTDSACADLDTNGCTGGHCLYAVQY-GDGSYTVGFFAQDTLTIAHDAIKG------ 267
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRI 281
FGCG G F A GL GLG KTS+ N+ +F+ C + GTG +
Sbjct: 268 FRFGCGEKNNGLFGKTA---GLMGLGRGKTSLTVQAYNK--YGGAFAYCLPALTTGTGYL 322
Query: 282 SFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSF 333
FG GS G TP + Y + +T + VGG V S + DSGT
Sbjct: 323 DFG-PGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVI 381
Query: 334 TYLNDPAYTQISETFNSL 351
T L AYT +S F+ +
Sbjct: 382 TRLPATAYTALSSAFDKV 399
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 70/269 (26%), Positives = 115/269 (42%), Gaps = 53/269 (19%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L++ + +G P+ + V +DTGSD+ W+ C C C + S I +Y P +S +
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKC----PTKSDLGIKLTLYDPASSVS 81
Query: 163 SSKVPCNSTLC---------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--A 211
+++V C+ C + +K+ P C Y V Y DG+ + G+ V D +
Sbjct: 82 ATRVSCDDDFCTSTYNGLLPDCKKELP-----CQYNVVY-GDGSSTAGYFVSDAVQFERV 135
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
T Q+ + ++FGCG Q+G GLG ++ IL +F+
Sbjct: 136 TGNLQTGLSNGTVTFGCGAQQSG------------GLGTSGEALDGILG-------AFAH 176
Query: 272 CFGS-DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------- 321
C + +G G + G+ SP TP Q H YN+ + ++ VGG +
Sbjct: 177 CLDNVNGGGIFAIGELVSPKVNTTPMVPNQAH--YNVYMKEIEVGGTVLELPTDVFDSGD 234
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ YL + Y + S
Sbjct: 235 RRGTIIDSGTTLAYLPEVVYDSMMNEIRS 263
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 126/287 (43%), Gaps = 37/287 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE--------KLFD 223
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 224 PARSSTYANVSCAAPACFDLDTRGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 283 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 332
Query: 275 SDGTGRISFGDKGSPGQG----ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
S GTG + FG GSP TP L PT Y + +T + VGG ++ S
Sbjct: 333 SSGTGYLDFG-PGSPAAAGARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATA 390
Query: 326 --IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
I DSGT T L PAY+ + F +++A + + + + CY
Sbjct: 391 GTIVDSGTVITRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLDTCY 437
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 118/271 (43%), Gaps = 34/271 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y + +G PA + V DTGSD W+ C CV + ++
Sbjct: 175 RALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQE--------KLFD 226
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 227 PARSSTYANVSCAAPACSDLYTRGCSGGHCLYSVQY-GDGSYSIGFFAMDTLTLSSYDAV 285
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 286 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 335
Query: 275 SDGTGRISFGDKGSP---GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG GSP G +T L PT Y + +T + VGG ++ S
Sbjct: 336 SSGTGYLDFG-PGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAG 394
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
I DSGT T L AY+ + F S +
Sbjct: 395 TIVDSGTVITRLPPAAYSSLRSAFASAMAAR 425
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 79/277 (28%), Positives = 113/277 (40%), Gaps = 36/277 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ +Y +++G P F + +DTGSDL W+ CD C C Y PN
Sbjct: 65 LGY-YYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGCTKPRAKQ---------YKPN 114
Query: 159 TSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++ +PC+ LC + C C Y++ Y SD S G LV D L
Sbjct: 115 HNT----LPCSHLLCSGLDLTQNRPCDDPEDQCDYEIGY-SDHASSIGALVTDEFPLKL- 168
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPN-GLFGLGMDKTSVPSILANQGLIPNSFSMC 272
++ ++FGCG Q P G+ GLG K + + L + G+ N C
Sbjct: 169 -ANGSIMNPHLTFGCGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHC 227
Query: 273 FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYN-------ITITQVSVGGNAVNFEFSA 325
G G +S GD+ P G T SL + N + + G +N
Sbjct: 228 LSHTGKGFLSIGDELVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGIN----V 283
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+FDSG+S+TY N AY I + K T T D
Sbjct: 284 VFDSGSSYTYFNAEAYQAILDLIRKDLNGKPLTDTKD 320
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 117/265 (44%), Gaps = 35/265 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +VS+G P + ++ DTGSDL W C C+ C L I++P S++
Sbjct: 92 YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRP---------IFNPLKSTSF 142
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S VPCN+ C C G C Y Y D T S G L + + + S SV
Sbjct: 143 SHVPCNTQTCHAVDDGHCGVQGV-CDYSYTY-GDRTYSKGDLGFEKITIG-----SSSVK 195
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGT 278
S I GCG +G F +G+ GLG + S+ S ++ I FS C S
Sbjct: 196 SVI--GCGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHAN 250
Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----AIFDSGT 331
G+I+FG+ PG TP + T Y IT+ +S+ GN + F+ I DSGT
Sbjct: 251 GKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI-GNERHMAFAKQGNVIIDSGT 309
Query: 332 SFTYLNDPAYTQISETFNSLAKEKR 356
+ T L Y + + + K KR
Sbjct: 310 TLTILPKELYDGVVSSLLKVVKAKR 334
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/280 (29%), Positives = 122/280 (43%), Gaps = 35/280 (12%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
G + V +G P F ++ DTGSDL W C+ C+ G + D P TS+
Sbjct: 137 GGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCE--PCLGGCFPQNQPKFD-----PTTST 189
Query: 162 TSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
+ V C+S C+L + C S + C Y ++Y S T+ GFL + L +A+ +
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCIS--NTCLYGIQYGSGYTI--GFLATETLAIASSD 245
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
V FGC G+F GL GLG ++PS N+ N FS C
Sbjct: 246 -----VFKNFLFGCSEESRGTF---NGTTGLLGLGRSPIALPSQTTNK--YKNLFSYCLP 295
Query: 275 S--DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---AIFDS 329
+ TG +SFG + S TP S + Y + +SV G + S I DS
Sbjct: 296 ASPSSTGHLSFGVEVSQAAKSTPISPKLKQ-LYGLNTVGISVRGRELPINGSISRTIIDS 354
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
GT+FT+L P Y+ + F + T+ + F+ CY
Sbjct: 355 GTTFTFLPSPTYSALGSAFREMMANYTLTNGTS-SFQPCY 393
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 121/278 (43%), Gaps = 28/278 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
+ +++G PA + + +DTGS L WL CD C++C + +Y P +
Sbjct: 39 FVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ C +L+K N C Y ++Y+ G S G L+ D L +
Sbjct: 90 KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144
Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
+ I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C S G
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTY 335
G + FGD P G T + + H Y+ + N+ IFDSG ++TY
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTY 264
Query: 336 LN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
P + +S ++L+KE + E D C+
Sbjct: 265 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 302
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + ++++G P + + +DTGSDL W+ CD C C N +Y PN
Sbjct: 61 LGY-YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRN---------RLYKPN 110
Query: 159 TSSTSSKVPCNSTLCELQKQCPS---AGSN--CPYQVRYLSDGTMSTGFLVEDVLHLA-T 212
+ V C LC+ + P+ AG N C Y+V Y G+ S G L+ D + L T
Sbjct: 111 ----GNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGS-SLGVLLRDNIPLKFT 165
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ ++ + ++FGCG Q + A+ G+ GLG KTS+ S L + GLI N
Sbjct: 166 NGSLARPI---LAFGCGYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGH 222
Query: 272 CFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITI---------TQVSVGGNAVNFE 322
C G G + FGD+ P G L Q+ T + SV G
Sbjct: 223 CLSERGGGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKG------ 276
Query: 323 FSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSG+S+TY N A+ ++ N L + +T D C+
Sbjct: 277 LQLIFDSGSSYTYFNSKAHKALVNLVTNDLRGKPLSRATEDSSLPICW 324
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 90/322 (27%), Positives = 137/322 (42%), Gaps = 53/322 (16%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYS 156
L G +Y + VG PA+ ++ +DTGSD+ W+ C C CV L ++
Sbjct: 132 LGQAGLEYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPP---------FN 182
Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
P SS+ K+PC S+ C ++ C +G C + ++Y DG++S+G L + +
Sbjct: 183 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQY-GDGSLSSGLLAMETIAGN 241
Query: 212 T----DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
T D + K S I+ GC + GA+ GL G+ S PS L+++
Sbjct: 242 TPNFGDGEPVKL--SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YAR 295
Query: 268 SFSMCFGS-----DGTGRISFG--DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
FS CF + +G + FG D SP TP P+ ++ V + G +V
Sbjct: 296 KFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVD 355
Query: 320 ---------NFEFS-------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
NF+ I DSGT+FTYL PA+ + F LA+ D
Sbjct: 356 ESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDN 413
Query: 364 P-FEYCYVLRSFLHLQALVVLP 384
F CY + S +LP
Sbjct: 414 SGFTPCYNITSGTAALESTILP 435
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 117/270 (43%), Gaps = 32/270 (11%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
++LG +Y + +G PA + V DTGSD W+ C+ CV + ++
Sbjct: 154 SALGTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQE--------KLFD 205
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + + C + C +G +C Y V+Y DG+ S GF D L L+
Sbjct: 206 PARSSTYANISCAAPACSDLYIKGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLS----- 259
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
S FGCG G + + A GL GLG KTS+P ++ F+ CF
Sbjct: 260 SYDAIKGFRFGCGERNEGLYGEAA---GLLGLGRGKTSLPVQAYDK--YGGVFAHCFPAR 314
Query: 275 SDGTGRISFGDKGSP---GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
S GTG + FG P + TP + Y + +T + VGG ++ S
Sbjct: 315 SSGTGYLDFGPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGT 374
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
I DSGT T L AY+ + F S E+
Sbjct: 375 IVDSGTVITRLPPAAYSSLRSAFASAMAER 404
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 116/271 (42%), Gaps = 34/271 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L N SVGQP + + +DTGS L W+ C C C SS +I +++P SST
Sbjct: 95 LFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHC------SSDHMIH-PVFNPALSST 147
Query: 163 SSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+ C SN C Y+ Y+S GT S G L ++ L T + V
Sbjct: 148 FVECSCDDRFCRYAPNGHCGSSNKCVYEQVYIS-GTGSKGVLAKERLTFTTPNGNT-VVT 205
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SD 276
I+FGCG + G L+ G+ GLG TS+ L ++ FS C G +
Sbjct: 206 QPIAFGCG-YENGEQLESHF-TGILGLGAKPTSLAVQLGSK------FSYCIGDLANKNY 257
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE----------FSAI 326
G ++ G+ TP + Y + + +SVG +N E I
Sbjct: 258 GYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVI 317
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
DSGT +T+L D AY ++ S+ K E
Sbjct: 318 LDSGTLYTWLADIAYRELYNEIKSILDPKLE 348
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 121/272 (44%), Gaps = 32/272 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +G P + + +D+GSDL WL CD CVSC + Y PN
Sbjct: 70 VSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPP---------YKPN----KG 116
Query: 165 KVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ CN +C + C ++ C Y+V Y G+ S G LV D+ L +
Sbjct: 117 PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS-SLGVLVHDIFSLQLTNGTLAA 175
Query: 220 VDSRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
R++FGCG Q S+ AP +G+ GLG K+S+ + L + GLI + C
Sbjct: 176 --PRLAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGR 231
Query: 277 GTGRISFGDKGS--PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
G G + GD S PG TP S + Y + + G + +FDSG+S+
Sbjct: 232 GGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSY 291
Query: 334 TYLNDPAY-TQISETFNSLAKEKRETSTSDLP 364
TY N AY T +S L + +ET+ LP
Sbjct: 292 TYFNAQAYKTTLSLVRKYLNGKLKETADESLP 323
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 123/286 (43%), Gaps = 37/286 (12%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IY 155
L++L F+ V G PA ++ V DTGSD+ W+ C+ C SG + I+
Sbjct: 130 LDTLEFV--VTVGFGTPAQTYTVIFDTGSDVSWI--QCLPC-------SGHCYKQHDPIF 178
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P S+T S VPC C + C Y+V Y DG+ S G L + L L +
Sbjct: 179 DPTKSATYSVVPCGHPQCAAADGSKCSNGTCLYKVEY-GDGSSSAGVLSHETLSLTSTRA 237
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+FGCG+ G F D +GL GLG + S+ S A +FS C S
Sbjct: 238 LPG-----FAFGCGQTNLGDFGD---VDGLIGLGRGQLSLSSQAAAS--FGGTFSYCLPS 287
Query: 276 DGT--GRISFGDKGSPGQGETPFSL---RQTHPT-YNITITQVSVGGNAVNF------EF 323
D T G ++ G + ++ +Q +P+ Y + + + +GG + +
Sbjct: 288 DNTTHGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDD 347
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
DSGT TYL AYT + + F + + D PF+ CY
Sbjct: 348 GTFLDSGTILTYLPPEAYTALRDRFKFTMTQYKPAPAYD-PFDTCY 392
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 115/261 (44%), Gaps = 31/261 (11%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
G+ H ++GQP + + DTGSDL WL CD C+ C + +Y P
Sbjct: 65 GYYH-VQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPHP---------LYQPTN 114
Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
K P ++L +C C Y+V Y +DG S G LV D+ +
Sbjct: 115 DLVVCKDPICASLHPDNYRCDDP-DQCDYEVEY-ADGGSSIGVLVNDLF--PVNLTSGMR 170
Query: 220 VDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
R++ GCG Q L G A +G+ GLG +S+ + L++QGL+ N CF
Sbjct: 171 ARPRLTIGCGYDQ----LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRR 226
Query: 277 GTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSF 333
G G + FGD S TP S R Y ++ + G + + +FDSG+S+
Sbjct: 227 GGGYLFFGDDIYDSSKVIWTPMS-RDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSY 285
Query: 334 TYLNDPAYTQISETFNSLAKE 354
TY N TQ +T S K+
Sbjct: 286 TYFN----TQTYQTLLSFIKK 302
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 121/268 (45%), Gaps = 24/268 (8%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +G P + + +D+GSDL WL CD CVSC + Y PN +
Sbjct: 37 VSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSCTKAPHPP---------YKPNKGPITC 87
Query: 165 KVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
P C++ + C ++ C Y+V Y G+ S G LV D+ L + R
Sbjct: 88 NDPMCSALHWPSKPPCKASHEQCDYEVSYADHGS-SLGVLVHDIFSLQLTNGTLAA--PR 144
Query: 224 ISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
++FGCG Q S+ AP +G+ GLG K+S+ + L + GLI + C G G
Sbjct: 145 LAFGCGYDQ--SYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSGRGGGF 202
Query: 281 ISFGDKGS--PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
+ GD S PG TP S + Y + + G + +FDSG+S+TY N
Sbjct: 203 LFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSSYTYFN 262
Query: 338 DPAY-TQISETFNSLAKEKRETSTSDLP 364
AY T +S L + +ET+ LP
Sbjct: 263 AQAYKTTLSLVRKYLNGKLKETADESLP 290
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 118/282 (41%), Gaps = 36/282 (12%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYSPN 158
++ ++++G P + + +DTGSDL W+ CD C C + +Y PN
Sbjct: 61 IYTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPCKGCT---------LPKDKLYKPN 111
Query: 159 TSSTSSKVPCNSTLCE--------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
+ V C+ +C ++C C Y+V Y +D STG L D +H+
Sbjct: 112 GNQL---VKCSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEY-ADNAESTGALARDYMHI 167
Query: 211 ATDEKQSKSVDSRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ S S + FGCG Q + G+ GLG K S+ S L + G I N
Sbjct: 168 GS---PSGSNVPLVVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHSMGFIHNVL 224
Query: 270 SMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAI 326
C ++G G + GDK P G TP Y+ + G + I
Sbjct: 225 GHCLSAEGGGYLFLGDKFIPSSGIFWTPIIQSSLEKHYSTGPVDLFFNGKPTPAKGLQII 284
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPF 365
FDSG+S+TY + YT ++ N+ K K RET LP
Sbjct: 285 FDSGSSYTYFSPRVYTIVANMVNNDLKGKPLRRETKDPSLPI 326
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 113/252 (44%), Gaps = 29/252 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C G+ D + P+ SST
Sbjct: 83 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQC--------GRHQDPK-FQPDLSSTYQP 133
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V C L C + C Y+ +Y ++ + S+G L EDV+ QS+ R
Sbjct: 134 VKCT-----LDCNCDNDRMQCVYERQY-AEMSTSSGVLGEDVVSFGN---QSELAPQRAV 184
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISF 283
FGC V+TG A +G+ GLG S+ L ++ ++ +SFS+C+G G G +
Sbjct: 185 FGCENVETGDLYSQHA-DGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVL 243
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G P S P YNI + ++ V G + + ++ DSGT++ YL
Sbjct: 244 GGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKRLPLNPSVFDGKHGSVLDSGTTYAYL 303
Query: 337 NDPAYTQISETF 348
+ A+ E
Sbjct: 304 PEEAFLAFKEAI 315
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 121/282 (42%), Gaps = 28/282 (9%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT 159
GF + T + VGQP + + DTGSDL WL CD C C L+ +Y P
Sbjct: 55 GFYNVT-LYVGQPPKPYFLDPDTGSDLTWLQCDAPCQQCTETLHP---------LYQP-- 102
Query: 160 SSTSSKVPCNSTLC-----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
++ VPC LC + +C + C Y+V Y +DG S G LV DV L +
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENP-DQCDYEVEY-ADGGSSLGVLVRDVFPL--NL 156
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ R++ GCG Q +G+ GLG S+ S L NQG++ N CF
Sbjct: 157 TNGDPIRPRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN 216
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE-FSAIFDSGTS 332
S G G FGD + + +P Y+ ++ G + +FDSG+S
Sbjct: 217 SKGGGYXFFGDGIYDPYRLVWTPMSRDYPKHYSPGFGELIFNGRSTGLRNLFVVFDSGSS 276
Query: 333 FTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRS 373
+TY N AY ++ N LA + + D C+ R
Sbjct: 277 YTYFNAQAYQVLTSLLNRELAGKPLREAMDDDTLPLCWRGRK 318
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 123/274 (44%), Gaps = 22/274 (8%)
Query: 105 HYTNV-SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
H+T + ++G P+ F + +DTGSDL W+ CD C+ C + +Y P+ ++
Sbjct: 52 HFTVLLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDM---------LYRPHNNA 102
Query: 162 TSSKVPCNSTLCELQKQC-PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S + P + L L K + C Y+V Y G+ S G LV+D++ + K +
Sbjct: 103 VSREDPLCAALSSLGKFIFKNPNDQCAYEVEYADHGS-SVGVLVKDLVPMRL--TNGKRI 159
Query: 221 DSRISFGCGRVQ-TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
+ FGCG Q G + G+ GL K ++ S L++ G + N C G G
Sbjct: 160 SPNLGFGCGYDQENGDLQQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGG 219
Query: 280 RISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNF-EFSAIFDSGTSFTYL 336
+ FG P G TP LR + Y+ +V G AV + FDSG+S+TY
Sbjct: 220 FLFFGGDVVPSSGMSWTPI-LRNSEGKYSSGPAEVYFNGRAVGIGGLTLTFDSGSSYTYF 278
Query: 337 NDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
N Y I + N L + ++ D E C+
Sbjct: 279 NSQVYRAIEKLLKNDLKGNPLKLASDDKTLELCW 312
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/341 (27%), Positives = 138/341 (40%), Gaps = 62/341 (18%)
Query: 34 FHHRYSDPVKGI-LAVDDLPKKGSFAYYS----ALAHRDRYFRLRGRGLAAQGNDKTPLT 88
HH P G+ + ++ + + Y A+ +R R L + +TP+
Sbjct: 31 LHHGQKRPQPGLRVDLEQVDSGKNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVY 90
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
AG+ Y +N V++G P SF +DTGSDL W C+ C C
Sbjct: 91 --AGDGEYLMN---------VAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTP--- 136
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
I++P SS+ S +PC S C+ + C Y Y DG+ + G++ +
Sbjct: 137 ------IFNPQDSSSFSTLPCESQYCQDLPSETCNNNECQYTYGY-GDGSTTQGYMATET 189
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
T S I+FGCG G F G GL G+G S+PS L
Sbjct: 190 FTFET------SSVPNIAFGCGEDNQG-FGQGNGA-GLIGMGWGPLSLPSQLGV-----G 236
Query: 268 SFSMC---FGSDGTGRISFGD------KGSPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
FS C +GS ++ G +GSP SL T+ Y IT+ ++VGG+
Sbjct: 237 QFSYCMTSYGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTY--YYITLQGITVGGDN 294
Query: 319 VNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF 348
+ S I DSGT+ TYL AY +++ F
Sbjct: 295 LGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAF 335
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/322 (27%), Positives = 137/322 (42%), Gaps = 53/322 (16%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYS 156
L G +Y + +G PA+ ++ +DTGSD+ W+ C C CV L ++
Sbjct: 131 LGQAGLEYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPP---------FN 181
Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
P SS+ K+PC S+ C ++ C +G C + ++Y DG++S+G L + +
Sbjct: 182 PRHSSSFFKLPCASSTCTNVYQGVKPFCSPSGRTCLFSIQY-GDGSLSSGLLAMETIAGN 240
Query: 212 T----DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
T D + K S I+ GC + GA+ GL G+ S PS L+++
Sbjct: 241 TPNFGDGEPVKL--SNITLGCADIDREGLPTGAS--GLLGMDRRPISFPSQLSSR--YAR 294
Query: 268 SFSMCFGS-----DGTGRISFG--DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAV- 319
FS CF + +G + FG D SP TP P+ ++ V + G +V
Sbjct: 295 KFSHCFPDKIAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVD 354
Query: 320 ---------NFEFS-------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
NF+ I DSGT+FTYL PA+ + F LA+ D
Sbjct: 355 ESRLPLSHKNFDIDKVTGSGGTIIDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDN 412
Query: 364 P-FEYCYVLRSFLHLQALVVLP 384
F CY + S +LP
Sbjct: 413 SGFTPCYNITSGTAALESTILP 434
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 110/273 (40%), Gaps = 30/273 (10%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P +F +DTGSD+ W+ CD C C + Y P ++ V
Sbjct: 58 LQIGNPPKAFEFDIDTGSDITWVQCDAPCTGC---------NLPPKLQYKPKGNT----V 104
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC+ +C QCP+ C Y+V Y G+ S G LV D ++
Sbjct: 105 PCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGS-SMGALVID--QFPFKLLNGSAMQ 161
Query: 222 SRISFGCGRVQT-GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
R++FGCG Q+ S A G+ GLG K + + L + GL N C S G G
Sbjct: 162 PRLAFGCGYDQSYPSAHPPPATAGVLGLGRGKIGLLTQLVSAGLTRNVVGHCLSSKGGGY 221
Query: 281 ISFGDKGSP--GQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYLN 337
+ FGD P G TP H Y ++ G + IFD+G+S+TY N
Sbjct: 222 LFFGDTLIPSLGVAWTPLLPPDNH--YTTGPAELLFNGKPTGLKGLKLIFDTGSSYTYFN 279
Query: 338 DPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
Y I N L + + D C+
Sbjct: 280 SKTYQTIVNLIGNDLKVSPLKVAKEDKTLPICW 312
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 128/312 (41%), Gaps = 54/312 (17%)
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSS 145
+ +AG R S + +G P + +VA+D +D W+PC C+ C G +S
Sbjct: 86 VPIAAGRQILRTPS----YVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSP 141
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLC----ELQKQCPSA-GSNCPYQVRYLSDGTMST 200
S + P SST V C + C CP+ G++C + + Y S +
Sbjct: 142 S--------FDPTQSSTYRPVRCGAPQCAQVPPATPSCPAGPGASCAFNLSYASSTLHAV 193
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSIL 259
L +D L L +D + D +FGC RV TGS P GL G G S +
Sbjct: 194 --LGQDALSL-SDSNGAAVPDDHYTFGCLRVVTGSG-GSVPPQGLVGFGRGPLSFLSQTK 249
Query: 260 ANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVS 313
A G I FS C S+ +G + G G P + +T L H P+ Y + + V
Sbjct: 250 ATYGSI---FSYCLPSYKSSNFSGTLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVR 306
Query: 314 VGGNAVNFEFSA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
V G AV SA I D+GT FT L+ PAY + F +R S
Sbjct: 307 VNGKAVPIPASALALDAATGRGGTIVDAGTMFTRLSPPAYAALRNAF------RRGVSAP 360
Query: 362 DLP----FEYCY 369
P F+ CY
Sbjct: 361 AAPALGGFDTCY 372
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 83/301 (27%), Positives = 135/301 (44%), Gaps = 46/301 (15%)
Query: 105 HYTNVSVGQP-ALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
++ ++ +G P FI+ DTGSDL W+ C+ C SC N G+V + N SS
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKP-NPHPGRV-----FRANDSS 172
Query: 162 TSSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TD 213
+ +PC+S C+++ Q CP+ + C + RYL +G + G + + + D
Sbjct: 173 SFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYL-NGPRAIGVFANETVTVGLND 231
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
K+ + D I GC T SF + P+G+ GLG K S+ LA + N FS C
Sbjct: 232 HKKIRLFDVLI--GC----TESFNETNGFPDGVMGLGYRKHSLALRLAE--IFGNKFSYC 283
Query: 273 F-----GSDGTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
S+ +SFGD P T L + Y + ++ +SVGG+ ++
Sbjct: 284 LVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSD 343
Query: 325 ---------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF--EYCYVLRS 373
I DSGTS T L AY ++ + + + ++ +LP +C+ +
Sbjct: 344 IWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFCFEDKG 403
Query: 374 F 374
F
Sbjct: 404 F 404
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 127/296 (42%), Gaps = 39/296 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
+ +G P++ + DTGSDL W+ PCD C + +Y P SST +
Sbjct: 100 IYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCF---------AQNTPLYDPLNSSTFTL 150
Query: 166 VPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+PC+S C Q C G +C Y Y D + S G L D + L + +
Sbjct: 151 LPCDSQPCTQLPYSQYVCSDYG-DCIYAYTY-GDNSYSYGGLSSDSIRLMLLQLH---YN 205
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT 278
S+I FGCG + G+ GLG S+ S L ++ I + FS C F S+
Sbjct: 206 SKICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDE--IGHKFSYCLLPFSSNSN 263
Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV---NFEFSAIFDSGTS 332
++ FG+ G TP ++ P Y + + ++VG V + + I DSG++
Sbjct: 264 SKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGST 323
Query: 333 FTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCYVLRSFLHLQALVVLPF 385
TYL + Y + F SL KE E PF++C+ + + VV F
Sbjct: 324 LTYLEESFYNE----FVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHF 375
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 124/268 (46%), Gaps = 44/268 (16%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L N S+GQPA + +DTGS++ W+ C C C +G ++D P+ SST
Sbjct: 98 LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQ----QNGPLLD-----PSKSST 148
Query: 163 SSKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC +T+C PSA N C Y + Y + G S G L + L + ++
Sbjct: 149 YASLPCTNTMCHY---APSAYCNRLNQCGYNLSY-ATGLSSAGVLATEQLIFHSSDEGVN 204
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
+V S + FGC + G + D G+FGLG TS + + ++ FS C G+
Sbjct: 205 AVPS-VVFGCSH-ENGDYKDRRF-TGVFGLGKGITSFVTRMGSK------FSYCLGNIAD 255
Query: 277 ---GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------EF 323
G ++ FG+K + TP + H Y +T+ +SVG ++ E
Sbjct: 256 PHYGYNQLVFGEKANFEGYSTPLKVVNGH--YYVTLEGISVGEKRLDIDSTAFSMKGNEK 313
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSL 351
SA+ DSGT+ T+L + A+ + L
Sbjct: 314 SALIDSGTALTWLAESAFRALDNEVRQL 341
>gi|348690234|gb|EGZ30048.1| pepsin-like aspartic protease A1 [Phytophthora sojae]
Length = 654
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 129/315 (40%), Gaps = 42/315 (13%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
H DRY R + + PL G HYT V G P V DT
Sbjct: 38 HPDRYARRLN--IEEDAPEIVPLHLGLGT-----------HYTWVYAGTPPQRASVIADT 84
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-KQCPSAG 183
GS L PC S G S + Q + + SST V C+ Q K+C
Sbjct: 85 GSGLMAFPC---SGCDGCGSHTDQP-----FQADNSSTLIHVTCSQQQSHFQCKECTEKS 136
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTGSFLD 238
C Y+ +G+ +VEDV++L DE + FGC +TG F+
Sbjct: 137 DTCAISQSYM-EGSSWKASVVEDVVYLGGESSFHDEAMRDRYGTHFQFGCQSSETGLFVT 195
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKGSPG-QGETPF 296
A +G+ GL T + + L + IP N FS+CF +G G +S G+ + +GE +
Sbjct: 196 QVA-DGIMGLSNSDTHIVAKLHRENKIPSNLFSLCFTENG-GTMSVGEPNTKAHRGEISY 253
Query: 297 SL----RQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISE 346
+ R YN+ + + +GG ++N + A I DSGT+ +YL + +
Sbjct: 254 AKVIKDRSAGHFYNVNMKDIRIGGKSINAKEEAYTRGHYIVDSGTTDSYLPRAMKNEFLQ 313
Query: 347 TFNSLAKEKRETSTS 361
F +A + TS
Sbjct: 314 VFKEVAGRDYQVGTS 328
>gi|413924528|gb|AFW64460.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 146
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/105 (47%), Positives = 61/105 (58%), Gaps = 10/105 (9%)
Query: 36 HRYSDPVKGILA--VDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGN 93
HR SD + + V P++GS YY AL D + + R LA K TFS GN
Sbjct: 33 HRLSDEARLEVGPRVGWWPQRGSGEYYRALVRSD--IQRQKRRLAVLSLSKGGSTFSPGN 90
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC 138
D LG+L+Y V VG PA SF+VALDTGSDLFW+PCDC+ C
Sbjct: 91 D------LGWLYYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQC 129
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/293 (24%), Positives = 121/293 (41%), Gaps = 35/293 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ VG P+ F++ DTGSDL W+ C C S + N + ++ ++ N SS+
Sbjct: 83 YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSS 141
Query: 163 SSKVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
+PC + +C+++ CP+ + C Y RY SDG+ + GF + + + E
Sbjct: 142 FKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEG 200
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ + + + GC G A +G+ GLG K S A + FS C
Sbjct: 201 RKMKLHN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVD 255
Query: 274 ---GSDGTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
+ + ++FG S T L + Y + + +S+GG +
Sbjct: 256 HLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEV 315
Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSG+S T+L +PAY + + R+ P EYC+
Sbjct: 316 WDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 368
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 116/284 (40%), Gaps = 35/284 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P F V +DTGSDL W+ C + N S ++ PNTS++ +
Sbjct: 3 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDS--------LFIPNTSTSFT 54
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
K+ C + LC + C Y Y DG++STG V D + + Q + V
Sbjct: 55 KLACGTELCNGLPYPMCNQTTCVYWYSY-GDGSLSTGDFVYDTITMDGINGQKQQV-PNF 112
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
+FGCG GSF A +G+ GLG S PS L + FS C T
Sbjct: 113 AFGCGHDNEGSF---AGADGILGLGQGPLSFPSQLKT--VFNGKFSYCLVDWLAPPTQTS 167
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA---------- 325
+ FGD P + T+P Y + + +SVGG +N +A
Sbjct: 168 PLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAG 227
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
IFDSGT+ T L + ++ N+ + S + C
Sbjct: 228 TIFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLC 271
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 112/253 (44%), Gaps = 32/253 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
V +G PA F V DTGSD W+ C CV+ + ++ P S+T + +
Sbjct: 100 VRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEP--------LFDPTKSATYANIS 151
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+S+ C +G +C Y ++Y DG+ + GF +D L LA D ++ FG
Sbjct: 152 CSSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYDTIKN------FRFG 204
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
CG G F A GL GLG KTS+P ++ F+ C S GTG + G
Sbjct: 205 CGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLDLG- 258
Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLN 337
G+P TP + + Y + +T + VGG+ + S + DSGT T L
Sbjct: 259 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 318
Query: 338 DPAYTQISETFNS 350
AY + F+
Sbjct: 319 PSAYAPLRSAFSK 331
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 124/297 (41%), Gaps = 40/297 (13%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
R+ GRG + K N Y + + ++ S+G P ++ + +DTGSDL W
Sbjct: 105 RVSGRGAPQLWDYKAAAATVPANWGYDIGTSNYV--VTASLGTPGMAQTLEVDTGSDLSW 162
Query: 131 L---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----LQKQCPSAG 183
+ PC SC + ++ P SS+ + VPC + C C +A
Sbjct: 163 VQCKPCAAPSCYRQKDP---------LFDPAQSSSYAAVPCGRSACAGLGIYASACSAA- 212
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
C Y V Y DG+ +TG D L LA + + FGCG Q+G G +
Sbjct: 213 -QCGYVVSY-GDGSNTTGVYSSDTLTLAANATVQGFL-----FGCGHAQSGGLFTGI--D 263
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKG--SPGQGETPFSLR 299
GL G G ++ S+ + G FS C S TG ++ G +PG T
Sbjct: 264 GLLGFGREQPSL--VQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPS 321
Query: 300 QTHPTYNIT-ITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNS 350
PTY + +T +SVGG ++ SA + D+GT T L AY + F S
Sbjct: 322 PNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPAAYAALRSAFRS 378
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 129/307 (42%), Gaps = 49/307 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G PA + LDTGSDL W C C+ CV Q + + P SST
Sbjct: 92 YLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPANSSTY 142
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C C YQ Y D + G L + T++ ++ R
Sbjct: 143 RSLGCSAPACNALYYPLCYQKTCVYQYFY-GDSASTAGVLANETFTFGTND--TRVTLPR 199
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + GS +G+ G+ G G S+ S L + FS C F S R
Sbjct: 200 ISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVRSR 251
Query: 281 ISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-------- 325
+ FG + TPF + PT Y + +T +SVGGN + + +
Sbjct: 252 LYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG 311
Query: 326 ----IFDSGTSFTYLNDPAYTQISETF----NSLAK--EKRETSTSDLPFEYCYVLRSFL 375
I DSGT+ TYL +PAY + E F NS + ETS D F++ R +
Sbjct: 312 TGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSV 371
Query: 376 HLQALVV 382
L LV+
Sbjct: 372 TLPQLVL 378
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 118/283 (41%), Gaps = 35/283 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P DTGSD+ WL C+ C C + I++P+ SS+ +PC
Sbjct: 92 SVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTP---------IFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S LC + + N C Y++ Y D + S G L D L L + S + G
Sbjct: 143 LSKLCHSVRDTSCSDQNSCQYKISY-GDSSHSQGDLSVDTLSLESTSGSPVSFPKTV-IG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------GSDGTGRI 281
CG G+F G A +G+ GLG S+ + L + I FS C S+ + +
Sbjct: 201 CGTDNAGTF--GGASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLNKESNASSIL 256
Query: 282 SFGDKG-SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIFDSG 330
SFGD G G L + P Y +T+ SVG V F E + I DSG
Sbjct: 257 SFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSG 316
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
T+ T + YT + L K R + F CY L+S
Sbjct: 317 TTLTLIPSDVYTNLESAVVDLVKLDR-VDDPNQQFSLCYSLKS 358
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 120/278 (43%), Gaps = 28/278 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
+ +++ PA + + +DTGS L WL CD C++C + +Y P +
Sbjct: 39 FVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ C +L+K N C Y ++Y+ G S G L+ D L +
Sbjct: 90 KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTNP-- 145
Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
+ I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C S G
Sbjct: 146 -TSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGTSFTY 335
G + FGD P G T + + H Y+ + N+ IFDSG ++TY
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNSKPISAAPMEVIFDSGATYTY 264
Query: 336 LN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
P + +S ++L+KE + E D C+
Sbjct: 265 FALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 302
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/251 (30%), Positives = 112/251 (44%), Gaps = 25/251 (9%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP 114
G + SAL D R GR LAA PL S L + L++T + +G P
Sbjct: 51 GGEGHLSALREHDG--RRHGRLLAAI---DLPLGGSG------LATETGLYFTRIGIGTP 99
Query: 115 ALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
A + V +DTGSD+ W+ +CVSC G S I+ +Y P S + V C+ C
Sbjct: 100 AKRYYVQVDTGSDILWV--NCVSC-DGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCV 156
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFG 227
+ C S S C Y + Y DG+ + GF V D L + + Q+ ++ +SFG
Sbjct: 157 ANYGGVLPSCTST-SPCEYSISY-GDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFG 214
Query: 228 CGRVQTGSF-LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGD 285
CG G A +G+ G G +S+ S LA G + F+ C + +G G + G+
Sbjct: 215 CGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGN 274
Query: 286 KGSPGQGETPF 296
P TP
Sbjct: 275 VVQPKVKTTPL 285
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 144/324 (44%), Gaps = 38/324 (11%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
+DR +R + A+ K ++ A N L++ ++ ++ +G PA +V LDTG
Sbjct: 103 QDRVDAIRRKVTASSNKPKGGVSLLA-NWGKSLSTTNYV--ASLRLGTPATELVVELDTG 159
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQK 177
SD W+ C C C + ++ P SST S VPC + C+ +
Sbjct: 160 SDQSWVQCKPCADCYEQRDP---------VFDPTASSTYSAVPCGARECQELASSSSSRN 210
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS-VDSRISFGCGRVQTGSF 236
NCPY+V Y D + + G L D L L+ S + FGCG G+F
Sbjct: 211 CSSDNNKNCPYEVSY-DDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVFGCGHSNAGTF 269
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGE- 293
+ +GL GLG+ K S+PS +A + +FS C S G +SFG + +
Sbjct: 270 GE---VDGLLGLGLGKASLPSQVAAR--YGAAFSYCLPSSPSAAGYLSFGGAAARANAQF 324
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-------IFDSGTSFTYLNDPAYTQISE 346
T Q +Y + +T + V G A+ SA I DSGT+F+ L AY +
Sbjct: 325 TEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTAFSRLPPSAYAALRS 384
Query: 347 TFNS-LAKEKRETSTSDLPFEYCY 369
+F S + + + + + S F+ CY
Sbjct: 385 SFRSAMGRYRYKRAPSSPIFDTCY 408
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 71/288 (24%), Positives = 119/288 (41%), Gaps = 35/288 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VG P+ F++ DTGSDL W+ C C S + N + ++ ++ N SS+ +P
Sbjct: 88 KVGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSSFKTIP 146
Query: 168 CNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
C + +C+++ CP+ + C Y RY SDG+ + GF + + + E + +
Sbjct: 147 CLTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEGRKMKL 205
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+ + GC G A +G+ GLG K S A + FS C
Sbjct: 206 HN-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVDHLSHK 260
Query: 276 DGTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA----- 325
+ + ++FG S T L + Y + + +S+GG +
Sbjct: 261 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 320
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSG+S T+L +PAY + + R+ P EYC+
Sbjct: 321 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 368
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 125/306 (40%), Gaps = 48/306 (15%)
Query: 87 LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHG 141
+ F G D + Y +++G+PA + + +DTGS+L W+ C C +C
Sbjct: 26 MVFKLGGDVHPTGHF----YVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTC--- 78
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLS 194
+ +Y P VPC LC+ K C C YQ+ Y +
Sbjct: 79 ------NKVPHPLYRPK-----KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINY-A 126
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGM 250
DGT S G L+ D L T ++ I+FGCG Q A +G+ GLG
Sbjct: 127 DGTTSLGVLLLDKFSLPTGSARN------IAFGCGYDQMQGPKKKAPEKVPVDGILGLGR 180
Query: 251 DKTSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPGQGET---PFSLRQTHPTYN 306
+ S L + G + N C S G G + G++ P + + + Y+
Sbjct: 181 GSVDLVSQLKHSGAVSKNVIGHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYS 240
Query: 307 ITITQVSVGGNAVNFE-FSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEK-RETSTSDL 363
+ +G N + + F AIFDSG+++TYL + + Q+ SL K + S +D
Sbjct: 241 PGQATLHLGRNPIGTKPFKAIFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDT 300
Query: 364 PFEYCY 369
C+
Sbjct: 301 RLHLCW 306
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 76/252 (30%), Positives = 112/252 (44%), Gaps = 32/252 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
V +G PA F V DTGSD W+ C CV+ + ++ P S+T + +
Sbjct: 165 VRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEP--------LFDPTKSATYANIS 216
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+S+ C +G +C Y ++Y DG+ + GF +D L LA D ++ FG
Sbjct: 217 CSSSYCSDLYVSGCSGGHCLYGIQY-GDGSYTIGFYAQDTLTLAYDTIKN------FRFG 269
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
CG G F A GL GLG KTS+P ++ F+ C S GTG + G
Sbjct: 270 CGEKNRGLFGRAA---GLLGLGRGKTSLPVQAYDK--YGGVFAYCLPATSAGTGFLDLG- 323
Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLN 337
G+P TP + + Y + +T + VGG+ + S + DSGT T L
Sbjct: 324 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLP 383
Query: 338 DPAYTQISETFN 349
AY + F+
Sbjct: 384 PSAYAPLRSAFS 395
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 119/287 (41%), Gaps = 35/287 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
VG P+ F++ DTGSDL W+ C C S + N + ++ ++ N SS+ +PC
Sbjct: 18 VGTPSQKFMLVADTGSDLTWMSCKYHCRS-RNCSNRKARRIRHKRVFHANLSSSFKTIPC 76
Query: 169 NSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +C+++ CP+ + C Y RY SDG+ + GF + + + E + +
Sbjct: 77 LTDMCKIELMDLFSLTNCPTPLTPCGYDYRY-SDGSTALGFFANETVTVELKEGRKMKLH 135
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
+ + GC G A +G+ GLG K S A + FS C +
Sbjct: 136 N-VLIGCSESFQGQSFQAA--DGVMGLGYSKYSFAIKAAEK--FGGKFSYCLVDHLSHKN 190
Query: 277 GTGRISFGDKGSP-----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
+ ++FG S T L + Y + + +S+GG +
Sbjct: 191 VSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKGA 250
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSG+S T+L +PAY + + R+ P EYC+
Sbjct: 251 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCF 297
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 128/298 (42%), Gaps = 45/298 (15%)
Query: 12 VLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRD--RY 69
+L++L + GC G F R P G +G + +AL D R+
Sbjct: 14 LLVLLFALSVGCASATGVF----QVRRKFPRHG--------GRGVAEHLAALRRHDANRH 61
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLF 129
RL G A G P DT L+YT + +G P + V +DTGSD+
Sbjct: 62 GRLLGAVDLALGGVGLP------TDTG-------LYYTRIEIGSPPKGYYVQVDTGSDIL 108
Query: 130 WLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE------LQKQCPSAG 183
W+ +C+ C G + SG I+ Y P S T+ V C C + CPS
Sbjct: 109 WV--NCIRC-DGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTS 163
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDGA- 240
S C +++ Y DG+ +TGF V D + + Q+ + ++ I+FGCG Q G L +
Sbjct: 164 SPCQFRITY-GDGSTTTGFYVTDFVQYNQVSGNGQTTTSNASITFGCG-AQLGGDLGSSN 221
Query: 241 -APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPGQGETPF 296
A +G+ G G +S+ S LA + F+ C + G G + G+ P TP
Sbjct: 222 QALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGGIFAIGNVVQPKVKTTPL 279
>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
Length = 461
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/342 (26%), Positives = 132/342 (38%), Gaps = 73/342 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-------CVSCVHGLNSSSGQ--------- 148
++ VG PA F++ DTGSDL W+ C + G N G
Sbjct: 55 YFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSV 114
Query: 149 ----VIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMS 199
++ P+ S T + +PC+S C CP+ GS C Y+ RY DG+ +
Sbjct: 115 SAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRY-KDGSAA 173
Query: 200 TGFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTG-SFLDGAAPNGLFGLGMDKT 253
G + D +A +KQ ++ + GC TG SFL A +G+ LG
Sbjct: 174 RGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFL---ASDGVLSLGYSNV 230
Query: 254 SVPSILANQGLIPNSFSMCF-----GSDGTGRISF-----------------GDKGSPGQ 291
S S A + FS C + T ++F G +PG
Sbjct: 231 SFASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGA 288
Query: 292 GETPFSL-RQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSFTYLNDPAY 341
+TP L + P Y + + VSV G + AI DSGTS T L PAY
Sbjct: 289 RQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWDVQKGGGAILDSGTSLTVLVSPAY 348
Query: 342 TQISETFNSLAKEKRETSTSDL-PFEYCYVLRSFLHLQALVV 382
+ +L K+ + PF+YCY S L + L V
Sbjct: 349 RAV---VAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAV 387
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 122/288 (42%), Gaps = 35/288 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ +G PA F V DTGSD W+ C CV+ + +++P S+T + +
Sbjct: 169 IRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEP--------LFTPTKSATYANIS 220
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C S+ C +G +C Y V+Y DG+ + GF +D L L D + FG
Sbjct: 221 CTSSYCSDLDTRGCSGGHCLYAVQY-GDGSYTVGFYAQDTLTLGYDTVKD------FRFG 273
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGD 285
CG G F A GL GLG KTSVP ++ F+ C S GTG + FG
Sbjct: 274 CGEKNRGLFGKAA---GLMGLGRGKTSVPVQAYDK--YSGVFAYCIPATSSGTGFLDFGP 328
Query: 286 KGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNF------EFSAIFDSGTSFTYLN 337
TP + Y + +T + VGG+ ++ + A+ DSGT T L
Sbjct: 329 GAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLP 388
Query: 338 DPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
AY + F + +T+ + + CY L + Q + LP
Sbjct: 389 PSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGY---QGSIALP 433
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 112/289 (38%), Gaps = 46/289 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA F + DTGS+L W+ C + GL ++ P S + +
Sbjct: 91 YFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-----------VFRPEASKSWA 139
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
VPC+S C+L C S+ S C Y RY + G + D +A +
Sbjct: 140 PVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQ 199
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----G 274
+ + GC G +G+ LG K S S A + SFS C
Sbjct: 200 LQD-VVLGCSSTHDGQSFKSV--DGVLSLGNAKISFASRAAAR--FGGSFSYCLVDHLAP 254
Query: 275 SDGTGRISFGDKGSPGQ------GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
+ TG ++FG PGQ +T L P Y + + V V G A++
Sbjct: 255 RNATGYLAFG----PGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP 310
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGT+ T L PAY + L + PFE+CY
Sbjct: 311 KSGGVILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDFP--PFEHCY 357
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 140/314 (44%), Gaps = 42/314 (13%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
RL RG+ + T L +G S+G Y V +G P F + DTGSD+
Sbjct: 91 RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 143
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
W C+ CV + +P+TS++ + C+S LC+L + C S
Sbjct: 144 WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 195
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
S C YQV+Y DG+ S GF + L L+ S +V FGCG+ G F A
Sbjct: 196 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 247
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
GL K ++PS A S+ + S G +S G + S TP S
Sbjct: 248 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 304
Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ P Y + IT +SVGG ++ + SA + DSGT T L+ AY+++S F +L +
Sbjct: 305 STPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 364
Query: 356 RETSTSDLPFEYCY 369
TS + F+ CY
Sbjct: 365 PSTSGYSI-FDTCY 377
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/295 (29%), Positives = 120/295 (40%), Gaps = 42/295 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ NV +G P + DTGSDL W C CV + I+ P+TS T
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQP--------IFDPSTSKTY 205
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C S C K + SNC Y ++Y D + + GF +D L L ++
Sbjct: 206 SNISCTSAACSSLKSATGNSPGCSSSNCVYGIQY-GDSSFTIGFFAKDKLTLTQND---- 260
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
V FGCG+ G F A GL GLG D S+ A + FS C +
Sbjct: 261 -VFDGFMFGCGQNNKGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314
Query: 277 GTGRISFGD----KGSP----GQGETPFSLRQTHPTYNITITQVSVGGNAVNF------E 322
G ++FG+ K S G TPF+ Q Y I + +SVGG A++
Sbjct: 315 SNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQN 374
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHL 377
I DSGT T L AY + F K T+ + + CY L ++ +
Sbjct: 375 AGTIIDSGTVITRLPSTAYGSLKSAFKQFM-SKYPTAPALSLLDTCYDLSNYTSI 428
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 115/271 (42%), Gaps = 43/271 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +GQP S ++ DTGSDL W+ C C +C H ++ ++ P SST
Sbjct: 83 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPAT--------VFFPRHSSTF 134
Query: 164 SKVPCNSTLCELQKQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S C +C L + A S CPY+ Y +DG++++G + L T
Sbjct: 135 SPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGY-ADGSLTSGLFARETTSLKTSSG 193
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + S ++FGCG +G + G + NG+ GLG S S L + N FS C
Sbjct: 194 KEAKLKS-VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYC 250
Query: 273 -----FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
T + GD G TP PT Y + + V V G + + S
Sbjct: 251 LMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPS 310
Query: 325 -----------AIFDSGTSFTYLNDPAYTQI 344
+ DSGT+ +L DPAY +
Sbjct: 311 IWEIDDSGNGGTVMDSGTTLAFLADPAYRLV 341
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 121/296 (40%), Gaps = 46/296 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF---NIYSPNTSS 161
++ VG PA F++ DTGSDL W+ C G +SS ++ P S
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKC------RGRRASSPDASPLASPRVFRPANSK 163
Query: 162 TSSKVPCNSTLCELQ-----KQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ + +PC+S C+ C SAG+ C Y RY D + + G + D +A
Sbjct: 164 SWAPIPCSSDTCKSYVPFSLANC-SAGTTPPAPCGYDYRY-KDKSSARGVVGTDAATIAL 221
Query: 213 DEKQS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
S K+ + GC G + +G+ LG S S A + FS
Sbjct: 222 SGSGSDRKAKLQEVVLGCTTSYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFS 277
Query: 271 MCF-----GSDGTGRISFGDKGSP-GQGETPFSL-RQTHPTYNITITQVSVGGNAVNFEF 323
C + T ++FG G+ TP L Q P Y +T+ VSV G A+N
Sbjct: 278 YCLVDHLAPRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPA 337
Query: 324 S---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
AI DSGTS T L PAY + + LA+ R T PFEYCY
Sbjct: 338 EVWDVKKNGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTMD---PFEYCY 390
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 115/266 (43%), Gaps = 38/266 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++ +G P + LDTGSDL W C C+ CV Q F + P S +
Sbjct: 89 YLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVD-------QPTPF--FDPAQSPSY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+K+PCNS +C + C YQ Y D + G L + T++ ++ R
Sbjct: 140 AKLPCNSPMCNALYYPLCYRNVCVYQYFY-GDSANTAGVLSNETFTFGTND--TRVTVPR 196
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG--------LIPNSFSMCFGS 275
I+FGCG + GS +G+ G+ G G S+ S L + + P + FG+
Sbjct: 197 IAFGCGNLNAGSLFNGS---GMVGFGRGPLSLVSQLGSPRFSYCLTSFMSPVPSRLYFGA 253
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---------- 324
T + G P Q TPF + PT Y + +T +SVGG + + S
Sbjct: 254 YATLNSTSASTGEPVQ-STPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGT 312
Query: 325 --AIFDSGTSFTYLNDPAYTQISETF 348
I DSG++ TYL AY + + F
Sbjct: 313 GGVIIDSGSTITYLARAAYDMVHQAF 338
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 105/393 (26%), Positives = 161/393 (40%), Gaps = 59/393 (15%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYY 60
MASS + + +LL+L + F + R + +++ + G++ +
Sbjct: 1 MASSASHMIIVILLVL--AVSSALFSPAASTWRSLDRRPEKNGFRVSLRHVDSGGNYTKF 58
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLT---FSAGNDTYRLNSLGFLHYTNVSVGQPALS 117
L + RLR + L+A+ P AGN + +N +++G PA +
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGEFLMN---------LAIGTPAET 109
Query: 118 FIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ 176
+ +DTGSDL W C C C I+ P SS+ SK+PC+S LC +
Sbjct: 110 YSAIMDTGSDLIWTQCKPCKVCFDQPTP---------IFDPEKSSSFSKLPCSSDLC-VA 159
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG-S 235
S C Y+ Y D + + G L + SV S+I FGCG G +
Sbjct: 160 LPISSCSDGCEYRYSY-GDHSSTQGVLATETFTFG-----DASV-SKIGFGCGEDNRGRA 212
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQ 291
+ GA GL GLG S+ S L +P FS C S G + G + +
Sbjct: 213 YSQGA---GLVGLGRGPLSLISQLG----VPK-FSYCLTSIDDSKGISTLLVGSEATVKS 264
Query: 292 G-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLND 338
TP + P+ Y +++ +SVG + E S I DSGT+ TYL D
Sbjct: 265 AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKD 324
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
A+ + + F S K + S S E C+ L
Sbjct: 325 SAFAALKKEFISQMKLDVDASGST-ELELCFTL 356
>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 121/305 (39%), Gaps = 47/305 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD---CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
++ VG PA F++ DTGSDL W+ C + +SS+ + P S
Sbjct: 95 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154
Query: 162 TSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA----- 211
T + +PC S C CP+ GS C Y RY DG+ + G + + +A
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSSSS 213
Query: 212 --TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ K K+ + GC TG + A +G+ LG S S A++ F
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFE--ASDGVLSLGYSNVSFASHAASR--FGGRF 269
Query: 270 SMCF-----GSDGTGRISFGDKGS----------PGQGETPFSL-RQTHPTYNITITQVS 313
S C + T ++FG + PG +TP L + P Y+++I +S
Sbjct: 270 SYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAIS 329
Query: 314 VGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
V G + I DSGTS T L PAY + K R + P
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGK--KLARFPRVAMDP 387
Query: 365 FEYCY 369
FEYCY
Sbjct: 388 FEYCY 392
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 129/284 (45%), Gaps = 39/284 (13%)
Query: 106 YTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
Y NV+ +GQP+ + + +DTGSDL WL CD CV C + P
Sbjct: 33 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH-------------PYYRP 79
Query: 162 TSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEK 215
++ VPC +C+ +C + G C Y+V Y +DG S G LV D +L T EK
Sbjct: 80 RNNLVPCMDPICQSLHSNGDHRCENPG-QCDYEVEY-ADGGSSFGVLVTDTFNLNFTSEK 137
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGA--APNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + ++ GCG Q F G+ +G+ GLG K+S+ S L++ GL+ N C
Sbjct: 138 RHSPL---LALGCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHCL 191
Query: 274 GSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSG 330
G G + FGD S TP S H Y+ + +++ G F+ FDSG
Sbjct: 192 SGHGGGFLFFGDDLYDSSRVAWTPMSPDAKH--YSPGLAELTFDGKTTGFKNLLTTFDSG 249
Query: 331 TSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
S+TYLN AY IS L+ + + D C+ R
Sbjct: 250 ASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRK 293
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 94/197 (47%), Gaps = 17/197 (8%)
Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
LG+ +YT +++G P + LDTGS L PC C S +G ++ P
Sbjct: 77 ELGY-YYTYLTIGTPGQTVSGILDTGSTLPAFPCS--GCTRCGPSKTG------MFKPEL 127
Query: 160 SSTSSKVPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
SSTSS C+ C C C Y +RYL +G+ ++GFL ED+L + +
Sbjct: 128 SSTSSTFGCSDARCFCGANSCSCNNEQCGYSIRYL-EGSSTSGFLAEDMLAVGDGGPAAN 186
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
V FGC + ++G A +G+FG+G S+ L QG+I ++FSMCFG+
Sbjct: 187 FV-----FGCAQSESGLLYSQIA-DGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAPRE 240
Query: 279 GRISFGDKGSPGQGETP 295
G + G+ P P
Sbjct: 241 GVLLLGNVALPADAPAP 257
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 114/269 (42%), Gaps = 32/269 (11%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 172 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 223
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 224 PARSSTYANVSCAAPACSDLDTRGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 282
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 283 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 332
Query: 275 SDGTGRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------I 326
S GTG + FG GSP TP + Y + +T + VGG + S I
Sbjct: 333 STGTGYLDFG-AGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTI 391
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEK 355
DSGT T L AY+ + F + +
Sbjct: 392 VDSGTVITRLPPAAYSSLRSAFAAAMSAR 420
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 140/314 (44%), Gaps = 42/314 (13%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
RL RG+ + T L +G S+G Y V +G P F + DTGSD+
Sbjct: 43 RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 95
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
W C+ CV + +P+TS++ + C+S LC+L + C S
Sbjct: 96 WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 147
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
S C YQV+Y DG+ S GF + L L+ S +V FGCG+ G F A
Sbjct: 148 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 199
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
GL K ++PS A S+ + S G +S G + S TP S
Sbjct: 200 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 256
Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ P Y + IT +SVGG ++ + SA + DSGT T L+ AY+++S F +L +
Sbjct: 257 STPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 316
Query: 356 RETSTSDLPFEYCY 369
TS + F+ CY
Sbjct: 317 PSTSGYSI-FDTCY 329
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 119/282 (42%), Gaps = 36/282 (12%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++G PA S+ + +DTGS L WL CD C +C ++ +Y P +
Sbjct: 39 FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC---------NIVPHVLYKP---TPK 86
Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C +LC K+C S C Y ++Y+ +M G LV D L+
Sbjct: 87 KLVTCADSLCTDLYTDLGKPKRCGSQ-KQCDYVIQYVDSSSM--GVLVIDRFSLSASNGT 143
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
+ + I+FGCG Q + P + + GL K ++ S L +QG+I + C
Sbjct: 144 NPTT---IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS 200
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
S G G + FGD P G T + + H Y+ + N+ + IFDSG
Sbjct: 201 SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGA 260
Query: 332 SFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
++TY Y + + T NS K E + D C+
Sbjct: 261 TYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCW 302
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 125/296 (42%), Gaps = 50/296 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG P + ++ALDT SDL WL C C C SG V D P S++
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 188
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ N+ C+ + + C Y V Y DG+ + G +E+ L A +
Sbjct: 189 REMSFNAADCQALGRSGGGDAKRGTCVYTVGY-GDGSTTVGDFIEETLTFAGGVRL---- 243
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGT 278
RIS GCG G F GA G+ GLG S P+ + + G +FS C G
Sbjct: 244 -PRISIGCGHDNKGLF--GAPAAGILGLGRGLMSFPNQIDHNG----TFSYCLVDFLSGP 296
Query: 279 GRIS----FGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV----------- 319
G +S FG SP TP L PT Y + +T +SVGG V
Sbjct: 297 GSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLD 356
Query: 320 --NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCYVL 371
I DSGT+ T L PAYT + F ++A + + S F+ CY +
Sbjct: 357 PYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTV 412
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/279 (26%), Positives = 120/279 (43%), Gaps = 29/279 (10%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNT--SS 161
+ +++ PA + + +DTGS L WL CD C++C + +Y P +
Sbjct: 39 FVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINC---------NKVPHGLYKPELKYAV 89
Query: 162 TSSKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ C +L+K N C Y ++Y+ G S G L+ D L +
Sbjct: 90 KCTEQRCADLYADLRKPMKCGPKNQCHYGIQYV--GGSSIGVLIVDSFSLPASNGTN--- 144
Query: 221 DSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGT 278
+ I+FGCG Q + + P NG+ GLG K ++ S L +QG+I + C S G
Sbjct: 145 PTSIAFGCGYNQGKNNHNVPTPVNGILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGK 204
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN----FEFSAIFDSGTSFT 334
G + FGD P G T + + H Y+ + N + IFDSG ++T
Sbjct: 205 GFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHFNSNKQSPISAAPMEVIFDSGATYT 264
Query: 335 YLN-DPAYTQISETFNSLAKEKR---ETSTSDLPFEYCY 369
Y P + +S ++L+KE + E D C+
Sbjct: 265 YFALQPYHATLSVVKSTLSKECKFLTEVKEKDRALTVCW 303
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/314 (30%), Positives = 140/314 (44%), Gaps = 42/314 (13%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLF 129
RL RG+ + T L +G S+G Y V +G P F + DTGSD+
Sbjct: 103 RLSSRGMFPE-KQATTLPVQSGA------SIGAGDYVVTVGLGTPKKEFTLIFDTGSDIT 155
Query: 130 WLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPS 181
W C+ CV + +P+TS++ + C+S LC+L + C S
Sbjct: 156 WTQCEPCVKTCYKQKEPR--------LNPSTSTSYKNISCSSALCKLVASGKKFSQSCSS 207
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
S C YQV+Y DG+ S GF + L L+ S +V FGCG+ G F A
Sbjct: 208 --STCLYQVQY-GDGSYSIGFFATETLTLS-----SSNVFKNFLFGCGQQNNGLFGGAAG 259
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLR-Q 300
GL K ++PS A S+ + S G +S G + S TP S
Sbjct: 260 LLGLG---RTKLALPSQTAKTYKKLFSYCLPASSSSKGYLSLGGQVSKSVKFTPLSADFD 316
Query: 301 THPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ P Y + IT +SVGG ++ + SA + DSGT T L+ AY+++S F +L +
Sbjct: 317 STPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLSPTAYSELSSAFQNLMTDY 376
Query: 356 RETSTSDLPFEYCY 369
TS + F+ CY
Sbjct: 377 PSTSGYSI-FDTCY 389
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/282 (26%), Positives = 119/282 (42%), Gaps = 36/282 (12%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++G PA S+ + +DTGS L WL CD C +C ++ +Y P +
Sbjct: 404 FITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNC---------NIVPHVLYKP---TPK 451
Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C +LC K+C S C Y ++Y+ +M G LV D L+
Sbjct: 452 KLVTCADSLCTDLYTDLGKPKRCGSQ-KQCDYVIQYVDSSSM--GVLVIDRFSLSASNGT 508
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAP-NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFG 274
+ + I+FGCG Q + P + + GL K ++ S L +QG+I + C
Sbjct: 509 NPTT---IAFGCGYDQGKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS 565
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FSAIFDSGT 331
S G G + FGD P G T + + H Y+ + N+ + IFDSG
Sbjct: 566 SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGA 625
Query: 332 SFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
++TY Y + + T NS K E + D C+
Sbjct: 626 TYTYFAAQPYQATLSVVKSTLNSECKFLTEVTEKDRALTVCW 667
Score = 42.0 bits (97), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 47/167 (28%), Positives = 71/167 (42%), Gaps = 27/167 (16%)
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ-TGSFLDGAAP 242
+ C Y+++Y +DG + G L+ D L + + FGCG Q G +P
Sbjct: 27 TQCDYEIKY-ADGASTIGALIVDQFSLP-----RIATRPNLPFGCGYNQGIGENFQQTSP 80
Query: 243 -NGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ 300
NG+ GL K S S L G+I + C S G G + GD G G +L
Sbjct: 81 VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLSSGGGGLLFVGD----GDG----NLVL 132
Query: 301 THPTY------NITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAY 341
H Y + + S+G N ++ +FDSG+++TY Y
Sbjct: 133 LHANYYSPGSATLYFDRHSLGMNPMD----VVFDSGSTYTYFTAQPY 175
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 123/274 (44%), Gaps = 30/274 (10%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P++ + DTGSDL WL C C +C + ++ P SST VPC
Sbjct: 93 SLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQ---------EAPLFDPTQSSTYVDVPC 143
Query: 169 NSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSR 223
S C L Q++C S+ C Y +Y +D + + G L D + +T Q + +
Sbjct: 144 ESQPCTLFPQNQRECGSS-KQCIYLHQYGTD-SFTIGRLGYDTISFSSTGMGQGGATFPK 201
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
FGC +F NG GLG S+ S L +Q I + FS C F S TG+
Sbjct: 202 SVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQ--IGHKFSYCMVPFSSTSTGK 259
Query: 281 ISFGDKGSPGQ-GETPFSLRQTHPTYNI-TITQVSVGGNAV---NFEFSAIFDSGTSFTY 335
+ FG + TPF + ++P+Y + + ++VG V + I DS T+
Sbjct: 260 LKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLTGQIGGNIIIDSVPILTH 319
Query: 336 LNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYC 368
L YT IS ++ E E + + PFEYC
Sbjct: 320 LEQGIYTDFISSVKEAINVEVAEDAPT--PFEYC 351
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 118/295 (40%), Gaps = 42/295 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ NV +G P + DTGSDL W C CV + I+ P+ S T
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQP--------IFDPSASKTY 205
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + C ST C K + SNC Y ++Y D + + GF +D L L ++
Sbjct: 206 SNISCTSTACSGLKSATGNSPGCSSSNCVYGIQY-GDSSFTVGFFAKDTLTLTQND---- 260
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-- 276
V FGCG+ G F A GL GLG D S+ A + FS C +
Sbjct: 261 -VFDGFMFGCGQNNRGLFGKTA---GLIGLGRDPLSIVQQTAQK--FGKYFSYCLPTSRG 314
Query: 277 GTGRISFGDKGSPGQGE--------TPFSLRQTHPTYNITITQVSVGGNAVNF------E 322
G ++FG+ + TPF+ Q Y I + +SVGG A++
Sbjct: 315 SNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQN 374
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHL 377
I DSGT T L Y + TF K T+ + + CY L ++ +
Sbjct: 375 AGTIIDSGTVITRLPSTVYGSLKSTFKQFM-SKYPTAPALSLLDTCYDLSNYTSI 428
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 96/339 (28%), Positives = 141/339 (41%), Gaps = 57/339 (16%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQGNDKTP---LTFSAGNDTYRLNSLGFLHYTNVSV 111
G++ + L + RLR + L+A+ P AGN + +N +++
Sbjct: 53 GNYTKFERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVHAGNGEFLMN---------LAI 103
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA ++ +DTGSDL W C C C I+ P SS+ SK+PC+S
Sbjct: 104 GTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTP---------IFDPEKSSSFSKLPCSS 154
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
LC + S C Y+ Y D + + G L + SV S+I FGCG
Sbjct: 155 DLC-VALPISSCSDGCEYRYSY-GDHSSTQGVLATETFTFG-----DASV-SKIGFGCGE 206
Query: 231 VQTG-SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGD 285
G ++ GA GL GLG S+ S L +P FS C S G + G
Sbjct: 207 DNRGRAYSQGA---GLVGLGRGPLSLISQLG----VPK-FSYCLTSIDDSKGISTLLVGS 258
Query: 286 KGSPGQG-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
+ + TP + P+ Y +++ +SVG + E S I DSGT+
Sbjct: 259 EATVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTT 318
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
TYL D A+ + + F S K + S S E C+ L
Sbjct: 319 ITYLKDNAFAALKKEFISQMKLDVDASGST-ELELCFTL 356
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 116/281 (41%), Gaps = 36/281 (12%)
Query: 80 QGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVS 137
Q K +T A R SLG +Y ++ +G PA V DTGSDL W+ C C
Sbjct: 124 QARGKKGVTLPA----QRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSD 179
Query: 138 CVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDG 196
C + ++ P SST S VPC S C+ L + S C Y+V Y D
Sbjct: 180 CYEQKDP---------LFDPARSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVY-GDQ 229
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
+ + G L D L L + V FGCG TG F G A +GL GLG +K S+
Sbjct: 230 SQTDGALARDTLTLTQSD-----VLPGFVFGCGEQDTGLF--GRA-DGLVGLGREKVSLS 281
Query: 257 SILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVS 313
S A++ FS C S G +S G T R P+ Y + + V
Sbjct: 282 SQAASK--YGAGFSYCLPSSPSAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVK 339
Query: 314 VGGNAVNFE---FSA---IFDSGTSFTYLNDPAYTQISETF 348
V G V FSA + DSGT T L Y + F
Sbjct: 340 VAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRVYAALRSAF 380
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/259 (30%), Positives = 113/259 (43%), Gaps = 35/259 (13%)
Query: 111 VGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + ++ DTGSDL W C C+ C L I++P S++ S VPCN
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRP---------IFNPLKSTSFSHVPCN 136
Query: 170 STLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+ C C G C Y Y D T S G L + + + S SV S I G
Sbjct: 137 TQTCHAVDDGHCGVQGV-CDYSYTY-GDRTYSKGDLGFEKITIG-----SSSVKSVI--G 187
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFG 284
CG +G F +G+ GLG + S+ S ++ I FS C S G+I+FG
Sbjct: 188 CGHASSGGF---GFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFG 244
Query: 285 DKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----AIFDSGTSFTYLN 337
PG TP + T Y IT+ +S+ GN + F+ I DSGT+ ++L
Sbjct: 245 QNAVVSGPGVVSTPLISKNTVTYYYITLEAISI-GNERHMAFAKQGNVIIDSGTTLSFLP 303
Query: 338 DPAYTQISETFNSLAKEKR 356
Y + + + K KR
Sbjct: 304 KELYDGVVSSLLKVVKAKR 322
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 124/314 (39%), Gaps = 53/314 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++T + VG PA F V +DTGS+L W V+C + + ++ + S +
Sbjct: 84 YFTEIRVGTPAKKFRVVVDTGSELTW-----VNCRYRARGKDNR----RVFRADESKSFK 134
Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C + C++ CP+ + C Y RY +DG+ + G ++ + + +
Sbjct: 135 TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRM 193
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----- 272
+ + GC TG GA +G+ GL S S + L FS C
Sbjct: 194 ARLPGHL-IGCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDHL 248
Query: 273 ----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
FGS + + +F + TP L + P Y I + +S+G + ++
Sbjct: 249 SNKNVSNYLIFGSSRSTKTAF-------RRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 301
Query: 323 FSA---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGTS T L D AY Q+ E + +P EYC+ S
Sbjct: 302 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 361
Query: 374 FLHLQALVVLPFPL 387
++ L L F L
Sbjct: 362 GFNVSKLPQLTFHL 375
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 129/311 (41%), Gaps = 45/311 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +SVG P + +DTGSD+ WL C CV+C H ++ I+ P SST
Sbjct: 58 YFIRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDA---------IFDPYKSSTY 108
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + C++ C + C YQV Y DG+ +TG D + L + + V ++
Sbjct: 109 STLGCSTRQCLNLDIGTCQANKCLYQVDY-GDGSFTTGEFGTDDVSLNSTSGVGQVVLNK 167
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----- 278
I GCG G F+ A L GLG S P+ + Q FS C T
Sbjct: 168 IPLGCGHDNEGYFVGAAG---LLGLGKGPLSFPNQVDPQN--GGRFSYCLTDRETDSTEG 222
Query: 279 GRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
+ FG+ P G TP PT Y + +T +SVGG + SA
Sbjct: 223 SSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGG 282
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF---------L 375
I DSGTS T L + AY + + F + + T+ L F+ CY L L
Sbjct: 283 VIIDSGTSVTRLQNAAYASLRDAFRAGTSDLAPTAGFSL-FDTCYDLSGLASVDVPTVTL 341
Query: 376 HLQALVVLPFP 386
H Q L P
Sbjct: 342 HFQGGTDLKLP 352
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 99/313 (31%), Positives = 137/313 (43%), Gaps = 55/313 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P+ ++ +DTGSDL WL C C C + GQV D P SST
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136
Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+VPC+S C + C S AG C Y V Y DG+ STG L D L A D
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGDLATDKLAFAND----- 190
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ + ++ GCGR G F D AA GL G+G K S+ + +A + F C G D T
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLGVGRGKISISTQVAPA--YGSVFEYCLG-DRT 244
Query: 279 GR------ISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
R + FG +P T F+ ++P Y + + SVGG V +A
Sbjct: 245 SRSTRSSYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLA 302
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST--SDLPFEYCYVLRS 373
+ DSGT+ + AY + + F++ A+ F+ CY LR
Sbjct: 303 LDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRG 362
Query: 374 FLHLQA-LVVLPF 385
A L+VL F
Sbjct: 363 RPAASAPLIVLHF 375
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/160 (33%), Positives = 83/160 (51%), Gaps = 15/160 (9%)
Query: 195 DGTMSTGFLVEDVLHL--ATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMD 251
DG+ + G+LV+DV+HL T +Q+ S + I FGCG Q+G + AA +G+ G G
Sbjct: 4 DGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGESQAAVDGIMGFGQS 63
Query: 252 KTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITIT 310
+S S LA+QG + SF+ C ++G G + G+ SP TP + H Y++ +
Sbjct: 64 NSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAH--YSVNLN 121
Query: 311 QVSVGGNAVNFEFSA---------IFDSGTSFTYLNDPAY 341
+ VG + + +A I DSGT+ YL D Y
Sbjct: 122 AIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVY 161
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 124/314 (39%), Gaps = 53/314 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++T + VG PA F V +DTGS+L W V+C + + ++ + S +
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTW-----VNCRYRARGKDNR----RVFRADESKSFK 156
Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C + C++ CP+ + C Y RY +DG+ + G ++ + + +
Sbjct: 157 TVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRM 215
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----- 272
+ + GC TG GA +G+ GL S S + L FS C
Sbjct: 216 ARLPGHL-IGCSSSFTGQSFQGA--DGVLGLAFSDFSFTSTATS--LYGAKFSYCLVDHL 270
Query: 273 ----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
FGS + + +F + TP L + P Y I + +S+G + ++
Sbjct: 271 SNKNVSNYLIFGSSRSTKTAF-------RRTTPLDLTRIPPFYAINVIGISLGYDMLDIP 323
Query: 323 FSA---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGTS T L D AY Q+ E + +P EYC+ S
Sbjct: 324 SQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTS 383
Query: 374 FLHLQALVVLPFPL 387
++ L L F L
Sbjct: 384 GFNVSKLPQLTFHL 397
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 124/293 (42%), Gaps = 62/293 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P F + +D+GSDL W+ C C+ C D +Y+P+ SST
Sbjct: 65 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCY---------AQDTPLYAPSNSSTF 115
Query: 164 SKVPCNSTLCELQK---------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
+ VPC S C L P A C Y+ RY +D ++S G
Sbjct: 116 NPVPCLSPECLLIPATEGFPCDFHYPGA---CAYEYRY-ADTSLSKGVFA---------- 161
Query: 215 KQSKSVDS----RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+S +VD +++FGCGR GSF AA G+ GLG S S + N F+
Sbjct: 162 YESATVDDVRIDKVAFGCGRDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFA 216
Query: 271 MCF-----GSDGTGRISFGDKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGNAVNF 321
C + + + FGD+ + TP +PT Y + I +V VGG ++
Sbjct: 217 YCLVNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPI 276
Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
SA IFDSGT+ TY PAY I F+ + R S L
Sbjct: 277 SHSAWSLDFLGNGGSIFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQGL 329
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 113/266 (42%), Gaps = 35/266 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P F V +DTGSDL W+ C + N + ++ PNTS++ +
Sbjct: 13 YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDA--------LFLPNTSTSFT 64
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
K+ C S LC + C Y Y DG+++TG V D + + Q + V
Sbjct: 65 KLACGSALCNGLPFPMCNQTTCVYWYSY-GDGSLTTGDFVYDTITMDGINGQKQQV-PNF 122
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
+FGCG GSF A +G+ GLG S S L + + FS C T
Sbjct: 123 AFGCGHDNEGSF---AGADGILGLGQGPLSFHSQL--KSVYNGKFSYCLVDWLAPPTQTS 177
Query: 280 RISFGDKGSPGQGET---PFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---------- 325
+ FGD P + P PT Y + + +SVG N +N +
Sbjct: 178 PLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAG 237
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNS 350
IFDSGT+ T L + AY ++ N+
Sbjct: 238 TIFDSGTTVTQLAEAAYKEVLAAMNA 263
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/264 (32%), Positives = 120/264 (45%), Gaps = 45/264 (17%)
Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
SLG +Y ++ +G P ++ DTGSDL W C S+ + D P
Sbjct: 128 SLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARC-----------SAAETFD-----PT 171
Query: 159 TSSTSSKVPCNSTLCELQKQC---PS--AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
S++ + V C++ LC PS A S C Y ++Y DG+ S GFL ++ L + +
Sbjct: 172 KSTSYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQY-GDGSYSIGFLGKERLTIGST 230
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + + FGCG+ G F A GL GLG DK SV S A + FS C
Sbjct: 231 D-----IFNNFYFGCGQDVDGLFGKAA---GLLGLGRDKLSVVSQTAPK--YNQLFSYCL 280
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----- 325
S TG +SFG S TP S + P+ YN+ +T ++VGG + S
Sbjct: 281 PSSSSTGFLSFGSSQSKSAKFTPLS---SGPSSFYNLDLTGITVGGQKLAIPLSVFSTAG 337
Query: 326 -IFDSGTSFTYLNDPAYTQISETF 348
I DSGT T L AY+ + F
Sbjct: 338 TIIDSGTVVTRLPPAAYSALRSAF 361
>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
Length = 469
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/328 (26%), Positives = 126/328 (38%), Gaps = 55/328 (16%)
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCV 136
L+ D PL G Y + S+G P DTGSDL W CD
Sbjct: 81 LSNNDTDTVPLRMDGGGGAYDME---------FSIGTPPQKLTALADTGSDLIWTKCD-- 129
Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK-----QCPSAGSNCPYQVR 191
+ Y PN SST +++PC+ LC + +C + G+ C Y+
Sbjct: 130 ------AGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYA 183
Query: 192 YL--SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
Y D + GFL + L D + FGC G + +GA GL GLG
Sbjct: 184 YGLGDDPDFTQGFLGSETFTLGGDAVPG------VGFGCTTALEGDYGEGA---GLVGLG 234
Query: 250 MDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDKGS---PGQGETPFSLRQTHPT 304
P L +Q L +F C +D + + FG + G G L +
Sbjct: 235 RG----PLSLVSQ-LDAGTFMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTF 289
Query: 305 YNITITQVSVGGNAV---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
Y + + +++G +FDSGT+ TYL +PAYT+ F S + TS +
Sbjct: 290 YAVNLRSITIGSATTAGVGGPGGVVFDSGTTLTYLAEPAYTEAKAAFLS-----QTTSLT 344
Query: 362 DLP----FEYCYVLRSFLHLQALVVLPF 385
+ FE CY L +VL F
Sbjct: 345 PVEGRYGFEACYEKPDSARLIPAMVLHF 372
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 73/248 (29%), Positives = 103/248 (41%), Gaps = 31/248 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLN----SSSGQVIDFNIYSPNT 159
+Y + VG P +DTGSD+ W C C C N SS +Y P
Sbjct: 88 YYAQIGVGHPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPEL 147
Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S T+S C+ LC C ++C Y + Y D + STG DV+HL S
Sbjct: 148 SITASPATCSDPLCSEGGSCRGNNNSCAYDISY-EDTSSSTGIYFRDVVHLG----HKAS 202
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDG 277
+++ + GC +G + +G+ G G K SVP+ LA Q N F C +G
Sbjct: 203 LNTTMFLGCATSISGLW----PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGEKEG 258
Query: 278 TGRISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSA------ 325
G + G P TP + YN+ + +SV A+ FE++A
Sbjct: 259 GGILVLGKNDEFPEMVYTP--MLANDIVYNVKLVSLSVNSKALPIEASEFEYNATVGNGG 316
Query: 326 -IFDSGTS 332
I DSGTS
Sbjct: 317 TIIDSGTS 324
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 126/293 (43%), Gaps = 48/293 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C CV C + Q + + P S+T
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLC-------ADQPTPY--FRPARSATY 142
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
VPC S LC L S C YQ Y D + G L + SK + S
Sbjct: 143 RLVPCRSPLCAALPYPACFQRSVCVYQY-YYGDEASTAGVLASETFTFGA-ANSSKVMVS 200
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
++FGCG + +G + +G+ GLG S+ S L P+ FS C F S
Sbjct: 201 DVAFGCGNINSGQLANS---SGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPS 252
Query: 280 RISFG-----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------A 318
R++FG GSP Q TP + P+ Y +++ +S+G A
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQ-STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311
Query: 319 VNFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+N + + DSGTS T+L AY + S+ + T+ +++ E C+
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCF 364
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/318 (27%), Positives = 127/318 (39%), Gaps = 46/318 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ VG PA F++ DTGSDL W+ C S L+ + + P S T
Sbjct: 97 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156
Query: 164 SKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ + C S C CP+ GS C Y RY DG+ + G + + +A ++ +
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSGREER 215
Query: 219 SVDSR-ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
+ + GC TG + A +G+ LG S S A++ FS C
Sbjct: 216 KAKLKGLVLGCSSSYTGPSFE--ASDGVLSLGYSGISFASHAASR--FGGRFSYCLVDHL 271
Query: 274 -GSDGTGRISFGDK---GSPGQG------------ETPFSL-RQTHPTYNITITQVSVGG 316
+ T ++FG SP +TP L R+ P Y++++ +SV G
Sbjct: 272 SPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331
Query: 317 NAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFE 366
+ + I DSGTS T L PAY + + LA R T PFE
Sbjct: 332 EFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMD---PFE 388
Query: 367 YCYVLRSFLHLQALVVLP 384
YCY S A V +P
Sbjct: 389 YCYNWTSPSGKDADVAVP 406
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 147/360 (40%), Gaps = 64/360 (17%)
Query: 43 KGILAVDDLPKKGSFA--YYSALAHRDRYFR-LRGRGLAAQGNDKTPLTFSAGNDTYRLN 99
KG A D KK SFA S A D R GR + ++G + T+ G ++
Sbjct: 68 KGSSATDK--KKPSFAERLRSDRARADHILRKASGRRMMSEGGGASIPTYLGG----FVD 121
Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYS 156
SL ++ + +G PA+ V +DTGSDL W+ PC+ C + ++
Sbjct: 122 SLEYV--VTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDP---------LFD 170
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAG-------------SNCPYQVRYLSDGTMSTGFL 203
P+ SST + +PC S C KQ P G C Y + Y +G ++ G
Sbjct: 171 PSKSSTFATIPCASDAC---KQLPVDGYDNGCTNNTSGMPPQCGYAIEY-GNGAITEGVY 226
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+ L L S +V FGCG Q G + +GL GLG S+ S A+
Sbjct: 227 STETLALG-----SSAVVKSFRFGCGSDQHGPYDKF---DGLLGLGGAPESLVSQTAS-- 276
Query: 264 LIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHP-------TYNITITQVSV 314
+ +FS C + G G ++ G S + F H Y +T+T +SV
Sbjct: 277 VYGGAFSYCLPPLNSGAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISV 336
Query: 315 GGNAVN-----FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
GG A++ F I DSGT T + AY + F S E +D + CY
Sbjct: 337 GGKALDIPPAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCY 396
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 126/268 (47%), Gaps = 39/268 (14%)
Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
S+G +Y T + +G P ++++ +D+GS L WL C C + +G +Y P
Sbjct: 102 SVGVGNYITRLGLGTPTTTYVMVVDSGSSLTWL--QCAPCAVSCHPQAGP-----LYDPR 154
Query: 159 TSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST + VPC++ C ELQ PS+ S C YQ Y DG+ S G+L +D + L+
Sbjct: 155 ASSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASY-GDGSFSFGYLSKDTVSLS- 212
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
S +GCG+ G F A GL GL +K S+ S LA + NSF+ C
Sbjct: 213 ----SSGSFPGFYYGCGQDNVGLFGRAA---GLIGLARNKLSLLSQLAPS--VGNSFAYC 263
Query: 273 F---GSDGTGRISFG---DKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
+ G +SFG D +PG+ + S Y +++ +SV G+ + S
Sbjct: 264 LPTSAAASAGYLSFGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSS 323
Query: 325 ------AIFDSGTSFTYLNDPAYTQISE 346
I DSGT T L P YT +S+
Sbjct: 324 EYGSLPTIIDSGTVITRLPTPVYTALSK 351
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 126/293 (43%), Gaps = 48/293 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C CV C + Q + + P S+T
Sbjct: 92 YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLC-------ADQPTPY--FRPARSATY 142
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
VPC S LC L S C YQ Y D + G L + SK + S
Sbjct: 143 RLVPCRSPLCAALPYPACFQRSVCVYQ-YYYGDEASTAGVLASETFTFGA-ANSSKVMVS 200
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
++FGCG + +G + +G+ GLG S+ S L P+ FS C F S
Sbjct: 201 DVAFGCGNINSGQLANS---SGMVGLGRGPLSLVSQLG-----PSRFSYCLTSFLSPEPS 252
Query: 280 RISFG-----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------A 318
R++FG GSP Q TP + P+ Y +++ +S+G A
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQ-STPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFA 311
Query: 319 VNFEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+N + + DSGTS T+L AY + S+ + T+ +++ E C+
Sbjct: 312 INDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCF 364
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 124/304 (40%), Gaps = 40/304 (13%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN-SLGFLHYT-NVSVGQPALSFIVAL 122
R Y + R G AA A L S+G L Y VS+G PA++ + +
Sbjct: 100 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 159
Query: 123 DTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
DTGSD+ W+ PC C + ++ P SS+ S VPC + C
Sbjct: 160 DTGSDVSWVQCKPCPSPPCYSQRDP---------LFDPTRSSSYSAVPCAAASCSQLALY 210
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
C +G C Y V Y DG+ +TG D L L + FGCG Q G
Sbjct: 211 SNGC--SGGQCGYVVSY-GDGSTTTGVYSSDTLTLTGSNALKGFL-----FGCGHAQQGL 262
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGS-PGQG 292
F A +GL GLG S+ S ++ FS C + G IS G S G
Sbjct: 263 F---AGVDGLLGLGRQGQSLVSQASST--YGGVFSYCLPPTQNSVGYISLGGPSSTAGFS 317
Query: 293 ETPFSLRQTHPTYNIT-ITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDPAYTQISE 346
TP PTY I + +SVGG ++ + S A+ D+GT T L AY+ +
Sbjct: 318 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRS 377
Query: 347 TFNS 350
F +
Sbjct: 378 AFRA 381
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 117/273 (42%), Gaps = 46/273 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ T +S+G PA F V DTGSDL W+ C C +C + + I+ P SS+
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDP---------IFDPEGSSSY 90
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C TLC+ +K C NC Y Y DG+ + G L + + L + + + K
Sbjct: 91 TTMSCGDTLCDSLPRKSC---SPNCDYSYGY-GDGSGTRGTLSSETVTLTSTQGE-KLAA 145
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
I+FGCG + GSF D + GL GLG S S L + L + FS C
Sbjct: 146 KNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPS 200
Query: 277 GTGRISFGDKGSPGQG----ETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
T + FGD+ S F+ +P Y + + +S+ G A+ +
Sbjct: 201 KTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDI 260
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNS 350
IFDSGT+ T L D Y + S
Sbjct: 261 KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRS 293
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 27/249 (10%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +DTGS + ++PC+ SC N + + P+ S T V CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCN--SCDQCGNHQDPK------FQPDLSDTYHPVKCNP 53
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C + C Y+ +Y ++ + S+G L ED++ S+ R FGC
Sbjct: 54 DCT-----CDTENDQCTYERQY-AEMSSSSGILGEDLVSFG---NMSELKPQRAVFGCEN 104
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGS 288
+TG A +G+ GLG S+ L +G+I +SFS+C+G G G + G
Sbjct: 105 AETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISP 163
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
P S P YNI + + V G ++ + I DSGT++ YL + A+
Sbjct: 164 PSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAF 223
Query: 342 TQISETFNS 350
+ S
Sbjct: 224 LPFIQAITS 232
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 27/249 (10%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +DTGS + ++PC+ SC N + + P+ S T V CN
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCN--SCDQCGNHQDPK------FQPDLSDTYHPVKCNP 53
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C + C Y+ +Y ++ + S+G L ED++ S+ R FGC
Sbjct: 54 DCT-----CDTENDQCTYERQY-AEMSSSSGILGEDLVSFG---NMSELKPQRAVFGCEN 104
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGS 288
+TG A +G+ GLG S+ L +G+I +SFS+C+G G G + G
Sbjct: 105 AETGDLFSQHA-DGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISP 163
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
P S P YNI + + V G ++ + I DSGT++ YL + A+
Sbjct: 164 PSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAF 223
Query: 342 TQISETFNS 350
+ S
Sbjct: 224 LPFIQAITS 232
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 138/342 (40%), Gaps = 65/342 (19%)
Query: 34 FHHRYSDPVKGILAVDDLPKKG-SFAYYS----ALAHRDRYFRLRGRGLAAQGNDKTPLT 88
HH P G+ V + G + Y A+ +R R L + +TP+
Sbjct: 31 LHHGQKRPQPGLRVVLEQVDSGMNLTKYELIKRAIKRGERRMRSINAMLQSSSGIETPVY 90
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
AG+ Y +N V++G PA S +DTGSDL W C+ C C
Sbjct: 91 --AGSGEYLMN---------VAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTP--- 136
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG--SNCPYQVRYLSDGTMSTGFLVE 205
I++P SS+ S +PC S C+ PS ++C Y Y DG+ + G++
Sbjct: 137 ------IFNPQDSSSFSTLPCESQYCQ---DLPSESCYNDCQYTYGY-GDGSSTQGYMAT 186
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA----- 260
+ T S I+FGCG G F G GL G+G S+PS L
Sbjct: 187 ETFTFET------SSVPNIAFGCGEDNQG-FGQGNGA-GLIGMGWGPLSLPSQLGVGQFS 238
Query: 261 ---NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN 317
+ ++ GS +G +GSP SL T+ Y IT+ ++VGG+
Sbjct: 239 YCMTSSGSSSPSTLALGSAASGV----PEGSPSTTLIHSSLNPTY--YYITLQGITVGGD 292
Query: 318 AVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETF 348
+ S I DSGT+ TYL AY +++ F
Sbjct: 293 NLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAF 334
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 131/297 (44%), Gaps = 47/297 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
+ VS G PA+ +V +DTGSD+ WL C SSGQ +Y P+ SST
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK--------PCSSGQCFPQKDPLYDPSHSST 130
Query: 163 SSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S VPC S +C+ C ++G C + + Y +DGT + G +D L LA
Sbjct: 131 YSAVPCASDVCKKLAADAYGSGC-TSGKQCGFAISY-ADGTSTVGAYSQDKLTLAPG--- 185
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
++ FGCG G +G+ GLG + S+ A G + FS C S
Sbjct: 186 --AIVQNFYFGCGH---GKHAVRGLFDGVLGLGRLRE---SLGARYGGV---FSYCLPSV 234
Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
+ G ++ G +P G TP PT++ +T+ ++VGG ++ SA I
Sbjct: 235 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIV 294
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
DSGT T L AY + F + R DL + CY L + + VV+P
Sbjct: 295 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL--DTCYNLTGYKN----VVVP 345
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 117/270 (43%), Gaps = 47/270 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++S+G PA+++ +DTGSDL W C CV N S+ ++ P++SST + +P
Sbjct: 105 DMSIGTPAVAYAAIIDTGSDLVWTQCK--PCVECFNQST------PVFDPSSSSTYAALP 156
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+STLC + C Y Y D + + G L + LA K+ ++FG
Sbjct: 157 CSSTLCSDLPSSKCTSAKCGYTYTY-GDSSSTQGVLAAETFTLA------KTKLPDVAFG 209
Query: 228 CGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR----- 280
CG G F GA GL GLG S+ S L GL N FS C S D T +
Sbjct: 210 CGDTNEGDGFTQGA---GLVGLGRGPLSLVSQL---GL--NKFSYCLTSLDDTSKSPLLL 261
Query: 281 -----ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
IS + TP + P+ Y + + ++VG + SA
Sbjct: 262 GSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTG 321
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAK 353
I DSGTS TYL Y + + F + K
Sbjct: 322 GVIVDSGTSITYLELQGYRALKKAFAAQMK 351
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 124/287 (43%), Gaps = 38/287 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + DTGSDL W C CV + I++P+ S++
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 184
Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S C L +AG SNC Y ++Y D + S GFL +D L + +
Sbjct: 185 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKDKFTLTSSD---- 239
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
V + FGCG G F A GL GLG DK S PS A FS C S
Sbjct: 240 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 293
Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
TG ++FG G S TP S + Y + I ++VGG + FS A+ D
Sbjct: 294 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 353
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLRSF 374
SGT T L AY + +F AK + +TS + + C+ L F
Sbjct: 354 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGF 398
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 121/285 (42%), Gaps = 33/285 (11%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IY 155
SLG L + V G PA ++ + DTGSD+ W+ C+ C SG + I+
Sbjct: 113 TSLGTLEFVVTVGFGTPAQTYTLMFDTGSDVSWI--QCLPC-------SGHCYKQHDPIF 163
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P S+T S VPC C S+ C Y+V+Y DG+ + G L + L L
Sbjct: 164 DPTKSATYSAVPCGHPQCAAAGGKCSSNGTCLYKVQY-GDGSSTAGVLSHETLSLT---- 218
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
S +FGCG G F D +GL GLG + S+ S A S+ + +
Sbjct: 219 -SARALPGFAFGCGETNLGDFGDV---DGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYN 274
Query: 276 DGTGRISFGD----KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFS 324
G ++ G GS G T +Q +P+ Y + + + VGG +
Sbjct: 275 TSHGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDG 334
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ DSGT TYL AYT + + F + + D PF+ CY
Sbjct: 335 TLLDSGTVLTYLPPEAYTALRDRFKFTMTQYKPAPAYD-PFDTCY 378
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 124/304 (40%), Gaps = 40/304 (13%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN-SLGFLHYT-NVSVGQPALSFIVAL 122
R Y + R G AA A L S+G L Y VS+G PA++ + +
Sbjct: 89 RRAEYIQRRVSGAAAAAPGMQLAGSKAATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEV 148
Query: 123 DTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
DTGSD+ W+ PC C + ++ P SS+ S VPC + C
Sbjct: 149 DTGSDVSWVQCKPCPSPPCYSQRDP---------LFDPTRSSSYSAVPCAAASCSQLALY 199
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
C +G C Y V Y DG+ +TG D L L + FGCG Q G
Sbjct: 200 SNGC--SGGQCGYVVSY-GDGSTTTGVYSSDTLTLTGSNALKGFL-----FGCGHAQQGL 251
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGS-PGQG 292
F A +GL GLG S+ S ++ FS C + G IS G S G
Sbjct: 252 F---AGVDGLLGLGRQGQSLVSQASST--YGGVFSYCLPPTQNSVGYISLGGPSSTAGFS 306
Query: 293 ETPFSLRQTHPTYNIT-ITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDPAYTQISE 346
TP PTY I + +SVGG ++ + S A+ D+GT T L AY+ +
Sbjct: 307 TTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDTGTVVTRLPPTAYSALRS 366
Query: 347 TFNS 350
F +
Sbjct: 367 AFRA 370
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 131/297 (44%), Gaps = 47/297 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
+ VS G PA+ +V +DTGSD+ WL C SSGQ +Y P+ SST
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCK--------PCSSGQCFPQKDPLYDPSHSST 164
Query: 163 SSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S VPC S +C+ C ++G C + + Y +DGT + G +D L LA
Sbjct: 165 YSAVPCASDVCKKLAADAYGSGC-TSGKQCGFAISY-ADGTSTVGAYSQDKLTLAPG--- 219
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
++ FGCG G +G+ GLG + S+ A G + FS C S
Sbjct: 220 --AIVQNFYFGCGH---GKHAVRGLFDGVLGLGRLRE---SLGARYGGV---FSYCLPSV 268
Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
+ G ++ G +P G TP PT++ +T+ ++VGG ++ SA I
Sbjct: 269 SSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIV 328
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
DSGT T L AY + F + R DL + CY L + + VV+P
Sbjct: 329 DSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGDL--DTCYNLTGYKN----VVVP 379
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 110/271 (40%), Gaps = 47/271 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT D W+PC DC C +SPNTSST
Sbjct: 99 YVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSS------------PTFSPNTSSTY 146
Query: 164 SKVPCNSTLCELQK--QCPSAG-SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ + C+ C + CP+ G + C + Y D + S L +D L LA D S
Sbjct: 147 ASLQCSVPQCTQVRGLSCPTTGTAACFFNQTYGGDSSFS-AMLSQDSLGLAVDTLPS--- 202
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG-LIPNSFSMCFGSDG-- 277
SFGC +GS L P GL GLG S+L+ G L FS CF S
Sbjct: 203 ---YSFGCVNAVSGSTLP---PQGLLGLGRGPM---SLLSQSGSLYSGVFSYCFPSFKSY 253
Query: 278 --TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFE 322
+G + G G P T LR H PT Y + +T VSVG V N
Sbjct: 254 YFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTG 313
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
I DSGT T +P Y I + F K
Sbjct: 314 AGTIIDSGTVITRFVEPVYAAIRDEFRKQVK 344
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 126/284 (44%), Gaps = 50/284 (17%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS- 164
N+S+GQP++ +V +DTGSD+ W+ C+ C +C + L ++ P+ SST S
Sbjct: 103 VNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGL---------LFDPSMSSTFSP 153
Query: 165 --KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
K PC C+ P+ + Y+ + + S F + ++ TDE S+ D
Sbjct: 154 LCKTPCGFKGCKCDP--------IPFTISYVDNSSASGTFGRDILVFETTDEGTSQISD- 204
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---- 278
+ GCG F NG+ GL + P+ LA Q I FS C G+
Sbjct: 205 -VIIGCG--HNIGFNSDPGYNGILGL----NNGPNSLATQ--IGRKFSYCIGNLADPYYN 255
Query: 279 -GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AI 326
++ G+ TPF + H Y +T+ +SVG ++ FE I
Sbjct: 256 YNQLRLGEGADLEGYSTPFEVY--HGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVI 313
Query: 327 FDSGTSFTYLNDPAYTQI-SETFNSLAKEKRETSTSDLPFEYCY 369
DSGT+ TYL D A+ + +E N L R+ + P++ CY
Sbjct: 314 LDSGTTITYLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCY 357
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 136/313 (43%), Gaps = 55/313 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P+ ++ +DTGSDL WL C C C + GQV D P SST
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136
Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+VPC+S C + C S AG C Y V Y DG+ STG L D L A D
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGELATDKLAFAND----- 190
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ + ++ GCGR G F D AA GL G+ K S+ + +A + F C G D T
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLGVARGKISISTQVAPA--YGSVFEYCLG-DRT 244
Query: 279 GR------ISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
R + FG +P T F+ ++P Y + + SVGG V +A
Sbjct: 245 SRSTRSSYLVFGR--TPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLA 302
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST--SDLPFEYCYVLRS 373
+ DSGT+ + AY + + F++ A+ F+ CY LR
Sbjct: 303 LDTATGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRG 362
Query: 374 FLHLQA-LVVLPF 385
A L+VL F
Sbjct: 363 RPAASAPLIVLHF 375
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 96/367 (26%), Positives = 142/367 (38%), Gaps = 74/367 (20%)
Query: 63 LAHRDR----YFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
LA DR + RGR AA+ + S+G T ++ VG PA F
Sbjct: 46 LARMDRERMAFISSRGRRRAAETASAFAMPLSSGAYTGTGQ-----YFVRFRVGTPAQPF 100
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF-------NIYSPNTSSTSSKVPCNST 171
++ DTGSDL W+ C + + + + + P+ S T + +PC+S
Sbjct: 101 LLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDKSRTWAPIPCSSA 160
Query: 172 LCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR-IS 225
C C + + C Y RY DG+ + G + D +A + ++ R +
Sbjct: 161 TCRESLPFSLAACATPANPCAYDYRY-KDGSAARGTVGVDSATIALSGRAARKAKLRGVV 219
Query: 226 FGCGRVQTG-SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTG 279
GC G SFL A +G+ LG S S A++ FS C + T
Sbjct: 220 LGCTTSYNGQSFL---ASDGVLSLGYSNISFASRAASR--FGGRFSYCLVDHLAPRNATS 274
Query: 280 RISFGDKGS-----PGQG---------------------ETPFSL-RQTHPTYNITITQV 312
++FG + P +G +TP L +T P Y +T+ V
Sbjct: 275 YLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGV 334
Query: 313 SVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSD 362
SV G + + AI DSGTS T L PAY + + LA R T
Sbjct: 335 SVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMD-- 392
Query: 363 LPFEYCY 369
PF+YCY
Sbjct: 393 -PFDYCY 398
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 122/267 (45%), Gaps = 35/267 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
L++L ++ VS+G PA++ V +DTGSD+ W+ C + +G + F+ P
Sbjct: 120 LDTLAYV--ITVSIGTPAMTQAVMIDTGSDVSWVHCHA-------RAGAGSSLFFD---P 167
Query: 158 NTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
SST + C+S C E + S S C Y VRY DG+ +TG D L L + E
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRY-GDGSNTTGTYGSDTLALNSTE 226
Query: 215 KQSKSVDSRISFGCGRV-QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS-FSMC 272
K FGC G LD +GL GLG PS+++ S FS C
Sbjct: 227 KVEN-----FQFGCSETSDPGEGLDEDQTDGLMGLG---GGAPSLVSQTAATYGSAFSYC 278
Query: 273 F--GSDGTGRISFG-DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVN-----FEF 323
+ +G ++ G G+ G TP + PT+ I Q ++VGG+ V F
Sbjct: 279 LPATTRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA 338
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNS 350
+I DSGT T L AY+ +S F +
Sbjct: 339 GSIMDSGTIITRLPPRAYSALSAAFRA 365
>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
Length = 410
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 127/298 (42%), Gaps = 53/298 (17%)
Query: 78 AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV 136
AA G+ +TPL +G Y + S+G P DTGSDL W C C
Sbjct: 64 AASGSAQTPLQLDSGGGAYDMT---------FSIGTPPQELSALADTGSDLIWAKCGACT 114
Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRY-- 192
CV + S Y PN SS+ SK+PC+ +LC QC + G+ C Y+ Y
Sbjct: 115 RCVPQGSPS---------YYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAECDYKYSYGL 165
Query: 193 LSDGTMST-GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMD 251
SD T G+L + L +D I FGC + G + G+ +
Sbjct: 166 ASDPHHYTQGYLGSETFTLGSDAVPG------IGFGCTTMSEGGYGSGSG-------LVG 212
Query: 252 KTSVPSILANQGLIPNSFSMCFGSDG--TGRISFGDKGSPGQG--ETPFSLRQTHPTYNI 307
P L +Q L +FS C SD T + FG G G TP LR + Y +
Sbjct: 213 LGRGPLSLVSQ-LNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQSTPL-LRTSTYYYTV 270
Query: 308 TITQVSVGGNAVNFEFSA--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
+ +S+G S+ IFDSGT+ +L +PAYT LAKE + T++L
Sbjct: 271 NLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYT--------LAKEAVLSQTTNL 320
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 114/294 (38%), Gaps = 51/294 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P +V +DTGSDL WL C C C + +Y P S T
Sbjct: 92 YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTP---------LYDPRNSKTH 142
Query: 164 SKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++PC S C C + C Y V Y DG+ S+G L D L L D +
Sbjct: 143 RRIPCASPQCRGVLRYPGCDARTGGCVYMVVY-GDGSASSGDLATDTLVLPDDTRVHN-- 199
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----- 275
++ GCG G A GL G G + S P+ LA + FS C G
Sbjct: 200 ---VTLGCGHDNEGLLASAA---GLLGAGRGQLSFPTQLAPA--YGHVFSYCLGDRMSRA 251
Query: 276 -DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG-------------N 317
+ + + FG +P T F+ +T+P Y + + SVGG N
Sbjct: 252 RNSSSYLVFGR--TPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALN 309
Query: 318 AVNFEFSAIFDSGTSFTYLNDPAYTQISETF--NSLAKEKRETSTSDLPFEYCY 369
+ DSGT+ + AY + + F ++ A R F+ CY
Sbjct: 310 PATGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCY 363
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 126/297 (42%), Gaps = 43/297 (14%)
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
P T + DT + V +G PA++ + +DTGSD+ W+ C NS+
Sbjct: 117 PTTLGSALDTME-------YVITVGIGSPAVTQTMMIDTGSDVSWVRC---------NST 160
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFL 203
G ++ P+ S+T + C+S C SN C Y+V+Y DG+ +TG
Sbjct: 161 DG----LTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQY-GDGSNTTGTY 215
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
D L L+ + + FGC + DG +GL GLG D S+ S A
Sbjct: 216 SSDTLALSASDTVTD-----FHFGCSHHEED--FDGEKIDGLMGLGGDAQSLVSQTA--A 266
Query: 264 LIPNSFSMCF--GSDGTGRISFGDKGSPGQG--ETPFSLRQTHPT-YNITITQVSVGGNA 318
SFS C + +G ++FG G TP PT Y + + +SVGG
Sbjct: 267 TYGKSFSYCLPPTNRTSGFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTP 326
Query: 319 VNFEFS-----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCY 369
+ + S ++ DSGT T+L AY+ +S F S R + L + CY
Sbjct: 327 LGIQPSVLSNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCY 383
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/312 (27%), Positives = 135/312 (43%), Gaps = 54/312 (17%)
Query: 62 ALAHRDRYFRLRGRGL---AAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
+L+ R R R R + + A++ N P D+ + V +G PA+S
Sbjct: 81 SLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSLE-------YVVTVGLGTPAVSQ 133
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQK 177
++ +DTGSDL W+ C C NS++ ++ P+ SST + +PCN+ C +L +
Sbjct: 134 VLLIDTGSDLSWV--QCAPC----NSTTCYPQKDPLFDPSRSSTYAPIPCNTDACRDLTR 187
Query: 178 -----QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
C S G+ C Y + Y DG+ +TG + L +A FGCG
Sbjct: 188 DGYGSDCTSGSGGGAQCGYAITY-GDGSQTTGVYSNETLTMAPGVTVKD-----FHFGCG 241
Query: 230 RVQTGSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISF 283
Q G PN GL GLG S+ ++ + +FS C +D G ++
Sbjct: 242 HDQDG-------PNDKYDGLLGLGGAPESL--VVQTSSVYGGAFSYCLPAANDQAGFLAL 292
Query: 284 GDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYL 336
G + G TP +R+ Y + +T ++VGG ++ SA I DSGT T L
Sbjct: 293 GAPVNDASGFVFTPM-VREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTEL 351
Query: 337 NDPAYTQISETF 348
AY + F
Sbjct: 352 QHTAYAALQAAF 363
>gi|195658449|gb|ACG48692.1| hypothetical protein [Zea mays]
gi|413938915|gb|AFW73466.1| hypothetical protein ZEAMMB73_105703 [Zea mays]
Length = 149
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/133 (41%), Positives = 69/133 (51%), Gaps = 25/133 (18%)
Query: 29 TFGFDFHHRYSD-------PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG 81
TF HR SD P G+ P++GS YY AL D + + R LA +
Sbjct: 26 TFSSRMVHRLSDEARLEAGPRMGLW-----PQRGSGGYYRALLRSD--LQRQKRRLAGKN 78
Query: 82 N----DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS 137
K TFS GND LG+L+Y V VG P SF+VALDTGSDLFW+PCDC+
Sbjct: 79 QLLSLSKGGSTFSPGND------LGWLYYAWVDVGTPTTSFLVALDTGSDLFWVPCDCIQ 132
Query: 138 CVHGLNSSSGQVI 150
C L+S G ++
Sbjct: 133 CAP-LSSYRGNLV 144
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 128/293 (43%), Gaps = 49/293 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ T +S+G PA F V DTGSDL W+ C C +C + + I+ P SS+
Sbjct: 40 YVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDP---------IFDPEGSSSY 90
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C TLC+ +K C +C Y Y DG+ + G L + + L + + + K
Sbjct: 91 TTMSCGDTLCDSLPRKSC---SPDCDYSYGY-GDGSGTRGTLSSETVTLTSTQGE-KLAA 145
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
I+FGCG + GSF D + GL GLG S S L + L + FS C
Sbjct: 146 KNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGD--LFGHKFSYCLVPWRDAPS 200
Query: 277 GTGRISFGDKGSPGQG----ETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--- 325
T + FGD+ S F+ +P Y + + +S+ G A+ +
Sbjct: 201 KTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDI 260
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
IFDSGT+ T L D Y + S ++ K + S++ L + CY
Sbjct: 261 KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGL--DLCY 311
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/303 (29%), Positives = 133/303 (43%), Gaps = 58/303 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P F +DTGSDL W+ C C C + IY P+ SST
Sbjct: 4 YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDP---------IYDPSASSTF 54
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+K C+++ C+ C S+ C Y +Y D + + G + L L + SK+
Sbjct: 55 AKTSCSTSSCQSLPASGCSSSAKTCIYGYQY-GDSSSTQGDFALETLTLRSSGGSSKAFP 113
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
FGCGR+ +GSF GAA G+ GLG K S+ + L + I N FS C S
Sbjct: 114 -NFQFGCGRLNSGSF-GGAA--GIVGLGQGKISLSTQLGSA--INNKFSYCLVDFDDDSS 167
Query: 277 GTGRISFGDKGSPGQGE-----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------ 325
T + FG S G G P S R T+ Y + + +SVGG ++ A
Sbjct: 168 KTSPLIFGSSASTGSGAISTPIIPNSGRSTY--YFVGLEGISVGGKQLSLATRAIDFLSV 225
Query: 326 ------------------IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFE 366
IFDSGT+ T L+D Y+++ F +S++ + S+S F+
Sbjct: 226 RSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSG--FD 283
Query: 367 YCY 369
CY
Sbjct: 284 LCY 286
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 123/287 (42%), Gaps = 38/287 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + DTGSDL W C CV + I++P+ S++
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 155
Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S C L +AG SNC Y ++Y D + S GFL ++ L +
Sbjct: 156 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTNSD---- 210
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
V + FGCG G F A GL GLG DK S PS A FS C S
Sbjct: 211 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 264
Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
TG ++FG G S TP S + Y + I ++VGG + FS A+ D
Sbjct: 265 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 324
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLRSF 374
SGT T L AY + +F AK + +TS + + C+ L F
Sbjct: 325 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGF 369
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/260 (31%), Positives = 114/260 (43%), Gaps = 34/260 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333
Query: 275 SDGTGRISFGD---KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG + + TP L + PT Y + +T + VGG ++ S
Sbjct: 334 STGTGYLDFGAGSLAAARARLTTPM-LTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392
Query: 326 -IFDSGTSFTYLNDPAYTQI 344
I DSGT T L AY+ +
Sbjct: 393 TIVDSGTVITRLPPAAYSSL 412
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 111/269 (41%), Gaps = 34/269 (12%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P +F +DTGSDL W+ CD C C + +Y P ++ V
Sbjct: 58 LNIGNPPKAFDFDIDTGSDLTWVQCDAPCKGCTKPRD---------KLYKPK----NNLV 104
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC+++LC+ C + C Y++ Y G+ S G L+ D L +
Sbjct: 105 PCSNSLCQAVSTGENYHCDAPDDQCDYEIEYADLGS-SIGVLLSDSFPLRL--SNGTLLQ 161
Query: 222 SRISFGCGRVQTGSFLDGAAP---NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+++FGCG Q L P G+ GLG K S+ S L G+ N CF
Sbjct: 162 PKMAFGCGYDQ--KHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARG 219
Query: 279 GRISFGDKGSPGQ--GETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTY 335
G + FGD P TP + Y+ ++ GG + IFDSG+S+TY
Sbjct: 220 GFLFFGDHLFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTY 279
Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLP 364
N Y I N + K+ D P
Sbjct: 280 FNAQVYQSI---LNLVRKDLAGKPLKDAP 305
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 87/263 (33%), Positives = 123/263 (46%), Gaps = 34/263 (12%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G PA S+ + +DTGS L WL C CV + G +Y P
Sbjct: 127 TSVGVGNYVTELGLGTPATSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGP-----LYDP 179
Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGS---NCPYQVRYLSDGTMSTGFLVEDVLHLA 211
SST + VPC+++ C ELQ PSA S C YQ Y D + S G+L D +
Sbjct: 180 RASSTYATVPCSASQCDELQAATLNPSACSVRNVCIYQASY-GDSSFSVGYLSRDTVSFG 238
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 239 SGSYP------NFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287
Query: 272 CFGSDG-TGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFSA- 325
C + TG +S G S TP + + Y +T++ +SVGG+ + E+S+
Sbjct: 288 CLPTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSL 347
Query: 326 --IFDSGTSFTYLNDPAYTQISE 346
I DSGT T L YT +S+
Sbjct: 348 PTIIDSGTVITRLPTAVYTALSK 370
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/264 (31%), Positives = 117/264 (44%), Gaps = 40/264 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
++ V +G P + DTGSDL W C+ C SC ++ I+ P+ S++
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDA---------IFDPSKSTS 195
Query: 163 SSKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDE 214
S + C STLC + C ++ C Y ++Y D + S G+ + L + ATD
Sbjct: 196 YSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQY-GDSSFSVGYFSRERLSVTATD- 253
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
VD+ + FGCG+ G F A GL GLG S + + FS C
Sbjct: 254 ----IVDNFL-FGCGQNNQGLFGGSA---GLIGLGRHPISF--VQQTAAVYRKIFSYCLP 303
Query: 274 -GSDGTGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------A 325
S TGR+SFG + TPFS + + Y + IT +SVGG + S A
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363
Query: 326 IFDSGTSFTYLNDPAYTQISETFN 349
I DSGT T L AYT + F
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFR 387
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 115/266 (43%), Gaps = 32/266 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P +DTGSD+ WL C C C + I+ P+ S+T +P
Sbjct: 91 SVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTT---------RIFDPSKSNTYKILPF 141
Query: 169 NSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+ST C+ + + N C Y + Y DG+ S G L + L L + S R
Sbjct: 142 SSTTCQSVEDTSCSSDNRKMCEYTI-YYGDGSYSQGDLSVETLTLGSTNGSSVKF-RRTV 199
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNSFSMCFG--SDGTGRIS 282
GCGR T SF +G + +G+ GLG S+ + L + I FS C S+ + +++
Sbjct: 200 IGCGRNNTVSF-EGKS-SGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLASMSNISSKLN 257
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------IFDSG 330
FGD G TP Y +T+ SVG N + F S+ I DSG
Sbjct: 258 FGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGEKGNIIIDSG 317
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR 356
T+ T L + Y+++ L + R
Sbjct: 318 TTLTLLPNDIYSKLESAVADLVELDR 343
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 122/293 (41%), Gaps = 46/293 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P ++ +DTGSD+ WL C CV C L+ +Y P SST
Sbjct: 99 YFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSP---------LYDPRGSSTY 149
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
++ PC+ C + C C Y++ Y D + ++G L D L + D
Sbjct: 150 AQTPCSPPQCRNPQTCDGTTGGCGYRIVY-GDASSTSGNLATDRLVFSNDTSVGN----- 203
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGT 278
++ GCG G F A GL G+ S + +A+ F+ C G +
Sbjct: 204 VTLGCGHDNEGLFGSAA---GLLGVARGNNSFATQVADS--YGRYFAYCLGDRTRSGSSS 258
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV----NFEFS------ 324
+ FG + +P + F+ +++P Y + + SVGG V N S
Sbjct: 259 SYLVFG-RTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATG 317
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKE--KRETSTSDLPFEYCYVLR 372
+ DSGTS T AY + + F++ A + R+ F+ CY LR
Sbjct: 318 RGGVVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLR 370
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 138/321 (42%), Gaps = 51/321 (15%)
Query: 59 YYSALAHRDRYFRLRG--RGLAAQGNDKTPLTFSAGNDTYRLNSLGF--LHYT-NVSVGQ 113
+Y+ + RDR+ R+R R L A T T A RL L F L Y + +G
Sbjct: 78 HYTGILRRDRH-RVRSIYRRLTAAETTTTTTTIPA-----RLG-LAFQSLEYVVTIGIGT 130
Query: 114 PALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
P +F V DTGSDL W LPC SC ++ P+ SST VPC++
Sbjct: 131 PPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEP---------LFDPSKSSTYVDVPCSA 181
Query: 171 TLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
C + +Q ++C Y V+Y D + + G L E+ L+ + + + + FGC
Sbjct: 182 PECHIGGVQQTRCGATSCEYSVKY-GDESETHGSLAEETFTLSPPSPLAPAA-TGVVFGC 239
Query: 229 GRVQTGSFLD-GAAPNGLFGLGMDKTSVPSILANQGLIPNS----FSMCFGSDG--TGRI 281
F D G GL GLG + SIL+ NS FS C G TG +
Sbjct: 240 SHEYISVFNDTGMGVAGLLGLGRGDS---SILSQTRRSINSGGGVFSYCLPPRGSSTGYL 296
Query: 282 SFGDKGSPGQGE------TPF--SLRQTHPTYNITITQVSVGGNAVN-----FEFSAIFD 328
+ G + Q + TP ++ Q Y + + VSV G AV+ F A+ D
Sbjct: 297 TIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAVID 356
Query: 329 SGTSFTYLNDPAYTQISETFN 349
SGT T++ AY + + F
Sbjct: 357 SGTVVTHMPAAAYYPLRDEFR 377
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/258 (27%), Positives = 120/258 (46%), Gaps = 41/258 (15%)
Query: 117 SFIVALDTGSDLFWLPCD-CVSC---VHGLNSSSGQVIDFNIYSPNTSSTSSKVPC---- 168
++ + +DTGS ++PC C C HG Y + S ++ C
Sbjct: 50 TYDLIVDTGSARTYVPCKGCARCGEHAHGY------------YDYDRSMEFERLDCGEAS 97
Query: 169 NSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
++TLCE ++ C S G C Y V Y ++G+ S G++V D + L ++ + ++F
Sbjct: 98 DATLCEETMKGTCQSDG-RCSYVVSY-AEGSSSRGYVVRDRVRLG-----EGTLSAMLAF 150
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG----TG 279
GC +T + + A +GLFG G +V + LA+ GLI N FS C FG++G G
Sbjct: 151 GCEEAETNAIYEQKA-DGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLG 209
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTY-NITITQVSVGGNAVNF--EFSAIFDSGTSFTYL 336
R FG +P TP +P + N+ + +G + + ++ DSGT+FT++
Sbjct: 210 RFDFG-ADAPALARTPLVADPANPAFHNVRTSSWKLGDSLIEHLNSYTTTLDSGTTFTFV 268
Query: 337 NDPAYTQISETFNSLAKE 354
+ ++ A +
Sbjct: 269 PRSVWVSFKTRLDTQATQ 286
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 126/291 (43%), Gaps = 35/291 (12%)
Query: 105 HY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
HY +S+G P DTGSDL W C C +C N ++ P S+T
Sbjct: 71 HYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNP---------MFDPQKSTT 121
Query: 163 SSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+S LC +L S C Y Y S ++ G L ++ + L++ + +S +
Sbjct: 122 YRNISCDSKLCHKLDTGVCSPQKRCNYTYAYAS-AAITRGVLAQETITLSSTKGKSVPLK 180
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD-- 276
I FGCG TG F D G+ GLG S+ S + + FS C F +D
Sbjct: 181 G-IVFGCGHNNTGGFNDHEM--GIIGLGGGPVSLISQMGSS-FGGKRFSQCLVPFHTDVS 236
Query: 277 GTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSV-------GGNAVNFEFSA 325
+ ++SFG KGS G+ TP +Q Y +T+ +SV G++ N E
Sbjct: 237 VSSKMSFG-KGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVEKGN 295
Query: 326 IF-DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
+F DSGT T L Y Q+ S K T DL + CY ++ L
Sbjct: 296 MFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLCYRTKNNL 346
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 119/285 (41%), Gaps = 50/285 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +G P + ++ DTGSDL W+ C C +C H S+ + S+T
Sbjct: 86 YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSA--------FFARHSTTY 137
Query: 164 SKVPCNSTLCELQKQCPSAGSN-------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S + C S C+L N C YQ Y +D + +TGF ++ L L T +
Sbjct: 138 SAIHCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTY-ADSSTTTGFFSKEALTLNTSTGK 196
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
K ++ +SFGCG +G L GA+ G+ GLG S S L + + FS C
Sbjct: 197 VKKLNG-LSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGRR--FGSKFSYCL 253
Query: 274 GS-------------DGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAV 319
G ++ KG TP + PT Y I I V V G +
Sbjct: 254 MDYTLSPPPTSFLTIGGAQNVAVSKKGI--MSFTPLLINPLSPTFYYIAIKGVYVNGVKL 311
Query: 320 NFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAK 353
S I DSGT+ T++ +PAYT+I + F K
Sbjct: 312 PINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVK 356
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 129/285 (45%), Gaps = 40/285 (14%)
Query: 106 YTNVS--VGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
Y NV+ +GQP+ + + +DTGSDL WL CD CV C + P
Sbjct: 19 YYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEAPH-------------PYYRP 65
Query: 162 TSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEK 215
++ VPC +C+ +C + G C Y+V Y +DG S G LV D +L T EK
Sbjct: 66 RNNLVPCMDPICQSLHSNGDHRCENPGQ-CDYEVEY-ADGGSSFGVLVRDTFNLNFTSEK 123
Query: 216 QSKSVDSRISFG-CGRVQTGSFLDGAAP--NGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + ++ G CG Q F G+ +G+ GLG K+S+ S L++ GL+ N C
Sbjct: 124 RHSPL---LALGLCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQLSSLGLVRNVIGHC 177
Query: 273 FGSDGTGRISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIFDS 329
G G + FGD S TP S H Y+ + +++ G F+ FDS
Sbjct: 178 LSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKH--YSPGLAELTFDGKTTGFKNLLTTFDS 235
Query: 330 GTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
G S+TYLN AY IS L+ + + D C+ R
Sbjct: 236 GASYTYLNSQAYQGLISLLKKELSGKPLREALDDQTLPLCWKGRK 280
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 123/287 (42%), Gaps = 38/287 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + DTGSDL W C CV + I++P+ S++
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEP--------IFNPSKSTSY 183
Query: 164 SKVPCNSTLC-ELQKQCPSAG----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S C L +AG SNC Y ++Y D + S GFL ++ L +
Sbjct: 184 YNVSCSSAACGSLSSATGNAGSCSASNCIYGIQY-GDQSFSVGFLAKEKFTLTNSD---- 238
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG- 277
V + FGCG G F A GL GLG DK S PS A FS C S
Sbjct: 239 -VFDGVYFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATA--YNKIFSYCLPSSAS 292
Query: 278 -TGRISFGDKG-SPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE---FS---AIFD 328
TG ++FG G S TP S + Y + I ++VGG + FS A+ D
Sbjct: 293 YTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALID 352
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVLRSF 374
SGT T L AY + +F AK + +TS + + C+ L F
Sbjct: 353 SGTVITRLPPKAYAALRSSFK--AKMSKYPTTSGVSILDTCFDLSGF 397
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 111/261 (42%), Gaps = 37/261 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ NV +G P + DTGS L W C C +C + ++ P S++
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKV----------PVFDPTKSASF 181
Query: 164 SKVPCNSTLCELQKQ-CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL---HLATDEKQSKS 219
+PC+S LC+ +Q C S C Y Y+ D + STG L + + HL D K
Sbjct: 182 KGLPCSSKLCQSIRQGCSSP--KCTYLTAYV-DNSSSTGTLATETISFSHLKYDFKN--- 235
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--G 277
I GC +G L +G+ GL S+ S AN + FS C S
Sbjct: 236 ----ILIGCSDQVSGESL---GESGIMGLNRSPISLASQTAN--IYDKLFSYCIPSTPGS 286
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG-----NAVNFEFSAIFDSGTS 332
TG ++FG K +P S Y+I +T +SVGG +A F+ ++ DSG
Sbjct: 287 TGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAV 346
Query: 333 FTYLNDPAYTQISETFNSLAK 353
T L AY+ + F + K
Sbjct: 347 LTRLPPKAYSALRSVFREMMK 367
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 110/272 (40%), Gaps = 41/272 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +VSVG P + LDTGSDL W C C+ + V+D P SST +
Sbjct: 90 YLMHVSVGTPPRPVALTLDTGSDLVW--TQCAPCLDCFEQGAAPVLD-----PAASSTHA 142
Query: 165 KVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+PC++ LC G +C Y Y D +++ G L D D+
Sbjct: 143 ALPCDAPLCRALPFTSCGGRSWGDRSCVYVYHY-GDRSLTVGQLATDSFTFGGDDNAGGL 201
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---- 275
R++FGCG + G F A G+ G G + S+PS L SFS CF S
Sbjct: 202 AARRVTFGCGHINKGIF--QANETGIAGFGRGRWSLPSQLNV-----TSFSYCFTSMFDT 254
Query: 276 DGTGRISFGDKGSP----------GQGETPFSLRQ-THPT-YNITITQVSVGGNAV---- 319
+ ++ G + G T ++ + P+ Y + + +SVGG V
Sbjct: 255 KSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPE 314
Query: 320 -NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
S I DSG S T L + Y + F S
Sbjct: 315 SRLRSSTIIDSGASITTLPEDVYEAVKAEFVS 346
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 141/331 (42%), Gaps = 53/331 (16%)
Query: 65 HRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
R +Y + R GR + + D T L +G+ N ++ V +G P
Sbjct: 96 ERVKYIQSRLSKNLGRENSVKELDSTTLPAKSGSLIGSAN-----YFVVVGLGTPKRDLS 150
Query: 120 VALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
+ DTGSDL W C+ C SC ++ I+ P+ SS+ + C S+LC
Sbjct: 151 LVFDTGSDLTWTQCEPCAGSCYKQQDA---------IFDPSKSSSYINITCTSSLCTQLT 201
Query: 175 ---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCGR 230
++ +C S+ + C Y ++Y D + S GFL ++ L + ATD VD + FGCG+
Sbjct: 202 SAGIKSRCSSSTTACIYGIQY-GDKSTSVGFLSQERLTITATD-----IVDDFL-FGCGQ 254
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGS 288
G F A GL GLG S + + FS C S + G ++FG +
Sbjct: 255 DNEGLFSGSA---GLIGLGRHPISF--VQQTSSIYNKIFSYCLPSTSSSLGHLTFGASAA 309
Query: 289 PGQG--ETPFSLRQTHPT-YNITITQVSVGGNAV----NFEFSA---IFDSGTSFTYLND 338
TP S T Y + I +SVGG + + FSA I DSGT T L
Sbjct: 310 TNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAP 369
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
AY + F EK + D F+ CY
Sbjct: 370 TAYAALRSAFRQ-GMEKYPVANEDGLFDTCY 399
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/342 (26%), Positives = 127/342 (37%), Gaps = 54/342 (15%)
Query: 65 HRDRYFR------LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSF 118
HR Y R RGR A G + S+G T ++ VG PA F
Sbjct: 60 HRHAYIRSQLASSRRGRRAAEVGASAFAMPLSSGAYTGTGQ-----YFVRFRVGTPAQPF 114
Query: 119 IVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-- 176
++ DTGSDL W+ C G + S ++ S + + + C+S C
Sbjct: 115 VLVADTGSDLTWVKCRGAGAAAGTGAGSPA----RVFRTAASKSWAPIACSSDTCTSYVP 170
Query: 177 ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR---------- 223
C S S C Y RY DG+ + G + D +A +
Sbjct: 171 FSLANCSSPASPCAYDYRY-RDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQG 229
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGT 278
+ GC G + +G+ LG S S A + FS C + T
Sbjct: 230 VVLGCAATYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFSYCLVDHLAPRNAT 285
Query: 279 GRISFGDKGSPGQGETPFSL-RQTHPTYNITITQVSVGGNA---------VNFEFSAIFD 328
++FG + +TP L R+ P Y +T+ V V G A V+ AI D
Sbjct: 286 SYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDRNGGAILD 345
Query: 329 SGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
SGTS T L PAY + + LA R T PFEYCY
Sbjct: 346 SGTSLTILATPAYRAVVTALSKHLAGLPRVTMD---PFEYCY 384
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/276 (30%), Positives = 119/276 (43%), Gaps = 33/276 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + DTGSDL W C+ C+ + S + FN P++SST
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY-----SQKEPKFN---PSSSSTY 183
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C+S +CE + C + SNC Y + Y D + + GFL ++ L + V
Sbjct: 184 QNVSCSSPMCEDAESC--SASNCVYSIVY-GDKSFTQGFLAKEKFTLTNSD-----VLED 235
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
+ FGCG G F A GL + + + N N FS C F S+ TG
Sbjct: 236 VYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN-----NIFSYCLPSFTSNSTGH 290
Query: 281 ISFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
++FG G S TP S + Y I I +SVG + FS AI DSGT F
Sbjct: 291 LTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVF 350
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T L Y ++ F + TS L F+ CY
Sbjct: 351 TRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCY 385
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/252 (26%), Positives = 109/252 (43%), Gaps = 29/252 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +DTGS + ++PC C C + + P+ SST
Sbjct: 15 TRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPK---------FQPDLSSTYQS 65
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ +Y ++ + S+G L ED++ S R
Sbjct: 66 VKCN-----IDCNCDDEKQQCVYERQY-AEMSTSSGVLGEDIISFGN---LSALAPQRAV 116
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
FGC ++TG A +G+ G+G S+ L ++G+I +SFS+C+G G G +
Sbjct: 117 FGCENMETGDLYSQHA-DGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVL 175
Query: 286 KGSPGQGETPFSLRQ--THPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYL 336
G FS P YNI + ++ V G + + I DSGT++ YL
Sbjct: 176 GGISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTTYAYL 235
Query: 337 NDPAYTQISETF 348
+ A+ +
Sbjct: 236 PEAAFVSFKDAI 247
>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
Length = 464
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 126/297 (42%), Gaps = 44/297 (14%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
K+ +T +GN + + +G P + DTGSDL W C+ C+ +
Sbjct: 122 KSGITLGSGN-----------YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCY-- 168
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
S + FN P++SST V C+S +CE + C + SNC Y + Y D + + GF
Sbjct: 169 ---SQKEPKFN---PSSSSTYQNVSCSSPMCEDAESC--SASNCVYSIGY-GDKSFTQGF 219
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
L ++ L + V + FGCG G F A GL + + + N
Sbjct: 220 LAKEKFTLTNSD-----VLEDVYFGCGENNQGLFDGVAGLLGLGPGKLSLPAQTTTTYN- 273
Query: 263 GLIPNSFSMC---FGSDGTGRISFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNA 318
N FS C F S+ TG ++FG G S TP S + Y I I +SVG
Sbjct: 274 ----NIFSYCLPSFTSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKE 329
Query: 319 VNF---EFS---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ FS AI DSGT FT L Y ++ F + TS L F+ CY
Sbjct: 330 LAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVFKEKMSSYKSTSGYGL-FDTCY 385
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 124/285 (43%), Gaps = 45/285 (15%)
Query: 105 HYTN-VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+YT+ V +G P F + +DTGS + ++PC SC H N + +SP SS+
Sbjct: 34 YYTSRVKIGTPPHEFSLIVDTGSTVTYVPCS--SCTHCGNHQDPR------FSPALSSSY 85
Query: 164 SKVPCNST----LCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C S C+ ++ YQ +Y T S+G L +DV+ + S
Sbjct: 86 KPLECGSECSTGFCDGSRK---------YQRQYAEKST-SSGVLGKDVIGFS---NSSDL 132
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDG 277
R+ FGC +TG D A +G+ GLG S+ L + + + FS+C+G +G
Sbjct: 133 GGQRLVFGCETAETGDLYDQTA-DGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEG 191
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSG 330
G + G P S P YN+ + + VGG+ + ++ + DSG
Sbjct: 192 GGAMILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSG 251
Query: 331 TSFTYLNDPAYTQISETFNSLAKEK----RETSTSDLPF-EYCYV 370
T++ Y A+ + F S KE+ +E D F + CY
Sbjct: 252 TTYAYFPGAAF----QAFKSAVKEQVGSLKEVPGPDEKFKDICYA 292
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 126/304 (41%), Gaps = 25/304 (8%)
Query: 56 SFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPA 115
+F + + RD+ R++ N T F+ G + V +G P
Sbjct: 84 TFPSAAEILRRDQ-LRVKSIRAKHSMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPK 142
Query: 116 LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
F + DTGSDL W C+ C G + + D + + + S PC S E
Sbjct: 143 KDFSLLFDTGSDLTWTQCE--PCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKES 200
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
+ C S+ S C Y V+Y + T+ GFL + L + + V GCG G
Sbjct: 201 AQGCSSSNS-CLYGVKYGTGYTV--GFLATETLTITPSD-----VFENFVIGCGERNGGR 252
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE 293
F A GL GLG ++PS ++ N FS C S TG +SFG S
Sbjct: 253 FSGTA---GLLGLGRSPVALPSQTSST--YKNLFSYCLPASSSSTGHLSFGGGVSQAAKF 307
Query: 294 TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISET 347
TP + + Y + ++ +SVGG + + S I DSGT+ TYL A++ +S
Sbjct: 308 TPIT-SKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSA 366
Query: 348 FNSL 351
F +
Sbjct: 367 FQEM 370
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 113/275 (41%), Gaps = 28/275 (10%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + DTGSDL W+ C C SC ++ P SST C
Sbjct: 96 IGTPPVERLATADTGSDLIWVQCSPCASCFPQSTP---------LFQPLKSSTFMPTTCR 146
Query: 170 STLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
S C L QK C +G C Y +Y + S G L + L +
Sbjct: 147 SQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSF 205
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRIS 282
FGCG + G+ GLG S+ S + +Q I + FS C GS T ++
Sbjct: 206 FGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGDQ--IGHKFSYCLLPLGSTSTSKLK 263
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV---NFEFSAIFDSGTSFTY 335
FG++ G TP ++ PTY + + V+V V + + + I DSGT TY
Sbjct: 264 FGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVPTGSTDGNVIIDSGTLLTY 323
Query: 336 LNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
L + Y + + SLA E + S LPF + Y
Sbjct: 324 LGESFYYNFAASLQESLAVELVQDVLSPLPFCFPY 358
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 123/294 (41%), Gaps = 41/294 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VGQP+ F + LDTGSD+ WL C C C + I+ P SS+
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDP---------IFDPTASSSY 207
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ + C++ C+ + C YQV Y DG+ + G V + + + SV+ R
Sbjct: 208 NPLTCDAQQCQDLEMSACRNGKCLYQVSY-GDGSFTVGEYVTETVSFG-----AGSVN-R 260
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F+ A GL G + TS + SFS C +G+ S
Sbjct: 261 VAIGCGHDNEGLFVGSAGLLGLGGGPLSLTS--------QIKATSFSYCLVDRDSGKSST 312
Query: 284 GDKGSPGQGET---PFSLRQTHPT-YNITITQVSVGGNAVNFEFS-----------AIFD 328
+ SP G++ P Q T Y + +T VSVGG V I D
Sbjct: 313 LEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVD 372
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
SGT+ T L AY + + F R L F+ CY L S ++ V
Sbjct: 373 SGTAITRLRTQAYNSVRDAFKRKTSNLRPAEGVAL-FDTCYDLSSLQSVRVPTV 425
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 118/290 (40%), Gaps = 47/290 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+ NVS+G P + DTGSDL W C DC + V L + P TS
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------------FDPKTS 137
Query: 161 STSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
ST V C+S+ C E Q C + + C Y + Y D + + G + D L L + + +
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRP 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-- 273
+ I GCG G+F N + P L Q I FS C
Sbjct: 197 MQL-KNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP 249
Query: 274 ---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF----- 321
D T +I+FG G TP + + T Y +T+ +SVG + +
Sbjct: 250 LTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS 309
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEY 367
E + I DSGT+ T L Y+++ + +S+ EK++ S L Y
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY 359
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 118/290 (40%), Gaps = 47/290 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+ NVS+G P + DTGSDL W C DC + V L + P TS
Sbjct: 90 YLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPL------------FDPKTS 137
Query: 161 STSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
ST V C+S+ C E Q C + + C Y + Y D + + G + D L L + + +
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSY-GDNSYTKGNIAVDTLTLGSSDTRP 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-- 273
+ I GCG G+F N + P L Q I FS C
Sbjct: 197 MQL-KNIIIGCGHNNAGTF------NKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVP 249
Query: 274 ---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF----- 321
D T +I+FG G TP + + T Y +T+ +SVG + +
Sbjct: 250 LTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDS 309
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEY 367
E + I DSGT+ T L Y+++ + +S+ EK++ S L Y
Sbjct: 310 ESSEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCY 359
>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
Length = 519
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 91/337 (27%), Positives = 129/337 (38%), Gaps = 83/337 (24%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSS------------- 146
++ VG PA F++ DTGSDL W+ C D + +G + +
Sbjct: 107 YFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAA 166
Query: 147 -GQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMST 200
++ P+ S T + +PC+S C CP+ GS C Y RY DG+ +
Sbjct: 167 ASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRY-KDGSAAR 225
Query: 201 GFLVEDVLHLA-----TDEKQSKSVDSRISFGCGRVQTG-SFLDGAAPNGLFGLGMDKTS 254
G + D +A +KQ ++ + GC TG SFL A +G+ LG S
Sbjct: 226 GTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFL---ASDGVLSLGYSNIS 282
Query: 255 VPSILANQGLIPNSFSMCF-----GSDGTGRISFGDKGSPGQGETPFS------------ 297
S A + FS C + T ++FG +P +P S
Sbjct: 283 FASRAAAR--FGGRFSYCLVDHLAPRNATSYLTFGP--NPAVSSSPPSKTACAGGGSPAA 338
Query: 298 -------LRQT--------HPTYNITITQVSVGGNAVNFEF---------SAIFDSGTSF 333
RQT P Y +T+ +SV G + AI DSGTS
Sbjct: 339 APPGPGGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWDVAKGGGAILDSGTSL 398
Query: 334 TYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
T L PAY + N LA R T PF+YCY
Sbjct: 399 TVLVSPAYRAVVAALNKKLAGLPRVTMD---PFDYCY 432
>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
Length = 325
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 117/258 (45%), Gaps = 27/258 (10%)
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQC 179
+DTGSD+ W+ CD C C +S ++ P S+T +PCNST+C +LQ
Sbjct: 5 IDTGSDITWIQCDPCPQCYKQQDS---------LFQPAGSATYKPLPCNSTMCQQLQSFS 55
Query: 180 PSA-GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
S S+C Y V Y D + + G + L L +D+ SV +FGCG G F +
Sbjct: 56 HSCLNSSCNYMVSY-GDKSTTRGDFALETLTLRSDDTILVSV-PNFAFGCGHANKGLF-N 112
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE- 293
GAA GL GLG P+ FS C S +G + FG+
Sbjct: 113 GAA--GLMGLGKSSIGFPA--QTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVR 168
Query: 294 -TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSL 351
TP + P+ Y +++T ++VG + + + DSGT + AY ++ + F +
Sbjct: 169 FTPLVDSSSGPSQYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQI 228
Query: 352 AKEKRETSTSDLPFEYCY 369
+T+ S PF+ C+
Sbjct: 229 LP-GLQTAVSVAPFDTCF 245
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 117/268 (43%), Gaps = 36/268 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y V +G PA + + +DTGS L WL C CV H V ++ P+ S T
Sbjct: 13 YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCH--------VQADPLFDPSASKTY 64
Query: 164 SKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C S+ C C ++ + C Y Y D + S G+L +D+L LA +
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASY-GDSSYSMGYLSQDLLTLAPSQTL 123
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
V +GCG+ G F A G+ GLG +K S+ ++++ +FS C +
Sbjct: 124 PGFV-----YGCGQDSEGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR 173
Query: 277 GTGR-ISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSAIF 327
G G +S G G TP + +P+ Y + +T ++VGG A+ + I
Sbjct: 174 GGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTII 233
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEK 355
DSGT T L YT + F + K
Sbjct: 234 DSGTVITRLPMSVYTPFQQAFVKIMSSK 261
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 112/260 (43%), Gaps = 34/260 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G PA + V DTGSD W+ C CV + ++
Sbjct: 171 RALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE--------KLFD 222
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 223 PVRSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 281
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 282 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 331
Query: 275 SDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG TP L PT Y I +T + VGG ++ S
Sbjct: 332 STGTGYLDFGAGSPAAASARLTTPM-LTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAG 390
Query: 326 -IFDSGTSFTYLNDPAYTQI 344
I DSGT T L PAY+ +
Sbjct: 391 TIVDSGTVITRLPPPAYSSL 410
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 116/290 (40%), Gaps = 63/290 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +GQP S ++ DTGSDL W+ C C +C H ++ ++ P SST
Sbjct: 84 YFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPAT--------VFFPRHSSTF 135
Query: 164 SKVPCNSTLCELQKQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
S C +C L + A S C Y+ Y +DG++++G + L T
Sbjct: 136 SPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGY-ADGSLTSGLFARETTSLKTSSG 194
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAA---PNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ + S ++FGCG +G + G + NG+ GLG S S L + N FS C
Sbjct: 195 KEARLKS-VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRR--FGNKFSYC 251
Query: 273 F-----------------GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSV 314
G DG ++ F TP PT Y + + V V
Sbjct: 252 LMDYTLSPPPTSYLIIGNGGDGISKLFF----------TPLLTNPLSPTFYYVKLKSVFV 301
Query: 315 GGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
G + + S + DSGT+ +L +PAY + K
Sbjct: 302 NGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVRRRVK 351
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 114/293 (38%), Gaps = 55/293 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG P F + DTGSDL W+ C +G ++ P TS + +
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWVKC------------AGASPPGRVFRPKTSRSWA 163
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+PC+S C+L C S S C Y RY + G + + +A +
Sbjct: 164 PIPCSSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQ 223
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----G 274
+ + GC G A +G+ LG K S + A + SFS C
Sbjct: 224 LKD-VVLGCSSSHDGQSFRSA--DGVLSLGNAKISFATQAAAR--FGGSFSYCLVDHLAP 278
Query: 275 SDGTGRISFGDKGSPGQ------GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
+ TG ++FG PGQ +T L P Y + + + V G A++
Sbjct: 279 RNATGYLAFG----PGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDA 334
Query: 325 ----AIFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETSTSDLPFEYCY 369
I DSG + T L PAY + S+ + + K S PFE+CY
Sbjct: 335 KSGGVILDSGNTLTVLAAPAYKAVVAALSKHLDGVPK------VSFPPFEHCY 381
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 78/297 (26%), Positives = 121/297 (40%), Gaps = 43/297 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P V +D+GSD+ W+ C C C H + ++ P S++
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDP---------VFDPADSASF 192
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
VPC+S++CE + C Y+V Y DG+ + G L + L ++V
Sbjct: 193 MGVPCSSSVCERIENAGCHAGGCRYEVMY-GDGSYTKGTLALETLTFG------RTVVRN 245
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF---GSDGT 278
++ GCG G F+ A GL G M L Q G +FS C G+D
Sbjct: 246 VAIGCGHRNRGMFVGAAGLLGLGGGSMS-------LVGQLGGQTGGAFSYCLVSRGTDSA 298
Query: 279 GRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------A 325
G + FG P G P P+ Y I ++ V VGG V F+ +
Sbjct: 299 GSLEFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGV 358
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
+ D+GT+ T + AY + F S + F+ CY L F+ ++ V
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSI-FDTCYNLNGFVSVRVPTV 414
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 127/297 (42%), Gaps = 43/297 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VG+PA + LDTGSD+ WL C C C + +Y P+ S++
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDP---------VYDPSVSTSY 213
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C+S C C ++ +C Y+V Y DG+ + G + L L S
Sbjct: 214 ATVGCDSPRCRDLDAAACRNSTGSCLYEVAY-GDGSYTVGDFATETLTLGDSAPVSN--- 269
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ +FS C S +
Sbjct: 270 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 319
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ FGD P +T+ Y + ++ +SVGG A++ SA I
Sbjct: 320 STLQFGDSEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIV 379
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL--RSFLHLQALVV 382
DSGT+ T L AY + E F + S L F+ CY L RS + + A+ +
Sbjct: 380 DSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSL-FDTCYDLAGRSSVQVPAVAL 435
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 97/352 (27%), Positives = 143/352 (40%), Gaps = 57/352 (16%)
Query: 35 HHRYS-DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ---GNDKTPLTFS 90
HH +S P D A S+L R ++RL +A+ K + S
Sbjct: 74 HHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVTASKAQVPVS 133
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
+G RL +L ++ + G+ V +DT S+L W+ C C SC + G +
Sbjct: 134 SGA---RLRTLNYVATVGLGGGEA----TVIVDTASELTWVQCAPCESC----HDQQGPL 182
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAG------------SNCPYQVRYLSDG 196
D P++S + + VPC+S C+ LQ+Q + + C Y + Y DG
Sbjct: 183 FD-----PSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSY-RDG 236
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
+ S G L D L LA + +D + FGCG G G +GL GLG + S+
Sbjct: 237 SYSRGVLAHDRLSLA-----GEVIDGFV-FGCGTSNQGPPFGGT--SGLMGLGRSQLSLV 288
Query: 257 SILANQ--GLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQ---------THPTY 305
S +Q G+ + SD +G + GD S + TP P Y
Sbjct: 289 SQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFY 348
Query: 306 NITITQVSVGGNAVN---FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ +T ++VGG V F AI DSGT T L Y + F S E
Sbjct: 349 LVNLTGITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAE 400
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 125/294 (42%), Gaps = 57/294 (19%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+S+G P L F V +DTGS+L W C C C + + P SST S++
Sbjct: 94 NISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFP-------RPTPAPVLQPARSSTFSRL 146
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PCN + C+ + + +A + C Y Y S T G+L + L +
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTVG------DGTF 198
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----G 277
+++FGC T + +D ++ G+ GLG S+ S LA FS C SD G
Sbjct: 199 PKVAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAV-----GRFSYCLRSDMADGG 248
Query: 278 TGRISFGDKGSPGQG---------ETPFSLRQTHPTYNIT-----ITQVSVGGNAVNFEF 323
I FG +G + P+ R TH N+T T++ V G+ F
Sbjct: 249 ASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQ 308
Query: 324 SA-----IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF--EYCY 369
+ I DSGT+ TYL Y + + F S +A + T S P+ + CY
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY 362
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 123/288 (42%), Gaps = 42/288 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ N+S+G PA F +DTGSDL W C C N S+ I++P SS+ S
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST------PIFNPQGSSSFS 146
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+PC+S LC+ + + ++C Y Y DG+ + G + + L S S+ I
Sbjct: 147 TLPCSSQLCQALQSPTCSNNSCQYTYGY-GDGSETQGSMGTETLTFG-----SVSIP-NI 199
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRI 281
+FGCG G F G GL G+G S+PS L FS C GS + +
Sbjct: 200 TFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSNSSTL 252
Query: 282 ---SFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAVNFEFSA 325
S + + G T PT Y IT+ +SVG N+ N
Sbjct: 253 LLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGI 312
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ TY D AY + + F S +S F+ C+ + S
Sbjct: 313 IIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSS-GFDLCFQMPS 359
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 94/324 (29%), Positives = 129/324 (39%), Gaps = 52/324 (16%)
Query: 82 NDKTPLTFSAGNDTYRLNS---------LGFLHY-TNVSVGQPALSFIVALDTGSDLFWL 131
ND+ +S N TY S +G +Y G PA + ++ +DTGSD+ W+
Sbjct: 105 NDRLNTIWSKNNGTYSTMSNLPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWI 164
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQ 189
C C C ++ I+ P SS+ + C S+ C EL C Y+
Sbjct: 165 QCKPCSDCYSQVDP---------IFEPQQSSSYKHLSCLSSACTELTTMNHCRLGGCVYE 215
Query: 190 VRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLG 249
+ Y DG+ S G ++ L L +D S +FGCG TG F A GL GLG
Sbjct: 216 INY-GDGSRSQGDFSQETLTLGSDSFPS------FAFGCGHTNTGLFKGSA---GLLGLG 265
Query: 250 MDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT 304
S PS + FS C S TG S G P P +P+
Sbjct: 266 RTALSFPS--QTKSKYGGQFSYCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPS 323
Query: 305 -YNITITQVSVGGN------AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRE 357
Y + + +SVGG AV I DSGT T L AY + +F S K
Sbjct: 324 FYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRLVPQAYDALKTSFRS----KTR 379
Query: 358 TSTSDLPF---EYCYVLRSFLHLQ 378
S PF + CY L S+ ++
Sbjct: 380 NLPSAKPFSILDTCYDLSSYSQVR 403
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 123/276 (44%), Gaps = 44/276 (15%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-----CVSCVHGLNSSSGQVIDFNIYS 156
GF + T +++GQP+ + + +DTGSDL WL CD C H Y
Sbjct: 18 GFYNVT-LNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEAPH------------PYYK 64
Query: 157 PNTSSTSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDE 214
P+ + + K P C S ++C + G C Y+V Y +DG S G LV+D +L T E
Sbjct: 65 PSNNLVACKDPICQSLHTGGDQRCENPG-QCDYEVEY-ADGGSSLGVLVKDAFNLNFTSE 122
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
K+ + + G ++ G++ +G+ GLG K S+ S L+ GL+ N C
Sbjct: 123 KRQSPLLALGLCGYDQLPGGTY---HPIDGVLGLGRGKPSIVSQLSGLGLVRNVIGHCL- 178
Query: 275 SDGTGRISFGDK------GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSAIF 327
+GR S TP S H Y+ +++ G F+ F
Sbjct: 179 ---SGRGGGFLFFGDDLYDSSRVAWTPMSPNAKH--YSPGFAELTFDGKTTGFKNLIVAF 233
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
DSG S+TYLN +Q+ + SL KRE ST L
Sbjct: 234 DSGASYTYLN----SQVYQGLISLI--KRELSTKPL 263
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 87/290 (30%), Positives = 122/290 (42%), Gaps = 46/290 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ N+S+G PA F +DTGSDL W C C N S+ I++P SS+ S
Sbjct: 95 YLMNLSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST------PIFNPQGSSSFS 146
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+PC+S LC+ + + ++C Y Y DG+ + G + + L S S+ I
Sbjct: 147 TLPCSSQLCQALQSPTCSNNSCQYTYGY-GDGSETQGSMGTETLTFG-----SVSIP-NI 199
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRI 281
+FGCG G F G GL G+G S+PS L FS C GS + +
Sbjct: 200 TFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFSYCMTPIGSSTSSTL 252
Query: 282 SFGD------KGSPGQGETPFSLRQTHPTYNITITQVSVGG------------NAVNFEF 323
G GSP T Q Y IT+ +SVG N+ N
Sbjct: 253 LLGSLANSVTAGSP--NTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTG 310
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ TY D AY + + F S +S F+ C+ + S
Sbjct: 311 GIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSS-GFDLCFQMPS 359
>gi|242035209|ref|XP_002464999.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
gi|241918853|gb|EER91997.1| hypothetical protein SORBIDRAFT_01g030210 [Sorghum bicolor]
Length = 107
Score = 78.2 bits (191), Expect = 7e-12, Method: Composition-based stats.
Identities = 42/70 (60%), Positives = 47/70 (67%), Gaps = 3/70 (4%)
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDK 286
CG TGSFLDG A NGL GLG +K SV +L GL+ +SFSMCF D GRI+FGD
Sbjct: 20 CG--PTGSFLDGGAFNGLMGLGKEKVSVAGMLTASGLVASDSFSMCFSEDVVGRINFGDA 77
Query: 287 GSPGQGETPF 296
G GQGE PF
Sbjct: 78 GIRGQGEMPF 87
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 105/410 (25%), Positives = 160/410 (39%), Gaps = 68/410 (16%)
Query: 8 SPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYS-ALAHR 66
SP+ +L++L S C G GF R S + + + + + S A S +L HR
Sbjct: 3 SPLLLLVVLCSYCCYIALGGNEHGFAVVQRRSYDSETVCSASKVNLEPSSATVSMSLVHR 62
Query: 67 D--------------------RYFRLRGRGLAAQGNDKTPLTFSAGND------TYRLNS 100
R R R + +Q + + ++ D T
Sbjct: 63 YGPCAPSQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMASTPDDDDAAVTIPTRL 122
Query: 101 LGFL----HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
GF+ + + G P++ ++ +DTGSD+ W+ C C NS+ ++
Sbjct: 123 GGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWV--QCTPC----NSTKCYPQKDPLFD 176
Query: 157 PNTSSTSSKVPCNSTLCE-----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
P+ SST + + CN+ C C S G+ C Y V Y +DG+ S G + L LA
Sbjct: 177 PSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEY-ADGSHSRGVYSNETLTLA 235
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
FGCGR Q G +GL GLG S+ ++ + +FS
Sbjct: 236 PGITVED-----FHFGCGRDQRGP---SDKYDGLLGLGGAPVSL--VVQTSSVYGGAFSY 285
Query: 272 CFGSDGTGRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
C + + F GSP G TP + T Y +T+T +SVGG ++ S
Sbjct: 286 CLPALNS-EAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQS 344
Query: 325 A-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
A I DSGT T L + AY + K + D F+ CY
Sbjct: 345 AFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD--FDTCY 392
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 136/329 (41%), Gaps = 45/329 (13%)
Query: 45 ILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND--TYRLNSLG 102
++ + L S Y AL H D L L + ++ L +G D + RL+S+
Sbjct: 15 LVLLTSLAVSASSGYRLALTHVDSKIGLTKTELMRRAAHRSRLRALSGYDANSPRLHSVQ 74
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
+ +++G P + F+ DTGSDL W C C C D +Y P+ SS
Sbjct: 75 VEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASS 125
Query: 162 TSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
T S VPC+S C + C + S C Y Y SDG S G L + L L +
Sbjct: 126 TFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSY-SDGAYSAGILGTETLTLGSSVPGQA 184
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FG 274
S ++FGCG G L+ G GLG S+LA G+ FS C F
Sbjct: 185 VSVSDVAFGCGTDNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFN 236
Query: 275 SDGTGRISFGDKG--SPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEF 323
S G +PG G TP +P+ Y +++ +++G + F+
Sbjct: 237 STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDL 296
Query: 324 SA------IFDSGTSFTYLNDPAYTQISE 346
A + DSGT+F+ L + + + +
Sbjct: 297 HANSTGGMVVDSGTTFSILPESGFRVVVD 325
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 126/286 (44%), Gaps = 41/286 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C+ C C ++ I++P+ S++
Sbjct: 197 YFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDP---------IFNPSLSASF 247
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + CNS +C G C Y+V Y DG+ + G ++L T ++
Sbjct: 248 STLGCNSAVCSYLDAYNCHGGGCLYKVSY-GDGSYTIGSFATEMLTFGTTSVRN------ 300
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGR 280
++ GCG G F+ A L GLG S PS L Q +FS C S+ +G
Sbjct: 301 VAIGCGHDNAGLFVGAAG---LLGLGAGLLSFPSQLGTQ--TGRAFSYCLVDRFSESSGT 355
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG + P G TP + PT Y + + +SVGG ++ F
Sbjct: 356 LEFGPESVPLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGF 415
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGT+ T L P Y + + F + ++ + + F+ CY L
Sbjct: 416 IVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSI-FDTCYDL 460
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 136/323 (42%), Gaps = 35/323 (10%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
R Y R G A Q D +A +G L+Y S+G P ++ + +D
Sbjct: 99 RRAEYILRRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
TGSDL W+ C S S + D P SS+ + VPC +C +
Sbjct: 159 TGSDLSWVQCKPCSAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+ + C Y V Y DG+ +TG D L L+ + S FGCG Q+G F +G
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
+GL GLG ++ S+ + G FS C + + G ++ G G P FS
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-LGGPSGAAPGFST 321
Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISET 347
Q P+ Y + +T +SVGG ++ SA + D+GT T L AY +
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITRLPPTAYAALRSA 381
Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
F S +A T+ S+ + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 122/282 (43%), Gaps = 37/282 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + LDTGS L WL C CV H +D ++ P+ S+T
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCH-------SQVD-PLFEPSASNTY 171
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C+S+ C L K C ++G C Y Y D + S G+L D+L L
Sbjct: 172 RPLYCSSSECSLLKAATLNDPLCTASGV-CVYTASY-GDASYSMGYLSRDLLTLTP---- 225
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
S+++ S ++GCG+ G F A G+ GL DK S+ + L+ + +FS C
Sbjct: 226 SQTLPS-FTYGCGQDNEGLFGKAA---GIVGLARDKLSMLAQLSPK--YGYAFSYCLPTS 279
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTY------NITITQVSVGGNAVNFEFSAIF 327
S G G +S G TP +P+ IT+ VG A ++ I
Sbjct: 280 TSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTII 339
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
DSGT T L Y + E F + + E + + + C+
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSRRYEQAPAYSILDTCF 381
>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 119/312 (38%), Gaps = 59/312 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ VG PA F++ DTGSDL W+ C + NSS + P S T +
Sbjct: 94 YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAA----NSSESGSGSGRAFRPEDSRTWA 149
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C S C CP+ GS C Y RY DG+ + G + + +A + +
Sbjct: 150 PISCASDTCTKSLPFSLATCPTPGSPCAYDYRY-KDGSAARGTVGTESATIALSGRGREE 208
Query: 220 VDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
+++ GC TG + +G+ LG S S A++ FS C
Sbjct: 209 RKAKLKGLVLGCTSSYTGPSFE--VSDGVLSLGYSDVSFASHAASR--FAGRFSYCLVDH 264
Query: 274 --GSDGTGRISFGDK-----------------------GSPGQGETPFSL-RQTHPTYNI 307
+ T ++FG P +TP L R+ P Y++
Sbjct: 265 LSPRNATSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDV 324
Query: 308 TITQVSVGGNAVNFEFS---------AIFDSGTSFTYLNDPAYTQISETFNS-LAKEKRE 357
+ VSV G + + I DSGTS T L PAY + + LA R
Sbjct: 325 AVKAVSVAGQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRV 384
Query: 358 TSTSDLPFEYCY 369
T PFEYCY
Sbjct: 385 TMD---PFEYCY 393
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 122/275 (44%), Gaps = 39/275 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P + I +DTGSDL W C C C QV+ F + P SST
Sbjct: 92 YIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVPF--FDPKNSSTY 142
Query: 164 SKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
C ++ C + C + G C + Y +DG+ + G L + L +A+ + S
Sbjct: 143 RDSSCGTSFCLALGNDRSCRN-GKKCTFMYSY-ADGSFTGGNLAVETLTVASTAGKPVSF 200
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+FGC G F + ++ G+ GLG+ + S+ S L + I FS C S
Sbjct: 201 PG-FAFGCVHRSGGIFDEHSS--GIVGLGVAELSMISQL--KSTINGRFSYCLLPVFTDS 255
Query: 276 DGTGRISFGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAVNF---------- 321
+ RI+FG G G TP ++ Y IT+ SVG +++
Sbjct: 256 SMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVE 315
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
E + I DSGT++TYL Y ++ E+ K KR
Sbjct: 316 EGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKR 350
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/351 (25%), Positives = 139/351 (39%), Gaps = 38/351 (10%)
Query: 50 DLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNV 109
D PK + + R R R + D + + S + + G + N+
Sbjct: 39 DSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMNL 98
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P + DTGS+L W C C C ++ ++ P SST V C
Sbjct: 99 SLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDP---------LFDPKASSTYKDVSC 149
Query: 169 NSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+S+ C E Q C + C Y V Y +DG+ + G D L L + + + + I
Sbjct: 150 SSSQCTALENQASCSTEDKTCSYLVSY-ADGSYTMGKFAVDTLTLGSTDNRPVQL-KNII 207
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG-LIPNSFSMCF--GSDGTGRIS 282
GCG+ +F N G+ S++ G I FS C +D T +I+
Sbjct: 208 IGCGQNNAVTFR-----NKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKIN 262
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV-----NFEFSAIFDSGTSFT 334
FG PG TP ++ Y +T+ +SVG + N + + + DSGT+ T
Sbjct: 263 FGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLT 322
Query: 335 YLNDPAYTQISETFNSLA---KEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
L Y +I SL K K E S L CY + L++ + +
Sbjct: 323 LLPVKYYIEIENAVASLINADKSKDERIGSSL----CYNATADLNIPVITM 369
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 70/244 (28%), Positives = 108/244 (44%), Gaps = 27/244 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +DTGS + ++PC +C H S Q F P S T V
Sbjct: 95 TRLWIGTPPQRFALIVDTGSTVTYVPCS--TCKH---CGSHQDPKFR---PEASETYQPV 146
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C Q C C Y+ RY ++ + S+G L EDV+ QS+ R F
Sbjct: 147 KCT-----WQCNCDDDRKQCTYERRY-AEMSTSSGVLGEDVVSFGN---QSELSPQRAIF 197
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG + A +G+ GLG S+ L + +I ++FS+C+G G G + G
Sbjct: 198 GCENDETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLG 256
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLN 337
P S P YNI + ++ V G ++ + + DSGT++ YL
Sbjct: 257 GISPPADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316
Query: 338 DPAY 341
+ A+
Sbjct: 317 ESAF 320
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 124/293 (42%), Gaps = 36/293 (12%)
Query: 105 HY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
HY VS+G P DTGSDL W C C C N I+ P S++
Sbjct: 24 HYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNP---------IFDPQKSTS 74
Query: 163 SSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+S LC +L S +C Y Y S ++ G L ++ + L++ + +S +
Sbjct: 75 YRNISCDSKLCHKLDTGVCSPQKHCNYTYAYAS-AAITQGVLAQETITLSSTKGESVPLK 133
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD-- 276
I FGCG TG F D G+ GLG S S + + FS C F +D
Sbjct: 134 G-IVFGCGHNNTGGFNDREM--GIIGLGGGPVSFISQIGSS-FGGKRFSQCLVPFHTDVS 189
Query: 277 GTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
+ ++S G KGS G+ TP +Q Y +T+ +SVG ++F S+
Sbjct: 190 VSSKMSLG-KGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEKG 248
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLH 376
DSGT T L Y ++ S K T+ DL + CY ++ L
Sbjct: 249 NVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLR 301
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 120/284 (42%), Gaps = 37/284 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
H + +G P + +DTGSDL W+ C C+ C + ++ P SST
Sbjct: 68 HLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKP---------MFDPLKSSTY 118
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ + C+S LC +L S C Y Y D +++ G L +D ++ + S+ S
Sbjct: 119 NNISCDSPLCHKLDTGVCSPEKRCNYTYGY-GDNSLTKGVLAQDTATFTSNTGKPVSL-S 176
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA--------NQGLIPNSFSMCFG 274
R FGCG TG F D GL GLG TS+ S + +Q L+P +
Sbjct: 177 RFLFGCGHNNTGGFNDHEM--GLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKIS 234
Query: 275 SDGTGRISFGDKGSPGQGE----TPFSLRQTHPTYNITITQVSVGG-----NAVNFEFSA 325
S R+SFG KGS G TP R+ +Y +T+ +SV N+ + +
Sbjct: 235 S----RMSFG-KGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIGKANM 289
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ DSGT L Y ++ + K T L + CY
Sbjct: 290 LVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCY 333
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 121/296 (40%), Gaps = 42/296 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +D+GSD+ W+ C C+ C + ++ P TS+T
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADP---------LFDPATSATF 177
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S VPC S +C + C +G C Y+V Y DG+ + G L + L L +
Sbjct: 178 SAVPCGSAVCRTLRTSGCGDSG-GCDYEVSY-GDGSYTKGALALETLTLGGTAVEG---- 231
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRI 281
++ GCG G F+ A GL GLG S+ L +FS C S G G +
Sbjct: 232 --VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAGSL 284
Query: 282 SFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEF-----------SAIF 327
G + +G P P+ Y + ++ + VG + + +
Sbjct: 285 VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVM 344
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D+GT+ T L AY + + F ++ R S L + CY L + ++ V
Sbjct: 345 DTGTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLL--DTCYDLSGYTSVRVPTV 398
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 130/309 (42%), Gaps = 49/309 (15%)
Query: 65 HRDRYF--RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVA 121
R Y R+ GRG + K + + N +G L+Y VS+G P ++ +
Sbjct: 98 RRAEYILRRVSGRGTPQLWDSKAEAATATVPANWGFN-IGTLNYVVTVSLGTPGVAQTLE 156
Query: 122 LDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---- 174
+DTGSDL W+ PC +C + ++ P SS+ + VPC +C
Sbjct: 157 VDTGSDLSWVQCTPCAAPACYSQKDP---------LFDPAQSSSYAAVPCGGPVCGGLGI 207
Query: 175 LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
C +A C Y V Y DG+ +TG D L L+ ++ FGCG Q+G
Sbjct: 208 YASSCSAA--QCGYVVSY-GDGSKTTGVYSSDTLTLSPNDAVRG-----FFFGCGHAQSG 259
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQG 292
+ +GL GLG ++ S+ + G FS C + TG ++ G G G
Sbjct: 260 FTGN----DGLLGLGREEASL--VEQTAGTYGGVFSYCLPTRPSTTGYLTLG--GPSGAA 311
Query: 293 ETPFSLRQ--THPT----YNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLNDPAY 341
FS Q + P Y + +T +SVGG ++ F + D+GT T L AY
Sbjct: 312 PPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPPTAY 371
Query: 342 TQISETFNS 350
+ F S
Sbjct: 372 AALRSAFRS 380
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 125/305 (40%), Gaps = 58/305 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + LDTGSDL W+ CD C C S Y P SST
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSH---------YYPKDSSTY 221
Query: 164 SKVPCNSTLCELQ------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT----- 212
+ C C+L + C + CPY Y +DG+ +TG + +
Sbjct: 222 RNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDY-ADGSNTTGDFASETFTVNLTWPNG 280
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
EK + VD + FGCG G F GA+ GL GLG S PS + Q + +SFS C
Sbjct: 281 KEKFKQVVD--VMFGCGHWNKG-FFYGAS--GLLGLGRGPISFPSQI--QSIYGHSFSYC 333
Query: 273 F-----GSDGTGRISFG-DKGSPGQGETPF-SLRQTHPT-----YNITITQVSVGGNAVN 320
+ + ++ FG DK F +L T Y + I + VGG ++
Sbjct: 334 LTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLD 393
Query: 321 -----FEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
+ +S+ I DSG++ T+ D AY I E F K ++ + D
Sbjct: 394 ISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIK-LQQIAADDFV 452
Query: 365 FEYCY 369
CY
Sbjct: 453 MSPCY 457
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 74/294 (25%), Positives = 123/294 (41%), Gaps = 34/294 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++S+G P + DTGSDL W C C C ++ ++ P +S T
Sbjct: 95 YLMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDP---------LFDPKSSKTY 145
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
C++ C L Q +G+ C YQ Y D + + G + D + L + S
Sbjct: 146 RDFSCDARQCSLLDQSTCSGNICQYQYSY-GDRSYTMGNVASDTITLDSTTGSPVSFPKT 204
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGT 278
+ GCG G+F D + G+ GLG S+ S + + + FS C + +
Sbjct: 205 V-IGCGHENDGTFSDKGS--GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNS 259
Query: 279 GRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF--------EFSAI 326
+++FG PG TP +T + Y +T+ +SVG + F E + I
Sbjct: 260 SKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNII 319
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQAL 380
DSGT+ T + D ++ +S + + +R S CY S L + A+
Sbjct: 320 IDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGF-LSVCYSATSDLKVPAI 372
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 131/300 (43%), Gaps = 47/300 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG PA F + +DTGSDL W+ C+ + NSSS Y ++SS+
Sbjct: 59 YFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTT--ANSSSPPA---PWYDKSSSSSYR 113
Query: 165 KVPCNSTLCE-----LQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++PC C+ + C ++ S C Y Y SD + +TG L + + + + ++ K
Sbjct: 114 EIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGY-SDQSRTTGILAYETISMKSRKRSGK 172
Query: 219 SVDSR---------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ ++ GC R G+ GA+ G+ GLG S+ + + L F
Sbjct: 173 RAGNHKTRRIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIF 229
Query: 270 SMCF-----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
S C GS+ + + G TP + Y + +T V+V G V+
Sbjct: 230 SYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 289
Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCY 369
S+ IFDSGT+ +YL +PAY+++ N+ R ++P FE CY
Sbjct: 290 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR---AQEIPEGFELCY 346
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 131/307 (42%), Gaps = 50/307 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G+P+ + LDTGSD+ W+ C C C H + I+ P +S++
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADP---------IFEPASSTSY 194
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + C++ C+ + C Y+V Y DG+ + G V + + L S SVD+
Sbjct: 195 SPLSCDTKQCQSLDVSECRNNTCLYEVSY-GDGSYTVGDFVTETITLG-----SASVDN- 247
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG K S PS + +SFS C SD
Sbjct: 248 VAIGCGHNNEGLFIGAAG---LLGLGGGKLSFPSQIN-----ASSFSYCLVDRDSDSAST 299
Query: 281 ISFGDKGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVN-----FEFSA------IFD 328
+ F P P R+ Y + +T +SVGG ++ FE I D
Sbjct: 300 LEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIID 359
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS---------FLHLQA 379
SGT+ T L AY + + F K+ TS L F+ CY L HL
Sbjct: 360 SGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVAL-FDTCYDLSRKTSVEVPTVTFHLAG 418
Query: 380 LVVLPFP 386
VLP P
Sbjct: 419 GKVLPLP 425
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 124/288 (43%), Gaps = 54/288 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
N S+G+P + + +DTGS L W+ C H +S S Q + I+ P+ SST S +
Sbjct: 96 NFSIGEPPIPQLAVMDTGSSLTWVMC------HPCSSCSQQSVP--IFDPSKSSTYSNLS 147
Query: 168 CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
C+ +C CPY V Y+ G+ S G + L L T ++ V S I FG
Sbjct: 148 CSEC-----NKCDVVNGECPYSVEYVGSGS-SQGIYAREQLTLETIDESIIKVPSLI-FG 200
Query: 228 CGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIP---NSFSMCFGSDGT-- 278
CGR S P NG+FGLG + S L+P FS C G+
Sbjct: 201 CGR--KFSISSNGYPYQGINGVFGLGSGRFS---------LLPSFGKKFSYCIGNLRNTN 249
Query: 279 ---GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------ 324
R+ GDK + QG++ +L + Y + + +S+GG ++ FE S
Sbjct: 250 YKFNRLVLGDKANM-QGDST-TLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNS 307
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYCY 369
I DSG T+L + +S +L + + D P+ CY
Sbjct: 308 GVIIDSGADHTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCY 355
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/302 (31%), Positives = 131/302 (43%), Gaps = 55/302 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG PA+ ++ALDT SDL WL C C C SG V D P S++
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 184
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDG----TMSTGFLVEDVLHLATDEKQ 216
++ ++ C+ + + C Y V+Y DG + S G LVE+ L A +Q
Sbjct: 185 GEMNYDAPDCQALGRSGGGDAKRGTCIYTVQY-GDGHGSTSTSVGDLVEETLTFAGGVRQ 243
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
+ +S GCG G F GA G+ GLG + S+P +A G SFS C
Sbjct: 244 AY-----LSIGCGHDNKGLF--GAPAAGILGLGRGQISIPHQIAFLGY-NASFSYCLVDF 295
Query: 274 ----GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV------ 319
GS + ++FG SP TP L Q PT Y + + VSVGG V
Sbjct: 296 ISGPGSP-SSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTER 354
Query: 320 -------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCY 369
I DSGT+ T L PAY + F + A + ST S L F+ CY
Sbjct: 355 DLQLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGL-FDTCY 413
Query: 370 VL 371
+
Sbjct: 414 TV 415
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 129/288 (44%), Gaps = 44/288 (15%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L ++ VS+G PA++ + +DTGSD+ WL C +Y P
Sbjct: 126 LNTLEYV--ITVSIGSPAVAXTMFIDTGSDVSWLRCKS-----------------RLYDP 166
Query: 158 NTSSTSSKVPCNSTLC-ELQKQCP--SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
TSST + C++ C +L ++ S+GS C Y V+Y DG+ +TG D L LA
Sbjct: 167 GTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKY-GDGSNTTGTYGSDTLTLA--- 222
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSMCF 273
S+ + S FGC V+ G D +GL GLG D S V A G ++FS C
Sbjct: 223 GTSEPLISGFQFGCSAVEHGFEEDNT--DGLMGLGGDAQSFVSQTAATYG---SAFSYCL 277
Query: 274 GS--DGTGRISFGDKGSPGQGETP----FSLRQTHPTYNITITQVSVGGNAVN-----FE 322
+ +G ++ G S +Q Y + + +SVGG + F
Sbjct: 278 PPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS 337
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
+I DSGT T L AY +S F + +A+ + + + + C+
Sbjct: 338 AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAAPRGLLDTCF 385
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/244 (27%), Positives = 108/244 (44%), Gaps = 27/244 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +DTGS + ++PC +C H G+ D + P+ S T V
Sbjct: 91 TRLWIGTPPQRFALIVDTGSTVTYVPCS--TCEH-----CGRHQDPK-FQPDLSETYQPV 142
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C C C + C Y +Y ++ + S+G L EDV+ S+ R F
Sbjct: 143 KCTPD-C----NCDGDTNQCMYDRQY-AEMSSSSGVLGEDVVSFG---NLSELAPQRAVF 193
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFG 284
GC +TG A +G+ GLG S+ L ++ +I +SFS+C+G G G + G
Sbjct: 194 GCENDETGDLYSQRA-DGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG 252
Query: 285 DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLN 337
P S P YNI + ++ V G + + + DSGT++ YL
Sbjct: 253 GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLP 312
Query: 338 DPAY 341
+ A+
Sbjct: 313 ETAF 316
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 115/264 (43%), Gaps = 33/264 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
HY + +G P V LDTGS L PCD CV C G D P +T
Sbjct: 46 HYAELYIGIPPQRASVILDTGSGLTAFPCDKCVDC--------GTHTD-----PKFDATK 92
Query: 164 SKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQSKSVD 221
S N C+ ++ C + N C RY S+G+M +++D++ + D +++ +
Sbjct: 93 S-TSINFVQCKYEEGCDTCRDNLCVIHQRY-SEGSMWEAVVMQDLIWVGNVDSDRAEMIM 150
Query: 222 S----RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSMCFGSD 276
R FGC +TG F+ NG+ GLG+ + ++ + + + + F++CFG
Sbjct: 151 RRYGIRFKFGCQTRETGLFI-TQVENGIMGLGIGRNNIATEMYKAKRVEEHKFALCFGQK 209
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYN--ITITQVSVGGNAVNFEFS-------AIF 327
G + G S + ++ H T N I + V +GG ++ + AI
Sbjct: 210 GGSFVIGGVDYSHHTTKIAYTPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSGRGAIV 269
Query: 328 DSGTSFTYLNDPAYTQISETFNSL 351
DSGT+ TY A T E F +
Sbjct: 270 DSGTTDTYFPSAAATPFQEAFKRI 293
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/271 (27%), Positives = 117/271 (43%), Gaps = 33/271 (12%)
Query: 96 YRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNI 154
+ L ++ L+ V +G P+ + +A TGSD+ W+PC C C + ++
Sbjct: 67 FVLEAMPGLYCITVKLGNPSRHYYLAFHTGSDVMWVPCSSCTDC----PTPDDIGFSLDL 122
Query: 155 YSPNTSSTSSKVP-----CNSTLCELQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVED 206
Y P SSTSS++ C L C S+G C Y Y +TG+ V D
Sbjct: 123 YDPKNSSTSSEISCSDDRCADALKTGHAICHTSHSSGDQCGYNQIYADGVLATTGYYVSD 182
Query: 207 VLH--LATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
+H + + S + + FGC + ++G +G+ G G D S+ S L +QG
Sbjct: 183 DIHFDIFMGNESFASSSASVIFGCSKSRSGHL----QADGVIGFGKDAPSLISQLNSQG- 237
Query: 265 IPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE 322
+ ++FS C DG G + + G PG T SL + P YN+ + ++V V +
Sbjct: 238 VSHAFSRCLDDSDDGGGVLILDEVGEPGLEFT--SLVASRPCYNLNMKSIAVNNQNVPID 295
Query: 323 FS---------AIFDSGTSFTYLNDPAYTQI 344
S DSGTS Y D Y +
Sbjct: 296 SSLFTTSSTQGTFLDSGTSLAYFPDGVYDPV 326
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 125/289 (43%), Gaps = 49/289 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +DTGSD+ W+ C C SC ++ ++ P SS+
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDA---------VFDPRASSSF 64
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
++ C++ C+L K C S + C YQV Y DG+ + G L D + S+
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDNRCLYQVSY-GDGSFTVGDLASDSFSV------SRGRT 117
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
S + FGCG G F+ A GLG K S PS L+++ FS C G
Sbjct: 118 SPVVFGCGHDNEGLFVGAAGLL---GLGAGKLSFPSQLSSR-----KFSYCLVSRDNGVR 169
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------- 325
+ + FGD P ++ +P Y ++ +S+GG ++ +A
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGTS T L AYT + + F S ++ + L F+ CY
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL-FDTCY 277
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 123/282 (43%), Gaps = 37/282 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+SVG P I DTGSD+ W C+ C +C D +++P+ S+T KV
Sbjct: 89 LSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQ---------DLPMFNPSKSTTYRKVS 139
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C+S +C + S +C Y + Y D + S G D L + + + + R +
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISY-GDNSHSQGDFAVDTLTMGSTSGRVVAF-PRTA 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD--GTGR 280
GCG GSF A +G+ GLG+ S+ + + + FS C G+D G+ +
Sbjct: 198 IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNK 253
Query: 281 ISFGDKGS---PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIF 327
++FG + G TP + + Y++ + VSVG N + + + I
Sbjct: 254 LNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIII 313
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
DSGT+ T L Y ++ ++ +R + EYC+
Sbjct: 314 DSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF-LEYCF 354
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 138/332 (41%), Gaps = 47/332 (14%)
Query: 58 AYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGND--TYRLNSLGFLHYTNVSVGQPA 115
Y AL H D L + ++ L +G D + RL+S+ + +++G P
Sbjct: 17 GYRLALTHVDSKIGFTKTELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPP 76
Query: 116 LSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC- 173
+ F+ DTGSDL W C C C D +Y P+ SST S VPC+S C
Sbjct: 77 VPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVYDPSASSTFSPVPCSSATCL 127
Query: 174 --ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD-EKQSKSVDSRISFGCGR 230
+ C + S C Y Y SDG S G L + L + + Q+ SV S ++FGCG
Sbjct: 128 PTWRSRNCSNPSSPCRYIYSY-SDGAYSVGILGTETLTIGSSVPGQTVSVGS-VAFGCGT 185
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFGDK 286
G L+ G GLG S+LA G+ FS C F S G
Sbjct: 186 DNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCLTDFFNSTMDSPFFLGTL 237
Query: 287 G--SPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSA------IFDS 329
+PG G TP +P+ Y + + +S+G + F+ A + DS
Sbjct: 238 AELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDS 297
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
GT+FT L + ++ + L + ++S
Sbjct: 298 GTTFTILAKSGFREVVDRVAQLLGQPPVNASS 329
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 137/363 (37%), Gaps = 81/363 (22%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYR--------LNSLGFLHYT-NVSVGQPALSFIVA 121
+ R L+A N FS ND R + G L Y ++++G P
Sbjct: 59 KARAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSAL 118
Query: 122 LDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--LQKQ 178
LDTGSDL W C C SC+ + +++P S++ + C LC L
Sbjct: 119 LDTGSDLIWTQCAPCASCLAQPDP---------LFAPGESASYEPMRCAGQLCSDILHHG 169
Query: 179 CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLD 238
C C Y+ Y DGTM+ G + T + + + FGCG + GS +
Sbjct: 170 C-EMPDTCTYRYNY-GDGTMTMGVYATERFTF-TSSGGDRLMTVPLGFGCGSMNVGSLNN 226
Query: 239 GAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS-----------FGDKG 287
G +G+ G G + S+ S L+ + FS C S G+GR S +GD
Sbjct: 227 G---SGIVGFGRNPLSLVSQLSIR-----RFSYCLTSYGSGRKSTLLFGSLSGGVYGDAT 278
Query: 288 SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTY 335
P Q TP +PT Y + + ++VG + SA I DSGT+ T
Sbjct: 279 GPVQ-TTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTL 337
Query: 336 LNDPAYTQISETFNSL--------------------AKEKRETSTSDLPFEYCYVLRSFL 375
L ++ F A +R +STS +P V R
Sbjct: 338 LPGAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQVP-----VPRMVF 392
Query: 376 HLQ 378
H Q
Sbjct: 393 HFQ 395
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/270 (29%), Positives = 117/270 (43%), Gaps = 46/270 (17%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P F +DTGSDL W+ C C C + ++ P SS+ S
Sbjct: 12 ISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDP---------LFIPLASSSYSNAS 62
Query: 168 CNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
C +LC+ L + S + C Y Y DG+ + G + + L + S +RI F
Sbjct: 63 CTDSLCDALPRPTCSMRNTCTYSYSY-GDGSNTRGDFAFETVTL------NGSTLARIGF 115
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----GRIS 282
GCG Q G+F A +GL GLG S+PS L + + FS C T I+
Sbjct: 116 GCGHNQEGTF---AGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQSTTGTFSPIT 170
Query: 283 FGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDS 329
FG+ + TP + +P+ Y + + +SVG V SA I DS
Sbjct: 171 FGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDS 230
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETS 359
GT+ TY A+ I LA+ +R+ S
Sbjct: 231 GTTITYWRLAAFIPI------LAELRRQIS 254
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/333 (24%), Positives = 136/333 (40%), Gaps = 49/333 (14%)
Query: 79 AQGNDKTPLTFSAGND-TYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV 136
++ N TP + SA Y + G ++ +S+G P + +V DTGSDL W+ C C
Sbjct: 67 SRANRFTPNSVSAAKTLEYDIIPGGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQ 126
Query: 137 SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QKQCPSAG--SNCPYQV 190
C + I++P SST +V C + C + C + G C Y
Sbjct: 127 ECYKQKSP---------IFNPKQSSTYRRVLCETRYCNALNSDMRACSAHGFFKACGYSY 177
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
Y D + + G+L + + + + ++FGCG G+F + + G+
Sbjct: 178 SY-GDHSFTMGYLATERFIIGSTNNSIQ----ELAFGCGNSNGGNFDEVGS-----GIVG 227
Query: 251 DKTSVPSILANQGL-IPNSFSMCF------GSDGTGRISFGDK----GSPGQGETPFSLR 299
S+++ G I N FS C + G+I FGD GS TP +
Sbjct: 228 LGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSK 287
Query: 300 QTHPTYNITITQVSVGGNAVNFEFS----------AIFDSGTSFTYLNDPAYTQISETFN 349
+ Y +T+ +SVG + +E S I DSGT+ T+L+ Y ++ E
Sbjct: 288 EPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNIIIDSGTTLTFLDSKLYNKL-ELVL 346
Query: 350 SLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
A E S + F C+ + + L + V
Sbjct: 347 EKAVEGERVSDPNGIFSICFRDKIGIELPIITV 379
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 125/289 (43%), Gaps = 49/289 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +DTGSD+ W+ C C SC ++ ++ P SS+
Sbjct: 14 YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDA---------VFDPRASSSF 64
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
++ C++ C+L K C S + C YQV Y DG+ + G L D + S+
Sbjct: 65 RRLSCSTPQCKLLDVKACASTDNRCLYQVSY-GDGSFTVGDLASDSFLV------SRGRT 117
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSD 276
S + FGCG G F+ A GLG K S PS L+++ FS C G
Sbjct: 118 SPVVFGCGHDNEGLFVGAAGLL---GLGAGKLSFPSQLSSR-----KFSYCLVSRDNGVR 169
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------- 325
+ + FGD P ++ +P Y ++ +S+GG ++ +A
Sbjct: 170 ASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSST 229
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGTS T L AYT + + F S ++ + L F+ CY
Sbjct: 230 GRGGVIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSL-FDTCY 277
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 93/339 (27%), Positives = 135/339 (39%), Gaps = 57/339 (16%)
Query: 55 GSFAYYSALAHRDRYFRLRGRGLAAQG---NDKTPLTFSAGNDTYRLNSLGFLHYTNVSV 111
G++ + L + +LR + L+A+ AGN + + +++
Sbjct: 53 GNYTKFERLQRAMKRGKLRLQRLSAKTASFESSVEAPVHAGNGEFLMK---------LAI 103
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA ++ +DTGSDL W C C C I+ P SS+ SK+PC+S
Sbjct: 104 GTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTP---------IFDPKKSSSFSKLPCSS 154
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
LC S C Y Y D + + G L + SV S+I FGCG
Sbjct: 155 DLCA-ALPISSCSDGCEYLYSY-GDYSSTQGVLATETFAFG-----DASV-SKIGFGCGE 206
Query: 231 VQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGD 285
GS F GA GL GLG S+ S L FS C S G + G
Sbjct: 207 DNDGSGFSQGA---GLVGLGRGPLSLISQLGEP-----KFSYCLTSMDDSKGISSLLVGS 258
Query: 286 KGSPGQG-ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
+ + TP + P+ Y +++ +SVG + E S I DSGT+
Sbjct: 259 EATMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTT 318
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
TYL D A+ + + F S K + S S + C+ L
Sbjct: 319 ITYLEDSAFAALKKEFISQLKLDVDESGS-TGLDLCFTL 356
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 115/268 (42%), Gaps = 52/268 (19%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+VS+G PAL++ +DTGSDL W C CV C ++ P++SST + V
Sbjct: 108 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 158
Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
PC+S C +C SA S C Y Y D + + G L + LA KS +
Sbjct: 159 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 210
Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
FGCG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 211 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 262
Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 263 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 322
Query: 326 -----IFDSGTSFTYLNDPAYTQISETF 348
I DSGTS TYL Y + + F
Sbjct: 323 GTGGVIVDSGTSITYLEVQGYRALKKAF 350
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 77/311 (24%), Positives = 122/311 (39%), Gaps = 41/311 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++T V VG PA F V +DTGS+L W+ C G+V + ++ S +
Sbjct: 88 YFTEVRVGTPAKKFRVVVDTGSELTWVNC------RYRGRGKGKVKNRRVFRAEESKSFK 141
Query: 165 KVPCNSTLCELQ-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
V C + C++ CP+ + C Y RY +DG+ + G ++ + + +
Sbjct: 142 TVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRY-ADGSAAQGVFAKETITVGLTNGRK 200
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
+ + GC +G GA +G+ GL S S + L S C
Sbjct: 201 ARLRGLL-VGCSSSFSGQSFQGA--DGVLGLAFSDFSFTSTATS--LFGAKLSYCLVDHL 255
Query: 274 -GSDGTGRISFG-------DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA 325
+ + + FG K +PG+ TP L P Y I I +S+G + ++
Sbjct: 256 SNKNISNYLIFGYSSSSTSTKTAPGR-TTPLDLTLIPPFYAINIIGISIGDDMLDIPTQV 314
Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLH 376
I DSGTS T L + AY + E + +P EYC+ S +
Sbjct: 315 WDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFN 374
Query: 377 LQALVVLPFPL 387
L L F L
Sbjct: 375 ESKLPQLTFHL 385
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 120/269 (44%), Gaps = 36/269 (13%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
SL L Y V +G P S + +DTGSD+ W+ C S H ++ P
Sbjct: 126 TSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 177
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C+S C Q C S S C Y V Y DG+ +TG D L L ++
Sbjct: 178 SSSSTYSPFSCSSAACAQLGQEGNGCSS--SQCQYTVTY-GDGSSTTGTYSSDTLALGSN 234
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ + FGC V++G F D +GL GLG S+ S A G +FS C
Sbjct: 235 AVR------KFQFGCSNVESG-FND--QTDGLMGLGGGAQSLVSQTA--GTFGAAFSYCL 283
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA 325
S +G ++ G G+ G +TP PT Y + I + VGG ++ F
Sbjct: 284 PATSSSSGFLTLG-AGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGT 342
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT T L AY+ +S F + K+
Sbjct: 343 IMDSGTVLTRLPPTAYSALSSAFKAGMKQ 371
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 124/283 (43%), Gaps = 40/283 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V VG PA S+ + LDTGSD+ W+ C C C + I++P SS+
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDP---------IFTPAASSSY 209
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + C+S C + C YQV Y DG+ + G V + + S +V+S
Sbjct: 210 SPLTCDSQQCNSLQMSSCRNGQCRYQVNY-GDGSFTFGDFVTETMSFG----GSGTVNS- 263
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I+ GCG G F+ A + P L +Q L SFS C + + S
Sbjct: 264 IALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTSQ-LKATSFSYCLVNRDSAASST 315
Query: 284 GDKGSPGQGETPFS--LRQTHPT--YNITITQVSVGGNAVNF-----------EFSAIFD 328
D S G++ + L+ + Y + ++ +SVGG + + I D
Sbjct: 316 LDFNSAPVGDSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVD 375
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
GT+ T L AY + ++F S+++ R TS L F+ CY L
Sbjct: 376 CGTAITRLQSEAYNSLRDSFVSMSRHLRSTSGVAL-FDTCYDL 417
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 115/268 (42%), Gaps = 52/268 (19%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+VS+G PAL++ +DTGSDL W C CV C ++ P++SST + V
Sbjct: 77 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 127
Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
PC+S C +C SA S C Y Y D + + G L + LA KS +
Sbjct: 128 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 179
Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
FGCG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 180 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 231
Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 232 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 291
Query: 326 -----IFDSGTSFTYLNDPAYTQISETF 348
I DSGTS TYL Y + + F
Sbjct: 292 GTGGVIVDSGTSITYLEVQGYRALKKAF 319
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 123/299 (41%), Gaps = 38/299 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P + DTGSDL W C C C ++ ++ P SST
Sbjct: 94 YLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDP---------LFDPKASSTY 144
Query: 164 SKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C+S+ C E Q C + + C Y Y D + + G + D L L + + + +
Sbjct: 145 KDVSCSSSQCTALENQASCSTEDNTCSYSTSY-GDRSYTKGNIAVDTLTLGSTDTRPVQL 203
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
I GCG G+F G +G+ +V I I FS C +
Sbjct: 204 -KNIIIGCGHNNAGTF----NKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSEN 258
Query: 276 DGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFS 324
D T +I+FG G TP + Y +T+ +SVG V + E +
Sbjct: 259 DRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGN 318
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
I DSGT+ T L Y+++ + +S+ EK++ + L CY L + A+ +
Sbjct: 319 IIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQTGLSL--CYSATGDLKVPAITM 375
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 107/270 (39%), Gaps = 39/270 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G PA+ + LDTGS L W+ C C NSS ++ PNTSS+ S
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWV--QCKPC----NSSQCYPQRLPLFDPNTSSSYS 182
Query: 165 KVPCNSTLCELQKQ------CPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
VPC+S C C S G C Y++ Y S G G D L L
Sbjct: 183 PVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGS-GATPAGEYSTDALTLG-----P 236
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS---FSMCFG 274
++ R FGCG Q D A +G+ GLG +P LA Q FS C
Sbjct: 237 GAIVKRFHFGCGHHQQRGKFDMA--DGVLGLG----RLPQSLAWQASARRGGGVFSHCLP 290
Query: 275 SDGTGRISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS 324
G F G+P TP P Y + T +SV G ++ F
Sbjct: 291 PTGV-STGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREG 349
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT + L + AYT + F S E
Sbjct: 350 VITDSGTVLSALQETAYTALRTAFRSAMAE 379
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 79/300 (26%), Positives = 129/300 (43%), Gaps = 47/300 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG PA F + +DTGSDL W+ C+ + NSSS Y ++SS+
Sbjct: 27 YFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTT--ANSSSPPA---PWYDKSSSSSYR 81
Query: 165 KVPCNSTLC-----ELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
++PC C + C + S C Y Y SD + +TG L + + + + ++ K
Sbjct: 82 EIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGY-SDQSRTTGILAYETISMKSRKRSGK 140
Query: 219 SVDSR---------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSF 269
+ ++ GC R G+ GA+ G+ GLG S+ + + L F
Sbjct: 141 RAGNHKTRTIRIKNVALGCSRESVGASFLGAS--GVLGLGQGPISLATQTRHTAL-GGIF 197
Query: 270 SMCF-----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
S C GS+ + + G TP + Y + +T V+V G V+
Sbjct: 198 SYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIA 257
Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYCY 369
S+ IFDSGT+ +YL +PAY+++ N+ R ++P FE CY
Sbjct: 258 SSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPR---AQEIPEGFELCY 314
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 69/248 (27%), Positives = 111/248 (44%), Gaps = 42/248 (16%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + +G P + +DTGSDL W C C +C I+ P+ SST
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAP---------IFDPSKSST 110
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C+ G++CPY++ Y +D + STG L + + + + + V +
Sbjct: 111 FKEKRCH-------------GNSCPYEIIY-ADESYSTGILATETVTIQSTSGE-PFVMA 155
Query: 223 RISFGCGRVQTGSFLDG--AAPNGLFGLGMDKTSVPSILANQGL-IPNSFSMCFGSDGTG 279
S GCG + G A+ +G+ GL M + S+++ L IP S CF S GT
Sbjct: 156 ETSIGCGLNNSNLMTPGYAASSSGIVGLNMGPS---SLISQMDLPIPGLISYCFSSQGTS 212
Query: 280 RISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVG-------GNAVNFEFSAIF-D 328
+I+FG G +++ P Y + + VSVG G + + IF D
Sbjct: 213 KINFGTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFID 272
Query: 329 SGTSFTYL 336
SGT++TYL
Sbjct: 273 SGTTYTYL 280
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 125/283 (44%), Gaps = 49/283 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS-- 164
N+S+GQP + +V +DTGSD+ W+ C C +C + L ++ P+ SST S
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGL---------LFDPSMSSTFSPL 154
Query: 165 -KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
K PC+ C S P+ V Y + T S F + V+ TDE S+ D
Sbjct: 155 CKTPCDFKGC-------SRCDPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPD-- 205
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
+ FGCG G D NG+ GL + P LA + I FS C G ++
Sbjct: 206 VLFGCGH-NIGQDTD-PGHNGILGL----NNGPDSLATK--IGQKFSYCIGDLADPYYNY 257
Query: 284 GD----KGSPGQG-ETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AIF 327
+G+ +G TPF + Y +T+ +SVG ++ FE I
Sbjct: 258 HQLILGEGADLEGYSTPFEVHNGF--YYVTMEGISVGEKRLDIAPETFEMKKNRTGGVII 315
Query: 328 DSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEYCY 369
D+G++ T+L D + +S E N L R+T+ P+ C+
Sbjct: 316 DTGSTITFLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCF 358
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 85/268 (31%), Positives = 115/268 (42%), Gaps = 52/268 (19%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+VS+G PAL++ +DTGSDL W C CV C ++ P++SST + V
Sbjct: 98 DVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATV 148
Query: 167 PCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
PC+S C +C SA S C Y Y D + + G L + LA KS +
Sbjct: 149 PCSSASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGV 200
Query: 225 SFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR-- 280
FGCG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 201 VFGCGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSP 252
Query: 281 --------ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------ 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 253 LLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDD 312
Query: 326 -----IFDSGTSFTYLNDPAYTQISETF 348
I DSGTS TYL Y + + F
Sbjct: 313 GTGGVIVDSGTSITYLEVQGYRALKKAF 340
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 120/288 (41%), Gaps = 50/288 (17%)
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQV 149
AGN Y + +++G P SF V +DTGSDL W+ C C C G
Sbjct: 34 AGNGEYLMT---------LTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQ----QPGPK 80
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
D P+ S + K C LC + K C A + C YQ Y D + + G L
Sbjct: 81 FD-----PSKSRSFRKAACTDNLCNVSALPLKAC--AANVCQYQYTY-GDQSNTNGDLAF 132
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
+ + L + ++SV + +FGCG G+F A GL GLG S+ S L++
Sbjct: 133 ETISL-NNGAGTQSVPN-FAFGCGTQNLGTFAGAA---GLVGLGQGPLSLNSQLSHT--F 185
Query: 266 PNSFSMC---FGSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN 320
N FS C S ++FG + + T + HPT Y + + + VGG +N
Sbjct: 186 ANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLN 245
Query: 321 FEFSA------------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
S I DSGT+ T L PAY+ + + S R
Sbjct: 246 LAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPR 293
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 128/319 (40%), Gaps = 54/319 (16%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLN 143
+ P+T A RL +L ++ + G+ V +DT S+L W+ C+ H
Sbjct: 99 QVPVTSGA-----RLRTLNYVATVGIGGGEAT----VIVDTASELTWVQCEPCDACHDQQ 149
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--------QCPSAGSNCPYQVRYLSD 195
++ P++S + + VPCNS+ C+ + C + C Y + Y D
Sbjct: 150 EP--------LFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSY-RD 200
Query: 196 GTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSV 255
G+ S G L D L LA ++ Q FGCG G F +GL GLG + S+
Sbjct: 201 GSYSRGVLAHDRLSLAGEDIQG------FVFGCGTSNQGPF---GGTSGLMGLGRSQLSL 251
Query: 256 PSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPTYN 306
S +Q FS C S +G + GD S + TP S P Y
Sbjct: 252 ISQTMDQ--FGGVFSYCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYL 309
Query: 307 ITITQVSVGGNAVNFE-FS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
+T ++VGG V FS AI DSGT T L Y + F S E + +
Sbjct: 310 ANLTGITVGGEDVQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAEYPQAA 369
Query: 360 TSDLPFEYCYVLRSFLHLQ 378
+ + C+ L +Q
Sbjct: 370 PFSI-LDTCFDLTGLREVQ 387
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 122/286 (42%), Gaps = 37/286 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + N+++G P ++ + +DTGSDL W+ CD C C + Y P+
Sbjct: 45 LGY-YSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDRQ---------YKPH 94
Query: 159 TSSTSSKVPCNSTLCELQKQCPSA-----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ V C LC + P+ C Y+V Y G+ S G LV D++ L
Sbjct: 95 ----GNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGS-SLGVLVRDIIPLKL- 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNSF 269
S ++FGCG QT G P G+ GLG + S+ S L ++GLI N
Sbjct: 149 -TNGTLTHSMLAFGCGYDQTHV---GHNPPPSAAGVLGLGNGRASILSQLNSKGLIRNVV 204
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFE-FS 324
C G G + FGD+ P G + Q+ + Y + G A + +
Sbjct: 205 GHCLSGTGGGFLFFGDQLIPQSGVVWTPILQSSSSLLKHYKTGPADMFFNGKATSVKGLE 264
Query: 325 AIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
FDSG+S+TY N A+ + + N + + +T D C+
Sbjct: 265 LTFDSGSSYTYFNSLAHKALVDLITNDIKGKPLSRATEDPSLPICW 310
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 122/275 (44%), Gaps = 42/275 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G PA F + +DTGS L WL C CV H V I++P+TS T
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCH--------VQVDPIFTPSTSKTY 164
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+PC+S+ C K C +A C Y+ Y D + S G+L +DVL L E
Sbjct: 165 KALPCSSSQCSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLTPSEAP 223
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S S +GCG+ G F +G+ GL DK S+ L+ + N+FS C S
Sbjct: 224 S----SGFVYGCGQDNQGLF---GRSSGIIGLANDKISMLGQLSKK--YGNAFSYCLPSS 274
Query: 277 G--------TGRISFGDKG--SPGQGETPFSLRQTHPT-YNITITQVSVGG-----NAVN 320
+G +S G S TP Q P+ Y + +T ++V G +A +
Sbjct: 275 FSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASS 334
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
+ I DSGT T L Y + ++F + +K
Sbjct: 335 YNVPTIIDSGTVITRLPVAVYNALKKSFVLIMSKK 369
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 124/294 (42%), Gaps = 57/294 (19%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+S+G P L F V +DTGS+L W C C C + + P SST S++
Sbjct: 94 NISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFP-------RPTPAPVLQPARSSTFSRL 146
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PCN + C+ + + +A + C Y Y S T G+L + L +
Sbjct: 147 PCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSGYT--AGYLATETLTVG------DGTF 198
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----G 277
+++FGC T + +D ++ G+ GLG S+ S LA FS C SD G
Sbjct: 199 PKVAFGC---STENGVDNSS--GIVGLGRGPLSLVSQLAV-----GRFSYCLRSDMADGG 248
Query: 278 TGRISFGDKGSPGQG---------ETPFSLRQTHPTYNIT-----ITQVSVGGNAVNFEF 323
I FG + + P+ R TH N+T T++ V G+ F
Sbjct: 249 ASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQ 308
Query: 324 SA-----IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPF--EYCY 369
+ I DSGT+ TYL Y + + F S +A + T S P+ + CY
Sbjct: 309 TGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY 362
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 113/277 (40%), Gaps = 50/277 (18%)
Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y +++VG P LDTGSDL W CD C +C+ + ++SP
Sbjct: 94 GDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP---------LFSPRM 144
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS+ + C LC L C C Y+ Y DGT + G+ + A+ ++
Sbjct: 145 SSSYEPMRCAGQLCGDILHHSCVRP-DTCTYRYSY-GDGTTTLGYYATERFTFASSSGET 202
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
+SV + FGCG + GS + +G+ G G D S+ S L+ + FS C +
Sbjct: 203 QSVP--LGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLSIR-----RFSYCLTPYA 252
Query: 275 SDGTGRISFG---------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
S + FG D P Q TP +PT Y + T V+VG + S
Sbjct: 253 SSRKSTLQFGSLADVGLYDDATGPVQ-TTPILQSAQNPTFYYVAFTGVTVGARRLRIPAS 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNS 350
A I DSGT+ T ++ F S
Sbjct: 312 AFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRS 348
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 91/312 (29%), Positives = 133/312 (42%), Gaps = 38/312 (12%)
Query: 60 YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN---SLGFLHY-TNVSVGQPA 115
+S+L+H DR R L+ T L +A N L + G Y +VS+G P
Sbjct: 46 FSSLSHYDRLTNAFRRSLS---RSATLLNRAATNGALDLQAPLTPGSGEYLMSVSIGTPP 102
Query: 116 LSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
+ +I DTGSDL W C+ C+ S I+ P S++ S VPCNS C+
Sbjct: 103 VDYIGMADTGSDLMW--AQCLPCLKCYKQSR------PIFDPLKSTSFSHVPCNSQNCKA 154
Query: 176 --QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
C + G C Y Y D T + G L + + + S SV S I GCG
Sbjct: 155 IDDSHCGAQGV-CDYSYTY-GDQTYTKGDLGFEKITIG-----SSSVKSVI--GCGHESG 205
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGDKG--- 287
G F + + GLG + S+ S ++ I FS C S G+I+FG
Sbjct: 206 GGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVS 262
Query: 288 SPGQGETPFSLRQTHPTYNITITQVSVGGN---AVNFEFSAIFDSGTSFTYLNDPAYTQI 344
PG TP + Y +T+ +S+G A + + I DSGT+ ++L Y +
Sbjct: 263 GPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQGNVIIDSGTTLSFLPKELYDGV 322
Query: 345 SETFNSLAKEKR 356
+ + K KR
Sbjct: 323 VSSLLKVVKAKR 334
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 125/305 (40%), Gaps = 49/305 (16%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
L L+Y +VG A V +DT S+L W+ C C SC + ++ P++
Sbjct: 115 LRTLNYV-ATVGLGAAEATVVVDTASELTWVQCQPCESCHDQQDP---------LFDPSS 164
Query: 160 SSTSSKVPCNSTLCELQKQCPSAGSN-----------CPYQVRYLSDGTMSTGFLVEDVL 208
S + + VPCNS+ C+ + +AG++ C Y + Y DG+ S G L D L
Sbjct: 165 SPSYAAVPCNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSY-RDGSYSRGVLARDKL 223
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
LA + + FGCG G+ G + GL GLG S+ S +Q
Sbjct: 224 RLAGQDIEG------FVFGCGTSNQGAPFGGTS--GLMGLGRSHVSLVSQTMDQ--FGGV 273
Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFSLRQT--------HPTYNITITQVSVGGN 317
FS C S +G + GD S + TP P Y + +T ++VGG
Sbjct: 274 FSYCLPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQ 333
Query: 318 AVNFE-FSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
V FSA I DSGT T L Y + F S E + + + C+ L
Sbjct: 334 EVESPWFSAGRVIIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQAPAFSI-LDTCFNLTG 392
Query: 374 FLHLQ 378
+Q
Sbjct: 393 LKEVQ 397
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 122/271 (45%), Gaps = 34/271 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P + + DTGSDL W C+ C C + ++ P SST
Sbjct: 86 YLMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSP---------LFDPKESSTY 136
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
KV C+S+ C + C + + C Y + Y D + + G + D + + + ++ S+
Sbjct: 137 RKVSCSSSQCRALEDASCSTDENTCSYTITY-GDNSYTKGDVAVDTVTMGSSGRRPVSLR 195
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG- 277
+ I GCG TG+F A +G+ GLG TS+ S L I FS C F S+
Sbjct: 196 NMI-IGCGHENTGTF--DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETG 250
Query: 278 -TGRISFGDKG-SPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNF--------EFSA 325
T +I+FG G G G S+ + P Y + + +SVG + F E +
Sbjct: 251 LTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNI 310
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
+ DSGT+ T L Y ++ S K +R
Sbjct: 311 VIDSGTTLTLLPSNFYYELESVVASTIKAER 341
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 136/337 (40%), Gaps = 53/337 (15%)
Query: 34 FHHRYSDPVKGI-LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFS-- 90
+HR+ V G + ++ + + + L R + L A N + + S
Sbjct: 30 LNHRHEAKVTGFQIMLEHVDSGKNLTKFQLLERAIERGSRRLQRLEAMLNGPSGVETSVY 89
Query: 91 AGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVI 150
AG+ Y +N +S+G PA F +DTGSDL W C C N S+
Sbjct: 90 AGDGEYLMN---------LSIGTPAQPFSAIMDTGSDLIW--TQCQPCTQCFNQST---- 134
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
I++P SS+ S +PC+S LC+ + + C Y Y DG+ + G + + L
Sbjct: 135 --PIFNPQGSSSFSTLPCSSQLCQALSSPTCSNNFCQYTYGY-GDGSETQGSMGTETLTF 191
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
S S+ I+FGCG G F G GL G+G S+PS L FS
Sbjct: 192 G-----SVSIP-NITFGCGENNQG-FGQGNGA-GLVGMGRGPLSLPSQLD-----VTKFS 238
Query: 271 MCFGSDGTGRI------SFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
C G+ S + + G T PT Y IT+ +SVG + +
Sbjct: 239 YCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDP 298
Query: 324 SA------------IFDSGTSFTYLNDPAYTQISETF 348
SA I DSGT+ TY + AY + + F
Sbjct: 299 SAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVRQEF 335
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 97/346 (28%), Positives = 143/346 (41%), Gaps = 68/346 (19%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
HR R RGR L + L+ +G ++ + +G P S+ + LDT
Sbjct: 20 HRHR----RGRSLLQTAQVSSGLSLGSGE-----------YFARMGIGSPQRSYYLELDT 64
Query: 125 GSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG 183
GSD+ W+ C C SC ++ IY P+ SS+ +V C S LC+ G
Sbjct: 65 GSDVTWIQCAPCSSCYSQVDP---------IYDPSNSSSYRRVYCGSALCQALDYSACQG 115
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
C Y+V Y D + S+G L + +L + S + I+FGCG +G F A
Sbjct: 116 MGCSYRVVY-GDSSASSGDLGIESFYLGPN---SSTAMRNIAFGCGHSNSGLFRGEAGLL 171
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS-FGDKGSP---GQGETPFSLR 299
G+ G + S I A+ G +FS C R S + SP G+ PF+ R
Sbjct: 172 GMGGGTLSFFS--QIAASIG---PAFSYCL----VDRYSQLQSRSSPLIFGRTAIPFAAR 222
Query: 300 QT----HPT----YNITITQVSVGGNAV-----------NFEFSAIFDSGTSFTYLNDPA 340
T +P Y +T +SVGG A+ N AI DSGTS T + A
Sbjct: 223 FTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAA 282
Query: 341 YTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLPFP 386
Y + + + R S + P Y+L + + Q L + P
Sbjct: 283 YAVLRDAY-------RAASRNLPPAPGVYLLDTCFNFQGLPTVQIP 321
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 144/343 (41%), Gaps = 47/343 (13%)
Query: 28 GTFGFDFHHRY----SDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGND 83
G HHR+ + P ++D+ ++ +A R +Y + G +G+D
Sbjct: 55 GVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQL--RAAYITR-KYSGVNGSAGDVEGSD 111
Query: 84 KT-PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL 142
T P T DT + V +G PA++ + +DTGSD+ W+ C S H
Sbjct: 112 VTVPTTLGTSLDTLE-------YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQ 164
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF 202
S ++ P++SST S C S C +Q + S C Y V+Y DG+ +G
Sbjct: 165 ADS--------LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKY-GDGSTGSGT 215
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
D L L + S FGC + ++G+ L + G ++ LA Q
Sbjct: 216 YSSDTLALGS------STVENFQFGCSQSESGNLLQDQTAGLMGLGGGAES-----LATQ 264
Query: 263 --GLIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGG 316
G +FS C +G ++ G S +TP LR T P+ Y + + + VGG
Sbjct: 265 TAGTFGKAFSYCLPPTPGSSGFLTLGASTSGFVVKTPM-LRSTQVPSYYGVLLQAIRVGG 323
Query: 317 NAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+N SA I DSGT T L AY+ +S F + K+
Sbjct: 324 RQLNIPASAFSAGSIMDSGTIITRLPRTAYSALSSAFKAGMKQ 366
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 81/261 (31%), Positives = 114/261 (43%), Gaps = 39/261 (14%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA + ++ +DTGSDL W+ C C C +++ I+ P SS+ +PC S
Sbjct: 144 GTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDA---------IFEPKQSSSYKTLPCLS 194
Query: 171 TLC-EL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C EL P C Y++ Y DG+ S G ++ L L +D Q+ +
Sbjct: 195 ATCTELITSESNPTPCLLGGCVYEINY-GDGSSSQGDFSQETLTLGSDSFQN------FA 247
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRI 281
FGCG TG F +GL GLG + S PS ++ F+ C S TG
Sbjct: 248 FGCGHTNTGLF---KGSSGLLGLGQNSLSFPS--QSKSKYGGQFAYCLPDFGSSTSTGSF 302
Query: 282 SFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGN------AVNFEFSAIFDSGTSF 333
S G P TP +PT Y + + +SVGG+ AV S I DSGT
Sbjct: 303 SVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTVI 362
Query: 334 TYLNDPAYTQISETFNSLAKE 354
T L AY + +F S ++
Sbjct: 363 TRLLPQAYNALKTSFRSKTRD 383
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 123/286 (43%), Gaps = 51/286 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSC---VHGLNSSSGQVIDFNIYSPNTSS 161
++ ++ +G P + ++ DTGSDL W+ C +H S+ + S+
Sbjct: 83 YFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGST---------FLARHST 133
Query: 162 TSSKVPCNSTLCELQKQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
T S C S+LC+L Q P+ S C Y+ Y SDG+ ++GF ++ L T
Sbjct: 134 TFSPTHCFSSLCQLVPQ-PNPNPCNHTRLHSTCRYEYVY-SDGSKTSGFFSKETTTLNTS 191
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFS 270
+ + S I+FGCG +G L G++ N G+ GLG S S L + SFS
Sbjct: 192 SGREMKLKS-IAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRR--FGRSFS 248
Query: 271 MC-----FGSDGTGRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGGNA 318
C T + GD S + TP + PT Y I+I V V G
Sbjct: 249 YCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVK 308
Query: 319 VNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAK 353
++ + S + DSGT+ T+L +PAY +I F K
Sbjct: 309 LHIDPSVWSLDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVK 354
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 82/312 (26%), Positives = 133/312 (42%), Gaps = 43/312 (13%)
Query: 59 YYSALAHRDRYFRLRG--RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
+Y+ + RD + R+R R L G+ + S G + L + + +G PA
Sbjct: 84 HYTGILRRD-HNRVRSIHRRLTGAGDTAATIPASLGLAFHSLE-----YVVTIGIGTPAR 137
Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
+F V DTGSDL W+ C C + ++ P+ SST VPC + C++
Sbjct: 138 NFTVLFDTGSDLTWVQCKPCTDSCYQQQEP--------LFDPSKSSTYVDVPCGTPQCKI 189
Query: 176 --QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ G+ C Y V+Y D +++ G L ++ L+ + V FGC +
Sbjct: 190 GGGQDLTCGGTTCEYSVKY-GDQSVTRGNLAQEAFTLSPSAPPAAGV----VFGCSH-EY 243
Query: 234 GSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKG 287
S + GA GL GLG +S+ S +G + FS C G+ G ++ G
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILS-QTRRGNSGDVFSYCLPPRGSSAGYLTIG-AA 301
Query: 288 SPGQGETPFSL-----RQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLN 337
+P Q F+ Q Y + + +SV G A+ + SA + DSGT T++
Sbjct: 302 APPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTVIDSGTVITHMP 361
Query: 338 DPAYTQISETFN 349
AY + + F
Sbjct: 362 AAAYYVLRDEFR 373
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 79/271 (29%), Positives = 108/271 (39%), Gaps = 45/271 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + LDTGSDL W C C+ CV Q + + P S+T
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPARSATY 140
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C S C C YQ Y D + G L + T+E +
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFY-GDSASTAGVLANETFTFGTNETRVSL--PG 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + GS +G +G+ G G S+ S L + FS C F S R
Sbjct: 198 ISFGCGNLNAGSLANG---SGMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSR 249
Query: 281 ISFG--------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAV 319
+ FG + S TPF + PT Y + +T +SVGG N
Sbjct: 250 LYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDT 309
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
+ I DSGT+ TYL +PAY + F S
Sbjct: 310 DGTGGTIIDSGTTITYLAEPAYDAVRAAFAS 340
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 126/296 (42%), Gaps = 44/296 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VGQPA F + LDTGSD+ WL C C C + I+ P +SS+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPRSSSSF 205
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC S C+ + S C YQV Y DG+ + G V + L + + +
Sbjct: 206 ASLPCESQQCQALETSGCRASKCLYQVSY-GDGSFTVGEFVTETLTFG-----NSGMIND 259
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL G + TS + +SFS C S +
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGPLSLTS--------QMKASSFSYCLVDRDSSSSSD 311
Query: 281 ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIFD 328
+ F P T Y + +T +SVGG ++ F+ I D
Sbjct: 312 LEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVD 371
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
SGT+ T L AY + + F S ++T+ L F+ CY L S Q+ V +P
Sbjct: 372 SGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL-FDTCYDLSS----QSRVTIP 422
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 141/359 (39%), Gaps = 62/359 (17%)
Query: 40 DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLN 99
+ VKG + D L ++ + +++ D R +G TP + R +
Sbjct: 56 EAVKGFVKRDKLRRQRMNQRWGVVSNYDS----RRKGFEMT---TTPAEVEMPMHSGRDD 108
Query: 100 SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
+LG ++ V VG P F + +DTGS+ WL C S S + +
Sbjct: 109 ALG-EYFAEVKVGSPGQRFWLVVDTGSEFTWLNC----------SKSFEAV--------- 148
Query: 160 SSTSSKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-TDEKQ 216
T + C L EL CP C Y + Y +DG+ + GF D + + T+ KQ
Sbjct: 149 --TCASRKCKVDLSELFSLSVCPKPSDPCLYDISY-ADGSSAKGFFGTDSITVGLTNGKQ 205
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN----GLFGLGMDKTSVPSILANQGLIPNSFSMC 272
K + ++ GC T S L+G N G+ GLG K S AN+ FS C
Sbjct: 206 GKL--NNLTIGC----TKSMLNGVNFNEETGGILGLGFAKDSFIDKAANK--YGAKFSYC 257
Query: 273 FGSDGTGRISFGDKGSPGQGETPF--SLRQTH-----PTYNITITQVSVGGNAV------ 319
+ R + G +R+T P Y + + +S+GG +
Sbjct: 258 LVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQV 317
Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSF 374
N E + DSGT+ T L PAY + E SL K KR T E+C+ F
Sbjct: 318 WDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDAEGF 376
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 110/282 (39%), Gaps = 46/282 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL WL C C C H + Y P TS++
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEA---------FYDPKTSASF 212
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L QC S +CPY Y + F VE ++L T E +
Sbjct: 213 KNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGR 272
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S + FGCG G F + GL + +S Q L +SFS C
Sbjct: 273 SSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVD 327
Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFS------LRQTHPTYNITITQVSVGGNAVNFEF 323
++ + ++ FG DK F+ Y I I + VGG A++
Sbjct: 328 RNSDTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPE 387
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
I DSGT+ +Y +PAY I F KE
Sbjct: 388 ETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKE 429
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 120/300 (40%), Gaps = 54/300 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA ++ LDTGSD+ W+ C C C SG V D P SS+
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYE----QSGPVFD-----PRRSSSY 179
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C + LC C C YQV Y DG+++ G V + L A +
Sbjct: 180 GAVGCGAALCRRLDSGGCDLRRGACMYQVAY-GDGSVTAGDFVTETLTFAGGARV----- 233
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+R++ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 234 ARVALGCGHDNEGLFVAAAGLLGLG---RGGLSFPTQISRR--YGRSFSYCLVDRTSSGA 288
Query: 274 ----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------ 319
GS + +SFG GS G F+ +P Y + + +SVGG V
Sbjct: 289 GAAPGSHRSSTVSFG-AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES 347
Query: 320 -------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
I DSGTS T L +Y+ + + F + A S F+ CY L
Sbjct: 348 DLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDL 407
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 83/286 (29%), Positives = 122/286 (42%), Gaps = 34/286 (11%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G PA +I+ +DTGS L WL C C + SG V D P
Sbjct: 110 TSVGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWL--QCSPCRVSCHRQSGPVFD-----P 162
Query: 158 NTSSTSSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
TSS+ + V C+S C+ L S + C YQ Y D + S G+L +D +
Sbjct: 163 KTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASY-GDSSFSVGYLSKDTVSFG 221
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 222 ANSVP------NFYYGCGQDNEGLFGRSA---GLMGLARNKLSLLYQLAPT--LGYSFSY 270
Query: 272 CFGS-DGTGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFS----- 324
C S +G +S G G TP S Y I+++ ++V G + S
Sbjct: 271 CLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSL 330
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGT T L YT +S+ + K + + + + C+
Sbjct: 331 PTIIDSGTVITRLPTSVYTALSKAVAAAMKGSTKRAAAYSILDTCF 376
>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
sativus]
Length = 364
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 83/280 (29%), Positives = 123/280 (43%), Gaps = 45/280 (16%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA + ++ALDT +D W+PC C+ C ++S + SS+ +PC
Sbjct: 32 IGTPAQTLLLALDTSNDAAWIPCSGCIGCPST-----------TVFSSDKSSSFRPLPCQ 80
Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S C +GS C + + Y S + LV+D L LATD S +FGC
Sbjct: 81 SPQCNQVPNPSCSGSACGFNLTYGSSTVAAD--LVQDNLTLATDSVPS------YTFGCI 132
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGD 285
R TGS P GL GLG S + +Q L ++FS C S + +G + G
Sbjct: 133 RKATGS---SVPPQGLLGLGRGPLS--LLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGP 187
Query: 286 KGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
P + + LR + Y + + + VG V+ SA + DSGT+
Sbjct: 188 VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTT 247
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCYVL 371
FT L PAYT + + F + R + S L F+ CY +
Sbjct: 248 FTRLVAPAYTAVRDEFRR--RVGRNVTVSSLGGFDTCYTV 285
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 113/277 (40%), Gaps = 50/277 (18%)
Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y +++VG P LDTGSDL W CD C +C+ + ++SP
Sbjct: 94 GDLEYVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDP---------LFSPRM 144
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS+ + C LC L C C Y+ Y DGT + G+ + A+ ++
Sbjct: 145 SSSYEPMRCAGQLCGDILHHSCVRP-DTCTYRYSY-GDGTTTLGYYATERFTFASSSGET 202
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
+SV + FGCG + GS + +G+ G G D S+ S L+ + FS C +
Sbjct: 203 QSVP--LGFGCGTMNVGSLNNA---SGIVGFGRDPLSLVSQLSIR-----RFSYCLTPYA 252
Query: 275 SDGTGRISFG---------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
S + FG D P Q TP +PT Y + T V+VG + S
Sbjct: 253 SSRKSTLQFGSLADVGLYDDATGPVQ-TTPILQSAQNPTFYYVAFTGVTVGARRLRIPAS 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNS 350
A I DSGT+ T ++ F S
Sbjct: 312 AFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRS 348
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 123/279 (44%), Gaps = 44/279 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G P SF LDTGS++ W+PC+ C C SS Q + P+ SST + + C S
Sbjct: 131 GTPPQSFYTVLDTGSNIAWIPCNPCSGC------SSKQ----QPFEPSKSSTYNYLTCAS 180
Query: 171 TLCELQKQCPSAGS--NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
C+L + C + + NC RY G S V+++L T S+ V++ + FGC
Sbjct: 181 QQCQLLRVCTKSDNSVNCSLTQRY---GDQSE---VDEILSSETLSVGSQQVENFV-FGC 233
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDGTGRISFG 284
G L P+ L G G + S S A L ++FS C F S TG + G
Sbjct: 234 SNAARG--LIQRTPS-LVGFGRNPLSFVSQTAT--LYDSTFSYCLPSLFSSAFTGSLLLG 288
Query: 285 DKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEF-----------SAIFDSG 330
+ QG TP +P+ Y + + +SVG V+ I DSG
Sbjct: 289 KEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSG 348
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T T L +PAY + ++F S S +DL F+ CY
Sbjct: 349 TVITRLVEPAYNAMRDSFRSQLSNLTMASPTDL-FDTCY 386
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 96/316 (30%), Positives = 133/316 (42%), Gaps = 55/316 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG PA+ ++ALDT SDL WL C C C SG V D P S++
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 191
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDG------TMSTGFLVEDVLHLATDE 214
++ ++ C+ + + C Y V Y DG + S G LVE+ L A
Sbjct: 192 GEMNYDAPDCQALGRSGGGDAKRGTCIYTVLY-GDGDGHGSTSTSVGDLVEETLTFAGGV 250
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+Q+ +S GCG G F GA G+ GL + S+P +A G SFS C
Sbjct: 251 RQAY-----LSIGCGHDNKGLF--GAPAAGILGLSRGQISIPHQIAFLGY-NASFSYCLV 302
Query: 274 ------GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV---- 319
GS + ++FG SP TP L Q PT Y + + VSVGG V
Sbjct: 303 DFISGPGSP-SSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVT 361
Query: 320 ---------NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--FEYC 368
I DSGT+ T L PAYT + F + A + ST F+ C
Sbjct: 362 ERDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTC 421
Query: 369 YVLRSFLHLQALVVLP 384
Y + L+ V +P
Sbjct: 422 YTVGGRAGLRHCVKVP 437
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 125/311 (40%), Gaps = 44/311 (14%)
Query: 68 RYFRLRGRGLAAQGNDKTPLTFSAGNDT---YRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R R LAA+ + + +++G T + G + S+G+P L +DT
Sbjct: 47 RTAESRNLSLAAERSRRRLSVYTSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDT 106
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQK 177
GSDL W+ C S +G N +Y P S +S K+PC+S LC+ +
Sbjct: 107 GSDLMWVKC---SPCNGCNPPPSP-----LYDPARSRSSGKLPCSSQLCQALGRGRIISD 158
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
QC C Y Y G ST + VL T V + +SFG GS
Sbjct: 159 QCSDDPPLCGYHYAYGHSGDHST----QGVLGTETFTFGDGYVANNVSFGRSDTIDGSQF 214
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLI------PNSFS-MCFGSDGTGRISFGDKGSPG 290
G A GL GLG S+ S L PN +S + FGS S GD S
Sbjct: 215 GGTA--GLVGLGRGHLSLVSQLGAGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTP 272
Query: 291 QGETPFSLRQTHPTYNITITQVSVGGN---------AVNFEFSA--IFDSGTSFTYLNDP 339
P R TH Y + + +SVGG+ A+N + S FDSG T L D
Sbjct: 273 LVTNPKPDRDTH--YYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDA 330
Query: 340 AYTQISETFNS 350
AY + + S
Sbjct: 331 AYQVVRQAITS 341
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 137/344 (39%), Gaps = 69/344 (20%)
Query: 35 HHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLR---GRGLAAQ---GNDKTPLT 88
H RY ++ +LA D+ R F+LR R AA G+ + PLT
Sbjct: 133 HDRY---LRRLLAADE--------------SRANSFQLRIRNDRAAAASTQSGSAEVPLT 175
Query: 89 FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSG 147
+G LN + + S G PA + V +DTGSDL W+ C C +C +
Sbjct: 176 --SGIRFQTLNYVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDP--- 230
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMS 199
++ P S+T + V CN++ C + C C Y + Y DG+ S
Sbjct: 231 ------LFDPAGSATYAAVRCNASACAASLKAATGTPGSCGGGNERCYYALAY-GDGSFS 283
Query: 200 TGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSIL 259
G L D + L S+D + FGCG G F GL GLG + S+ S
Sbjct: 284 RGVLATDTVALG-----GASLDGFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQT 334
Query: 260 ANQGLIPNSFSMCF----GSDGTGRISFGDKGSPGQGETPFSLRQT------HPTYNITI 309
A + FS C D +G +S G S + TP + + P Y + +
Sbjct: 335 ALR--YGGVFSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNV 392
Query: 310 TQVSVGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETFN 349
T +VGG A+ + + + DSGT T L Y + F
Sbjct: 393 TGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVRAEFT 436
>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 441
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 122/280 (43%), Gaps = 45/280 (16%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA + ++ALDT +D W+PC C+ C ++S + SS+ +PC
Sbjct: 109 IGTPAQTLLLALDTSNDAAWIPCSGCIGCPST-----------TVFSSDKSSSFRPLPCQ 157
Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S C +GS C + + Y S + LV+D L LATD S +FGC
Sbjct: 158 SPQCNQVPNPSCSGSACGFNLTYGSSTVAAD--LVQDNLTLATDSVPS------YTFGCI 209
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGD 285
R TGS + LG+ + + + +Q L ++FS C S + +G + G
Sbjct: 210 RKATGSSVPPQG-----LLGLGRGPLSLLGQSQSLYQSTFSYCLPSFKSVNFSGSLRLGP 264
Query: 286 KGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTS 332
P + + LR + Y + + + VG V+ SA + DSGT+
Sbjct: 265 VAQPIRIKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTT 324
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDL-PFEYCYVL 371
FT L PAYT + + F + R + S L F+ CY +
Sbjct: 325 FTRLVAPAYTAVRDEFRR--RVGRNVTVSSLGGFDTCYTV 362
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 118/297 (39%), Gaps = 45/297 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V VG PA F + LDTGSD+ WL C C C + I+ P SST
Sbjct: 20 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPTASSTY 70
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C S C + C YQV Y DG+ + G + + S SV +
Sbjct: 71 APVTCQSQQCSSLEMSSCRSGQCLYQVNY-GDGSYTFGDFATESVSFG----NSGSVKN- 124
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A + P L NQ L SFS C S G+
Sbjct: 125 VALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTNQ-LKATSFSYCLVNRDSAGSST 176
Query: 281 ISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ F G R+ Y + ++ +SVGG V+ S I
Sbjct: 177 LDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 236
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
D GT+ T L AY + + F + + + TS L F+ CY L QA V +P
Sbjct: 237 DCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL-FDTCYDLSG----QASVRVP 288
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 121/306 (39%), Gaps = 54/306 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+VS+G P V LDTGS L W+PC +SS + ++ P SS+S V
Sbjct: 94 SVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVG 153
Query: 168 CNSTLCEL-----QKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
C + C C S G+N PY V Y S T +G L+ D L L+
Sbjct: 154 CRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGST--SGLLISDTLRLSPSSSS 211
Query: 217 SKSVDSR-ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R + GC V P+GL G G SVPS L +P FS C
Sbjct: 212 SAPAPFRNFAIGCSIVSVHQ-----PPSGLAGFGRGAPSVPSQLK----VPK-FSYCLLS 261
Query: 274 -----GSDGTGRISFGDKGSP-GQGETPFSL------RQTHPTYNI----TITQVSVGGN 317
S +G + GD P G+ +T + P Y++ +T +SVGG
Sbjct: 262 RRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGK 321
Query: 318 AVNFEF---------SAIFDSGTSFTYLNDPAYTQISETFNSL--AKEKRETSTSD-LPF 365
VN AI DSGT+FTYL+ + ++ S + R D L
Sbjct: 322 PVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGL 381
Query: 366 EYCYVL 371
C+ L
Sbjct: 382 RPCFAL 387
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 126/287 (43%), Gaps = 39/287 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+ +G P + I +DTGSDL W C C C QV+ ++ P SST
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVP--LFDPKNSSTY 142
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
C ++ C + + S C ++ Y +DG+ + G L + L + D K V
Sbjct: 143 RDSSCGTSFCLALGKDRSCSKEKKCTFRYSY-ADGSFTGGNLASETLTV--DSTAGKPVS 199
Query: 222 -SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+FGCG G F + +G+ GLG + S+ S L + I FS C S
Sbjct: 200 FPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQL--KSTINGLFSYCLLPVSTDS 255
Query: 276 DGTGRISFGDKGS-PGQGETPFSLRQTHPT--YNITITQVSVGGNAVNF----------E 322
+ RI+FG G G G L Q P Y +T+ +SVG + + E
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVEE 315
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ I DSGT++T+L Y+++ ++ + K KR + + F CY
Sbjct: 316 GNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGI-FSLCY 361
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 68/240 (28%), Positives = 108/240 (45%), Gaps = 27/240 (11%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P F + +DTGS + ++PC +C H S Q F P S T V C
Sbjct: 99 IGTPPQRFALIVDTGSTVTYVPCS--TCRH---CGSHQDPKFR---PEDSETYQPVKCT- 149
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
Q C + C Y+ RY ++ + S+G L EDV+ Q++ R FGC
Sbjct: 150 ----WQCNCDNDRKQCTYERRY-AEMSTSSGALGEDVVSFGN---QTELSPQRAIFGCEN 201
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPG 290
+TG + A +G+ GLG S+ L + +I +SFS+C+G G G + G
Sbjct: 202 DETGDIYNQRA-DGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISP 260
Query: 291 QGETPFSLRQ--THPTYNITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAY 341
+ F+ P YNI + ++ V G ++ + + DSGT++ YL + A+
Sbjct: 261 PADMVFTRSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLPESAF 320
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 122/282 (43%), Gaps = 37/282 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+SVG P I DTGSD+ W C C +C D +++P+ S+T KV
Sbjct: 89 LSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQ---------DLPMFNPSKSTTYRKVS 139
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C+S +C + S +C Y + Y D + S G D L + + + + R +
Sbjct: 140 CSSPVCSFTGEDNSCSFKPDCTYSISY-GDNSHSQGDFAVDTLTMGSTSGRVVAF-PRTA 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD--GTGR 280
GCG GSF A +G+ GLG+ S+ + + + FS C G+D G+ +
Sbjct: 198 IGCGHDNAGSF--DANVSGIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNK 253
Query: 281 ISFGDKGS---PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF---------EFSAIF 327
++FG + G TP + + Y++ + VSVG N + + + I
Sbjct: 254 LNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIII 313
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
DSGT+ T L Y ++ ++ +R + EYC+
Sbjct: 314 DSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF-LEYCF 354
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 126/296 (42%), Gaps = 44/296 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VGQPA F + LDTGSD+ WL C C C + I+ P +SS+
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPRSSSSF 205
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ +PC S C+ + S C YQV Y DG+ + G V + L + + +
Sbjct: 206 ASLPCESQQCQALETSGCRASKCLYQVSY-GDGSFTVGEFVIETLTFG-----NSGMINN 259
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL G + TS + +SFS C S +
Sbjct: 260 VAVGCGHDNEGLFVGSAGLLGLGGGSLSLTS--------QMKASSFSYCLVDRDSSSSSD 311
Query: 281 ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIFD 328
+ F P T Y + +T +SVGG ++ F+ I D
Sbjct: 312 LEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVD 371
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
SGT+ T L AY + + F S ++T+ L F+ CY L S Q+ V +P
Sbjct: 372 SGTAITRLQTQAYNTLRDAFVSRTPYLKKTNGFAL-FDTCYDLSS----QSRVTIP 422
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 117/274 (42%), Gaps = 46/274 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +G P ++ DTGSDL W+ C C +C S+ +SPN
Sbjct: 89 YFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNH---- 144
Query: 164 SKVPCNSTLCEL-----QKQCPSA--GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
C + C+L +C A S C Y+ Y DG+ ++GF ++ L T +
Sbjct: 145 ----CYDSACQLVPLPKHHRCNHARLHSPCRYEYSY-GDGSKTSGFFSKETTTLNTSSGR 199
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+ I+FGC +G + GA+ N G+ GLG S+ S L ++ N FS C
Sbjct: 200 EAKLKG-IAFGCAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHR--FGNKFSYCL 256
Query: 274 -----GSDGTGRISFG---DKGSPGQGE---TPFSLRQTHPT-YNITITQVSVGGNAVNF 321
T + G + +PG+ TP + PT Y I I VSV G +
Sbjct: 257 MDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPI 316
Query: 322 EFSA-----------IFDSGTSFTYLNDPAYTQI 344
S I DSGT+ T+L +PAY QI
Sbjct: 317 NPSVWALDELGNGGTIVDSGTTLTFLPEPAYLQI 350
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 118/297 (39%), Gaps = 45/297 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V VG PA F + LDTGSD+ WL C C C + I+ P SST
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDP---------IFDPTASSTY 211
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C S C + C YQV Y DG+ + G + + S SV +
Sbjct: 212 APVTCQSQQCSSLEMSSCRSGQCLYQVNY-GDGSYTFGDFATESVSFG----NSGSVKN- 265
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A + P L NQ L SFS C S G+
Sbjct: 266 VALGCGHDNEGLFVGAAG-------LLGLGGGPLSLTNQ-LKATSFSYCLVNRDSAGSST 317
Query: 281 ISFGDK--GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ F G R+ Y + ++ +SVGG V+ S I
Sbjct: 318 LDFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIV 377
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVLP 384
D GT+ T L AY + + F + + + TS L F+ CY L QA V +P
Sbjct: 378 DCGTAITRLQTQAYNPLRDAFVRMTQNLKLTSAVAL-FDTCYDLSG----QASVRVP 429
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 122/286 (42%), Gaps = 41/286 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C+ C C + I++P+ S++
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP---------IFNPSYSASF 207
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C C Y+ Y DG+ STG + L T +
Sbjct: 208 STVGCDSAVCSQLDAYDCHSGGCLYEASY-GDGSYSTGSFATETLTFGTTSV------AN 260
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG S P+ + Q ++FS C SD +G
Sbjct: 261 VAIGCGHKNVGLFIGAAG---LLGLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGP 315
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG K P G TP PT Y +++T +SVGG ++ F
Sbjct: 316 LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGF 375
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGT T L AY + + F + + T + F+ CY L
Sbjct: 376 IIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSI-FDTCYDL 420
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 82/273 (30%), Positives = 120/273 (43%), Gaps = 46/273 (16%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
RL +L ++ V +G ++ IV DTGSDL W+ C C C + + ++
Sbjct: 61 RLQTLNYI--VTVEIGGRNMTVIV--DTGSDLTWVQCQPCRLCYNQQDP---------LF 107
Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+P+ S + + CNS+ C+ LQ C S C Y V Y DG+ + G L + L
Sbjct: 108 NPSGSPSYQTILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNY-GDGSYTRGDLGMEQL 166
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
+L T S FGCGR G F +GL GLG K+ + + +
Sbjct: 167 NLGTTHV------SNFIFGCGRNNKGLF---GGASGLMGLG--KSDLSLVSQTSAIFEGV 215
Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFSLRQT-----HPT-YNITITQVSVGGNAV 319
FS C +D +G + G S + TP S + PT Y + +T +S+GG A+
Sbjct: 216 FSYCLPTTAADASGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVAL 275
Query: 320 ---NFEFSAIF-DSGTSFTYLNDPAYTQISETF 348
N+ S I DSGT T L P Y + F
Sbjct: 276 QAPNYRQSGILIDSGTVITRLPPPVYRDLKAEF 308
>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 73/253 (28%), Positives = 108/253 (42%), Gaps = 32/253 (12%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G P +V + TGSDL W+PC C H D + P SST V
Sbjct: 101 KISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHN--------CDLRFFDPMESSTYKNV 152
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PC+S C++ S+C Y + G L D L L + +S + F
Sbjct: 153 PCDSYRCQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFML-PNTGF 211
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISF 283
CG G + G+ GLG S+ + +++ LI FS C + S+ T ++SF
Sbjct: 212 ICGNRIGGDY----PGVGILGLGHGSLSLLNRISH--LIDGKFSHCIVPYSSNQTSKLSF 265
Query: 284 GDKGSPGQGETPFSLR---------QTHPTYNITI--TQVSVGGNAVNFEFSAI-FDSGT 331
GDK G FS R T Y I++ +S GG ++ + + DSGT
Sbjct: 266 GDKAVV-SGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLGMDSGT 324
Query: 332 SFTYLNDPAYTQI 344
FTY + Y+Q+
Sbjct: 325 MFTYFPEYFYSQL 337
>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 372
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 80/290 (27%), Positives = 123/290 (42%), Gaps = 41/290 (14%)
Query: 92 GNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVI 150
G+D+ R N ++ +S+G P + +V +DTGS L W+ C +C + + +GQ
Sbjct: 16 GDDSMRKNK----YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-- 69
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFL 203
I++P SST SKV C++ C ++ C C Y +RY S G S G+L
Sbjct: 70 ---IFNPYNSSTYSKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYL 125
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+D L LA++ +S+D+ I FGCG L G+ G G S + + Q
Sbjct: 126 GKDRLTLASN----RSIDNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQT 176
Query: 264 LIPNSFSMCFGSD--GTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVN 320
+FS CF D G ++ G T P Y I Q+ + N +
Sbjct: 177 DY-TAFSYCFPRDHENEGSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIR 233
Query: 321 FEFS--------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
E I DSGT+ TY+ P + + + + K T D
Sbjct: 234 LEIDPYIYISKMTIVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWD 283
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 138/363 (38%), Gaps = 56/363 (15%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLRGRGLAAQGNDKTP 86
G F D HR D PK + A R DR+FR A + TP
Sbjct: 33 GRFSIDLIHR------------DSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTP 80
Query: 87 LT-FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNS 144
S+ N Y + +S+G P DTGSDL W C C+SC N
Sbjct: 81 EPPVSSNNGEYLMK---------ISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNP 131
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGF 202
++ P+ S++ +V C S C L C C + Y DG+++ G
Sbjct: 132 ---------MFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY-GDGSLAQGV 181
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
+ + L L ++ Q S+ I FGCG +G+F + GLFG G S+ S + +
Sbjct: 182 IATETLTLNSNSGQPTSI-LNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238
Query: 263 GLIPNSFSMC---FGSDG--TGRISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSV 314
FS C F +D T +I FG + + TP + Y +T+ +SV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 315 GGNAVNFEFSA--------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
G F S+ D+GT T L Y ++ + A DL +
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKE-AIPMEPVQDPDLQPQ 357
Query: 367 YCY 369
CY
Sbjct: 358 LCY 360
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 120/295 (40%), Gaps = 48/295 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA ++ LDTGSD+ WL C C C SGQV D P S +
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYE----QSGQVFD-----PRRSRSY 190
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C + LC C S C YQV Y DG+++ G + L A +
Sbjct: 191 NAVGCAAPLCRRLDSGGCDLRRSACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 244
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+R++ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRR--YGRSFSYCLVDRTSSAN 299
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV---------- 319
+ + ++FG + F+ +P Y + + +SVGG V
Sbjct: 300 TASRSSTVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRL 359
Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
+ I DSGTS T L PAY+ + + F A R + F+ CY L
Sbjct: 360 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDL 414
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 93/378 (24%), Positives = 149/378 (39%), Gaps = 65/378 (17%)
Query: 46 LAVDDLPKKGSFAYYSALAHRDRY---------FRLRGRGLAAQGNDKTPLTF---SAGN 93
L V + ++G + + HRD+ RL GR L L S G
Sbjct: 59 LEVSEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGR-LKRDAKRVASLIRRLSSGGG 117
Query: 94 DTYRLNSLGF-----------LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHG 141
+YR++ G ++ + VG P S + +D+GSD+ W+ C C C H
Sbjct: 118 GSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQ 177
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTG 201
+ ++ P S++ + V C+S++C+ + C Y+V Y DG+ + G
Sbjct: 178 SDP---------VFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSY-GDGSYTKG 227
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
L + L +++ ++ GCG G F+ A GL G M S L
Sbjct: 228 TLALETLTFG------RTMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSM---SFVGQLGG 278
Query: 262 QGLIPNSFSMCF---GSDGTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGG 316
Q +FS C G+D +G + FG + P G P P+ Y I + + VGG
Sbjct: 279 Q--TGGAFSYCLVSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGG 336
Query: 317 NAVNF-----------EFSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLP 364
V + + D+GT+ T L AY + F A R T +
Sbjct: 337 IRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA--I 394
Query: 365 FEYCYVLRSFLHLQALVV 382
F+ CY L F+ ++ V
Sbjct: 395 FDTCYDLLGFVSVRVPTV 412
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 127/268 (47%), Gaps = 54/268 (20%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + VG +F+V +DTGS L +P + C +CV +Y P SSTS+K
Sbjct: 124 TQIIVGNT--TFLVQVDTGSLLMAIPLEGCNTCVESR----------PVYHP--SSTSTK 169
Query: 166 VPCNSTLCELQKQCP------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
V C+S C+ P S+G +C +Q+RY DG+ +G++ EDV++LA
Sbjct: 170 VACSSDQCKGSGSTPPSCSRTSSGESCDFQIRY-GDGSHVSGYIYEDVVNLA-------G 221
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VP----SILANQGLIPNSFSMCFG 274
+ + +FG +TG F + +G+ G G +S VP S++++ GL N F M
Sbjct: 222 LQGKANFGANDEETGDF-EYPRADGIIGFGRTCSSCVPTVWDSLVSDLGL-KNQFGMLLN 279
Query: 275 SDGTGRISFGDKGSP-----------GQGETPF-SLRQTHPTYNITITQVSVGGNAVNFE 322
+G G +S G+ + Q TPF S++ T I I ++ G+ + E
Sbjct: 280 YEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKST----GIRINDYTIPGSKLGQE 335
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNS 350
I DSG++ L AY Q+ F +
Sbjct: 336 --VIVDSGSTALSLASGAYDQLRNYFQT 361
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 78/284 (27%), Positives = 116/284 (40%), Gaps = 47/284 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ +G P S ++ DTGSDL W+ C C +C H SS+ + P SS+
Sbjct: 88 YFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSA--------FLPRHSSSF 139
Query: 164 SKVPCNSTLCELQKQCPSAGSN-------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S C C L P N C + Y +DG++S+GF ++ L +
Sbjct: 140 SPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSY-ADGSLSSGFFSKETTTLKSLSGS 198
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSMC- 272
+ +SFGCG +G + GA N G+ GLG S S L + N FS C
Sbjct: 199 EIHLKG-LSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRR--FGNKFSYCL 255
Query: 273 -----------FGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGG---- 316
F G G S + TP + PT Y ITI +++ G
Sbjct: 256 MDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLP 315
Query: 317 -NAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
N +E + DSGT+ TYL AY ++ ++ K
Sbjct: 316 INPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVK 359
>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 415
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/278 (28%), Positives = 113/278 (40%), Gaps = 52/278 (18%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P +DTGSDL WL C+ C C + I+ P+ SS+ +PC
Sbjct: 93 SIGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITP---------IFDPSLSSSYQNIPC 143
Query: 169 NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
S C + ++C VR G+L + L L + S S + GC
Sbjct: 144 LSDTCHSMRT-----TSC--DVR---------GYLSVETLTLDSTTGYSVSF-PKTMIGC 186
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG---SDGTGRISFGD 285
G TG+F +G+ GLG S+PS L I FS C G + T +++FGD
Sbjct: 187 GYRNTGTF--HGPSSGIVGLGSGPMSLPSQLGTS--IGGKFSYCLGPWLPNSTSKLNFGD 242
Query: 286 KG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAIFDSGTSFT 334
G TP + Y +T+ SVG + F E + + DSGT+FT
Sbjct: 243 AAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNILIDSGTTFT 302
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLP---FEYCY 369
+L Y + F S E + P F+ CY
Sbjct: 303 FLPYDVYYR----FESAVAEYINLEHVEDPNGTFKLCY 336
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 117/285 (41%), Gaps = 33/285 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P DTGSDL W C+ CV + +I+ P+TS +
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQRE--------HIFDPSTSLSY 198
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S V C+S CE + + S C Y +RY DG+ S GF + L L S
Sbjct: 199 SNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRY-GDGSYSIGFFAREKLSLT-----ST 252
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
V + FGCG+ G F A GL GL + S+ S A + S+ + S T
Sbjct: 253 DVFNNFQFGCGQNNRGLFGGTA---GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309
Query: 279 GRISF--GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFDS 329
G +SF GD S TP + +P+ Y + + +SVG + S I DS
Sbjct: 310 GYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDS 369
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF 374
GT + L Y+ + + F L + + + CY L +
Sbjct: 370 GTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSI-LDTCYDLSKY 413
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 124/283 (43%), Gaps = 39/283 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVS-CVHGLNSSSGQVIDFNIYSPNTSST 162
+ V +G P DTGSDL W C+ C C H I++P+ S++
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEP---------IFNPSKSTS 188
Query: 163 SSKVPCNSTLCELQK----QCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ + C+S C+ K PS + S C Y ++Y D + S GF +D L L + +
Sbjct: 189 YTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQY-GDQSYSVGFFAQDKLALTSTD--- 244
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GS 275
V + FGCG+ G F+ A GL GLG + S+ S A + FS C S
Sbjct: 245 --VFNNFLFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQK--YGKLFSYCLPSTS 297
Query: 276 DGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------I 326
TG ++FG G + TP + P+ Y + + +SVGG ++ S I
Sbjct: 298 SSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTI 357
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
DSGT + L AY+ + +F + + + + + + CY
Sbjct: 358 IDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASI-LDTCY 399
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 114/280 (40%), Gaps = 52/280 (18%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
SVG P DTGSD+ WL C+ C N ++ + + P+ SST +PC+
Sbjct: 92 SVGTPPFKLYGIADTGSDIVWLQCE--PCKECYNQTTPK------FKPSKSSTYKNIPCS 143
Query: 170 STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S LC+ +Q G L D L L + S + GCG
Sbjct: 144 SDLCKSGQQ----------------------GNLSVDTLTLESSTGHPISFPKTV-IGCG 180
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGRISFG 284
T SF +GA+ +G+ GLG S+ + L + I FS C S+ T +++FG
Sbjct: 181 TDNTVSF-EGAS-SGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPVESNTTSKLNFG 236
Query: 285 DKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--------IFDSGTSF 333
D G TP + Y +T+ SVG + FE S+ I DSGT+
Sbjct: 237 DTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEGSSNGGHEGNIIIDSGTTL 296
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
T + Y + L K KR + L F CY + S
Sbjct: 297 TVIPTDVYNNLESAVLELVKLKRVNDPTRL-FNLCYSVTS 335
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 113/259 (43%), Gaps = 36/259 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y V G PA + + +DTGS L WL C CV H V ++ P+ S T
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCH--------VQADPLFDPSASKTY 169
Query: 164 SKVPCNSTLCEL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C S+ C C ++ + C Y Y D + S G+L +D+L LA +
Sbjct: 170 KSLSCTSSQCSSLVDATLNNPLCETSSNVCVYTASY-GDSSYSMGYLSQDLLTLAPSQTL 228
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
V +GCG+ G F A G+ GLG +K S+ ++++ +FS C +
Sbjct: 229 PGFV-----YGCGQDSDGLFGRAA---GILGLGRNKLSMLGQVSSK--FGYAFSYCLPTR 278
Query: 277 GTGR-ISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFSAIF 327
G G +S G G TP + +P+ Y + +T ++VGG A+ + I
Sbjct: 279 GGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTII 338
Query: 328 DSGTSFTYLNDPAYTQISE 346
DSGT T L YT +
Sbjct: 339 DSGTVITRLPMSVYTPFQQ 357
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 126/293 (43%), Gaps = 46/293 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G + SV L + FS C
Sbjct: 109 ----FTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
F S TG S G K + + + + + R+ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 271
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 137/363 (37%), Gaps = 56/363 (15%)
Query: 28 GTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLRGRGLAAQGNDKTP 86
G F D HR D PK + A R DR+FR A + TP
Sbjct: 33 GRFSIDLIHR------------DSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTP 80
Query: 87 LT-FSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNS 144
S+ N Y + +S+G P DTGSDL W C C+SC N
Sbjct: 81 EPPVSSNNGEYLMK---------ISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNP 131
Query: 145 SSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGF 202
++ P+ S++ +V C S C L C C + Y DG+++ G
Sbjct: 132 ---------MFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGY-GDGSLAQGV 181
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
+ + L L ++ Q S+ I FGCG +G+F + GLFG G S+ S + +
Sbjct: 182 IATETLTLNSNSGQPXSI-XNIVFGCGHNNSGTFNENEM--GLFGTGGRPLSLTSQIMST 238
Query: 263 GLIPNSFSMC---FGSDG--TGRISFGDKGSPGQG---ETPFSLRQTHPTYNITITQVSV 314
FS C F +D T +I FG + TP + Y +T+ +SV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 315 GGNAVNFEFSA--------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
G F S+ D+GT T L Y ++ + A DL +
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKE-AIPMEPVQDPDLQPQ 357
Query: 367 YCY 369
CY
Sbjct: 358 LCY 360
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/255 (29%), Positives = 120/255 (47%), Gaps = 35/255 (13%)
Query: 115 ALSFIVALDTGSDLFWLPCD-CVSC-VHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
A +F + +DTGS +LPC C SC H +G+ D++ S+ S+V C S
Sbjct: 44 AQTFELIVDTGSSRTYLPCKGCASCGAH----EAGRYYDYD-----ASADFSRVEC-SAC 93
Query: 173 CELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ 232
+ +C ++G C Y V YL +G+ S G+LV DV+ L ++ + FGC +
Sbjct: 94 AGIGGKCGTSGV-CRYDVHYL-EGSGSEGYLVRDVVSLG-----GSVGNATVVFGCEERE 146
Query: 233 TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------GSDGTGRISFGD 285
GS +A +GLFG G ++ + LA+ +I + FSMC G G ++ G+
Sbjct: 147 LGSIKQQSA-DGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGN 205
Query: 286 ----KGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA--IFDSGTSFTYLNDP 339
+P TP + + Y +T T ++G + V I DSGTS+TY+
Sbjct: 206 FDFGADAPALVYTP--MVSSAMYYQVTTTSWTLGNSVVEGSRGVLTIIDSGTSYTYVPGN 263
Query: 340 AYTQISETFNSLAKE 354
+ + + A+E
Sbjct: 264 MHARFLQLAEDAARE 278
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/310 (26%), Positives = 122/310 (39%), Gaps = 56/310 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVH------GLNSSSGQVIDFNIYSP 157
++ VG PA F++ DTGSDL W+ C S H + S V ++ P
Sbjct: 110 YFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRP 169
Query: 158 NTSSTSSKVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHL 210
S T S +PC+S C+ C S+ + C Y RY +D + + G + D + L
Sbjct: 170 GDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRY-NDNSAARGVVGTDSATVAL 228
Query: 211 ATDEKQSKSVDSR-----ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
+ D + + GC G + A +G+ LG S S A++
Sbjct: 229 SGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFE--ASDGVLSLGYSNISFASRAASR--F 284
Query: 266 PNSFSMCF-----GSDGTGRISFG------DKGSPGQG-ETPFSL-RQTHPTYNITITQV 312
FS C + T ++FG +P G TP L + P Y + + V
Sbjct: 285 GGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSV 344
Query: 313 SVGGNAVNFEFSA---------IFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETS 359
SV G A++ I DSGTS T L PAY + SE L + +
Sbjct: 345 SVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMD-- 402
Query: 360 TSDLPFEYCY 369
PF+YCY
Sbjct: 403 ----PFDYCY 408
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/306 (28%), Positives = 131/306 (42%), Gaps = 53/306 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P S+ + LDTGSD+ W+ C C SC ++ IY P+ SS+
Sbjct: 12 YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDP---------IYDPSNSSSY 62
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+V C S LC+ G C Y+V Y D + S+G L + +L + S +
Sbjct: 63 RRVYCGSALCQALDYSACQGMGCSYRVVY-GDSSASSGDLGIESFYLGPN---SSTAMRN 118
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS- 282
I+FGCG +G F A G+ G + S I A+ G +FS C R S
Sbjct: 119 IAFGCGHSNSGLFRGEAGLLGMGGGTLSFFS--QIAASIG---PAFSYCL----VDRYSQ 169
Query: 283 FGDKGSP---GQGETPFSLRQT----HPTYNI----TITQVSVGGNAV-----------N 320
+ SP G+ PF+ R T +P N +T +SVGG + N
Sbjct: 170 LQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGN 229
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQAL 380
AI DSGTS T + PAY + + + R S + P Y+L + + Q L
Sbjct: 230 GTGGAILDSGTSVTRVVPPAYAVLRDAY-------RAASRNLPPAPGVYLLDTCFNFQGL 282
Query: 381 VVLPFP 386
+ P
Sbjct: 283 PTVQIP 288
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 111/277 (40%), Gaps = 51/277 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ ++++G P L LDTGSDL W CD C C +Y+P S+T
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142
Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C S +C+ LQ +C + C Y Y DGT + G L + L +D
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
++FGCG GS + +GL G+G S+ S L FS CF
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRGPLSLVSQLGV-----TRFSYCFTPFNA 248
Query: 276 --------DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
+ R+S K +P R+ Y +++ ++VG + + +
Sbjct: 249 TAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFR 308
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
I DSGT+FT L + A+ ++ S +
Sbjct: 309 LTPMGDGGVIIDSGTTFTALEESAFVALARALASRVR 345
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 100/371 (26%), Positives = 145/371 (39%), Gaps = 65/371 (17%)
Query: 29 TFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA----QGNDK 84
T GF R+ D K + ++ + + + R +L LAA D+
Sbjct: 44 TNGFRVMLRHVDSGKNLTKLERV-------QHGIKRGKSRLQKLNAMVLAASSTPDSEDQ 96
Query: 85 TPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLN 143
AGN Y + +++G P +S+ LDTGSDL W C C C
Sbjct: 97 LEAPIHAGNGEYLIE---------LAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPT 147
Query: 144 SSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMSTG 201
I+ P SS+ SKV C S+LC PS+ C Y Y D +M+ G
Sbjct: 148 P---------IFDPKKSSSFSKVSCGSSLCS---ALPSSTCSDGCEYVYSY-GDYSMTQG 194
Query: 202 FLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILAN 261
L + + ++K I FGCG G + A+ GL GLG S+ S L
Sbjct: 195 VLATETFTFG--KSKNKVSVHNIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQLKE 250
Query: 262 QGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-------TPFSLRQTHPT-YNITITQVS 313
Q FS C + S GS G+ + TP P+ Y +++ +S
Sbjct: 251 Q-----RFSYCLTPIDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAIS 305
Query: 314 VGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VG ++ E S I DSGT+ TY+ AY + + F S K + TS
Sbjct: 306 VGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYVQQKAYEALKKEFISQTKLALD-KTSS 364
Query: 363 LPFEYCYVLRS 373
+ C+ L S
Sbjct: 365 TGLDLCFSLPS 375
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 115/277 (41%), Gaps = 35/277 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ V G P + V DTGS++ W+ C VSC ++ P SST
Sbjct: 16 YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEP---------LFDPTLSST 66
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C S C +GS C Y V Y DG+ + GFL + LA + +V +
Sbjct: 67 YRNISCTSAACTGLSSRGCSGSTCVYGVTY-GDGSSTVGFLATETFTLA-----AGNVFN 120
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGR 280
FGCG+ G F GAA GL GLG S+ S LA + N FS C S TG
Sbjct: 121 NFIFGCGQNNQGLF-TGAA--GLIGLGRSPYSLNSQLATS--LGNIFSYCLPSTSSATGY 175
Query: 281 ISFGDK-GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGTS 332
++ G+ +PG T PT Y I + +SVGG + I DSGT
Sbjct: 176 LNIGNPLRTPGY--TAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTV 233
Query: 333 FTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T L AY + F + + + + + + CY
Sbjct: 234 ITRLPPTAYGALRTAFRAAMTQYTRAAAASI-LDTCY 269
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 85/275 (30%), Positives = 120/275 (43%), Gaps = 36/275 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNI 154
+SL L Y +V +G PA++ V +DTGSD+ W+ PC C + +G + D
Sbjct: 120 SSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCY----AQTGALFD--- 172
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P SST V C + C +L++Q C + C Y V+Y DG+ + G D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL 229
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ K FGC V++G F D +GL GLG S+ S A NSFS
Sbjct: 230 SGASDAVKG----FQFGCSHVESG-FSD--QTDGLMGLGGGAQSLVSQTA--AAYGNSFS 280
Query: 271 MCF----GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----F 321
C GS G + G S RQ Y + ++VGG + F
Sbjct: 281 YCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVF 340
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
++ DSGT T L AY+ +S F + K+ R
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYR 375
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 119/285 (41%), Gaps = 45/285 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + ++A+DT +D W+PC CV C ++P S+T
Sbjct: 98 YIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTT-----------TPFAPAKSTTF 146
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDS 222
KV C ++ C+ + GS C + Y GT S LV+D + LATD +
Sbjct: 147 KKVGCGASQCKQVRNPTCDGSACAFNFTY---GTSSVAASLVQDTVTLATDPVPA----- 198
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---- 278
+FGC + TGS + GL + + Q L ++FS C S T
Sbjct: 199 -YAFGCIQKVTGSSVPPQGLLGLGRGPLSLLA-----QTQKLYQSTFSYCLPSFKTLNFS 252
Query: 279 GRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + TP + Y + + + VG V+ A
Sbjct: 253 GSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGT 312
Query: 326 IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
+FDSGT FT L +PAY + F +A K+ T TS F+ CY
Sbjct: 313 VFDSGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDTCY 357
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 110/260 (42%), Gaps = 34/260 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
+LG +Y V +G P + V DTGSD W+ C CV + ++
Sbjct: 173 RALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQRE--------KLFD 224
Query: 157 PNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
P SST + V C + C +G +C Y V+Y DG+ S GF D L L++ +
Sbjct: 225 PARSSTYANVSCAAPACSDLNIHGCSGGHCLYGVQY-GDGSYSIGFFAMDTLTLSSYDAV 283
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--G 274
FGCG G F + A GL GLG KTS+P ++ F+ C
Sbjct: 284 KG-----FRFGCGERNEGLFGEAA---GLLGLGRGKTSLPVQTYDK--YGGVFAHCLPAR 333
Query: 275 SDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA----- 325
S GTG + FG TP L PT Y + +T + VGG ++ S
Sbjct: 334 STGTGYLDFGAGSLAAASARLTTPM-LTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAG 392
Query: 326 -IFDSGTSFTYLNDPAYTQI 344
I DSGT T L AY+ +
Sbjct: 393 TIVDSGTVITRLPPAAYSSL 412
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 111/277 (40%), Gaps = 51/277 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ ++++G P L LDTGSDL W CD C C +Y+P S+T
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142
Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C S +C+ LQ +C + C Y Y DGT + G L + L +D
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
++FGCG GS + +GL G+G S+ S L FS CF
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRGPLSLVSQLGV-----TRFSYCFTPFNA 248
Query: 276 --------DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--- 324
+ R+S K +P R+ Y +++ ++VG + + +
Sbjct: 249 TAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFR 308
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSLAK 353
I DSGT+FT L + A+ ++ S +
Sbjct: 309 LTPMGDGGVIIDSGTTFTALEERAFVALARALASRVR 345
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 76/295 (25%), Positives = 119/295 (40%), Gaps = 40/295 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA + LDTGSD+ W+ C+ C C + +++P +SST
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDP---------VFNPTSSSTY 212
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C L + + C YQV Y DG+ + G L D + K +
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKIND----- 266
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A + + NQ + SFS C +G+ S
Sbjct: 267 VALGCGHDNEGLFTGAAG-------LLGLGGGALSITNQ-MKATSFSYCLVDRDSGKSSS 318
Query: 284 GDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P Q T Y + ++ SVGG V F+ A I
Sbjct: 319 LDFNSVQLGSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVIL 378
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D GT+ T L AY + + F L ++ ++S F+ CY S ++ V
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTV 433
>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
Length = 475
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 121/295 (41%), Gaps = 47/295 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA + ++ LDTGSD+ WL C C H + SG+V D P S + +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 173
Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + +C C ++C YQV Y DG+++ G + L A +
Sbjct: 174 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 227
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
R++ GCG G F+ A +GL GLG + S PS +A SFS C
Sbjct: 228 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRP 282
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
S + ++FG F+ +P Y + + SVGG
Sbjct: 283 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 342
Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
N I DSGTS T L P Y + + F + A R + F+ CY L
Sbjct: 343 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNL 397
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 123/283 (43%), Gaps = 50/283 (17%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS-- 164
N+S+GQP + +V +DTGSD+ W+ C C +C + L ++ P+ SST S
Sbjct: 104 NISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGL---------LFDPSKSSTFSPL 154
Query: 165 -KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
K PC+ C P+ V Y + T S F + V+ TDE S+ D
Sbjct: 155 CKTPCDFEGCRCDP--------IPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISD-- 204
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
+ FGCG G D NG+ GL S+ + L + FS C G+ ++
Sbjct: 205 VLFGCGH-NIGHDTD-PGHNGILGLNNGPDSLVTKLGQK------FSYCIGNLADPYYNY 256
Query: 284 GD----KGSPGQG-ETPFSLRQTHPTYNITITQVSVGGNAVN-----FEFS------AIF 327
+G+ +G TPF + Y +T+ +SVG ++ FE I
Sbjct: 257 HQLILGEGADLEGYSTPFEVYNGF--YYVTMEGISVGEKRLDIAPETFEMKENRAGGVII 314
Query: 328 DSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSDLPFEYCY 369
D+G++ T+L D + +S E N L R+ + P+ C+
Sbjct: 315 DTGSTITFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQCF 357
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 124/297 (41%), Gaps = 59/297 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + +DTGS WL C C H + + +++P+ S T
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCH--------IQEDPVFNPSASKTY 154
Query: 164 SKVPCN---------STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC+ +TL E C + C Y+ Y D + S G+L +DVL L +
Sbjct: 155 KTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLTPSQ 211
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
S V +GCG+ G F +G+ GL ++ S+ S L+ G N+FS C
Sbjct: 212 TLSSFV-----YGCGQDNQGLF---GRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261
Query: 275 SDGTGRISFGDKGSPGQGE----------------TPFSLRQTHPT-YNITITQVSVGGN 317
+ SF SP +G TP +P+ Y I + ++V G
Sbjct: 262 T------SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGR 315
Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
A +++ I DSGT T L P YT + + ++ +K + + + C+
Sbjct: 316 PLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCF 372
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 113/260 (43%), Gaps = 50/260 (19%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N+S+G P ++ ++ +DT SDL WL C C++C I+ P+ S T
Sbjct: 87 VNISIGSPPVTQLLHMDTASDLLWLQCRPCINCY---------AQSLPIFDPSRSYTHRN 137
Query: 166 VPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSK 218
C ++ Q PS N C Y +RY+ DGT S G L +++L T DE S
Sbjct: 138 ESCRTS----QYSMPSLRFNAKTRSCEYSMRYM-DGTGSKGILAKEMLMFNTIYDESSSA 192
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
++ + FGCG G L G G+ GLG + S+ + FS CFGS
Sbjct: 193 ALHD-VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGTK------FSYCFGSLDD 242
Query: 279 -----GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE---FS------ 324
+ GD G+ G+T L + Y +TI +SV G + + F+
Sbjct: 243 PSYPHNVLVLGDDGANILGDTT-PLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTG 301
Query: 325 ---AIFDSGTSFTYLNDPAY 341
I D+G S T L + AY
Sbjct: 302 LGGTIIDTGNSLTSLVEEAY 321
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 82/295 (27%), Positives = 119/295 (40%), Gaps = 48/295 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA ++ LDTGSD+ WL C C C SGQV D P S +
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYD----QSGQVFD-----PRRSRSY 192
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C++ LC C C YQV Y DG+++ G + L A +
Sbjct: 193 GAVGCSAPLCRRLDSGGCDLRRKACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 246
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+RI+ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 247 ARIALGCGHDNEGLFVAAAGLLGLG---RGSLSFPAQISRR--YGRSFSYCLVDRTSSAN 301
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV---------- 319
+ + ++FG F+ +P Y + + +SVGG V
Sbjct: 302 PASHSSTVTFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRL 361
Query: 320 ---NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
+ I DSGTS T L PAY+ + + F + A R + F+ CY L
Sbjct: 362 DPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDL 416
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/312 (27%), Positives = 121/312 (38%), Gaps = 50/312 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL WL C C C H +G Y P TS++
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFH----QNGM-----FYDPKTSASF 210
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L QC S +CPY Y + F VE ++L T E
Sbjct: 211 KNITCNDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGG 270
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S + FGCG G F + GL + +S Q L +SFS C
Sbjct: 271 SSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSS-----QLQSLYGHSFSYCLVD 325
Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFS------LRQTHPTYNITITQVSVGGNAVNF-- 321
++ + ++ FG DK F+ Y I I + VGG A++
Sbjct: 326 RNSNTNVSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPE 385
Query: 322 ---------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK----RETSTSDLPFEYC 368
+ I DSGT+ +Y +PAY I F KE R+ D F
Sbjct: 386 ETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVS 445
Query: 369 YVLRSFLHLQAL 380
+ + +HL L
Sbjct: 446 GIEENNIHLPEL 457
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 92/320 (28%), Positives = 132/320 (41%), Gaps = 47/320 (14%)
Query: 66 RDRYFRLRGRGLAAQGNDKTP----LTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
+ R ++ G G+ + K P + GN + V +G P F +
Sbjct: 103 QARLSKISGHGIFEEMVTKLPAQSGIAIGTGN-----------YVVTVGLGTPKEDFTLV 151
Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL----QK 177
DTGS + W C C+ Q D P S++ + V C+S C L ++
Sbjct: 152 FDTGSGITWTQCQ--PCLGSCYPQKEQKFD-----PTKSTSYNNVSCSSASCNLLPTSER 204
Query: 178 QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
C ++ S C YQ+ Y D + S GF + L ++ S V + FGCG+ G F
Sbjct: 205 GCSASNSTCLYQIIY-GDQSYSQGFFATETLTIS-----SSDVFTNFLFGCGQSNNGLFG 258
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETP 295
A GL GL S+PS A + FS C S TG ++FG K S G TP
Sbjct: 259 QAA---GLLGLSSSSVSLPSQTAEK--YQKQFSYCLPSTPSSTGYLNFGGKVSQTAGFTP 313
Query: 296 FSLRQTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFN 349
S Y I I +SV G+ + + S AI DSGT T L AY + E F+
Sbjct: 314 IS-PAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLPPTAYKALKEAFD 372
Query: 350 SLAKEKRETSTSDLPFEYCY 369
+T+ +L + CY
Sbjct: 373 EKMSNYPKTNGDEL-LDTCY 391
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 124/297 (41%), Gaps = 59/297 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + +DTGS WL C C H + + +++P+ S T
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCH--------IQEDPVFNPSASKTY 154
Query: 164 SKVPCN---------STLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
VPC+ +TL E C + C Y+ Y D + S G+L +DVL L +
Sbjct: 155 KTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY-GDSSFSLGYLSQDVLTLTPSQ 211
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
S V +GCG+ G F +G+ GL ++ S+ S L+ G N+FS C
Sbjct: 212 TLSSFV-----YGCGQDNQGLF---GRTDGIIGLANNELSMLSQLS--GKYGNAFSYCLP 261
Query: 275 SDGTGRISFGDKGSPGQGE----------------TPFSLRQTHPT-YNITITQVSVGGN 317
+ SF SP +G TP +P+ Y I + ++V G
Sbjct: 262 T------SFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGR 315
Query: 318 -----AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
A +++ I DSGT T L P YT + + ++ +K + + + C+
Sbjct: 316 PLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCF 372
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 75/253 (29%), Positives = 105/253 (41%), Gaps = 22/253 (8%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
++ S F + V++G P S + DTGSDL W V C G N +S +
Sbjct: 93 KVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVW-----VKCKKGNNDTSSAAAPTTQFD 147
Query: 157 PNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P+ SST +V C + CE L + GSNC Y Y DG+ +TG L +
Sbjct: 148 PSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAY-GDGSNTTGVLSTETFTFDDGGS 206
Query: 216 QSKSVDSR---ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
R + FGC GSF +GL GLG S+ + L + FS C
Sbjct: 207 GRSPRQVRVGGVKFGCSTATAGSF----PADGLVGLGGGAVSLVTQLGGATSLGRRFSYC 262
Query: 273 F---GSDGTGRISFG---DKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
+ + ++FG D PG TP Y + + V VG V S+
Sbjct: 263 LVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASSR 322
Query: 326 -IFDSGTSFTYLN 337
I DSGT+ T+L+
Sbjct: 323 IIVDSGTTLTFLD 335
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 121/272 (44%), Gaps = 34/272 (12%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
+SL L Y +V +G PA++ V +DTGSD+ W+ C+ ++ +G + D P
Sbjct: 128 SSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFD-----P 182
Query: 158 NTSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST + C++ C + A S C Y V+Y DG+ +TG DVL L+
Sbjct: 183 AASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTLSG 241
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ V FGC + G+ +D +GL GLG D S+ S A + SFS C
Sbjct: 242 SD-----VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSLVSQTAAR--YGKSFSYC 293
Query: 273 FGSD--GTGRISFGDKGSPGQ------GETPFSLRQTHPTYNI-TITQVSVGGNAVN--- 320
+ +G ++ G S G TP + PTY + ++VGG +
Sbjct: 294 LPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSP 353
Query: 321 --FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
F ++ DSGT T L AY +S F +
Sbjct: 354 SVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 385
>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
Length = 481
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 121/295 (41%), Gaps = 47/295 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA + ++ LDTGSD+ WL C C H + SG+V D P S + +
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 179
Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + +C C ++C YQV Y DG+++ G + L A +
Sbjct: 180 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 233
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
R++ GCG G F+ A +GL GLG + S PS +A SFS C
Sbjct: 234 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPSQIARS--FGRSFSYCLVDRTSSVRP 288
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
S + ++FG F+ +P Y + + SVGG
Sbjct: 289 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 348
Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
N I DSGTS T L P Y + + F + A R + F+ CY L
Sbjct: 349 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNL 403
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 117/288 (40%), Gaps = 43/288 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C C + I++P S +
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDP---------IFNPYKSKSF 160
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC+S LC C + C YQV Y DG+ +TG + L ++
Sbjct: 161 AGIPCSSPLCRRLDSSGCSTRRHTCLYQVSY-GDGSFTTGDFATETLTFRGNKI------ 213
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
++++ GCG G F+ A GL + S I N + FS C S
Sbjct: 214 AKVALGCGHHNEGLFVGAAGLLGLGRGRLSFPSQTGIRFN-----HKFSYCLVDRSASSK 268
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + +SVGG V F+ +
Sbjct: 269 PSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNG 328
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGTS T L PAYT + + F A+ + L F+ CY L
Sbjct: 329 GVIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSL-FDTCYDL 375
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 122/298 (40%), Gaps = 49/298 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V VG P F + LDTGSDL W+ CV C + Y P SS+
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWI--QCVPCYECFEQNGPH------YDPGQSSSYR 232
Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV---LHLATDEK 215
+ C+ + C L + C + CPY Y + F +E L +++ +
Sbjct: 233 NIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKP 292
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ + V++ + FGCG G F A L GLG S S L Q L +SFS C
Sbjct: 293 ELRRVEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 346
Query: 274 -GSDG--TGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
SD + ++ FG+ P T + +P Y + I + VGG VN
Sbjct: 347 RNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPE 406
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCY 369
I DSGT+ +Y +PAY I E F +AK K D P E CY
Sbjct: 407 EKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF--MAKVKGYPVVKDFPVLEPCY 462
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 118/282 (41%), Gaps = 40/282 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA + LDTGSD+ W+ C+ C C + +++P +SST
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDP---------VFNPTSSSTY 212
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C L + + C YQV Y DG+ + G L D + K +
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKINN----- 266
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A GL G + + NQ + SFS C +G+ S
Sbjct: 267 VALGCGHDNEGLFTGAAGLLGLGGGVLS-------ITNQ-MKATSFSYCLVDRDSGKSSS 318
Query: 284 GDKGSP----GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P + T Y + ++ SVGG V F+ A I
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
D GT+ T L AY + + F L ++ S+S F+ CY
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCY 420
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 81/295 (27%), Positives = 127/295 (43%), Gaps = 37/295 (12%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTS 160
G + + S+G P +DTGSD W C C C LN +S I++P+ S
Sbjct: 87 GSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPC---LNQTSP------IFNPSKS 137
Query: 161 STSSKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
ST + C+S +C+ + +C S C Y++ YL D + S G + +D L L +++
Sbjct: 138 STYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYL-DRSGSQGDISKDTLTLNSNDGSP 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-- 275
S +I GCG S +G+ G G S+ S L + I FS C S
Sbjct: 197 ISF-PKIVIGCG--HKNSLTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCLASLF 251
Query: 276 ---DGTGRISFGDKGS-PGQGETPFSLRQTH--PTYNITITQVSVGGNAVNF-------- 321
+ + ++ FGD G G L Q+ Y + SVG + +
Sbjct: 252 SKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPD 311
Query: 322 -EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLRSF 374
E +A+ DSG++ T L + Y+Q+ S+ K KR + T L Y L+ +
Sbjct: 312 NEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKY 366
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 122/295 (41%), Gaps = 40/295 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA + LDTGSD+ W+ C+ C C + +++P +SST
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDP---------VFNPTSSSTY 212
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C L + + C YQV Y DG+ + G L D + K +
Sbjct: 213 KSLTCSAPQCSLLETSACRSNKCLYQVSY-GDGSFTVGELATDTVTFGNSGKINN----- 266
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A GL G + + NQ + SFS C +G+ S
Sbjct: 267 VALGCGHDNEGLFTGAAGLLGLGGGVLS-------ITNQ-MKATSFSYCLVDRDSGKSSS 318
Query: 284 GDKGSP----GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P + T Y + ++ SVGG V F+ A I
Sbjct: 319 LDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVIL 378
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D GT+ T L AY + + F L ++ S+S F+ CY S ++ V
Sbjct: 379 DCGTAVTRLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTV 433
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 124/298 (41%), Gaps = 47/298 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VG PA F++ DTGSDL W+ C S ++S ++ P S + S
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQ---RVFRPAGSKSWS 160
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLATDEKQS 217
+PC+S C+ C S C Y RY D + + G + D + L+ ++
Sbjct: 161 PLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRY-KDNSSARGVVGLDSATVSLSGNDGTR 219
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
K+ + GC G + +G+ LG S S A++ FS C
Sbjct: 220 KAKLQEVVLGCTTSYDGQSFKSS--DGVLSLGNSNISFASRAASR--FGGRFSYCLVDHL 275
Query: 274 -GSDGTGRISFGDKGSPGQG-----ETPFSLRQ---THPTYNITITQVSVGGNAVN---- 320
+ T ++FG+ S TP L + T P Y +++ V+V G +
Sbjct: 276 APRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPD 335
Query: 321 -FEFS----AIFDSGTSFTYLNDPAY----TQISETFNSLAKEKRETSTSDLPFEYCY 369
++F AI DSGTS T L PAY IS+ F + + + PFEYCY
Sbjct: 336 VWDFRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMD------PFEYCY 387
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/271 (28%), Positives = 107/271 (39%), Gaps = 45/271 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ + +G P + LDTGSDL W C C+ CV Q + + P S+T
Sbjct: 90 YLMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVD-------QPTPY--FDPARSATY 140
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C S C C YQ Y D + G L + T+E +
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFY-GDSASTAGVLANETFTFGTNETRVSL--PG 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + G +G +G+ G G S+ S L + FS C F S R
Sbjct: 198 ISFGCGNLNAGLLANG---SGMVGFGRGSLSLVSQLGSP-----RFSYCLTSFLSPVPSR 249
Query: 281 ISFG--------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGG------------NAV 319
+ FG + S TPF + PT Y + +T +SVGG N
Sbjct: 250 LYFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDT 309
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
+ I DSGT+ TYL +PAY + F S
Sbjct: 310 DGTGGTIIDSGTTITYLAEPAYDAVRAAFAS 340
>gi|301119611|ref|XP_002907533.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
gi|262106045|gb|EEY64097.1| aspartyl protease family A01B, putative [Phytophthora infestans
T30-4]
Length = 681
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 111/266 (41%), Gaps = 30/266 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
HYT V G P V DTGS L PC C C H + + SST
Sbjct: 67 HYTWVYAGTPPQRASVIADTGSALMAFPCSGCDGCGHHTDQP---------FQAANSSTL 117
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDEKQSK 218
+ C K+C C Y+ +G+ +VED+++L D++
Sbjct: 118 VHITCAQKSLFQCKECHVQSDTCGISQSYM-EGSSWKASVVEDIVYLGGESSFDDKEMRN 176
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP-NSFSMCFGSDG 277
+ FGC + G F+ A +G+ GL + + + L + I N FS+CF +G
Sbjct: 177 RYGTHFQFGCQSSEKGLFVTQVA-DGIMGLSNTENHIIAKLHRENKIASNLFSLCFTENG 235
Query: 278 TGRISFGD-KGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFSA------I 326
G +S G + +GE + + R YN+ + + +GG ++N + A I
Sbjct: 236 -GTMSVGQPHKAAHRGEISYVKVIADRSAGHFYNVHMKDIRIGGKSINAKEEAYTRGHYI 294
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLA 352
DSGT+ +YL T+ + F +A
Sbjct: 295 VDSGTTDSYLPRALKTEFLQMFKEIA 320
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 115/270 (42%), Gaps = 41/270 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ ++SVG P + LDTGSDL W C C++ + + V+D P SST +
Sbjct: 94 YLVHLSVGTPPRPVALTLDTGSDLVW--TQCAPCLNCFDQGAIPVLD-----PAASSTHA 146
Query: 165 KVPCNSTLCELQ--KQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLAT-DEKQ 216
V C++ +C C GS +C Y Y D +++ G L D D
Sbjct: 147 AVRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHY-GDKSITVGKLASDRFTFGPGDNAD 205
Query: 217 SKSV-DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
V + R++FGCG G F A G+ G G + S+PS L SFS CF S
Sbjct: 206 GGGVSERRLTFGCGHFNKGIFQ--ANETGIAGFGRGRWSLPSQLGV-----TSFSYCFTS 258
Query: 276 ---DGTGRISFGDKGSP----GQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNF----- 321
+ ++ G + GQ + TP + P+ Y +++ ++VG +
Sbjct: 259 MFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQ 318
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETF 348
E SAI DSG S T L + Y + F
Sbjct: 319 RLREASAIIDSGASITTLPEDVYEAVKAEF 348
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 126/293 (43%), Gaps = 46/293 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G + SV L + FS C
Sbjct: 109 ----FTFGCNMDSFGANEFGNV-DGLLGMGAGQMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSVGGNAVNFEFS-- 324
F S TG S G K + + + ++ R+ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 271
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 81/270 (30%), Positives = 118/270 (43%), Gaps = 36/270 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
+ +++G P LS +ALDTGSD+ W C+ CV SC + + P SS+
Sbjct: 45 YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTK---------FDPRKSSS 95
Query: 163 SSKVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C+S+ C + A S C Y+V+Y DG+ S GF + L ++ +
Sbjct: 96 YKNVSCSSSSCRIITDSGGARGCVSSTCIYKVQY-GDGSYSVGFFATEKLTISPSD---- 150
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGS 275
V S FGCG+ G F A G+ + + L N F+ C F S
Sbjct: 151 -VISNFLFGCGQQNAGRFGRIAGLL-----GLGRGKLSLALQTSEKYNNLFTYCLPSFSS 204
Query: 276 DGTGRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------AIFD 328
TG ++ G + TP S + P Y I I +SVGG+ + + S AI D
Sbjct: 205 SSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIID 264
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRET 358
SGT T L Y+ +S F L K+ +T
Sbjct: 265 SGTVITRLQPTVYSALSSKFQQLMKDYPKT 294
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 113/263 (42%), Gaps = 40/263 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + +G P + LDTGSD+ W+ C+ C C + I++P++S +
Sbjct: 154 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP---------IFNPSSSVSF 204
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C G C Y+V Y DG+ + G + L T Q+
Sbjct: 205 STVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLTFGTTSIQN------ 257
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL + S P+ L Q +FS C S+ +G
Sbjct: 258 VAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSESSGT 312
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG + P G TP PT Y +++ +SVGG ++ F
Sbjct: 313 LEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGI 372
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSGT+ T L AY + + F
Sbjct: 373 IIDSGTAVTRLQTSAYDALRDAF 395
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 108/256 (42%), Gaps = 35/256 (13%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
G + +SVG P S + DTGSD+ W C S + N+ ++ P+ S+
Sbjct: 80 GGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAP--------MFDPSKST 131
Query: 162 TSSKVPCNSTLCELQKQCPSAG--SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
T V C+S +C S S C Y + Y D + S G L D + + + + +
Sbjct: 132 TYKNVACSSPVCSYSGDGSSCSDDSECLYSIAY-GDDSHSQGNLAVDTVTMQSTSGRPVA 190
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
R GCG G+F A +G+ GLG S+ + L FS C GTG
Sbjct: 191 F-PRTVIGCGHDNAGTF--NANVSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTG 245
Query: 280 ------RISFGDKGS---PGQGETP-FSLRQTHPTYNITITQVSVGGNAVNF-------- 321
+++FG + G TP +S Q Y++ + VSVG NF
Sbjct: 246 STNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLG 305
Query: 322 -EFSAIFDSGTSFTYL 336
E + I DSGT+ TYL
Sbjct: 306 GESNIIIDSGTTLTYL 321
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 87/344 (25%), Positives = 135/344 (39%), Gaps = 60/344 (17%)
Query: 31 GFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---NDKTPL 87
GF ++ D VK + + L + ++R RL LAA D+
Sbjct: 50 GFRVRLKHVDHVKNLTRFERLRR-------GVARGKNRLHRLNAMVLAAANATVGDQVKA 102
Query: 88 TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSG 147
AGN + + +++G P SF +DTGSDL W C C + S+
Sbjct: 103 PVVAGNGEFLMK---------LAIGSPPRSFSAIMDTGSDLIW--TQCKPCQQCFDQST- 150
Query: 148 QVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
I+ P SS+ K+ C+S LC + C Y Y D + + G L +
Sbjct: 151 -----PIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTY-GDSSSTQGVLAFET 204
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLI- 265
+ S+ + FGCG G F GA GL GLG S+ S L Q
Sbjct: 205 FTFGDSTEDQISIPG-LGFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKEQKFAY 260
Query: 266 ---------PNSFSMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVG 315
P+S + ++ T + S + + TP + P+ Y +++ +SVG
Sbjct: 261 CLTAIDDSKPSSLLLGSLANITPKTSKDEMKT-----TPLIKNPSQPSFYYLSLQGISVG 315
Query: 316 GNAVN-----FEF------SAIFDSGTSFTYLNDPAYTQISETF 348
G ++ FE I DSGT+ TY+ + A+T + F
Sbjct: 316 GTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEF 359
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 76/266 (28%), Positives = 122/266 (45%), Gaps = 39/266 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L N S+GQP + + +DTGS L W+ C C SC S Q+I ++ P+ SST
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSC-------SQQIIG-PMFDPSISST 152
Query: 163 SSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C + +C +C S+ S C Y Y+ +G S G + + L + ++ +V
Sbjct: 153 YDSLSCKNIICRYAPSGECDSS-SQCVYNQTYV-EGLPSVGVIATEQLIFGSSDEGRNAV 210
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
++ + FGC + G++ D G+FGLG TSV NQ + + FS C G+
Sbjct: 211 NN-VLFGCSH-RNGNYKDRRF-TGVFGLGSGITSV----VNQ--MGSKFSYCIGNIADPD 261
Query: 281 ISFGD----KGSPGQG-ETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA---------- 325
S+ +G +G TP + H Y + + +SVG + + SA
Sbjct: 262 YSYNQLVLSEGVNMEGYSTPLDVVDGH--YQVILEGISVGETRLVIDPSAFKRTEKQRRV 319
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSL 351
I DSGT+ T+L + Y + +L
Sbjct: 320 IIDSGTAPTWLAENEYRALEREVRNL 345
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 112/265 (42%), Gaps = 52/265 (19%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PAL++ +DTGSDL W C CV C ++ P++SST + VPC+
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTP---------VFDPSSSSTYATVPCS 223
Query: 170 STLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S C +C SA S C Y Y D + + G L + LA KS + FG
Sbjct: 224 SASCSDLPTSKCTSA-SKCGYTYTY-GDSSSTQGVLATETFTLA------KSKLPGVVFG 275
Query: 228 CGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGR----- 280
CG G F GA GL GLG S+ S L GL + FS C S D T
Sbjct: 276 CGDTNEGDGFSQGA---GLVGLGRGPLSLVSQL---GL--DKFSYCLTSLDDTNNSPLLL 327
Query: 281 -----ISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA--------- 325
IS + TP + P+ Y +++ ++VG ++ SA
Sbjct: 328 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 387
Query: 326 --IFDSGTSFTYLNDPAYTQISETF 348
I DSGTS TYL Y + + F
Sbjct: 388 GVIVDSGTSITYLEVQGYRALKKAF 412
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 81/195 (41%), Gaps = 30/195 (15%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+ +G P +F +DTGSDL W+ CD C C + Y P ++ V
Sbjct: 58 LQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCT---------LPPIRQYKPKGNT----V 104
Query: 167 PCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
PC +C + QCP+ C Y+V Y G+ S G LV D L ++
Sbjct: 105 PCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGS-SMGALVIDQFPLKL--LNGSAMQ 161
Query: 222 SRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
R++FGCG Q L A P G+ GLG K V L GL N C S G
Sbjct: 162 PRLAFGCGYDQ---ILPKAHPPPATAGVLGLGRGKIGVLPQLVAAGLTRNVVGHCLSSKG 218
Query: 278 TGRISFGDKGSPGQG 292
G + FGD P G
Sbjct: 219 GGYLFFGDTLIPTLG 233
>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
Length = 475
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 121/295 (41%), Gaps = 47/295 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ V VG PA + ++ LDTGSD+ WL C C H + SG+V D P S + +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWL--QCAPCRH-CYAQSGRVFD-----PRRSRSYA 173
Query: 165 KVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + +C C ++C YQV Y DG+++ G + L A +
Sbjct: 174 AVDCVAPICRRLDSAGCDRRRNSCLYQVAY-GDGSVTAGDFASETLTFARGARV-----Q 227
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--------- 273
R++ GCG G F+ A +GL GLG + S P+ +A SFS C
Sbjct: 228 RVAIGCGHDNEGLFI---AASGLLGLGRGRLSFPTQIARS--FGRSFSYCLVDRTSSVRP 282
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------------- 316
S + ++FG F+ +P Y + + SVGG
Sbjct: 283 SSTRSSTVTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRL 342
Query: 317 NAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
N I DSGTS T L P Y + + F + A R + F+ CY L
Sbjct: 343 NPTTGRGGVILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNL 397
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 134/324 (41%), Gaps = 57/324 (17%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
+R RL LA +TP+ ++GN Y ++ +S G P +DTG
Sbjct: 62 HERRARLAKHVLAGDQLFETPV--ASGNGEYLID---------ISYGNPPQKSTAIVDTG 110
Query: 126 SDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAG 183
SDL W+ C C SC L++ + P+ S++ + C S C+ L Q S
Sbjct: 111 SDLNWVQCLPCKSCYETLSAK---------FDPSKSASYKTLGCGSNFCQDLPFQ--SCA 159
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
++C Y Y DG+ ++G L D + + T + + ++FGCG G+F
Sbjct: 160 ASCQYDYMY-GDGSSTSGALSTDDVTIGTGKIPN------VAFGCGNSNLGTFAGAGG-- 210
Query: 244 GLFGLGMDKTSVPSILANQ--GLIPNSFSMC---FGSDGTGRISFGDKG-SPGQGETPFS 297
+ P L +Q G FS C GS T + GD + G TP
Sbjct: 211 -----LVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAYTPML 265
Query: 298 LRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IFDSGTSFTYLNDPAYTQIS 345
+PT Y + +SV G AVN F+ +A I DSGT+ TYL+ A+ +
Sbjct: 266 TNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLDVDAFNPMV 325
Query: 346 ETFNSLAKEKRETSTSDLPFEYCY 369
+ A E S EYC+
Sbjct: 326 AALKA-ALPYPEADGSFYGLEYCF 348
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 117/285 (41%), Gaps = 46/285 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P F + +D+GSDL W+ C C C D +Y P+ SST
Sbjct: 64 YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCY---------AQDSPLYVPSNSSTF 114
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV-LHLATDEKQSKSVD- 221
S VPC S+ C L A P RY G + +L D +S +VD
Sbjct: 115 SPVPCLSSDCLLIP----ATEGFPCDFRY--PGACAYEYLYADTSSSKGVFAYESATVDG 168
Query: 222 ---SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----- 273
+++FGCG GSF AA G+ GLG S S + N F+ C
Sbjct: 169 VRIDKVAFGCGSDNQGSF---AAAGGVLGLGQGPLSFGSQVGYA--YGNKFAYCLVNYLD 223
Query: 274 GSDGTGRISFGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA---- 325
+ + + FGD+ TP PT Y + I +V+VGG ++ SA
Sbjct: 224 PTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEID 283
Query: 326 -------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
IFDSGT+ TY AY+ I F+S R S L
Sbjct: 284 LLGNGGSIFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGL 328
>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
Length = 425
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 120/283 (42%), Gaps = 45/283 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +VG PA +F++ALDT +D W+PC+ CV C +++ TS+T
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSS------------TVFNSVTSTTF 137
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C+ GS C + Y +S L D + L+TD +
Sbjct: 138 KTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALSTD------IVPG 189
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
+FGC + TGS P GL GLG S S Q L ++FS C S + +G
Sbjct: 190 YTFGCIQKTTGS---SVPPQGLLGLGRGPLSFLS--QTQDLYKSTFSYCLPSFRTLNFSG 244
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
+ G G P + +T L+ + Y + + + VG V+ SA I
Sbjct: 245 TLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTI 304
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSGT FT L P YT + + F +S F+ CY
Sbjct: 305 FDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG--FDTCY 345
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 125/289 (43%), Gaps = 49/289 (16%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+GF + T +++GQPA + + +DTGSDL WL CD C H + P
Sbjct: 68 VGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCD-APCTHCSETPH----------PLHR 115
Query: 161 STSSKVPCNSTLC-ELQKQCPSAGSNCP------YQVRYLSDGTMSTGFLVEDVLHLATD 213
++ VPC LC LQ P+ NC Y++ Y +D + G L+ DV L +
Sbjct: 116 PSNDFVPCRDPLCASLQ---PTEDYNCEHPDQCDYEINY-ADQYSTYGVLLNDVYLLNSS 171
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
V R++ GCG Q S +GL GLG K S+ S L +QGL+ N C
Sbjct: 172 NGVQLKV--RMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCL 229
Query: 274 GSDGTG-----------RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF- 321
S G G R+++ TP S + Y+ ++ GG
Sbjct: 230 SSQGGGYIFFGNAYDSARVTW----------TPISSVDSK-HYSAGPAELVFGGRKTGVG 278
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCY 369
+A+FD+G+S+TY N AY + N L+ + + + D C+
Sbjct: 279 SLTAVFDTGSSYTYFNSHAYQALLSWLNKELSGKPLKVAPDDQTLSLCW 327
>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like [Glycine max]
Length = 444
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 115/279 (41%), Gaps = 44/279 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G PA + ++A+DT +D W+PC CV C ++P S+T KV C +
Sbjct: 113 GTPAQTLLLAMDTSNDAAWVPCTACVGCSTT-----------TPFAPPKSTTFKKVGCGA 161
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDSRISFGCG 229
+ C+ + GS C + Y GT S LV+D + LATD + +FGC
Sbjct: 162 SQCKQVRNPTCDGSACAFNFTY---GTSSVAASLVQDTVTLATDPVPA------YTFGCI 212
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT----GRISFGD 285
+ TGS L GL + + Q L ++FS C S T G
Sbjct: 213 QKATGSSLPPQGLLGLGRGPLSLLA-----QTQKLYQSTFSYCLPSFKTLNFSGHXDLXP 267
Query: 286 KGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
P P F + Y + + + VG V+ A +FDSGT F
Sbjct: 268 VAQPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVF 327
Query: 334 TYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVL 371
T L +PAYT + F ++ K+ T TS F+ CY +
Sbjct: 328 TRLVEPAYTAVRNEFRRRVSVHKKLTVTSLGGFDTCYTV 366
>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
Length = 425
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 120/283 (42%), Gaps = 45/283 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +VG PA +F++ALDT +D W+PC+ CV C +++ TS+T
Sbjct: 90 YIVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSS------------TVFNSVTSTTF 137
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C+ GS C + Y +S L D + L+TD +
Sbjct: 138 KTLGCDAPQCKQVPNPTCGGSTCTWNTTYGGSTILSN--LTRDTIALSTD------IVPG 189
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
+FGC + TGS P GL GLG S S Q L ++FS C S + +G
Sbjct: 190 YTFGCIQKTTGS---SVPPQGLLGLGRGPLSFLS--QTQDLYKSTFSYCLPSFRTLNFSG 244
Query: 280 RISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------I 326
+ G G P + +T L+ + Y + + + VG V+ SA I
Sbjct: 245 TLRLGPAGQPLRIKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTI 304
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
FDSGT FT L P YT + + F +S F+ CY
Sbjct: 305 FDSGTVFTRLVAPVYTAVRDEFRKRVGNAIVSSLGG--FDTCY 345
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 89/346 (25%), Positives = 137/346 (39%), Gaps = 60/346 (17%)
Query: 29 TFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQG---NDKT 85
+ GF ++ D VK + + L ++G ++R RL LAA D+
Sbjct: 303 SHGFRVRLKHVDHVKNLTRFERL-RRG------VARGKNRLHRLNAMVLAAANATVGDQV 355
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
AGN + + +++G P SF +DTGSDL W C C + S
Sbjct: 356 KAPVVAGNGEFLMK---------LAIGSPPRSFSAIMDTGSDLIW--TQCKPCQQCFDQS 404
Query: 146 SGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
+ I+ P SS+ K+ C+S LC + C Y Y D + + G L
Sbjct: 405 T------PIFDPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTY-GDSSSTQGVLAF 457
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGL 264
+ + S+ + FGCG G F GA GL GLG S+ S L Q
Sbjct: 458 ETFTFGDSTEDQISIPG-LGFGCGNDNNGDGFSQGA---GLVGLGRGPLSLVSQLKEQ-- 511
Query: 265 IPNSFSMCFGSDGTGRIS---------FGDKGSPGQGE-TPFSLRQTHPT-YNITITQVS 313
F+ C + + S K S + + TP + P+ Y +++ +S
Sbjct: 512 ---KFAYCLTAIDDSKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGIS 568
Query: 314 VGGNAVN-----FEF------SAIFDSGTSFTYLNDPAYTQISETF 348
VGG ++ FE I DSGT+ TY+ + A+T + F
Sbjct: 569 VGGTQLSIPKSTFELHDDGSGGVIIDSGTTITYVENSAFTSLKNEF 614
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 145/372 (38%), Gaps = 64/372 (17%)
Query: 27 FGTFGFDFHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAA---QGND 83
+ T GF R+ D K + ++ + + + R RL LAA D
Sbjct: 43 YPTKGFRVMLRHVDSGKNLTKLERV-------QHGIKRGKSRLQRLNAMVLAASTLDSED 95
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ AGN Y + +++G P +S+ LDTGSDL W C C C
Sbjct: 96 QLEAPIHAGNGEYLME---------LAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQP 146
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA--GSNCPYQVRYLSDGTMST 200
I+ P SS+ SKV C S+LC PS+ C Y Y D +M+
Sbjct: 147 TP---------IFDPKKSSSFSKVSCGSSLCS---AVPSSTCSDGCEYVYSY-GDYSMTQ 193
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L + + ++K I FGCG G + A+ GL GLG S+ S L
Sbjct: 194 GVLATETFTFG--KSKNKVSVHNIGFGCGEDNEGDGFEQAS--GLVGLGRGPLSLVSQLK 249
Query: 261 NQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGE-------TPFSLRQTHPT-YNITITQV 312
FS C + S GS G+ + TP P+ Y +++ +
Sbjct: 250 EP-----RFSYCLTPMDDTKESILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGI 304
Query: 313 SVGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
SVG ++ E S I DSGT+ TY+ A+ + + F S K + TS
Sbjct: 305 SVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTITYIEQKAFEALKKEFISQTKLPLD-KTS 363
Query: 362 DLPFEYCYVLRS 373
+ C+ L S
Sbjct: 364 STGLDLCFSLPS 375
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 115/274 (41%), Gaps = 42/274 (15%)
Query: 102 GFLHY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y ++++G P LDTGSDL W C C SC+ + +++P
Sbjct: 99 GDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDP---------LFAPAA 149
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
SS+ + C+ LC L C C Y+ Y DGT + G + A+ +
Sbjct: 150 SSSYVPMRCSGQLCNDILHHSCQRP-DTCTYRYNY-GDGTTTLGVYATERFTFASSSGEK 207
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG----LIP----NSF 269
SV + FGCG + GS +G +G+ G G D S+ S L+ + L P
Sbjct: 208 LSVP--LGFGCGTMNVGSLNNG---SGIVGFGRDPLSLVSQLSIRRFSYCLTPYTSTRKS 262
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-- 325
++ FGS G + GD + GQ +T L RQ Y + T V+VG + SA
Sbjct: 263 TLMFGSLSDG-VFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFA 321
Query: 326 ---------IFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ T T++ F +
Sbjct: 322 LRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRA 355
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 121/303 (39%), Gaps = 44/303 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + ++ P S T
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADP---------VFDPTKSRTY 179
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC + LC C + C YQV Y DG+ + G + L ++
Sbjct: 180 AGIPCGAPLCRRLDSPGCNNKNKVCQYQVSY-GDGSFTFGDFSTETLTF------RRTRV 232
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+R++ GCG G F+ A L GLG + S P + FS C S
Sbjct: 233 TRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPVQTGRR--FNQKFSYCLVDRSASAK 287
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + +SVGG+ V F A
Sbjct: 288 PSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNG 347
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-LVV 382
I DSGTS T L PAY + + F A + + L F+ C+ L ++ VV
Sbjct: 348 GVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSL-FDTCFDLSGLTEVKVPTVV 406
Query: 383 LPF 385
L F
Sbjct: 407 LHF 409
>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
Length = 353
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 117/277 (42%), Gaps = 37/277 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +S+G P + +V +DTGS L W+ C +C + + +GQ I++P SST
Sbjct: 6 YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-----IFNPYNSSTY 60
Query: 164 SKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
SKV C++ C ++ C C Y +RY S G S G+L +D L LA++
Sbjct: 61 SKVGCSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYLGKDRLTLASN--- 116
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+S+D+ I FGCG L G+ G G S + + Q +FS CF D
Sbjct: 117 -RSIDNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQTDY-TAFSYCFPRD 169
Query: 277 --GTGRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------A 325
G ++ G T P Y I Q+ + N + E
Sbjct: 170 HENEGSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIRLEIDPYIYISKMT 227
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
I DSGT+ TY+ P + + + + K T D
Sbjct: 228 IVDSGTADTYILSPVFDALDKAMTKEMQAKGYTRGWD 264
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 86/300 (28%), Positives = 119/300 (39%), Gaps = 43/300 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C C S + Q+ D P+ S +
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCY----SQTDQIFD-----PSKSKSF 180
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC S LC C + C YQV Y DG+ + G + L ++
Sbjct: 181 AGIPCYSPLCRRLDSPGCSLKNNLCQYQVSY-GDGSFTFGDFSTETLTF------RRAAV 233
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
R++ GCG G F+ A L GLG S P+ + N FS C S
Sbjct: 234 PRVAIGCGHDNEGLFVGAAG---LLGLGRGGLSFPTQTGTR--FNNKFSYCLTDRTASAK 288
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
I FGD TP T Y + + +SVGG V F +
Sbjct: 289 PSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNG 348
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVVL 383
I DSGTS T L PAY + + F A + L F+ CY L ++ V+
Sbjct: 349 GVIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSL-FDTCYDLSGLSEVKVPTVV 407
>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
Length = 459
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 80/289 (27%), Positives = 115/289 (39%), Gaps = 40/289 (13%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC--DCVSCVHGLNSSSGQVIDFNI 154
R++ G + S+G P DTGSDL W C C + S S
Sbjct: 83 RMDDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPS-------- 134
Query: 155 YSPNTSSTSSKVPCNSTLCELQKQ-----CPSAGSNCPYQVRY---LSDGTMSTGFLVED 206
Y PN SST +K+PC+ LC L + C +AG+ C Y+ Y D + GFL +
Sbjct: 135 YLPNASSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARE 194
Query: 207 VLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
L D S + FGC G + G+ + P L +Q L
Sbjct: 195 TFTLGADAVPS------VRFGCTTASEGGYGSGSG-------LVGLGRGPLSLVSQ-LNA 240
Query: 267 NSFSMCFGSDGTGR--ISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNA---VN 320
++F C SD + + FG S G L + Y + + +S+G V
Sbjct: 241 STFMYCLTSDASKASPLLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVG 300
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FDSGT+ TYL +PAY++ F S + T FE C+
Sbjct: 301 EPEGVVFDSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDG--FEACF 347
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 114/286 (39%), Gaps = 51/286 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ VG P + ++ALD D W+PC CV C +++ S+T
Sbjct: 35 YIVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGC------------SSTVFNTVKSTTF 82
Query: 164 SKVPCNSTLCELQKQCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C + C KQ P+ GS C + Y S +S L D + L+ D
Sbjct: 83 KTLGCGAPQC---KQVPNPICGGSTCTWNTTYGSSTILSN--LTRDTIALSMDPV----- 132
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----D 276
+FGC + TGS P GL G G S S Q L ++FS C S +
Sbjct: 133 -PYYAFGCIQKATGS---SVPPQGLLGFGRGPLSFLS--QTQNLYKSTFSYCLPSFRTLN 186
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L+ + Y + + + VG V+ SA
Sbjct: 187 FSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGA 246
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT FT L PAY + F + T +S F+ CY
Sbjct: 247 GTIFDSGTVFTRLVAPAYIAVRNEFRK--RVGNATVSSLGGFDTCY 290
>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
Length = 284
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 81/174 (46%), Gaps = 20/174 (11%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
T + +G P F + +D+GS + ++PC DC C G+ D + P SST
Sbjct: 95 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQC--------GKHQDPK-FQPEMSSTYQP 145
Query: 166 VPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
V CN + C C Y+ Y ++ + S G L ED++ +S+ R
Sbjct: 146 VKCN-----MDCNCDDDREQCVYEREY-AEHSSSKGVLGEDLISFG---NESQLTPQRAV 196
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
FGC V+TG A +G+ GLG S+ L ++GLI NSF +C+G G
Sbjct: 197 FGCETVETGDLYSQRA-DGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVG 249
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 88/346 (25%), Positives = 136/346 (39%), Gaps = 57/346 (16%)
Query: 52 PKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT---PLTFSAGNDTYRLNSLGFLHYTN 108
P + + A HRD + R R LAA +D T P++ + + +
Sbjct: 39 PSVTASQFVRAALHRDMH-RHNARKLAASSSDGTVSAPVSPTTVPGEFLMT--------- 88
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID--FNIYSPNTSSTSSKV 166
+++G P L F+ DTGSDL W C C S Q +Y+P++S+T S +
Sbjct: 89 LAIGTPPLPFLAIADTGSDLIW--TQCAPC-------SRQCFQQPTPLYNPSSSTTFSAL 139
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PCNS+L C C Y + Y S T F + + + I+F
Sbjct: 140 PCNSSLGLCAPAC-----ACMYNMTYGSGWTYV--FQGTETFTFGSSTPADQVRVPGIAF 192
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRIS 282
GC +G + ++ +GL GLG S+ S L FS C ++ T +
Sbjct: 193 GCSNASSG--FNASSASGLVGLGRGSLSLVSQLGAP-----KFSYCLTPYQDTNSTSTLL 245
Query: 283 FGDKGSPGQ----GETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
G S TPF + Y + +T +S+G A+ +A I
Sbjct: 246 LGPSASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLII 305
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
DSGT+ T L + AY Q+ SL ++ + C+ L S
Sbjct: 306 DSGTTITMLGNTAYQQVRAAVLSLVTLPTTDGSAATGLDLCFELPS 351
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 120/278 (43%), Gaps = 35/278 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P +DT +D W C+ C C N++S ++ P+ SST +PC+
Sbjct: 95 IGTPPFQLYGVMDTANDNIWFQCNPCKPC---FNTTSP------MFDPSKSSTYKTIPCS 145
Query: 170 STLCE--LQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
S C+ C S C Y Y + S G L D L L ++ S + I
Sbjct: 146 SPKCKNVENTHCSSDDKKVCEYSFTYGGEA-YSQGDLSIDTLTLNSNNDTPISFKN-IVI 203
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC----FGSDG-TGRI 281
GCG G L+G +G GLG S S L + I FS C F ++G +G++
Sbjct: 204 GCGHRNKGP-LEGYV-SGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNEGISGKL 259
Query: 282 SFGDKG-SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS---------AIFDSGT 331
FGDK G G + Y+ T+ +SVG + + FE S I DSGT
Sbjct: 260 HFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGT 319
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ T L + Y+++ S+ K +R S + F+ CY
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAKSPNQ-QFKLCY 356
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 119/292 (40%), Gaps = 55/292 (18%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N+S+G P ++ ++ +DT SDL W+ C C++C I+ P+ S T
Sbjct: 87 VNISIGSPPITQLLHMDTASDLLWIQCLPCINC---------YAQSLPIFDPSRSYTHRN 137
Query: 166 VPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSK 218
C ++ Q PS N C Y +RY+ D T S G L ++L T DE S
Sbjct: 138 ETCRTS----QYSMPSLKFNANTRSCEYSMRYVDD-TGSKGILAREMLLFNTIYDESSSA 192
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
++ + FGCG G L G G+ GLG + S+ + FS CFGS
Sbjct: 193 ALHD-VVFGCGHDNYGEPLVGT---GILGLGYGEFSLVHRFGKK------FSYCFGSLDD 242
Query: 279 -----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA------- 325
+ GD G+ G+ TP + Y +TI +SV G + +
Sbjct: 243 PSYPHNVLVLGDDGANILGDTTPLEIHNGF--YYVTIEAISVDGIILPIDPRVFNRNHQT 300
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNSLAKEK---RETSTSDLPFEYCY 369
I D+G S T L + AY + + + + + S D+ CY
Sbjct: 301 GLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECY 352
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 113/286 (39%), Gaps = 46/286 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + +VA+D +D W+PC C C S +SP SST
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 132
Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
VPC S C CP+ GS+C + + Y + + L +D L L + V
Sbjct: 133 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 184
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
+FGC RV +G + P GL G G S + + + FS C S+
Sbjct: 185 VVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSN 239
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L H P+ Y + + + VG V SA
Sbjct: 240 FSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGS 299
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I D+GT FT L P Y + + F + F+ CY
Sbjct: 300 GTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY 343
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 122/289 (42%), Gaps = 41/289 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + +G P + LDTGSD+ W+ C+ C C + I++P++S +
Sbjct: 8 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADP---------IFNPSSSVSF 58
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C G C Y+V Y DG+ + G + L T Q+
Sbjct: 59 STVGCDSAVCSQLDANDCHGGGCLYEVSY-GDGSYTVGSYATETLTFGTTSIQN------ 111
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A GL + S P+ L Q +FS C S+ +G
Sbjct: 112 VAIGCGHDNVGLFVGAAGLLGLGAGSL---SFPAQLGTQ--TGRAFSYCLVDRDSESSGT 166
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA------- 325
+ FG + P G TP PT Y +++ +SVGG ++ F
Sbjct: 167 LEFGPESVPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGI 226
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF 374
I DSGT+ T L AY + + F + + + F+ CY L +
Sbjct: 227 IIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISI-FDTCYDLSAL 274
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 127/308 (41%), Gaps = 53/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA + + LDTGSD+ WL C C +C + ++ I+ P S T
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDA---------IFDPKKSKTF 185
Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC S LC +C + S C YQV Y DG+ + G + L
Sbjct: 186 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSY-GDGSFTEGDFSTETLTF-----HGAR 239
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
VD + GCG G F+ A GLG S PS N+ FS C
Sbjct: 240 VD-HVPLGCGHDNEGLFVGAAGLL---GLGRGGLSFPSQTKNR--YNGKFSYCLVDRTSS 293
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
S I FG+ P + F+ T+P Y + + +SVGG+ V F
Sbjct: 294 GSSSKPPSTIVFGNAAVP--KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 351
Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
+ A I DSGTS T L PAY + + F L K + + S F+ C+ L
Sbjct: 352 KLDATGNGGVIIDSGTSVTRLTQPAYVALRDAFR-LGATKLKRAPSYSLFDTCFDLSGMT 410
Query: 376 HLQALVVL 383
++ V+
Sbjct: 411 TVKVPTVV 418
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 118/295 (40%), Gaps = 39/295 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P + V +D+GSD+ W+ C+ C C H + +++P SS+
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDP---------VFNPADSSSY 184
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C ST+C C Y+V Y DG+ + G L + L +++
Sbjct: 185 AGVSCASTVCSHVDNAGCHEGRCRYEVSY-GDGSYTKGTLALETLTFG------RTLIRN 237
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG---TGR 280
++ GCG G F+ A GL GLG S L Q +FS C S G +G
Sbjct: 238 VAIGCGHHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQA--GGTFSYCLVSRGIQSSGL 292
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTY--------NITITQVSVGGNAVNF----EFSAIF 327
+ FG + P G P ++ + +V + + + +
Sbjct: 293 LQFGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVM 352
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D+GT+ T L AY + F + S + F+ CY L F+ ++ V
Sbjct: 353 DTGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSI-FDTCYDLFGFVSVRVPTV 406
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 113/286 (39%), Gaps = 46/286 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + +VA+D +D W+PC C C S +SP SST
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPS----------FSPTQSSTY 151
Query: 164 SKVPCNSTLCEL--QKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
VPC S C CP+ GS+C + + Y + + L +D L L + V
Sbjct: 152 RTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQA--VLGQDSLALENN------V 203
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
+FGC RV +G + P GL G G S + + + FS C S+
Sbjct: 204 VVSYTFGCLRVVSG---NSVPPQGLIGFGRGPLSF--LSQTKDTYGSVFSYCLPNYRSSN 258
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L H P+ Y + + + VG V SA
Sbjct: 259 FSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGS 318
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I D+GT FT L P Y + + F + F+ CY
Sbjct: 319 GTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLGG--FDTCY 362
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 139/355 (39%), Gaps = 77/355 (21%)
Query: 40 DPVKGILAVDDLPKKGSFAYYSALAHR-DRYFRLR------GRGLAAQGNDKTPLTFSAG 92
+ V G+L+ D A S+L R DRY RL A + P+T A
Sbjct: 95 EEVDGLLSTD-------AARVSSLQRRIDRYRRLMITSSAEVAVAVAASKAQVPVTSGA- 146
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
+L +L ++ + G+ V +DT S+L W+ C C SC +
Sbjct: 147 ----KLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCESCHDQQDP------- 191
Query: 152 FNIYSPNTSSTSSKVPCNSTLCEL---------------QKQCPSAGSNCPYQVRYLSDG 196
++ P++S + + VPCNS+ C+ Q Q SA + C Y + Y DG
Sbjct: 192 --LFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGAAACQGQDQSAAA-CSYTLSY-RDG 247
Query: 197 TMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP 256
+ S G L D L LA + V FGCG G G + GL GLG + S+
Sbjct: 248 SYSRGVLAHDRLSLAGE------VIDGFVFGCGTSNQGPPFGGTS--GLMGLGRSQLSLV 299
Query: 257 SILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPFSLRQT------HPTYNI 307
S +Q FS C SD +G + GD S + TP P Y +
Sbjct: 300 SQTMDQ--FGGVFSYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFV 357
Query: 308 TITQVSVGGNAVN--------FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+T ++VGG V AI DSGT T L Y + F S E
Sbjct: 358 NLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAE 412
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 125/286 (43%), Gaps = 42/286 (14%)
Query: 95 TYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVI 150
T+ +S+ L Y + +G PA+ IV +DTGSDL W+ PC C +
Sbjct: 107 TFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDP------ 160
Query: 151 DFNIYSPNTSSTSSKVPCNSTLCE------LQKQCPS-AGSNCPYQVRYLSDGTMSTGFL 203
++ P++SS+ + VPC+S C C S A + C Y + Y + T +TG
Sbjct: 161 ---LFDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRAT-TTGVY 216
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
+ L L + V + FGCG Q G + +GL GLG S+ S ++Q
Sbjct: 217 STETLTL-----KPGVVVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQF 268
Query: 264 LIPNSFSMCFGSDGTGRISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVG 315
P S+ + S G G ++ G + G TP + PT Y +T+T +SVG
Sbjct: 269 GGPFSYCLPPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVG 328
Query: 316 GNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
G + SA + DSGT T L AY + F S E R
Sbjct: 329 GAPLAVPPSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYR 374
>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
Length = 462
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 74/263 (28%), Positives = 113/263 (42%), Gaps = 33/263 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+YT++ +G P I+ +DTGS+L WL C C C +++ IY S++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDT---------IYDAARSASY 150
Query: 164 SKVPC-NSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C NS LC Q A GS C + Y DG+ S G L D L + T
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY-GDGSFSYGSLSTDTLIMETVVGGKP 209
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
+FGC + GA+ G+ GL K ++P L + FS CF
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSS 265
Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPT-----YNITITQVSVGGNAVNF---EFSA 325
+ TG + FG+ P + S+ T+ Y++ + VS+ + + F
Sbjct: 266 HLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVV 325
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG+SF+ P ++Q+ E F
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAF 348
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 73/149 (48%), Gaps = 24/149 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P+ ++ +DTGSDL WL C C C + GQV D P SST
Sbjct: 86 YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCY----AQRGQVFD-----PRRSSTY 136
Query: 164 SKVPCNSTLCELQK--QCPS---AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+VPC+S C + C S AG C Y V Y DG+ STG L D L A D
Sbjct: 137 RRVPCSSPQCRALRFPGCDSGGAAGGGCRYMVAY-GDGSSSTGDLATDKLAFAND----- 190
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
+ + ++ GCGR G F D AA GL G
Sbjct: 191 TYVNNVTLGCGRDNEGLF-DSAA--GLLG 216
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/301 (27%), Positives = 120/301 (39%), Gaps = 40/301 (13%)
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
LG Y +++ G P ++ DTGSDL WL C + + +
Sbjct: 48 LGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA-CSRRPAFVASK 106
Query: 160 SSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
S+T S VPC++ C L P+A C Y Y +DG+ +TGFL D ++
Sbjct: 107 SATLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDY-ADGSSTTGFLARDTATIS 165
Query: 212 TDEKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+V ++FGCG R Q GSF + G+ GLG + S P+ + L +FS
Sbjct: 166 NGTSGGAAVRG-VAFGCGTRNQGGSF---SGTGGVIGLGQGQLSFPA--QSGSLFAQTFS 219
Query: 271 MCFGSDGTGRI----SFGDKGSPGQ----GETPFSLRQTHPT-YNITITQVSVGGNAVNF 321
C GR SF G P + TP PT Y + + + VG +
Sbjct: 220 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV 279
Query: 322 EFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYC 368
S + DSG++ TYL AY + F + R S++ E C
Sbjct: 280 PGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELC 339
Query: 369 Y 369
Y
Sbjct: 340 Y 340
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 128/299 (42%), Gaps = 45/299 (15%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHY-TNVSVGQPALSFIVA 121
L H R + G G + + PLT A S+ +Y T + +G PA S+++
Sbjct: 96 LLHGHRKKKAGGVGGSQASSSSVPLTPGA--------SVAVGNYVTRLGLGTPATSYVMV 147
Query: 122 LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQC- 179
+DTGS L WL C C + +G V D P S T + V C+S+ C ELQ
Sbjct: 148 VDTGSSLTWL--QCSPCSVSCHRQAGPVFD-----PRASGTYAAVQCSSSECGELQAATL 200
Query: 180 -PSAGS---NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS 235
PSA S C YQ Y D + S G+L +D + + +GCG+ G
Sbjct: 201 NPSACSVSNVCIYQASY-GDSSYSVGYLSKDTVSFGSGSFPG------FYYGCGQDNEGL 253
Query: 236 FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQ-G 292
F A GL GL +K S+ LA + +FS C S G +S G +PGQ
Sbjct: 254 FGRSA---GLIGLAKNKLSLLYQLAPS--LGYAFSYCLPTSSAAAGYLSIGSY-NPGQYS 307
Query: 293 ETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGTSFTYLNDPAYTQI 344
TP + + Y +T++ +SV G + I DSGT T L YT +
Sbjct: 308 YTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITRLPPNVYTAL 366
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 124/285 (43%), Gaps = 37/285 (12%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPN 158
LG+ + ++++G P + + +DTGSDL W+ CD C C N +Y P+
Sbjct: 61 LGY-YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRN---------RLYKPH 110
Query: 159 TSSTSSKVPCNSTLCELQKQCPS---AGSN--CPYQVRYLSDGTMSTGFLVEDVLHLA-T 212
V C LC + P+ AG N C Y+V Y G+ S G L+ D + L T
Sbjct: 111 ----GDLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGS-SLGVLLRDNIPLKFT 165
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLIPNS 268
+ ++ + ++FGCG QT G P G+ GLG +TS+ S L + GLI N
Sbjct: 166 NGSLARPM---LAFGCGYDQTHH---GQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNV 219
Query: 269 FSMCFGSDGTGRISFGDKGSPGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFE-FSA 325
C G G + FGD+ P G TP + Y + + +
Sbjct: 220 VGHCLSGRGGGFLFFGDQLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLEL 279
Query: 326 IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
IFDSG+S+TY N A+ + N L + +T D C+
Sbjct: 280 IFDSGSSYTYFNSQAHKALVNLIANDLRGKPLSRATGDPSLPICW 324
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 132/283 (46%), Gaps = 38/283 (13%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G P+ S+ + +DTGS L WL C CV + G + D P
Sbjct: 127 TSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGPLFD-----P 179
Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLA 211
SST + V C+++ C ELQ PSA S C YQ Y D + S G+L D +
Sbjct: 180 RASSTYTSVRCSASQCDELQAATLNPSACSASNVCIYQASY-GDSSFSVGYLSTDTVSFG 238
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ S +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 239 STSYPS------FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287
Query: 272 CFGSDG-TGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFS 324
C + TG +S G + G TP + + Y IT++ +SVGG+ + E+S
Sbjct: 288 CLPTAASTGYLSIGPYNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYS 346
Query: 325 A---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDL 363
+ I DSGT T L +T +S+ ++A +R + S L
Sbjct: 347 SLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSIL 389
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 121/275 (44%), Gaps = 36/275 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNI 154
+SL L Y +V +G PA++ V +DTGSD+ W+ PC C ++ +G + D
Sbjct: 120 SSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPC----HAQTGALFD--- 172
Query: 155 YSPNTSSTSSKVPCNSTLC-ELQKQ---CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P SST V C + C +L++Q C + C Y V+Y DG+ + G D L L
Sbjct: 173 --PAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQY-GDGSTTNGTYSRDTLTL 229
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ K FGC +++G F D +GL GLG S+ S A NSFS
Sbjct: 230 SGASDAVKG----FQFGCSHLESG-FSD--QTDGLMGLGGGAQSLVSQTA--AAYGNSFS 280
Query: 271 MCF----GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVN-----F 321
C GS G + G S +Q Y + ++VGG + F
Sbjct: 281 YCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVF 340
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
++ DSGT T L AY+ +S F + K+ R
Sbjct: 341 AAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYR 375
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 113/274 (41%), Gaps = 57/274 (20%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++SVG PAL + +DTGSDL W C CV N ++ ++ P SST + +P
Sbjct: 119 DLSVGTPALPYAAIVDTGSDLVW--TQCKPCVECFNQTT------PVFDPAASSTYAALP 170
Query: 168 CNSTLCE--------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S LC SA S C Y Y D + + G L + LA +
Sbjct: 171 CSSALCADLPTSTCASSSSSSSASSPCGYTYTY-GDASSTQGVLATETFTLARQKVPG-- 227
Query: 220 VDSRISFGCGRVQTGS-FLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--D 276
++FGCG G F GA GL GLG S+ S L + FS C S D
Sbjct: 228 ----VAFGCGDTNEGDGFTQGA---GLVGLGRGPLSLVSQLGI-----DRFSYCLTSLDD 275
Query: 277 GTGRISF----------GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA 325
GR +P Q TP + P+ Y +++T ++VG + SA
Sbjct: 276 AAGRSPLLLGSAAGISASAATAPAQ-TTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSA 334
Query: 326 -----------IFDSGTSFTYLNDPAYTQISETF 348
I DSGTS TYL AY + + F
Sbjct: 335 FAIQDDGTGGVIVDSGTSITYLELRAYRALRKAF 368
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 122/305 (40%), Gaps = 40/305 (13%)
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
LG Y +++ G P ++ DTGSDL WL C + + +
Sbjct: 49 LGLGQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA-CSRRPAFVASK 107
Query: 160 SSTSSKVPCNSTLCELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
S+T S VPC++ C L P+A C Y Y +DG+ +TGFL D ++
Sbjct: 108 SATLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDY-ADGSSTTGFLARDTATIS 166
Query: 212 TDEKQSKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+V ++FGCG R Q GSF + G+ GLG + S P+ + L +FS
Sbjct: 167 NGTSGGAAVRG-VAFGCGTRNQGGSF---SGTGGVIGLGQGQLSFPA--QSGSLFAQTFS 220
Query: 271 MCFGSDGTGRI----SFGDKGSPGQ----GETPFSLRQTHPT-YNITITQVSVGGNAVNF 321
C GR SF G P + TP PT Y + + + VG +
Sbjct: 221 YCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPV 280
Query: 322 EFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYC 368
S + DSG++ TYL AY + F + R S++ E C
Sbjct: 281 PGSEWAIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELC 340
Query: 369 YVLRS 373
Y + S
Sbjct: 341 YNVSS 345
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 80/260 (30%), Positives = 113/260 (43%), Gaps = 41/260 (15%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P+LSF LDTGSDL W C C C IY P+ SST SKV
Sbjct: 118 KMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTP---------IYDPSQSSTYSKV 168
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
PC+S++C+ +G+NC Y Y D + + G L + L S+S+ I+F
Sbjct: 169 PCSSSMCQALPMYSCSGANCEYLYSY-GDQSSTQGILSYESFTLT-----SQSL-PHIAF 221
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTGRI 281
GCG Q + GL G G S+ S L + N FS C S T +
Sbjct: 222 GCG--QENEGGGFSQGGGLVGFGRGPLSLISQLGQS--LGNKFSYCLVSITDSPSKTSPL 277
Query: 282 SFGDKGSPGQ---GETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AI 326
G S TP ++ PT Y +++ +SVGG ++ F+ I
Sbjct: 278 FIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVI 337
Query: 327 FDSGTSFTYLNDPAYTQISE 346
DSGT+ TYL Y + +
Sbjct: 338 IDSGTTVTYLEQSGYDVVKK 357
>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 323
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 125/293 (42%), Gaps = 46/293 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + I+ +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFS-- 324
F S TG S G K + + + + + R+ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIF 220
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 SRKGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 271
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 115/296 (38%), Gaps = 49/296 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P ++ LDTGSD+ WL C C C SGQ+ D P S +
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCY----DQSGQMFD-----PRASHSY 197
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C + LC C C YQV Y DG+++ G + L A+ +
Sbjct: 198 GAVDCAAPLCRRLDSGGCDLRRKACLYQVAY-GDGSVTAGDFATETLTFASGARV----- 251
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
R++ GCG G F+ A GL S PS ++ + SFS C
Sbjct: 252 PRVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRR--FGRSFSYCLVDRTSSSA 306
Query: 274 -GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV--------- 319
+ + ++FG F+ +P Y + + +SVGG V
Sbjct: 307 SATSRSSTVTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLR 366
Query: 320 ----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGTS T L PAY + + F + A R + F+ CY L
Sbjct: 367 LDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDL 422
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 123/294 (41%), Gaps = 40/294 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G+P+ +F + +DTGSD+ WL C C C ++ I+ P +SS+
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDP---------IFDPASSSSF 210
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S++ C + C +C YQV Y DG+ + G + + S SVD +
Sbjct: 211 SRLGCQTPQCRNLDVFACRNDSCLYQVSY-GDGSYTVGDFATETVSFG----NSGSVD-K 264
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A + P L +Q + +SFS C S +
Sbjct: 265 VAIGCGHDNEGLFVGAAG-------LIGLGGGPLSLTSQ-IKASSFSYCLVNRDSVDSST 316
Query: 281 ISFGDKGSPGQGETP-FSLRQTHPTYNITITQVSVGGNAVN-----FEFSA------IFD 328
+ F P F + Y + IT +SVGG + FE I D
Sbjct: 317 LEFNSAKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVD 376
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
GT+ T L AY + +TF L K+ TS L F+ CY L S ++ V
Sbjct: 377 CGTAVTRLQTQAYNALRDTFVKLTKDLPSTSGFAL-FDTCYNLSSRTSVRVPTV 429
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 114/274 (41%), Gaps = 22/274 (8%)
Query: 105 HYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
H+T +V++G P F + +DTGSDL W+ CD C C + +Y P+ +
Sbjct: 54 HFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCT---------LPHDRLYKPHNNV 104
Query: 162 TSSKVP-CNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
P C++ + C + C Y+V Y G+ S G LV+D + L +
Sbjct: 105 VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGS-SIGVLVKDPVPLRL--TNGTIL 161
Query: 221 DSRISFGCGRVQT--GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ FGCG Q GS L G+ GLG K ++ + L+ + N CF G
Sbjct: 162 APNLGFGCGYDQHNGGSQLPPLT-AGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGG 220
Query: 279 GRISFGDKGSPGQGETPFS-LRQTHPTYNITITQVSVGGNAVNFE-FSAIFDSGTSFTYL 336
G + FG P G + LR Y+ +V GGN V FDSG+S+TY
Sbjct: 221 GFLFFGGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYF 280
Query: 337 NDPAYTQISETF-NSLAKEKRETSTSDLPFEYCY 369
N Y + N L + + D C+
Sbjct: 281 NSQVYGAVLNLLRNGLKGQPLRDAPEDKTLPICW 314
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 73/266 (27%), Positives = 111/266 (41%), Gaps = 37/266 (13%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G P +DT SD+ W+ C C +C + + ++ P+ S T +PC
Sbjct: 93 SLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSP---------MFDPSYSKTYKNLPC 143
Query: 169 NSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ST C+ Q S S+ C + V Y DG+ S G L+ + + L + R
Sbjct: 144 SSTTCK-SVQGTSCSSDERKICEHTVNY-KDGSHSQGDLIVETVTLGSYNDPFVHF-PRT 200
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRIS 282
GC R SF G+ GLG S+ L++ I FS C SD + ++
Sbjct: 201 VIGCIRNTNVSF----DSIGIVGLGGGPVSLVPQLSSS--ISKKFSYCLAPISDRSSKLK 254
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF---------SAIFDSG 330
FGD G T + Y +T+ SVG N + F + I DSG
Sbjct: 255 FGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSG 314
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKR 356
T+FT L D Y+++ + K +R
Sbjct: 315 TTFTVLPDDVYSKLESAVADVVKLER 340
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 111/277 (40%), Gaps = 47/277 (16%)
Query: 102 GFLHY-TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y +++VG P LDTGSDL W C C SC+ + I+SP
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDP---------IFSPGA 150
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVED--VLHLATDEK 215
SS+ + C LC L C C Y+ Y DGT + G + ++
Sbjct: 151 SSSYEPMRCAGELCNDILHHSCQRP-DTCTYRYSY-GDGTTTRGVYATERFTFSSSSSGG 208
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ + + + FGCG + GS +G +G+ G G S+ S LA + FS C
Sbjct: 209 ETTKLSAPLGFGCGTMNKGSLNNG---SGIVGFGRAPLSLVSQLAIR-----RFSYCLTP 260
Query: 276 DGTGRIS---FG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
+GR S FG D + T + +PT Y + T V+VG + S
Sbjct: 261 YASGRKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPIS 320
Query: 325 -----------AIFDSGTSFTYLNDPAYTQISETFNS 350
AI DSGT+ T P ++ F S
Sbjct: 321 AFALRPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRS 357
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 123/298 (41%), Gaps = 40/298 (13%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N SVG+P + +V +DTGSDL W+ C C C I+ P+ SST
Sbjct: 61 VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 111
Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ +S +C Q N C Y Y +DG+ S+G L + + T ++ + +V S +
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 169
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
FGCG G F DG +G+ GL S+ S L ++ FS C G
Sbjct: 170 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 221
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
++ GD TPF + Y +T+ +SVG ++ + + D
Sbjct: 222 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLRSFLHLQALVVLPF 385
SGT+ T+L + +S L + ++ +P CY R L+ L F
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAF 337
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 70.9 bits (172), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 98/341 (28%), Positives = 145/341 (42%), Gaps = 56/341 (16%)
Query: 65 HRDRYFRLR-----GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
R +Y + R GR + D T L +G+ N + V +G P
Sbjct: 6 ERVKYIQSRLSKNLGRENTVKDLDSTTLPAESGSLIGSAN-----YVVVVGLGTPKRDLS 60
Query: 120 VALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
+ DTGSDL W C+ C SC ++ I+ P+ SS+ + + C S+LC
Sbjct: 61 LVFDTGSDLTWTQCEPCAGSCYKQQDA---------IFDPSKSSSYTNITCTSSLCTQLT 111
Query: 175 ---LQKQCPSA-GSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRISFGCG 229
++ +C S+ ++C Y +Y D + S GFL ++ L + ATD VD + FGCG
Sbjct: 112 SDGIKSECSSSTDASCIYDAKY-GDNSTSVGFLSQERLTITATD-----IVDDFL-FGCG 164
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSMCF--GSDGTGRISFGDK 286
+ G F +G+A GL GLG S V +N I FS C S G ++FG
Sbjct: 165 QDNEGLF-NGSA--GLMGLGRHPISIVQQTSSNYNKI---FSYCLPATSSSLGHLTFGAS 218
Query: 287 GSPGQG--ETPFS-LRQTHPTYNITITQVSVGGNAV----NFEFSA---IFDSGTSFTYL 336
+ TP S + + Y + I +SVGG + + FSA I DSGT T L
Sbjct: 219 AATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRL 278
Query: 337 NDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHL 377
Y + F EK + + CY L + +
Sbjct: 279 APTVYAALRSAFRR-XMEKYPVANEAGLLDTCYDLSGYKEI 318
>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
Length = 346
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 115/273 (42%), Gaps = 37/273 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P + +V +DTGS L W+ C +C + + +GQ I++P SST SKV
Sbjct: 3 ISLGTPPVFNLVTIDTGSTLSWVQCKNCQIKCYDQAAKAGQ-----IFNPYNSSTYSKVG 57
Query: 168 CNSTLCE-------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
C++ C ++ C C Y +RY S G S G+L +D L LA++ +S+
Sbjct: 58 CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGS-GEYSVGYLGKDRLTLASN----RSI 112
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GT 278
D+ I FGCG L G+ G G S + + Q +FS CF D
Sbjct: 113 DNFI-FGCGEDN----LYNGVNAGIIGFGTKSYSFFNQVCQQTDY-TAFSYCFPRDHENE 166
Query: 279 GRISFGDKGSP-GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS--------AIFDS 329
G ++ G T P Y I Q+ + N + E I DS
Sbjct: 167 GSLTIGPYARDINLMWTKLIYYDHKPAY--AIQQLDMMVNGIRLEIDPYIYISKMTIVDS 224
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
GT+ TY+ P + + + + K T D
Sbjct: 225 GTADTYILSPVFDALDKAMTKEMQAKGYTRGWD 257
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/272 (30%), Positives = 114/272 (41%), Gaps = 61/272 (22%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+SVG P L+F V DTGSDL W C C C + P +SST SK+
Sbjct: 89 NISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139
Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S+ C+ + C + G C Y +Y S T G+L + L + S
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN----SFSMCFGSD-- 276
++FGC + G G + +G+ GLG S LIP FS C S
Sbjct: 191 -VAFGC-STENGV---GNSTSGIAGLGRGALS---------LIPQLGVGRFSYCLRSGSA 236
Query: 277 -GTGRISFGDKGSPGQGE---TPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS- 324
G I FG + G TPF + HP+ Y + +T ++VG + F F+
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296
Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL Y + + F S
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 328
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 123/298 (41%), Gaps = 40/298 (13%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N SVG+P + +V +DTGSDL W+ C C C I+ P+ SST
Sbjct: 61 VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 111
Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ +S +C Q N C Y Y +DG+ S+G L + + T ++ + +V S +
Sbjct: 112 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 169
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
FGCG G F DG +G+ GL S+ S L ++ FS C G
Sbjct: 170 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 221
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
++ GD TPF + Y +T+ +SVG ++ + + D
Sbjct: 222 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 279
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLRSFLHLQALVVLPF 385
SGT+ T+L + +S L + ++ +P CY R L+ L F
Sbjct: 280 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAF 337
>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
Length = 462
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 109/263 (41%), Gaps = 33/263 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+YT++ +G P I+ +DTGS+L WL C C C +++ IY S +
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDT---------IYDAARSVSY 150
Query: 164 SKVPC-NSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C NS LC Q A GS C + Y DG+ S G L D L + T
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFY-GDGSFSYGSLSTDTLIMETVVGGKP 209
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--- 275
+FGC + GA+ G+ GL K ++P L + FS CF
Sbjct: 210 VTVQDFAFGCAQGDLELVPTGAS--GILGLNAGKMALPMQLGQR--FGWKFSHCFPDRSS 265
Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEF--------SA 325
+ TG + FG+ P + S+ T+ V++ G ++N
Sbjct: 266 HLNSTGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVV 325
Query: 326 IFDSGTSFTYLNDPAYTQISETF 348
I DSG+SF+ P ++Q+ E F
Sbjct: 326 ILDSGSSFSSFVRPFHSQLREAF 348
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 123/298 (41%), Gaps = 40/298 (13%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSK 165
N SVG+P + +V +DTGSDL W+ C C C I+ P+ SST
Sbjct: 93 VNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTP---------IFDPSKSSTYVD 143
Query: 166 VPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ +S +C Q N C Y Y +DG+ S+G L + + T ++ + +V S +
Sbjct: 144 LSYDSPICPNSPQKKYNHLNQCIYNASY-ADGSTSSGNLATEDIVFETSDQGTVTVSS-V 201
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-----DGTG 279
FGCG G F DG +G+ GL S+ S L ++ FS C G
Sbjct: 202 VFGCGHSNRGRF-DGQQ-SGILGLSAGDQSIVSRLGSR------FSYCIGDLFDPHYTHN 253
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
++ GD TPF + Y +T+ +SVG ++ + + D
Sbjct: 254 QLVLGDGVKMEGSSTPF--HTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMD 311
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLRSFLHLQALVVLPF 385
SGT+ T+L + +S L + ++ +P CY R L+ L F
Sbjct: 312 SGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAF 369
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 119/287 (41%), Gaps = 44/287 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASY 216
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C+S C C +A C Y+V Y DG+ + G + L L
Sbjct: 217 AAVSCDSQRCRDLDTAACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVGN--- 272
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ ++FS C S
Sbjct: 273 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAA 322
Query: 279 GRISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA----------- 325
+ FGD + T +R +T Y + ++ +SVGG ++ SA
Sbjct: 323 STLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGG 382
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGT+ T L AY + + F A TS L F+ CY L
Sbjct: 383 VIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVSL-FDTCYDL 428
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 122/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G PA + IV +DTGS + W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/281 (27%), Positives = 113/281 (40%), Gaps = 37/281 (13%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P + +DTGSD+ WL C+ C C I+ P+ S T +PC
Sbjct: 96 SVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTP---------IFDPSKSKTYKTLPC 146
Query: 169 NSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+S CE L+ S+ + C Y + Y DG+ S G L + L L + + S + G
Sbjct: 147 SSNTCESLRNTACSSDNVCEYSIDY-GDGSHSDGDLSVETLTLGSTDGSSVHFPKTV-IG 204
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDGTGRIS 282
CG G+F + G +G+ V I I FS C S+ + +++
Sbjct: 205 CGHNNGGTFQE----EGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLN 260
Query: 283 FGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-----------EFSAIFD 328
FGD G TP Y +T+ SVG N + F + + I D
Sbjct: 261 FGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIID 320
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SGT+ T L Y + + + K +R S L CY
Sbjct: 321 SGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKL-LSLCY 360
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 91/330 (27%), Positives = 136/330 (41%), Gaps = 57/330 (17%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R+R RL+ L A + + GN + + +++G P ++ LDTG
Sbjct: 67 RNRLQRLQAMALVASSSSEIEAPVLPGNGEFLMK---------LAIGTPPETYSAILDTG 117
Query: 126 SDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS 184
SDL W C C C H I+ P SS+ SK+ C+S LCE Q S +
Sbjct: 118 SDLIWTQCKPCTQCFHQSTP---------IFDPKKSSSFSKLSCSSQLCEALPQS-SCNN 167
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGS-FLDGAAPN 243
C Y Y D + + G L + L K+ ++FGCG GS F GA
Sbjct: 168 GCEYLYSY-GDYSSTQGILASETLTFG------KASVPNVAFGCGADNEGSGFSQGA--- 217
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGT-------GRISFGDKGSPGQGETP 295
GL GLG S+ S L FS C + D T G ++ + S TP
Sbjct: 218 GLVGLGRGPLSLVSQLKEP-----KFSYCLTTVDDTKTSTLLMGSLASVNASSSAIKTTP 272
Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
HP+ Y +++ +SVG + + S I DSGT+ TYL + A+
Sbjct: 273 LIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTTITYLEESAFNL 332
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+++ F + ++S S + C+ L S
Sbjct: 333 VAKEFTAKINLPVDSSGST-GLDVCFTLPS 361
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 83/272 (30%), Positives = 114/272 (41%), Gaps = 61/272 (22%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+SVG P L+F V DTGSDL W C C C + P +SST SK+
Sbjct: 89 NISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139
Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S+ C+ + C + G C Y +Y S T G+L + L + S
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN----SFSMCFGSD-- 276
++FGC + G G + +G+ GLG S LIP FS C S
Sbjct: 191 -VAFGC-STENGV---GNSTSGIAGLGRGALS---------LIPQLGVGRFSYCLRSGSA 236
Query: 277 -GTGRISFGDKGSPGQGE---TPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS- 324
G I FG + G TPF + HP+ Y + +T ++VG + F F+
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296
Query: 325 ------AIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL Y + + F S
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 328
>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 466
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 104/267 (38%), Gaps = 42/267 (15%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGL--------------NSSSGQ 148
F + V+VG P + F+ DTGSDL WL C+ +G+
Sbjct: 80 FEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEA 139
Query: 149 VIDFNIYSPNTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVE 205
V+ FN P SS+ S+V C+ C C C ++ Y DG +TG L
Sbjct: 140 VVYFN---PFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSY-RDGASATGLLAA 195
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
D + + + I FGC G +G+ GLG S+ S L +
Sbjct: 196 DTFTFGGNINNDTTSTASIDFGCATGTAGREFQA---DGMVGLGAGPLSLASQLGRK--- 249
Query: 266 PNSFSMCFGS----DGTGRISFGDKG---SPGQGETPFSLRQTHPT--YNITITQVSVGG 316
FS C + D + ++FG + PG TP ++ Y I+I + V G
Sbjct: 250 ---FSFCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAG 306
Query: 317 NAVNFEFS---AIFDSGTSFTYLNDPA 340
V S I D+GT T+L+ A
Sbjct: 307 QPVPGTTSVSKVIVDTGTVLTFLDRAA 333
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 137/330 (41%), Gaps = 53/330 (16%)
Query: 62 ALAHRDRYFRLRGRGLAAQGND----KTPLTFSAGNDT----YRLNSLGFLHYT-NVSVG 112
+LA R R R R + + T L+ +AG T + +S+ L Y + +G
Sbjct: 39 SLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIG 98
Query: 113 QPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
PA+ V +DTGSDL W+ PC C + ++ P++SS+ + VPC+
Sbjct: 99 TPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDP---------LFDPSSSSSYASVPCD 149
Query: 170 STLCELQKQCP----------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S C A + C Y + Y + T +TG + L L +
Sbjct: 150 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT-TTGVYSTETLTL-----KPGV 203
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V + FGCG Q G + +GL GLG S+ S ++Q P S+ + S G G
Sbjct: 204 VVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 260
Query: 280 RISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----I 326
++ G + G TP + PT Y +T+T +SVGG + SA +
Sbjct: 261 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 320
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT T L AY + F S E R
Sbjct: 321 IDSGTVITGLPATAYAALRSAFRSAMSEYR 350
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 120/275 (43%), Gaps = 45/275 (16%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIY 155
SLG Y V++G PA++ ++++DTGSD+ W+ PC SC + ++
Sbjct: 123 SLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD---------KLF 173
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P S+T S C S C Q G S C Y V+Y DG+ + G D L L
Sbjct: 174 DPAMSATYSAFSCGSAQC---AQLGDEGNGCLKSQCQYIVKY-GDGSNTAGTYGSDTLSL 229
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
+ S +V S FGC G F+ +GL GLG D S+ S A +FS
Sbjct: 230 TS----SDAVKS-FQFGCSHRAAG-FV--GELDGLMGLGGDTESLVSQTA--ATYGKAFS 279
Query: 271 MCF---GSDGTGRISFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--- 320
C S G G ++ G G S TP +R + PT Y + + ++V G +N
Sbjct: 280 YCLPPPSSSGGGFLTLGAAGGASSSRYSHTPM-VRFSVPTFYGVFLQGITVAGTMLNVPA 338
Query: 321 --FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
F +++ DSGT T L AY + F K
Sbjct: 339 SVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMK 373
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 116/296 (39%), Gaps = 49/296 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P+ ++ LDTGSD+ WL C C C SG V D P SS+
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCY----DQSGPVFD-----PRRSSSY 190
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
V C + LC C C YQV Y DG+++ G + L A +
Sbjct: 191 GAVDCAAPLCRRLDSGGCDLRRRACLYQVAY-GDGSVTAGDFATETLTFAGGARV----- 244
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-------- 273
+R++ GCG G F+ A GL S P+ ++ + SFS C
Sbjct: 245 ARVALGCGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRR--YGKSFSYCLVDRTSSSS 299
Query: 274 ----GSDGTGRISFGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAV--------- 319
+ ++FG + TP T Y + + +SVGG V
Sbjct: 300 SGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLR 359
Query: 320 ----NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGTS T L P+Y+ + + F + A R + F+ CY L
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDL 415
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 118/264 (44%), Gaps = 32/264 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P F + DTGSDL W C+ CV + + I++P+ S++
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEA--------IFNPSQSTSY 204
Query: 164 SKVPCNSTLCELQKQCPS-----AGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
+ + C STLC+ A S C Y ++Y D + S GF ++ L L ATD
Sbjct: 205 ANISCGSTLCDSLASATGNIFNCASSTCVYGIQY-GDSSFSIGFFGKEKLSLTATD---- 259
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
V + FGCG+ G F A GLG DK S+ S A + S+ + S
Sbjct: 260 --VFNDFYFGCGQNNKGLFGGAAGLL---GLGRDKLSLVSQTAQRYNKIFSYCLPSSSSS 314
Query: 278 TGRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSA------IFDSG 330
TG ++FG S TP ++ Y + +T +SVGG + S I DSG
Sbjct: 315 TGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSG 374
Query: 331 TSFTYLNDPAYTQISETFNSLAKE 354
T T L AY+ +S TF L +
Sbjct: 375 TVITRLPPAAYSALSSTFRKLMSQ 398
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 113/282 (40%), Gaps = 59/282 (20%)
Query: 105 HYTNVSVGQPALSFIVA-LDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++ +G P +V LDTGSDL W C C C ++ + S T
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTVCFDQ---------PVPVFRASVSHTF 144
Query: 164 SKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQS 217
S+VPC+ LC P +G +C Y Y+ D +++TG + ED A D +
Sbjct: 145 SRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYM-DHSITTGKMAEDTFTFKAPDRADT 203
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPN--GLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ I FGCG + G F PN G+ G G S+PS L + FS CF +
Sbjct: 204 AAAVPNIRFGCGMMNYGLF----TPNQSGIAGFGTGPLSLPSQLKVR-----RFSYCFTA 254
Query: 276 DGTGRIS---FGDKGSPGQGE---------TPFSLRQ------THPTYNITITQVSVGGN 317
R+S G G P E TPF+ + P Y +++ V+VG
Sbjct: 255 MEESRVSPVILG--GEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGET 312
Query: 318 AVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETF 348
+ F S DSGT+ T+ + + E F
Sbjct: 313 RLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAF 354
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 98/334 (29%), Positives = 141/334 (42%), Gaps = 50/334 (14%)
Query: 34 FHHRYSDPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQ--GNDKTPLTFSA 91
HHRY DP + P K L R R +LR + + G + +A
Sbjct: 59 LHHRY-DPCSPV------PSK----KVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAA 107
Query: 92 GNDTYRLNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
T SL L Y V +G PA++ +++DTGSD+ W+ C C C ++S
Sbjct: 108 TVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDS----- 162
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVE 205
++ P++SST S C+S C Q S C Y V Y G S+
Sbjct: 163 ----LFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNY---GDSSSTTGTY 215
Query: 206 DVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
L S + FGC + ++G F D +GL GLG S+ S A G
Sbjct: 216 SSDTLTL----GSSAMTDFQFGCSQSESGGFNDQT--DGLMGLGGGAQSLASQTA--GTF 267
Query: 266 PNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLRQTH-PTYNITITQ-VSVGGNAVN- 320
+FS C S +G ++ G GS G +TP LR T PTY + + + + VG +N
Sbjct: 268 GTAFSYCLPPTSGSSGFLTLG-TGSSGFVKTPM-LRSTQIPTYYVVLLESIKVGSQQLNL 325
Query: 321 ----FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
F ++ DSGT T L AY+ +S F +
Sbjct: 326 PTSVFSAGSLMDSGTIITRLPPTAYSALSSAFKA 359
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/298 (25%), Positives = 117/298 (39%), Gaps = 49/298 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
++ GCG +G F+ A GL GLG S+ L G FS C G+
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------A 325
G G + G + +G R+ Y + +T + VGG + + S
Sbjct: 289 GAGSLVLGRTEAVPRG------RRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 342
Query: 326 IFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
+ D+GT+ T L AY + F+ ++ R + S L + CY L + ++ V
Sbjct: 343 VMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLSGYASVRVPTV 398
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 125/325 (38%), Gaps = 73/325 (22%)
Query: 102 GFLHYT-NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNT 159
G L Y ++++G P LDTGSDL W C C SC+ + +++P
Sbjct: 92 GDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDP---------LFAPGQ 142
Query: 160 SSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
S++ + C TLC L C C Y+ Y DGTM+ G + A+
Sbjct: 143 SASYEPMRCAGTLCSDILHHSCERP-DTCTYRYNY-GDGTMTVGVYATERFTFASSGGGG 200
Query: 218 KSVDS-RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ + + FGCG V GS +G +G+ G G + S+ S L+ + FS C S
Sbjct: 201 LTTTTVPLGFGCGSVNVGSLNNG---SGIVGFGRNPLSLVSQLSIR-----RFSYCLTSY 252
Query: 277 GTGRIS-----------FGDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS 324
+ R S +GD Q TP +PT Y + T ++VG + S
Sbjct: 253 ASRRQSTLLFGSLSDGVYGDATGRVQ-TTPLLQSPQNPTFYYVHFTGLTVGARRLRIPES 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNSL--------------------AK 353
A I DSGT+ T L ++ F A
Sbjct: 312 AFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAA 371
Query: 354 EKRETSTSDLPFEYCYVLRSFLHLQ 378
+R +STS +P V R LH Q
Sbjct: 372 WRRSSSTSQMP-----VPRMVLHFQ 391
>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 452
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/296 (27%), Positives = 126/296 (42%), Gaps = 50/296 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P L + DTGSDL W C C S C +Y+P++S+T + +
Sbjct: 96 LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 146
Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
PCNS+L P G C Y V Y S T + F + + V
Sbjct: 147 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWT--SVFQGSETFTFGSTPAGHARV 204
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
I+FGC +G + ++ +GL GLG + S L +Q +P FS C ++
Sbjct: 205 PG-IAFGCSTASSG--FNASSASGLVGLGRGRLS----LVSQLGVPK-FSYCLTPYQDTN 256
Query: 277 GTGRISFGD----KGSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVN-----FEF 323
T + G G+ G TPF + T P Y + +T +S+G A++ F
Sbjct: 257 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 316
Query: 324 SA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+A I DSGT+ T L + AY Q+ SL ++D + C++L S
Sbjct: 317 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPS 372
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/284 (26%), Positives = 116/284 (40%), Gaps = 36/284 (12%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN-IYSPNTSSTSS 164
Y +++G+PA + + +DTGS WL C ++ G N + P T
Sbjct: 40 YVTMNIGEPAEPYFLDIDTGSSFTWLEC---------HAKDGPCKTCNKVPHPLYRLTRK 90
Query: 165 K-VPCNSTLCEL-------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
K VPC LC+ K+C N C Y+V+Y DG S G L+ D L T
Sbjct: 91 KLVPCADPLCDALHKDLGTTKKCTDVRKNQCDYKVKY-QDGLSSLGVLLLDKFSLPTGGA 149
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAP----NGLFGLGMDKTSVPSILANQGLI-PNSFS 270
++ I+FGCG Q A +G+ GLG + S L + G + N
Sbjct: 150 RN------IAFGCGYDQMKGSKKKAPEKVPVDGILGLGRGSVDLASQLKHSGAVSKNVIG 203
Query: 271 MCFGSDGTGRISFGDKGSPGQGETPFSLRQTHP----TYNITITQVSVGGNAVNFE-FSA 325
C S G G + G++ P T + T P Y+ + + N + + A
Sbjct: 204 HCLSSKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQATLHLDSNPIGTKPLKA 263
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSG+++TYL + + Q+ + + SD C+
Sbjct: 264 IFDSGSTYTYLPENLHAQLVSALKASLSKSSLKQVSDPALPLCW 307
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 96/329 (29%), Positives = 134/329 (40%), Gaps = 57/329 (17%)
Query: 71 RLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW 130
RLRG A + K+ T +GN + +V +G P + DTGSDL W
Sbjct: 109 RLRGSK-ATKIPAKSGATIGSGN-----------YIVSVGLGTPKKYLSLIFDTGSDLTW 156
Query: 131 LPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL-------QKQCPSA 182
C C + ++ P+ S+T S + C+S C Q C SA
Sbjct: 157 TQCQPCARYCYNQKDP--------VFVPSQSTTYSNISCSSPDCSQLESGTGNQPGC-SA 207
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
C Y ++Y D + S G+ ++ L L S V FGCG+ G F A
Sbjct: 208 ARACIYGIQY-GDQSFSVGYFAKETLTLT-----STDVIENFLFGCGQNNRGLFGSAA-- 259
Query: 243 NGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE-TPFSLR 299
GL GLG DK S+ A + FS C S TG ++FG G G + TP +
Sbjct: 260 -GLIGLGQDKISIVKQTAQK--YGQVFSYCLPKTSSSTGYLTFGGGGGGGALKYTP--IT 314
Query: 300 QTHPT---YNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNS 350
+ H Y + I + VGG + S AI DSGT T L AY+ + F
Sbjct: 315 KAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITRLPPDAYSALKSAFEK 374
Query: 351 -LAKEKRETSTSDLPFEYCYVLRSFLHLQ 378
+AK + S L + CY L + +Q
Sbjct: 375 GMAKYPKAPELSIL--DTCYDLSKYSTIQ 401
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 120/287 (41%), Gaps = 44/287 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSASY 219
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C+S C C +A C Y+V Y DG+ + G + L L +
Sbjct: 220 AAVSCDSPRCRDLDTAACRNATGACLYEVAY-GDGSYTVGDFATETLTLGDSTPVTN--- 275
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ ++FS C S
Sbjct: 276 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----STFSYCLVDRDSPAA 325
Query: 279 GRISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAVNFEFSA----------- 325
+ FG G+ T +R +T Y + ++ +SVGG A++ SA
Sbjct: 326 STLQFGADGAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGG 385
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGT+ T L AY + + F TS L F+ CY L
Sbjct: 386 VIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSL-FDTCYDL 431
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/310 (27%), Positives = 121/310 (39%), Gaps = 56/310 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT +D W+PC C C + PN S+T
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS------------TTFLPNASTTL 145
Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C+ C + CP+ GS+ C + Y D ++ T LV+D + LA D V
Sbjct: 146 GSLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSL-TATLVQDAITLAND------V 198
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 199 IPGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 253
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 254 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 313
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS-------FLH 376
I DSGT T P Y I + F K+ +S F+ C+ + LH
Sbjct: 314 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAATNEAEAPAITLH 370
Query: 377 LQAL-VVLPF 385
+ L +VLP
Sbjct: 371 FEGLNLVLPM 380
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 122/288 (42%), Gaps = 38/288 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
L++L F+ V G PA ++ +++DTGSD+ W+ C+ C V D P
Sbjct: 156 LDTLEFV--VTVGFGSPAQNYTLSIDTGSDVSWI--QCLPCSGHCYKQHDPVFD-----P 206
Query: 158 NTSSTSSKVPCNSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S+T S VPC C +C ++G+ C Y+V Y DG+ + G L + L L++
Sbjct: 207 TKSATYSAVPCGHPQCAAAGGKCSNSGT-CLYKVTY-GDGSSTAGVLSHETLSLSSTRDL 264
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+FGCG+ G F GL + S+PS A +FS C S
Sbjct: 265 PG-----FAFGCGQTNLGEFGGVDGLVGLGRGAL---SLPSQAA--ATFGATFSYCLPSY 314
Query: 277 GT--GRISFGDKGSPGQGE------TPFSLRQTHPT-YNITITQVSVGG------NAVNF 321
T G ++ G + T ++ +P+ Y + + + +GG V
Sbjct: 315 DTTHGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT 374
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+FDSGT TYL AY + + F + + D PF+ CY
Sbjct: 375 RDGTLFDSGTILTYLPPEAYASLRDRFKFTMTQYKPAPAYD-PFDTCY 421
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 121/291 (41%), Gaps = 53/291 (18%)
Query: 113 QPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
Q LS I+ DTGS+ + C S S V D P S + +VPC S L
Sbjct: 110 QKNLSAII--DTGSEAVLVQC---------GSRSRPVFD-----PAASQSYRQVPCISQL 153
Query: 173 C-ELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
C +Q+Q C ++ + C Y + Y D STG +DV+ L + ++V R
Sbjct: 154 CLAVQQQTSNGSSQPCVNSSATCTYSLSY-GDSRNSTGDFSQDVIFLNSTNSSGQAVQFR 212
Query: 224 -ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-----G 277
++FGC G FL G+ G S+PS L ++ L + FS CF S
Sbjct: 213 DVAFGCAHSPQG-FLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRA 270
Query: 278 TGRISFGDKG--SPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
TG I GD G G TP P Y + +T +SV G + SA
Sbjct: 271 TGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 330
Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCY 369
+ DSGT+FT + D AYT F + + R+ + F+ CY
Sbjct: 331 TGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCY 381
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 138/337 (40%), Gaps = 73/337 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSSGQVIDFNIYSPNT 159
+ +++G P + V LDTGSDL W+PC DC+ C N+ + +++SP
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNN---DLKSPSVFSPLH 139
Query: 160 SSTSSKVPCNSTLC-ELQKQ------CPSAGSN------------CPYQVRYLSDGTMST 200
SSTS + C S+ C E+ C AG + CP +G + +
Sbjct: 140 SSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLIS 199
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L D+L T + R SFGC T ++ + P G+ G G S+PS L
Sbjct: 200 GILTRDILKARTRDV------PRFSFGC---VTSTYRE---PIGIAGFGRGLLSLPSQL- 246
Query: 261 NQGLIPNSFSMCF-------GSDGTGRISFGDKG-----SPGQGETPFSLRQTHP-TYNI 307
G + FS CF + + + G + TP +P +Y I
Sbjct: 247 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYI 304
Query: 308 TITQVSVGGNAVNFEFS-------------AIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ +++G N + + DSGT++T+L +P Y+Q+ T S
Sbjct: 305 GLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTITY 364
Query: 355 KRETST-SDLPFEYCYVL----RSFLHLQALVVLPFP 386
R T T S F+ CY + + L+ V++ FP
Sbjct: 365 PRATETESRTGFDLCYKVPCPNNNLTSLENDVMMIFP 401
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/262 (28%), Positives = 113/262 (43%), Gaps = 41/262 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNST 171
G PA+ ++ +DTGSDL W+ C NSS+ ++ P+ SST + VPC S
Sbjct: 129 GTPAVPQVLLIDTGSDLSWVQC------QPCNSSTCYPQKDPVFDPSASSTYAPVPCGSE 182
Query: 172 LCE------LQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
C C S S C Y ++Y +G + G + L L+ ++ +V +
Sbjct: 183 ACRDLDPDSYANGCTNSSSGASLCQYGIQY-GNGDTTVGVYSTETLTLS---PEAATVVN 238
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF--GSDGT 278
SFGCG VQ G F + P L +Q G +FS C G+
Sbjct: 239 NFSFGCGLVQKGVFDLFDG-------LLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTA 291
Query: 279 GRISFGDKGSPGQGE-----TPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFD 328
G ++ G + G TP + +T Y + +T +SVGG ++ E + I D
Sbjct: 292 GFLALGAPATGGNNTAGFQFTPLQVVETT-FYLVKLTGISVGGKQLDIEPTVFAGGMIID 350
Query: 329 SGTSFTYLNDPAYTQISETFNS 350
SGT T L + AY+ + F S
Sbjct: 351 SGTIVTGLPETAYSALRTAFRS 372
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 111/270 (41%), Gaps = 45/270 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G PA+ + +DTGSDL W C C C I+ P SS+ SKV
Sbjct: 111 ELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 161
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+S LC + C +C Y Y D + + G L + + ++ S I
Sbjct: 162 GCSSGLCNALPRSNCNEDKDSCEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 215
Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
FGCG G DG + +GL GLG S+ S L S S+ G
Sbjct: 216 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 272
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
S +G ++ G+ SL + P+ Y + + ++VG ++ E S
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL + A+ + E F S
Sbjct: 333 GTGGMIIDSGTTITYLEETAFKVLKEEFTS 362
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 127/287 (44%), Gaps = 40/287 (13%)
Query: 111 VGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + +DTGSDL W+ C C+ C + +N ++ P SST + + C+
Sbjct: 70 IGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINP---------MFDPLKSSTYTNISCD 120
Query: 170 STLC--ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S LC +C S C Y Y +D +++ G L ++ + L ++ + S+ I FG
Sbjct: 121 SPLCYKPYIGEC-SPEKRCDYTYGY-ADSSLTKGVLAQETVTLTSNTGKPISLQG-ILFG 177
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA--------NQGLIPNSFSMCFGSDGTG 279
CG TG+F D GL GLG TS+ S + +Q L+P + S
Sbjct: 178 CGHNNTGNFNDHEM--GLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISS---- 231
Query: 280 RISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGG-----NAVNFEFSAIFDS 329
++SFG KGS GE TP R+ T Y +T+ +SV N+ + + + DS
Sbjct: 232 QMSFG-KGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDS 290
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLH 376
GT L Y ++ + + T L + CY ++ L
Sbjct: 291 GTPPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLK 337
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 119/288 (41%), Gaps = 41/288 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA S + DTGSD+ WL C C C + I++P+ SS+
Sbjct: 81 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDP---------IFNPSLSSSF 131
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C S++C +L+ + S + C YQV Y DG+ + G + L +S
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSY-GDGSFTVGDFSTETLSFGEHAVRS----- 185
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTG 279
++ GCGR G F A L GLG S PS + FS C S
Sbjct: 186 -VAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAA 239
Query: 280 RISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-----------I 326
+ FG P + L R+ Y + + ++ V G+ VN A I
Sbjct: 240 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 299
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF 374
DSGT+ + L PAYT + + F SL S F+ CY L S
Sbjct: 300 VDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGIS--LFDTCYDLSSM 345
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 101/401 (25%), Positives = 148/401 (36%), Gaps = 64/401 (15%)
Query: 1 MASSYRNSPVCVLLILLSCCAGCCFGFGTFGFDFHHRYSDP-VKGILAVDDLPKKGSFAY 59
M+SS +L+ L CA G + +SDP + V D ++
Sbjct: 1 MSSSTSQMASLAVLVFLVVCATLASGAASVRVGLTRIHSDPDITAPEFVRDALRRD---- 56
Query: 60 YSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFI 119
HR + L GR LA +D T ++ D G + +S+G P LS+
Sbjct: 57 ----MHRQQSRSLFGRELAE--SDGTTVSARTRKDLPN----GGEYLMTLSIGTPPLSYP 106
Query: 120 VALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-- 174
DTGSDL W PC C +Y+P +S+T +PCNS+L
Sbjct: 107 AIADTGSDLIWTQCAPCSGDQCF---------AQPAPLYNPASSTTFGVLPCNSSLSMCA 157
Query: 175 --LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQ 232
L + P G C Y Y + T G + + V I+FGC
Sbjct: 158 GVLAGKAPPPGCACMYNQTYGTGWT--AGVQGSETFTFGSAAADQARVPG-IAFGCSNAS 214
Query: 233 TGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGS 288
+ + +G+A GL GLG S+ S L FS C ++ T + G +
Sbjct: 215 SSDW-NGSA--GLVGLGRGSLSLVSQLGA-----GRFSYCLTPFQDTNSTSTLLLGPSAA 266
Query: 289 ---PGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSG 330
G TPF Y + +T +S+G A++ A I DSG
Sbjct: 267 LNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGTGGLIIDSG 326
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
T+ T L + AY Q+ SL + + CY L
Sbjct: 327 TTITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTGLDLCYAL 367
>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
gi|194708650|gb|ACF88409.1| unknown [Zea mays]
Length = 392
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/296 (27%), Positives = 126/296 (42%), Gaps = 50/296 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P L + DTGSDL W C C S C +Y+P++S+T + +
Sbjct: 36 LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 86
Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
PCNS+L P G C Y V Y S T + F + + V
Sbjct: 87 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSGWT--SVFQGSETFTFGSTPAGHARV 144
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
I+FGC +G + ++ +GL GLG + S+ S L +P FS C ++
Sbjct: 145 PG-IAFGCSTASSG--FNASSASGLVGLGRGRLSLVSQLG----VPK-FSYCLTPYQDTN 196
Query: 277 GTGRISFGD----KGSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVN-----FEF 323
T + G G+ G TPF + T P Y + +T +S+G A++ F
Sbjct: 197 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSL 256
Query: 324 SA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+A I DSGT+ T L + AY Q+ SL ++D + C++L S
Sbjct: 257 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSADTGLDLCFMLPS 312
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/247 (29%), Positives = 109/247 (44%), Gaps = 30/247 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + +DTGS L WL C C +C + ++ P SST C+
Sbjct: 95 IGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQ---------ETPLFEPLKSSTYKYATCD 145
Query: 170 STLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRI 224
S C L Q+ C G C Y + Y D + S G L + L +T Q+ S + I
Sbjct: 146 SQPCTLLQPSQRDCGKLG-QCIYGIMY-GDKSFSVGILGTETLSFGSTGGAQTVSFPNTI 203
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
FGCG + G+ GLG S+ S L Q I + FS C + S T ++
Sbjct: 204 -FGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQ--IGHKFSYCLLPYDSTSTSKL 260
Query: 282 SFGDKG---SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAVN---FEFSAIFDSGTSFT 334
FG + + G TP ++ + PTY + + V++G V+ + + + DSGT T
Sbjct: 261 KFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVSTGQTDGNIVIDSGTPLT 320
Query: 335 YLNDPAY 341
YL + Y
Sbjct: 321 YLENTFY 327
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 137/330 (41%), Gaps = 53/330 (16%)
Query: 62 ALAHRDRYFRLRGRGLAAQGND----KTPLTFSAGNDT----YRLNSLGFLHYT-NVSVG 112
+LA R R R R + + T L+ +AG T + +S+ L Y + +G
Sbjct: 119 SLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTSIPTFLGDSVNSLEYVVTLGIG 178
Query: 113 QPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
PA+ V +DTGSDL W+ PC C + ++ P++SS+ + VPC+
Sbjct: 179 TPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDP---------LFDPSSSSSYASVPCD 229
Query: 170 STLCELQKQCP----------SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S C A + C Y + Y + T +TG + L L +
Sbjct: 230 SDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT-TTGVYSTETLTL-----KPGV 283
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
V + FGCG Q G + +GL GLG S+ S ++Q P S+ + S G G
Sbjct: 284 VVADFGFGCGDHQHGPYEKF---DGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 340
Query: 280 RISFG-------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----I 326
++ G + G TP + PT Y +T+T +SVGG + SA +
Sbjct: 341 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 400
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT T L AY + F S E R
Sbjct: 401 IDSGTVITGLPATAYAALRSAFRSAMSEYR 430
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 125/309 (40%), Gaps = 54/309 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ V +G P + LDT +D W+PC S G +S++ + PN S+T
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPC---SGCTGFSSTT--------FLPNASTTLG 146
Query: 165 KVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C+ C + CP+ GS+ C + Y D ++ T LV+D + LA D V
Sbjct: 147 SLDCSGAQCSQVRGFSCPATGSSACLFNQSYGGDSSL-TATLVQDAITLAND------VI 199
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG---- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 200 PGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYYF 254
Query: 278 TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEFS 324
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 255 SGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAG 314
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS-------FLHL 377
I DSGT T P Y I + F K+ +S F+ C+ + LH
Sbjct: 315 TIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAATNEAEAPAITLHF 371
Query: 378 QAL-VVLPF 385
+ L +VLP
Sbjct: 372 EGLNLVLPM 380
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 122/288 (42%), Gaps = 42/288 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG P ++ LDTGSD+ W+ C+ C C + IY+P SS+
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDP---------IYNPALSSSY 195
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C + LC +L S +C YQV Y DG+ + G + L L Q+
Sbjct: 196 KLVGCQANLCQQLDVSGCSRNGSCLYQVSY-GDGSYTQGNFATETLTLGGAPLQN----- 249
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ-GLIPNSFSMCF---GSDGT 278
++ GCG G F+ A GL S PS L ++ G I FS C S+ +
Sbjct: 250 -VAIGCGHDNEGLFVGAAGLLGLG---GGSLSFPSQLTDENGKI---FSYCLVDRDSESS 302
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFS-----------A 325
+ FG P L+ + Y ++++ +SVGG ++ S
Sbjct: 303 STLQFGRAAVPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGV 362
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ T L AY + + F + K T L F+ CY L S
Sbjct: 363 IVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSL-FDTCYDLSS 409
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/275 (28%), Positives = 120/275 (43%), Gaps = 36/275 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+ +G P + I +DTGSDL W C C C QV+ ++ P SST
Sbjct: 92 YLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYK-------QVVP--LFDPKNSSTY 142
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
C ++ C + + S C ++ Y +DG+ + G L + L D K V
Sbjct: 143 RDSSCGTSFCLALGKDRSCSKEKKCTFRYSY-ADGSFTGGNLASET--LTVDSTAGKPVS 199
Query: 222 -SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GS 275
+FGCG G F + +G+ GLG + S+ S L + I FS C S
Sbjct: 200 FPGFAFGCGHSSGGIF--DKSSSGIVGLGGGELSLISQL--KSTINGLFSYCLLPVSTDS 255
Query: 276 DGTGRISFGDKGS-PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFT 334
+ RI+FG G G G LR + Y+ T+V G + I DSGT++T
Sbjct: 256 SISSRINFGASGRVSGYGTVSTPLRLPYKGYS-KKTEVEEG--------NIIVDSGTTYT 306
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+L Y+++ ++ + K KR + + F CY
Sbjct: 307 FLPQEFYSKLEKSVANSIKGKRVRDPNGI-FSLCY 340
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/269 (27%), Positives = 110/269 (40%), Gaps = 37/269 (13%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P +DTGSD+ WL C+ C C + +++P+ SS+ +PC
Sbjct: 92 SVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTP---------MFNPSKSSSYKNIPC 142
Query: 169 NSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S LC+ + N C Y Y D + S G L D L L + + S I G
Sbjct: 143 PSKLCQSMEDTSCNDKNYCEYST-YYGDNSHSGGDLSVDTLTLESTNGLTVSF-PNIVIG 200
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---------GSDGT 278
CG S+ +GA+ +G+ G G S + L + FS C S+ T
Sbjct: 201 CGTNNILSY-EGAS-SGIVGFGSGPASFITQLGSS--TGGKFSYCLTPLFSVTNIQSNAT 256
Query: 279 GRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAIF 327
+++FGD + G TP + Y +T+ SVG V E + I
Sbjct: 257 SKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNGDNEGNIII 316
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT+ T L Y+ + L K +R
Sbjct: 317 DSGTTLTSLTKDDYSFLESAVVDLVKLER 345
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 119/308 (38%), Gaps = 51/308 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ VSVG P + +D+GSD+ W+ C C+ C V ++ P TS+T
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECY---------VQADPLFDPATSATF 221
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
S V C S +C + C Y+V Y +DG+ + G L + L L +
Sbjct: 222 SGVSCGSAICRILPTSACGDGELGGCEYEVSY-ADGSYTKGALALETLTLGGTAVEG--- 277
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+ GCG G F+ A GL GLG S+ L G + +FS C S G
Sbjct: 278 ---VVIGCGHRNRGLFVGAA---GLMGLGWGPMSLVGQLG--GEVGGAFSYCLASRGGYG 329
Query: 278 -------TGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS--- 324
G + G + +G P P+ Y + ++ + VG + +
Sbjct: 330 SGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETF-NSLAKE-KRETSTSDLPFEYCYVLRSF 374
+ D+GT+ T L AY + + F +LA R S + CY L +
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449
Query: 375 LHLQALVV 382
++ V
Sbjct: 450 ASVRVPTV 457
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 121/292 (41%), Gaps = 52/292 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C +C + +++P S +
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDP---------VFNPVKSGSF 179
Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+KV C + LC ++ S G N C YQV Y DG+ +TG V + L + +
Sbjct: 180 AKVLCRTPLC---RRLESPGCNQRQTCLYQVSY-GDGSYTTGEFVTETLTFRRTKVE--- 232
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
+++ GCG G F+ A GL G+ S NQ FS C S
Sbjct: 233 ---QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ-----KFSYCLVDRSAS 284
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NFEFS- 324
+ FG+ F+ T+P Y + + +SVGG V +F+
Sbjct: 285 SKPSSVVFGNSAVSRTAR--FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 342
Query: 325 -----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I D GTS T LN PAY + + F + A + L F+ CY L
Sbjct: 343 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDL 393
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 119/303 (39%), Gaps = 44/303 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + +++ P S T
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTD---------HVFDPTKSRTY 168
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC + LC C + C YQV Y DG+ + G + L +
Sbjct: 169 AGIPCGAPLCRRLDSPGCSNKNKVCQYQVSY-GDGSFTFGDFSTETLTFRRNRV------ 221
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+R++ GCG G F GL GLG + S P + + FS C S
Sbjct: 222 TRVALGCGHDNEGLF---TGAAGLLGLGRGRLSFPVQTGRR--FNHKFSYCLVDRSASAK 276
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + +SVGG V F A
Sbjct: 277 PSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNG 336
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-LVV 382
I DSGTS T L PAY + + F A + L F+ C+ L ++ VV
Sbjct: 337 GVIIDSGTSVTRLTRPAYIALRDAFRIGASHLKRAPEFSL-FDTCFDLSGLTEVKVPTVV 395
Query: 383 LPF 385
L F
Sbjct: 396 LHF 398
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 121/310 (39%), Gaps = 56/310 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT +D W+PC C C + PN S+T
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS------------TTFLPNASTTL 92
Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C+ C + CP+ GS+ C + Y D +++ LV+D + LA D V
Sbjct: 93 GSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLA-ATLVQDAITLAND------V 145
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 146 IPGFTFGCINAVSGGSIP---PQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 200
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 260
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS-------FLH 376
I DSGT T P Y I + F K+ +S F+ C+ + LH
Sbjct: 261 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAATNEAEAPAVTLH 317
Query: 377 LQAL-VVLPF 385
+ L +VLP
Sbjct: 318 FEGLNLVLPM 327
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 126/298 (42%), Gaps = 56/298 (18%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
L+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T
Sbjct: 81 LYVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTC 130
Query: 164 SKVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 131 AKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG 189
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------ 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 190 -----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKS 240
Query: 273 ---FGSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS--- 324
F S TG S G + T R+ + + + +T +SV G + S
Sbjct: 241 ERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS 300
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L A+E+ E + CY +RS
Sbjct: 301 RKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRS 350
>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
Length = 178
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 41/127 (32%), Positives = 65/127 (51%), Gaps = 9/127 (7%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLP-CDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
L+YT++ +G PA+ + V LDTGS FW+ C C H S + Y P +S +
Sbjct: 58 LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPH----ESDILRKLTFYDPRSSVS 113
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT--DEKQSKSV 220
S +V C+ T+C + C + CPY Y +DG ++ G L D+LH Q++
Sbjct: 114 SKEVKCDDTICTSRPPC-NMTLRCPYITGY-ADGGLTMGILFTDLLHYHQLYGNGQTQPT 171
Query: 221 DSRISFG 227
+ ++FG
Sbjct: 172 STSVTFG 178
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 133/317 (41%), Gaps = 55/317 (17%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ P+T A RL +L ++ + G+ V +DT S+L W+ C C SC
Sbjct: 113 RVPVTSGA-----RLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASC---- 159
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGS--------NCPYQVRYL 193
+ G + D P +S + + +PCNS+ C+ LQ SA +C Y + Y
Sbjct: 160 HDQQGPLFD-----PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSY- 213
Query: 194 SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
DG+ S G L D L LA + V FGCG G F +GL GLG +
Sbjct: 214 RDGSYSQGVLAHDKLSLAGE------VIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQL 264
Query: 254 SVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPT 304
S+ S +Q FS C S+ +G + GD S + TP S P
Sbjct: 265 SLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF 322
Query: 305 YNITITQVSVGGNAVNFEFSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
Y + +T +++GG V E SA I DSGT T L Y + F S E +
Sbjct: 323 YFVNLTGITIGGQEV--ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGF 380
Query: 362 DLPFEYCYVLRSFLHLQ 378
+ + C+ L F +Q
Sbjct: 381 SI-LDTCFNLTGFREVQ 396
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 133/317 (41%), Gaps = 55/317 (17%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+ P+T A RL +L ++ + G+ V +DT S+L W+ C C SC
Sbjct: 112 RVPVTSGA-----RLRTLNYVATVGLGGGEAT----VIVDTASELTWVQCAPCASC---- 158
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGS--------NCPYQVRYL 193
+ G + D P +S + + +PCNS+ C+ LQ SA +C Y + Y
Sbjct: 159 HDQQGPLFD-----PASSPSYAVLPCNSSSCDALQVATGSAAGACGGGEQPSCSYTLSY- 212
Query: 194 SDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKT 253
DG+ S G L D L LA + V FGCG G F +GL GLG +
Sbjct: 213 RDGSYSQGVLAHDKLSLAGE------VIDGFVFGCGTSNQGPF---GGTSGLMGLGRSQL 263
Query: 254 SVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPF------SLRQTHPT 304
S+ S +Q FS C S+ +G + GD S + TP S P
Sbjct: 264 SLISQTMDQ--FGGVFSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPF 321
Query: 305 YNITITQVSVGGNAVNFEFSA---IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTS 361
Y + +T +++GG V E SA I DSGT T L Y + F S E +
Sbjct: 322 YFVNLTGITIGGQEV--ESSAGKVIVDSGTIITSLVPSVYNAVKAEFLSQFAEYPQAPGF 379
Query: 362 DLPFEYCYVLRSFLHLQ 378
+ + C+ L F +Q
Sbjct: 380 SI-LDTCFNLTGFREVQ 395
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 119/288 (41%), Gaps = 41/288 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA S + DTGSD+ WL C C C + I++P+ SS+
Sbjct: 14 YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDP---------IFNPSLSSSF 64
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ C S++C +L+ + S + C YQV Y DG+ + G + L +S
Sbjct: 65 KPLACASSICGKLKIKGCSRKNKCMYQVSY-GDGSFTVGDFSTETLSFGEHAVRS----- 118
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTG 279
++ GCGR G F A L GLG S PS + FS C S
Sbjct: 119 -VAMGCGRNNQGLFHGAAG---LLGLGRGPLSFPSQTGTS--YASVFSYCLPRRESAIAA 172
Query: 280 RISFGDKGSPGQGETPFSL--RQTHPTYNITITQVSVGGNAVNFEFSA-----------I 326
+ FG P + L R+ Y + + ++ V G+ VN A I
Sbjct: 173 SLVFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVI 232
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF 374
DSGT+ + L PAYT + + F SL S F+ CY L S
Sbjct: 233 VDSGTAISRLTTPAYTALRDAFRSLVTFPSAPGISL--FDTCYDLSSM 278
>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
Length = 480
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 116/306 (37%), Gaps = 59/306 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ VG PA F++ DTGSDL W+ C G +G ++ S + +
Sbjct: 112 YFVRFRVGTPAQPFVLVADTGSDLTWVKCS------GAGDGTGDA-PRRVFRAAASRSWA 164
Query: 165 KVPCNSTLCELQ-----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C+S C C S S C Y RY +DG+ + G + D +A +S+
Sbjct: 165 PIACSSDTCTSYVPFSLANCSSPASPCAYDYRY-NDGSAARGVVGTDSATIALSGSESRD 223
Query: 220 VDSR------ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
R + GC G + +G+ LG S S A + FS C
Sbjct: 224 GGGRRAKLQGVVLGCTASYDGQSFQSS--DGVLSLGNSNISFASRAAAR--FGGRFSYCL 279
Query: 274 -----GSDGTGRISFGDKGSPG-----------QGETPFSL-RQTHPTYNITITQVSVGG 316
+ T ++FG G G TP L R+ P Y + + V V G
Sbjct: 280 VDHLAPRNATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAG 339
Query: 317 NAVNFEFS---------AIFDSGTSFTYLNDPAYTQI----SETFNSLAKEKRETSTSDL 363
A++ AI DSGTS T L PAY + SE L + +
Sbjct: 340 EALDIPADVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMD------ 393
Query: 364 PFEYCY 369
PFEYCY
Sbjct: 394 PFEYCY 399
>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
Length = 418
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 93/304 (30%), Positives = 131/304 (43%), Gaps = 53/304 (17%)
Query: 60 YSALAHR--DRYFRLRGR-GLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
++ AHR +R L R G A+ G+ ++PL +G Y + S+G P
Sbjct: 42 FTRAAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMT---------FSMGTPPQ 92
Query: 117 SFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE- 174
+ DTGSDL W C C C ++S Y P SS+ SK+PC+S LC
Sbjct: 93 TLSALADTGSDLIWAKCGACKRCAPRGSAS---------YYPTKSSSFSKLPCSSALCRT 143
Query: 175 LQKQ-------CPSAGSNCPYQVRY-LSDGT--MSTGFLVEDVLHLATDEKQSKSVDSRI 224
L+ Q + G+ C Y+ Y LS + G++ + L +D Q I
Sbjct: 144 LESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDAVQG------I 197
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--GTGRIS 282
FGC T S + +GL GLG K S L Q L +FS C SD + +
Sbjct: 198 GFGC---TTMSEGGYGSGSGLVGLGRGKLS----LVRQ-LKVGAFSYCLTSDPSTSSPLL 249
Query: 283 FGDKG--SPGQGETPFSLRQTHPTYNITITQVSVGGNAV--NFEFSAIFDSGTSFTYLND 338
FG PG TP +T Y + + +S+G IFDSGT+ T+L +
Sbjct: 250 FGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAE 309
Query: 339 PAYT 342
PAYT
Sbjct: 310 PAYT 313
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 116/286 (40%), Gaps = 42/286 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P + +DTGSDL WL C C +C LN ++ P +SST S +
Sbjct: 63 LSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNP---------MFDPQSSSTYSNIA 113
Query: 168 CNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
S C C +NC Y Y D +++ G L ++ L L + + ++ I
Sbjct: 114 YGSESCSKLYSTSCSPDQNNCNYTYSY-EDDSITEGVLAQETLTLTSTTGKPVALKGVI- 171
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGR 280
FGCG G F D G+ GLG S+ S + + FS C T
Sbjct: 172 FGCGHNNNGVFNDKEM--GIIGLGRGPLSLVSQIGS-SFGGKMFSQCLVPFHTNPSITSP 228
Query: 281 ISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVNFEF------------ 323
+SFG KGS G TP + TH Y +T+ +SV +N F
Sbjct: 229 MSFG-KGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLPFNDGSSLEPITKG 285
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + DSGT T L + Y ++ E + L ++ CY
Sbjct: 286 NMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCY 331
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 121/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 82/167 (49%), Gaps = 17/167 (10%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
T + +G P F + +DTGS++ ++PC C G G+ D T S+S+
Sbjct: 52 TKLYIGTPPQEFTLVVDTGSNMTFVPC----C--GSEEYCGKHEDPAF---QTESSSTYQ 102
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISF 226
P N C C S C Y++ Y DG+ S G L ED++ +S+ R+ F
Sbjct: 103 PVN---CHPSCDCDYLRSQCSYKMHY-GDGSYSRGVLAEDIISFG---NESEFAPQRLVF 155
Query: 227 GCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
GC GS A +G+ GLG ++++ L ++G+I +SFS+C+
Sbjct: 156 GCELDAIGSLYSLRA-DGIIGLGRGRSTIVDQLVDKGVISDSFSLCY 201
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 121/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/283 (31%), Positives = 131/283 (46%), Gaps = 38/283 (13%)
Query: 99 NSLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
S+G +Y T + +G P+ S+ + +DTGS L WL C CV + G + D P
Sbjct: 127 TSVGVGNYVTQLGLGTPSTSYAMVVDTGSSLTWL--QCSPCVVSCHRQVGPLFD-----P 179
Query: 158 NTSSTSSKVPCNSTLC-ELQKQC--PSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLA 211
SST + V C+++ C ELQ PSA S C YQ Y D + S G L D +
Sbjct: 180 RASSTYASVRCSASQCDELQAATLNPSACSASNVCIYQASY-GDSSFSVGSLSTDTVSFG 238
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ S +GCG+ G F A GL GL +K S+ LA + SFS
Sbjct: 239 STRYPS------FYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFSY 287
Query: 272 CFGSDG-TGRISFGDKGSPGQ--GETPFSLRQTHPT-YNITITQVSVGGNAVNF---EFS 324
C + TG +S G + G TP + + Y IT++ +SVGG+ + E+S
Sbjct: 288 CLPTAASTGYLSIGPYNT-GHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYS 346
Query: 325 A---IFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDL 363
+ I DSGT T L +T +S+ ++A +R + S L
Sbjct: 347 SLPTIIDSGTVITRLPTAVHTALSKAVAQAMAGAQRAPAFSIL 389
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/323 (25%), Positives = 133/323 (41%), Gaps = 55/323 (17%)
Query: 84 KTPLTFSAGNDTYRLNS----LGF-------LHYTNVSVGQPALSFIVALDTGSDLFWLP 132
+ PL N T RL++ +G+ L+ +V +G PA + IV +DTGS W+
Sbjct: 50 RIPLFRYISNKTSRLSTQAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVF 109
Query: 133 CDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGS-----NCP 187
C+C C H + + + S+T +KV C +++C L P +CP
Sbjct: 110 CECDGC-H---------TNPRTFLQSRSTTCAKVSCGTSMCLLGGSDPHCQDSENYPDCP 159
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
++V Y DG+ S G L +D L + +K +FGC G+ G +GL G
Sbjct: 160 FRVSY-QDGSASYGILYQDTLTFSDVQKIPS-----FTFGCNLDSFGANEFGNV-DGLLG 212
Query: 248 LGMDKTSVPSILANQGLIPNSFSMC---------FGSDGTGRISFGDKGSPGQGE--TPF 296
+G SV L + FS C F S TG S G +
Sbjct: 213 MGAGPMSV---LKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMV 269
Query: 297 SLRQTHPTYNITITQVSVGGNAVNFEFS------AIFDSGTSFTYLNDPAYTQISETFNS 350
+ R+ + + + +SV G + S +FDSG+ +Y+ D A + +S+
Sbjct: 270 ARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELSYIPDRALSVLSQRIRE 329
Query: 351 LAKEKRETSTSDLPFEYCYVLRS 373
L R + + CY +RS
Sbjct: 330 LL--LRRGAAEEESERNCYDMRS 350
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/279 (27%), Positives = 117/279 (41%), Gaps = 40/279 (14%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + DT SDL W+ C C +C D ++ P+ SST + + C+
Sbjct: 96 IGTPPVERLAIADTASDLIWVQCSPCETCFPQ---------DTPLFEPHKSSTFANLSCD 146
Query: 170 STLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
S C CP G+ C Y Y DG+ + G L + +H + Q+ + I FG
Sbjct: 147 SQPCTSSNIYYCPLVGNLCLYTNTY-GDGSSTKGVLCTESIHFGS---QTVTFPKTI-FG 201
Query: 228 CGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRISFG 284
CG G+ GLG S+ S L +Q I + FS C F S T ++ FG
Sbjct: 202 CGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQ--IGHKFSYCLLPFTSTSTIKLKFG 259
Query: 285 -DKGSPGQG--ETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFS------AIFDSGTSFT 334
D G G TP + +P+Y + + +++G + + I D GT T
Sbjct: 260 NDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLT 319
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSD----LPFEYCY 369
YL Y F +L +E S + PF++C+
Sbjct: 320 YLEVNFY----HNFVTLLREALGISETKDDIPYPFDFCF 354
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 118/283 (41%), Gaps = 52/283 (18%)
Query: 97 RLNSLGFLHYTNV--SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFN 153
RL +L ++ ++ S G PA + V +DTGSDL W+ C C +C +
Sbjct: 138 RLQTLNYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDP--------- 188
Query: 154 IYSPNTSSTSSKVPCNSTLCE--------LQKQCPSAGS---NCPYQVRYLSDGTMSTGF 202
++ P S+T + V CN++ C C S G+ C Y + Y DG+ S G
Sbjct: 189 LFDPAGSATYAAVRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAY-GDGSFSRGV 247
Query: 203 LVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
L D + L S+ + FGCG G F GL GLG + S+ S A++
Sbjct: 248 LATDTVALG-----GASLGGFV-FGCGLSNRGLF---GGTAGLMGLGRTELSLVSQTASR 298
Query: 263 GLIPNSFSMCF----GSDGTGRISFG---DKGSPGQGETPFSLRQT------HPTYNITI 309
FS C D +G +S G D S + TP + + P Y + +
Sbjct: 299 --YGGVFSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNV 356
Query: 310 TQVSVGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETF 348
T +VGG A+ + + + DSGT T L Y + F
Sbjct: 357 TGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRAVRAEF 399
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 115/284 (40%), Gaps = 42/284 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G PA ++A+DT SD+ W+PC CV C +SP S++
Sbjct: 99 YIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSF 147
Query: 164 SKVPCNSTLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C++ C+ Q P+ G+ C + + Y S + L +D + LA D ++
Sbjct: 148 KNVSCSAPQCK-QVPNPACGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA----- 199
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
+FGC G G P LG+ + + + Q + ++FS C S +
Sbjct: 200 -FTFGCVNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSVYKSTFSYCLPSFRSLTFS 255
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + LR + Y + + + VG V+ +A
Sbjct: 256 GSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGT 315
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT +T L P Y + F K TS F+ CY
Sbjct: 316 IFDSGTVYTRLAKPVYEAVRNEFRKRVKPPTAVVTSLGGFDTCY 359
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 126/293 (43%), Gaps = 59/293 (20%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
GFL N+S+G P ++ +V +DTGS L W+ C C++C S + P S
Sbjct: 103 GFL--VNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTS---------WFDPLKS 151
Query: 161 STSSKVPC--------NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ + C N C Q Y++RYL G S G L ++ L T
Sbjct: 152 VSFKTLGCGFPGYNYINGYKCNRFNQ-------AEYKLRYLG-GDSSQGILAKESLLFET 203
Query: 213 -DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI-LANQGLIPNSFS 270
DE + K S I+FGCG + + D A NG+FGLG + P I +A Q + N FS
Sbjct: 204 LDEGKIKK--SNITFGCGHMNIKTNNDDAY-NGVFGLG----AYPHITMATQ--LGNKFS 254
Query: 271 MCFGSDGT-----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
C G + G +GS +G+ TP + H Y +T+ +SVG + + +
Sbjct: 255 YCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGH--YYVTLQSISVGSKTLKIDPN 311
Query: 325 A-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFE 366
A + DSG ++T L + + + + L K E + FE
Sbjct: 312 AFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFE 364
>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
Length = 444
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 122/315 (38%), Gaps = 57/315 (18%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH----YTNVSVGQPALSFI 119
A + R F LR R + A + P + L F H +++VG P +
Sbjct: 30 AAKPRAFPLRARQVPAGALPRPP------------SKLRFHHNVSLTVSLAVGTPPQNVT 77
Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ--- 176
+ LDTGS+L WL C + G ++ + P S+T + VPC ST C +
Sbjct: 78 MVLDTGSELSWL--LCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGSTQCSSRDLP 135
Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
C A C + Y +DG+ S G L DV + ++ R +FGC
Sbjct: 136 APPSCDGASRQCHVSLSY-ADGSASDGALATDVFAVG------EAPPLRSAFGCMSTAYD 188
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSP--GQ 291
S DG A GL G+ S + + + FS C D G + G P
Sbjct: 189 SSPDGVATAGLLGMNRGTLSFVTQASTR-----RFSYCISDRDDAGVLLLGHSDLPFLPL 243
Query: 292 GETPFSLRQTHP-------TYNITITQVSVGGNAVNFEFSAI-----------FDSGTSF 333
TP + T P Y++ + + VGG A+ S + DSGT F
Sbjct: 244 NYTPL-YQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTMVDSGTQF 302
Query: 334 TYLNDPAYTQISETF 348
T+L AY+ + F
Sbjct: 303 TFLLGDAYSALKAEF 317
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 121/292 (41%), Gaps = 52/292 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ WL C C +C + +++P S +
Sbjct: 42 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDP---------VFNPVKSGSF 92
Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+KV C + LC ++ S G N C YQV Y DG+ +TG V + L + +
Sbjct: 93 AKVLCRTPLC---RRLESPGCNQRQTCLYQVSY-GDGSYTTGEFVTETLTFRRTKVE--- 145
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
+++ GCG G F+ A GL G+ S NQ FS C S
Sbjct: 146 ---QVALGCGHDNEGLFVGAAGLLGLGRGGLSFPSQAGRTFNQ-----KFSYCLVDRSAS 197
Query: 276 DGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NFEFS- 324
+ FG+ F+ T+P Y + + +SVGG V +F+
Sbjct: 198 SKPSSVVFGNSAVSRTAR--FTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDR 255
Query: 325 -----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I D GTS T LN PAY + + F + A + L F+ CY L
Sbjct: 256 TGNGGVIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSL-FDTCYDL 306
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 119/273 (43%), Gaps = 36/273 (13%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
+SL L Y +V +G PA++ V +DTGSD+ W+ C+ ++ +G + D P
Sbjct: 101 SSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFD-----P 155
Query: 158 NTSSTSSKVPCNSTLCEL-----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SST + C++ C + A S C Y V+Y DG+ +TG DVL L+
Sbjct: 156 AASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKY-GDGSNTTGTYSSDVLTLSG 214
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS-VPSILANQGLIPNSFSM 271
+ V FGC + G+ +D +GL GLG D S V A G SF
Sbjct: 215 SD-----VVRGFQFGCSHAELGAGMDDKT-DGLIGLGGDAQSPVSQTAARYG---KSFFY 265
Query: 272 CFGSD--GTGRISFGDKGSPGQ------GETPFSLRQTHPTYNI-TITQVSVGGNAVN-- 320
C + +G ++ G S G TP + PTY + ++VGG +
Sbjct: 266 CLPATPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS 325
Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
F ++ DSGT T L AY +S F +
Sbjct: 326 PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRA 358
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 121/310 (39%), Gaps = 56/310 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + LDT +D W+PC C C + PN S+T
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGC------------SSTTFLPNASTTL 92
Query: 164 SKVPCNSTLCELQK--QCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C+ C + CP+ GS+ C + Y D +++ LV+D + LA D V
Sbjct: 93 GSLDCSEAQCSQVRGFSCPATGSSACLFNQSYGGDSSLA-ATLVQDAITLAND------V 145
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
+FGC +G + P GL GLG S+ I + FS C S
Sbjct: 146 IPGFTFGCINAVSGGSI---PPQGLLGLGRGPISL--ISQAGAMYSGVFSYCLPSFKSYY 200
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T LR H P+ Y + +T VSVG V N
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGA 260
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS-------FLH 376
I DSGT T P Y I + F K+ +S F+ C+ + LH
Sbjct: 261 GTIIDSGTVITRFVQPVYFAIRDEFR---KQVNGPISSLGAFDTCFAETNEAEAPAVTLH 317
Query: 377 LQAL-VVLPF 385
+ L +VLP
Sbjct: 318 FEGLNLVLPM 327
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/298 (26%), Positives = 127/298 (42%), Gaps = 45/298 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P S + +D+GSD+ W+ C C C H + ++ P S++
Sbjct: 43 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDP---------LFDPADSASF 93
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C+S +C+ Q +AG N C Y+V Y DG+ + G L + L L ++V
Sbjct: 94 MGVSCSSAVCD---QVDNAGCNSGRCRYEVSY-GDGSSTKGTLALETLTLG------RTV 143
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---DG 277
++ GCG + G F+ A GL G M + V + +G N+FS C S +
Sbjct: 144 VQNVAIGCGHMNQGMFVGAAGLLGLGGGSM--SFVGQLSRERG---NAFSYCLVSRVTNS 198
Query: 278 TGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------ 324
G + FG + P G P P+ Y I ++ + VG V FE +
Sbjct: 199 NGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGG 258
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
+ D+GT+ T AY + F S + F+ CY L FL ++ V
Sbjct: 259 VVMDTGTAVTRFPTVAYEAFRDAFIDQTGNLPRASGVSI-FDTCYNLFGFLSVRVPTV 315
>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
Length = 451
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 114/286 (39%), Gaps = 45/286 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +G PA + +VA+D +D W+PC + S + P SST
Sbjct: 107 YVARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPS----------FDPTRSSTYR 156
Query: 165 KVPCNSTLCELQKQCPSA----GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C + C Q PS GS+C + + Y + + L +D L L D +
Sbjct: 157 PVRCGAPQCS-QAPAPSCPGGLGSSCAFNLSYAA--STFQALLGQDALALHDDVDAVAA- 212
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
+FGC V TG + P GL G G S PS + + + FS C S+
Sbjct: 213 ---YTFGCLHVVTGGSVP---PQGLVGFGRGPLSFPS--QTKDVYGSVFSYCLPSYKSSN 264
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAVNFEFSA--------- 325
+G + G G P + +T L H P+ Y + + + VGG V SA
Sbjct: 265 FSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGR 324
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I D+GT FT L+ P Y + + F S + F+ CY
Sbjct: 325 GTIVDAGTMFTRLSAPVYAAVRDVFRSRVRAPVAGPLGG--FDTCY 368
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 113/276 (40%), Gaps = 46/276 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL W+ C C +C + Y P SS+
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQ---------NGPYYDPKDSSSF 245
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDE-K 215
+ C+ C+L + C +CPY Y + F +E ++L T E K
Sbjct: 246 KNITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGK 305
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ + FGCG G F A L GLG S + L Q L +SFS C
Sbjct: 306 PELKIVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFATQL--QSLYGHSFSYCLVD 360
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVN--- 320
S + ++ FG+ P T F + +P Y + I + VGG +
Sbjct: 361 RNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPE 420
Query: 321 --FEFSA------IFDSGTSFTYLNDPAYTQISETF 348
+ SA I DSGT+ TY +PAY I E F
Sbjct: 421 ETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAF 456
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 119/297 (40%), Gaps = 43/297 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIY-SPNTSS 161
F + + VG P + + DTGSDL W+ C G ++ + ++Y P+ SS
Sbjct: 108 FEYLMAIEVGTPPVRVLAIADTGSDLVWVKC------KGKDNDNNSTAPPSVYFVPSASS 161
Query: 162 TSSKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
T +V C++ C C GS C Y Y DG+ ++G L + +T SK
Sbjct: 162 TYGRVGCDTKACRALSSAASCSPDGS-CEYLYSY-GDGSRASGQLSTETFTFSTIADSSK 219
Query: 219 SVD----------------SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ 262
+ +++ FGC TG+F +GL GLG S+ S L
Sbjct: 220 TNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTF----RADGLVGLGGGPVSLASQLGAT 275
Query: 263 GLIPNSFSMCFG----SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVG 315
+ FS C ++ + ++FG + PG TP + Y I + ++V
Sbjct: 276 TSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVA 335
Query: 316 GN---AVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
G + I DSGT+ TYL+ T + + K R S + + CY
Sbjct: 336 GTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKI-LDLCY 391
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 116/291 (39%), Gaps = 47/291 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C C+ C + Q + + S+T
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------AAQPTPY--FDVKRSATY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+PC S+ C C YQ Y D + G L + +K +
Sbjct: 140 RALPCRSSRCAALSSPSCFKKMCVYQ-YYYGDTASTAGVLANETFTFGA-ASSTKVRAAN 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
ISFGCG + G + +G+ G G S+ S L P+ FS C + S R
Sbjct: 198 ISFGCGSLNAGELANS---SGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSPTPSR 249
Query: 281 ISFG----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------AVN 320
+ FG GSP Q TPF + P Y +++ +S+G A+N
Sbjct: 250 LYFGVFANLNSTNTSSGSPVQ-STPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAIN 308
Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + I DSGTS T+L AY + S T D+ + C+
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDT-DIGLDTCF 358
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 110/270 (40%), Gaps = 45/270 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G PA+ + +DTGSDL W C C C I+ P SS+ SKV
Sbjct: 110 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 160
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+S LC + C C Y Y D + + G L + + ++ S I
Sbjct: 161 GCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 214
Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
FGCG G DG + +GL GLG S+ S L S S+ G
Sbjct: 215 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 271
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
S +G ++ G+ SL + P+ Y + + ++VG ++ E S
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL + A+ + E F S
Sbjct: 332 GTGGMIIDSGTTITYLEETAFKVLKEEFTS 361
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 116/301 (38%), Gaps = 46/301 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
++ GCG +G F+ A GL GLG S+ L G FS C G+
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
Query: 277 GTGRISFGDKGSPGQGETPFSL---RQTHPTYNITITQVSVGGNAVNFEFS--------- 324
G G + G + G L Q Y + +T + VGG + + S
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA 348
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRSFLHLQALV 381
+ D+GT+ T L AY + F+ ++ R + S L + CY L + ++
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLSGYASVRVPT 406
Query: 382 V 382
V
Sbjct: 407 V 407
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 118/295 (40%), Gaps = 39/295 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + +G P S + +D+GSD+ W+ C C C H + ++ P S++
Sbjct: 43 YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDP---------LFDPADSASF 93
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
V C+S +C+ + C Y+V Y DG+ + G L + L ++V
Sbjct: 94 MGVSCSSAVCDRVENAGCNSGRCRYEVSY-GDGSYTKGTLALETLTFG------RTVVRN 146
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT---GR 280
++ GCG G F+ A GL G M S G N+FS C S GT G
Sbjct: 147 VAIGCGHSNRGMFVGAAGLLGLGGGSMSFMGQLS-----GQTGNAFSYCLVSRGTNTNGF 201
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFS------AIF 327
+ FG + P G P P+ Y I + + VG V F+ + +
Sbjct: 202 LEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVM 261
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D+GT+ T AY F + S + F+ CY L FL ++ V
Sbjct: 262 DTGTAVTRFPTVAYEAFRNAFIEQTQNLPRASGVSI-FDTCYNLFGFLSVRVPTV 315
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 121/291 (41%), Gaps = 53/291 (18%)
Query: 113 QPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL 172
Q LS I+ DTGS+ + C S S V D P S + +VPC S L
Sbjct: 9 QKNLSAII--DTGSEAVLVQC---------GSRSRPVFD-----PAASQSYRQVPCISQL 52
Query: 173 C-ELQKQ--------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
C +Q+Q C ++ + C Y + Y D STG +DV+ L + S++V R
Sbjct: 53 CLAVQQQTSNGSSQPCVNSSAACTYSLSY-GDSRNSTGDFSQDVIFLNSTNSSSQAVQFR 111
Query: 224 -ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD-----G 277
++FGC G FL G+ G S+PS L ++ L + FS CF S
Sbjct: 112 DVAFGCAHSPQG-FLVDLGSLGIVGFNRGNLSLPSQLKDR-LGGSKFSYCFPSQPWQPRA 169
Query: 278 TGRISFGDKG--SPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA------ 325
TG I GD G TP P Y + +T +SV G + SA
Sbjct: 170 TGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPS 229
Query: 326 ------IFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCY 369
+ DSGT+FT + D AYT F + + R+ + F+ CY
Sbjct: 230 TGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDDCY 280
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 120/296 (40%), Gaps = 41/296 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P S V +D+GSD+ W+ C C C + ++ P S+T
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDP---------VFDPAGSATY 187
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ + C+S++C+ C Y+V Y DG+ + G L + L + +
Sbjct: 188 AGISCDSSVCDRLDNAGCNDGRCRYEVSY-GDGSYTRGTLALETLTFG------RVLIRN 240
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
I+ GCG + G F+ A GL G M S L Q +FS C G++ TG
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAM---SFVGQLGGQ--TGGAFSYCLVSRGTESTGT 295
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTY------NITITQVSVGGNAVNFEFS------AIF 327
+ FG P G P P++ + + + V FE + +
Sbjct: 296 LEFGRGAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVM 355
Query: 328 DSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D+GT+ T L PAY +TF A R S F+ CY L F+ ++ V
Sbjct: 356 DTGTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSI--FDTCYNLNGFVSVRVPTV 409
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 124/305 (40%), Gaps = 48/305 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ W+ C C+ C + ++ P S +
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDP---------VFDPTKSRSF 195
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC S LC C + C YQV Y DG+ + G + L
Sbjct: 196 ANIPCGSPLCRRLDYPGCSTKKQICLYQVSY-GDGSFTVGEFSTETLTFRGTRV------ 248
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG----SDG 277
R+ GCG G F+ A GLG + S PS + + + FS C G S
Sbjct: 249 GRVVLGCGHDNEGLFVGAAGLL---GLGRGRLSFPSQIGRR--FNSKFSYCLGDRSASSR 303
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVN------FEFSA-- 325
I FGD S T F+ ++P Y + + +SVGG V+ F+ +
Sbjct: 304 PSSIVFGD--SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTG 361
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-L 380
I DSGTS T L AY + + F A + L F+ C+ L ++
Sbjct: 362 NGGVIIDSGTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSL-FDTCFDLSGKTEVKVPT 420
Query: 381 VVLPF 385
VVL F
Sbjct: 421 VVLHF 425
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 103/237 (43%), Gaps = 33/237 (13%)
Query: 105 HY---TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
HY +S+G P + DTGSDL WL C C +C LN ++ +S
Sbjct: 56 HYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNP---------MFDSQSS 106
Query: 161 STSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
ST S + C S C C NC Y Y+ DG+ + G L ++ L L + +
Sbjct: 107 STFSNIACGSESCSKLYSTSCSPDQINCKYNYSYV-DGSETQGVLAQETLTLTSTTGEPV 165
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+ I FGCG G+F D G+ GLG S+ S + + L N FS C T
Sbjct: 166 AFKGVI-FGCGHNNNGAFNDKEM--GIIGLGRGPLSLVSQIGSS-LGGNMFSQCLVPFNT 221
Query: 279 G-----RISFGDKGSPGQGE----TPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA 325
+SFG KGS G TP + T+ + Y +T+ +SV +N F+A
Sbjct: 222 NPSISSPMSFG-KGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISV--EDINLPFNA 275
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 72/267 (26%), Positives = 111/267 (41%), Gaps = 52/267 (19%)
Query: 106 YTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSC-VHGLNSSSGQVIDFNIYSPNTSSTS 163
Y V +G P F V +DTGS ++ C C SC HG N+ Y SS+
Sbjct: 139 YATVLIGTPGHQFEVIVDTGSTYTFVTCYPCASCGQHGSNAP---------YDAAKSSSY 189
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+VPC S + C ++G C Y ++ D + G +V DV+ + R
Sbjct: 190 ERVPCGSGC--IFGACRASGL-CEYDEKFSEDSQVG-GHVVSDVIDVG-----GSLGTPR 240
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS----FSMCFGS-DGT 278
I FGC ++T + L NG+ LG + + L + P S F +C GS +G
Sbjct: 241 IHFGCNSLET-NMLKTQKANGMIALGRAEAGLHRQLKKKAYPPGSYDGTFGLCLGSFEGG 299
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT------------YNITITQVSVGG---------- 316
G +S G P Q F R+TH + YN+ + ++ V
Sbjct: 300 GVLSLGK--LPEQHYANFVTRKTHTSTVKLVKGSKSQYYNVEVHRMFVRNTELKKPSGAE 357
Query: 317 --NAVNFEFSAIFDSGTSFTYLNDPAY 341
A + + DSGT++TYL++ +
Sbjct: 358 LMEAFRAGYGTVLDSGTTYTYLHEDVF 384
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 130/291 (44%), Gaps = 53/291 (18%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTS 160
+GF + T +++GQPA + + +DTGSDL WL CD C H + +Y P
Sbjct: 66 VGFYNVT-LNIGQPARPYFLDVDTGSDLTWLQCD-APCTHCSETP------HPLYRP--- 114
Query: 161 STSSKVPCNSTLC-ELQKQCPSAGSNCP------YQVRYLSDGTMSTGFLVEDVLHLA-T 212
++ VPC LC LQ P+ NC Y++ Y +D + G L+ DV L T
Sbjct: 115 -SNDFVPCRDPLCASLQ---PTEDYNCEHPDQCDYEINY-ADQYSTFGVLLNDVYLLNFT 169
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ Q K R++ GCG Q S +GL GLG K S+ S L +QGL+ N C
Sbjct: 170 NGVQLKV---RMALGCGYDQVFSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHC 226
Query: 273 FGSDGTG-----------RISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVN 320
+ G G R+++ TP S+ H Y+ ++ GG
Sbjct: 227 LSAQGGGYIFFGNAYDSARVTW----------TPISSVDSKH--YSAGPAELVFGGRKTG 274
Query: 321 F-EFSAIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCY 369
+A+FD+G+S+TY N AY +S L+ + + + D C+
Sbjct: 275 VGSLTAVFDTGSSYTYFNSHAYQALLSWLKKELSGKPLKVAPDDQTLPLCW 325
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 132/311 (42%), Gaps = 56/311 (18%)
Query: 66 RDRYFRLRGRGLAA----QGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVA 121
R + +LR + + + Q +T + ++G +L +L ++ V +G +S IV
Sbjct: 100 RVQSLQLRIKAMTSSTTEQSVSETQIPLTSG---IKLETLNYI--VTVELGGKNMSLIV- 153
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-----L 175
DTGSDL W+ C C SC + +Y P+ SS+ V CNS+ C+
Sbjct: 154 -DTGSDLTWVQCQPCRSCYNQQGP---------LYDPSVSSSYKTVFCNSSTCQDLVAAT 203
Query: 176 QKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
P G N C Y V Y DG+ + G L + + L + ++ + FGCG
Sbjct: 204 GNSGPCGGFNGVVKTTCEYVVSY-GDGSYTRGDLASESIVLGDTKLEN------LVFGCG 256
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DG-TGRISFGD- 285
R G F +GL GLG ++SV + FS C S DG +G +SFG+
Sbjct: 257 RNNKGLF---GGASGLMGLG--RSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTLSFGND 311
Query: 286 ----KGSPGQGETPFSLR-QTHPTYNITITQVSVGG---NAVNFEFSAIFDSGTSFTYLN 337
K S TP Q Y + +T S+GG ++F + DSGT T L
Sbjct: 312 FSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTVITRLP 371
Query: 338 DPAYTQISETF 348
Y + F
Sbjct: 372 PSIYKAVKTEF 382
>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 129/302 (42%), Gaps = 48/302 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V +G+PA + LDTGSD+ WL C C C H I+ P++SS+
Sbjct: 148 YFTRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEP---------IFEPSSSSSY 198
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C + + C Y+V Y DG+ + G + L + + Q+
Sbjct: 199 EPLSCDTPQCNALEVSECRNATCLYEVSY-GDGSYTVGDFATETLTIGSTLVQN------ 251
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG ++PS L SFS C SD
Sbjct: 252 VAVGCGHSNEGLFVGAAG---LLGLGGGLLALPSQLNT-----TSFSYCLVDRDSDSAST 303
Query: 281 ISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
+ FG SP P LR Q Y + +T +SVGG + +FE I
Sbjct: 304 VDFGTSLSPDAVVAPL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 362
Query: 328 DSGTSFTYLNDPAYTQISETF--NSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-LVVLP 384
DSGT+ T L Y + ++F +L EK + F+ CY L + ++ V
Sbjct: 363 DSGTAVTRLQTEIYNSLRDSFVKGTLDLEK---AAGVAMFDTCYNLSAKTTVEVPTVAFH 419
Query: 385 FP 386
FP
Sbjct: 420 FP 421
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 110/270 (40%), Gaps = 45/270 (16%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+S+G PA+ + +DTGSDL W C C C I+ P SS+ SKV
Sbjct: 2 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP---------IFDPEKSSSYSKV 52
Query: 167 PCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C+S LC + C C Y Y D + + G L + + ++ S I
Sbjct: 53 GCSSGLCNALPRSNCNEDKDACEYLYTY-GDYSSTRGLLATETFTF-----EDENSISGI 106
Query: 225 SFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGL---------IPNSFSMCFG 274
FGCG G DG + +GL GLG S+ S L S S+ G
Sbjct: 107 GFGCGVENEG---DGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIG 163
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVNFEFSA------ 325
S +G ++ G+ SL + P+ Y + + ++VG ++ E S
Sbjct: 164 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 223
Query: 326 -----IFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL + A+ + E F S
Sbjct: 224 GTGGMIIDSGTTITYLEETAFKVLKEEFTS 253
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 137/337 (40%), Gaps = 73/337 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-----DCVSCVHGLNSSSGQVIDFNIYSPNT 159
+ +++G P + V +DTGSDL W+PC DC+ C + S + +I+SP
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCN---DLKSNNLKSSSIFSPLH 67
Query: 160 SSTSSKVPCNSTLC-ELQKQ------CPSAGSN------------CPYQVRYLSDGTMST 200
SS+S + C S+ C E+ C AG + CP +G + +
Sbjct: 68 SSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVS 127
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L D+L T + R SFGC T ++ + P G+ G G S+PS L
Sbjct: 128 GILTRDILKARTRDV------PRFSFGC---VTSTYHE---PIGIAGFGRGLLSLPSQL- 174
Query: 261 NQGLIPNSFSMCF-------GSDGTGRISFGDKG-----SPGQGETPFSLRQTHP-TYNI 307
G + FS CF + + + G + TP +P +Y I
Sbjct: 175 --GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYI 232
Query: 308 TITQVSVGGNAVNFEF-------------SAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ +++G N + + DSGT++T+L +P Y+Q+ S
Sbjct: 233 GLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITY 292
Query: 355 KRETST-SDLPFEYCYVL----RSFLHLQALVVLPFP 386
R T T S F+ CY + + L+ V++ FP
Sbjct: 293 PRATETESRTGFDLCYKVPCPNNNLTSLENDVMMVFP 329
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 77/269 (28%), Positives = 110/269 (40%), Gaps = 39/269 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNTSST 162
+ VS G PA+ +V +DTGSDL WL C SSGQ ++ P+ SST
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCK--------PCSSGQCSPQKDPLFDPSHSST 163
Query: 163 SSKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S VPC S C+ C S G C + + Y+ DGT + G +D L LA
Sbjct: 164 YSAVPCASGECKKLAADAYGSGC-SNGQPCGFAISYV-DGTSTVGVYGKDKLTLAPG--- 218
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
++ FGCG ++ + + L Q FS C +
Sbjct: 219 --AIVKDFYFGCGHSKSSLPGLFDG-------LLGLGRLSESLGAQYGGGGGFSYCLPAV 269
Query: 277 GT--GRISFGDKGSP-GQGETPFSLRQTHPTYN-ITITQVSVGGNAVNFEFSA-----IF 327
+ G ++FG +P G TP PT++ +T+ ++VGG ++ SA I
Sbjct: 270 NSKPGFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIV 329
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKR 356
DSGT T L Y + F K R
Sbjct: 330 DSGTVVTVLQSTVYRALRAAFREAMKAYR 358
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 143/358 (39%), Gaps = 54/358 (15%)
Query: 44 GILAVDDLPKKGSFAYYSALAHRDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLG 102
G+ + P + + RD + R R LA+ G D+T T + G
Sbjct: 32 GLTRIHSNPDVSATEFVRDALRRDMHRHARFTRELASSG-DRT-----VAAPTRKDLPNG 85
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ +++G P LS+ DTGSDL W C C +GQ Y+P++S+T
Sbjct: 86 GEYIMTLAIGTPPLSYPAIADTGSDLIW--TQCAPCGSQCFKQAGQP-----YNPSSSTT 138
Query: 163 SSKVPCNSTL---CELQKQCPSAGSNCPYQVRYLSDGTMSTGFL--VEDVLHLATDEKQS 217
+PCNS++ L P G +C Y Y GT T + VE +T Q+
Sbjct: 139 FGVLPCNSSVSMCAALAGPSPPPGCSCMYNQTY---GTGWTAGIQSVETFTFGSTPADQT 195
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
+ I+FGC + + +G+A GL GLG S+ S L FS C
Sbjct: 196 RV--PGIAFGCSNASSDDW-NGSA--GLVGLGRGSMSLVSQLGA-----GMFSYCLTPFQ 245
Query: 274 GSDGTGRISFGDKGS---PGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
++ T + G + G TPF S Y + +T +S+G A++ +A
Sbjct: 246 DANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAF 305
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ T L D AY Q+ SL + + C+ L S
Sbjct: 306 ALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESLVTLPVADGSDSTGLDLCFALTS 363
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 129/297 (43%), Gaps = 49/297 (16%)
Query: 72 LRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWL 131
+ GR + + PLT RL +L ++ V +G ++ IV DTGSDL W+
Sbjct: 109 ISGRNIDDSVDAPIPLT-----SGIRLQTLNYI--VTVELGGRKMTVIV--DTGSDLSWV 159
Query: 132 PCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQ------CPSAG 183
C C C + + +++P+TS + V C+S C+ LQ C S
Sbjct: 160 QCQPCKRCYNQQDP---------VFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNP 210
Query: 184 SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPN 243
+C Y V Y DG+ + G L + L L S +V++ I FGCGR G F +
Sbjct: 211 PSCNYVVNY-GDGSYTRGELGTEHLDLGN----STAVNNFI-FGCGRNNQGLF---GGAS 261
Query: 244 GLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQGETPFSLRQ 300
GL GLG ++S+ I + FS C ++ +G + G S + TP S +
Sbjct: 262 GLVGLG--RSSLSLISQTSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTR 319
Query: 301 THPT-----YNITITQVSVGGNAVNF----EFSAIFDSGTSFTYLNDPAYTQISETF 348
P Y + +T ++VG AV + + DSGT T L Y + + F
Sbjct: 320 MIPNPQLPFYFLNLTGITVGSVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEF 376
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 110/256 (42%), Gaps = 39/256 (15%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYS 156
SLG +Y + +G P F V DTGSD W+ C VSC + ++
Sbjct: 157 SLGTANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKD---------RLFD 207
Query: 157 PNTSSTSSKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P SST + V C C +L +AG +C Y ++Y DG+ + GF +D L +A D
Sbjct: 208 PAKSSTYANVSCADPACADLDASGCNAG-HCLYGIQY-GDGSYTVGFFAKDTLAVAQDAI 265
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
+ FGCG G F A GL GLG TS+ ++ A + SFS C
Sbjct: 266 KG------FKFGCGEKNRGLFGQTA---GLLGLGRGPTSI-TVQAYEKY-GGSFSYCLPA 314
Query: 274 GSDGTGRISF---GDKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSAIF-- 327
S TG + F S +T L PT Y + +T + VGG + ++F
Sbjct: 315 SSAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSN 374
Query: 328 -----DSGTSFTYLND 338
DSGT T L D
Sbjct: 375 SGTLVDSGTVITRLPD 390
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 119/291 (40%), Gaps = 47/291 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C C+ C + Q + + S+T
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------ADQPTPY--FDVKKSATY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+PC S+ C C YQ Y D + G L + +K +
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQ-YYYGDTASTAGVLANETFTFGA-ANSTKVRATN 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
I+FGCG + G D A +G+ G G S+ S L P+ FS C + S R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249
Query: 281 ISFG----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------AVN 320
+ FG GSP Q TPF + P Y +++ +S+G A+N
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQ-STPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIN 308
Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + I DSGTS T+L AY + S A + +D+ + C+
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPLPAMNDTDIGLDTCF 358
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 117/269 (43%), Gaps = 41/269 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V +G P F + +DTGSDL WL C C+ C SG + D P S +
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFE----QSGPIFD-----PAASISY 199
Query: 164 SKVPCNSTLCEL--------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
V C C L ++C S+ CPY Y D + +TG L + + +
Sbjct: 200 RNVTCGDDRCRLVSPPAESAPRECRRPRSDPCPYYYWY-GDQSNTTGDLALEAFTVNLTQ 258
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSMCF 273
++ VD ++FGCG G F A L GLG S S L +G+ ++FS C
Sbjct: 259 SGTRRVDG-VAFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RGVYGGHAFSYCL 312
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFE--- 322
GS +I FG + P T F+ T Y + + + VGG AVN
Sbjct: 313 VEHGSAAGSKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDT 372
Query: 323 FSA---IFDSGTSFTYLNDPAYTQISETF 348
SA I DSGT+ +Y +PAY I + F
Sbjct: 373 LSAGGTIIDSGTTLSYFPEPAYQAIRQAF 401
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 113/278 (40%), Gaps = 42/278 (15%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA ++A+DT SD+ W+PC CV C +SP S++ V C+
Sbjct: 121 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSFKNVSCS 169
Query: 170 STLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+ C+ Q P+ G+ C + + Y S + L +D + LA D ++ +FGC
Sbjct: 170 APQCK-QVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA------FTFGC 220
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFG 284
G G P LG+ + + + Q + ++FS C S +G + G
Sbjct: 221 VNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 277
Query: 285 DKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
P + + LR + Y + + + VG V+ +A IFDSGT
Sbjct: 278 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 337
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+T L P Y + F K TS F+ CY
Sbjct: 338 VYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY 375
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 71/278 (25%), Positives = 113/278 (40%), Gaps = 42/278 (15%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G PA ++A+DT SD+ W+PC CV C +SP S++ V C+
Sbjct: 105 IGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-----------TAFSPAKSTSFKNVSCS 153
Query: 170 STLCELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
+ C+ Q P+ G+ C + + Y S + L +D + LA D ++ +FGC
Sbjct: 154 APQCK-QVPNPTCGARACSFNLTYGSSSIAAN--LSQDTIRLAADPIKA------FTFGC 204
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFG 284
G G P LG+ + + + Q + ++FS C S +G + G
Sbjct: 205 VNKVAG---GGTIPPPQGLLGLGRGPLSLMSQAQSIYKSTFSYCLPSFRSLTFSGSLRLG 261
Query: 285 DKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
P + + LR + Y + + + VG V+ +A IFDSGT
Sbjct: 262 PTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGT 321
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+T L P Y + F K TS F+ CY
Sbjct: 322 VYTRLAKPVYEAVRNEFRKRVKPTTAVVTSLGGFDTCY 359
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 119/291 (40%), Gaps = 47/291 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P L + +DTGSDL W C C+ C + Q + + S+T
Sbjct: 89 YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLC-------ADQPTPY--FDVKKSATY 139
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+PC S+ C C YQ Y D + G L + +K +
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQY-YYGDTASTAGVLANETFTFGA-ANSTKVRATN 197
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGR 280
I+FGCG + G D A +G+ G G S+ S L P+ FS C + S R
Sbjct: 198 IAFGCGSLNAG---DLANSSGMVGFGRGPLSLVSQLG-----PSRFSYCLTSYLSATPSR 249
Query: 281 ISFG----------DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGN---------AVN 320
+ FG GSP Q TPF + P Y +++ +S+G A+N
Sbjct: 250 LYFGVYANLSSTNTSSGSPVQ-STPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAIN 308
Query: 321 FEFS--AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + I DSGTS T+L AY + S A + +D+ + C+
Sbjct: 309 DDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS-AIPLTAMNDTDIGLDTCF 358
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 121/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T+V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
Length = 370
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 113/287 (39%), Gaps = 62/287 (21%)
Query: 122 LDTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE--- 174
+DTGSDL W+PC C++C ++S+G ++ P SS+ V C + C+
Sbjct: 1 MDTGSDLVWVPCTRNYSCINCPED-SASNG------VFLPRMSSSLHLVTCADSNCKTLY 53
Query: 175 ------LQKQCPSAGSNC-----PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
L + C + NC PY ++Y T G L+ + L+L + + +
Sbjct: 54 GNNTELLCQSCAGSLKNCSETCPPYGIQYGRGST--AGLLLTETLNLPLENGEGARAITH 111
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS------DG 277
+ GC S + P+G+ G G S+PS L + + F+ C S +
Sbjct: 112 FAVGC------SIVSSQQPSGIAGFGRGALSMPSQLGEH-IGKDRFAYCLQSHRFDEENK 164
Query: 278 TGRISFGDKGSPGQ---GETPFSLRQTHPT-------YNITITQVSVGGNAVN------F 321
+ GDK P TPF P Y I + VS+GG +
Sbjct: 165 KSLMVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLL 224
Query: 322 EFS------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
F I DSGT+FT +D + I+ F S +R D
Sbjct: 225 RFDTKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVED 271
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 74/262 (28%), Positives = 115/262 (43%), Gaps = 43/262 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
V++G + + V +DTGSDL W+ C+ C SC + + ++ P+TS + +
Sbjct: 124 VTMGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQ---------NGPLFKPSTSPSYQPIL 174
Query: 168 CNSTLCELQK--QC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
CNST C+ + C PS + C Y V Y DG+ ++G L + L SV S
Sbjct: 175 CNSTTCQSLELGACGSDPSTSATCDYVVNY-GDGSYTSGELGIEKLGFG-----GISV-S 227
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD----GT 278
FGCGR G F +GL GLG + S+ I FS C S +
Sbjct: 228 NFVFGCGRNNKGLF---GGASGLMGLGRSELSM--ISQTNATFGGVFSYCLPSTDQAGAS 282
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSA------I 326
G + G++ + TP + + P Y + +T + VGG +++ + S+ I
Sbjct: 283 GSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVI 342
Query: 327 FDSGTSFTYLNDPAYTQISETF 348
DSGT + L Y + F
Sbjct: 343 LDSGTVISRLAPSVYKALKAKF 364
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 89/340 (26%), Positives = 129/340 (37%), Gaps = 46/340 (13%)
Query: 44 GILAVDDLPKKGSFAYYSALAHRD--RYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSL 101
G+ + P+ + + RD R+ R LA LT A N
Sbjct: 26 GLTRIHADPEVTASEFVRGALRRDMHRHARFAREQLAPSSAAAAGLTVGAPTQKDLRN-- 83
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN--IYSPNT 159
G + +S+G P LS+ DTGSDL W C C + + Q + +Y+P++
Sbjct: 84 GGEYIMTLSIGTPPLSYRAIADTGSDLIW--TQCAPCGDTVTDTDNQCFKQSGCLYNPSS 141
Query: 160 SSTSSKVPCNSTL---CELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
S+T +PCNS L + P G C Y Y + T G + +
Sbjct: 142 STTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTGWT--AGVQSVETFTFGSSSTP 199
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
I+FGC + + +G+A GL GLG S+ S L +FS C
Sbjct: 200 PAVRVPNIAFGCSNASSNDW-NGSA--GLVGLGRGSMSLVSQLGA-----GAFSYCLTPF 251
Query: 274 -GSDGTGRISFGD------KGSPGQGETPF----SLRQTHPTYNITITQVSVGGNAVNFE 322
++ T + G KG+ TPF S Y + +T +SVG A+
Sbjct: 252 QDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIP 311
Query: 323 FSA-----------IFDSGTSFTYLNDPAYTQISETFNSL 351
A I DSGT+ T L D AY Q+ SL
Sbjct: 312 PDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSL 351
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 121/272 (44%), Gaps = 38/272 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 123 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 172
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 173 SSSSTYSPFSCGSAACAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 230
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 231 AVKS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 279
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 280 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 339
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ DSGT T L AY+ +S F + K+
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQ 371
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 125/296 (42%), Gaps = 50/296 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVS-CVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
+++G P L + DTGSDL W C C S C +Y+P++S+T + +
Sbjct: 94 LAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTP---------LYNPSSSTTFAVL 144
Query: 167 PCNSTLC------ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
PCNS+L P G C Y V Y S G S E +T QS+
Sbjct: 145 PCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGS-GWTSVFQGSETFTFGSTPAGQSRV- 202
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSD 276
I+FGC +G + ++ +GL GLG + S L +Q +P FS C ++
Sbjct: 203 -PGIAFGCSTASSG--FNASSASGLVGLGRGRLS----LVSQLGVPK-FSYCLTPYQDTN 254
Query: 277 GTGRISFGD----KGSPGQGETPF-SLRQTHPT---YNITITQVSVGGNAVNFEFSA--- 325
T + G G+ G TPF + T P Y + +T +S+G A++ A
Sbjct: 255 STSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLL 314
Query: 326 --------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
I DSGT+ T L + AY Q+ SL ++ + C++L S
Sbjct: 315 NADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAATGLDLCFMLPS 370
>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
Length = 472
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 111/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C ++C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 224
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + +
Sbjct: 225 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----A 272
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
FS C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 273 FSYCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSS 332
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 333 EMIVDSGAQRTSLWPSTFALLDKTI 357
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 116/290 (40%), Gaps = 43/290 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +S+G P + IV DTGSDL W+ C C C + ++ P+ SS+
Sbjct: 94 YFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSP---------LFDPSRSSSY 144
Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ C S C ++ C + C Y Y D + + G L + + + +
Sbjct: 145 RHMLCGSRFCNALDVSEQACTMDTNICEYHYSY-GDKSYTNGNLATEKFTIGSTSSRPVH 203
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF---- 273
+ S I FGCG G+F + L + L +Q +I FS C
Sbjct: 204 L-SPIVFGCGTGNGGTF------DELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPLS 256
Query: 274 -GSDGTGRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF-------- 321
S+ T +I FG P TP +Q Y +T+ +SVG + +
Sbjct: 257 EQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGN 316
Query: 322 --EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ + I DSGT+ T+L+ +T++ K +R + L F C+
Sbjct: 317 VEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGL-FSVCF 365
>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
Length = 472
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 110/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 224
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + +
Sbjct: 225 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----A 272
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
FS C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 273 FSYCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSS 332
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 333 EMIVDSGAQRTSLWPSTFALLDKTI 357
>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
Length = 472
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 110/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 224
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + +
Sbjct: 225 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----A 272
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
FS C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 273 FSYCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSS 332
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 333 EMIVDSGAQRTSLWPSTFALLDKTI 357
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 121/272 (44%), Gaps = 38/272 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 193 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 242
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 243 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 300
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 301 AVRS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 349
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 350 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 409
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ DSGT T L AY+ +S F + K+
Sbjct: 410 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQ 441
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 117/315 (37%), Gaps = 64/315 (20%)
Query: 70 FRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLH----YTNVSVGQPALSFIVALDTG 125
F LR R + A+ + P + L F H +++VG P + + LDTG
Sbjct: 58 FALRARQMPARALPRQP------------SKLRFHHNVSLTVSLAVGTPPQNVTMVLDTG 105
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ-----KQCP 180
S+L WL C + ++ S + P SST + VPC S C + C
Sbjct: 106 SELSWLLCAPAGARNKFSAMS--------FRPRASSTFAAVPCASAQCRSRDLPSPPACD 157
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
A S C + Y +DG+ S G L DV + + R +FGC S DG
Sbjct: 158 GASSRCSVSLSY-ADGSSSDGALATDVFAVGSGPPL------RAAFGCMSSAFDSSPDGV 210
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS-DGTGRISFGDKGSPG--------- 290
A GL G+ S S + + FS C D G + G P
Sbjct: 211 ASAGLLGMNRGALSFVSQASTR-----RFSYCISDRDDAGVLLLGHSDLPTFLPLNYTPM 265
Query: 291 -QGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAI-----------FDSGTSFTYLND 338
Q P Y++ + + VGG + S + DSGT FT+L
Sbjct: 266 YQPALPLPYFD-RVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSGTQFTFLLG 324
Query: 339 PAYTQISETFNSLAK 353
AY+ + F A+
Sbjct: 325 DAYSALKAEFTRQAR 339
>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
Length = 474
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 110/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 114 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 168
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 169 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 226
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + +
Sbjct: 227 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----A 274
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
FS C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 275 FSYCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSS 334
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 EMIVDSGAQRTSLWPSTFALLDKTI 359
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 71/269 (26%), Positives = 107/269 (39%), Gaps = 42/269 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++VG P + LDTGSDL W C C C + P SST
Sbjct: 86 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQ---------GIPLLDPAASSTY 136
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ----SKS 219
+ +PC + C G +C Y Y D +++ G + D + ++ S
Sbjct: 137 AALPCGAPRCRALPFTSCGGRSCVYVYHY-GDKSVTVGKIATDRFTFGDNGRRNGDGSLP 195
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---D 276
R++FGCG G F G+ G G + S+PS L SFS CF S
Sbjct: 196 ATRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNA-----TSFSYCFTSMFDS 248
Query: 277 GTGRISFGDKGSPG-------QGE---TPFSLRQTHPT-YNITITQVSVGGNAVNFE--- 322
+ ++ G G+P GE TP + P+ Y +++ +SVG +
Sbjct: 249 KSSIVTLG--GAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETK 306
Query: 323 -FSAIFDSGTSFTYLNDPAYTQISETFNS 350
S I DSG S T L + Y + F +
Sbjct: 307 FRSTIIDSGASITTLPEEVYEAVKAEFAA 335
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 126/303 (41%), Gaps = 46/303 (15%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LAA+G + ++G + + + +G P ++A+D
Sbjct: 73 ASRDASRLLYLDSLAARGKARAYAPIASGRQLLQTPT----YVVRARLGTPPQQLLLAVD 128
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C +SS D P S++ VPC S LC CP
Sbjct: 129 TSNDAAWIPCAGCAGC----PTSSAPPFD-----PAASTSYRSVPCGSPLCAQAPNAACP 179
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
G C + + Y +D ++ L +D L +A D ++ +FGC + TG+ A
Sbjct: 180 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGDAVKT------YTFGCLQKATGT---AA 228
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 229 PPQGLLGLGRGPLSF--LSQTRDMYQGTFSYCLPSFKSLNFSGTLRLGRNGQPPRIKTTP 286
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
L H + Y + +T + VG V A + DSGT FT L PAY
Sbjct: 287 LLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTMFTRLVAPAYVA 346
Query: 344 ISE 346
+ +
Sbjct: 347 VRD 349
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 120/303 (39%), Gaps = 44/303 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ W+ C C C + +++P S +
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDP---------VFNPTKSRSF 197
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ +PC S LC C + C YQV Y DG+ + G + L
Sbjct: 198 ANIPCGSPLCRRLDSPGCSTKKHICLYQVSY-GDGSFTYGEFSTETLTFRGTRV------ 250
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
R++ GCG G F+ A L GLG + S PS + + FS C S
Sbjct: 251 GRVALGCGHDNEGLFIGAAG---LLGLGRGRLSFPSQIGRR--FSRKFSYCLVDRSASSK 305
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFSA---- 325
+ FGD TP T Y + + VSVGG V F+ +
Sbjct: 306 PSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNG 365
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-LVV 382
I DSGTS T L PAY + + F A + L F+ C+ L ++ VV
Sbjct: 366 GVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSL-FDTCFDLSGKTEVKVPTVV 424
Query: 383 LPF 385
L F
Sbjct: 425 LHF 427
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 115/301 (38%), Gaps = 46/301 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSD 276
++ GCG +G F+ A GL GLG S+ L G FS C G+
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQLG--GAAGGVFSYCLASRGAG 288
Query: 277 GTGRISFGDKGSPGQGETPFSL---RQTHPTYNITITQVSVGGNAVNFE----------- 322
G G + G + G L Q Y + +T + VGG + +
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGA 348
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRSFLHLQALV 381
+ D+GT+ T L AY + F+ ++ R + S L + CY L + ++
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLSGYASVRVPT 406
Query: 382 V 382
V
Sbjct: 407 V 407
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 125/297 (42%), Gaps = 56/297 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
F S TG S G + T R+ + + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L A+E+ E + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRS 269
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 119/276 (43%), Gaps = 41/276 (14%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAH---RDRYFRLRGRGLAAQGNDKTPLTFSAG 92
H S P K + K S A +AL R Y R R + A Q D P
Sbjct: 51 HSPSSPYKNV-------KAESLAKDTALESTLSRHAYLRARQQK-ALQPADFVPPPLIRD 102
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
+ N+S+G P + V LDTGSDLFW+ C+ C C +
Sbjct: 103 KSAF---------LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDP------- 146
Query: 152 FNIYSPNTSSTSSKVPCNSTLC---ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
IY+ S + +++ CN C + QC +GS C YQ Y +DG+ ++G L + +
Sbjct: 147 --IYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGS-CLYQTSY-ADGSRTSGLLSYEKV 202
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
T + +++ FGCG +Q +F+ + G+ GLG S+ S L+ G + S
Sbjct: 203 AF-TSHYSDEDKTAQVGFGCG-LQNLNFVTSSRDGGVLGLGPGLVSLVSQLSAIGKVSKS 260
Query: 269 FSMCFGS----DGTGRISFGDKGSPGQGETPFSLRQ 300
F+ CFG+ + G + FGD TP + +
Sbjct: 261 FAYCFGNLSNPNAGGFLVFGDATYLNGDMTPMVIAE 296
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/258 (29%), Positives = 104/258 (40%), Gaps = 41/258 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++S G P V +DTGSDL W C C +C N+++ + D P SST
Sbjct: 80 YLIDISFGSPPQKASVIVDTGSDLIWTQCLPCETC----NAAASVIFD-----PVKSSTY 130
Query: 164 SKVPCNSTLC-ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C S C L Q S ++C Y Y DG+ ++G L
Sbjct: 131 DTVSCASNFCSSLPFQ--SCTTSCKYDYMY-GDGSSTSGALS------TETVTVGTGTIP 181
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTG 279
++FGCG GSF A G+ GLG S+ I + FS C GS T
Sbjct: 182 NVAFGCGHTNLGSFAGAA---GIVGLGQGPLSL--ISQASSITSKKFSYCLVPLGSTKTS 236
Query: 280 RISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA-----------I 326
+ GD + G T +PT Y +T +SV G AV + I
Sbjct: 237 PMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFI 296
Query: 327 FDSGTSFTYLNDPAYTQI 344
DSGT+ TYL A+ +
Sbjct: 297 LDSGTTLTYLETGAFNAL 314
>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
Length = 534
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 57/309 (18%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDC--VSCVHGLNSSSGQVIDF--------- 152
++ +V +G PAL + + LDT +DL W+ C H S+GQ +
Sbjct: 124 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEGAKEAS 183
Query: 153 -NIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N Y P SS+ ++ C+ C + Q PS +C Y + DGT++ G ++
Sbjct: 184 KNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIYGKEK 242
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ + + + I GC ++ G +D A +G+ LG S A +
Sbjct: 243 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR--FGQ 297
Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
FS C S D + ++FG + PG ET P Y +T V VGG
Sbjct: 298 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGER 357
Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP--- 364
++ I D+ TS T L AY ++ + S LP
Sbjct: 358 LDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDR--------HLSHLPRVY 409
Query: 365 ----FEYCY 369
FEYCY
Sbjct: 410 ELEGFEYCY 418
>gi|226530663|ref|NP_001146528.1| uncharacterized protein LOC100280120 [Zea mays]
gi|219887685|gb|ACL54217.1| unknown [Zea mays]
Length = 292
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 81/177 (45%), Gaps = 11/177 (6%)
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSIL 259
G V D + ++ + ++ D I FGCG Q G L+ +G+ GL S+P+ L
Sbjct: 2 GVYVRDSMQFVGEDGERENAD--IVFGCGYDQQGVLLNALETTDGVLGLTNKALSLPTQL 59
Query: 260 ANQGLIPNSFSMCFGSDGTGR---ISFGDKGSPGQGETPFSLRQ--THPTYNITITQVSV 314
A++G+I N+F C +D +G + GD P G T +R + Q++
Sbjct: 60 ASRGIISNAFGHCMSTDPSGAGGYLFLGDDYIPRWGMTWVPIRDGPADDVRRAQVKQINH 119
Query: 315 GGNAVNFE---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYC 368
G +N + +FD+G+++TY D A T++ + A + SD +C
Sbjct: 120 GDQQLNAQGKLTQVVFDTGSTYTYFPDEALTRLISSLKEAASPRFVQDDSDKTLPFC 176
>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
lyrata]
Length = 362
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 84/190 (44%), Gaps = 32/190 (16%)
Query: 107 TNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHG---LNSSSGQVI--------DFNI 154
T + +G P F + +D+GS + ++PC DC C L+S Q++ F I
Sbjct: 94 TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQVMLSSPKDQILCLVSCKVQIFKI 153
Query: 155 ----------YSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLV 204
+ P SST V CN + C C Y+ Y ++ + S G L
Sbjct: 154 SYGLFDEDPKFQPELSSTYQPVKCN-----MDCNCDDDKEQCVYEREY-AEHSSSKGVLG 207
Query: 205 EDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGL 264
ED++ +S R FGC V+TG A +G+ GLG S+ L ++GL
Sbjct: 208 EDLISFGN---ESHLTPQRAVFGCKTVETGDLYSQRA-DGIIGLGQGDLSLVGQLVDKGL 263
Query: 265 IPNSFSMCFG 274
I NSF +C+G
Sbjct: 264 ISNSFGLCYG 273
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 117/284 (41%), Gaps = 38/284 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + +G P + + LDTGS L WL C C H +Y P+ S T
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADP--------LYDPSVSKTY 176
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
K+ C S C K C + + C Y Y D + S G+L +D+L L + +
Sbjct: 177 KKLSCASVECSRLKAATLNDPLCETDSNACLYTASY-GDTSFSIGYLSQDLLTLTSSQTL 235
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ ++GCG+ G F A G+ GL DK S+ + L+ + ++FS C +
Sbjct: 236 -----PQFTYGCGQDNQGLFGRAA---GIIGLARDKLSMLAQLSTK--YGHAFSYCLPTA 285
Query: 277 GTGRISFGDKG----SPGQGE-TPFSLRQTHPT-YNITITQVSVGGN-----AVNFEFSA 325
+G G SP + TP +P+ Y + +T ++V G A +
Sbjct: 286 NSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT 345
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ DSGT T L Y + + F + K + + + C+
Sbjct: 346 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCF 389
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 116/265 (43%), Gaps = 45/265 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CV-SCVHGLNSSSGQVIDFNIYSPNTSST 162
+Y V +G P + DTGS L W C+ C SC + I+ P+ SS+
Sbjct: 140 YYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDP---------IFDPSKSSS 190
Query: 163 SSKVPCNSTLCELQKQCPSAG------SNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEK 215
+ + C S+LC Q SAG ++C Y V+Y D ++S GFL ++ L + ATD
Sbjct: 191 YTNIKCTSSLC---TQFRSAGCSSSTDASCIYDVKY-GDNSISRGFLSQERLTITATD-- 244
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ FGCG+ G F A GL +G+ + + + + FS C S
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRGTA---GL--MGLSRHPISFVQQTSSIYNKIFSYCLPS 295
Query: 276 --DGTGRISFGDKGSPGQG--ETPFS-LRQTHPTYNITITQVSVGGNAV----NFEFSA- 325
G ++FG + TPFS + + Y + I +SVGG + + FSA
Sbjct: 296 TPSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAG 355
Query: 326 --IFDSGTSFTYLNDPAYTQISETF 348
I DSGT T L AY + F
Sbjct: 356 GSIIDSGTVITRLPPTAYAALRSAF 380
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 122/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
F S TG S G + T R+ + + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 122/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGAMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
F S TG S G + T R+ + + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 76/263 (28%), Positives = 109/263 (41%), Gaps = 37/263 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ V +G P + DTGSDL W C+ SC + I+ P+ S++
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDV---------IFDPSKSTS 196
Query: 163 SSKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDE 214
S + C S LC C ++ C Y ++Y D + S G+ + L + ATD
Sbjct: 197 YSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQY-GDSSFSVGYFSRERLTVTATD- 254
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
V FGCG+ G F A GL GLG S A + S+ +
Sbjct: 255 -----VVDNFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAKYRKIFSYCLPST 306
Query: 275 SDGTGRISFGDKGSPGQGE-TPFS-LRQTHPTYNITITQVSVGGNAVNFEFS------AI 326
S TG +SFG + + TPFS + + Y + IT ++VGG + S AI
Sbjct: 307 SSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAI 366
Query: 327 FDSGTSFTYLNDPAYTQISETFN 349
DSGT T L AY + F
Sbjct: 367 IDSGTVITRLPPTAYGALRSAFR 389
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 121/286 (42%), Gaps = 50/286 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G P + ++A+DT +D W+PC C C L ++P S+T
Sbjct: 78 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTL------------FAPEKSTTF 125
Query: 164 SKVPCNSTLCELQKQCPSAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
V C + C KQ P+ G S+C + + Y S + LV+D + LATD S
Sbjct: 126 KNVSCAAPEC---KQVPNPGCGVSSCNFNLTYGSSSIAAN--LVQDTITLATDPVPS--- 177
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----D 276
+FGC TG+ A P GL GLG S+ S Q L ++FS C S +
Sbjct: 178 ---YTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLN 229
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA--------- 325
+G + G P + + L+ + Y + + + VG V+ +A
Sbjct: 230 FSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGA 289
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT FT L P Y + + F K T TS F+ CY
Sbjct: 290 GTIFDSGTVFTRLVAPVYVAVRDEFRRRVGPKL-TVTSLGGFDTCY 334
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 117/280 (41%), Gaps = 34/280 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ T + +G PA +I+ +DTGS L WL C C + SG V D P TSS+ +
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWL--QCSPCRVSCHRQSGPVFD-----PKTSSSYA 189
Query: 165 KVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
V C++ C L S+ C YQ Y D + S G+L +D + ++
Sbjct: 190 AVSCSTPQCNDLSTATLNPAACSSSDVCIYQASY-GDSSFSVGYLSKDTVSFGSNSVP-- 246
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
+GCG+ G F A GL GL +K S+ LA + SFS C S +
Sbjct: 247 ----NFYYGCGQDNEGLFGRSA---GLMGLARNKLSLLYQLAPT--LGYSFSYCLPSSSS 297
Query: 279 GRISFGDKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAVNF---EFSA---IFDSG 330
+PGQ TP S Y I ++ ++V G + E+S+ I DSG
Sbjct: 298 SGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSG 357
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYV 370
T T L Y +S+ K + + + C+V
Sbjct: 358 TVITRLPTTVYDALSKAVAGAMKGTKRADAYSI-LDTCFV 396
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 117/298 (39%), Gaps = 48/298 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V +G P F + LDTGSDL W+ CV C + Y P S +
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWI--QCVPCFDCFEQNGP------YYDPKDSISFR 247
Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ CN C+L + C +CPY Y + F +E T K
Sbjct: 248 NITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGK 307
Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R+ FGCG G F A L GLG S S L Q L +SFS C
Sbjct: 308 SEFRRVENVMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 362
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAV---- 319
+ + ++ FG+ P T + +P Y + I + VGG +
Sbjct: 363 RDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPE 422
Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY-CY 369
N+ SA I DSGT+ +Y +DPAY I E F L K K D P + CY
Sbjct: 423 ENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAF--LRKVKGYKLVEDFPILHPCY 478
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 117/275 (42%), Gaps = 34/275 (12%)
Query: 100 SLGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPN 158
S+G +Y T + +G PA +++ +DTGS L WL C C+ + SG V ++P
Sbjct: 116 SVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWL--QCSPCLVSCHRQSGPV-----FNPK 168
Query: 159 TSSTSSKVPCNSTLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+SST + V C++ C L S+ + C YQ Y D + S G+L +D + +
Sbjct: 169 SSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASY-GDSSFSVGYLSKDTVSFGS 227
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+GCG+ G F A GL GL +K S+ LA + SF+ C
Sbjct: 228 TSLP------NFYYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFTYC 276
Query: 273 FGSDGTGRISFGDKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAV------NFEFS 324
S + +PGQ TP S Y I ++ ++V GN +
Sbjct: 277 LPSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLP 336
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETS 359
I DSGT T L Y+ +S+ + K S
Sbjct: 337 TIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRAS 371
>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 439
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 123/289 (42%), Gaps = 48/289 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G P + ++A+DT +D W+PC C C L ++P S+T
Sbjct: 98 YIVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTL------------FAPEKSTTF 145
Query: 164 SKVPCNSTLCELQKQCPSAG-SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C S C Q PS G S C + + Y S + +V+D + LATD
Sbjct: 146 KNVSCGSPQCN-QVPNPSCGTSACTFNLTYGSSSIAAN--VVQDTVTLATDPIPD----- 197
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
+FGC TG+ A P GL GLG S+ S Q L ++FS C S + +
Sbjct: 198 -YTFGCVAKTTGA---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 251
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + L+ + Y + + + VG V+ A
Sbjct: 252 GSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGT 311
Query: 326 IFDSGTSFTYLNDPAYTQISETFN---SLAKEKRETSTSDLPFEYCYVL 371
+FDSGT FT L PAYT + + F ++A + T TS F+ CY +
Sbjct: 312 VFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTV 360
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/298 (28%), Positives = 117/298 (39%), Gaps = 48/298 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V +G P F + LDTGSDL W+ CV C + Y P S +
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWI--QCVPCFDCFEQNGP------YYDPKDSISFR 247
Query: 165 KVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ CN C+L + C +CPY Y + F +E T K
Sbjct: 248 NITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGK 307
Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R+ FGCG G F GL GLG S S L Q L +SFS C
Sbjct: 308 SEFRRVENVMFGCGHWNRGLF---HGAAGLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 362
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAV---- 319
+ + ++ FG+ P T + +P Y + I + VGG +
Sbjct: 363 RDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPE 422
Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY-CY 369
N+ SA I DSGT+ +Y +DPAY I E F L K K D P + CY
Sbjct: 423 ENWNLSADGAGGTIIDSGTTLSYFSDPAYRIIKEAF--LRKVKGYKLVEDFPILHPCY 478
>gi|302853254|ref|XP_002958143.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
gi|300256504|gb|EFJ40768.1| hypothetical protein VOLCADRAFT_99354 [Volvox carteri f.
nagariensis]
Length = 475
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 82/175 (46%), Gaps = 16/175 (9%)
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
Y R ++ + S G++VED D+ R+ FGC +TG A +G+ G
Sbjct: 8 YYSRTYAERSSSEGWMVEDAFGFPDDQPPV-----RMVFGCENGETGEIYRQLA-DGIMG 61
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS--LRQTH-PT 304
+G + + S L +G+I + FS+CFG G + GD P T ++ L H
Sbjct: 62 MGNNHNAFQSQLVARGVIEDVFSLCFGYPKDGILLLGDVPMPKGANTVYTPLLNNLHLHY 121
Query: 305 YNITITQVSVGG-----NAVNFE--FSAIFDSGTSFTYLNDPAYTQISETFNSLA 352
YN+ + ++V G NA F + + DSGT+FTYL A+ ++ S A
Sbjct: 122 YNVRMDGIAVNGVELSLNARIFTRGYGVVLDSGTTFTYLPTEAFNAMAAAIGSYA 176
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 434
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 86/305 (28%), Positives = 128/305 (41%), Gaps = 43/305 (14%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIY 155
N FL N+S+G P + ++ +DTGSDL W LPC C Q I F +
Sbjct: 84 NPAAFL--ANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYP----------QTIPF--F 129
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P+ SST C S + + + NC Y +RY D + + G L ++ L T +
Sbjct: 130 HPSRSSTYRNASCESAPHAMPQIFRDEKTGNCRYHLRY-RDFSNTRGILAKEKLTFQTSD 188
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
+ S I FGCG+ +G +G+ GLG S+ + N G + FS CFG
Sbjct: 189 EGLIS-KPNIVFGCGQDNSGF----TQYSGVLGLGPGTFSI--VTRNFG---SKFSYCFG 238
Query: 275 S--DGTGRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFE--------- 322
S D T +F G+ + E P L+ Y + + +S+G ++ E
Sbjct: 239 SLIDPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRS 298
Query: 323 -FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK-RETSTSDLPFEYCYVLRSFLHLQAL 380
+ D+G S T L AY +SE + L E R + +CY L L
Sbjct: 299 KGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGF 358
Query: 381 VVLPF 385
V+ F
Sbjct: 359 PVVTF 363
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 112/295 (37%), Gaps = 56/295 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDP---------LFDPAASSSF 180
Query: 164 SKVPCNSTLCE----LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
S V C S +C C Y V Y DG+ + G L + L L Q
Sbjct: 181 SGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTY-GDGSYTKGELALETLTLGGTAVQG-- 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
++ GCG +G F+ A GL GLG S+ L G FS C S G G
Sbjct: 238 ----VAIGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLG--GAAGGVFSYCLASRGAG 288
Query: 280 RISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFD 328
G G S Y + +T + VGG + + S + D
Sbjct: 289 ----------GAGSLASSF------YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMD 332
Query: 329 SGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
+GT+ T L AY + F+ ++ R + S L + CY L + ++ V
Sbjct: 333 TGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLL--DTCYDLSGYASVRVPTV 385
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 132/315 (41%), Gaps = 48/315 (15%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A R R R LAA ++ T T SA +++ + +++G P +S+ D
Sbjct: 50 ALRRDMHRHNARQLAASSSNGT--TVSAPT---QISPTAGEYLMTLAIGTPPVSYQAIAD 104
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTL----CELQKQC 179
TGSDL W C C SS +Y+P++S+T + +PCNS+L L
Sbjct: 105 TGSDLIW--TQCAPC-----SSQCFQQPTPLYNPSSSTTFAVLPCNSSLSMCAAALAGTT 157
Query: 180 PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDG 239
P G C Y + Y S T + + + + +++ I+FGC G +
Sbjct: 158 PPPGCTCMYNMTYGSGWT--SVYQGSETFTFGSSTPANQTGVPGIAFGCSNASGG--FNT 213
Query: 240 AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDGTGRISFGDKGS----PGQ 291
++ +GL GLG S+ S L +P FS C ++ T + G S G
Sbjct: 214 SSASGLVGLGRGSLSLVSQLG----VPK-FSYCLTPYQDTNSTSTLLLGPSASLNDTGGV 268
Query: 292 GETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYL 336
TPF + Y + +T +S+G A++ +A I DSGT+ T L
Sbjct: 269 SSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGGFIIDSGTTITLL 328
Query: 337 NDPAYTQISETFNSL 351
+ AY Q+ SL
Sbjct: 329 GNTAYQQVRAAVVSL 343
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 83/299 (27%), Positives = 127/299 (42%), Gaps = 42/299 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LA +G + ++G L +L ++ S+G P ++A+D
Sbjct: 75 ASRDASRLLYLDSLAVRGRARAYAPIASGRQL--LQTLTYV--VRASLGTPPQQLLLAVD 130
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C +SS D P S++ VPC S LC CP
Sbjct: 131 TSNDASWIPCAGCAGC----PTSSAAPFD-----PAASASYRTVPCGSPLCAQAPNAACP 181
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
G C + + Y +D ++ L +D L +A + ++ +FGC + TG+ A
Sbjct: 182 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGNAVKA------YTFGCLQRATGT---AA 230
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 231 PPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTP 288
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISE 346
L H + Y + +T V VG V + DSGT FT L PAY + +
Sbjct: 289 LLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRD 347
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 75/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 125/304 (41%), Gaps = 48/304 (15%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVS--CVHGLNSSSGQVIDFNIY 155
L++L F+ V G PA + + LDTGSDL W+ C S C + DF+
Sbjct: 132 LDTLEFV--VVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDP------DFD-- 181
Query: 156 SPNTSSTSSKVPCNSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P SS+ + VPC + +C C G+ C Y V+Y DG+ +TG L D L +
Sbjct: 182 -PAKSSSYAAVPCGTPVCAAAGGMC--NGTTCLYGVQY-GDGSSTTGVLSRDTLTFNSSS 237
Query: 215 KQSKSVDSRISFGCGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
K + +FGCG G F +DG G L + + PS FS C
Sbjct: 238 KFTG-----FTFGCGEKNIGDFGEVDGLLGLGRGKLSLPSQAAPSFGG-------VFSYC 285
Query: 273 FGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYN----ITITQVSVGG------NAVN 320
S T G ++ G ++ P Y I + +++GG +V
Sbjct: 286 LPSYNTTPGYLNIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF 345
Query: 321 FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQAL 380
+ + DSGT TYL PAYT + + F + + + P + CY Q
Sbjct: 346 TKTGTLLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYE-PLDTCYDFTG----QGA 400
Query: 381 VVLP 384
+V+P
Sbjct: 401 IVIP 404
>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
Length = 493
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 79/173 (45%), Gaps = 20/173 (11%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG PA+ ++A+DTGSD+ WL C C C SG V D P S++
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCY----PQSGPVFD-----PRHSTSY 184
Query: 164 SKVPCNSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
++ ++ C+ + + C Y V Y DG+ + G +E+ L A +
Sbjct: 185 REMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAGGVQV---- 240
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S GCG G F AA G+ GLG + S PS +A G SFS C
Sbjct: 241 -PHMSIGCGHDNKGLFAAPAA--GILGLGRGQISCPSQIAALGYNVTSFSYCL 290
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 121/272 (44%), Gaps = 38/272 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 123 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 172
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 173 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 230
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 231 AVRS------FQFGCSNVESG-FNDQT--DGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 279
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 280 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 339
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ DSGT T L AY+ +S F + K+
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQ 371
>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
Length = 472
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 109/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 224
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + L
Sbjct: 225 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL---- 273
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
S C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 274 -SYCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSS 332
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 333 EMIVDSGAQRTSLWPSTFALLDKTI 357
>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
Length = 474
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/265 (28%), Positives = 109/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 114 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 168
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 169 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 226
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + L
Sbjct: 227 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL---- 275
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
S C +D T G + G D+ + G TP PTY++T+ ++ G V
Sbjct: 276 -SYCLPTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSS 334
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 335 EMIVDSGAQRTSLWPSTFALLDKTI 359
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/301 (28%), Positives = 134/301 (44%), Gaps = 51/301 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + + DTGSD+ WL C C SC GQ +++P+ SST
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY-------GQTDP--LFNPSFSSTF 131
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C S+LC+ L + C + C YQV Y DG+ + G + L ++ S
Sbjct: 132 QSITCGSSLCQQLLIRGCRR--NQCLYQVSY-GDGSFTVGEFSTETLSFGSNAVNS---- 184
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F A L GLG S PS + L + FS C S G+
Sbjct: 185 --VAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--------- 325
+ FG++ + F+ T+P Y + + + VGG +VN +
Sbjct: 238 VPLIFGNQAVASNAQ--FTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGN 295
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVL--RSFLHLQA 379
I DSGT+ T L AY + + F + + + + TS L F+ CY L RS + L A
Sbjct: 296 GGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSL-FDTCYDLSGRSSIMLPA 354
Query: 380 L 380
+
Sbjct: 355 V 355
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 130/322 (40%), Gaps = 40/322 (12%)
Query: 66 RDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTG 125
R Y + R A D +T + ++SL ++ + G P++ ++ +DTG
Sbjct: 89 RTNYIKSRASTGMASTPDDAAVTVPTRLGGF-VDSLEYM--VTLGFGTPSVPQVLLMDTG 145
Query: 126 SDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-----ELQKQCP 180
SD+ W+ C C NS+ ++ P+ SST + + C + C + C
Sbjct: 146 SDVSWV--QCAPC----NSTECYPQKDPLFDPSKSSTYAPIACGADACNKLGDHYRNGCT 199
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
S G+ C Y+V Y DG+ + G + + A FGCG Q G
Sbjct: 200 SGGTQCGYRVEY-GDGSSTRGVYSNETITFAPGITVKD-----FHFGCGHDQRGP---SD 250
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS--DGTGRISFGDKGSPGQGETPF-- 296
+GL GLG S+ ++ + +FS C + G ++ G + S + F
Sbjct: 251 KFDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNSEAGFLALGVRPSAATNTSAFVF 308
Query: 297 ----SLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYTQISET 347
L +Y + +T +SVGG ++ SA + DSGT T L + AY ++
Sbjct: 309 TPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSGTIVTELPETAYNALNAA 368
Query: 348 FNSLAKEKRETSTSDLPFEYCY 369
++ D F+ CY
Sbjct: 369 LRKAFAAYPMVASED--FDTCY 388
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 105/253 (41%), Gaps = 35/253 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ V G P + V DTGSD+ WL C V C ++ P+ SST
Sbjct: 16 YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEP---------LFDPSLSST 66
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
V C C + S C Y V Y DG+ + GFL D L +K +
Sbjct: 67 YRNVSCTEPACVGLSTRGCSSSTCLYGVFY-GDGSSTIGFLAMDTFMLTPAQKFKNFI-- 123
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKT-SVPSILANQGLIPNSFSMCF--GSDGTG 279
FGCG+ TG F A GL GLG T S+ S +A + N FS C S TG
Sbjct: 124 ---FGCGQNNTGLFQGTA---GLVGLGRSSTYSLNSQVAPS--LGNVFSYCLPSTSSATG 175
Query: 280 RISFGD-KGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNF------EFSAIFDSGT 331
++ G+ + +PG R PT Y I + +SVGG ++ I DSGT
Sbjct: 176 YLNIGNPQNTPGYTAMLTDTRV--PTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGT 233
Query: 332 SFTYLNDPAYTQI 344
T L AY+ +
Sbjct: 234 VITRLPPTAYSAL 246
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 109/270 (40%), Gaps = 45/270 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++VG P + LDTGSDL W C C C D + P SST
Sbjct: 84 YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQ---------DLPVLDPAASSTY 134
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ +PC + C + +C Y Y D +++ G + D
Sbjct: 135 AALPCGAARCRALPFTSCGVRTLGNHRSCIYAYHY-GDKSLTVGEIATDRFTFGDSGGSG 193
Query: 218 KSVDS-RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
+S+ + R++FGCG + G F G+ G G + S+PS L SFS CF S
Sbjct: 194 ESLHTRRLTFGCGHLNKGVFQSNE--TGIAGFGRGRWSLPSQLNV-----TSFSYCFTSM 246
Query: 276 --DGTGRISFGDKGSPG-------QGE---TPFSLRQTHPT-YNITITQVSVGGNAV--- 319
+ ++ G GSP GE TP + P+ Y +++ +SVG +
Sbjct: 247 FESKSSLVTLG--GSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVP 304
Query: 320 NFEF-SAIFDSGTSFTYLNDPAYTQISETF 348
+F S I DSG S T L + Y + F
Sbjct: 305 ETKFRSTIIDSGASITTLPEEVYEAVKAEF 334
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/276 (27%), Positives = 112/276 (40%), Gaps = 43/276 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G P + ++ALDT SD W+PC CV C S+S ++P S++ V C S
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGC-----STSKP------FAPIKSTSFRNVSCGS 152
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C+ GS C + Y S ++ +V+D L LATD +FGC
Sbjct: 153 PHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLATDPIPG------YTFGCVN 204
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDK 286
TGS +AP + +Q L ++FS C S + +G + G
Sbjct: 205 KTTGS----SAPQQGLLGLGRGPLS-LLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259
Query: 287 GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
P + + LR + Y + + + VG V+ +A IFDSGT F
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T L +P YT + F K +T F+ CY
Sbjct: 320 TRLAEPVYTAVRNEFRRRVGPKLPVTTLGG-FDTCY 354
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/305 (23%), Positives = 120/305 (39%), Gaps = 51/305 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V +G P + +D+GSD+ W+ C C+ C + ++ P +S+T
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADP---------LFDPASSATF 175
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
S V C S +C + C +G C Y+V Y DG+ + G L + L L +
Sbjct: 176 SAVSCGSAICRTLRTSGCGDSG-GCEYEVSY-GDGSYTKGTLALETLTLGGTAVEG---- 229
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS------ 275
++ GCG G F+ A GL GLG S+ L +FS C S
Sbjct: 230 --VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGGSGS 282
Query: 276 ---DGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGGNAVNFE------- 322
D G + G + +G P P+ Y + ++ + VG + +
Sbjct: 283 GAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLT 342
Query: 323 ----FSAIFDSGTSFTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSFLHL 377
+ D+GT+ T L AY + + F ++ R S L + CY L + +
Sbjct: 343 EDGGGGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLL--DTCYDLSGYTSV 400
Query: 378 QALVV 382
+ V
Sbjct: 401 RVPTV 405
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/291 (26%), Positives = 122/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + I+ +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
F S TG S G + T R+ + + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 135/323 (41%), Gaps = 35/323 (10%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
R Y R G A Q D +A +G L+Y S+G P ++ + +D
Sbjct: 99 RRAEYILRRVSGRAPQLWDSKAAAAAATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
TGSDL W+ C S S + D P SS+ + VPC +C +
Sbjct: 159 TGSDLSWVQCKPCSAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+ + C Y V Y DG+ +TG D L L+ + S FGCG Q+G F +G
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
+GL GLG ++ S+ + G FS C + + G ++ G G P FS
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-VGGPSGAAPGFST 321
Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSAI-----FDSGTSFTYLNDPAYTQISET 347
Q P+ Y + +T +SVGG ++ SA D+GT T L AY +
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSA 381
Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
F S +A T+ S+ + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/256 (30%), Positives = 108/256 (42%), Gaps = 37/256 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
V G PA + DTGSDL W+ C C V D P SS+ + VPC
Sbjct: 116 VGFGSPAQTSATMFDTGSDLSWI--QCQPCSGHCYKQHDPVFD-----PAKSSSYAVVPC 168
Query: 169 NSTLCELQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFG 227
+T C +C G+ C Y V Y DG+ +TG L + L ++ + + + FG
Sbjct: 169 GTTECAAAGGEC--NGTTCVYGVEY-GDGSSTTGVLARETLTFSSSSEFTGFI-----FG 220
Query: 228 CGRVQTGSF--LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISF 283
CG G F +DG G L + + P+ G I FS C S T G +S
Sbjct: 221 CGETNLGDFGEVDGLLGLGRGSLSLSSQAAPAF----GGI---FSYCLPSYNTTPGYLSI 273
Query: 284 GDKGSPGQGETPFSLRQTHPTYN----ITITQVSVGGNAVNF---EFS---AIFDSGTSF 333
G GQ ++ P Y I + +++GG + EF+ + DSGT
Sbjct: 274 GATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGTLLDSGTIL 333
Query: 334 TYLNDPAYTQISETFN 349
TYL PAYT + + F
Sbjct: 334 TYLPPPAYTALRDRFK 349
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/283 (28%), Positives = 111/283 (39%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL W+ C C C + Y P S++
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGA---------FYDPKASASY 205
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L K C S +CPY Y + F VE ++L T
Sbjct: 206 KNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGS 265
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S+ + + FGCG G F A L GLG S S L Q L +SFS C
Sbjct: 266 SELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 320
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
++ + ++ FG+ P T F R+ + Y + I + V G +N
Sbjct: 321 RNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPE 380
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
I DSGT+ +Y +PAY I AK K
Sbjct: 381 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 423
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/276 (28%), Positives = 116/276 (42%), Gaps = 46/276 (16%)
Query: 100 SLGFLHYT-NVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIY 155
SLG Y VS+G PA++ ++++DTGSD+ W+ PC SC + ++
Sbjct: 124 SLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD---------KLF 174
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAG-----SNCPYQVRYLSDGTMSTGFLVEDVLHL 210
P S+T S C+S C Q G S+C Y V+Y+ D + +TG D L L
Sbjct: 175 DPAKSATYSAFSCSSAQC---AQLGGEGNGCLNSHCQYIVKYV-DHSNTTGTYGSDTLGL 230
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
T + FGC G F+ +GL GLG D S+ S A +FS
Sbjct: 231 TTSDAVKN-----FQFGCSHRANG-FV--GQLDGLMGLGGDTESLVSQTA--ATYGKAFS 280
Query: 271 MCFGSDGTGRISFGDKGSPGQG-------ETPFSLRQTHPT-YNITITQVSVGGNAVN-- 320
C + F G+ G TP +R PT Y + + ++V G +N
Sbjct: 281 YCLPPSSSSAGGFLTLGAAAGGTSSSRYSRTPL-VRFNVPTFYGVFLQAITVAGTKLNVP 339
Query: 321 ---FEFSAIFDSGTSFTYLNDPAYTQISETFNSLAK 353
F +++ DSGT T L AY + F K
Sbjct: 340 ASVFSGASVVDSGTVITQLPPTAYQALRTAFKKEMK 375
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 116/284 (40%), Gaps = 41/284 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VG PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSTSY 213
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C++ C C ++ C Y+V Y DG+ + G + L L S
Sbjct: 214 ASVACDNPRCHDLDAAACRNSTGACLYEVAY-GDGSYTVGDFATETLTLGDSAPVSS--- 269
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ +FS C S +
Sbjct: 270 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 319
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ FGD +T Y + ++ +SVGG ++ SA I
Sbjct: 320 STLQFGDAADAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIV 379
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
DSGT+ T L AY + + F + TS L F+ CY L
Sbjct: 380 DSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL-FDTCYDL 422
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 117/276 (42%), Gaps = 41/276 (14%)
Query: 36 HRYSDPVKGILAVDDLPKKGSFAYYSALAH---RDRYFRLRGRGLAAQGNDKTPLTFSAG 92
H S P K + K S A +AL R Y R R + A Q D P
Sbjct: 38 HSPSSPYKNV-------KAESLAKDTALESTLSRHAYLRARQQK-ALQPADFVPPPLIRD 89
Query: 93 NDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVID 151
+ N+S+G P + V LDTGSDLFW+ C+ C C +
Sbjct: 90 KSAF---------LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDP------- 133
Query: 152 FNIYSPNTSSTSSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
IY+ S + +++ CN C + QC +GS C YQ Y +DG ++G L + +
Sbjct: 134 --IYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGS-CLYQTAY-ADGARTSGLLSYEKV 189
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
T + +++ FGCG +Q +F+ G+ GLG S+ S L+ G + S
Sbjct: 190 AF-TSHYSDEDKTAQVGFGCG-LQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGKVSKS 247
Query: 269 FSMCFGS----DGTGRISFGDKGSPGQGETPFSLRQ 300
F+ CFG+ + G + FGD TP + +
Sbjct: 248 FAYCFGNISNPNAGGFLVFGDATYLNGDMTPMVIAE 283
>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
Length = 462
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/275 (28%), Positives = 106/275 (38%), Gaps = 38/275 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P ++ LDTGSD+ WL C C C + SG+V D +
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQCY----AQSGRVFDPRRSRSYAAVRC 197
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
PC C C YQV Y DG+++ G L + L A + R
Sbjct: 198 GAPPCRGLDAGGGGGCDRRRGTCLYQVAY-GDGSVTAGDLATETLWFARGARVP-----R 251
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRIS 282
++ GCG G F+ A GL + S+P+ A + FS CF GSD R
Sbjct: 252 VAVGCGHDNEGLFVAAAGLLGLG---RGRLSLPTQTARR--YGRRFSYCFQGSDLDHRTI 306
Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----AIFDSGTSFTYLN 337
+R H + VG ++ + S I DSGTS T L
Sbjct: 307 ---------------IRTVHQHVGGARVR-GVGERSLRLDPSTGRGGVILDSGTSVTRLA 350
Query: 338 DPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLR 372
P Y + E F + A R F+ CY LR
Sbjct: 351 RPVYVAVREAFRAAAGGLRLAPGGFSLFDTCYDLR 385
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 121/272 (44%), Gaps = 38/272 (13%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
LN+L +L V +G PA S + +DTGSD+ W+ C S H ++ P
Sbjct: 47 LNTLEYL--ITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADP--------LFDP 96
Query: 158 NTSSTSSKVPCNSTLCELQKQ----CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
++SST S C S C Q C S+ S C Y V Y DG+ +TG D L L +
Sbjct: 97 SSSSTYSPFSCGSADCAQLGQEGNGC-SSSSQCQYIVTY-GDGSSTTGTYSSDTLALGSS 154
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
+S FGC V++G F D +GL GLG S+ S A G + +FS C
Sbjct: 155 AVRS------FQFGCSNVESG-FND--QTDGLMGLGGGAQSLVSQTA--GTLGRAFSYCL 203
Query: 274 --GSDGTGRISFGDKGSPGQG---ETPFSLRQTHPT-YNITITQVSVGGNAVN-----FE 322
+G ++ G G G +TP PT Y + + + VGG ++ F
Sbjct: 204 PPTPSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFS 263
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ DSGT T L AY+ +S F + K+
Sbjct: 264 AGTVMDSGTVITRLPPTAYSALSSAFKAGMKQ 295
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 80/312 (25%), Positives = 122/312 (39%), Gaps = 57/312 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
H V VG P V LD GSDL W C V ++ Q+ ++ SS+ S
Sbjct: 107 HSLTVGVGTPPQPSKVILDLGSDLLWTQCSLV------GPTAKQLEP--VFDAARSSSFS 158
Query: 165 KVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTM-STGFLVEDVLHLATDEKQSKS 219
+PC+S LCE K C C Y+ Y G M +TG L +
Sbjct: 159 VLPCDSKLCEAGTFTNKTC--TDRKCAYENDY---GIMTATGVLATETFTFGAHH----G 209
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSD 276
V + ++FGCG++ G+ A +G+ GL S+ LA FS C F
Sbjct: 210 VSANLTFGCGKLANGTI---AEASGILGLSPGPLSMLKQLAI-----TKFSYCLTPFADR 261
Query: 277 GTGRISFGDKGSPGQGETPFSLRQTHPT---------YNITITQVSVGGNAVNFEFS--- 324
T + FG G+ +T + QT P Y + + +SVG ++
Sbjct: 262 KTSPVMFGAMADLGKYKTTGKV-QTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLA 320
Query: 325 --------AIFDSGTSFTYLNDPAYTQISE-TFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
+ DS T+ YL +PA+T++ + + S D P C+ L +
Sbjct: 321 IKPDGTGGTVLDSATTLAYLVEPAFTELKKAVMEGIKLPVANRSVDDYPV--CFELPRGM 378
Query: 376 HLQALVVLPFPL 387
++ + V P L
Sbjct: 379 SMEGVQVPPLVL 390
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 75/311 (24%), Positives = 123/311 (39%), Gaps = 52/311 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN---IYSPNTSS 161
H V +G P + +DTGSDL W C L+SS+ +Y P SS
Sbjct: 91 HSLTVGIGTPPQPRKLIVDTGSDLIWTQCK-------LSSSTAVAARHGSPPVYDPGESS 143
Query: 162 TSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T + +PC+ LC+ K C S + C Y+ Y S + G L +
Sbjct: 144 TFAFLPCSDRLCQEGQFSFKNCTSK-NRCVYEDVYGS--AAAVGVLASETFTFGA----R 196
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
++V R+ FGCG + GS + G+ GL + S+ + L Q FS C F
Sbjct: 197 RAVSLRLGFGCGALSAGSLIGA---TGILGLSPESLSLITQLKIQ-----RFSYCLTPFA 248
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT----HPT----YNITITQVSVGGNAVNFEFSA- 325
T + FG + +T ++ T +P Y + + +S+G + ++
Sbjct: 249 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASL 308
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
I DSG++ YL + A+ + E + + T + +E C+VL
Sbjct: 309 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVE-DYELCFVLPRRT 367
Query: 376 HLQALVVLPFP 386
A+ + P
Sbjct: 368 AAAAMEAVQVP 378
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 123/308 (39%), Gaps = 53/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA + + LDTGSD+ WL C C C + + +++P S T
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDP---------VFNPAKSKTF 186
Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC S LC +C S S C YQV Y DG+ + G + L
Sbjct: 187 ATVPCGSRLCRRLDDSSECVSRRSKACLYQVSY-GDGSFTVGDFSTETLTF-----HGAR 240
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
VD ++ GCG G F+ A GL S PS N+ FS C
Sbjct: 241 VD-HVALGCGHDNEGLFVGAAGLLGLG---RGGLSFPSQTKNR--YNGKFSYCLVDRTSS 294
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
S I FG+ P F+ T+P Y + + +SVGG+ V F
Sbjct: 295 GSSSKPPSTIVFGNGAVPKTAV--FTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 352
Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
+ A I DSGTS T L AY + + F A + + L F+ C+ L
Sbjct: 353 KLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSL-FDTCFDLSGMT 411
Query: 376 HLQALVVL 383
++ V+
Sbjct: 412 TVKVPTVV 419
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 116/291 (39%), Gaps = 54/291 (18%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+S+G P ++F V DTGS L W C C C + P +SST SK+
Sbjct: 93 NLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPP---------FQPASSSTFSKL 143
Query: 167 PCNSTLCELQKQCPSAGSNCPYQVRYLSDGT-MSTGFLVEDVLHLATDEKQSKSVDSRIS 225
PC S+LC+ P N V Y G + G+L + LH+ ++
Sbjct: 144 PCASSLCQFLTS-PYLTCNATGCVYYYPYGMGFTAGYLATETLHVGGASFPG------VA 196
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD---GTGRIS 282
FGC + G G + +G+ GLG S+++ G+ FS C SD G I
Sbjct: 197 FGC-STENGV---GNSSSGIVGLGRSPL---SLVSQVGV--GRFSYCLRSDADAGDSPIL 247
Query: 283 FGDKGSPGQGE---TPFSLRQTHPT---YNITITQVSVGG-----NAVNFEFS------- 324
FG G TP P+ Y + +T ++VG + F F+
Sbjct: 248 FGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGL 307
Query: 325 ---AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST---SDLPFEYCY 369
I DSGT+ TYL Y + F S T+T + F+ C+
Sbjct: 308 VGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCF 358
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 115/266 (43%), Gaps = 31/266 (11%)
Query: 94 DTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDF 152
+T +++LG + + SVG P+L LDTGSD+ WL C C C
Sbjct: 79 ETTVISALG-EYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTP-------- 129
Query: 153 NIYSPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
I+ + S T +PC S C+ +Q S+ +C Y + Y+ DG+ S G L + L L
Sbjct: 130 -IFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYV-DGSQSLGDLSVETLTLG 187
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ + GCGR + + G+ GLG S+ + L+ FS
Sbjct: 188 STNGSPVQFPGTV-IGCGRYNAIGIEEKNS--GIVGLGRGPMSLITQLSPS--TGGKFSY 242
Query: 272 CFG---SDGTGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---- 321
C S + +++FG+ G TP + Y +T+ SVG N + F
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQI 344
+ + I DSGT+ T L + Y+++
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKL 328
>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
Length = 357
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 106/257 (41%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 80/175 (45%), Gaps = 17/175 (9%)
Query: 199 STGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI 258
S+G L ED++ ++S+ R FGC +TG A +G+ GLG + S+
Sbjct: 4 SSGVLGEDIVSFG---RESELKAQRAVFGCENSETGDLFSQHA-DGIMGLGRGQLSIMDQ 59
Query: 259 LANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSV 314
L +G+I +SFS+C+G G + G P + FS LR P YNI + ++ V
Sbjct: 60 LVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMVFSRSDPLRS--PYYNIELKEIHV 117
Query: 315 GGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
G A+ + + DSGT++ YL + A+ + S ++ D
Sbjct: 118 AGKALRVDSRIFDSKHGTVLDSGTTYAYLPEQAFMAFKDAVTSKVHSLKKIRGPD 172
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 86/309 (27%), Positives = 129/309 (41%), Gaps = 53/309 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + LDTGSDL W+ CD C C Y+PN SS+
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPH---------YNPNESSSY 220
Query: 164 SKVPCNSTLCELQ------KQCPSAGSNCPYQVRYLSDGTMSTG-FLVE----DVLHLAT 212
+ C C+L + C + CPY Y +DG+ +TG F +E ++
Sbjct: 221 RNISCYDPRCQLVSSPDPLQHCKTENQTCPYFYDY-ADGSNTTGDFALETFTVNLTWPNG 279
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
EK VD + FGCG G F GL GLG S PS L Q + +SFS C
Sbjct: 280 KEKFKHVVD--VMFGCGHWNKGFF---HGAGGLLGLGRGPLSFPSQL--QSIYGHSFSYC 332
Query: 273 F-----GSDGTGRISFG-DKGSPGQGETPFS-LRQTHPT-----YNITITQVSVGGNAVN 320
+ + ++ FG DK F+ L T Y + I + VGG ++
Sbjct: 333 LTDLFSNTSVSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLD 392
Query: 321 -----FEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ +S+ I DSG++ T+ D AY I E F K ++ + D CY
Sbjct: 393 IPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIK-LQQIAADDFIMSPCY 451
Query: 370 VLRSFLHLQ 378
+ + ++
Sbjct: 452 NVSGAMQVE 460
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 119/286 (41%), Gaps = 43/286 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V +G+P + LDTGSD+ W+ C C C + I+ P +S++
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADP---------IFEPASSASF 199
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S + CN+ C C Y+V Y DG+ + G V + + L S VD+
Sbjct: 200 STLSCNTRQCRSLDVSECRNDTCLYEVSY-GDGSYTVGDFVTETITLG-----SAPVDN- 252
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG S PS + SFS C S+
Sbjct: 253 VAIGCGHNNEGLFVGAAG---LLGLGGGSLSFPSQIN-----ATSFSYCLVDRDSESAST 304
Query: 281 ISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IF 327
+ F P P LR H Y + +T +SVGG V+ SA I
Sbjct: 305 LEFNSTLPPNAVSAPL-LRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIV 363
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
DSGT+ T L Y + + F ++ T+ L F+ CY L S
Sbjct: 364 DSGTAITRLQTDVYNSLRDAFVKRTRDLPSTNGIAL-FDTCYDLSS 408
>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 83/300 (27%), Positives = 124/300 (41%), Gaps = 44/300 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T V +G PA + LDTGSD+ WL C C C H I+ P++SS+
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEP---------IFEPSSSSSY 201
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C++ C + + C Y+V Y DG+ + G + L + + Q+
Sbjct: 202 EPLSCDTPQCNALEVSECRNATCLYEVSY-GDGSYTVGDFATETLTIGSTLVQN------ 254
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG ++PS L SFS C SD
Sbjct: 255 VAVGCGHSNEGLFVGAAG---LLGLGGGLLALPSQLNT-----TSFSYCLVDRDSDSAST 306
Query: 281 ISFGDKGSPGQGETPFSLR--QTHPTYNITITQVSVGGNAV-----NFEFSA------IF 327
+ FG P P LR Q Y + +T +SVGG + +FE I
Sbjct: 307 VEFGTSLPPDAVVAPL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIII 365
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-LVVLPFP 386
DSGT+ T L Y + ++F E + F+ CY L + ++ V FP
Sbjct: 366 DSGTAVTRLQTGIYNSLRDSFLK-GTSDLEKAAGVAMFDTCYNLSAKTTIEVPTVAFHFP 424
>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 437
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 120/280 (42%), Gaps = 35/280 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCN 169
+G P + + DTGSDL W+ C C +C D ++ P SST C+
Sbjct: 98 IGTPPVERLAIADTGSDLIWVQCSPCQNCFPQ---------DTPLFEPLKSSTFKAATCD 148
Query: 170 STLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHL-ATDEKQSKSVDSRI 224
S C Q+QC G C Y Y D + + G + + L +T + Q+ S S I
Sbjct: 149 SQPCTSVPPSQRQCGKVG-QCIYSYSY-GDKSFTVGVVGTETLSFGSTGDAQTVSFPSSI 206
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
FGCG +F GL GLG S+ S L Q I FS C F S+ T ++
Sbjct: 207 -FGCGVYNNFTFHTSDKVTGLVGLGGGPLSLVSQLGPQ--IGYKFSYCLLPFSSNSTSKL 263
Query: 282 SFGDKG---SPGQGETPFSLRQTHPT-YNITITQVSVGGNAV---NFEFSAIFDSGTSFT 334
FG + + G TP ++ P+ Y + + V++G V + + I DSGT T
Sbjct: 264 KFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVPTGRTDGNIIIDSGTVLT 323
Query: 335 YLNDPAYTQISETFNSLAKEKRETSTSDL--PFEYCYVLR 372
YL Y SL + S DL PF++C+ R
Sbjct: 324 YLEQTFYNNFVA---SLQEVLSVESAQDLPFPFKFCFPYR 360
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 127/304 (41%), Gaps = 68/304 (22%)
Query: 102 GFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTS 160
GFL N+S+G P ++ +V +DTGS L W+ C C++C S + P S
Sbjct: 103 GFL--VNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTS---------WFDPLKS 151
Query: 161 STSSKVPC--------NSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+ + C N C Q Y++RYL G S G L ++ L T
Sbjct: 152 VSFKTLGCGFPGYNYINGYKCNRFNQ-------AEYKLRYLG-GDSSQGILAKESLLFET 203
Query: 213 -DEKQ-----------SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSI-L 259
DE + SK S I+FGCG + + D A NG+FGLG + P I +
Sbjct: 204 LDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAY-NGVFGLG----AYPHITM 258
Query: 260 ANQGLIPNSFSMCFGSDGT-----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVS 313
A Q + N FS C G + G +GS +G+ TP + H Y +T+ +S
Sbjct: 259 ATQ--LGNKFSYCIGDINNPLYTHNHLVLG-QGSYIEGDSTPLQIHFGH--YYVTLQSIS 313
Query: 314 VGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VG + + +A + DSG ++T L + + + + L K E +
Sbjct: 314 VGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQ 373
Query: 363 LPFE 366
FE
Sbjct: 374 RKFE 377
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 134/323 (41%), Gaps = 35/323 (10%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYT-NVSVGQPALSFIVALD 123
R Y R G A Q D A +G L+Y S+G P ++ + +D
Sbjct: 99 RRAEYILRRVSGRAPQLWDSKAAAAVATVPASWGYDIGTLNYVVTASLGTPGVAQTMEVD 158
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE---LQKQCP 180
TGSDL W+ C + S + D P SS+ + VPC +C +
Sbjct: 159 TGSDLSWVQCKPCAAAPSCYSQKDPLFD-----PAQSSSYAAVPCGGPVCAGLGIYAASA 213
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+ + C Y V Y DG+ +TG D L L+ + S FGCG Q+G F +G
Sbjct: 214 CSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS-----ASSAVQGFFFGCGHAQSGLF-NGV 266
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT--GRISFGDKGSPGQGETPFSL 298
+GL GLG ++ S+ + G FS C + + G ++ G G P FS
Sbjct: 267 --DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTKPSTAGYLTLG-VGGPSGAAPGFST 321
Query: 299 RQTHPT------YNITITQVSVGGNAVNFEFSAI-----FDSGTSFTYLNDPAYTQISET 347
Q P+ Y + +T +SVGG ++ SA D+GT T L AY +
Sbjct: 322 TQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSA 381
Query: 348 FNS-LAKEKRETSTSDLPFEYCY 369
F S +A T+ S+ + CY
Sbjct: 382 FRSGMASYGYPTAPSNGILDTCY 404
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/213 (30%), Positives = 95/213 (44%), Gaps = 38/213 (17%)
Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQ- 176
V +DTGSDL W+ C+ C+SC + ++ P+TSS+ +PCNS+ C+ LQ
Sbjct: 158 VIIDTGSDLTWVQCEPCMSCYNQQGP---------VFKPSTSSSYQSIPCNSSTCQSLQL 208
Query: 177 -----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRV 231
C S SNC Y V Y DG+ + G L + L SV S FGCG+
Sbjct: 209 TTGNAGACESNPSNCSYAVNY-GDGSYTNGELGAEHLSFG-----GISV-SNFVFGCGKN 261
Query: 232 QTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGS 288
G F +GL GLG S+ I FS C + +G ++ G++ S
Sbjct: 262 NKGLF---GGVSGLMGLGRSNLSL--ISQTNSTFGGVFSYCLPPTDAGASGSLAMGNESS 316
Query: 289 PGQGETPFSLRQTHPT------YNITITQVSVG 315
+ TP + + P Y + +T + VG
Sbjct: 317 VFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 126/299 (42%), Gaps = 42/299 (14%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LA +G + ++G + + + S+G P ++A+D
Sbjct: 75 ASRDASRLLYLDSLAVRGRARAYAPIASGRQLLQTPT----YVVRASLGTPPQQLLLAVD 130
Query: 124 TGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C +SS D P +S++ VPC S LC CP
Sbjct: 131 TSNDASWIPCAGCAGC----PTSSAAPFD-----PASSASYRTVPCGSPLCAQAPNAACP 181
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
G C + + Y +D ++ L +D L +A + ++ +FGC + TG+ A
Sbjct: 182 PGGKACGFSLTY-ADSSLQAA-LSQDSLAVAGNAVKA------YTFGCLQRATGT---AA 230
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 231 PPQGLLGLGRGPLSF--LSQTKDMYEATFSYCLPSFKSLNFSGTLRLGRNGQPQRIKTTP 288
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISE 346
L H + Y + +T + VG V + DSGT FT L PAY + +
Sbjct: 289 LLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGTMFTRLVAPAYVAVRD 347
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 69/271 (25%), Positives = 107/271 (39%), Gaps = 40/271 (14%)
Query: 120 VALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE----L 175
+ +DT SD+ W+ C H + +Y P+ SS+S+ PC+S C
Sbjct: 158 MVIDTASDVPWVQCAPCPAPHCHAQTD------VLYDPSKSSSSAAFPCSSPACRNLGPY 211
Query: 176 QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR--VQT 233
C AG C Y+V+Y DG+ S G + DVL L + + S S FGC +Q
Sbjct: 212 ANGCTPAGDQCQYRVQY-PDGSASAGTYISDVLTL--NPAKPASAISEFRFGCSHALLQP 268
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD---------GTGRISFG 284
GSF + +G+ LG S+P+ + + FS C G R++
Sbjct: 269 GSFSNKT--SGIMALGRGAQSLPT--QTKATYGDVFSYCLPPTPVHSGFFILGVPRVAAS 324
Query: 285 DKGSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
TP + P Y + + + V G + F A+ DS T T L
Sbjct: 325 RYAV-----TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPP 379
Query: 339 PAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
AY + F + + R + + + CY
Sbjct: 380 TAYMALRAAFVAEMRAYRAAAPKEH-LDTCY 409
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/323 (24%), Positives = 121/323 (37%), Gaps = 68/323 (21%)
Query: 70 FRLRGRGLAA-QGNDKT-PLTFSAGND-----TYRLNSLGFLHYTNVSVGQPALSFIVAL 122
F + R LAA + +D + L AG D T R ++G L+Y + +G PA + V +
Sbjct: 57 FAGQKRSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVG-LYYAKIGIGTPARDYYVQM 115
Query: 123 DTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPS- 181
+ +Y S T V C+ C P
Sbjct: 116 E----------------------------LTLYDIKESLTGKLVSCDQDFCYAINGGPPS 147
Query: 182 ---AGSNCPYQVRYLSDGTMSTGFLVE---------DVLHLATDEKQSKSVDSRISFGCG 229
A +C Y Y +DG+ S G+ V+ + HL + + C
Sbjct: 148 YCIANMSCSYTEIY-ADGSSSFGYFVKGYCTASKYNSIPHLNNNPLL------EVPLRCS 200
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-GSDGTGRISFGDKGS 288
Q+G A +G+ G G TS+ S LA+ G + F+ C G +G G + G
Sbjct: 201 ATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQ 260
Query: 289 PGQGETPFSLRQTHPTYNITITQVSVGGNAVNF---------EFSAIFDSGTSFTYLNDP 339
P TP QTH YN+ + V VGG +N + I DSGT+ YL +
Sbjct: 261 PKVNTTPLVPNQTH--YNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEV 318
Query: 340 AYTQISETFNSLAKEKRETSTSD 362
Y Q+ S + + + D
Sbjct: 319 VYDQLLSKIFSWQSDLKVHTIHD 341
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 123/297 (41%), Gaps = 56/297 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
SFGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FSFGCNMDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDCFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLGLSPSVFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L A+E+ E + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEESERN--------CYDMRS 269
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 120/273 (43%), Gaps = 38/273 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+Y + VG PA F + +DTGS L WL C CV H V I++P+ S T
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCH--------VQVDPIFTPSVSKTY 158
Query: 164 SKVPCNSTLCELQKQ-------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+ C+S+ C K C +A C Y+ Y D + S G+L +DVL L
Sbjct: 159 KALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASY-GDTSFSIGYLSQDVLTLT----P 213
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ------GLIPNSFS 270
S + S +GCG+ G F A G+ GL DK S+ L+N+ +P+SFS
Sbjct: 214 SAAPSSGFVYGCGQDNQGLFGRSA---GIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFS 270
Query: 271 MCFGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPT-YNITITQVSVGG-----NAVNFE 322
S +G +S G TP P+ Y + +T ++V G +A ++
Sbjct: 271 AQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYN 330
Query: 323 FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
I DSGT T L Y + ++F + +K
Sbjct: 331 VPTIIDSGTVITRLPVAIYNALKKSFVMIMSKK 363
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 112/272 (41%), Gaps = 44/272 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V VG P F + +DTGSDL WL C C+ C G V D P S++
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCF----DQRGPVFD-----PMASTSY 200
Query: 164 SKVPCNSTLCEL------QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C T C L + C S+ S+ CPY Y D + +TG L + +
Sbjct: 201 RNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWY-GDQSNTTGDLALEAFTVNLTASS 259
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
S+ VD + GCG G F A L GLG S S L + + ++FS C
Sbjct: 260 SRRVDG-VVLGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHAFSYCLVDH 313
Query: 277 GTG---RISFGDKG----SPGQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEFS---- 324
G+ +I FGD P T F+ T Y + + + VGG ++ +
Sbjct: 314 GSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGV 373
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETF 348
I DSGT+ +Y +PAY I + F
Sbjct: 374 SKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAF 405
>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
Length = 412
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 69/256 (26%), Positives = 102/256 (39%), Gaps = 38/256 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD--CVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ ++++G P L LDTGSDL W CD C C +Y+P S+T
Sbjct: 92 YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ---------PAPLYAPARSAT 142
Query: 163 SSKVPCNSTLCE-LQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ V C S +C+ LQ +C + C Y Y DGT + G L + L +D
Sbjct: 143 YANVSCRSPMCQALQSPWSRCSPPDTGCAYYFSY-GDGTSTDGVLATETFTLGSDTAVRG 201
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
++FGCG GS + +GL G+G P L +Q + C
Sbjct: 202 -----VAFGCGTENLGSTDNS---SGLVGMGRG----PLSLVSQLGVTRPRRSCRARAAA 249
Query: 279 GRISFGDKGSPGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSFTYLN 337
SP +G T +L P +T + GG I DSGT+FT L
Sbjct: 250 RGGGAPTTTSPLEGITVGDTLLPIDPAV-FRLTPMGDGG--------VIIDSGTTFTALE 300
Query: 338 DPAYTQISETFNSLAK 353
+ A+ ++ S +
Sbjct: 301 ERAFVALARALASRVR 316
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 120/291 (41%), Gaps = 44/291 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G P+ + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPS- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPRFDGFSYCLPLQKSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + R+ + + + +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + +S+ L R + + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLSQRIRELL--LRRGAAEEESERNCYDMRS 269
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 116/284 (40%), Gaps = 41/284 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ V VG PA + LDTGSD+ W+ C C C + ++ P+ S++
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDP---------VFDPSLSTSY 217
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ V C++ C C ++ C Y+V Y DG+ + G + L L S
Sbjct: 218 ASVACDNPRCHDLDAAACRNSTGACLYEVAY-GDGSYTVGDFATETLTLGDSAPVSS--- 273
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F+ A L G + S PS ++ +FS C S +
Sbjct: 274 --VAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQISA-----TTFSYCLVDRDSPSS 323
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----------IF 327
+ FGD +T Y + ++ +SVGG ++ SA I
Sbjct: 324 STLQFGDAADAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIV 383
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
DSGT+ T L AY + + F + TS L F+ CY L
Sbjct: 384 DSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSL-FDTCYDL 426
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 127/305 (41%), Gaps = 38/305 (12%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R+ S + +++G P + +DTGSDL W C C C + ++
Sbjct: 74 RVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSP---------MF 124
Query: 156 SPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P S T S +PC S C S C Y Y +D +++ G L + + ++ +
Sbjct: 125 EPLRSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSY-ADSSVTKGVLAREAITFSSTDG 183
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVP-SILANQGLIPNS--FSMC 272
V I FGCG +G+F + + P S+++ G + S FS C
Sbjct: 184 DPVVV-GDIIFGCGHSNSGTFNENDM------GIIGMGGGPLSLVSQIGTLYGSKRFSQC 236
Query: 273 ---FGSDG--TGRISFGDKGS-PGQG--ETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
F +D +G I+FG++ G+G TP + + +Y +T+ +SVG V F S
Sbjct: 237 LVPFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSS 296
Query: 325 A-------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHL 377
+ DSGT TY+ Y ++ E + DL + CY RS +L
Sbjct: 297 ETLSKGNIMIDSGTPATYIPQEFYERLVEELKVQSSLLPIEDDPDLGTQLCY--RSETNL 354
Query: 378 QALVV 382
+ ++
Sbjct: 355 EGPIL 359
>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
Length = 538
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 120/302 (39%), Gaps = 43/302 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-----------DCVSCVHGLN-SSSGQVID 151
++ +V G PAL + + LDT +DL W+ C +S G + +++ +
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N Y P SS+ ++ C+ C L Q PS +C Y + + DGT++ G ++
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSY-YQQMQDGTLTMGIYGKEK 244
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ + + + I GC ++ G +D A +G+ LG + S A +
Sbjct: 245 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQ 299
Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
FS C S D + ++FG + PG ET P Y +T + VGG
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGER 359
Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
++ I D+ TS T L AY ++ + D FEY
Sbjct: 360 LDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELD-GFEY 418
Query: 368 CY 369
CY
Sbjct: 419 CY 420
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 56/171 (32%), Positives = 76/171 (44%), Gaps = 22/171 (12%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R R+ + + LA + D+ T G T L ++ + VG PA S + +DT
Sbjct: 90 QRVRWIESKAQ-LAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDT 148
Query: 125 GSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC---ELQKQCP 180
GSDL WL C C SC + I+ P SS+ ++PC S LC E+
Sbjct: 149 GSDLPWLQCQPCKSCYKQADP---------IFDPRNSSSFQRIPCLSPLCKALEIHSCSG 199
Query: 181 SAG--SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S G S C YQV Y DG+ S G D+ L T K ++FGCG
Sbjct: 200 SRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTGSKAMS-----VAFGCG 244
>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
Length = 538
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 73/302 (24%), Positives = 120/302 (39%), Gaps = 43/302 (14%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-----------DCVSCVHGLN-SSSGQVID 151
++ +V G PAL + + LDT +DL W+ C +S G + +++ +
Sbjct: 126 MYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAAKEARR 185
Query: 152 FNIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
N Y P SS+ ++ C+ C L Q PS +C Y + + DGT++ G ++
Sbjct: 186 KNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSY-YQQMQDGTLTMGIYGKEK 244
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
+ + + + I GC ++ G +D A +G+ LG + S A +
Sbjct: 245 ATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGEMSFAVHAAKR--FGQ 299
Query: 268 SFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSVGGNA 318
FS C S D + ++FG + PG ET P Y +T + VGG
Sbjct: 300 RFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGER 359
Query: 319 VNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY 367
++ I D+ TS T L AY ++ + D FEY
Sbjct: 360 LDIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAYAAVTSALDRHLSHLPRVYELD-GFEY 418
Query: 368 CY 369
CY
Sbjct: 419 CY 420
>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
Length = 357
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 105/257 (40%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + L S C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 127/300 (42%), Gaps = 48/300 (16%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+S+G P + DTGSDL W C C C N ++ P +SS+ + +
Sbjct: 64 LSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNP---------MFDPRSSSSYTNIT 114
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C + C C + C Y Y +D +++ G L ++ L L + + + I
Sbjct: 115 CGTESCNKLDSSLCSTDQKTCNYTYSY-ADNSITQGVLAQETLTLTSTTGEPVAFQGII- 172
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPS-ILANQGLIPNSFSMC---FGSDG--TG 279
FGCG +G F D GL GLG S+ S I ++ G N FS C F +D T
Sbjct: 173 FGCGHNNSG-FNDREM--GLIGLGRGPLSLISQIGSSLGAGGNMFSQCLVPFNTDPSITS 229
Query: 280 RISFGDKGSP----GQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS----------- 324
+++FG KGS G TP + + Y T+ +SV +N FS
Sbjct: 230 QMNFG-KGSEVLGNGTVSTPL-ISKDGTGYFATLLGISV--EDINLPFSNGSSLGTITKG 285
Query: 325 -AIFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
+ DSGT+ TYL + Y + I + N +A E +E CY + L+ L +
Sbjct: 286 NILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFRIDG----YELCYQTPTNLNGPTLTI 341
>gi|242074844|ref|XP_002447358.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
gi|241938541|gb|EES11686.1| hypothetical protein SORBIDRAFT_06g033560 [Sorghum bicolor]
Length = 497
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 124/325 (38%), Gaps = 75/325 (23%)
Query: 63 LAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVAL 122
L R R +G ++ G+ P T + +Y G +T S+G P V L
Sbjct: 67 LKRRGRASHHSQKGSSSGGHKSIPATAALYPHSY-----GGYAFT-ASLGTPPQPLPVLL 120
Query: 123 DTGSDLFWLPC----DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC----- 173
DTGS L W+PC DC +C SS ++ P SS+S V C + C
Sbjct: 121 DTGSQLTWVPCTSNYDCRNC------SSPFAAAVPVFHPKNSSSSRLVGCRNPSCLWVHS 174
Query: 174 -ELQKQCP---SAGSNC--------PYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
E +C S G+NC PY V Y S T G L+ D L +
Sbjct: 175 AEHVAKCRAPCSRGANCTPASNVCPPYAVVYGSGST--AGLLIADTL------RAPGRAV 226
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA----NQGLIPNSF-------- 269
S GC V P+GL G G SVP+ L + L+ F
Sbjct: 227 SGFVLGCSLVSVHQ-----PPSGLAGFGRGAPSVPAQLGLSKFSYCLLSRRFDDNAAVSG 281
Query: 270 SMCFGSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
S+ G D G S + P+++ Y + ++ V+VGG AV
Sbjct: 282 SLVLGGDNDGMQYVPLVKSAAGDKQPYAV-----YYYLALSGVTVGGKAVRLPARAFAAN 336
Query: 323 ----FSAIFDSGTSFTYLNDPAYTQ 343
AI DSGT+FTYL DP Q
Sbjct: 337 AAGSGGAIVDSGTTFTYL-DPTVFQ 360
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 76/171 (44%), Gaps = 22/171 (12%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R R+ + + LA + D+ T G T L ++ + +G PA S + +DT
Sbjct: 15 RRVRWIESKAK-LAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGLGTPARSLFMVVDT 73
Query: 125 GSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAG 183
GSDL WL C C SC + I+ P SS+ ++PC S LC+ + +G
Sbjct: 74 GSDLPWLQCQPCKSCYKQADP---------IFDPRNSSSFQRIPCLSPLCKALEVHSCSG 124
Query: 184 -----SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
S C YQV Y DG+ S G D+ L T K ++FGCG
Sbjct: 125 SRGATSRCSYQVAY-GDGSFSVGDFSSDLFTLGTGSKAMS-----VAFGCG 169
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 120/272 (44%), Gaps = 45/272 (16%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
RL SL ++ V +G ++ IV DTGSDL W+ C C C + + ++
Sbjct: 60 RLQSLNYI--VTVELGGRKMTVIV--DTGSDLSWVQCQPCNRCYNQQDP---------VF 106
Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVL 208
+P+ S + V CNS C LQ C S C Y V Y DG+ ++G + + L
Sbjct: 107 NPSKSPSYRTVLCNSLTCRSLQLATGNSGVCGSNPPTCNYVVNY-GDGSYTSGEVGMEHL 165
Query: 209 HLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS 268
+L + +V++ I FGCGR G F +GL GLG S+ S ++ +
Sbjct: 166 NLG-----NTTVNNFI-FGCGRKNQGLF---GGASGLVGLGRTDLSLISQISP--MFGGV 214
Query: 269 FSMCF---GSDGTGRISFGDKGSPGQGETPFSL-RQTH----PTYNITITQVSVGGNAVN 320
FS C ++ +G + G S + TP S R H P Y + +T ++VGG V
Sbjct: 215 FSYCLPTTEAEASGSLVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQ 274
Query: 321 F----EFSAIFDSGTSFTYLNDPAYTQISETF 348
+ I DSGT + L Y + F
Sbjct: 275 APSFGKDRMIIDSGTVISRLPPSIYQALKAEF 306
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 111/283 (39%), Gaps = 42/283 (14%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
+ VG P F + D +D WL C C+ C +S I+ P+ SS+ + +
Sbjct: 191 IGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDS---------IFDPSQSSSYTLLS 241
Query: 168 CNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C + C L C G C Y + Y DGT + G L+ + + + S VD R+S
Sbjct: 242 CETKHCNLLPNSSCSDDGY-CRYNITY-KDGTNTEGVLINETVSF----ESSGWVD-RVS 294
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGD 285
GC G F+ +G FGLG S PS + + S+ + DG +
Sbjct: 295 LGCSNKNQGPFV---GSDGTFGLGRGSLSFPSRINASSM---SYCLVESKDGYSSSTLEF 348
Query: 286 KGSPGQGETPFSLRQT---HPTYNITITQVSVGGNAVNFEFSA-----------IFDSGT 331
P G L Q Y + + + VGG ++ S I S +
Sbjct: 349 NSPPCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSS 408
Query: 332 SFTYLNDPAYTQISETFNSLAKEKR-ETSTSDLPFEYCYVLRS 373
T L + Y + + F +AK + E + L F+ CY L S
Sbjct: 409 LITMLENDTYNVVRDAF--VAKTQHLERLKAFLQFDTCYNLSS 449
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 108/265 (40%), Gaps = 44/265 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ V +G P + + LDT +D W PC C+ C SS+ +S SST
Sbjct: 95 YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGC-----SST------TTFSAQNSSTF 143
Query: 164 SKVPCNSTLCELQK--QCPSAGS-NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ + C+ C + CP+ G+ +C + Y D T S LV+D LHL + V
Sbjct: 144 ATLDCSKPECTQARGLSCPTTGNVDCLFNQTYGGDSTFS-ATLVQDSLHLGPN------V 196
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG--- 277
SFGC +GS + P GL GLG S+ I + L FS C S
Sbjct: 197 IPNFSFGCISSASGSSI---PPQGLMGLGRGPLSL--ISQSGSLYSGLFSYCLPSFKSYY 251
Query: 278 -TGRISFGDKGSPGQGETPFSLRQTH-PT-YNITITQVSVGGNAV-----------NFEF 323
+G + G G P T L H P+ Y + +T +SVG V N
Sbjct: 252 FSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGA 311
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSGT T YT + + F
Sbjct: 312 GTIIDSGTVITRFVPAIYTAVRDEF 336
>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
Length = 357
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 105/257 (40%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
Length = 357
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 72/257 (28%), Positives = 105/257 (40%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + L S C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|7548466|gb|AAA34371.2| secreted aspartyl proteinase 1 [Candida albicans]
Length = 391
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFNIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 99/225 (44%), Gaps = 27/225 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C+ C C + I++P+ S++
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADP---------IFNPSYSASF 207
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
S V C+S +C C Y+ Y DG+ STG + L T +
Sbjct: 208 STVGCDSAVCSQLDAYDCHSGGCLYEASY-GDGSYSTGSFATETLTFGTTSV------AN 260
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A L GLG S P+ + Q ++FS C SD +G
Sbjct: 261 VAIGCGHKNVGLFIGAAG---LLGLGAGALSFPNQIGTQ--TGHTFSYCLVDRESDSSGP 315
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAVNFEF 323
+ FG K P G TP PT Y +++T +S+ A + F
Sbjct: 316 LQFGPKSVPVGSIFTPLEKNPHLPTFYYLSVTAISISAIACVWSF 360
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 123/297 (41%), Gaps = 56/297 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +V +G PA + IV +DTGS W+ C+C C H + + + S+T +
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCECDGC-H---------TNPRTFLQSRSTTCA 50
Query: 165 KVPCNSTLCELQKQCPSAGS-----NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KV C +++C L P +CP++V Y DG+ S G L +D L + +K
Sbjct: 51 KVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSY-QDGSASYGILYQDTLTFSDVQKIPG- 108
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC------- 272
+FGC G+ G +GL G+G SV L + FS C
Sbjct: 109 ----FTFGCNLDSFGANEFGNV-DGLLGMGAGPMSV---LKQSSPTFDGFSYCLPLQMSE 160
Query: 273 --FGSDGTGRISFGDKGSPGQGE--TPFSLRQTHPTYNITITQVSVGGNAVNFEFS---- 324
F S TG S G + + ++ + + +T +SV G + S
Sbjct: 161 RGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSR 220
Query: 325 --AIFDSGTSFTYLNDPAYTQISETFNSL------AKEKRETSTSDLPFEYCYVLRS 373
+FDSG+ +Y+ D A + + + L A+E+ E + CY +RS
Sbjct: 221 KGVVFDSGSELSYIPDRALSVLRQRIRELLLKRGAAEEESERN--------CYDMRS 269
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 106/268 (39%), Gaps = 47/268 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C C C + +++P SST
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDP---------LFNPAASSTY 203
Query: 164 SKVPCNSTLCELQKQCPSAGSN----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
KVPC + LC K+ +G C YQV Y DG+ + G + L
Sbjct: 204 RKVPCATPLC---KKLDISGCRNKRYCEYQVSY-GDGSFTVGDFSTETLTF------RGQ 253
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GS 275
V R++ GCG G F+ A GL S PS Q FS C S
Sbjct: 254 VIRRVALGCGHDNEGLFIGAAGLLGLG---RGSLSFPSQTGAQ--FSKRFSYCLVDRSAS 308
Query: 276 DGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVN------FEFSA-- 325
+ FG P TP S + Y + + +SVGG + F A
Sbjct: 309 GTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATG 368
Query: 326 ----IFDSGTSFTYLNDPAYTQISETFN 349
I DSGTS T L D AY+ + + F
Sbjct: 369 NGGVIIDSGTSVTRLVDSAYSTMRDAFR 396
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/298 (29%), Positives = 123/298 (41%), Gaps = 49/298 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ +V +G P + + LDTGSDL W+ CV C H +G Y P SS+
Sbjct: 90 YFMDVFIGTPPKHYSLILDTGSDLNWI--QCVPC-HDCFEQNGPY-----YDPKESSSFR 141
Query: 165 KVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ C+ C L C + CPY Y D + +TG + + K
Sbjct: 142 NIGCHDPRCHLVSSPDPPLPCKAENQTCPYFYWY-GDSSNTTGDFATETFTVNLTSPTGK 200
Query: 219 SVDSRIS---FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S R+ FGCG G F GA+ GL GLG S S L Q L +SFS C
Sbjct: 201 SEFKRVENVMFGCGHWNRGLF-HGAS--GLLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 255
Query: 274 ---GSDGTGRISFG-DKGSPGQGETPFSLR---QTHPT---YNITITQVSVGGNAVNFEF 323
++ + ++ FG DK E F+ + +P Y + I + VGG +N
Sbjct: 256 RNSDTNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPE 315
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCY 369
S I DSGT+ +Y +PAY I + F + K K D P + CY
Sbjct: 316 STWNMTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAF--VKKVKGYPIVQDFPILDPCY 371
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 75/276 (27%), Positives = 111/276 (40%), Gaps = 43/276 (15%)
Query: 112 GQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
G P + ++ALDT SD W+PC CV C S+S ++P S++ V C S
Sbjct: 104 GTPPQTLLLALDTSSDAAWIPCSGCVGC-----STSKP------FAPIKSTSFRNVSCGS 152
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C+ GS C + Y S ++ +V+D L LA D +FGC
Sbjct: 153 PHCKQVPNPTCGGSACAFNFTYGSSSIAAS--VVQDTLTLAADPIPG------YTFGCVN 204
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDK 286
TGS +AP + +Q L ++FS C S + +G + G
Sbjct: 205 KTTGS----SAPQQGLLGLGRGPLS-LLSQSQNLYKSTFSYCLPSFKSINFSGSLRLGPV 259
Query: 287 GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSF 333
P + + LR + Y + + + VG V+ +A IFDSGT F
Sbjct: 260 YQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVF 319
Query: 334 TYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
T L +P YT + F K +T F+ CY
Sbjct: 320 TRLAEPVYTAVRNEFRRRVGPKLPVTTLG-GFDTCY 354
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 126/306 (41%), Gaps = 50/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + I+ P S T
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC+S C ++ SAG N C YQV Y DG+ + G + L + +
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
++ GCG G F+ A L GLG K S P ++ FS C
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297
Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
S + FG+ TP S + Y + + +SVGG V +++F
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQI 357
Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA- 379
DSGTS T L PAY + + F AK + L F+ C+ L + ++
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSL-FDTCFDLSNMNEVKVP 416
Query: 380 LVVLPF 385
VVL F
Sbjct: 417 TVVLHF 422
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 126/306 (41%), Gaps = 50/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + I+ P S T
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC+S C ++ SAG N C YQV Y DG+ + G + L + +
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
++ GCG G F+ A L GLG K S P ++ FS C
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297
Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
S + FG+ TP S + Y + + +SVGG V +++F
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357
Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA- 379
DSGTS T L PAY + + F AK + L F+ C+ L + ++
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSL-FDTCFDLSNMNEVKVP 416
Query: 380 LVVLPF 385
VVL F
Sbjct: 417 TVVLHF 422
>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 467
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 120/301 (39%), Gaps = 69/301 (22%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD----CVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+S G P + + +DTGSDL W PC C +C ++ S NI+ P +SS+S
Sbjct: 94 LSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSS-----NIFIPKSSSSSK 148
Query: 165 KVPCNSTLC------ELQKQC-------PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
+ C + C ++Q +C P+ CP + + G ++ G ++ + L L
Sbjct: 149 VLGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDLP 207
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
K V + I GC S L + P G+ G G S+PS L GL FS
Sbjct: 208 -----GKGVPNFI-VGC------SVLSTSQPAGISGFGRGPPSLPSQL---GL--KKFSY 250
Query: 272 CFGS----DGTGRISFGDKGSPGQGE-------TPF----SLRQTHP---TYNITITQVS 313
C S D T S G GE TPF + H Y + + ++
Sbjct: 251 CLLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHIT 310
Query: 314 VGGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VGG V + I DSGT+FTY+ + ++ F + KR T
Sbjct: 311 VGGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEG 370
Query: 363 L 363
+
Sbjct: 371 I 371
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 64.7 bits (156), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 127/303 (41%), Gaps = 55/303 (18%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGL 142
+TP+T G+ Y + +++G PALS +DTGSDL W C+ C C
Sbjct: 30 ETPVTPDIGSGEYLIQ---------MAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSS 80
Query: 143 NSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMST 200
++SST SKV C S+LC+ C + G +C Y Y D + ++
Sbjct: 81 IYDP-----------SSSSTYSKVLCQSSLCQPPSIFSCNNDG-DCEYVYPY-GDRSSTS 127
Query: 201 GFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILA 260
G L ++ ++ S+S+ I+FGCG G D GL G G S+ S L
Sbjct: 128 GILSDETFSIS-----SQSL-PNITFGCGHDNQG--FDKVG--GLVGFGRGSLSLVSQLG 177
Query: 261 NQGLIPNSFSMCF----GSDGTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVS 313
+ N FS C S T + G+ S G TP + Y +++ +S
Sbjct: 178 PS--MGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYLSLEGIS 235
Query: 314 VGGNAV-----NFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
VGG ++ F+ + I DSGT+ T+L AY + E S + D
Sbjct: 236 VGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLPQADGQLD 295
Query: 363 LPF 365
L F
Sbjct: 296 LCF 298
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 79/281 (28%), Positives = 114/281 (40%), Gaps = 37/281 (13%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
++ +G P ++ I DTGSDL W C+ C N S I++P SS+ KV
Sbjct: 93 SIFIGTPPVNVIAIADTGSDLTW--TQCLPCRECFNQSQ------PIFNPRRSSSYRKVS 144
Query: 168 CNSTLCELQK--QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
C S C + C +C Y Y D + + G L D + + + K K+V
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSY-GDRSFTYGDLASDQITIGS-FKLPKTV----- 197
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-----GSDGTGR 280
GCG G+F G + G + V + G+ P FS C ++ TG
Sbjct: 198 IGCGHQNGGTF-GGVTSGIIGLGGGSLSLVSQMRTIAGVKPR-FSYCLPTFFSNANITGT 255
Query: 281 ISFGDKGSPGQGE---TPFSLRQTHPTYNITITQVSVGG---------NAVNFEFSAIFD 328
ISFG K + TP R Y +T+ +SVG +A+ + I D
Sbjct: 256 ISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIID 315
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
SGT+ T L Y + T + K KR S + E CY
Sbjct: 316 SGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGI-LELCY 355
>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
Length = 357
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 106/257 (41%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
Length = 537
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 77/313 (24%), Positives = 118/313 (37%), Gaps = 61/313 (19%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDC--VSCVHGLNSSSGQVIDF--------- 152
++ +V +G PAL + + LDT +DL W+ C H S GQ +
Sbjct: 123 MYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEGATAAK 182
Query: 153 -----NIYSPNTSSTSSKVPCNSTLCELQK----QCPSAGSNCPYQVRYLSDGTMSTGFL 203
N Y P SS+ ++ C+ C + Q PS +C Y + DGT++ G
Sbjct: 183 KEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSY-FQKTQDGTVTIGIY 241
Query: 204 VEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQG 263
++ + + + + I GC ++ G +D A +G+ LG S A +
Sbjct: 242 GKEKATVTVSDGRMAKLPGLI-LGCSVLEAGGSVD--AHDGVLSLGNGDMSFAVHAAKR- 297
Query: 264 LIPNSFSMCFGS-----DGTGRISFGDKGS---PGQGETPFSLR-QTHPTYNITITQVSV 314
FS C S D + ++FG + PG ET P Y +T V V
Sbjct: 298 -FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLV 356
Query: 315 GGNAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDL 363
GG ++ I D+ TS T L AY ++ + S L
Sbjct: 357 GGERLDIPDEVWDAERFVGGGVILDTSTSVTSLVPEAYAPVTAALDR--------HLSHL 408
Query: 364 P-------FEYCY 369
P FEYCY
Sbjct: 409 PRVYELEGFEYCY 421
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 76/303 (25%), Positives = 126/303 (41%), Gaps = 55/303 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P + + +D+GSD+ W+ C C C + ++ P SS+
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDP---------VFDPADSSSF 193
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C S +C+ + C Y+V Y DG+ + G L + L + + +
Sbjct: 194 AGVSCGSDVCDRLENTGCNAGRCRYEVSY-GDGSYTKGTLALETLTVG------QVMIRD 246
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--- 280
++ GCG G F+ A G+ S+ I G +FS C S GTG
Sbjct: 247 VAIGCGHTNQGMFIGAAGLL-----GLGGGSMSFIGQLGGQTGGAFSYCLVSRGTGSTGA 301
Query: 281 ISFGDKGSPGQGETPFSLRQT--HPT-YNITITQVSVGGNAVN-----FEFS------AI 326
+ FG +G+ G T SL + P+ Y I + + VGG V+ F+ + +
Sbjct: 302 LEFG-RGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVV 360
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-------FEYCYVLRSFLHLQA 379
D+GT+ T AY ++F + TS+LP F+ CY L F ++
Sbjct: 361 MDTGTAVTRFPTAAYVAFRDSFTA--------QTSNLPRAPGVSIFDTCYDLNGFESVRV 412
Query: 380 LVV 382
V
Sbjct: 413 PTV 415
>gi|348690233|gb|EGZ30047.1| hypothetical protein PHYSODRAFT_474645 [Phytophthora sojae]
Length = 642
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 125/290 (43%), Gaps = 43/290 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
HY + +G PA V +DTGS L LPC C C GQ D ++ + S+T+
Sbjct: 95 HYAEIYLGIPAQRASVIVDTGSHLTALPCSTCQGC--------GQHTD-PLFDVSKSTTA 145
Query: 164 SKVPCNS----TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA-----TDE 214
+ C+ CE Q +C Y + +G+M +V++++ + DE
Sbjct: 146 KYLACHDFDSCRSCE-QDRC--------YISQSYMEGSMWEAVMVDELVWVGGFSSPADE 196
Query: 215 KQS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI-PNSFSM 271
+ K+ R GC +TG F+ NG+ GLG +++V S + N G + N F++
Sbjct: 197 MEGVLKTFGFRFPVGCQTKETGLFIT-QKENGIMGLGRHRSTVMSYMLNAGRVTQNLFTL 255
Query: 272 CFGSDGTGRISFG----DKGSPGQGETPFSLRQT--HPTY--NITITQVSVGGN--AVNF 321
CF DG G + FG + G TP ++ +P + +I + VS+G + +N
Sbjct: 256 CFAGDG-GELVFGGVDYSHHTSDVGYTPLLSDKSAYYPVHVKDILLNGVSLGIDTGTINS 314
Query: 322 EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSGT+ T+ + F+ A S L E L
Sbjct: 315 GRGVIVDSGTTDTFFDGKGKRAFMSAFSKAAGRDYSESRMKLTSEELAAL 364
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 73/263 (27%), Positives = 110/263 (41%), Gaps = 33/263 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G PA +++ +DTGS L WL C C+ + SG V ++P +SST + V C++
Sbjct: 3 LGTPATQYVMVVDTGSSLTWLQCS--PCLVSCHRQSGPV-----FNPKSSSTYASVGCSA 55
Query: 171 TLCE------LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
C L S+ + C YQ Y D + S G+L +D + +
Sbjct: 56 QQCSDLPSATLNPSACSSSNVCIYQASY-GDSSFSVGYLSKDTVSFGSTSLP------NF 108
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFG 284
+GCG+ G F A GL GL +K S+ LA + SF+ C S +
Sbjct: 109 YYGCGQDNEGLFGRSA---GLIGLARNKLSLLYQLAPS--LGYSFTYCLPSSSSSGYLSL 163
Query: 285 DKGSPGQ-GETPF-SLRQTHPTYNITITQVSVGGNAV------NFEFSAIFDSGTSFTYL 336
+PGQ TP S Y I ++ ++V GN + I DSGT T L
Sbjct: 164 GSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRL 223
Query: 337 NDPAYTQISETFNSLAKEKRETS 359
Y+ +S+ + K S
Sbjct: 224 PTSVYSALSKAVAAAMKGTSRAS 246
>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
Length = 445
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 75/260 (28%), Positives = 110/260 (42%), Gaps = 44/260 (16%)
Query: 98 LNSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYS 156
+ G LH+T VS+G P + LDTGSDL W C + Q + +Y
Sbjct: 81 IRPFGRLHHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLF--------DTRQHREKPLYD 132
Query: 157 PNTSSTSSKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
P SS+ + PC+ LCE K C + + C Y Y S T G L +
Sbjct: 133 PAKSSSFAAAPCDGRLCETGSFNTKNC--SRNKCIYTYNYGSATT--KGELASETFTFGE 188
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+ S S+D FGCG++ +GS L GA+ G+ G+ D+ S L +Q IP FS C
Sbjct: 189 HRRVSVSLD----FGCGKLTSGS-LPGAS--GILGISPDRLS----LVSQLQIPR-FSYC 236
Query: 273 ----FGSDGTGRISFGDKGSPGQGETPFSLRQTHPT---------YNITITQVSVGGNAV 319
+ T I FG + T ++ T Y + + +SVG +
Sbjct: 237 LTPFLDRNTTSHIFFGAMADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRL 296
Query: 320 NFEFS--AIFDSGTSFTYLN 337
N S AI G+ T+++
Sbjct: 297 NVPVSSFAIGRDGSGGTFVD 316
>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 756
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 104/249 (41%), Gaps = 44/249 (17%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P + +DTGSD+ W C C +C I+ P+ SST
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAP---------IFDPSKSST 470
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN G++C Y++ Y +D T S G L + + + + + V +
Sbjct: 471 FREQRCN-------------GNSCHYEIIY-ADKTYSKGILATETVTIPSTSGE-PFVMA 515
Query: 223 RISFGCGRVQTGSFLDGAA--PNGLFGLGMDKTSVPSI--LANQGLIPNSFSMCFGSDGT 278
GCG T G A +G+ GL M S+ S L GLI S CF GT
Sbjct: 516 ETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLI----SYCFSGQGT 571
Query: 279 GRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAV-------NFEFSAIF- 327
+I+FG G +++ +P Y + + VSV N + + E IF
Sbjct: 572 SKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFI 631
Query: 328 DSGTSFTYL 336
DSGT+ TY
Sbjct: 632 DSGTTLTYF 640
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 65/251 (25%), Positives = 104/251 (41%), Gaps = 48/251 (19%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P +DTGSDL W C C C + I+ P+ SST
Sbjct: 81 IYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDP---------IFDPSKSST 131
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ C+ G +C Y++ Y D T S G L + + + + + V +
Sbjct: 132 FNEQRCH-------------GKSCHYEIIY-EDNTYSKGILATETVTIHSTSGE-PFVMA 176
Query: 223 RISFGCGRVQTGSFLD----GAAPNGLFGLGMDKTSVPSI--LANQGLIPNSFSMCFGSD 276
+ GCG T LD ++ +G+ GL M S+ S L GLI S CF
Sbjct: 177 ETTIGCGLHNTD--LDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLI----SYCFSGQ 230
Query: 277 GTGRISFGDK---GSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSA 325
GT +I+FG G +++ +P Y + + VSV N + + +
Sbjct: 231 GTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNI 290
Query: 326 IFDSGTSFTYL 336
+ DSG++ TY
Sbjct: 291 VIDSGSTVTYF 301
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 116/289 (40%), Gaps = 55/289 (19%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVID---FNIYSPNTSSTSSKV 166
S+G P + LDTGS L W PC + + + + +D IY+ N SST +
Sbjct: 79 SLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSL 138
Query: 167 PCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S C C S CPY G+ +TG LV DVL L+ K ++ D
Sbjct: 139 PCRSPKCNWVFGSDLNC-STTKRCPYYGLEYGLGS-TTGQLVSDVLGLS---KLNRIPD- 192
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS---DGT- 278
FGC S + P G+ G G S+P+ L GL FS C S D T
Sbjct: 193 -FLFGC------SLVSNRQPEGIAGFGRGLASIPAQL---GL--TKFSYCLVSHRFDDTP 240
Query: 279 ---------GRISFGDKGSPGQGETPFS----LRQTHPTYNITITQVSVGGNAVNF---- 321
GR D + G PF+ L Y I+++++ VGG V
Sbjct: 241 QSGDLVLHRGR-RHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRY 299
Query: 322 -------EFSAIFDSGTSFTYLNDPAYTQIS-ETFNSLAKEKRETSTSD 362
+ I DSG++FT++ + ++ E + K KR D
Sbjct: 300 LVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIED 348
>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
Length = 357
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 106/257 (41%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|68475693|ref|XP_718053.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
gi|68475828|ref|XP_717987.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
gi|7548425|gb|AAA34368.2| secreted aspartyl proteinase 1 [Candida albicans]
gi|7548465|gb|AAA34370.2| secreted aspartyl proteinase 1 [Candida albicans]
gi|46439729|gb|EAK99043.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
gi|46439804|gb|EAK99117.1| secretory aspartyl proteinase SAP1p [Candida albicans SC5314]
Length = 391
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 126/306 (41%), Gaps = 50/306 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG PA + LDTGSD+ WL C C C + I+ P S T
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDP---------IFDPRKSKTY 192
Query: 164 SKVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
+ +PC+S C ++ SAG N C YQV Y DG+ + G + L + +
Sbjct: 193 ATIPCSSPHC---RRLDSAGCNTRRKTCLYQVSY-GDGSFTVGDFSTETLTFRRNRVKG- 247
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----G 274
++ GCG G F+ A L GLG K S P ++ FS C
Sbjct: 248 -----VALGCGHDNEGLFVGAAG---LLGLGKGKLSFPGQTGHR--FNQKFSYCLVDRSA 297
Query: 275 SDGTGRISFGDKGSPGQGE-TP-FSLRQTHPTYNITITQVSVGGNAVNFEFSAIF----- 327
S + FG+ TP S + Y + + +SVGG V +++F
Sbjct: 298 SSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQI 357
Query: 328 -------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA- 379
DSGTS T L PAY + + F AK + L F+ C+ L + ++
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSL-FDTCFDLSNMNEVKVP 416
Query: 380 LVVLPF 385
VVL F
Sbjct: 417 TVVLHF 422
>gi|353678008|sp|P0CY27.1|CARP1_CANAL RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
Full=Aspartate protease 1; AltName: Full=Secreted
aspartic protease 1; Flags: Precursor
gi|7548436|gb|AAA34369.2| secreted aspartyl proteinase 1 [Candida albicans]
Length = 391
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|353678009|sp|C4YSF6.1|CARP1_CANAW RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
Full=Aspartate protease 1; AltName: Full=Secreted
aspartic protease 1; Flags: Precursor
gi|238883021|gb|EEQ46659.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 391
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 123/308 (39%), Gaps = 53/308 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG PA + + LDTGSD+ WL C C +C + + I+ P S T
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDV---------IFDPKKSKTF 188
Query: 164 SKVPCNSTLCEL---QKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC S LC +C + S C YQV Y DG+ + G + L
Sbjct: 189 ATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSY-GDGSFTEGDFSTETLTF-----HGAR 242
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------ 273
VD + GCG G F+ A GL S PS + FS C
Sbjct: 243 VD-HVPLGCGHDNEGLFVGAAGLLGLG---RGGLSFPS--QTKSRYNGKFSYCLVDRTSS 296
Query: 274 --GSDGTGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV------NF 321
S I FG+ P + F+ T+P Y + + +SVGG+ V F
Sbjct: 297 GSSSKPPSTIVFGNDAVP--KTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQF 354
Query: 322 EFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFL 375
+ A I DSGTS T L AY + + F L K + + S F+ C+ L
Sbjct: 355 KLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFR-LGATKLKRAPSYSLFDTCFDLSGMT 413
Query: 376 HLQALVVL 383
++ V+
Sbjct: 414 TVKVPTVV 421
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 134/301 (44%), Gaps = 51/301 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ ++ VG P + + DTGSD+ WL C C SC GQ +++P+ SST
Sbjct: 81 YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCY-------GQTDP--LFNPSFSSTF 131
Query: 164 SKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C S+LC+ L + C + C YQV Y DG+ + G + L ++ S
Sbjct: 132 QSITCGSSLCQQLLIRGCRR--NQCLYQVSY-GDGSFTVGEFSTETLSFGSNAVNS---- 184
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGT 278
++ GCG G F A L GLG S PS + L + FS C S G+
Sbjct: 185 --VAIGCGHNNQGLFTGAAG---LLGLGKGLLSFPSQVGQ--LYGSVFSYCLPTRESTGS 237
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAVNFEFSA--------- 325
+ FG++ + F+ T+P Y + + + VGG +V+ +
Sbjct: 238 VPLIFGNQAVASNAQ--FTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGN 295
Query: 326 ---IFDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCYVL--RSFLHLQA 379
I DSGT+ T L AY + + F + + + + TS L F+ CY L RS + L A
Sbjct: 296 GGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSL-FDTCYDLSGRSSIMLPA 354
Query: 380 L 380
+
Sbjct: 355 V 355
>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 118/306 (38%), Gaps = 65/306 (21%)
Query: 84 KTPLTFSAGND-TYRLN-SLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHG 141
KTP SA + YR + ++ +G P S + LDTGS L W+ C
Sbjct: 54 KTPALKSAASPYNYRSRFKYSMILLVSLPIGTPPQSQQMILDTGSQLSWIQCH------- 106
Query: 142 LNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-------LQKQCPSAGSNCPYQVRYLS 194
+ ++ P+ SS+ S +PCN LC+ L C C Y Y +
Sbjct: 107 -KKVPRKPPPSTVFDPSLSSSFSVLPCNHPLCKPRIPDFTLPTSC-DLNRLCHYSYFY-A 163
Query: 195 DGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTS 254
DGT++ G LV + + +T + + GC D + G+ G+ + + S
Sbjct: 164 DGTLAEGNLVREKITFSTSQSTPPLI-----LGCAE-------DASDDKGILGMNLGRLS 211
Query: 255 VPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETP------------FSLRQTH 302
A+Q I FS C + R F GS GE P FS Q
Sbjct: 212 ----FASQAKI-TKFSYCVPTRQV-RPGFTPTGSFYLGENPNSAGFQYISLLTFSQSQRM 265
Query: 303 P-----TYNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQISE 346
P + + + + +G +N SA + DSG+ FTYL D AY ++ E
Sbjct: 266 PNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAGQSMIDSGSEFTYLVDVAYNKVRE 325
Query: 347 TFNSLA 352
LA
Sbjct: 326 EVVRLA 331
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 64.3 bits (155), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 109/275 (39%), Gaps = 43/275 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V VG P F + +DTGSDL WL C C+ C G V D P SS+
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFE----QRGPVFD-----PAASSSY 199
Query: 164 SKVPCNSTLC------ELQKQCPS-AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C C E + C A +CPY Y + +E T
Sbjct: 200 RNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 259
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
S+ VD + FGCG G F A L GLG S S L + + ++FS C
Sbjct: 260 SRRVDG-VVFGCGHRNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHTFSYCLVEH 313
Query: 274 GSDGTGRISFGDK----GSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFS--- 324
GSD ++ FG+ P T F+ + Y + + V VGG+ +N
Sbjct: 314 GSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWD 373
Query: 325 --------AIFDSGTSFTYLNDPAYTQISETFNSL 351
I DSGT+ +Y +PAY I + F L
Sbjct: 374 VGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDL 408
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 35/287 (12%)
Query: 101 LGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
+G L+Y S+G P ++ + +DTGSDL W+ C + S + D P
Sbjct: 43 IGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFD-----PAQ 97
Query: 160 SSTSSKVPCNSTLCE---LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
SS+ + VPC +C + + + C Y V Y DG+ +TG D L L+
Sbjct: 98 SSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSY-GDGSNTTGVYSSDTLTLS----- 151
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD 276
+ S FGCG Q+G F +G +GL GLG ++ S+ + G FS C +
Sbjct: 152 ASSAVQGFFFGCGHAQSGLF-NGV--DGLLGLGREQPSL--VEQTAGTYGGVFSYCLPTK 206
Query: 277 GT--GRISFGDKGSPGQGETPFSLRQTHPT------YNITITQVSVGGNAVNFEFSAI-- 326
+ G ++ G G P FS Q P+ Y + +T +SVGG ++ SA
Sbjct: 207 PSTAGYLTLG-VGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG 265
Query: 327 ---FDSGTSFTYLNDPAYTQISETFNS-LAKEKRETSTSDLPFEYCY 369
D+GT T L AY + F S +A T+ S+ + CY
Sbjct: 266 GTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCY 312
>gi|193885194|pdb|2QZW|A Chain A, Secreted Aspartic Proteinase (Sap) 1 From Candida Albicans
gi|193885195|pdb|2QZW|B Chain B, Secreted Aspartic Proteinase (Sap) 1 From Candida Albicans
Length = 341
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 7 LNNEHVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 63
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 64 IYTPKSSTTSQNL------------------GTPFYIGY-GDGSSSQGTLYKDT------ 98
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 99 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 148
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 149 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 208
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 209 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 249
>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
Length = 472
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 75/265 (28%), Positives = 109/265 (41%), Gaps = 38/265 (14%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSS 161
FL VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S
Sbjct: 112 FLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSY 166
Query: 162 TSSKVPCNSTLC-EL-------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
TS +V C+S C EL Q C +C Y V Y + S G +V D L +
Sbjct: 167 TSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIG-- 224
Query: 214 EKQSKSVDS--RISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNS 268
DS + FGC V+ F G G + P IL+ + +
Sbjct: 225 -------DSFMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----A 272
Query: 269 FSMCFGSDGT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEF 323
FS C +D T G + G D+ + G T PTY++T+ ++ G V
Sbjct: 273 FSYCLPTDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRLVTSSS 332
Query: 324 SAIFDSGTSFTYLNDPAYTQISETF 348
I DSG T L + + +T
Sbjct: 333 EMIVDSGAQRTSLWPSTFALLDKTI 357
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/281 (25%), Positives = 109/281 (38%), Gaps = 54/281 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +++VG P + LDTGSDL W C C C H + P SST
Sbjct: 92 YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQ---------GLPLLDPAASSTY 142
Query: 164 SKVPCNSTLCELQ--KQCPSAG--------SNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
+ +PC + C C G +C Y Y D +++ G + D D
Sbjct: 143 AALPCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHY-GDKSVTVGEIATDRFTFGGD 201
Query: 214 --EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ S+ R++FGCG G F G+ G G + S+PS L +FS
Sbjct: 202 NGDGDSRLPTRRLTFGCGHFNKGVFQSNE--TGIAGFGRGRWSLPSQLNV-----TTFSY 254
Query: 272 CFGS---DGTGRISFGDKGSPG-----------QGE---TPFSLRQTHPT-YNITITQVS 313
CF S + ++ G G+P GE TP + P+ Y +++ +S
Sbjct: 255 CFTSMFESKSSLVTLG--GAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGIS 312
Query: 314 VGGNAVNFE----FSAIFDSGTSFTYLNDPAYTQISETFNS 350
VG + S I DSG S T L + Y + F +
Sbjct: 313 VGKTRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAA 353
>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 424
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 114/275 (41%), Gaps = 46/275 (16%)
Query: 99 NSLGFLHYTNVSVGQPALSFIVALDTGSDLFW---LPCDCVSCVHGLNSSSGQVIDFNIY 155
N FL N+S+G P + ++ +DTGSDL W LPC C Q I F +
Sbjct: 74 NPAAFL--ANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYP----------QTIPF--F 119
Query: 156 SPNTSSTSSKVPCNSTLCEL-QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P+ SST C S + Q NC Y +RY D + + G L E+ L T +
Sbjct: 120 HPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYHLRY-RDFSNTRGILAEEKLTFETSD 178
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG 274
S I FGCG+ +G +G+ GLG S+ + N G + FS CFG
Sbjct: 179 DGLIS-KQNIVFGCGQDNSGF----TKYSGVLGLGPGTFSI--VTRNFG---SKFSYCFG 228
Query: 275 SDGT----GRISFGDKGSPGQGE-TPFSLRQTHPTYNITITQVSVGGNAVNFE------- 322
S I G+ +G+ TP + Q Y + + +S G ++ E
Sbjct: 229 SLTNPTYPHNILILGNGAKIEGDPTPLQIFQDR--YYLDLQAISFGEKLLDIEPGTFQRY 286
Query: 323 ---FSAIFDSGTSFTYLNDPAYTQISETFNSLAKE 354
+ D+G S T L AY +SE + L E
Sbjct: 287 RSQGGTVIDTGCSPTILAREAYETLSEEIDFLLGE 321
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 119/284 (41%), Gaps = 46/284 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G P + ++A+DT +D W+PC C C L ++P S+T
Sbjct: 93 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTL------------FAPEKSTTF 140
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGF-LVEDVLHLATDEKQSKSVDS 222
V C + C KQ P+ G + L+ G+ S LV+D + LATD S
Sbjct: 141 KNVSCAAPEC---KQVPNPGCGVSSRNFNLTYGSSSIAANLVQDTITLATDPVPS----- 192
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGT 278
+FGC TG+ A P GL GLG S+ S Q L ++FS C S + +
Sbjct: 193 -YTFGCVSKTTGT---SAPPQGLLGLGRGPLSLLS--QTQNLYQSTFSYCLPSFKSLNFS 246
Query: 279 GRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA----------- 325
G + G P + + L+ + Y + + + VG V+ +A
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306
Query: 326 IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
IFDSGT FT L P Y + + F K T TS F+ CY
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKL-TVTSLGGFDTCY 349
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 115/295 (38%), Gaps = 40/295 (13%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + VG PA V LDTGSD+ W+ C C C + I+ P +SST
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDP---------IFDPTSSSTF 214
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+ C + C YQV Y DG+ + G D + K +
Sbjct: 215 KSLTCSDPKCASLDVSACRSNKCLYQVSY-GDGSFTVGNYATDTVTFGESGKVND----- 268
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
++ GCG G F A GL G + T NQ + SFS C + + S
Sbjct: 269 VALGCGHDNEGLFTGAAGLLGLGGGALSMT-------NQ-IKAKSFSYCLVDRDSAKSSS 320
Query: 284 GDKGS----PGQGETPFSLRQTHPT-YNITITQVSVGGNAVN-----FEFSA------IF 327
D S G P T Y + ++ SVGG V+ FE A I
Sbjct: 321 LDFNSVQIGAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVIL 380
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
D GT+ T L AY + + F L + ++ ++ F+ CY S ++ V
Sbjct: 381 DCGTAVTRLQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTV 435
>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 711
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 70/276 (25%), Positives = 108/276 (39%), Gaps = 54/276 (19%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P +DTGS++ W C+ CVH ++ I+ P+ SST
Sbjct: 379 VYLMKLQVGTPPFEIEAVIDTGSEITW--TQCLPCVHCYKQNAP------IFDPSKSSTF 430
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+ +CPY+V Y D T + G L D + + + + +
Sbjct: 431 KEKRCHD-------------HSCPYEVDYF-DKTYTKGTLATDTVTIHSTSGEPFVMAET 476
Query: 224 ISFGCGRVQTG---SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR 280
I GCGR + SF G GL S+ I G P S CF +GT +
Sbjct: 477 I-IGCGRNNSWFRPSF------EGFVGLNWGPLSL--ITQMGGEYPGLMSYCFAGNGTSK 527
Query: 281 ISFGDKGSPGQG----ETPFSLRQTHPTYNITITQVSVGGNAVN--------FEFSAIFD 328
I+FG G G T F Y + + VSVG + E + + D
Sbjct: 528 INFGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVID 587
Query: 329 SGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP 364
SGT+ TY E++ +L ++ E +P
Sbjct: 588 SGTTLTYF--------PESYCNLVRQAVEHVVPAVP 615
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 58/242 (23%), Positives = 92/242 (38%), Gaps = 52/242 (21%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ + + +G P LDTGS+L W C+ C+H + + I+ P+ SST
Sbjct: 63 YEYLMKLQIGTPPFEVEAVLDTGSELIW--TQCLPCLHCYDQKAP------IFDPSKSST 114
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN + +CPY++ Y D + + G L + + + + +
Sbjct: 115 FKETRCN-----------TPDHSCPYKLVY-DDKSYTQGTLATETVTIHSTSGVPFVMPE 162
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
I GC R +GS G P+ +G+ + S+ I G P G G +S
Sbjct: 163 TI-IGCSRNNSGS---GFRPSSSGIVGLSRGSLSLISQMGGAYP----------GDGVVS 208
Query: 283 FGDKGSPGQGETPFSLRQTHPTYNITITQVSVGG---NAVNFEFSA-----IFDSGTSFT 334
T F+ Y + + VSVG V F A + DSGT T
Sbjct: 209 ----------TTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLT 258
Query: 335 YL 336
Y
Sbjct: 259 YF 260
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 108/270 (40%), Gaps = 73/270 (27%)
Query: 108 NVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKV 166
N+SVG P L+F V DTGSDL W C C C + P +SST SK+
Sbjct: 89 NISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPP---------FQPASSSTFSKL 139
Query: 167 PCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
PC S+ C+ + C + G C Y +Y S T G+L + L + S
Sbjct: 140 PCTSSFCQFLPNSIRTCNATG--CVYNYKYGSGYT--AGYLATETLKVGDASFPS----- 190
Query: 223 RISFGCGRVQTGSFLDGAAPNGL--FGLGMDKTSVPSILANQGLIPNSFSMCFGSD---G 277
++FGC + NGL LG+ + FS C S G
Sbjct: 191 -VAFGC-----------STENGLGQLDLGVGR----------------FSYCLRSGSAAG 222
Query: 278 TGRISFGDKGSPGQGE---TPF-SLRQTHPT-YNITITQVSVGGNAV-----NFEFS--- 324
I FG + G TPF + HP+ Y + +T ++VG + F F+
Sbjct: 223 ASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNG 282
Query: 325 ----AIFDSGTSFTYLNDPAYTQISETFNS 350
I DSGT+ TYL Y + + F S
Sbjct: 283 LGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 312
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/256 (31%), Positives = 109/256 (42%), Gaps = 38/256 (14%)
Query: 99 NSLGFLHYT-NVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSP 157
+S+G L Y VS+G P ++ V +DTGSD+ W+ C + ++ P
Sbjct: 493 HSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKD------QLFDP 546
Query: 158 NTSSTSSKVPCNSTLC-ELQ---KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
SS+ S VPC + C EL C +AGS C Y V Y DG+ +TG D L L
Sbjct: 547 AKSSSYSAVPCAADACSELSTYGHGC-AAGSQCGYVVSY-GDGSNTTGVYGSDTLTLTDA 604
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGL---GMDKTSVPSILANQGLIPNSFS 270
+ + + FGCG Q G F A +GL L GM TS S G+ FS
Sbjct: 605 DAVTGFL-----FGCGHAQAGLF---AGIDGLLALGRKGMSLTSQTSGAYGGGV----FS 652
Query: 271 MCF--GSDGTGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGN------AVN 320
C TG ++ G S G T PT Y + +T + VGG A
Sbjct: 653 YCLPPSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASA 712
Query: 321 FEFSAIFDSGTSFTYL 336
F + D+GT T L
Sbjct: 713 FAGGTVVDTGTVITRL 728
>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 438
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/289 (24%), Positives = 112/289 (38%), Gaps = 52/289 (17%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
F + + V P + + DTGS L WL C + ++P SS+
Sbjct: 74 FEYLMALDVSTPPVRMLALADTGSSLVWLKCKLPAA----------------HTP-ASSS 116
Query: 163 SSKVPCNSTLCEL---QKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
+++PC++ C+ C + GS C Y+ + +DG+ + G + D +T
Sbjct: 117 YARLPCDAFACKALGDAASCRATGSGNNICVYRYAF-ADGSCTAGPVTVDAFTFST---- 171
Query: 217 SKSVDSRISFGCG-RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
R+ FGC R + S D +GL GL S+ S L+ + + FS C
Sbjct: 172 ------RLDFGCATRTEGLSVPD----DGLVGLANGPISLVSQLSAKTPFAHKFSYCLVP 221
Query: 274 ---GSDGTGRISFGDKG----SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA- 325
+ ++FG SPG TP + Y I + + V G V + +
Sbjct: 222 YSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT 281
Query: 326 --IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLR 372
I DSGT TYL + + K R S L + CY +R
Sbjct: 282 KLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETL-YAVCYDVR 329
>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 74/284 (26%), Positives = 115/284 (40%), Gaps = 35/284 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VGQP S+ DTGSD+ WL C +G G + D P +SS+ S
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFD-----PKSSSSYS 238
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ C+S C L + ++C Y+V Y DG+ + G L + + S S+ +
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEY-GDGSFTVGELATETFSF----RHSNSI-PNL 292
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
GCG G F+ A + L++Q L SFS C S+ + +
Sbjct: 293 PIGCGHDNEGLFVGAAG-------LIGLGGGAISLSSQ-LEATSFSYCLVDLDSESSSTL 344
Query: 282 SFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDS 329
F +P PT+ + + +SVGG + +FE I DS
Sbjct: 345 DFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 404
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
GT+ T + Y + + F L K + PF+ CY L S
Sbjct: 405 GTTITEIPSDVYDVLRDAFVGLTK-NLPPAPGVSPFDTCYDLSS 447
>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
Length = 357
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 105/257 (40%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + L S C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKAL-----SYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T+ ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 72/252 (28%), Positives = 109/252 (43%), Gaps = 34/252 (13%)
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + V LD+ SD+ W+ CV C + QV F Y P+ S TS+ C+S C
Sbjct: 25 PGVIQTVVLDSASDVPWV--QCVPCP--IPPCHPQVDSF--YDPSRSPTSAAFSCSSPTC 78
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
C A + C Y VRY DG+ ++G + D+L L + + S FGC
Sbjct: 79 TALGPYANGC--ANNQCQYLVRY-PDGSSTSGAYIADLLTL-----DAGNAVSGFKFGCS 130
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSP 289
+ GSF AA G+ LG S+ S A++ N+FS C + + F G P
Sbjct: 131 HAEQGSFDARAA--GIMALGGGPESLLSQTASR--YGNAFSYCIPATASDS-GFFTLGVP 185
Query: 290 GQGETPFSL------RQTHPTYNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
+ + + + RQ Y + + ++VGG + F ++ DS T+ T L
Sbjct: 186 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 245
Query: 339 PAYTQISETFNS 350
AY + F S
Sbjct: 246 TAYQALRAAFRS 257
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/311 (26%), Positives = 122/311 (39%), Gaps = 67/311 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSS 161
+ N+S+G P + DTGSDL WL PCD G I+ P+ S+
Sbjct: 80 YMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKG-----------PIFDPSNST 128
Query: 162 TSSKVPCNSTLC----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T K+PC + C E + C + + C Y Y D + +TG+L D + + Q
Sbjct: 129 TFHKLPCTTAPCNALDESARSC-TDPTTCGYTYSY-GDHSYTTGYLASDTVTVGNASVQI 186
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---- 273
++V +FGCG G+F + + G+ GLG S S L + I FS C
Sbjct: 187 RNV----AFGCGTRNGGNFDEQGS--GIVGLGGGNLSFVSQLGDT--IGKKFSYCLLPLE 238
Query: 274 --------GSDGTGRISFGDK----GSPGQG----ETPFSLRQTHPTYNITITQVSVGGN 317
S T RI FGD S G TP ++ Y +TI ++VG
Sbjct: 239 NEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRK 298
Query: 318 AVNF-------------------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
+ + E + I DSGT+ T+L + Y + K +R
Sbjct: 299 KLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVN 358
Query: 359 STSDLPFEYCY 369
+ F C+
Sbjct: 359 DVKNSMFSLCF 369
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 117/295 (39%), Gaps = 56/295 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + S+G P F + +DTGSDL ++ C C C D +Y P+ SST
Sbjct: 34 YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQ---------DGPLYQPSNSSTF 84
Query: 164 SKVPCNSTLCEL------------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLA 211
+ VPC+S C L + P G+ C Y+ RY D + + G + +
Sbjct: 85 TPVPCDSAECLLIPAPVGAPCSSSYPESPPQGA-CSYEYRY-GDNSSTVGVFAYETATVG 142
Query: 212 TDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSM 271
+ ++FGCG GSF+ G+ GLG S S N F+
Sbjct: 143 GIRV------NHVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYA--FENKFAY 191
Query: 272 CFGSDGT-----GRISFGDKGSPGQGETPFSLRQTHP----TYNITITQVSVGGNAVNFE 322
C S + + FGD + F+ ++P Y + I ++ GG +
Sbjct: 192 CLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIP 251
Query: 323 FSA-----------IFDSGTSFTYLNDPAYTQISETFN-SLAKEKRETSTSDLPF 365
SA IFDSGT+ TY + AY +I F S+ + S LP
Sbjct: 252 DSAWKIDSVGNGGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQGLPL 306
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 126/346 (36%), Gaps = 57/346 (16%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A+R R+ + R N P+ +G + V G P S +D
Sbjct: 85 ANRLRFLKRTSRSSKQDANANVPVRSGSGE-----------YIIQVDFGTPKQSMYTLID 133
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSA 182
TGSD+ W+PC H I+ P SS+ C+S C E+ C
Sbjct: 134 TGSDVAWIPCKQCQGCHSTAP---------IFDPAKSSSYKPFACDSQPCQEISGNC-GG 183
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAP 242
S C ++V Y DGT G L D + L + + SFGC S + +P
Sbjct: 184 NSKCQFEVSY-GDGTQVDGTLASDAITLGSQYLPN------FSFGCAE----SLSEDTSP 232
Query: 243 NGLFGLGMDKTSVPSILANQG-LIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLR 299
+ + A L +FS C S +G + G + + F+
Sbjct: 233 SPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTL 292
Query: 300 QTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTSFTYLNDPAYTQISETF 348
P+ Y +T+ +SVG ++ + I DSGT+ T+L AYT + + F
Sbjct: 293 IKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAYTALRDAF 352
Query: 349 NSLAKEKRETSTSDLPFEYCYVLRS--------FLHLQALVVLPFP 386
+ T D+ + CY L S LHL V L P
Sbjct: 353 RQQLSSLQPTPVEDM--DTCYDLSSSSVDVPTITLHLDRNVDLVLP 396
>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 523
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 73/284 (25%), Positives = 114/284 (40%), Gaps = 35/284 (12%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
++ + VGQP S+ DTGSD+ WL C +G G + D P +SS+ S
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFD-----PKSSSSYS 238
Query: 165 KVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
+ C+S C L + ++C Y+V Y DG+ + G L + + S S+ +
Sbjct: 239 PLSCDSEQCHLLDEAACDANSCIYEVEY-GDGSFTVGELATETFSF----RHSNSI-PNL 292
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGTGRI 281
GCG G F+ + L++Q L SFS C S+ + +
Sbjct: 293 PIGCGHDNEGLFVGADG-------LIGLGGGAISLSSQ-LEATSFSYCLVDLDSESSSTL 344
Query: 282 SFGDKGSPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDS 329
F +P PT+ + + +SVGG + +FE I DS
Sbjct: 345 DFNADQPSDSLTSPLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDS 404
Query: 330 GTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
GT+ T + Y + + F L K + PF+ CY L S
Sbjct: 405 GTTITEIPSDVYDVLRDAFVGLTK-NLPPAPGVSPFDTCYDLSS 447
>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
Length = 357
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/257 (27%), Positives = 105/257 (40%), Gaps = 34/257 (13%)
Query: 109 VSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVP 167
VS+G+P + +VA+DTGS L W+ C C H ++ +G + D P S TS +V
Sbjct: 3 VSLGKPPVVNLVAIDTGSTLSWVQCQPCAVHCHTQSAKAGPIFD-----PGRSYTSRRVR 57
Query: 168 CNSTLC-------ELQK-QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
C+S C LQ+ C +C Y V Y + S G +V D L +
Sbjct: 58 CSSVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDSFMD--- 114
Query: 220 VDSRISFGCGR-VQTGSFLDGAAPNGLFGLGMDK--TSVPSILANQGLIPNSFSMCFGSD 276
+ FGC V+ F G G + P IL+ + +FS C +D
Sbjct: 115 ----LMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYK-----AFSYCLPTD 165
Query: 277 GT--GRISFG--DKGSPGQGETPFSLRQTHPTYNITITQ-VSVGGNAVNFEFSAIFDSGT 331
T G + G D+ + G TP PTY++T ++ G V I DSG
Sbjct: 166 ETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRLVTSSSEMIVDSGA 225
Query: 332 SFTYLNDPAYTQISETF 348
T L + + +T
Sbjct: 226 QRTSLWPSTFALLDKTI 242
>gi|353678010|sp|P0CY26.1|CARP1_CANAX RecName: Full=Candidapepsin-1; AltName: Full=ACP 1; AltName:
Full=Aspartate protease 1; AltName: Full=Secreted
aspartic protease 1; Flags: Precursor
gi|578121|emb|CAA40192.1| microbial aspartic proteinases [Candida albicans]
Length = 391
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/281 (25%), Positives = 110/281 (39%), Gaps = 56/281 (19%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDF----N 153
LN+ + ++++G F V +DTGS W+P V+C GQ DF
Sbjct: 57 LNNELVSYAADITIGSNKQKFNVIVDTGSSDLWVPDASVTCDK---PRPGQSADFCKGKG 113
Query: 154 IYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
IY+P +S+TS + P+ + Y DG+ S G L +D
Sbjct: 114 IYTPKSSTTSQNL------------------GSPFYIGY-GDGSSSQGTLYKDT------ 148
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLD---GAAPNGLFGLGM-------DKTSVPSILANQG 263
+ FG + F D + P G+ G+G D +VP L NQG
Sbjct: 149 ----------VGFGGASITKQVFADITKTSIPQGILGIGYKTNEAAGDYDNVPVTLKNQG 198
Query: 264 LI-PNSFSMCFGS--DGTGRISFGDKGSPGQGETPFSLRQTHP-TYNITITQVSVGGNAV 319
+I N++S+ S TG+I FG + ++ T IT+ + G +
Sbjct: 199 VIAKNAYSLYLNSPNAATGQIIFGGVDKAKYSGSLIAVPVTSDRELRITLNSLKAVGKNI 258
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETST 360
N + DSGT+ TYL I + F + K + T
Sbjct: 259 NGNIDVLLDSGTTITYLQQDVAQDIIDAFQAELKSDGQGHT 299
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 125/346 (36%), Gaps = 57/346 (16%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A+R R+ + R N P+ +G + V G P S +D
Sbjct: 85 ANRLRFLKRTSRSSKEDANANVPVRSGSGE-----------YIIQVDFGTPKQSMYTLID 133
Query: 124 TGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQCPSA 182
TGSD+ W+PC H I+ P SS+ C+S C E+ C
Sbjct: 134 TGSDVAWIPCKQCQGCHSTAP---------IFDPAKSSSYKPFACDSQPCQEISGNC-GG 183
Query: 183 GSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR-VQTGSFLDGAA 241
S C ++V Y DGT G L D + L + + SFGC + ++
Sbjct: 184 NSKCQFEVLY-GDGTQVDGTLASDAITLGSQYLPN------FSFGCAESLSEDTYSSPGL 236
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGETPFSLR 299
G T P+ L +FS C S +G + G + + F+
Sbjct: 237 MGLGGGSLSLLTQAPT----AELFGGTFSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTL 292
Query: 300 QTHPT----YNITITQVSVGGNAVNFEFS-------AIFDSGTSFTYLNDPAYTQISETF 348
P+ Y +T+ +SVG ++ + I DSGT+ TYL AY + + F
Sbjct: 293 IKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAF 352
Query: 349 NSLAKEKRETSTSDLPFEYCYVLRS--------FLHLQALVVLPFP 386
+ T D+ + CY L S LHL V L P
Sbjct: 353 RQQLSSLQPTPVEDM--DTCYDLSSSSVDVPTITLHLDRNVDLVLP 396
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 128/307 (41%), Gaps = 52/307 (16%)
Query: 66 RDRYFRLRGRGLAAQGN---DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVAL 122
R R + R R + + N +T + S+G + LN + V++G + + V +
Sbjct: 28 RVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYI-------VTMGLGSTNMTVII 80
Query: 123 DTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQ---- 176
DTGSDL W+ C+ C+SC + I+ P+TSS+ V CNS+ C+ LQ
Sbjct: 81 DTGSDLTWVQCEPCMSCYNQQGP---------IFKPSTSSSYQSVSCNSSTCQSLQFATG 131
Query: 177 --KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
C S S C Y V Y DG+ + G L + L SV S FGCGR G
Sbjct: 132 NTGACGSNPSTCNYVVNY-GDGSYTNGELGVEQLSFG-----GVSV-SDFVFGCGRNNKG 184
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPGQ 291
F +GL GLG S+ S FS C S +G + G++ S +
Sbjct: 185 LF---GGVSGLMGLGRSYLSLVS--QTNATFGGVFSYCLPTTESGASGSLVMGNESSVFK 239
Query: 292 GETPFSLRQTHPT------YNITITQVSVGGNAVNF----EFSAIFDSGTSFTYLNDPAY 341
TP + + P Y + +T + V G A+ + DSGT T L Y
Sbjct: 240 NVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLPSSVY 299
Query: 342 TQISETF 348
+ F
Sbjct: 300 KALKALF 306
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/306 (25%), Positives = 127/306 (41%), Gaps = 37/306 (12%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
R+ S + +++G P + +DTGSDL W C C C + ++
Sbjct: 42 RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSP---------MF 92
Query: 156 SPNTSSTSSKVPCNSTLCE-LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
P S+T + +PC+S C L S C Y Y +D +++ G L + + ++ +
Sbjct: 93 EPLRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAY-ADSSVTKGVLARETVTFSSTD 151
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNS--FSMC 272
+ V I FGCG +G+F + G+ S+++ G + S FS C
Sbjct: 152 GEPVVV-GDIVFGCGHSNSGTFNEND-----MGIIGLGGGPLSLVSQFGNLYGSKRFSQC 205
Query: 273 ---FGSD--GTGRISFGDKGS---PGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS 324
F +D G ISFGD G TP + Y +T+ +SVG V+F S
Sbjct: 206 LVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS 265
Query: 325 AIF-------DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHL 377
+ DSGT TYL Y ++ + + DL + CY RS +L
Sbjct: 266 EMLSKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCY--RSETNL 323
Query: 378 QALVVL 383
+ +++
Sbjct: 324 EGPILI 329
>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/264 (25%), Positives = 101/264 (38%), Gaps = 36/264 (13%)
Query: 117 SFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQ 176
++ +ALD G L W+ C+ C H L S ++ P S T S +P ++T+
Sbjct: 110 NYQLALDMGGGLSWM--QCLPCRHCLLQMS------PVFDPTKSPTFSNIPAHNTVWCRP 161
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
P A C + + Y D T ++G+L D + S I FGC QT F
Sbjct: 162 PYQPLANGACGFDIAY-RDNTHASGYLARDTFSFPAGNDDFVPL-SAIVFGCAH-QTEHF 218
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIP---NSFSMCFGSDGTGRISFGDKGSPGQGE 293
+ A G+ GLGM P + ++P FS C G S+ GS
Sbjct: 219 KNQRAVAGILGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSH 278
Query: 294 TPFSL-RQTHPT---------YNITITQVSVGGNAVNFEFSAIF------------DSGT 331
P ++ RQ+ P Y + + VSVG N ++ A+F D GT
Sbjct: 279 PPPNVHRQSTPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGT 338
Query: 332 SFTYLNDPAYTQISETFNSLAKEK 355
T AY I + +
Sbjct: 339 RMTAFIHSAYVHIDHAVRQHLQRR 362
>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 389
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 67/275 (24%), Positives = 115/275 (41%), Gaps = 46/275 (16%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ + + +G P LDTGS+ W C+ CVH N ++ I+ P+ SST
Sbjct: 57 YEYLMKLQIGTPPFEIEAVLDTGSEHIW--TQCLPCVHCYNQTA------PIFDPSKSST 108
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ +C + +CPY++ Y + + G LV + + + + Q +
Sbjct: 109 FKEI-----------RCDTHDHSCPYELVY-GGKSYTKGTLVTETVTIHSTSGQPFVMPE 156
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
I GCGR +G F G A G+ +G+D+ I G P S CF GT +I+
Sbjct: 157 TI-IGCGRNNSG-FKPGFA--GV--VGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKIN 210
Query: 283 FGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T ++ P Y + + VSVG + + + + DSG
Sbjct: 211 FGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSG 270
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
++ TY E++ +L ++ E + + F
Sbjct: 271 STLTYF--------PESYCNLVRKAVEQVVTAVRF 297
>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 395
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 68/275 (24%), Positives = 114/275 (41%), Gaps = 46/275 (16%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
+ + + +G P LDTGS+ W C+ CVH N ++ I+ P+ SST
Sbjct: 63 YEYLMKLQIGTPPFEIEAVLDTGSEHIW--TQCLPCVHCYNQTA------PIFDPSKSST 114
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
++ +C + +CPY++ Y + + G LV + + + + Q +
Sbjct: 115 FKEI-----------RCDTHDHSCPYELVY-GGKSYTKGTLVTETVTIHSTSGQPFVMPE 162
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
I GCGR +G F G A G+ GL D+ I G P S CF GT +I+
Sbjct: 163 TI-IGCGRNNSG-FKPGFA--GVVGL--DRGPKSLITQMGGEYPGLMSYCFAGKGTSKIN 216
Query: 283 FGDK---GSPGQGETPFSLRQTHPT-YNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T ++ P Y + + VSVG + + + + DSG
Sbjct: 217 FGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSG 276
Query: 331 TSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPF 365
++ TY E++ +L ++ E + + F
Sbjct: 277 STLTYF--------PESYCNLVRKAVEQVVTAVRF 303
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 68/140 (48%), Gaps = 14/140 (10%)
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDG-AAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC 272
+Q+ + + I FGC Q+G A +G+FG G + SV S L + G+ P FS C
Sbjct: 9 NEQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHC 68
Query: 273 F-GSD-GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFS------ 324
GSD G G + G+ PG TP L + P YN+ + ++V G + + S
Sbjct: 69 LKGSDNGGGILVLGEIVEPGLVYTP--LVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSN 126
Query: 325 ---AIFDSGTSFTYLNDPAY 341
I DSGT+ YL D AY
Sbjct: 127 TQGTIVDSGTTLAYLADGAY 146
>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 449
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 129/330 (39%), Gaps = 58/330 (17%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
+HR Y L A T + ++GN + N + +G P + LD
Sbjct: 72 SHRLTYLS----SLVAGKPKPTSVPVASGNQLHIGN-----YVVRAKLGTPPQLMFMVLD 122
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCP 180
T +D WLPC C C + S + +SST S V C++ C + CP
Sbjct: 123 TSNDAVWLPCSGCSGCSNASTSFNTN----------SSSTYSTVSCSTAQCTQARGLTCP 172
Query: 181 SAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFL 237
S+ S C + Y D + S LV+D L LA D V SFGC +G+ L
Sbjct: 173 SSSPQPSVCSFNQSYGGDSSFSAS-LVQDTLTLAPD------VIPNFSFGCINSASGNSL 225
Query: 238 DGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE 293
P GL GLG S+ + L FS C S +G + G G P
Sbjct: 226 ---PPQGLMGLGRGPMSL--VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIR 280
Query: 294 -TPFSLRQTHPT-YNITITQVSVGG-----NAVNFEFSA------IFDSGTSFTYLNDPA 340
TP P+ Y + +T VSVG + V F A I DSGT T P
Sbjct: 281 YTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPV 340
Query: 341 YTQISETFNSLAKEKRETSTSDL-PFEYCY 369
Y I + F K+ +S S L F+ C+
Sbjct: 341 YEAIRDEFR---KQVNVSSFSTLGAFDTCF 367
>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 417
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 120/293 (40%), Gaps = 61/293 (20%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIY 155
RL +L ++ V+VG + + +DTGSDL W+ C C C + ++
Sbjct: 60 RLQTLNYI----VTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP---------LF 106
Query: 156 SPNTSSTSSKVPCNSTLC-ELQKQCPSAG-------SNCPYQVRYLSDGTMSTGFLVEDV 207
+P+ SS+ +PCNS C LQ S+G ++C YQ+ Y DG+ S G L +
Sbjct: 107 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEK 165
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
L L E +D+ I FGCGR G F +GL GL + S+ S L +
Sbjct: 166 LTLGKTE-----IDNFI-FGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGS 214
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFS-LRQTHPT--------------YNITITQV 312
FS C + G G GS G FS + P Y + +T +
Sbjct: 215 VFSYCLPTTGV-----GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGI 269
Query: 313 SVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
S+GG +N ++ DSGT T L+ Y F R T
Sbjct: 270 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTT 322
>gi|325183198|emb|CCA17656.1| aspartyl protease family A01B putative [Albugo laibachii Nc14]
Length = 656
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 129/306 (42%), Gaps = 32/306 (10%)
Query: 74 GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC 133
R L + L S N+ LN HY + VG P + +DTGS + PC
Sbjct: 64 ARTLQIAKTYRRSLFTSDQNEVVPLNLGMGTHYAWIYVGTPPQRVSIIIDTGSGMTAFPC 123
Query: 134 D-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRY 192
C C + + I FN N SS+ + CN C + C R
Sbjct: 124 SGCDQCGNHTD------IPFNT---NLSSSIQPISCNHRTYFSCAYCTNPTEPC----RT 170
Query: 193 LSDGTMSTGFLVEDVLHL-----ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
+G+ + ++ED+++L A D S +R FGC +TG F+ A +G+ G
Sbjct: 171 YMEGSSWSAKVMEDIVYLGDVASAKDTNLHHSYSTRYMFGCQNKETGLFIPQVA-DGIMG 229
Query: 248 LGMDKTSVPSILANQGLIP-NSFSMCFGSDGTGRISFGDKG-SPGQGETPFSLRQT---H 302
+ + + + L + IP N+F++CF G G + G S GE ++
Sbjct: 230 IHNNGNDIVTKLFREKKIPSNTFTLCFSPRG-GYFALGAMDTSRHAGEVTYARINDAYGE 288
Query: 303 PTYNITITQVSVGGNAVNFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAKEKR 356
Y + +T + VGG++++ + A I DSGT+ + ++ A + + + +L K
Sbjct: 289 NYYAVFMTDIRVGGHSIDIDMKATNSYRYIVDSGTTNSIISGRAGQALMDLYRNLTHLKN 348
Query: 357 ETSTSD 362
+ +D
Sbjct: 349 PLNDND 354
>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
Length = 191
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 73/153 (47%), Gaps = 21/153 (13%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S + H D + RGR L++ + F+ G + L + L++T + +G P + V
Sbjct: 37 SGIKHHDHH--RRGRFLSS-------VDFNLGGNG--LPTRTGLYFTKLGLGSPKKDYYV 85
Query: 121 ALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQC 179
+DTGSD+ W+ C +C C + S +D +Y P S TS + C+ C
Sbjct: 86 QVDTGSDILWVNCVECSRCP----TKSQIGMDLTLYDPKGSHTSELISCDHEFCSSTYDG 141
Query: 180 PSAG----SNCPYQVRYLSDGTMSTGFLVEDVL 208
P G + CPY + Y DG+ +TG+ V D L
Sbjct: 142 PIPGCRAETPCPYSITY-GDGSATTGYYVRDYL 173
>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
Length = 287
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 93/204 (45%), Gaps = 26/204 (12%)
Query: 182 AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAA 241
+G +C Y V+Y DG+ + GF D L L++ + FGCG G F + A
Sbjct: 17 SGGHCLYGVQY-GDGSYTIGFFAMDTLTLSSHDAIKG-----FRFGCGERNEGLFGEAA- 69
Query: 242 PNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--GSDGTGRISFGDKGSPGQGE----TP 295
GL GLG KTS+P ++ F+ CF S GTG + FG SP TP
Sbjct: 70 --GLLGLGRGKTSLPVQTYDK--YGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLSTTP 125
Query: 296 FSLRQTHPT-YNITITQVSVGGNAVNFE---FSA---IFDSGTSFTYLNDPAYTQISETF 348
L T PT Y + +T + VGG + F+A I DSGT T L AY+ + F
Sbjct: 126 M-LIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAF 184
Query: 349 -NSLAKEKRETSTSDLPFEYCYVL 371
S+A + + + + CY L
Sbjct: 185 AASMAARGYKRAPALSLLDTCYDL 208
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/314 (25%), Positives = 116/314 (36%), Gaps = 70/314 (22%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ V VG P + +D+GSD+ W+ C C C + ++ P S++
Sbjct: 133 YFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADP---------LFDPAASASF 183
Query: 164 SKVPCNSTLCEL----QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+ VPC+S +C C +G+ C YQV Y DG+ + G L + L S
Sbjct: 184 TAVPCDSGVCRTLPGGSSGCADSGA-CRYQVSY-GDGSYTQGVLAMETLTFG----DSTP 237
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSD--- 276
V ++ GCG G F+ A GL GLG S+ L +FS C S
Sbjct: 238 VQG-VAIGCGHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGA--AGGAFSYCLASRGAD 291
Query: 277 -GTGRISFG-DKGSP-GQGETPFSLRQTHPTYNITITQVSV------------------G 315
G G + FG D P G P P++ G
Sbjct: 292 AGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDG 351
Query: 316 GNAVNFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-------FEYC 368
G V + D+GT+ T L AY + + F S T DLP + C
Sbjct: 352 GGGV------VMDTGTAVTRLPPDAYAALRDAFAS-------TIGGDLPRAPGVSLLDTC 398
Query: 369 YVLRSFLHLQALVV 382
Y L + ++ V
Sbjct: 399 YDLSGYASVRVPTV 412
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 85/321 (26%), Positives = 126/321 (39%), Gaps = 53/321 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +S+G P +DTGSDL WL CD +C H G+ I F+ + SS+
Sbjct: 5 YMMELSIGTPPQLIPAMIDTGSDLVWLKCD--NCDHCDLDHHGETIFFS----DASSSYK 58
Query: 165 KVPCNSTLCELQKQC---PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD--EKQSKS 219
K+PCNST C P C Y+ Y DG+ ++G + D + + + +S
Sbjct: 59 KLPCNSTHCSGMSSAGIGPRCEETCKYKYEY-GDGSRTSGDVGSDRISFRSHGAGEDHRS 117
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT- 278
FGCGR G D GL GLG S+ L ++ + FS C S +
Sbjct: 118 FFDGFLFGCGRKLKG---DWNFTQGLIGLGQKSHSLIQQLGDK--LGYKFSYCLVSYDSP 172
Query: 279 -GRISFGDKGSPG--QGETPFSLRQTH------PTYNITITQVSVGGNAVN--------- 320
SF GS +G S H Y + + ++VGG V
Sbjct: 173 PSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHN 232
Query: 321 ------FEFSAIFDSGTSFTYLNDPAYTQISETF----------NSLAKEKRETSTSDLP 364
+ DSGT++T L P Y + ++ NS + S+ D
Sbjct: 233 TSVGPFLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVILPTLGNSAGLDLCFNSSGDTS 292
Query: 365 FEYCYVLRSFLHLQALVVLPF 385
+ + V F + Q +VLPF
Sbjct: 293 YGFPSVTFYFAN-QVQLVLPF 312
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 120/293 (40%), Gaps = 61/293 (20%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIY 155
RL +L ++ V+VG + + +DTGSDL W+ C C C + ++
Sbjct: 139 RLQTLNYI----VTVGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEP---------LF 185
Query: 156 SPNTSSTSSKVPCNSTLC-ELQKQCPSAG-------SNCPYQVRYLSDGTMSTGFLVEDV 207
+P+ SS+ +PCNS C LQ S+G ++C YQ+ Y DG+ S G L +
Sbjct: 186 NPSNSSSFLSLPCNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDY-GDGSYSRGELGFEK 244
Query: 208 LHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPN 267
L L E +D+ I FGCGR G F +GL GL + S+ S L +
Sbjct: 245 LTLGKTE-----IDNFI-FGCGRNNKGLF---GGASGLMGLARSELSLVS--QTSSLFGS 293
Query: 268 SFSMCFGSDGTGRISFGDKGSPGQGETPFS-LRQTHPT--------------YNITITQV 312
FS C + G G GS G FS + P Y + +T +
Sbjct: 294 VFSYCLPTTGV-----GSSGSLTLGGADFSNFKNISPISYTRMIQNPQMSNFYFLNLTGI 348
Query: 313 SVGGNAVNFE-------FSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRET 358
S+GG +N ++ DSGT T L+ Y F R T
Sbjct: 349 SIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFEKQFSGYRTT 401
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 119/313 (38%), Gaps = 49/313 (15%)
Query: 68 RYFRLRGRGLAAQ-----GNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQP-ALSFIVA 121
R +R R AA G P T G +NS +H +S+G P + ++
Sbjct: 53 RRMVVRSRARAANLCPYSGATARPATAPVGRANTDVNSEYLIH---LSIGAPRSQPVVLT 109
Query: 122 LDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQKQCP 180
LDTGSD+ W C+ C C + + F+ + NT + V C+ LC +
Sbjct: 110 LDTGSDVVWTQCEPCAECF------TQPLPRFDTAASNTVRS---VACSDPLCNAHSEHG 160
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
C Y Y DG++S G + D + K I FGCG G FL
Sbjct: 161 CFLHGCTYVSGY-GDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQ-- 217
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS------FGDKGSPGQG-- 292
G+ G G S+PS L + FS CF + + S GD + G
Sbjct: 218 TETGIAGFGRGPLSLPSQLKVR-----QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPI 272
Query: 293 -ETPFSLRQTHP-----TYNITITQVSVGGNAVNF-EFSA------IFDSGTSFTYLNDP 339
TPF +R P Y ++ V+VG + E A DSGT T D
Sbjct: 273 LSTPF-VRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSGATFIDSGTDITTFPDA 331
Query: 340 AYTQISETFNSLA 352
+ Q+ F + A
Sbjct: 332 VFRQLKSAFIAQA 344
>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
gi|238011188|gb|ACR36629.1| unknown [Zea mays]
Length = 342
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 111/285 (38%), Gaps = 54/285 (18%)
Query: 120 VALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--Q 176
+ LDTGSD+ W+ C C C SG V D P SS+ V C + LC
Sbjct: 1 MVLDTGSDVVWVQCAPCRRCYE----QSGPVFD-----PRRSSSYGAVGCGAALCRRLDS 51
Query: 177 KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C C YQV Y DG+++ G V + L A + +R++ GCG G F
Sbjct: 52 GGCDLRRGACMYQVAY-GDGSVTAGDFVTETLTFAGGARV-----ARVALGCGHDNEGLF 105
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF------------GSDGTGRISFG 284
+ A GL S P+ ++ + SFS C GS + +SFG
Sbjct: 106 VAAAGLLGLG---RGGLSFPTQISRR--YGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFG 160
Query: 285 DKGSPGQGETPFSLRQTHPT----YNITITQVSVGGNAV-------------NFEFSAIF 327
GS G F+ +P Y + + +SVGG V I
Sbjct: 161 -AGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIV 219
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLP-FEYCYVL 371
DSGTS T L +Y+ + + F + A S F+ CY L
Sbjct: 220 DSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDL 264
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 77/286 (26%), Positives = 112/286 (39%), Gaps = 34/286 (11%)
Query: 98 LNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYS 156
L++L F+ V +G PA + DTGSDL W+ C C S H ++
Sbjct: 139 LDTLEFV--VAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQD------PLFD 190
Query: 157 PNTSSTSSKVPCNSTLCELQKQ-CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
P+ SST + V C C C + C Y VRY DG+ +TG L D L L +
Sbjct: 191 PSKSSTYAAVHCGEPQCAAAGDLCSEDNTTCLYLVRY-GDGSSTTGVLSRDTLALTSSRA 249
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
+ FGCG G F G L + + A+ G + FS C S
Sbjct: 250 LTG-----FPFGCGTRNLGDF--GRVDGLLGLGRGELSLPSQAAASFGAV---FSYCLPS 299
Query: 276 DG--TGRISFGDKGSPGQGETPFSLRQTHPT----YNITITQVSVGG------NAVNFEF 323
TG ++ G + G ++ P Y + + + +GG AV
Sbjct: 300 SNSTTGYLTIGATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRG 359
Query: 324 SAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ DSGT TYL AY + + F L E+ + + + CY
Sbjct: 360 GTLLDSGTVLTYLPAQAYALLRDRFR-LTMERYTPAPPNDVLDACY 404
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 87/403 (21%), Positives = 147/403 (36%), Gaps = 77/403 (19%)
Query: 35 HHRYS---------DPVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKT 85
H R+S + VKG + D L ++ + +++ DR R +GL +
Sbjct: 42 HERFSGGGGDVDQVEAVKGFVNRDGLRRQRMNQRW-GVSNYDR----RRKGLETTTTTEV 96
Query: 86 PLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSS 145
+ AG D ++LG ++T V VG P F +A DTGS+ W C + +
Sbjct: 97 EMPMRAGRD----DALG-EYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTK 151
Query: 146 SGQVIDF------------------------------NIYSPNTSSTSSKVPCNSTLCEL 175
+ ++ P+ S + V C S C++
Sbjct: 152 KTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKI 211
Query: 176 Q-------KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
CP C Y + Y +DG+ + GF D + + + +++ ++ GC
Sbjct: 212 DLSQLFSLSLCPKPSDPCLYDISY-ADGSSAKGFFGTDTITVDLKNGKEGKLNN-LTIGC 269
Query: 229 GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGR--ISFGDK 286
+ G+ GLG K S A + FS C + R S+
Sbjct: 270 TKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYE--YGAKFSYCLVDHLSHRNVSSYLTI 327
Query: 287 GSPGQGETPFSLRQTH-----PTYNITITQVSVGGNAV---------NFEFSAIFDSGTS 332
G + +++T P Y + + +S+GG + N + + DSGT+
Sbjct: 328 GGHHNAKLLGEIKRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTT 387
Query: 333 FTYLNDPAYTQISETF-NSLAKEKRETSTSDLPFEYCYVLRSF 374
T L PAY + E SL K KR T ++C+ F
Sbjct: 388 LTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGF 430
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/280 (27%), Positives = 116/280 (41%), Gaps = 54/280 (19%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++++G P + LDTGSDL W C CVSC Q + + + + SST+
Sbjct: 35 YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFD-------QPLPY--FDTSRSSTN 85
Query: 164 SKVPCNSTLCELQK------QCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ +PC ST C+L + C Y Y D +++ G L D
Sbjct: 86 ALLPCESTQCKLDPTVTVCVKLNQTVQTCAYYTSY-GDNSVTIGLLAADKFTFVAGTSLP 144
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG 277
++FGCG TG F + G+ G G S+PS L +FS CF +
Sbjct: 145 G-----VTFGCGLNNTGVF--NSNETGIAGFGRGPLSLPSQLKV-----GNFSHCFTTI- 191
Query: 278 TGRISF-------GDKGSPGQGE---TP---FSLRQTHPT-YNITITQVSVGGNAVNFEF 323
TG I D S GQG TP ++ + +PT Y +++ ++VG +
Sbjct: 192 TGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPE 251
Query: 324 SA----------IFDSGTSFTYLNDPAYTQISETFNSLAK 353
SA I DSGTS T L Y + + F + K
Sbjct: 252 SAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIK 291
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/259 (25%), Positives = 106/259 (40%), Gaps = 40/259 (15%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
S+G PA + DTGSDL W C C C D ++ P +SST + C
Sbjct: 97 SLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQ---------DAPLFDPKSSSTYRDISC 147
Query: 169 NSTLCELQKQ---CPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRI 224
++ C+L K+ C G+ C Y Y D + ++G + D + L + + + I
Sbjct: 148 STKQCDLLKEGASCSGEGNKTCHYSYSY-GDRSFTSGNVAADTITLGSTSGRPVLLPKAI 206
Query: 225 SFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFSMCF-----GSDG 277
GCG GSF + + + P L +Q I FS C +
Sbjct: 207 -IGCGHNNGGSFTEKGS------GIVGLGGGPISLISQLGSTIDGKFSYCLVPLSSNATN 259
Query: 278 TGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------EFSAI 326
+ +++FG G G TP + Y +T+ VSVG + F E + I
Sbjct: 260 SSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNII 319
Query: 327 FDSGTSFTYLNDPAYTQIS 345
DSGT+ T + ++++S
Sbjct: 320 IDSGTTLTLFPEDFFSELS 338
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/283 (27%), Positives = 110/283 (38%), Gaps = 46/283 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL W+ C C C + Y P S++
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGA---------FYDPKASASY 220
Query: 164 SKVPCNSTLCELQKQ------CPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATDEKQ 216
+ CN C L C S +CPY Y + F VE ++L T+
Sbjct: 221 KNITCNDQRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGS 280
Query: 217 SKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF-- 273
S+ + + FGCG G F A L GLG S S L Q L +SFS C
Sbjct: 281 SELYNVENMMFGCGHWNRGLFHGAAG---LLGLGRGPLSFSSQL--QSLYGHSFSYCLVD 335
Query: 274 ---GSDGTGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFEF 323
++ + ++ FG+ P T F + + Y + I + V G +N
Sbjct: 336 RNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPE 395
Query: 324 S-----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEK 355
I DSGT+ +Y +PAY I AK K
Sbjct: 396 ETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGK 438
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 117/293 (39%), Gaps = 46/293 (15%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
RL+S+ + +++G+P + F+ DTGSDL W C C C D +Y
Sbjct: 63 RLHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQ---------DTPVY 113
Query: 156 SPNTSSTSSKVPCNSTLCE--LQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATD 213
P+ SST S +PC+S C + C + S C Y+ Y DG S G L + L L
Sbjct: 114 DPSASSTFSPLPCSSATCLPIWSRNC-TPSSLCRYRYAY-GDGAYSAGILGTETLTLGPS 171
Query: 214 EKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC- 272
++FGCG G L+ G GLG S+LA G+ FS C
Sbjct: 172 SAPVSV--GGVAFGCGTDNGGDSLNS---TGTVGLGRGTL---SLLAQLGV--GKFSYCL 221
Query: 273 ---FGSDGTGRISFGDKGSPGQG-----ETPFSLRQTHPT-YNITITQVSVGGNAV---- 319
F S G G TP +P+ Y +++ +S+G +
Sbjct: 222 TDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPN 281
Query: 320 -NFEFSA------IFDSGTSFTYLNDPAYTQ-ISETFNSLAKEKRETSTSDLP 364
F+ I DSGT+FT L + + + + L + S+ D P
Sbjct: 282 GTFDLRGDGTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSLDAP 334
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 109/252 (43%), Gaps = 34/252 (13%)
Query: 114 PALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC 173
P + V LD+ SD+ W+ CV C + QV F Y P+ S +S+ C+S C
Sbjct: 155 PGVIQTVVLDSASDVPWV--QCVPCP--IPPCHPQVDSF--YDPSRSPSSAPFSCSSPTC 208
Query: 174 ----ELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCG 229
C A + C Y VRY DG+ ++G + D+L L + + S FGC
Sbjct: 209 TALGPYANGC--ANNQCQYLVRY-PDGSSTSGAYIADLLTL-----DAGNAVSGFKFGCS 260
Query: 230 RVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSP 289
+ GSF AA G+ LG S+ S A++ N+FS C + + F G P
Sbjct: 261 HAEQGSFDARAA--GIMALGGGPESLLSQTASR--YGNAFSYCIPATASDS-GFFTLGVP 315
Query: 290 GQGETPFSL------RQTHPTYNITITQVSVGGNAVN-----FEFSAIFDSGTSFTYLND 338
+ + + + RQ Y + + ++VGG + F ++ DS T+ T L
Sbjct: 316 RRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPP 375
Query: 339 PAYTQISETFNS 350
AY + F S
Sbjct: 376 TAYQALRSAFRS 387
>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
Length = 482
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 79/301 (26%), Positives = 116/301 (38%), Gaps = 55/301 (18%)
Query: 105 HYTNVSVGQP-----ALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPN 158
+ ++VG P + +++ D GSD+ WL C C C H +Y+
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPCFRCYHQPGP---------VYNRL 175
Query: 159 TSSTSSKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEK 215
SS++S V C + C C + C Y+V Y DG+ S G + L +
Sbjct: 176 KSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEY-GDGSSSAGDFGVETLTFPPGVR 234
Query: 216 QSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS 275
++ GCG G F AA G+ GLG S PS +A G SFS C
Sbjct: 235 VPG-----VAIGCGSDNQGLFPAPAA--GILGLGRGSLSFPSQIA--GRYGRSFSYCLAG 285
Query: 276 DGTG----RISFGDKGSPGQGETP-------FSLRQTHPTYNITITQVSVGGNAVNF--- 321
GTG ++FG S T + + + Y + + +SVGG V
Sbjct: 286 QGTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTE 345
Query: 322 ----------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEY---C 368
I DSGT+ T L+ PAY + F A ++ + PF + C
Sbjct: 346 SDLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTC 405
Query: 369 Y 369
Y
Sbjct: 406 Y 406
>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
Length = 460
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 73/168 (43%), Gaps = 23/168 (13%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
+G P +DTGS+L W C N GQ D Y P+ S T+ V CN
Sbjct: 90 IGDPPQQAAAIIDTGSNLIWTQCSTCRA----NGCFGQ--DLTFYDPSRSRTAKPVACND 143
Query: 171 TLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGC 228
T C L + +C G C Y + GFL +V QS + ++FGC
Sbjct: 144 TACLLGSETRCARDGKACAVLTAYGAGAI--GGFLGTEVFTFG--HGQSSENNVSLAFGC 199
Query: 229 ---GRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
R+ GS LDGA+ G+ GLG K S+PS L + N FS C
Sbjct: 200 ITASRLTPGS-LDGAS--GIIGLGRGKLSLPSQLGD-----NKFSYCL 239
>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 336
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 112/278 (40%), Gaps = 35/278 (12%)
Query: 111 VGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNS 170
VGQP LDTGSD+ WL C+ C G N Q+ I+ P SS+ + V C+S
Sbjct: 3 VGQPQQPSFFVLDTGSDVTWL--QCLPCA-GKNGCYEQITP--IFDPELSSSYNPVSCDS 57
Query: 171 TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGR 230
C+L + ++C Y+V Y DG+ + G L + L S S+ IS GCG
Sbjct: 58 EQCQLLDEAGCNVNSCIYKVEY-GDGSFTIGELATETLTFV----HSNSI-PNISIGCGH 111
Query: 231 VQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKG 287
G F+ GL G + +S L +SFS C S + F
Sbjct: 112 DNEGLFVGADGLIGLGGGAISISS--------QLKASSFSYCLVDIDSPSFSTLDFNTDP 163
Query: 288 SPGQGETPFSLRQTHPTYN-ITITQVSVGGNAV-----NFEFSA------IFDSGTSFTY 335
+P P++ + + +SVGG + FE I DSGT+ T
Sbjct: 164 PSDSLISPLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQ 223
Query: 336 LNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRS 373
L Y + E F L + PF+ CY L S
Sbjct: 224 LPSDVYEVLREAFLGLTT-NLPPAPEISPFDTCYDLSS 260
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 84/303 (27%), Positives = 119/303 (39%), Gaps = 44/303 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++T + VG P + LDTGSD+ W+ C C C + S V D P S +
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCY----AQSDPVFD-----PRKSRSF 176
Query: 164 SKVPCNSTLCELQKQ--CPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ + C S LC C + C YQV Y DG+ + G + L ++
Sbjct: 177 ASIACRSPLCHRLDSPGCNTQKQTCMYQVSY-GDGSFTFGDFSTETLTF------RRTRV 229
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF----GSDG 277
+R++ GCG G F+ A GLG + S PS + + FS C S
Sbjct: 230 ARVALGCGHDNEGLFVGAAGLL---GLGRGRLSFPSQTGRR--FNHKFSYCLVDRSASSK 284
Query: 278 TGRISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAVN------FEFS----- 324
+ FGD TP T Y + + +SVGG V F+
Sbjct: 285 PSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNG 344
Query: 325 -AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA-LVV 382
I DSGTS T L PAY + F + A + L F+ C+ L ++ VV
Sbjct: 345 GVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRAPQFSL-FDTCFDLSGKTEVKVPTVV 403
Query: 383 LPF 385
L F
Sbjct: 404 LHF 406
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/310 (26%), Positives = 130/310 (41%), Gaps = 54/310 (17%)
Query: 66 RDRYFRLR-GRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
R +Y R +G+ D + T G+ ++SL ++ V +G P++S ++ +DT
Sbjct: 90 RSKYIMSRVSKGMMGDDADVSIPTHLGGS----VDSLEYV--VTVGLGTPSVSQVLLIDT 143
Query: 125 GSDLFWL---PCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE------L 175
GSDL W+ PC+ +C + ++ P+ SST + +PCN+ C
Sbjct: 144 GSDLSWVQCQPCNSTTCYPQKDP---------LFDPSKSSTYAPIPCNTDACRDLTDDGY 194
Query: 176 QKQCPS--AGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
C S + C + + Y DG+ + G + L LA FGCG Q
Sbjct: 195 GGGCASGDGAAQCGFAITY-GDGSQTRGVYSNETLALAPGVAVKD-----FRFGCGHDQD 248
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----------DGTGRISF 283
G+ +GL GLG S+ ++ + +FS C + G G S
Sbjct: 249 GA---NDKYDGLLGLGGAPESL--VVQTASVYGGAFSYCLPALNNQVGFLALGGGGAPSG 303
Query: 284 GDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLND 338
G + G TP +R+ Y + +T ++VGG ++ SA I DSGT T L
Sbjct: 304 GVVNTSGFVFTPM-IREEETFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTELQH 362
Query: 339 PAYTQISETF 348
AY + F
Sbjct: 363 TAYNALQAAF 372
>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
Length = 373
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 73/296 (24%), Positives = 118/296 (39%), Gaps = 55/296 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFN---IYSPNTSS 161
H V + QP + DTGSDL W C L+SS+ +Y P SS
Sbjct: 16 HSLTVGIVQPRKLIV---DTGSDLIWTQCK-------LSSSTAAAARHGSPPVYDPGESS 65
Query: 162 TSSKVPCNSTLCELQ----KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
T + +PC+ LC+ K C S + C Y+ Y S + G L +
Sbjct: 66 TFAFLPCSDRLCQEGQFSFKNCTSK-NRCVYEDVYGS--AAAVGVLASETFTFGA----R 118
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FG 274
++V R+ FGCG + GS + G+ GL + S+ + L Q FS C F
Sbjct: 119 RAVSLRLGFGCGALSAGSLIGA---TGILGLSPESLSLITQLKIQ-----RFSYCLTPFA 170
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQT----HPT----YNITITQVSVGGNAVNFEFSA- 325
T + FG + +T ++ T +P Y + + +S+G + ++
Sbjct: 171 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASL 230
Query: 326 ----------IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
I DSG++ YL + A+ + E + + T + +E C+VL
Sbjct: 231 AMRPDGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRLPVANRTVE-DYELCFVL 285
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 70/260 (26%), Positives = 102/260 (39%), Gaps = 43/260 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ +++ G P + LDTGSD+ W C N + ++ P+ SS+ +
Sbjct: 88 YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQT------LPLFDPSASSSFA 141
Query: 165 KVPCNSTLCELQKQCPSAGSN------CPYQVRYLSDGTMSTGFLVEDVLHLA--TDEKQ 216
+PC+S CE C G N C Y + Y DG++S G + +V A T E
Sbjct: 142 SLPCSSPACETTPPC--GGGNDATSRPCNYSISY-GDGSVSRGEIGREVFTFASGTGEGS 198
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS- 275
S +V + FGCG G F G+ G G S+PS L +FS CF +
Sbjct: 199 SAAVPGLV-FGCGHANRGVFTSNE--TGIAGFGRGSLSLPSQLKV-----GNFSHCFTTI 250
Query: 276 --DGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNAVNFEFSAIFDSGTSF 333
T + G G +P R+ +Y T S +SGTS
Sbjct: 251 TGSKTSAVLLGLPGVAPPSASPLGRRRG--SYRCRSTPRSS-------------NSGTSI 295
Query: 334 TYLNDPAYTQISETFNSLAK 353
T L Y + E F + K
Sbjct: 296 TSLPPRTYRAVREEFAAQVK 315
>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 457
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 123/324 (37%), Gaps = 68/324 (20%)
Query: 84 KTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD----CVSCV 139
KTP + S +S G + T +S G P + + DTGS L W PC C C
Sbjct: 61 KTPKSNSVFKSPLSPHSYG-AYSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECS 119
Query: 140 HGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC------ELQKQCPSAG-------SNC 186
+G + P SS+S V C + C +++ QC S C
Sbjct: 120 FPKIDPTG----IPRFVPKLSSSSKLVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTC 175
Query: 187 P-YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGL 245
P Y V+Y S T G L+ + L D+K V GC SFL P+G+
Sbjct: 176 PAYVVQYGSGST--AGLLLSETLDFP-DKKIPNFV-----VGC------SFLSIHQPSGI 221
Query: 246 FGLGMDKTSVPSILANQGLIPNSFSMCFGS--------------DGTGRISFGDKGSPGQ 291
G G S+PS + GL F+ C S D TG S G +P +
Sbjct: 222 AGFGRGSESLPSQM---GL--KKFAYCLASRKFDDSPHSGQLILDSTGVKSSGLTYTPFR 276
Query: 292 GETPFSLRQTHPTYNITITQVSVGGNAVNFEFS-----------AIFDSGTSFTYLNDPA 340
S Y + I ++ VG AV + +I DSG++FT+++ P
Sbjct: 277 QNPSVSNNAYKEYYYLNIRKIIVGNQAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPV 336
Query: 341 YTQISETFNS-LAKEKRETSTSDL 363
++ F LA R T L
Sbjct: 337 LEVVAREFEKQLANWTRATDVETL 360
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 76/278 (27%), Positives = 113/278 (40%), Gaps = 33/278 (11%)
Query: 103 FLHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
FL + +G P + +VA+DTG+ L ++ C+ + + +G++ D P+ S +
Sbjct: 204 FLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPCTLRCHKQTDAGEIFD-----PSKSES 258
Query: 163 SSKVPCNSTLCE-------LQ-KQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDE 214
S+V C+ C LQ K C +C Y + + + S G LV D L +
Sbjct: 259 FSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGKYA 318
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPN---GLFGLGMDKTSVPSILANQGLIPNSFSM 271
K D FGC LD GL G + S +A + +FS
Sbjct: 319 KGYSFPD--FLFGCS-------LDTEYHQYEAGLVGFADEPFSFFEQVAPL-VNYKAFSY 368
Query: 272 CFGSD--GTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGNA-VNFEFSAIFD 328
CF SD TG +S GD TP L + Y + + +V V G A V I D
Sbjct: 369 CFPSDRRKTGYLSIGDYTRVNSTYTPLFLARQQSRYALKLDEVLVNGMALVTTPSEMIVD 428
Query: 329 SGTSFTYLNDPAYTQ----ISETFNSLAKEKRETSTSD 362
SG+ +T L +TQ I+E L + SD
Sbjct: 429 SGSRWTILLSDTFTQLDAAITEAMRPLGYNRNYYRGSD 466
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 122/302 (40%), Gaps = 43/302 (14%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ N+S+G P +S DTGSDL W C C SC + I+ P S T
Sbjct: 95 YLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEP---------IFDPAKSKTY 145
Query: 164 SKVPCNSTLCEL---QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSV 220
+ C C Q C S + C Y Y DG+ ++G L D L + + + SV
Sbjct: 146 QILSCEGKSCSNLGGQGGC-SDDNTCIYSYSY-GDGSHTSGDLAVDTLTIGSTTGRPVSV 203
Query: 221 DSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDG 277
++ FGCG G+F +G +G+ + I + LI FS C G+D
Sbjct: 204 -PKVVFGCGHNNGGTF----ELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGNDP 258
Query: 278 --TGRISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF----------- 321
+ ++ FG +G G TP + RQ Y +T+ +SVG + +
Sbjct: 259 SVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLA 318
Query: 322 ---EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQ 378
E + I DSGT+ T L Y + S K +++ F CY S L +
Sbjct: 319 DADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNV-FSLCYSNLSGLRIP 377
Query: 379 AL 380
+
Sbjct: 378 TI 379
>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
Length = 375
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 125/317 (39%), Gaps = 54/317 (17%)
Query: 77 LAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-C 135
L A T + ++GN + N + +G P + LDT +D WLPC C
Sbjct: 7 LVAGKPKPTSVPVASGNQLHIGN-----YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGC 61
Query: 136 VSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK--QCPSAG---SNCPYQV 190
C + S + +SST S V C++ C + CPS+ S C +
Sbjct: 62 SGCSNASTSFNTN----------SSSTYSTVSCSTAQCTQARGLTCPSSSPQPSVCSFNQ 111
Query: 191 RYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGM 250
Y D + S LV+D L LA D V SFGC +G+ L P GL GLG
Sbjct: 112 SYGGDSSFSAS-LVQDTLTLAPD------VIPNFSFGCINSASGNSL---PPQGLMGLGR 161
Query: 251 DKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPGQGE-TPFSLRQTHPT- 304
S+ + L FS C S +G + G G P TP P+
Sbjct: 162 GPMSL--VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPKSIRYTPLLRNPRRPSL 219
Query: 305 YNITITQVSVGG-----NAVNFEFSA------IFDSGTSFTYLNDPAYTQISETFNSLAK 353
Y + +T VSVG + V F A I DSGT T P Y I + F K
Sbjct: 220 YYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTVITRFAQPVYEAIRDEFR---K 276
Query: 354 EKRETSTSDL-PFEYCY 369
+ +S S L F+ C+
Sbjct: 277 QVNVSSFSTLGAFDTCF 293
>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
Length = 453
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 95/233 (40%), Gaps = 25/233 (10%)
Query: 97 RLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIY 155
+LN FL V +G PA+ ++V +DTGS L W+ C C H + G + D
Sbjct: 47 KLNDFAFL--IPVKLGTPAVQYLVTMDTGSSLSWVQCRPCTIKCHVQPAKVGPIFD---- 100
Query: 156 SPNTSSTSSKVPCNSTLCEL--------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDV 207
P+ SST V C++++C K C C Y + Y S G V D
Sbjct: 101 -PSNSSTFRHVGCSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSVGKAVTDR 159
Query: 208 LHLATDEKQSKSVD-SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIP 266
L L E ++ + FGC S A G+FGLG S I L
Sbjct: 160 LVLGGGETTRTTLSLANFVFGCSMDTQYSTHKEA---GIFGLGTSNYSFEQIAPL--LSY 214
Query: 267 NSFSMCFGSDGT--GRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN 317
+FS C SD G +S G S G + F P Y+I +T ++V N
Sbjct: 215 KAFSYCLPSDEAHQGYLSIGPDSSGGVPTSMFP-GTPRPVYSIGMTGLTVTVN 266
>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
Length = 372
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 121/285 (42%), Gaps = 44/285 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + ++A+DT SD+ W+PC+ C+ C L FN SP S+T
Sbjct: 36 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTL---------FN--SP-ASTTY 83
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C + C+ + G C + + Y G+ L +D + LATD
Sbjct: 84 KSLGCQAAQCKQVPKPTCGGGVCSFNLTY--GGSSLAANLSQDTITLATDAVPG------ 135
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
SFGC + TG L GL + S Q L ++FS C S + +G
Sbjct: 136 YSFGCIQKATGGSLPAQGLLGLGRGPLSLLS-----QTQNLYQSTFSYCLPSFKSLNFSG 190
Query: 280 RISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFS------AI 326
+ G G P + + TP P+ Y + + V VG V +F F+ I
Sbjct: 191 SLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTI 250
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
FDSGT FT L PAY + + F + + T TS F+ CY +
Sbjct: 251 FDSGTVFTRLVTPAYIAVRDAFRNRVG-RNLTVTSLGGFDTCYTV 294
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 113/297 (38%), Gaps = 41/297 (13%)
Query: 74 GRGLAAQGNDKTPL-TFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDTGSDLFW-- 130
R D TP T S G ++ ++ T + Q A+S V +DT SD+ W
Sbjct: 124 ARSTTVSNRDYTPSSTASVGTNSGTSKTIEKSDQTATNEHQDAVSQTVVVDTSSDIPWVQ 183
Query: 131 -LPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLC-ELQKQ----CPSAGS 184
LPC C + +Y P SST + +PC S C EL C
Sbjct: 184 CLPCPIPQC---------HLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTD 234
Query: 185 NCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNG 244
C Y V Y DG +TG V D L ++ V FGC GSF + A G
Sbjct: 235 ECKYIVNY-GDGKATTGTYVTDTLTMS-----PTIVVKDFRFGCSHAVRGSFSNQNA--G 286
Query: 245 LFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISFGDKGSPGQGETPFS----LRQ 300
+ LG + S+ A+ N+FS C + F G P + FS ++
Sbjct: 287 ILALGGGRGSLLEQTADA--YGNAFSYCIPKPSSA--GFLSLGGPVEASLKFSYTPLIKN 342
Query: 301 TH-PT-YNITITQVSVGGNAV-----NFEFSAIFDSGTSFTYLNDPAYTQISETFNS 350
H PT Y + + + V G + F A+ DSG T L Y + F S
Sbjct: 343 KHAPTFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRS 399
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 83/326 (25%), Positives = 131/326 (40%), Gaps = 50/326 (15%)
Query: 64 AHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALD 123
A RD L LA +G P+ ++G + + + +G PA ++A+D
Sbjct: 72 AARDASRLLYLDSLAVKGRAYAPI--ASGRQLLQTPT----YVVRARLGTPAQQLLLAVD 125
Query: 124 TGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL--QKQCP 180
T +D W+PC C C + ++P S++ VPC S C L C
Sbjct: 126 TSNDAAWIPCSGCAGCPTS-----------SPFNPAASASYRPVPCGSPQCVLAPNPSCS 174
Query: 181 SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGA 240
+C + + Y +D ++ L +D L +A D V +FGC + TG+ A
Sbjct: 175 PNAKSCGFSLSY-ADSSLQAA-LSQDTLAVAGD------VVKAYTFGCLQRATGT---AA 223
Query: 241 APNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTGRISFGDKGSPGQGETPF 296
P GL GLG S + + + +FS C S + +G + G G P + +T
Sbjct: 224 PPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTP 281
Query: 297 SLRQTHPT--YNITITQVSVGGNAVNFEFSA-----------IFDSGTSFTYLNDPAYTQ 343
L H + Y + +T + VG V+ SA + DSGT FT L P Y
Sbjct: 282 LLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYLA 341
Query: 344 ISETFNSLAKEKRETSTSDLPFEYCY 369
+ + +S F+ CY
Sbjct: 342 LRDEVRRRVGAGAAAVSSLGGFDTCY 367
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 122/300 (40%), Gaps = 46/300 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+++ + +G PA + LDTGSD+ WL C C C + ++ P SS+
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDP---------LFDPALSSSY 246
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQS 217
+ VPC+S C + S+C Y+V Y DG+ + G + L L D
Sbjct: 247 ATVPCDSPHCRALDASACHNNAANGNSSCVYEVAY-GDGSYTVGDFATETLTLGGD---G 302
Query: 218 KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---G 274
+ ++ GCG G F+ A L G + S PS ++ FS C
Sbjct: 303 SAAVHDVAIGCGHDNEGLFVGAAGLLALGGGPL---SFPSQIS-----ATEFSYCLVDRD 354
Query: 275 SDGTGRISFGDKGSPGQGETPFSLRQTHPTYNITITQVSVGGN----------AVNFEFS 324
S + FG S +++ Y + + +SVGG A++ + S
Sbjct: 355 SPSASTLQFGASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGS 414
Query: 325 A--IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL--RSFLHLQAL 380
I DSGT+ T L AY+ + + F + S L F+ CY L RS + + A+
Sbjct: 415 GGVIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL-FDTCYDLAGRSSVQVPAV 473
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 97/246 (39%), Gaps = 42/246 (17%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P +DTGSDL W C C +C I+ P+ SST
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP---------IFDPSNSST 110
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN G++C Y++ Y +D T S G L + + + + + V
Sbjct: 111 FKEKRCN-------------GNSCHYKIIY-ADTTYSKGTLATETVTIHSTSGE-PFVMP 155
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
+ GCG S +G+ GL +S+ I G P S CF S GT +I+
Sbjct: 156 ETTIGCGH---NSSWFKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKIN 210
Query: 283 FGDK---GSPGQGETPFSLRQTHP-TYNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T L P Y + + VSVG V E + I DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270
Query: 331 TSFTYL 336
T+ TY
Sbjct: 271 TTLTYF 276
>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 444
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 114/295 (38%), Gaps = 38/295 (12%)
Query: 110 SVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPC 168
SVG P + +DTGSD+ WL C C C + I+ P+ S T +PC
Sbjct: 99 SVGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTP---------IFDPSQSKTYKTLPC 149
Query: 169 NSTLCELQKQCPSAGSN---CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRIS 225
+S +C+ + S SN C Y + Y D + S G L + L L + + S +
Sbjct: 150 SSNICQSVQSAASCSSNNDECEYTITY-GDNSHSQGDLSVETLTLGSTDGSSVQFPKTV- 207
Query: 226 FGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFG-----SDGTGR 280
GCG G+F G +G+ V I I FS C S+ + +
Sbjct: 208 IGCGHNNKGTF----QREGSGIVGLGGGPVSLISQLSSSIGGKFSYCLAPLFSQSNSSSK 263
Query: 281 ISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV----------NFEFSAIF 327
++FGD+ G TP + Y +T+ SVG N + E + I
Sbjct: 264 LNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSGGEGNIII 323
Query: 328 DSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQALVV 382
DSGT+ T L + Y + + +R S CY S L V+
Sbjct: 324 DSGTTLTILPEDDYLNLESAVADAIELERVEDPSKF-LRLCYRTTSSDELNVPVI 377
>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
Length = 517
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 108/277 (38%), Gaps = 48/277 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +V VG P F + +DTGSDL WL C C+ C + ++ P SS+
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGP---------VFDPAASSSY 201
Query: 164 SKVPCNSTLC------ELQKQCPSAGSN-CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQ 216
V C C E + C G + CPY Y + +E T
Sbjct: 202 RNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 261
Query: 217 SKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF--- 273
S+ VD + FGCG G F A L GLG S S L + + ++FS C
Sbjct: 262 SRRVDD-VVFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQL--RAVYGHTFSYCLVDH 315
Query: 274 GSDGTGRISFGDKGS-------PGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFE-- 322
GSD ++ FG+ + P T F+ + Y + + V VGG +N
Sbjct: 316 GSDVASKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSD 375
Query: 323 -----------FSAIFDSGTSFTYLNDPAYTQISETF 348
I DSGT+ +Y +PAY I + F
Sbjct: 376 TWGVGEGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAF 412
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 88/309 (28%), Positives = 132/309 (42%), Gaps = 54/309 (17%)
Query: 66 RDRYFRLRGRGLAAQGN---DKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVAL 122
R R + R R +A+ N +T + S+G + LN + V++G + + V +
Sbjct: 28 RVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYI-------VTMGLGSKNMTVII 80
Query: 123 DTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE-LQKQCP 180
DTGSDL W+ C+ C+SC + I+ P+TSS+ V CNS+ C+ LQ
Sbjct: 81 DTGSDLTWVQCEPCMSCYNQQGP---------IFKPSTSSSYQSVSCNSSTCQSLQFATG 131
Query: 181 SAG-------SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
+ G S C Y V Y DG+ + G L + L SV S FGCGR
Sbjct: 132 NTGACGSSNPSTCNYVVNY-GDGSYTNGELGVEALSFG-----GVSV-SDFVFGCGRNNK 184
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGRISFGDKGSPG 290
G F +GL GLG S+ S FS C + +G + G++ S
Sbjct: 185 GLF---GGVSGLMGLGRSYLSLVS--QTNATFGGVFSYCLPTTEAGSSGSLVMGNESSVF 239
Query: 291 QGETPFSLRQ--THPT----YNITITQVSVGGNAVNFEFS-----AIFDSGTSFTYLNDP 339
+ P + + ++P Y + +T + VGG A+ S + DSGT T L
Sbjct: 240 KNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITRLPSS 299
Query: 340 AYTQISETF 348
Y + F
Sbjct: 300 VYKALKAEF 308
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 139/324 (42%), Gaps = 31/324 (9%)
Query: 41 PVKGILAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNS 100
P +L D + S A A R +LR RG ++ + ++ + G T S
Sbjct: 62 PFSAVL-THDHARIASLAARLAKTPSSRPTKLR-RGSSSSPDAESLASVPLGPGT----S 115
Query: 101 LGFLHY-TNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNT 159
+G +Y T + +G PA S+++ +DTGS L WL C C+ + SG V + S
Sbjct: 116 VGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWL--QCSPCLVSCHRQSGPVFNPRSSSSYA 173
Query: 160 SSTSSKVPCNS-TLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSK 218
S + S C++ T L S + C YQ Y D + S G+L +D + S
Sbjct: 174 SVSCSAPQCDALTTATLNPSTCSTSNVCIYQASY-GDSSFSVGYLSKDTVSFG-----ST 227
Query: 219 SVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGT 278
SV + +GCG+ G F A GL GL +K S+ LA + SFS C + +
Sbjct: 228 SVPN-FYYGCGQDNEGLFGQSA---GLIGLARNKLSLLYQLAPS--MGYSFSYCLPTSSS 281
Query: 279 GRISFGDKG-SPGQ-GETPFSLRQTHPT-YNITITQVSVGGNAVNFEFSA------IFDS 329
+PGQ TP + + Y I +T ++V G ++ SA I DS
Sbjct: 282 SSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDS 341
Query: 330 GTSFTYLNDPAYTQISETFNSLAK 353
GT T L Y+ +S+ K
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMK 365
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 97/246 (39%), Gaps = 42/246 (17%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSST 162
++ + VG P +DTGSDL W C C +C I+ P+ SST
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAP---------IFDPSNSST 110
Query: 163 SSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDS 222
+ CN G++C Y++ Y +D T S G L + + + + + V
Sbjct: 111 FKEKRCN-------------GNSCHYKIIY-ADTTYSKGTLATETVTIHSTSGE-PFVMP 155
Query: 223 RISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRIS 282
+ GCG S +G+ GL +S+ I G P S CF S GT +I+
Sbjct: 156 ETTIGCGH---NSSWFKPTFSGMVGLSWGPSSL--ITQMGGEYPGLMSYCFASQGTSKIN 210
Query: 283 FGDK---GSPGQGETPFSLRQTHP-TYNITITQVSVGGNAVN--------FEFSAIFDSG 330
FG G T L P Y + + VSVG V E + I DSG
Sbjct: 211 FGTNAIVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSG 270
Query: 331 TSFTYL 336
T+ TY
Sbjct: 271 TTLTYF 276
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 16/117 (13%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSC-VHGLNSSSGQVIDFNIYSPN 158
L L+YT V +G P V +DTGSDL W+ C+ CV C +H + + P
Sbjct: 74 LSALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGCPLH----------NVTFFDPG 123
Query: 159 TSSTSSKVPCNSTLC--ELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SS++ K+ C+ C +LQK+ S +C Y+V Y DG++++G+ + D++ T
Sbjct: 124 ASSSAVKLACSDKRCSSDLQKKSRCSLLESCTYKVEY-GDGSVTSGYYISDLISFDT 179
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 73/275 (26%), Positives = 109/275 (39%), Gaps = 55/275 (20%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA ++ALDT +D W C C +C SSG ++++P S++
Sbjct: 77 YVVRAGLGSPAQPILLALDTSADATWAHCSPCGTC-----PSSG-----SLFAPANSTSY 126
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMST-------------GFLVEDVLHL 210
+ +PC+ST+C + + G CP Q Y S + L D LHL
Sbjct: 127 APLPCSSTMCTVLQ-----GQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWLHL 181
Query: 211 ATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFS 270
D + +FGC +G + GL GLG ++ S + N + FS
Sbjct: 182 GKDAIPN------YAFGCVSAVSGPTAN-LPKQGLLGLGRGPMALLSQVGN--MYNGVFS 232
Query: 271 MCFGSDG----TGRISFGDKGSP-GQGETPFSLRQTHPT-YNITITQVSVGGNAV----- 319
C S +G + G G P G TP + Y + +T +SVG V
Sbjct: 233 YCLPSYKSYYFSGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAG 292
Query: 320 NFEFS------AIFDSGTSFTYLNDPAYTQISETF 348
+F F + DSGT T P Y + E F
Sbjct: 293 SFAFDPATGAGTVVDSGTVITRWTPPVYAALREEF 327
>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 89/311 (28%), Positives = 121/311 (38%), Gaps = 54/311 (17%)
Query: 61 SALAHRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIV 120
S+ +HR Y L A + T + ++GN + N + +G P +
Sbjct: 70 SSDSHRFTYLS----SLVAGKSKPTSVPVASGNQLHIGN-----YVVRARLGTPPQLMFM 120
Query: 121 ALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCELQK-- 177
LDT +D WLPC C C + S + +SST S V C++T C +
Sbjct: 121 VLDTSNDAVWLPCSGCSGCSNASTSFNTN----------SSSTYSTVSCSTTQCTQARGL 170
Query: 178 QCPSAG---SNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTG 234
CPS+ S C + Y D + S LV+D L L+ D V SFGC +G
Sbjct: 171 TCPSSTPQPSICSFNQSYGGDSSFSAN-LVQDTLTLSPD------VIPNFSFGCINSASG 223
Query: 235 SFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDG----TGRISFGDKGSPG 290
+ L P GL GLG S+ + L FS C S +G + G G P
Sbjct: 224 NSL---PPQGLMGLGRGPMSL--VSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGLLGQPK 278
Query: 291 QGE-TPFSLRQTHPT-YNITITQVSVGGNAV-----------NFEFSAIFDSGTSFTYLN 337
TP P+ Y + +T VSVG V N I DSGT T
Sbjct: 279 SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTVITRFA 338
Query: 338 DPAYTQISETF 348
P Y I + F
Sbjct: 339 QPVYEAIRDEF 349
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 110/283 (38%), Gaps = 60/283 (21%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V VG P F + LDTGSDL W+ C C++C SG Y P SS+
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFE----QSGPY-----YDPKDSSSF 247
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTG-FLVED-VLHLATDEK 215
+ C+ C+L K C + +CPY Y DG+ +TG F +E ++L T
Sbjct: 248 RNISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWY-GDGSNTTGDFALETFTVNLTTPNG 306
Query: 216 QS--KSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF 273
S K V++ + FGCG G F A GL + S Q L SFS C
Sbjct: 307 TSELKHVEN-VMFGCGHWNRGLFHGAAGLLGLGKGPLSFAS-----QMQSLYGQSFSYCL 360
Query: 274 GSDGTGRISFGDKGSPGQGETPFSLRQTHPTYNIT-----------------ITQVSVGG 316
D S K G+ + S HP N T I V V
Sbjct: 361 -VDRNSNASVSSKLIFGEDKELLS----HPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDD 415
Query: 317 NAVNFEFS-----------AIFDSGTSFTYLNDPAYTQISETF 348
+ I DSGT+ TY +PAY I E F
Sbjct: 416 EVLKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAF 458
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 114/303 (37%), Gaps = 55/303 (18%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P + +D+GSD+ W+ C C C + ++ P S +
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDP---------VFDPAKSGSY 181
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ V C S++C+ + C Y+V Y DG+ + G L + L A K+V
Sbjct: 182 TGVSCGSSVCDRIENSGCHSGGCRYEVMY-GDGSYTKGTLALETLTFA------KTVVRN 234
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF---GSDGTGR 280
++ GCG G F+ A G+ G M S G +F C G+D TG
Sbjct: 235 VAMGCGHRNRGMFIGAAGLLGIGGGSMSFVGQLS-----GQTGGAFGYCLVSRGTDSTGS 289
Query: 281 ISFGDKGSP-GQGETPFSLRQTHPTYN--------------------ITITQVSVGGNAV 319
+ FG + P G P P++ +T+ GG
Sbjct: 290 LVFGREALPVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGG--- 346
Query: 320 NFEFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSFLHLQA 379
+ D+GT+ T L AY + F S S + F+ CY L F+ ++
Sbjct: 347 -----VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSI-FDTCYDLSGFVSVRV 400
Query: 380 LVV 382
V
Sbjct: 401 PTV 403
>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
vinifera]
Length = 437
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 121/285 (42%), Gaps = 44/285 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA + ++A+DT SD+ W+PC+ C+ C L FN SP S+T
Sbjct: 101 YIVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTL---------FN--SP-ASTTY 148
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C + C+ + G C + + Y G+ L +D + LATD
Sbjct: 149 KSLGCQAAQCKQVPKPTCGGGVCSFNLTY--GGSSLAANLSQDTITLATDAVPG------ 200
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DGTG 279
SFGC + TG L GL + S Q L ++FS C S + +G
Sbjct: 201 YSFGCIQKATGGSLPAQGLLGLGRGPLSLLS-----QTQNLYQSTFSYCLPSFKSLNFSG 255
Query: 280 RISFGDKGSPGQGE-TPFSLRQTHPT-YNITITQVSVGGNAV-----NFEFS------AI 326
+ G G P + + TP P+ Y + + V VG V +F F+ I
Sbjct: 256 SLRLGPVGQPKRIKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTI 315
Query: 327 FDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVL 371
FDSGT FT L PAY + + F + + T TS F+ CY +
Sbjct: 316 FDSGTVFTRLVTPAYIAVRDAFRNRVG-RNLTVTSLGGFDTCYTV 359
>gi|219120658|ref|XP_002181063.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407779|gb|EEC47715.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 448
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 132/329 (40%), Gaps = 58/329 (17%)
Query: 93 NDTYRL--NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
N T RL +++ H+ +G+P + + +DTGS L C+ C C +
Sbjct: 72 NATVRLPLHAVAGTHHVTAWMGEPPQAQTLIVDTGSRLTATACEPCSQC------GTTHA 125
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
F P SST C S L ++C +A C RY ++G+ T V D
Sbjct: 126 HPFPHLDPQRSSTLRYTQCGSCLLSGIQEC-AAEQKCGINQRY-TEGSSWTAVEVSDTFV 183
Query: 210 LATDE----KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
L E +Q S +FGC + G F A NG+ GL S+ L + +I
Sbjct: 184 LGGPEISSLEQYVSFTIIFAFGCQQKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVI 242
Query: 266 P-NSFSMCFGSDGTGRISFG----DKGSPGQGETPFSLRQTHPTYNITITQVSVGG---- 316
P SFS+C + G I G DK + TPF+ T Y + + +V VG
Sbjct: 243 PRESFSLCM-TPFEGYIGLGGPLRDKHTESMKYTPFT--STQSWYAVHVVRVFVGDECLT 299
Query: 317 ----------NAVNFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+A+ F+ I DSGT+ TYL ++ E + L S+
Sbjct: 300 SNDQHDTVVEHALVEAFAEGKGTILDSGTTDTYLPKAVAGRMREIWARL---------SN 350
Query: 363 LPFE----YCYVLRSFLHLQALVVLPFPL 387
PF+ Y Y F ++L ++ F L
Sbjct: 351 TPFQPSSTYAYTYDEF---RSLPIVTFEL 376
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 54/197 (27%), Positives = 86/197 (43%), Gaps = 23/197 (11%)
Query: 188 YQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFG 247
YQ +Y T S+G L +DV+ + S R+ FGC +TG D A +G+ G
Sbjct: 103 YQRQYAEKST-SSGVLGKDVISFSN---SSDLGGQRLVFGCETAETGDLYDQTA-DGIIG 157
Query: 248 LGMDKTSVPSILANQGLIPNSFSMCFG--SDGTGRISFGDKGSPGQGETPFSLRQTHPTY 305
LG S+ L + + + FS+C+G +G G + G P S P Y
Sbjct: 158 LGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPYY 217
Query: 306 NITITQVSVGGNAVNF-------EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEK--- 355
N+ + + VGG+ + ++ + DSGT++ Y A+ + F S KE+
Sbjct: 218 NLMLKGIRVGGSPLRLKPEVFDGKYGTVLDSGTTYAYFPGAAF----QAFKSAVKEQVGS 273
Query: 356 -RETSTSDLPF-EYCYV 370
+E D F + CY
Sbjct: 274 LKEVPGPDEKFKDICYA 290
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 125/338 (36%), Gaps = 54/338 (15%)
Query: 60 YSALAHRDRYFRLRG---RGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPAL 116
Y L R LRG R + A ND S G + N+S+G P +
Sbjct: 56 YQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGG----------AYLMNISLGTPPV 105
Query: 117 SFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCEL 175
+ DTGSDL W C C +C + ++ P S T + C++ C+
Sbjct: 106 PMLGIADTGSDLIWRQCLPCPNCYEQVEP---------LFDPKESETYKTLDCDNEFCQD 156
Query: 176 QKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQT 233
Q S + C Y Y D + + G L D L + + E S I+FGCG
Sbjct: 157 LGQQGSCDDDNTCTYSYSY-GDRSYTRGDLSSDTLTIGSTEGDPASFPG-IAFGCGHDNG 214
Query: 234 GSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT--GRISFGDKG- 287
G+F + +G+ + ++ + FS C SD T +I+FG G
Sbjct: 215 GTFNE----KDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGV 270
Query: 288 --SPGQGETPFSLRQTHPTYNITITQVSVGGNAVNF--------------EFSAIFDSGT 331
G TP Y +T+ +SVG V F E + I DSGT
Sbjct: 271 VSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGT 330
Query: 332 SFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ T L YT + + + T + + F CY
Sbjct: 331 TLTLLPQDFYTDVESALTNAIGGQTTTDPNGI-FSLCY 367
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 124/308 (40%), Gaps = 49/308 (15%)
Query: 65 HRDRYFRLRGRGLAAQGNDKTPLTFSAGNDTYRLNSLGFLHYTNVSVGQPALSFIVALDT 124
H R + GR + P + A D+ + + +G PA+ V +DT
Sbjct: 95 HITRKAKASGR-TTTLSDVSIPTSLGAAVDSLE-------YVVTLGIGTPAVQQTVLIDT 146
Query: 125 GSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSSKVPCNSTLCE------LQKQ 178
GSDL W+ C C NSSS +Y P SST + VPC+S C+
Sbjct: 147 GSDLSWV--QCKPC----NSSSCYPQKDPLYDPTASSTYAPVPCDSKACKDLVPDAYDHG 200
Query: 179 C--PSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSRISFGCGRVQTGSF 236
C S S C Y + Y + T + G + L L+ Q D FGCG VQ G+F
Sbjct: 201 CTNSSGTSLCQYGIEYGNRDT-TVGVYSTETLTLS---PQVSVKD--FGFGCGLVQQGTF 254
Query: 237 LDGAAPNGLFGLGMDKTSVPSILANQG--LIPNSFSMCF--GSDGTGRISFG----DKGS 288
+ P L +Q +FS C G+ TG ++ G + +
Sbjct: 255 DLFDG-------LLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTTGFLALGAPTNNNDT 307
Query: 289 PGQGETPF-SLRQTHPTYNITITQVSVGGNAVNFEFSA-----IFDSGTSFTYLNDPAYT 342
G TP SL + Y + +T VSVGG ++ + I DSGT T L D AY+
Sbjct: 308 AGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMIIDSGTIITGLPDTAYS 367
Query: 343 QISETFNS 350
+ F +
Sbjct: 368 ALRTAFRT 375
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 134/368 (36%), Gaps = 65/368 (17%)
Query: 56 SFAYYSALAHRDR------YFRLRGRGLAAQGNDKTPLT-----FSAGNDTYRLNSLGF- 103
S Y L HRD+ Y R R A D +AG TY + G
Sbjct: 65 SAKYKLKLVHRDKVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAEAFGSD 124
Query: 104 ----------LHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDF 152
++ + VG P + V +D+GSD+ W+ C+ C C H +
Sbjct: 125 VVSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDP-------- 176
Query: 153 NIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
+++P SS+ S V C ST+C C Y+V Y DG+ + G L + +
Sbjct: 177 -VFNPADSSSFSGVSCASTVCSHVDNAACHEGRCRYEVSY-GDGSYTKGTLALETITFG- 233
Query: 213 DEKQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQ--GLIPNSFS 270
+++ ++ GCG G F+ A + P Q G +FS
Sbjct: 234 -----RTLIRNVAIGCGHHNQGMFVGAAG-------LLGLGGGPMSFVGQLGGQTGGAFS 281
Query: 271 MCFGSDG---TGRISFGDKGSP-GQGETPFSLRQTHPTY--------NITITQVSVGGNA 318
C S G +G + FG + P G P ++ + +VS+ +
Sbjct: 282 YCLVSRGIESSGLLEFGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDV 341
Query: 319 VNF----EFSAIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCYVLRSF 374
+ + D+GT+ T L AY + F + S + F+ CY L F
Sbjct: 342 FKLSELGDGGVVMDTGTAVTRLPTVAYEAFRDGFIAQTTNLPRASGVSI-FDTCYDLFGF 400
Query: 375 LHLQALVV 382
+ ++ V
Sbjct: 401 VSVRVPTV 408
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 82/345 (23%), Positives = 131/345 (37%), Gaps = 38/345 (11%)
Query: 46 LAVDDLPKKGSFAYYSALAHRDRYFRLRGRGLAAQGN-DKTPLTFSAGNDTYRLNSLGFL 104
L D PK S Y S H R+ + R ++ + +T T S + + G
Sbjct: 35 LVHRDSPK--SPLYNSQQTHLQRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGE 92
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ ++S+G P + DTGSDL W C C C + ++ P +S T
Sbjct: 93 YLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAP---------LFDPKSSKTY 143
Query: 164 SKVPCNSTLCELQKQCPSAGSN--CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
+ C++ C+ + S S C Y Y D + + G L D + L +
Sbjct: 144 RDLSCDTRQCQNLGESSSCSSEQLCQYSY-YYGDRSFTNGNLAVDTVTLPSTNGGPVYFP 202
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMC---FGSDGT 278
+ GCGR G+F +G+ GLG S+ S + + + FS C F S+
Sbjct: 203 KTV-IGCGRRNNGTF--DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESA 257
Query: 279 G---RISFGDKG---SPGQGETPFSLRQTHPTYNITITQVSVGGNAV--------NFEFS 324
G ++ FG G TP + Y +T+ +SVG + E +
Sbjct: 258 GNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSEGN 317
Query: 325 AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGTS T +T+ + + T + +CY
Sbjct: 318 IIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCY 362
>gi|219120652|ref|XP_002181060.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407776|gb|EEC47712.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 453
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 86/325 (26%), Positives = 132/325 (40%), Gaps = 50/325 (15%)
Query: 93 NDTYRL--NSLGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQV 149
N T RL +++ H+ +G+P + + +DTGS L C+ C C +
Sbjct: 68 NATVRLPLHAVAGTHHVTAWMGEPPQAQTLIVDTGSRLTATACEPCSQC------GTTHA 121
Query: 150 IDFNIYSPNTSSTSSKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLH 209
F P SST C S L ++C +A C RY ++G+ T V D
Sbjct: 122 HPFPHLDPQRSSTLRYTQCGSCLLSGIQEC-AAEQKCGINQRY-TEGSSWTAVEVSDTFV 179
Query: 210 LATDE----KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLI 265
L E +Q S +FGC + G F A NG+ GL S+ L + +I
Sbjct: 180 LGGPEISSLEQYVSFTIIFAFGCQQKVRGLFRTQYA-NGILGLERSDLSLIKRLWKENVI 238
Query: 266 P-NSFSMCFGSDGTGRISFG----DKGSPGQGETPFSLRQTHPTYNITITQVSVGG---- 316
P SFS+C + G I G DK + TPF+ T Y + + +V VG
Sbjct: 239 PRESFSLCM-TPFEGYIGLGGPLRDKHTESMKYTPFT--STQSWYAVHVVRVFVGDECLT 295
Query: 317 ----------NAVNFEFS----AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSD 362
+A+ F+ I DSGT+ TYL ++ E + L+ + S++
Sbjct: 296 SNDQHDTVVEHALVEAFAEGKGTILDSGTTDTYLPKAVAGRMREIWARLSNTPFQPSST- 354
Query: 363 LPFEYCYVLRSFLHLQALVVLPFPL 387
Y Y F ++L ++ F L
Sbjct: 355 ----YAYTYDEF---RSLPIVTFEL 372
>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 396
Score = 61.6 bits (148), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 64/245 (26%), Positives = 101/245 (41%), Gaps = 40/245 (16%)
Query: 104 LHYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ + VG P +DTGS++ W C+ CVH ++ I+ P+ SST
Sbjct: 64 VYLMKLQVGTPPFEIQAIIDTGSEITW--TQCLPCVHCYEQNAP------IFDPSKSSTF 115
Query: 164 SKVPCNSTLCELQKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVDSR 223
+ C+ G +CPY+V Y D T + G L + + L + + +
Sbjct: 116 KEKRCD-------------GHSCPYEVDYF-DHTYTMGTLATETITLHSTSGEPFVMPET 161
Query: 224 ISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTGRISF 283
I GCG S+ + +G+ GL +S+ I G P S CF GT +I+F
Sbjct: 162 I-IGCG--HNNSWFKPSF-SGMVGLNWGPSSL--ITQMGGEYPGLMSYCFSGQGTSKINF 215
Query: 284 GDKG-SPGQGETPFSLRQTHPT---YNITITQVSVGGNAVN--------FEFSAIFDSGT 331
G G G ++ T Y + + VSVG + E + + DSGT
Sbjct: 216 GANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGT 275
Query: 332 SFTYL 336
+ TY
Sbjct: 276 TLTYF 280
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 73/295 (24%), Positives = 115/295 (38%), Gaps = 50/295 (16%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCDCVSCVHGLNSSSGQVIDFNIYSPNTSSTSS 164
+ ++++G P + LDTGSDL W C C + + G + P+ SST
Sbjct: 415 YLVHLAIGTPPQPVQLILDTGSDLVW--TQCRPCPVCFSRALGPL------DPSNSSTFD 466
Query: 165 KVPCNSTLCELQKQCPSAGSN-----CPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKS 219
+PC+S +C+ N C Y Y +DG+++TG L + A + ++
Sbjct: 467 VLPCSSPVCDNLTWSSCGKHNWGNQTCVYVYAY-ADGSITTGHLDAETFTFAAADGTGQA 525
Query: 220 VDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGSDGTG 279
++FGCG G F G+ G G S+PS L ++FS CF +
Sbjct: 526 TVPDLAFGCGLFNNGIFTSNE--TGIAGFGRGALSLPSQLKV-----DNFSHCFTAITGS 578
Query: 280 RISFGDKGSPGQ---------GETPF-----SLRQTHPTYNITITQVSVGGNAVNFEFS- 324
S G P TP SLR Y +++ ++VG + S
Sbjct: 579 EPSSVLLGLPANLYSDADGAVQSTPLVQNFSSLR----AYYLSLKGITVGSTRLPIPEST 634
Query: 325 ----------AIFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
I DSGT T L AY + + F + + + +TS C+
Sbjct: 635 FALKQDGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCF 689
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 63/117 (53%), Gaps = 16/117 (13%)
Query: 101 LGFLHYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSC-VHGLNSSSGQVIDFNIYSPN 158
L L+YT V +G P V +DTGSDL W+ C+ CV C +H + + P
Sbjct: 74 LSALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSCVGCPLHNV----------TFFDPG 123
Query: 159 TSSTSSKVPCNSTLC--ELQKQCP-SAGSNCPYQVRYLSDGTMSTGFLVEDVLHLAT 212
SS++ K+ C+ C +LQK+ S +C Y+V Y DG++++G+ + D++ T
Sbjct: 124 ASSSAVKLACSDKRCSSDLQKKSRCSLLESCTYKVEY-GDGSVTSGYYISDLISFDT 179
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 113/277 (40%), Gaps = 48/277 (17%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPC-DCVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
++ +V +G P + + LDTGSDL W+ C C++C SG Y P SS+
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFE----QSGPY-----YDPKESSSF 242
Query: 164 SKVPCNSTLCEL------QKQCPSAGSNCPYQVRYLSDGTMSTGFLVED-VLHLATD--E 214
+ C+ C+L K C CPY Y + F +E ++L T +
Sbjct: 243 ENITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGK 302
Query: 215 KQSKSVDSRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCF- 273
+ K V++ + FGCG G F A L GLG S S L Q + +SFS C
Sbjct: 303 SEQKHVEN-VMFGCGHWNRGLFHGAAG---LLGLGRGPLSFASQL--QSIYGHSFSYCLV 356
Query: 274 --GSDG--TGRISFGDKGS----PGQGETPFSLRQTHPT---YNITITQVSVGGNAVNFE 322
SD + ++ FG+ P T F + + Y + I + V G +
Sbjct: 357 DRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIP 416
Query: 323 FS-----------AIFDSGTSFTYLNDPAYTQISETF 348
I DSGT+ TY +PAY I E F
Sbjct: 417 EETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAF 453
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 116/285 (40%), Gaps = 44/285 (15%)
Query: 105 HYTNVSVGQPALSFIVALDTGSDLFWLPCD-CVSCVHGLNSSSGQVIDFNIYSPNTSSTS 163
+ +G PA ++A+DT +D W+PC C C + ++P S++
Sbjct: 54 YVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTS-----------SPFNPAASASY 102
Query: 164 SKVPCNSTLCEL--QKQCPSAGSNCPYQVRYLSDGTMSTGFLVEDVLHLATDEKQSKSVD 221
VPC S C L C +C + + Y +D ++ L +D L +A D V
Sbjct: 103 RPVPCGSPQCVLAPNPSCSPNAKSCGFSLSY-ADSSLQAA-LSQDTLAVAGD------VV 154
Query: 222 SRISFGCGRVQTGSFLDGAAPNGLFGLGMDKTSVPSILANQGLIPNSFSMCFGS----DG 277
+FGC + TG+ A P GL GLG S + + + +FS C S +
Sbjct: 155 KAYTFGCLQRATGT---AAPPQGLLGLGRGPLSF--LSQTKDMYGATFSYCLPSFKSLNF 209
Query: 278 TGRISFGDKGSPGQGETPFSLRQTHPT--YNITITQVSVGGNAVNFEFSA---------- 325
+G + G G P + +T L H + Y + +T + VG V+ SA
Sbjct: 210 SGTLRLGRNGQPRRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAG 269
Query: 326 -IFDSGTSFTYLNDPAYTQISETFNSLAKEKRETSTSDLPFEYCY 369
+ DSGT FT L P Y + + +S F+ CY
Sbjct: 270 TVLDSGTMFTRLVAPVYLALRDEVRRRVGAGAAAVSSLGGFDTCY 314
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.136 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,377,384,117
Number of Sequences: 23463169
Number of extensions: 283585496
Number of successful extensions: 581446
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 332
Number of HSP's successfully gapped in prelim test: 2252
Number of HSP's that attempted gapping in prelim test: 577454
Number of HSP's gapped (non-prelim): 2907
length of query: 387
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 243
effective length of database: 8,980,499,031
effective search space: 2182261264533
effective search space used: 2182261264533
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)