BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019088
(346 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 488
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 197/309 (63%), Positives = 254/309 (82%), Gaps = 4/309 (1%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
GN+VF V++KF +ER+LSALKQHD RRH R+++++DL LGGNGHP+ GLYF K+G
Sbjct: 31 GNYVFNVQHKFAG---KERSLSALKQHDARRHRRILSAVDLPLGGNGHPAEAGLYFAKIG 87
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG P +YYVQVDTGSD+LWVNCA C +CPTKSDLG+KLTL+DP S+++ I C D+FC
Sbjct: 88 LGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIYCDDDFC 147
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
TYN C+ + C+Y V YGDGSST+G+FV+D +Q ++ +GNL+T+ N SVIFGC
Sbjct: 148 AATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVIFGC 207
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G +QSG+LG+S++ A+DGILGFGQANSS++SQLAAAG V++ FAHCLD VKGGGIFAIG+
Sbjct: 208 GAKQSGELGTSSE-ALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGIFAIGE 266
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
VVSPKV TTPMVPN PHYNV+++E+EVGGN L+LPT + TGD RGTIIDSGTTLAYLP
Sbjct: 267 VVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRGTIIDSGTTLAYLPE 326
Query: 329 MLYDLVLSQ 337
++Y+ ++++
Sbjct: 327 VVYESMMTK 335
>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
vinifera]
Length = 561
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 196/315 (62%), Positives = 250/315 (79%), Gaps = 9/315 (2%)
Query: 27 VMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTK 86
V GN VF V++KFK R ++L AL+ HDTRRHGR+++++DL LGGNGHPS GLYF K
Sbjct: 102 VSGNAVFRVQHKFKG---RGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAK 158
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+G+GTP+ +YYVQVDTGSD+LWVNCAGC RCPTKSDLG+ LTL+D S+TS + C DN
Sbjct: 159 IGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDN 218
Query: 147 FCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIF 206
FC + Y+ P C PG++C Y V YGDGSST+GYFV+D +Q N+ SGN +T P N +V+F
Sbjct: 219 FC-SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 277
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAI 266
GCGN+QSG+LGSS++ A+DGILGFGQANSS+LSQLA++G V+K F+HCLD V GGGIFAI
Sbjct: 278 GCGNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAI 336
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G+VV PKV TP+V N HYNV+++E+EVGG+PLD+P+ +GD +GTIIDSGTTLAY
Sbjct: 337 GEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYF 396
Query: 327 PPMLY----DLVLSQ 337
P +Y + +LSQ
Sbjct: 397 PQEVYVPLIEKILSQ 411
>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
vinifera]
Length = 560
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 196/315 (62%), Positives = 250/315 (79%), Gaps = 9/315 (2%)
Query: 27 VMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTK 86
V GN VF V++KFK R ++L AL+ HDTRRHGR+++++DL LGGNGHPS GLYF K
Sbjct: 102 VSGNAVFRVQHKFKG---RGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAK 158
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+G+GTP+ +YYVQVDTGSD+LWVNCAGC RCPTKSDLG+ LTL+D S+TS + C DN
Sbjct: 159 IGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDN 218
Query: 147 FCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIF 206
FC + Y+ P C PG++C Y V YGDGSST+GYFV+D +Q N+ SGN +T P N +V+F
Sbjct: 219 FC-SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 277
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAI 266
GCGN+QSG+LGSS++ A+DGILGFGQANSS+LSQLA++G V+K F+HCLD V GGGIFAI
Sbjct: 278 GCGNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAI 336
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G+VV PKV TP+V N HYNV+++E+EVGG+PLD+P+ +GD +GTIIDSGTTLAY
Sbjct: 337 GEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYF 396
Query: 327 PPMLY----DLVLSQ 337
P +Y + +LSQ
Sbjct: 397 PQEVYVPLIEKILSQ 411
>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
Length = 480
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 196/315 (62%), Positives = 250/315 (79%), Gaps = 9/315 (2%)
Query: 27 VMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTK 86
V GN VF V++KFK R ++L AL+ HDTRRHGR+++++DL LGGNGHPS GLYF K
Sbjct: 21 VSGNAVFRVQHKFKG---RGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAK 77
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+G+GTP+ +YYVQVDTGSD+LWVNCAGC RCPTKSDLG+ LTL+D S+TS + C DN
Sbjct: 78 IGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDN 137
Query: 147 FCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIF 206
FC + Y+ P C PG++C Y V YGDGSST+GYFV+D +Q N+ SGN +T P N +V+F
Sbjct: 138 FC-SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 196
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAI 266
GCGN+QSG+LGSS++ A+DGILGFGQANSS+LSQLA++G V+K F+HCLD V GGGIFAI
Sbjct: 197 GCGNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAI 255
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G+VV PKV TP+V N HYNV+++E+EVGG+PLD+P+ +GD +GTIIDSGTTLAY
Sbjct: 256 GEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTTLAYF 315
Query: 327 PPMLY----DLVLSQ 337
P +Y + +LSQ
Sbjct: 316 PQEVYVPLIEKILSQ 330
>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 478
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/302 (62%), Positives = 235/302 (77%), Gaps = 4/302 (1%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N VFEV++KFK RER+L+ALK HD RRHGR+++ IDLELGGNGHP+ TGLY+ ++G+
Sbjct: 23 NLVFEVQHKFKG---RERSLNALKSHDVRRHGRLLSVIDLELGGNGHPAETGLYYARIGI 79
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
G+P ++++VQVDTGSD+LWVNC GCS CP KSD+G+ L L++P SSTS I C FC
Sbjct: 80 GSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCS 139
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
TY+ P C P + C+Y V YGDGS+T+GYFV D IQL +A GN KT+ N S++FGCG
Sbjct: 140 ATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCG 199
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
+QSG+LGSS++ A+DGILGFGQANSS++SQLAA G V+K FAHCLD + GGGIFAIG+V
Sbjct: 200 AKQSGELGSSSE-ALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEV 258
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V PK+KTTP+VPN HYNV+L V+VG LDLP L T +RG IIDSGTTLAYLP
Sbjct: 259 VEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPDS 318
Query: 330 LY 331
+Y
Sbjct: 319 IY 320
>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
Length = 507
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/323 (60%), Positives = 248/323 (76%), Gaps = 17/323 (5%)
Query: 27 VMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTK 86
V GN VF V++KFK R ++L AL+ HDTRRHGR+++++DL LGGNGHPS GLYF K
Sbjct: 25 VSGNAVFRVQHKFKG---RGKSLDALRAHDTRRHGRILSAVDLPLGGNGHPSEAGLYFAK 81
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+G+GTP+ +YYVQVDTGSD+LWVNCAGC RCPTKSDLG+ LTL+D S+TS + C DN
Sbjct: 82 IGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCDDN 141
Query: 147 FCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIF 206
FC + Y+ P C PG++C Y V YGDGSST+GYFV+D +Q N+ SGN +T P N +V+F
Sbjct: 142 FC-SLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVVF 200
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAI 266
GCGN+QSG+LGSS++ A+DGILGFGQANSS+LSQLA++G V+K F+HCLD V GGGIFAI
Sbjct: 201 GCGNKQSGELGSSSE-ALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFAI 259
Query: 267 GDVVSPKVKTTPMVPNM--------PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIID 318
G+VV PKV+ M M HYNV+++E+EVGG+PLD+P+ +GD +GTIID
Sbjct: 260 GEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIID 319
Query: 319 SGTTLAYLPPMLY----DLVLSQ 337
SGTTLAY P +Y + +LSQ
Sbjct: 320 SGTTLAYFPQEVYVPLIEKILSQ 342
>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Cucumis sativus]
Length = 478
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 186/302 (61%), Positives = 233/302 (77%), Gaps = 4/302 (1%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N VFEV++KFK RER+L+ALK HD RRHGR+++ IDLELGGNGHP+ TGLY+ ++G+
Sbjct: 23 NLVFEVQHKFKG---RERSLNALKSHDVRRHGRLLSVIDLELGGNGHPAETGLYYARIGI 79
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
G+P ++++VQVDTGSD+LWVNC GCS CP KSD+G+ L L++P SSTS I C FC
Sbjct: 80 GSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQPFCS 139
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
TY+ P C P + C+Y V YGDGS+T+GYFV D IQL +A GN KT+ N S++FGCG
Sbjct: 140 ATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIVFGCG 199
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
+QSG+LGSS++ A+DGILGFGQANSS++SQLAA G V+K FAHCLD + GGGIFAIG+V
Sbjct: 200 AKQSGELGSSSE-ALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFAIGEV 258
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V PK+ TP+VPN HYNV+L V+VG LDLP L T +RG IIDSGTTLAYLP
Sbjct: 259 VEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTLAYLPES 318
Query: 330 LY 331
+Y
Sbjct: 319 IY 320
>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 475
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 189/332 (56%), Positives = 246/332 (74%), Gaps = 13/332 (3%)
Query: 6 LLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMA 65
+L LV + VA + G GNFVF VE R+R+L+A+K HD RR GR+++
Sbjct: 6 VLILVAILVAEI------GCIANGNFVFPVE-------RRKRSLNAVKAHDARRRGRILS 52
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI 125
++DL LGGNG P+ TGLYFTK+GLG+P +YYVQVDTGSD+LWVNC CSRCP KSDLGI
Sbjct: 53 AVDLNLGGNGLPTETGLYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGI 112
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDI 185
LTL+DP S TS I+C FC TY+ P C + C Y +TYGDGS+T+GY+V+D
Sbjct: 113 DLTLYDPKGSETSELISCDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDY 172
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ N + NL+TAP NSS+IFGCG QSG L SS++ A+DGI+GFGQ+NSS+LSQLAA+G
Sbjct: 173 LTYNHVNDNLRTAPQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASG 232
Query: 246 NVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTS 305
V+K F+HCLD ++GGGIFAIG+VV PKV TTP+VP M HYNV+L+ +EV + L LP+
Sbjct: 233 KVKKIFSHCLDNIRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSD 292
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
+ +G+ +GTIIDSGTTLAYLP ++YD ++ +
Sbjct: 293 IFDSGNGKGTIIDSGTTLAYLPAIVYDELIPK 324
>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
gi|255641727|gb|ACU21134.1| unknown [Glycine max]
Length = 475
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/328 (57%), Positives = 248/328 (75%), Gaps = 9/328 (2%)
Query: 10 VVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDL 69
V++ VAV+ A G GN VF VE R+R+LSA++ HD RR GR+++++DL
Sbjct: 6 VLILVAVLG--AEIGSVANGNLVFPVE-------RRKRSLSAVRAHDVRRRGRILSAVDL 56
Query: 70 ELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTL 129
LGGNG P+ TGLYFTK+GLG+P +YYVQVDTGSD+LWVNC CSRCP KSDLGI LTL
Sbjct: 57 NLGGNGLPTETGLYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTL 116
Query: 130 FDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
+DP S TS ++C +FC T++ P C + C Y +TYGDGS+T+GY+V+D + N
Sbjct: 117 YDPKGSETSDVVSCDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYN 176
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK 249
+ +GNL+T+P NSS+IFGCG QSG LGSS++ A+DGI+GFGQANSS+LSQLAA+G V+K
Sbjct: 177 RINGNLRTSPQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKK 236
Query: 250 EFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
F+HCLD V+GGGIFAIG+VV PKV TTP+VP M HYNV+L+ +EV + L LP+ + +
Sbjct: 237 IFSHCLDNVRGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDS 296
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
+ +GT+IDSGTTLAYLP ++YD ++ +
Sbjct: 297 VNGKGTVIDSGTTLAYLPDIVYDELIQK 324
>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
Length = 494
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/306 (60%), Positives = 233/306 (76%), Gaps = 2/306 (0%)
Query: 32 VFEVENKFKAGGER-ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLG 90
VFEV+ KF G+ E LSAL++HD RRHGR++A+IDL LGG+G + TGLYFT++G+G
Sbjct: 38 VFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIG 97
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT 150
TP YYVQVDTGSD+LWVNC C CP KS+LGI+LT++DP S + + C FC
Sbjct: 98 TPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVA 157
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
Y PSC+ CEY ++YGDGSST+G+FV D +Q NQ SG+ +T P N+SV FGCG
Sbjct: 158 NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGA 217
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+ GDLGSS + A+DGILGFGQ+NSS+LSQLAAAG VRK FAHCLD V GGGIFAIG+VV
Sbjct: 218 KLGGDLGSS-NLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNVV 276
Query: 271 SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
PKVKTTP+VP+MPHYNVIL+ ++VGG L LPT++ +G+ +GTIIDSGTTLAY+P +
Sbjct: 277 QPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYVPEGV 336
Query: 331 YDLVLS 336
Y + +
Sbjct: 337 YKALFA 342
>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
Group]
gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
Length = 494
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/306 (60%), Positives = 233/306 (76%), Gaps = 2/306 (0%)
Query: 32 VFEVENKFKAGGER-ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLG 90
VFEV+ KF G+ E LSAL++HD RRHGR++A+IDL LGG+G + TGLYFT++G+G
Sbjct: 38 VFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIG 97
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT 150
TP YYVQVDTGSD+LWVNC C CP KS+LGI+LT++DP S + + C FC
Sbjct: 98 TPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVA 157
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
Y PSC+ CEY ++YGDGSST+G+FV D +Q NQ SG+ +T P N+SV FGCG
Sbjct: 158 NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGA 217
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+ GDLGSS + A+DGILGFGQ+NSS+LSQLAAAG VRK FAHCLD V GGGIFAIG+VV
Sbjct: 218 KLGGDLGSS-NLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNVV 276
Query: 271 SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
PKVKTTP+VP+MPHYNVIL+ ++VGG L LPT++ +G+ +GTIIDSGTTLAY+P +
Sbjct: 277 QPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYVPEGV 336
Query: 331 YDLVLS 336
Y + +
Sbjct: 337 YKALFA 342
>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/307 (60%), Positives = 230/307 (74%), Gaps = 2/307 (0%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGR-MMASIDLELGGNGHPSATGLYFTKVGLG 90
VFEV KF + L+ L+ HD RRHGR + A++DL LGGNG P+ TGLYFT++G+G
Sbjct: 29 VFEVRRKFPRHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNGLPTETGLYFTQIGIG 88
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT 150
TP YYVQVDTGSD+LWVNC C CP KS LGI+LTL+DPS SS+ + C +FC
Sbjct: 89 TPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCGQDFCVA 148
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
T+ PSC P C+Y ++YGDGSST+G+FV D +Q NQ SGN +T N+S+ FGCG
Sbjct: 149 THGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSITFGCGA 208
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+ GDLGSS+ A+DGILGFGQ+NSS+LSQLAAAG VRK FAHCLD + GGGIFAIGDVV
Sbjct: 209 KIGGDLGSSSQ-ALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTINGGGIFAIGDVV 267
Query: 271 SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
PKV TTP+VP MPHYNV LE ++VGG L LPT++ G+ +GTIIDSGTTLAYLP ++
Sbjct: 268 QPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGTIIDSGTTLAYLPGVV 327
Query: 331 YDLVLSQ 337
Y+ ++S+
Sbjct: 328 YNAIMSK 334
>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 482
Score = 382 bits (982), Expect = e-104, Method: Compositional matrix adjust.
Identities = 183/308 (59%), Positives = 240/308 (77%), Gaps = 4/308 (1%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N V +V++KFK RER+L A K HD +R GR +++IDL+LGGNGHPS +GLYF K+GL
Sbjct: 24 NLVLKVQHKFKG---RERSLEAFKAHDIQRRGRFLSAIDLQLGGNGHPSESGLYFAKIGL 80
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP +YYVQVDTGSD+LWVNCAGC+ CP KSDLGI+L+L+ PS SSTS + C+ +FC
Sbjct: 81 GTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVTCNQDFCT 140
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
+TY+ P C+P + CEY V YGDGSST+GYFVRD + L++ +GN +T N S++FGCG
Sbjct: 141 STYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSIVFGCG 200
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
+QSG LG +T AA+DGILGFGQANSS++SQLA++G V++ FAHCLD + GGGIFAIG+V
Sbjct: 201 AQQSGQLG-ATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIFAIGEV 259
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V PKV+TTP+VP HYNV ++ +EV L+LPT + T +GTIIDSGTTLAY P +
Sbjct: 260 VQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGTIIDSGTTLAYFPDV 319
Query: 330 LYDLVLSQ 337
+Y+ ++S+
Sbjct: 320 IYEPLISK 327
>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 482
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/333 (56%), Positives = 240/333 (72%), Gaps = 9/333 (2%)
Query: 5 RLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMM 64
RL+ LVV VV N VF V KFK E L+A+K HD R GR +
Sbjct: 6 RLVRLVVSLFVVVQLCCHANA----NMVFPVVRKFKGPAEN---LAAIKAHDAGRRGRFL 58
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
+ +DL LGGNG P++TGLY+TK+GLG P D YYVQVDTGSD LWVNC GC+ CP KS LG
Sbjct: 59 SVVDLALGGNGRPTSTGLYYTKIGLG-PND-YYVQVDTGSDTLWVNCVGCTTCPKKSGLG 116
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRD 184
++LTL+DP+ S TS + C D FC +TY+ C + C Y +TYGDGS+TSG +++D
Sbjct: 117 MELTLYDPNSSKTSKVVPCDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKD 176
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
+ ++ G+L+T P N+SVIFGCG++QSG L S+TD ++DGI+GFGQANSS+LSQLAAA
Sbjct: 177 DLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAA 236
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
G V++ F+HCLD V GGGIFAIG+VV PKVKTTP+VP M HYNV+L+++EV G+P+ LPT
Sbjct: 237 GKVKRVFSHCLDTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPT 296
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
+ + RGTIIDSGTTLAYLP +YD +L +
Sbjct: 297 DIFDSTSGRGTIIDSGTTLAYLPVSIYDQLLEK 329
>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
Length = 494
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/306 (60%), Positives = 232/306 (75%), Gaps = 2/306 (0%)
Query: 32 VFEVENKFKAGGER-ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLG 90
VFEV+ KF G+ E LSAL++HD RRHGR++A+IDL LGG+G + TGLYFT++G+G
Sbjct: 38 VFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIG 97
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT 150
TP YYVQVDTGSD+LWVNC C CP KS+LGI+LT++DP S + + C FC
Sbjct: 98 TPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVA 157
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
Y PSC+ CEY ++YGDGSST+G+FV D +Q NQ SG+ +T P N+SV FGCG
Sbjct: 158 NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGA 217
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+ GDLGSS + A+DGILGFGQ+NSS+LSQLAAAG VRK FAHCLD V GGGIFAIG+VV
Sbjct: 218 KLGGDLGSS-NLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNVV 276
Query: 271 SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
PKVKTTP+V +MPHYNVIL+ ++VGG L LPT++ +G+ +GTIIDSGTTLAY+P +
Sbjct: 277 QPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTLAYVPEGV 336
Query: 331 YDLVLS 336
Y + +
Sbjct: 337 YKALFA 342
>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
Length = 489
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/307 (58%), Positives = 229/307 (74%), Gaps = 3/307 (0%)
Query: 32 VFEVENKFKAG--GERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
VF+V KF AG G +SAL+ HD RRHGR++A+ DL LGG G P+ TGLYFT++ L
Sbjct: 31 VFQVRRKFPAGVGGGASANISALRVHDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKL 90
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP YYVQVDTGSD+LWVNC C +CP KS LG+ LT +DP SS+ ++C FC
Sbjct: 91 GTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCA 150
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
TY + P C+ V CEY V YGDGSST+G+FV D +Q +Q +G+ +T P N++V FGCG
Sbjct: 151 ATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATVTFGCG 210
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
+Q GDLGSS + A+DGILGFGQAN+S+LSQLAAAG V+K FAHCLD +KGGGIFAIG+V
Sbjct: 211 AQQGGDLGSS-NQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGIFAIGNV 269
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V PKVKTTP+V +MPHYNV L+ ++VGG L LP + TG+ +GTIIDSGTTL YLP +
Sbjct: 270 VQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTLTYLPEL 329
Query: 330 LYDLVLS 336
++ V++
Sbjct: 330 VFKEVMA 336
>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 476
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/309 (58%), Positives = 228/309 (73%), Gaps = 3/309 (0%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
N VF V+ KF R+L A+K HD RR GR +A+ID+ LGGNG PS+TGLY+TKVG
Sbjct: 21 ANLVFPVQRKFNG---PHRSLDAIKAHDDRRRGRFLAAIDVPLGGNGLPSSTGLYYTKVG 77
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P E+YVQVDTGSD+LWVNCAGC+ CP KS LG+ LTL+DP+ S TS + C D FC
Sbjct: 78 LGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFC 137
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
TY+ C + C Y +TYGDGS+TSG FV D + ++ SGNL T P NSSVIFGC
Sbjct: 138 TDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGC 197
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G +QSG L S++D A+DGI+GFGQANSS+LSQLAA+G V++ F+HCLD GGGIF+IG
Sbjct: 198 GAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQ 257
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V+ PK TTP+VP M HYNVIL++++V G P+ LP L +G RGTIIDSGTTLAYLP
Sbjct: 258 VMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPL 317
Query: 329 MLYDLVLSQ 337
+Y+ +L +
Sbjct: 318 SIYNQLLPK 326
>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
Length = 478
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/309 (56%), Positives = 232/309 (75%), Gaps = 7/309 (2%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
N VF V+ R+ +L+ +K HD+ R GR+++++D LGGNG P+ TGLYFTK+G
Sbjct: 22 ANLVFPVQ-------RRQASLTGIKAHDSSRRGRILSAVDFNLGGNGLPTVTGLYFTKIG 74
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P+ +YYVQVDTGSD+LWVNC C+RCP KSD+GI LTL+DP +S TS ++C NFC
Sbjct: 75 LGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEHNFC 134
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
+TY R C C Y ++YGDGS+T+GY+V+D + N+ +GN TA NSS+IFGC
Sbjct: 135 SSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSIIFGC 194
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G QSG SS++ A+DGI+GFGQANSS+LSQLAA+G V+K F+HCLD GGGIF+IG+
Sbjct: 195 GAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFSIGE 254
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
VV PKVKTTP+VPNM HYNVIL+ +EV G+ L LP+ + + +GT+IDSGTTLAYLP
Sbjct: 255 VVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVIDSGTTLAYLPR 314
Query: 329 MLYDLVLSQ 337
++YD ++S+
Sbjct: 315 IVYDQLMSK 323
>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 488
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/331 (55%), Positives = 241/331 (72%), Gaps = 7/331 (2%)
Query: 7 LALVVVTVAVVHQWAVGGGGVMGNFVFEVENKF-KAGGERERTLSALKQHDTRRHGRMMA 65
+ L+ + +AVV VG V F+V KF + G + ++A HD+ R GR++A
Sbjct: 11 VVLMAMLLAVVSSHGVGATSV-----FQVRRKFPRLGSKGGGDITAHLTHDSNRRGRLLA 65
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI 125
+ D+ LGG G P+ TGLY+T++ +GTP +Y+VQVDTGSD+LWVNC C++CP KSDLGI
Sbjct: 66 AADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGI 125
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDI 185
L L+DP SS+ ++C FC TY + P C+ + CEY V YGDGSST+GYFV D
Sbjct: 126 DLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDS 185
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+Q NQ SG+ +T N+SVIFGCG +Q GDLG ST+ A+DGI+GFGQ+N+S+LSQLAAAG
Sbjct: 186 LQYNQVSGDGQTRHANASVIFGCGAQQGGDLG-STNQALDGIIGFGQSNTSMLSQLAAAG 244
Query: 246 NVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTS 305
V+K F+HCLD +KGGGIFAIGDVV PKVK+TP+VP+MPHYNV LE + VGG L LP+
Sbjct: 245 EVKKIFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSH 304
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ TG+++GTIIDSGTTL YLP ++Y VL+
Sbjct: 305 MFETGEKKGTIIDSGTTLTYLPELVYKDVLA 335
>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 481
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/310 (56%), Positives = 233/310 (75%), Gaps = 5/310 (1%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
N VF V KFK E L+A+K HD R GR ++ +D+ LGGNG P++ GLY+TK+G
Sbjct: 25 ANLVFPVVRKFKGPVEN---LAAIKAHDAGRRGRFLSVVDVALGGNGRPTSNGLYYTKIG 81
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG P D YYVQVDTGSD LWVNC GC+ CP KS LG+ LTL+DP+ S TS + C D FC
Sbjct: 82 LG-PKD-YYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPCDDEFC 139
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
+TY+ + C+ G+ C Y +TYGDGS+TSG +++D + ++ G+L+T P N+SVIFGC
Sbjct: 140 TSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTSVIFGC 199
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G++QSG L S+TD ++DGI+GFGQANSS+LSQLAAAG V++ F+HCLD + GGGIFAIG+
Sbjct: 200 GSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISGGGIFAIGE 259
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
VV PKVKTTP++ M HYNV+L+++EV G+P+ LP+ +L + RGTIIDSGTTLAYLP
Sbjct: 260 VVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSGTTLAYLPV 319
Query: 329 MLYDLVLSQF 338
+YD +L +
Sbjct: 320 SIYDQLLEKI 329
>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
Length = 494
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 171/283 (60%), Positives = 222/283 (78%), Gaps = 1/283 (0%)
Query: 53 KQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCA 112
+ HD R GR++A+ D+ LGG G P+ TGLY+T++G+GTPT YYVQVDTGSD+LWVNC
Sbjct: 59 RAHDGSRRGRLLAAADIPLGGLGLPTDTGLYYTEIGIGTPTKRYYVQVDTGSDILWVNCI 118
Query: 113 GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYG 172
C RCP KS LG++LTL+DP SST +++C FC TY P C+ + CEY VTYG
Sbjct: 119 SCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYG 178
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
DGSST+GYFV D++Q +Q SG+ +T P NS+V FGCG++Q GDLGSS + A+DGI+GFGQ
Sbjct: 179 DGSSTTGYFVSDLLQFDQVSGDGQTRPANSTVTFGCGSQQGGDLGSS-NQALDGIIGFGQ 237
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
+N+S+LSQL+AAG V+K FAHCLD + GGGIFAIG+VV PKVKTTP+VPNMPHYNV L+
Sbjct: 238 SNTSMLSQLSAAGKVKKIFAHCLDTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKS 297
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
++VGG L LP+ + TG+++GTIIDSGTTL YLP ++Y ++
Sbjct: 298 IDVGGTALKLPSHMFDTGEKKGTIIDSGTTLTYLPEIVYKEIM 340
>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
Length = 493
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 183/313 (58%), Positives = 226/313 (72%), Gaps = 4/313 (1%)
Query: 26 GVMGNFVFEVENKF---KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL 82
G VF+V KF GG +SAL+ HD RHGR++A+ DL LGG G P+ TGL
Sbjct: 28 GATATGVFQVRRKFPVGVGGGAAGANISALRAHDGTRHGRLLATADLPLGGLGLPTDTGL 87
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+T+V LGTP +YVQVDTGSD+LWVNC C +CP KS LG+ LTL+DP SST +
Sbjct: 88 YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVM 147
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C FC T+ R P CS V CEY VTYGDGSST G FV D +Q +Q +G+ +T P N+
Sbjct: 148 CDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANA 207
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
SVIFGCG +Q GDLGSS+ A+DGILGFG+AN+S+LSQLA AG V+K FAHCLD +KGGG
Sbjct: 208 SVIFGCGAQQGGDLGSSS-QALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGG 266
Query: 263 IFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
IFAIGDVV PKVKTTP+V + PHYNV L+ ++VGG L+LP + G++RGTIIDSGTT
Sbjct: 267 IFAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRGTIIDSGTT 326
Query: 323 LAYLPPMLYDLVL 335
L YLP +++ V+
Sbjct: 327 LTYLPELVFKKVM 339
>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 488
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 173/309 (55%), Positives = 230/309 (74%), Gaps = 6/309 (1%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N VF+V +KF G+RE+ L AL+ HD RH R++++IDL LGG+ P + GLYF K+GL
Sbjct: 34 NLVFQVRSKF--AGKREKDLGALRAHDVHRHSRLLSAIDLPLGGDSQPESIGLYFAKIGL 91
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP+ +++VQVDTGSD+LWVNCAGC RCP KSDL ++LT +D SST+ ++CSDNFC
Sbjct: 92 GTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDADASSTAKSVSCSDNFC- 149
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
+Y N+ C G C+YV+ YGDGSST+GY VRD++ L+ +GN +T N ++IFGCG
Sbjct: 150 -SYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFGCG 208
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
++QSG LG S AAVDGI+GFGQ+NSS +SQLA+ G V++ FAHCLD GGGIFAIG+V
Sbjct: 209 SKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEV 267
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
VSPKVKTTPM+ HY+V L +EVG + L L + +GD++G IIDSGTTL YLP
Sbjct: 268 VSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVYLPDA 327
Query: 330 LYDLVLSQF 338
+Y+ +++Q
Sbjct: 328 VYNPLMNQI 336
>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 480
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 178/310 (57%), Positives = 228/310 (73%), Gaps = 6/310 (1%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
GNFVF V +KF +E+ LS LK HD+ RH RM+A+IDL LGG+ + GLYFTK+
Sbjct: 26 GNFVFNVTHKFAG---KEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIK 82
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P EYYVQVDTGSD+LWVNCA C +CP K+DLGI L+L+D SSTS + C D FC
Sbjct: 83 LGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCEDAFC 142
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ + +C C Y V YGDGS++ G FV+D I L+Q +GNL+TAPL V+FGC
Sbjct: 143 --SFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFGC 200
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G QSG LG T++AVDGI+GFGQ+N+S++SQLAA G+V++ F+HCLD + GGGIFAIG+
Sbjct: 201 GKNQSGQLGQ-TESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGIFAIGE 259
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V SP VKTTP+VPN HYNVIL+ ++V G P+DLP SL T + GTIIDSGTTLAYLP
Sbjct: 260 VESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIIDSGTTLAYLPQ 319
Query: 329 MLYDLVLSQF 338
LY+ ++ +
Sbjct: 320 NLYNSLIEKI 329
>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 482
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 176/310 (56%), Positives = 228/310 (73%), Gaps = 6/310 (1%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
GNFVF V +KF +E+ LS LK HD+ RH RM+A+IDL LGG+ + GLYFTK+
Sbjct: 27 GNFVFNVTHKFAG---KEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIK 83
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P EYYVQVDTGSD+LWVNCA C +CP K+DLGI L+L+D SSTS + C D+FC
Sbjct: 84 LGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFC 143
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ + +C C Y V YGDGS++ G F++D I L Q +GNL+TAPL V+FGC
Sbjct: 144 --SFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGC 201
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G QSG LG TD+AVDGI+GFGQ+N+S++SQLAA G+ ++ F+HCLD + GGGIFA+G+
Sbjct: 202 GKNQSGQLGQ-TDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGE 260
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V SP VKTTP+VPN HYNVIL+ ++V G+P+DLP SL T + GTIIDSGTTLAYLP
Sbjct: 261 VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQ 320
Query: 329 MLYDLVLSQF 338
LY+ ++ +
Sbjct: 321 NLYNSLIEKI 330
>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
Length = 478
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 176/310 (56%), Positives = 228/310 (73%), Gaps = 6/310 (1%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
GNFVF V +KF +E+ LS LK HD+ RH RM+A+IDL LGG+ + GLYFTK+
Sbjct: 23 GNFVFNVTHKFAG---KEKQLSELKSHDSFRHARMLANIDLPLGGDSRADSIGLYFTKIK 79
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P EYYVQVDTGSD+LWVNCA C +CP K+DLGI L+L+D SSTS + C D+FC
Sbjct: 80 LGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCEDDFC 139
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ + +C C Y V YGDGS++ G F++D I L Q +GNL+TAPL V+FGC
Sbjct: 140 --SFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVFGC 197
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G QSG LG TD+AVDGI+GFGQ+N+S++SQLAA G+ ++ F+HCLD + GGGIFA+G+
Sbjct: 198 GKNQSGQLGQ-TDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAVGE 256
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V SP VKTTP+VPN HYNVIL+ ++V G+P+DLP SL T + GTIIDSGTTLAYLP
Sbjct: 257 VESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTLAYLPQ 316
Query: 329 MLYDLVLSQF 338
LY+ ++ +
Sbjct: 317 NLYNSLIEKI 326
>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 473
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 178/328 (54%), Positives = 235/328 (71%), Gaps = 15/328 (4%)
Query: 5 RLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMM 64
R L +VV +V+++A GNFVF+V++KF +E+ L K HDTRRH RM+
Sbjct: 5 RKLCIVVAVFVIVNEFA------SGNFVFKVQHKFAG---KEKKLEHFKSHDTRRHSRML 55
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
ASIDL LGG+ + GLYFTK+ LG+P EY+VQVDTGSD+LWVNC C CP+K++L
Sbjct: 56 ASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLN 115
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRD 184
L+LFD + SSTS ++ C D+FC ++ ++ SC P V C Y + Y D S++ G F+RD
Sbjct: 116 FHLSLFDVNASSTSKKVGCDDDFC--SFISQSDSCQPAVGCSYHIVYADESTSEGNFIRD 173
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
+ L Q +G+L+T PL V+FGCG+ QSG LG S D+AVDG++GFGQ+N+S+LSQLAA
Sbjct: 174 KLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKS-DSAVDGVMGFGQSNTSVLSQLAAT 232
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
G+ ++ F+HCLD VKGGGIFA+G V SPKVKTTPMVPN HYNV+L ++V G LDLP
Sbjct: 233 GDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPP 292
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYD 332
S++ G GTI+DSGTTLAY P +LYD
Sbjct: 293 SIMRNG---GTIVDSGTTLAYFPKVLYD 317
>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 488
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 171/309 (55%), Positives = 231/309 (74%), Gaps = 6/309 (1%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N VFEV +KF G+R + L AL+ HD RH R++++ID+ LGG+ P + GLYF K+GL
Sbjct: 34 NLVFEVRSKF--AGKRVKDLGALRAHDVHRHSRLLSAIDIPLGGDSQPESIGLYFAKIGL 91
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP+ +++VQVDTGSD+LWVNCAGC RCP KSDL ++LT +D SST+ ++CSDNFC
Sbjct: 92 GTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDL-VELTPYDVDASSTAKSVSCSDNFC- 149
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
+Y N+ C G C+YV+ YGDGSST+GY V+D++ L+ +GN +T N ++IFGCG
Sbjct: 150 -SYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCG 208
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
++QSG LG S AAVDGI+GFGQ+NSS +SQLA+ G V++ FAHCLD GGGIFAIG+V
Sbjct: 209 SKQSGQLGES-QAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEV 267
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
VSPKVKTTPM+ HY+V L +EVG + L+L ++ +GD++G IIDSGTTL YLP
Sbjct: 268 VSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDA 327
Query: 330 LYDLVLSQF 338
+Y+ +L++
Sbjct: 328 VYNPLLNEI 336
>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
gi|224030089|gb|ACN34120.1| unknown [Zea mays]
gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
Length = 491
Score = 358 bits (918), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 175/306 (57%), Positives = 221/306 (72%), Gaps = 3/306 (0%)
Query: 32 VFEVENKFKAGGERER--TLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
+F+V KF AG +SAL+ HD RHGR++A+ DL LGG G P+ TGLY+T++ L
Sbjct: 33 IFQVRRKFTAGVGGGAGANISALRAHDGTRHGRLLAAADLPLGGLGLPTDTGLYYTEIKL 92
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP YYVQVDTGSD+LWVNC C +CP KS LG+ LTL+DP SST + C FC
Sbjct: 93 GTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFCA 152
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
T+ + P C V CEY VTYGDGSST G FV D +Q +Q + + +T P N+SVIFGCG
Sbjct: 153 ATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGCG 212
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
+Q GDLGSS + A+DGILGFG+AN+S+LSQL AG V+K FAHCLD +KGGGIF+IGDV
Sbjct: 213 AQQGGDLGSS-NQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGIFSIGDV 271
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V PKVKTTP+V + PHYNV L+ ++VGG L LP + G+++GTIIDSGTTL YLP +
Sbjct: 272 VQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTLTYLPEL 331
Query: 330 LYDLVL 335
++ V+
Sbjct: 332 VFKEVM 337
>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
Length = 475
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 168/304 (55%), Positives = 225/304 (74%), Gaps = 9/304 (2%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
NFVF+ ++KF +++ L K HDTRRH RM+ASIDL LGG+ + GLYFTK+
Sbjct: 23 ANFVFKAQHKFAG---KKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIK 79
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P EY+VQVDTGSD+LW+NC C +CPTK++L +L+LFD + SSTS ++ C D+FC
Sbjct: 80 LGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFC 139
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ ++ SC P + C Y + Y D S++ G F+RD++ L Q +G+LKT PL V+FGC
Sbjct: 140 --SFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGC 197
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G+ QSG LG+ D+AVDG++GFGQ+N+S+LSQLAA G+ ++ F+HCLD VKGGGIFA+G
Sbjct: 198 GSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGV 256
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V SPKVKTTPMVPN HYNV+L ++V G LDLP S++ G GTI+DSGTTLAY P
Sbjct: 257 VDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPK 313
Query: 329 MLYD 332
+LYD
Sbjct: 314 VLYD 317
>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 502
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 170/308 (55%), Positives = 222/308 (72%), Gaps = 3/308 (0%)
Query: 38 KFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYY 97
K+K G++ R+L+ALK HD R R++A +DL LGG G P A GLY+ K+G+GTP +YY
Sbjct: 54 KYKFAGQK-RSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYY 112
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
VQVDTGSD++WVNC C+ CP KS LG++LTL+D +S T ++C +FC
Sbjct: 113 VQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPS 172
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
C + C Y Y DGSS+ GYFVRDI+Q +Q SG+L+T N SVIFGC QSGDL
Sbjct: 173 YCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDL- 231
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT 277
S++ A+DGILGFG++N+S++SQLA++G VRK FAHCLD + GGGIFAIG +V PKV TT
Sbjct: 232 -SSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
P+VPN HYNV ++ VEVGG L+LPT + GD++GTIIDSGTTLAYLP ++YD +LS+
Sbjct: 291 PLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLLSK 350
Query: 338 FRFWIASL 345
W + L
Sbjct: 351 IFSWQSDL 358
>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 494
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 183/322 (56%), Positives = 230/322 (71%), Gaps = 10/322 (3%)
Query: 21 AVGGGGVMGNFVFEVENKFKA----GGERERTLSALKQHDTRRHGRMMASIDLELGGNGH 76
A+G G VF+V F G E L+AL++HD RR ++ ++DL LGGNG
Sbjct: 26 ALGPGRAAATGVFQVRRNFPRHQGNGPGGEEHLAALRKHDGRR---LLTAVDLPLGGNGI 82
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSS 136
P+ TGLYFT++G+GTP+ YYVQVDTGSD+LWVNC C CP KS LGI LTL+DP+ S+
Sbjct: 83 PTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASA 142
Query: 137 TSGEIACSDNFCRTTYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+S + C FC T N P SC+ C+Y +TYGDGSST+G+FV D +Q +Q SG+
Sbjct: 143 SSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDG 202
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+T N+SV FGCG + G LGSS + A+DGILGFGQANSS+LSQL +AG V K F+HCL
Sbjct: 203 QTNLANASVTFGCGAKIGGALGSS-NVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCL 261
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT-GDERG 314
D V GGGIFAIG+VV PKVKTTP+VP MPHYNV+L+ ++VGG+ L LPT++ G RG
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRG 321
Query: 315 TIIDSGTTLAYLPPMLYDLVLS 336
TIIDSGTTLAYLP ++Y VLS
Sbjct: 322 TIIDSGTTLAYLPEVVYKAVLS 343
>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
Length = 477
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 170/308 (55%), Positives = 222/308 (72%), Gaps = 3/308 (0%)
Query: 38 KFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYY 97
K+K G++ R+L+ALK HD R R++A +DL LGG G P A GLY+ K+G+GTP +YY
Sbjct: 54 KYKFAGQK-RSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYY 112
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
VQVDTGSD++WVNC C+ CP KS LG++LTL+D +S T ++C +FC
Sbjct: 113 VQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCDQDFCYAINGGPPS 172
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
C + C Y Y DGSS+ GYFVRDI+Q +Q SG+L+T N SVIFGC QSGDL
Sbjct: 173 YCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSVIFGCSATQSGDL- 231
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT 277
S++ A+DGILGFG++N+S++SQLA++G VRK FAHCLD + GGGIFAIG +V PKV TT
Sbjct: 232 -SSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVNTT 290
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
P+VPN HYNV ++ VEVGG L+LPT + GD++GTIIDSGTTLAYLP ++YD +LS+
Sbjct: 291 PLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLLSK 350
Query: 338 FRFWIASL 345
W + L
Sbjct: 351 IFSWQSDL 358
>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
[Arabidopsis thaliana]
Length = 449
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 168/308 (54%), Positives = 227/308 (73%), Gaps = 9/308 (2%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
NFVF+ ++KF +++ L K HDTRRH RM+ASIDL LGG+ + GLYFTK+
Sbjct: 23 ANFVFKAQHKFAG---KKKNLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIK 79
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P EY+VQVDTGSD+LW+NC C +CPTK++L +L+LFD + SSTS ++ C D+FC
Sbjct: 80 LGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFC 139
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ ++ SC P + C Y + Y D S++ G F+RD++ L Q +G+LKT PL V+FGC
Sbjct: 140 --SFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGC 197
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G+ QSG LG+ D+AVDG++GFGQ+N+S+LSQLAA G+ ++ F+HCLD VKGGGIFA+G
Sbjct: 198 GSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGV 256
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP 328
V SPKVKTTPMVPN HYNV+L ++V G LDLP S++ G GTI+DSGTTLAY P
Sbjct: 257 VDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG---GTIVDSGTTLAYFPK 313
Query: 329 MLYDLVLS 336
+LYD ++
Sbjct: 314 VLYDSLIE 321
>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
Length = 485
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 171/309 (55%), Positives = 216/309 (69%), Gaps = 3/309 (0%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N VF V K+K G +R+LS LK HD +R R++A +DL LGG G P GLY+ K+G+
Sbjct: 28 NGVFSV--KYKYAG-LQRSLSDLKAHDDQRQLRILAGVDLPLGGIGRPDILGLYYAKIGI 84
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTPT +YYVQVDTGSD++WVNC C CP S LGI LTL++ ++S T + C FC
Sbjct: 85 GTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQEFCY 144
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
+ P C+ + C Y+ YGDGSST+GYFV+D++Q + SG+LKT N SVIFGCG
Sbjct: 145 EINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANGSVIFGCG 204
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
RQSGDLGSS + A+DGILGFG++NSS++SQLA G V+K FAHCLD GGGIF IG V
Sbjct: 205 ARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGGIFVIGHV 264
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V PKV TP++PN PHYNV + V+VG L LPT + GD +G IIDSGTTLAYLP M
Sbjct: 265 VQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSGTTLAYLPEM 324
Query: 330 LYDLVLSQF 338
+Y ++S+
Sbjct: 325 VYKPLVSKI 333
>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
Length = 506
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 173/330 (52%), Positives = 228/330 (69%), Gaps = 23/330 (6%)
Query: 32 VFEVENKFKAG--GERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
+F V K AG G+ +SAL+ HD RRHGR++A+ DL LGG G P+ TGLYFT++ L
Sbjct: 34 IFRVRRKLPAGVGGDTGANISALRAHDGRRHGRLLAAADLPLGGLGLPTDTGLYFTEIKL 93
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP YYVQVDTGSD+LWVNC CS+CP KS LG+ LT +DP SS+ ++C FC
Sbjct: 94 GTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFCA 153
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
TY + P C+ V CEY V YGDGSST+G+F+ D +Q +Q +G+ +T P N+++ FGCG
Sbjct: 154 ATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATITFGCG 213
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
+Q GDLG+S + A+DGILGFGQAN+S+LSQLAAAG +K FAHCLD +KGGGIFAIG+V
Sbjct: 214 AQQGGDLGNS-NQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGIFAIGNV 272
Query: 270 VSPK----------VKTTPM------VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
V PK + P+ + + PHYNV L+ ++VGG L LP + TG+++
Sbjct: 273 VQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFETGEKK 332
Query: 314 GTIIDSGTTLAYLPPMLY----DLVLSQFR 339
GTIIDSGTTL YLP +++ D+V S+ R
Sbjct: 333 GTIIDSGTTLTYLPELVFKQVMDVVFSKHR 362
>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
Length = 490
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 175/310 (56%), Positives = 223/310 (71%), Gaps = 8/310 (2%)
Query: 32 VFEVENKFK---AGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
VF+V KF GG+ L+AL++HD RHGR++ ++DL LGG G P+ATGLY+T++
Sbjct: 31 VFQVRRKFPRHGGGGDVAEHLAALRRHDVGRHGRLLGAVDLPLGGVGLPTATGLYYTQIE 90
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+G+P+ YYVQVDTGSD+LWVNC C CPT S LGI+LT +DP+ S T+ + C FC
Sbjct: 91 IGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGTT--VGCDQEFC 148
Query: 149 RTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIF 206
N P P C++ + YGDGSST+G++V D +Q NQ SGN +T P N+S+ F
Sbjct: 149 VANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNASITF 208
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAI 266
GCG + GDLGSS+ A +DGILGFGQA+SS+LSQLAAA VRK FAHCLD V GGGIFAI
Sbjct: 209 GCGAQLGGDLGSSSQA-LDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGGGIFAI 267
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G+VV PKVKTTP+V N+ HYNV L+ + VGG L LP+S +GD +GTIIDSGTTLAYL
Sbjct: 268 GNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGTTLAYL 327
Query: 327 PPMLYDLVLS 336
P +Y +L+
Sbjct: 328 PREVYRTLLT 337
>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 498
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 167/317 (52%), Positives = 222/317 (70%), Gaps = 4/317 (1%)
Query: 23 GGGGVMG-NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATG 81
GGGGV N +F V+ K+ RER+LS LK HD R R +A ID+ LGG+G P A G
Sbjct: 29 GGGGVYADNGIFSVKYKYAG---RERSLSTLKAHDISRQLRFLAGIDIPLGGSGRPDAVG 85
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
LY+ K+G+GTP+ +YYVQVDTGSD++WVNC C CP S LG++LT +D +S+T +
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
+C + FC C+ + C Y+ YGDGSST+GYFV+D +Q N+ SG+L+T N
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
S+ FGCG RQSGDLGSS + A+DGILGFG++NSS++SQLA+ V+K FAHCLD GG
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 262 GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
GIFA+G VV PKV TP+VPN PHYNV + V+VG L++ + GD +GTIIDSGT
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSGT 325
Query: 322 TLAYLPPMLYDLVLSQF 338
TLAYLP ++Y+ ++++
Sbjct: 326 TLAYLPELIYEPLVAKI 342
>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
Length = 321
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 159/258 (61%), Positives = 205/258 (79%), Gaps = 1/258 (0%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
+AT LY+T++G+GTPT YYVQVDTGSD+LWVNC C RCP KS LG++LTL+DP SST
Sbjct: 28 TATRLYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSST 87
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+++C FC TY P C+ + CEY VTYGDGSST+GYFV D++Q +Q SG+ +T
Sbjct: 88 GSKVSCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQT 147
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
P NS+V FGCG++Q GDLGSS + A+DGI+GFGQ+N+S+LSQL+AAG V+K FAHCLD
Sbjct: 148 RPANSTVTFGCGSQQGGDLGSS-NQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDT 206
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
+ GGGIFAIG+VV PKVKTTP+VPNMPHYNV L+ ++VGG L LP+ + TG+++GTII
Sbjct: 207 INGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTII 266
Query: 318 DSGTTLAYLPPMLYDLVL 335
DSGTTL YLP ++Y ++
Sbjct: 267 DSGTTLTYLPEIVYKEIM 284
>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
[Cucumis sativus]
Length = 420
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 167/317 (52%), Positives = 222/317 (70%), Gaps = 4/317 (1%)
Query: 23 GGGGVMG-NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATG 81
GGGGV N VF V+ K+ RER+LS LK HD R R +A +D+ LGG+G P A G
Sbjct: 29 GGGGVYADNGVFSVKYKYAG---RERSLSTLKAHDISRQLRFLAGVDIPLGGSGRPDAVG 85
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
LY+ K+G+GTP+ +YYVQVDTGSD++WVNC C CP S LG++LT +D +S+T +
Sbjct: 86 LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
+C + FC C+ + C Y+ YGDGSST+GYFV+D +Q N+ SG+L+T N
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAAN 205
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
S+ FGCG RQSGDLGSS + A+DGILGFG++NSS++SQLA+ V+K FAHCLD GG
Sbjct: 206 GSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGG 265
Query: 262 GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
GIFA+G VV PKV TP+VPN PHYNV + V+VG L++ + GD +GTIIDSGT
Sbjct: 266 GIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSGT 325
Query: 322 TLAYLPPMLYDLVLSQF 338
TLAYLP ++Y+ ++++
Sbjct: 326 TLAYLPELIYEPLVAKI 342
>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
Length = 491
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 177/328 (53%), Positives = 230/328 (70%), Gaps = 7/328 (2%)
Query: 13 TVAVVHQWAVGGGGVMGNFVFEVENKFKAGGER--ERTLSALKQHDTRRHGRMMASIDLE 70
+V +V +A+ G VF+V KF G R L+AL++HD RHGR++ ++DL
Sbjct: 12 SVLLVLLFALSVGCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLLGAVDLA 71
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
LGG G P+ TGLY+T++ +G+P YYVQVDTGSD+LWVNC C CPT+S LGI+LT +
Sbjct: 72 LGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQY 131
Query: 131 DPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQL 188
DP+ S T+ + C FC P P C++ +TYGDGS+T+G++V D +Q
Sbjct: 132 DPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQY 189
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
NQ SGN +T N+S+ FGCG + GDLGSS + A+DGILGFGQ++SS+LSQLAAA VR
Sbjct: 190 NQVSGNGQTTTSNASITFGCGAQLGGDLGSS-NQALDGILGFGQSDSSMLSQLAAARRVR 248
Query: 249 KEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
K FAHCLD V+GGGIFAIG+VV PKVKTTP+VPN+ HYNV L+ + VGG L LPTS
Sbjct: 249 KIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFD 308
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+GD +GTIIDSGTTLAYLP +Y +L+
Sbjct: 309 SGDSKGTIIDSGTTLAYLPREVYRTLLA 336
>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
Length = 491
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 177/328 (53%), Positives = 230/328 (70%), Gaps = 7/328 (2%)
Query: 13 TVAVVHQWAVGGGGVMGNFVFEVENKFKAGGER--ERTLSALKQHDTRRHGRMMASIDLE 70
+V +V +A+ G VF+V KF G R L+AL++HD RHGR++ ++DL
Sbjct: 12 SVLLVLLFALSVGCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLLGAVDLA 71
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
LGG G P+ TGLY+T++ +G+P YYVQVDTGSD+LWVNC C CPT+S LGI+LT +
Sbjct: 72 LGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQY 131
Query: 131 DPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQL 188
DP+ S T+ + C FC P P C++ +TYGDGS+T+G++V D +Q
Sbjct: 132 DPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQY 189
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
NQ SGN +T N+S+ FGCG + GDLGSS + A+DGILGFGQ++SS+LSQLAAA VR
Sbjct: 190 NQVSGNGQTTTSNASITFGCGAQLGGDLGSS-NQALDGILGFGQSDSSMLSQLAAARRVR 248
Query: 249 KEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
K FAHCLD V+GGGIFAIG+VV PKVKTTP+VPN+ HYNV L+ + VGG L LPTS
Sbjct: 249 KIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFD 308
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+GD +GTIIDSGTTLAYLP +Y +L+
Sbjct: 309 SGDSKGTIIDSGTTLAYLPREVYRTLLA 336
>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 492
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 164/306 (53%), Positives = 219/306 (71%), Gaps = 3/306 (0%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V K++ G+ +R+LS LK HD RR R++A +DL LGG+G P GLY+ KVG+GT
Sbjct: 38 VFSV--KYRYAGQ-QRSLSDLKAHDDRRQLRILAGVDLPLGGSGRPDTVGLYYAKVGIGT 94
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P+ +YYVQVDTGSD++WVNC C CP S LG++LTL++ S + + C + FC
Sbjct: 95 PSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVPCDEEFCYEV 154
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
C+ + C Y+ YGDGSST+GYFV+D++Q ++ SG+L+T N SVIFGCG R
Sbjct: 155 NGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVIFGCGAR 214
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
QSGDLG +++ A+DGILGFG++NSS++SQLAA V+K FAHCLD + GGGIFAIG VV
Sbjct: 215 QSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGGIFAIGHVVQ 274
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PKV TP++PN PHYNV + V+VG + L LPT GD +G IIDSGTTLAYLP ++Y
Sbjct: 275 PKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTTLAYLPEIVY 334
Query: 332 DLVLSQ 337
+ ++S+
Sbjct: 335 EPLVSK 340
>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 484
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 164/307 (53%), Positives = 216/307 (70%), Gaps = 3/307 (0%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V+ ++ + +LSALK+HD RR ++A IDL LGG G P GLY+ K+G+GT
Sbjct: 32 VFNVKYRYP---RLQGSLSALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIGIGT 88
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P YYVQVDTGSD++WVNC C +CP +S LGI+LTL++ +S + ++C D+FC
Sbjct: 89 PAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFCYQI 148
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
C + C Y+ YGDGSST+GYFV+D++Q + +G+LKT N SVIFGCG R
Sbjct: 149 SGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGAR 208
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
QSGDL SS + A+DGILGFG+ANSS++SQLA++G V+K FAHCLD GGGIFAIG VV
Sbjct: 209 QSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQ 268
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PKV TP+VPN PHYNV + V+VG L++P L GD +G IIDSGTTLAYLP ++Y
Sbjct: 269 PKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSGTTLAYLPEIIY 328
Query: 332 DLVLSQF 338
+ ++ +
Sbjct: 329 EPLVKKI 335
>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 488
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 165/308 (53%), Positives = 217/308 (70%), Gaps = 10/308 (3%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V+ K++ +R+LSALK HD RR ++A +DL LGG+G P A GLY+ K+G+GT
Sbjct: 37 VFNVKCKYQ-----DRSLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGT 91
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P YY+QVDTGSD++WVNC C CPT+S LG+ LTL+D +SS+ + C FC+
Sbjct: 92 PPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVPCDQEFCKEI 151
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
C+ + C Y+ YGDGSST+GYFV+DI+ +Q SG+LKT N S++FGCG R
Sbjct: 152 NGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGAR 211
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
QSGDL SS + A+DGILGFG+ANSS++SQLA++G V+K FAHCL+ V GGGIFAIG VV
Sbjct: 212 QSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAIGHVVQ 271
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP---- 327
PKV TP++P+ PHY+V + V+VG L L T GD +GTIIDSGTTLAYLP
Sbjct: 272 PKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTTLAYLPEGIY 331
Query: 328 -PMLYDLV 334
P++Y ++
Sbjct: 332 EPLVYKMI 339
>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
Length = 409
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 157/254 (61%), Positives = 202/254 (79%), Gaps = 1/254 (0%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
LY+T++G+GTPT YYVQVDTGSD+LWVNC C RCP KS LG++LTL+DP SST ++
Sbjct: 3 LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
+C FC TY P C+ + CEY VTYGDGSST+GYFV D++Q +Q SG+ +T P N
Sbjct: 63 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPAN 122
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
S+V FGCG++Q GDLGSS + A+DGI+GFGQ+N+S+LSQL+AAG V+K FAHCLD + GG
Sbjct: 123 STVTFGCGSQQGGDLGSS-NQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGG 181
Query: 262 GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
GIFAIG+VV PKVKTTP+VPNMPHYNV L+ ++VGG L LP+ + TG+++GTIIDSGT
Sbjct: 182 GIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGT 241
Query: 322 TLAYLPPMLYDLVL 335
TL YLP ++Y ++
Sbjct: 242 TLTYLPEIVYKEIM 255
>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
from this gene [Arabidopsis thaliana]
Length = 388
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 163/306 (53%), Positives = 215/306 (70%), Gaps = 3/306 (0%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V+ ++ + +L+ALK+HD RR ++A IDL LGG G P GLY+ K+G+GT
Sbjct: 32 VFNVKYRYP---RLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIGIGT 88
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P YYVQVDTGSD++WVNC C +CP +S LGI+LTL++ +S + ++C D+FC
Sbjct: 89 PAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFCYQI 148
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
C + C Y+ YGDGSST+GYFV+D++Q + +G+LKT N SVIFGCG R
Sbjct: 149 SGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGAR 208
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
QSGDL SS + A+DGILGFG+ANSS++SQLA++G V+K FAHCLD GGGIFAIG VV
Sbjct: 209 QSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQ 268
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PKV TP+VPN PHYNV + V+VG L +P L GD +G IIDSGTTLAYLP ++Y
Sbjct: 269 PKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEIIY 328
Query: 332 DLVLSQ 337
+ ++ +
Sbjct: 329 EPLVKK 334
>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 163/307 (53%), Positives = 215/307 (70%), Gaps = 3/307 (0%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V+ ++ + +L+ALK+HD RR ++A IDL LGG G P GLY+ K+G+GT
Sbjct: 32 VFNVKYRYP---RLQGSLTALKEHDDRRQLTILAGIDLPLGGTGRPDIPGLYYAKIGIGT 88
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P YYVQVDTGSD++WVNC C +CP +S LGI+LTL++ +S + ++C D+FC
Sbjct: 89 PAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCDDDFCYQI 148
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
C + C Y+ YGDGSST+GYFV+D++Q + +G+LKT N SVIFGCG R
Sbjct: 149 SGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSVIFGCGAR 208
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
QSGDL SS + A+DGILGFG+ANSS++SQLA++G V+K FAHCLD GGGIFAIG VV
Sbjct: 209 QSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGGIFAIGRVVQ 268
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PKV TP+VPN PHYNV + V+VG L +P L GD +G IIDSGTTLAYLP ++Y
Sbjct: 269 PKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSGTTLAYLPEIIY 328
Query: 332 DLVLSQF 338
+ ++ +
Sbjct: 329 EPLVKKI 335
>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/301 (54%), Positives = 214/301 (71%), Gaps = 5/301 (1%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V+ K++ +RTLSALK HD RR ++A +DL LGG+G P A GLY+ K+G+GT
Sbjct: 39 VFNVKCKYQ-----DRTLSALKAHDYRRQLSLLAGVDLPLGGSGRPDAVGLYYAKIGIGT 93
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P YY+QVDTGSD++WVNC C CPT+S+LG+ LTL+D +SS+ + C FC+
Sbjct: 94 PPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVPCDQEFCKEI 153
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
C+ + C Y+ YGDGSST+GYFV+DI+ +Q SG+LKT N S++FGCG R
Sbjct: 154 NGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSIVFGCGAR 213
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
QSGDL SS + A+ GILGFG+ANSS++SQLA++G V+K FAHCL+ V GGGIFAIG VV
Sbjct: 214 QSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGGIFAIGHVVQ 273
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PKV TP++P+ PHY+V + V+VG L L T GD +GTIIDSGTTLAYLP +Y
Sbjct: 274 PKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSGTTLAYLPEGIY 333
Query: 332 D 332
+
Sbjct: 334 E 334
>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 497
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 175/315 (55%), Positives = 228/315 (72%), Gaps = 4/315 (1%)
Query: 25 GGVMGNFVFEVENKF-KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLY 83
GGV VF+V +F + GGE L+A HD RHGR++A+ D+ LGG G P+ TGLY
Sbjct: 28 GGVSAAGVFKVRRRFARPGGEGGGNLTAHLAHDGDRHGRLLAAADVPLGGLGLPTGTGLY 87
Query: 84 FTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIAC 143
+TK+ +GTP ++VQVDTGSD+LWVNC C +CPTKS LGI L L+DP SS+ ++C
Sbjct: 88 YTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVSC 147
Query: 144 SDNFCRTTYNN--RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
+ FC TY + + P C+ G CEY YGDGSST+G FV D +Q NQ SGN +T
Sbjct: 148 DNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHAK 207
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
++VIFGCG +Q GDL ST+ A+DGI+GFGQ+N+S LSQLA+AG V+K F+HCLD +KGG
Sbjct: 208 ANVIFGCGAQQGGDL-ESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKGG 266
Query: 262 GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
GIFAIG+VV PKVK+TP++PNM HYNV L+ ++V GN L LP + T ++RGTIIDSGT
Sbjct: 267 GIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRGTIIDSGT 326
Query: 322 TLAYLPPMLYDLVLS 336
TL YLP ++Y +L+
Sbjct: 327 TLTYLPELVYKDILA 341
>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
Length = 492
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 173/312 (55%), Positives = 225/312 (72%), Gaps = 10/312 (3%)
Query: 32 VFEVENKF--KAGGER-ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVG 88
+F+V KF GG+ E L+AL +HD R+GR++ ++DL LGG G P+ATGLY+T++
Sbjct: 31 LFQVRRKFPRHGGGDVVEHRLAALLRHDMGRNGRLLGAVDLPLGGVGLPTATGLYYTRIE 90
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+G+P YYVQVDTGSD+LWVN C CPT+S LGI+LT +DP+ S T+ + C FC
Sbjct: 91 IGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGTT--VGCEQEFC 148
Query: 149 --RTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
+ + P+C S C++ +TYGDGSST+G++V D +Q NQ SGN +T P N S+
Sbjct: 149 VANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSNVSIT 208
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFA 265
FGCG + GDLGSS+ A+DGILGFGQ+++S+LSQLAAA VRK FAHCLD V+GGGIFA
Sbjct: 209 FGCGAQLGGDLGSSSQ-ALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGGGIFA 267
Query: 266 IGDVVSPK-VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
IG+VV P VKTTP+VPN HYNV L+ + VGG L LPTS +GD +GTIIDSGTTLA
Sbjct: 268 IGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTIIDSGTTLA 327
Query: 325 YLPPMLYDLVLS 336
YLP +Y +L+
Sbjct: 328 YLPREVYRTLLT 339
>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 481
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 159/321 (49%), Positives = 219/321 (68%), Gaps = 9/321 (2%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGT 91
VF V+ KF +++R+LS LK HD RR ++ +DL LGG G P + GLY+ K+G+GT
Sbjct: 24 VFNVQYKFS--DDQQRSLSVLKAHDYRRQISLLTGVDLPLGGTGRPDSVGLYYAKIGIGT 81
Query: 92 PTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
P+ +YY+QVDTG+D++WVNC C CPT+S+LG+ LTL++ +SS+ + C C+
Sbjct: 82 PSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVPCDQELCKEI 141
Query: 152 YNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
C+ C Y+ YGDGSST+GYFV+D++ +Q SG+LKTA N SVIFGCG
Sbjct: 142 NGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASANGSVIFGCG 201
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
RQSGDL S + A+DGILGFG+AN S++SQL+++G V+K FAHCL+ V GGGIFAIG V
Sbjct: 202 ARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNGGGIFAIGHV 261
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP-- 327
V P V TTP++P+ PHY+V + ++VG L+L T D +GTIIDSGTTLAYLP
Sbjct: 262 VQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSKGTIIDSGTTLAYLPDG 321
Query: 328 ---PMLYDLVLSQFRFWIASL 345
P++Y ++ Q + +L
Sbjct: 322 IYQPLVYKILSQQPNLKVQTL 342
>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
Length = 297
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 159/263 (60%), Positives = 198/263 (75%), Gaps = 4/263 (1%)
Query: 32 VFEVENKFKAGGER-ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLG 90
VFEV+ KF G+ E LSAL++HD RRHGR++A+IDL LGG+G + TGLYFT++G+G
Sbjct: 38 VFEVQRKFTRHGDGGEGHLSALREHDGRRHGRLLAAIDLPLGGSGLATETGLYFTRIGIG 97
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT 150
TP YYVQVDTGSD+LWVNC C CP KS+LGI+LT++DP S + + C FC
Sbjct: 98 TPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCDQQFCVA 157
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
Y PSC+ CEY ++YGDGSST+G+FV D +Q NQ SG+ +T P N+SV FGCG
Sbjct: 158 NYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANASVSFGCGA 217
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+ GDLGSS + A+DGILGFGQ+NSS+LSQLAAAG VRK FAHCLD V GGGIFAIG+VV
Sbjct: 218 KLGGDLGSS-NLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGIFAIGNVV 276
Query: 271 SPKVKTTPMVPNMPHYNVILEEV 293
PKVKTTP+VP+M Y +IL ++
Sbjct: 277 QPKVKTTPLVPDM--YAIILCQL 297
>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
Length = 372
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 148/258 (57%), Positives = 187/258 (72%), Gaps = 21/258 (8%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
LYF K+GLG P+ +YYVQVDTGSD+LWVNC GC +CPTKSDLGIKLTL+DP+ S ++ +
Sbjct: 26 LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRV 85
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
+C D+FC +TYN P C + C+Y V YGDGSST+GYFV D +Q + +GNL+T N
Sbjct: 86 SCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSN 145
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
+V FGCG +QSG LG+S + A+DGILG FAHCLD V GG
Sbjct: 146 GTVTFGCGAQQSGGLGTSGE-ALDGILG--------------------AFAHCLDNVNGG 184
Query: 262 GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
GIFAIG++VSPKV TTPMVPN HYNV ++E+EVGG L+LPT + +GD RGTIIDSGT
Sbjct: 185 GIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSGT 244
Query: 322 TLAYLPPMLYDLVLSQFR 339
TLAYLP ++YD ++++ R
Sbjct: 245 TLAYLPEVVYDSMMNEIR 262
>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
Length = 482
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 214/344 (62%), Gaps = 16/344 (4%)
Query: 1 MGGLRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKA--GGERERTLSALKQHDTR 58
M LL+ +++ + VV A G M N VF+V KF G + + AL+ HD
Sbjct: 1 MAAPLLLSTIILALVVV---ASSTHGTMANGVFQVRRKFHIVDGVYKGSDIGALQTHDEN 57
Query: 59 RHGR--MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RH R +MA+ +L LGG P TGLY+T +G+GTP +YYVQ+DTGS WVN C +
Sbjct: 58 RHRRRNLMAA-ELPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQ 116
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
CP +SD+ KLT +DP S +S E+ C D C + P C+ +RC Y+ Y DG
Sbjct: 117 CPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR-----PPCNMTLRCPYITGYADGGL 171
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
T G D++ +Q GN +T P ++SV FGCG +QSG L +S A+DGI+GFG +N +
Sbjct: 172 TMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSA-VAIDGIIGFGNSNQT 230
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEV 295
LSQLAAAG +K F+HCLD GGGIFAIG+VV PKVKTTP+V N Y+++ L+ + V
Sbjct: 231 ALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINV 290
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY-DLVLSQF 338
G L LP ++ GT +GT IDSG+TL YLP ++Y +L+L+ F
Sbjct: 291 AGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVF 334
>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
Length = 388
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 214/344 (62%), Gaps = 16/344 (4%)
Query: 1 MGGLRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKA--GGERERTLSALKQHDTR 58
M LL+ +++ + VV A G M N VF+V KF G + + AL+ HD
Sbjct: 1 MAAPLLLSTIILALVVV---ASSTHGTMANGVFQVRRKFHIVDGVYKGSDIGALQTHDEN 57
Query: 59 RHGR--MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RH R +MA+ +L LGG P TGLY+T +G+GTP +YYVQ+DTGS WVN C +
Sbjct: 58 RHRRRNLMAA-ELPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQ 116
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
CP +SD+ KLT +DP S +S E+ C D C + P C+ +RC Y+ Y DG
Sbjct: 117 CPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR-----PPCNMTLRCPYITGYADGGL 171
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
T G D++ +Q GN +T P ++SV FGCG +QSG L +S A+DGI+GFG +N +
Sbjct: 172 TMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSA-VAIDGIIGFGNSNQT 230
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEV 295
LSQLAAAG +K F+HCLD GGGIFAIG+VV PKVKTTP+V N Y+++ L+ + V
Sbjct: 231 ALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINV 290
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY-DLVLSQF 338
G L LP ++ GT +GT IDSG+TL YLP ++Y +L+L+ F
Sbjct: 291 AGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVF 334
>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 433
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 214/344 (62%), Gaps = 16/344 (4%)
Query: 1 MGGLRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKA--GGERERTLSALKQHDTR 58
M LL+ +++ + VV A G M N VF+V KF G + + AL+ HD
Sbjct: 1 MAAPLLLSTIILALVVV---ASSTHGTMANGVFQVRRKFHIVDGVYKGSDIGALQTHDEN 57
Query: 59 RHGR--MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RH R +MA+ +L LGG P TGLY+T +G+GTP +YYVQ+DTGS WVN C +
Sbjct: 58 RHRRRNLMAA-ELPLGGFNIPYGTGLYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQ 116
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
CP +SD+ KLT +DP S +S E+ C D C + P C+ +RC Y+ Y DG
Sbjct: 117 CPHESDILRKLTFYDPRSSVSSKEVKCDDTICTSR-----PPCNMTLRCPYITGYADGGL 171
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
T G D++ +Q GN +T P ++SV FGCG +QSG L +S A+DGI+GFG +N +
Sbjct: 172 TMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQSGSLNNSA-VAIDGIIGFGNSNQT 230
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEV 295
LSQLAAAG +K F+HCLD GGGIFAIG+VV PKVKTTP+V N Y+++ L+ + V
Sbjct: 231 ALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINV 290
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY-DLVLSQF 338
G L LP ++ GT +GT IDSG+TL YLP ++Y +L+L+ F
Sbjct: 291 AGTTLQLPANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVF 334
>gi|413936885|gb|AFW71436.1| hypothetical protein ZEAMMB73_738128, partial [Zea mays]
Length = 320
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 148/287 (51%), Positives = 197/287 (68%), Gaps = 7/287 (2%)
Query: 13 TVAVVHQWAVGGGGVMGNFVFEVENKFKAGGER--ERTLSALKQHDTRRHGRMMASIDLE 70
+V +V +A+ G VF+V KF G R L+AL++HD RHGR++ ++DL
Sbjct: 12 SVLLVLLFALSVGCASATGVFQVRRKFPRHGGRGVAEHLAALRRHDANRHGRLLGAVDLA 71
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
LGG G P+ TGLY+T++ +G+P YYVQVDTGSD+LWVNC C CPT+S LGI+LT +
Sbjct: 72 LGGVGLPTDTGLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQY 131
Query: 131 DPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQL 188
DP+ S T+ + C FC P P C++ +TYGDGS+T+G++V D +Q
Sbjct: 132 DPAGSGTT--VGCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQY 189
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
NQ SGN +T N+S+ FGCG + GDLGSS + A+DGILGFGQ++SS+LSQLAAA VR
Sbjct: 190 NQVSGNGQTTTSNASITFGCGAQLGGDLGSS-NQALDGILGFGQSDSSMLSQLAAARRVR 248
Query: 249 KEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEV 295
K FAHCLD V+GGGIFAIG+VV PKVKTTP+VPN+ +V+ V +
Sbjct: 249 KIFAHCLDTVRGGGIFAIGNVVQPKVKTTPLVPNVYVVSVLFSPVYI 295
>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
Length = 422
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 202/317 (63%), Gaps = 13/317 (4%)
Query: 28 MGNFVFEVENKFKA--GGERERTLSALKQHDTRRHGR--MMASIDLELGGNGHPSATGLY 83
M N VF+V KF G + + AL+ HD RH R +MA+ +L LGG P TGLY
Sbjct: 1 MANGVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAA-ELPLGGFNIPYGTGLY 59
Query: 84 FTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIAC 143
+T +G+GTP +YYVQ+DTGS WVN C +CP +SD+ KLT +DP S +S E+ C
Sbjct: 60 YTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKC 119
Query: 144 SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS 203
D C + P C+ +RC Y+ Y DG T G D++ +Q GN +T P ++S
Sbjct: 120 DDTICTSR-----PPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 174
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI 263
V FGCG +QSG L +S A+DGI+GFG +N + LSQLAAAG +K F+HCLD GGGI
Sbjct: 175 VTFGCGLQQSGSLNNSA-VAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI 233
Query: 264 FAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
FAIG+VV PKVKTTP+V N Y+++ L+ + V G L LP ++ GT +GT IDSG+T
Sbjct: 234 FAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGST 293
Query: 323 LAYLPPMLY-DLVLSQF 338
L YLP ++Y +L+L+ F
Sbjct: 294 LVYLPEIIYSELILAVF 310
>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
Length = 431
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 202/317 (63%), Gaps = 13/317 (4%)
Query: 28 MGNFVFEVENKFKA--GGERERTLSALKQHDTRRHGR--MMASIDLELGGNGHPSATGLY 83
M N VF+V KF G + + AL+ HD RH R +MA+ +L LGG P TGLY
Sbjct: 1 MANGVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAA-ELPLGGFNIPYGTGLY 59
Query: 84 FTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIAC 143
+T +G+GTP +YYVQ+DTGS WVN C +CP +SD+ KLT +DP S +S E+ C
Sbjct: 60 YTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKC 119
Query: 144 SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS 203
D C + P C+ +RC Y+ Y DG T G D++ +Q GN +T P ++S
Sbjct: 120 DDTICTSR-----PPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTS 174
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI 263
V FGCG +QSG L +S A+DGI+GFG +N + LSQLAAAG +K F+HCLD GGGI
Sbjct: 175 VTFGCGLQQSGSLNNSA-VAIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGI 233
Query: 264 FAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
FAIG+VV PKVKTTP+V N Y+++ L+ + V G L LP ++ GT +GT IDSG+T
Sbjct: 234 FAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGST 293
Query: 323 LAYLPPMLY-DLVLSQF 338
L YLP ++Y +L+L+ F
Sbjct: 294 LVYLPEIIYSELILAVF 310
>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 406
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 129/225 (57%), Positives = 164/225 (72%)
Query: 113 GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYG 172
GC+ CP KS LG+ LTL+DP+ S TS + C D FC TY+ C + C Y +TYG
Sbjct: 32 GCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYG 91
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
DGS+TSG FV D + ++ SGNL T P NSSVIFGCG +QSG L S++D A+DGI+GFGQ
Sbjct: 92 DGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQ 151
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
ANSS+LSQLAA+G V++ F+HCLD GGGIF+IG V+ PK TTP+VP M HYNVIL++
Sbjct: 152 ANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKD 211
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
++V G P+ LP L +G RGTIIDSGTTLAYLP +Y+ +L +
Sbjct: 212 MDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQLLPK 256
>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
Length = 431
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 192/310 (61%), Gaps = 31/310 (10%)
Query: 38 KFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYY 97
K+K G++ R+L+ALK HD R R++A +DL LGG G P A GLY+ K+G+GTP +YY
Sbjct: 54 KYKFAGQK-RSLAALKAHDNSRQLRILAGVDLPLGGTGRPEAVGLYYAKIGIGTPARDYY 112
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
VQ++ LTL+D +S T ++C +FC
Sbjct: 113 VQME-------------------------LTLYDIKESLTGKLVSCDQDFCYAINGGPPS 147
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG--NLKTAPLNSSVIFGCGNRQSGD 215
C + C Y Y DGSS+ GYFV+ ++ + +L PL V C QSGD
Sbjct: 148 YCIANMSCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLNNNPL-LEVPLRCSATQSGD 206
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVK 275
L S++ A+DGILGFG++N+S++SQLA++G VRK FAHCLD + GGGIFAIG +V PKV
Sbjct: 207 L--SSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIFAIGHIVQPKVN 264
Query: 276 TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
TTP+VPN HYNV ++ VEVGG L+LPT + GD++GTIIDSGTTLAYLP ++YD +L
Sbjct: 265 TTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLAYLPEVVYDQLL 324
Query: 336 SQFRFWIASL 345
S+ W + L
Sbjct: 325 SKIFSWQSDL 334
>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 490
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 137/294 (46%), Positives = 179/294 (60%), Gaps = 7/294 (2%)
Query: 49 LSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
LS L+ D RH RM+ S +D + G P GLY+TKV LGTP E+ VQ+DTGS
Sbjct: 37 LSQLRARDALRHRRMLQSSNGVVDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGS 96
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP-GV 163
D+LWV+C CS CP S L I+L FDP SSTS IACSD C + +CS
Sbjct: 97 DVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNN 156
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C Y YGDGS TSGY+V D++ LN T + V+FGC N+Q+GDL + +D A
Sbjct: 157 QCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPVVFGCSNQQTGDL-TKSDRA 215
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPN 282
VDGI GFGQ S++SQL++ G + F+HCL GGGI +G++V P + T +VP
Sbjct: 216 VDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPA 275
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
PHYN+ L+ + V G L + +S+ T + RGTI+DSGTTLAYL YD +S
Sbjct: 276 QPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEEAYDPFVS 329
>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 493
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 146/332 (43%), Positives = 194/332 (58%), Gaps = 14/332 (4%)
Query: 11 VVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMAS---- 66
V++VA++ AV GG +E F E LS L+ D RH RM+ S
Sbjct: 9 VISVALLA--AVAGGSPA---TLTLERAFPTNHGVE--LSQLRARDELRHRRMLQSSSGV 61
Query: 67 IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIK 126
+D + G P GLY+TKV LGTP E+ VQ+DTGSD+LWV+C C+ CP S L I+
Sbjct: 62 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQ 121
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGSSTSGYFVRDI 185
L FDP SSTS IACSD C + +CS +C Y YGDGS TSGY+V D+
Sbjct: 122 LNFFDPGSSSTSSMIACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDM 181
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ LN T + V+FGC N+Q+GDL + +D AVDGI GFGQ S++SQL++ G
Sbjct: 182 MHLNTIFEGSMTTNSTAPVVFGCSNQQTGDL-TKSDRAVDGIFGFGQQEMSVISQLSSQG 240
Query: 246 NVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
+ F+HCL GGGI +G++V P + T +VP PHYN+ L+ + V G L + +
Sbjct: 241 IAPRIFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDS 300
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
S+ T + RGTI+DSGTTLAYL YD +S
Sbjct: 301 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVS 332
>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 506
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 137/296 (46%), Positives = 177/296 (59%), Gaps = 16/296 (5%)
Query: 52 LKQHDTRRHGR----------MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVD 101
LK+ D H R + +D + G+ +P GLYFT+V LG P EY+VQ+D
Sbjct: 48 LKERDGAHHARRRGLLGGAPAVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEYFVQID 107
Query: 102 TGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-- 159
TGSD+LWV C+ C+ CPT S L I+L F+P SSTS I CSD+ C C
Sbjct: 108 TGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCSDDRCTAALQTGEAVCQS 167
Query: 160 --SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
SP C Y TYGDGS TSG++V D + + GN +TA ++SV+FGC N QSGDL
Sbjct: 168 SDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTANSSASVVFGCSNSQSGDL- 226
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKT 276
TD AVDGI GFGQ S++SQL + G K F+HCL GGGI +G++V P +
Sbjct: 227 MKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCLKGSDNGGGILVLGEIVEPGLVF 286
Query: 277 TPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
TP+VP+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTL YL YD
Sbjct: 287 TPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIVDSGTTLVYLVDGAYD 342
>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 507
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 181/301 (60%), Gaps = 14/301 (4%)
Query: 49 LSALKQHDTRRH--------GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
L L++ D RH G + +D + G+ +P GLYFT+V LG P E++VQ+
Sbjct: 47 LEELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQI 106
Query: 101 DTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC- 159
DTGSD+LWV C+ C+ CPT S L I+L F+P SST+ I CSD+ C + C
Sbjct: 107 DTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQ 166
Query: 160 ---SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDL 216
S C Y TYGDGS TSGY+V D + GN +TA ++S++FGC N QSGDL
Sbjct: 167 TSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDL 226
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVK 275
+ D AVDGI GFGQ S++SQL + G K F+HCL GGGI +G++V P +
Sbjct: 227 -TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLV 285
Query: 276 TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
TP+VP+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTLAYL YD +
Sbjct: 286 YTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFV 345
Query: 336 S 336
S
Sbjct: 346 S 346
>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 509
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 181/301 (60%), Gaps = 14/301 (4%)
Query: 49 LSALKQHDTRRH--------GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
L L++ D RH G + +D + G+ +P GLYFT+V LG P E++VQ+
Sbjct: 49 LEELRRRDAARHRVSRRRLLGGVAGVVDFPVEGSANPYMVGLYFTRVKLGNPAKEFFVQI 108
Query: 101 DTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC- 159
DTGSD+LWV C+ C+ CPT S L I+L F+P SST+ I CSD+ C + C
Sbjct: 109 DTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCSDDRCTAGFQTGEAICQ 168
Query: 160 ---SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDL 216
S C Y TYGDGS TSGY+V D + GN +TA ++S++FGC N QSGDL
Sbjct: 169 TSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANSSASIVFGCSNSQSGDL 228
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVK 275
+ D AVDGI GFGQ S++SQL + G K F+HCL GGGI +G++V P +
Sbjct: 229 -TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLV 287
Query: 276 TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
TP+VP+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTLAYL YD +
Sbjct: 288 YTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYDPFV 347
Query: 336 S 336
S
Sbjct: 348 S 348
>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
Length = 504
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/292 (46%), Positives = 180/292 (61%), Gaps = 4/292 (1%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER+R + G + +D + G+ +P GLYFT+V LG+P EY+VQ+DTG
Sbjct: 52 ERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTG 111
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC--SP 161
SD+LWV C+ C+ CP+ S L I+L F+P SSTS +I CSD+ C C S
Sbjct: 112 SDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSD 171
Query: 162 GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
C Y TYGDGS TSGY+V D + + GN +TA ++S++FGC N QSGDL + TD
Sbjct: 172 NSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANSSASIVFGCSNSQSGDL-TKTD 230
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMV 280
AVDGI GFGQ S++SQL + G K F+HCL GGGI +G++V P + TP+V
Sbjct: 231 RAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLV 290
Query: 281 PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
P+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTLAYL YD
Sbjct: 291 PSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 342
>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
Length = 504
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/292 (46%), Positives = 180/292 (61%), Gaps = 4/292 (1%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER+R + G + +D + G+ +P GLYFT+V LG+P EY+VQ+DTG
Sbjct: 52 ERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTG 111
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC--SP 161
SD+LWV C+ C+ CP+ S L I+L F+P SSTS +I CSD+ C C S
Sbjct: 112 SDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSD 171
Query: 162 GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
C Y TYGDGS TSGY+V D + + GN +TA ++S++FGC N QSGDL + TD
Sbjct: 172 NSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQSGDL-TKTD 230
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMV 280
AVDGI GFGQ S++SQL + G K F+HCL GGGI +G++V P + TP+V
Sbjct: 231 RAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLV 290
Query: 281 PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
P+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTLAYL YD
Sbjct: 291 PSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 342
>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
Length = 381
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/292 (46%), Positives = 180/292 (61%), Gaps = 4/292 (1%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER+R + G + +D + G+ +P GLYFT+V LG+P EY+VQ+DTG
Sbjct: 52 ERDRARHGRRGLLGGGGGGVAGVVDFPVEGSANPFMVGLYFTRVKLGSPPKEYFVQIDTG 111
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC--SP 161
SD+LWV C+ C+ CP+ S L I+L F+P SSTS +I CSD+ C C S
Sbjct: 112 SDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCSDDRCTAALQTSEAVCQTSD 171
Query: 162 GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
C Y TYGDGS TSGY+V D + + GN +TA ++S++FGC N QSGDL + TD
Sbjct: 172 NSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSASIVFGCSNSQSGDL-TKTD 230
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMV 280
AVDGI GFGQ S++SQL + G K F+HCL GGGI +G++V P + TP+V
Sbjct: 231 RAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDNGGGILVLGEIVEPGLVYTPLV 290
Query: 281 PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
P+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTLAYL YD
Sbjct: 291 PSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYLADGAYD 342
>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 493
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 180/305 (59%), Gaps = 9/305 (2%)
Query: 33 FEVENKFKAGGERERTLSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVG 88
++E A E E LS LK D RHGR++ S ID + G P GLY+TK+
Sbjct: 29 LKLERVIPANHEME--LSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLR 86
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LGTP ++YVQVDTGSD+LWV+CA C+ CP S L I+L FDP S T+ I+CSD C
Sbjct: 87 LGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRC 146
Query: 149 RTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
+ CS C Y YGDGS TSG++V D++Q + G+ + V+FG
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFG 206
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAI 266
C Q+GDL S D AVDGI GFGQ S++SQLA+ G + F+HCL GGGI +
Sbjct: 207 CSTSQTGDLVKS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVL 265
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G++V P + TP+VP+ PHYNV L + V G L + S+ T + +GTIID+GTTLAYL
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325
Query: 327 PPMLY 331
Y
Sbjct: 326 SEAAY 330
>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
Length = 539
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 180/305 (59%), Gaps = 9/305 (2%)
Query: 33 FEVENKFKAGGERERTLSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVG 88
++E A E E LS LK D RHGR++ S ID + G P GLY+TK+
Sbjct: 29 LKLERVIPANHEME--LSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLR 86
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LGTP ++YVQVDTGSD+LWV+CA C+ CP S L I+L FDP S T+ I+CSD C
Sbjct: 87 LGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRC 146
Query: 149 RTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
+ CS C Y YGDGS TSG++V D++Q + G+ + V+FG
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFG 206
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAI 266
C Q+GDL S D AVDGI GFGQ S++SQLA+ G + F+HCL GGGI +
Sbjct: 207 CSTSQTGDLVKS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVL 265
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G++V P + TP+VP+ PHYNV L + V G L + S+ T + +GTIID+GTTLAYL
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325
Query: 327 PPMLY 331
Y
Sbjct: 326 SEAAY 330
>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
Length = 426
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 138/305 (45%), Positives = 180/305 (59%), Gaps = 9/305 (2%)
Query: 33 FEVENKFKAGGERERTLSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVG 88
++E A E E LS LK D RHGR++ S ID + G P GLY+TK+
Sbjct: 29 LKLERVIPANHEME--LSQLKARDEARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKLR 86
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LGTP ++YVQVDTGSD+LWV+CA C+ CP S L I+L FDP S T+ I+CSD C
Sbjct: 87 LGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSDQRC 146
Query: 149 RTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
+ CS C Y YGDGS TSG++V D++Q + G+ + V+FG
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFG 206
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAI 266
C Q+GDL S D AVDGI GFGQ S++SQLA+ G + F+HCL GGGI +
Sbjct: 207 CSTSQTGDLVKS-DRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGENGGGGILVL 265
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G++V P + TP+VP+ PHYNV L + V G L + S+ T + +GTIID+GTTLAYL
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325
Query: 327 PPMLY 331
Y
Sbjct: 326 SEAAY 330
>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 136/305 (44%), Positives = 180/305 (59%), Gaps = 9/305 (2%)
Query: 33 FEVENKFKAGGERERTLSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVG 88
++E A E E LS LK D RHGR++ S ID + G P GLY+TK+
Sbjct: 29 LKLERGIPANHEME--LSQLKARDKARHGRLLQSLGGVIDFPVDGTFDPFVVGLYYTKIR 86
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
LG+P ++YVQVDTGSD+LWV+CA C+ CP S L I+L FDP S T+ ++CSD C
Sbjct: 87 LGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSCSDQRC 146
Query: 149 RTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
+ CS C Y YGDGS TSG++V D++Q + G+ + V+FG
Sbjct: 147 SWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPVVFG 206
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAI 266
C Q+GDL S D AVDGI GFGQ S++SQLA+ G + F+HCL GGGI +
Sbjct: 207 CSTSQTGDLVKS-DRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGENGGGGILVL 265
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G++V P + TP+VP+ PHYNV L + V G L + S+ T + +GTIID+GTTLAYL
Sbjct: 266 GEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTIIDTGTTLAYL 325
Query: 327 PPMLY 331
Y
Sbjct: 326 SEAAY 330
>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 499
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 189/319 (59%), Gaps = 12/319 (3%)
Query: 22 VGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASI-----DLELGGNGH 76
V GG+ G F+ +E + E L AL+ D RHGR++ + D + G
Sbjct: 20 VSCGGLAGTFL-PLERAIPLNQQVE--LEALRARDRARHGRILQGVVGGVVDFSVQGTSD 76
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSS 136
P GLYFTKV LG+P ++YVQ+DTGSD+LW+NC CS CP S LGI+L FD + SS
Sbjct: 77 PYFVGLYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSS 136
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQA-SGN 194
T+ ++C+D C CS +C Y YGDGS T+GY+V D + + G
Sbjct: 137 TAALVSCADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQ 196
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
A +S+++FGC QSGDL + TD AVDGI GFG S++SQL++ G K F+HC
Sbjct: 197 SMVANSSSTIVFGCSTYQSGDL-TKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC 255
Query: 255 LD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
L GGG+ +G+++ P + +P+VP++PHYN+ L+ + V G L + +++ T + +
Sbjct: 256 LKGGENGGGVLVLGEILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQ 315
Query: 314 GTIIDSGTTLAYLPPMLYD 332
GTI+DSGTTLAYL Y+
Sbjct: 316 GTIVDSGTTLAYLVQEAYN 334
>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 535
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 193/335 (57%), Gaps = 18/335 (5%)
Query: 10 VVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHG-RMMAS-- 66
+ VTV VV+ GG G+++ +E + E L+ LK D RHG R++
Sbjct: 1 MAVTVTVVY------GGFPGSYL-SLERTIPLNHQVE--LTTLKARDRARHGGRILQDGG 51
Query: 67 ---IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDL 123
+D + G P GLYFTKV +G+P E+YVQ+DTGSD+LW+NC C+ CP S L
Sbjct: 52 GGILDFSVQGTSDPYLVGLYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGL 111
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFV 182
GI L FD + SST+ ++CSD C CS +C Y YGDGS TSGY+V
Sbjct: 112 GIDLNYFDTASSSTAALVSCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDGSGTSGYYV 171
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
D + + G + +S+V+FGC QSGDL + T+ AVDGI GFG S++SQ++
Sbjct: 172 YDAMYFDVIMGQSVFSNSSSTVVFGCSTYQSGDL-ARTEKAVDGIFGFGPGALSVVSQVS 230
Query: 243 AAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLD 301
+ G K F+HCL GGGI +G+++ P + TP+VP PHYN+ L+ + V G L
Sbjct: 231 SQGMAPKVFSHCLKGQGSGGGILVLGEILEPNIVYTPLVPLQPHYNLNLQSIAVNGQILP 290
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + TG+ RGTI+DSGTTLAYL YD L+
Sbjct: 291 IDQDVFATGNNRGTIVDSGTTLAYLVQEAYDPFLN 325
>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 500
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 186/316 (58%), Gaps = 12/316 (3%)
Query: 25 GGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASI-----DLELGGNGHPSA 79
GG+ G F+ +E + E L AL+ D RHGR++ + D + G P
Sbjct: 23 GGLAGTFL-PLERAIPLNQQVE--LEALRARDRARHGRILQGVVGGVVDFSVQGTSDPYF 79
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
GLYFTKV LG+P E+YVQ+DTGSD+LW+NC CS CP S LGI+L FD + SST+
Sbjct: 80 VGLYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAA 139
Query: 140 EIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQA-SGNLKT 197
++C D C CS +C Y YGDGS T+GY+V D + + G
Sbjct: 140 LVSCGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVV 199
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD- 256
A +S++IFGC QSGDL + TD AVDGI GFG S++SQL++ G K F+HCL
Sbjct: 200 ANSSSTIIFGCSTYQSGDL-TKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKG 258
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
GGG+ +G+++ P + +P+VP+ PHYN+ L+ + V G L + +++ T + +GTI
Sbjct: 259 GENGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGTI 318
Query: 317 IDSGTTLAYLPPMLYD 332
+DSGTTLAYL Y+
Sbjct: 319 VDSGTTLAYLVQEAYN 334
>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 178/295 (60%), Gaps = 8/295 (2%)
Query: 49 LSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
LS LK+ D RHGRM+ S +D + G P GLY+T++ LGTP ++YVQ+DTG
Sbjct: 13 LSKLKERDRVRHGRMLQSSGVGVVDFPVQGTFDPFLVGLYYTRLQLGTPPRDFYVQIDTG 72
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SD+LWV+C C+ CP S L I L FDP S T+ I+CSD C + CS
Sbjct: 73 SDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCSAQN 132
Query: 164 R-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
C Y YGDGS TSGY+V D++ + G ++ ++FGC Q+GDL + +D
Sbjct: 133 NLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAPIVFGCSALQTGDL-TKSDR 191
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVP 281
AVDGI GFGQ + S++SQLA+ G + F+HCL GGGI +G++V P + TP+VP
Sbjct: 192 AVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGGGILVLGEIVEPNIVYTPLVP 251
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ PHYN+ ++ + V G L + S+ GT +GTIIDSGTTLAYL YD +S
Sbjct: 252 SQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSGTTLAYLAEAAYDPFIS 306
>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 507
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 185/307 (60%), Gaps = 14/307 (4%)
Query: 43 GERERTLSALKQHDTRRHGRMMAS------IDLELGGNGHPSATGLYFTKVGLGTPTDEY 96
+ LS LK+ D+ RH R++ S +D + G +P GLYFT+V LG+P ++
Sbjct: 38 ASHKLELSQLKERDSFRHRRILQSTTSGGVVDFPVQGTFNPFLVGLYFTRVQLGSPPKDF 97
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY 156
YVQ+DTGSD+LWV+C+ C+ CP S L I LT FDP S+T+ ++CSD C +
Sbjct: 98 YVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVSCSDQRCTAGIQSSD 157
Query: 157 PSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQ---ASGNLKT--APLNSSVIFGCGN 210
CS +C Y YGDGS TSGY+V D++ L+ +SG L +SSV F C
Sbjct: 158 SLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQICQTYDSSVSFMCST 217
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDV 269
Q+GDL + +D AVDGI GFGQ S++SQLA+ G + F+HCL GGG+ +G++
Sbjct: 218 LQTGDL-TKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKGDDSGGGVLVLGEI 276
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
V P + TP+VP+ PHYN+ L+ + V G L + S+ G +GTI+DSGTTLAYL
Sbjct: 277 VEPNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGTIVDSGTTLAYLAEG 336
Query: 330 LYDLVLS 336
YD +S
Sbjct: 337 AYDPFVS 343
>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
Length = 354
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 128/272 (47%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 67 IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIK 126
+D + G P GLY+TKV LGTP E+ VQ+DTGSD+LWV+C CS CP S L I+
Sbjct: 9 VDFSVQGTFDPFQVGLYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQ 68
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGSSTSGYFVRDI 185
L FDP SSTS IACSD C + +CS +C Y YGDGS TSGY+V D+
Sbjct: 69 LNFFDPGSSSTSSMIACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDM 128
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ LN T + V+FGC N+Q+GDL + +D AVDGI GFGQ S++SQL++ G
Sbjct: 129 MHLNTIFEGSVTTNSTAPVVFGCSNQQTGDL-TKSDRAVDGIFGFGQQEMSVISQLSSQG 187
Query: 246 NVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
+ F+HCL GGGI +G++V P + T +VP PHYN+ L+ + V G L + +
Sbjct: 188 IAPRVFSHCLKGDSSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDS 247
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
S+ T + RGTI+DSGTTLAYL YD +S
Sbjct: 248 SVFATSNSRGTIVDSGTTLAYLAEEAYDPFVS 279
>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 494
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 128/304 (42%), Positives = 177/304 (58%), Gaps = 8/304 (2%)
Query: 49 LSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
L+ L+ D RH R++ +D + G+ P GLYFT+V LGTP E+ VQ+DTG
Sbjct: 42 LAQLRARDHLRHARLLQGFVGGVVDFSVQGSSDPYLVGLYFTRVKLGTPPREFNVQIDTG 101
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SD+LWV C+ CS CP S LGI+L FD + SST+ + CS C + C P
Sbjct: 102 SDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCSHPICTSQIQTTATQCPPQS 161
Query: 164 -RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+C Y YGDGS TSGY+V D + G A +++++FGC QSGDL + TD
Sbjct: 162 NQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAAIVFGCSTYQSGDL-TKTDK 220
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVP 281
AVDGI GFGQ S++SQL++ G + F+HCL GGGI +G+++ P + +P+VP
Sbjct: 221 AVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGGGILVLGEILEPGIVYSPLVP 280
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFW 341
+ PHYN+ L+ + V G L + + T RGTIID+GTTLAYL YD +S
Sbjct: 281 SQPHYNLDLQSIAVSGQLLPIDPAAFATSSNRGTIIDTGTTLAYLVEEAYDPFVSAITAA 340
Query: 342 IASL 345
++ L
Sbjct: 341 VSQL 344
>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 500
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 130/290 (44%), Positives = 180/290 (62%), Gaps = 7/290 (2%)
Query: 49 LSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
++ L+ D RHGRM+ S ID + G P GLY+T+V LG P ++YVQ+DTGS
Sbjct: 45 IAHLRSRDRVRHGRMLQSSGGVIDFSVSGTYDPFLVGLYYTRVQLGNPPKDFYVQIDTGS 104
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGV 163
D+LWV+C C+ CP S L I L FDP S+T+ ++CSD C + +C
Sbjct: 105 DVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVSCSDQICALGVQSSDSACFGQSN 164
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C YV YGDGS TSGY+V D+I L+ + T+ ++SV+FGC Q+GDL + +D A
Sbjct: 165 QCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASVVFGCSTSQTGDL-TKSDRA 223
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPN 282
VDGI GFGQ + S++SQL++ G K F+HCL GGGI +G++V P V TP+VP+
Sbjct: 224 VDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGGGILVLGEIVEPNVVYTPLVPS 283
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
PHYN+ L+ + V G L + ++ T +GTIIDSGTTLAYL Y+
Sbjct: 284 QPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSGTTLAYLAEEAYN 333
>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
Length = 423
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 127/261 (48%), Positives = 165/261 (63%), Gaps = 6/261 (2%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
GLYFT+V LG P E++VQ+DTGSD+LWV C+ C+ CPT S L I+L F+P SST+
Sbjct: 3 GLYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASR 62
Query: 141 IACSDNFCRTTYNNRYPSC----SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
I CSD+ C + C S C Y TYGDGS TSGY+V D + GN +
Sbjct: 63 ITCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQ 122
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
TA ++S++FGC N QSGDL + D AVDGI GFGQ S++SQL + G K F+HCL
Sbjct: 123 TANSSASIVFGCSNSQSGDL-TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLK 181
Query: 257 -VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
GGGI +G++V P + TP+VP+ PHYN+ LE + V G L + +SL T + +GT
Sbjct: 182 GSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGT 241
Query: 316 IIDSGTTLAYLPPMLYDLVLS 336
I+DSGTTLAYL YD +S
Sbjct: 242 IVDSGTTLAYLADGAYDPFVS 262
>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
Length = 454
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 127/286 (44%), Positives = 168/286 (58%), Gaps = 3/286 (1%)
Query: 49 LSALKQHDTRRHGRMMASI-DLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLL 107
LK HD RHGR + +I D L G P GLY+T++ LGTP +YVQ+DTGSD+L
Sbjct: 6 FEMLKAHDRARHGRSLNTIVDFTLQGTADPYVAGLYYTRIELGTPPRPFYVQIDTGSDIL 65
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEY 167
WVNC C+ CP S LG+ L FDP SST+ ++C D+ C ++ C+ C Y
Sbjct: 66 WVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLSCIDSKCVSSNQISESVCTTDRYCGY 125
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
YGDGS T GY+V D NQ T ++ + FGC QSGDL + D AVDGI
Sbjct: 126 SFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKITFGCSYNQSGDL-TKPDRAVDGI 184
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHY 286
GFGQ + S++SQL + G K F+HCL+ GGGI +G++ P + TP+VP+ PHY
Sbjct: 185 FGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGGGILVLGEITEPGMVYTPIVPSQPHY 244
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
N+ L+ + V G L + + T + RGTIID GTTLAYL Y+
Sbjct: 245 NLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGTTLAYLAEEAYE 290
>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 498
Score = 243 bits (621), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 131/295 (44%), Positives = 177/295 (60%), Gaps = 12/295 (4%)
Query: 49 LSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATG--LYFTKVGLGTPTDEYYVQVD 101
+ L+ D RHGR++ + +D + G+ PS G LY TKV +GTP E+ VQ+D
Sbjct: 43 IDTLRARDRVRHGRILRASVGGVVDFRVQGSSDPSTLGYGLYTTKVKMGTPPREFTVQID 102
Query: 102 TGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP 161
TGSD+LW+NC CS CP S LGI+L FD SST+ + CSD C + CSP
Sbjct: 103 TGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVPCSDPMCASAIQGAAAQCSP 162
Query: 162 GV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS--VIFGCGNRQSGDLGS 218
V +C Y Y DGS TSG +V D + + G A + SS ++FGC QSGDL +
Sbjct: 163 QVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVASSATIVFGCSTYQSGDL-T 221
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTT 277
TD AVDGILGFG S++SQL++ G K F+HCL GGGI +G+++ P + +
Sbjct: 222 KTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGGGILVLGEILEPSIVYS 281
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
P+VP+ PHYN+ L+ + V G L + ++ T D+RGTIIDSGTTL+YL YD
Sbjct: 282 PLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKRGTIIDSGTTLSYLVQEAYD 336
>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
Length = 530
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 126/253 (49%), Positives = 164/253 (64%), Gaps = 4/253 (1%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YFT+V LG+P EY+VQ+DTGSD+LWV C+ C+ CP+ S L I+L F+P SSTS +I
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176
Query: 143 CSDNFCRTTYNNRYPSC--SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
CSD+ C C S C Y TYGDGS TSGY+V D + + GN +TA
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVK 259
++S++FGC N QSGDL + TD AVDGI GFGQ S++SQL + G K F+HCL
Sbjct: 237 SASIVFGCSNSQSGDL-TKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGSDN 295
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDS 319
GGGI +G++V P + TP+VP+ PHYN+ LE + V G L + +SL T + +GTI+DS
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 355
Query: 320 GTTLAYLPPMLYD 332
GTTLAYL YD
Sbjct: 356 GTTLAYLADGAYD 368
>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 127/295 (43%), Positives = 173/295 (58%), Gaps = 8/295 (2%)
Query: 49 LSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
L L+ D RH R++ +D + G+ P GLYFTKV LG+P E+ VQ+DTG
Sbjct: 27 LHQLRARDRLRHARLLQGFVGGVVDFSVQGSSDPYLVGLYFTKVKLGSPPREFNVQIDTG 86
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SD+LWV C C+ CP S LGI+L FD S SST+G++ CSD C + CS
Sbjct: 87 SDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSDPICTSAVQTTATQCSSQT 146
Query: 164 -RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+C Y YGDGS TSGY+V D + + G ++ ++FGC QSGDL + TD
Sbjct: 147 DQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALIVFGCSAYQSGDL-TKTDK 205
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVP 281
AVDGI GFGQ S++SQL+ G + F+HCL GGGI +G+++ P + +P+VP
Sbjct: 206 AVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGGGILVLGEILEPGIVYSPLVP 265
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ PHYN+ L + V G L + + T + +GTI+DSGTTLAYL YD +S
Sbjct: 266 SQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQGTIVDSGTTLAYLVAEAYDPFVS 320
>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 131/290 (45%), Positives = 178/290 (61%), Gaps = 7/290 (2%)
Query: 49 LSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
LS L+ D+ RH RM+ S +D + G PS GLY+TKV LGTP E YVQ+DTGS
Sbjct: 39 LSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGS 98
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGV 163
D+LWV+C C+ CP S L I+L FDP SSTS I+C D CR+ SCS
Sbjct: 99 DVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNN 158
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C Y YGDGS TSGY+V D++ T ++SV+FGC Q+GDL + ++ A
Sbjct: 159 QCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSILQTGDL-TKSERA 217
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPN 282
VDGI GFGQ S++SQL++ G + F+HCL GGG+ +G++V P + +P+VP+
Sbjct: 218 VDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEIVEPNIVYSPLVPS 277
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
PHYN+ L+ + V G + + S+ T + RGTI+DSGTTLAYL Y+
Sbjct: 278 QPHYNLNLQSISVNGQIVRIAPSVFATSNNRGTIVDSGTTLAYLAEEAYN 327
>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
Length = 451
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 133/342 (38%), Positives = 189/342 (55%), Gaps = 21/342 (6%)
Query: 10 VVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMAS--- 66
+++ V V H V + +F + + + LS LK+ D RH RM+ S
Sbjct: 9 ILIAVVVFHATVV-----LSSFPATLHLERGVPASHKLKLSQLKERDRVRHSRMLQSSGG 63
Query: 67 --IDLELGGNGHPSATGLYF--------TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
+D + G P G YF T++ LG+P ++YVQ+DTGSD+LWV+C+ C+
Sbjct: 64 GVVDFPVQGTFDPFLVGFYFGSFCRLYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNG 123
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGS 175
CP S L I L FDP S T+ I+CSD C + C+ +C Y YGDGS
Sbjct: 124 CPVSSGLHIPLNFFDPGSSPTASLISCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGS 183
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
TSGY+V D++ + G ++ ++FGC Q+GDL + D AVDGI GFGQ +
Sbjct: 184 GTSGYYVSDLLHFDTILGGSVMKNSSAPIVFGCSTLQTGDL-TKPDRAVDGIFGFGQQDM 242
Query: 236 SLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE 294
S++SQLA+ G + F+HCL GGGI +G++V P + TP+VP+ PHYN+ L+ +
Sbjct: 243 SVISQLASQGITPRVFSHCLKGDDSGGGILVLGEIVEPNIVYTPLVPSQPHYNLNLQSIY 302
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
V G L + S+ T +GTIIDSGTTLAYL YD +S
Sbjct: 303 VNGQTLAIDPSVFATSSNQGTIIDSGTTLAYLTEAAYDPFIS 344
>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 507
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 125/299 (41%), Positives = 173/299 (57%), Gaps = 12/299 (4%)
Query: 49 LSALKQHDTRRHGRMM----------ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
LS L+ D RH R++ +D + G+ P GLYFTKV LG+P E+ V
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNV 115
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
Q+DTGSD+LWV C+ CS CP S LGI L FD S T+G + CSD C + +
Sbjct: 116 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQ 175
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
CS +C Y YGDGS TSGY++ D + G A ++ ++FGC QSGDL +
Sbjct: 176 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL-T 234
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTT 277
+D AVDGI GFG+ S++SQL++ G F+HCL GGG+F +G+++ P + +
Sbjct: 235 KSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYS 294
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
P+VP+ PHYN+ L + V G L L ++ + RGTI+D+GTTL YL YDL L+
Sbjct: 295 PLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLN 353
>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
Length = 469
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 126/308 (40%), Positives = 176/308 (57%), Gaps = 12/308 (3%)
Query: 49 LSALKQHDTRRHGRMM----------ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
LS L+ D RH R++ +D + G+ P GLYFTKV LG+P E+ V
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNV 115
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
Q+DTGSD+LWV C+ CS CP S LGI L FD S T+G + CSD C + +
Sbjct: 116 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQTTAAQ 175
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
CS +C Y YGDGS TSGY++ D + G A ++ ++FGC QSGDL +
Sbjct: 176 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL-T 234
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTT 277
+D AVDGI GFG+ S++SQL++ G F+HCL GGG+F +G+++ P + +
Sbjct: 235 KSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYS 294
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
P+VP+ PHYN+ L + V G L L ++ + RGTI+D+GTTL YL YDL L+
Sbjct: 295 PLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYDLFLNA 354
Query: 338 FRFWIASL 345
++ L
Sbjct: 355 ISNSVSQL 362
>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 492
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/281 (45%), Positives = 174/281 (61%), Gaps = 4/281 (1%)
Query: 49 LSALKQHDTRRHGRMMASI-DLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLL 107
L L+ D RH R++ + D + G+ P GLYFTKV LGTP E+ VQ+DTGSD+L
Sbjct: 44 LETLRARDRLRHARILQGVVDFSVEGSSDPLLVGLYFTKVKLGTPPMEFTVQIDTGSDIL 103
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCE 166
WVNC C+ CP S LGI+L FD S SS+S ++CSD C + + C + +C
Sbjct: 104 WVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVSCSDPICNSAFQTTATQCLTQSNQCS 163
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y YGDGS TSGY+V + + + G A ++SV+FGC QSGDL + +D A+DG
Sbjct: 164 YTFQYGDGSGTSGYYVSESMYFDMVMGQSMIANSSASVVFGCSTYQSGDL-TKSDHAIDG 222
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPH 285
I GFG + S++SQL+A G K F+HCL GGGI +G+V+ P + +P+VP+ PH
Sbjct: 223 IFGFGPGDLSVISQLSARGITPKVFSHCLKGEGNGGGILVLGEVLEPGIVYSPLVPSQPH 282
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
YN+ L+ + V G L + S+ T RGTIIDSGTTLAYL
Sbjct: 283 YNLYLQSISVNGQTLPIDPSVFATSINRGTIIDSGTTLAYL 323
>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
Length = 395
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/287 (47%), Positives = 178/287 (62%), Gaps = 5/287 (1%)
Query: 56 DTRRHGRMMAS-IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC 114
D R GR +A +D LGG P + GLYFT+VGLG P Y VQVDTGSD+LWVNC C
Sbjct: 1 DRGRRGRFLAEGVDFSLGGTADPLSGGLYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPC 60
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGD 173
S CP KS L I LT++DP +SST+ ++CSD C CS CEY+ +YGD
Sbjct: 61 SGCPRKSALNIPLTMYDPRESSTTSLVSCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGD 120
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
GS++ GY+VRD +Q N S N A S V+FGC RQ+GDL S++ AVDGI+GFGQ
Sbjct: 121 GSTSEGYYVRDAMQYNVISSN-GLANTTSQVLFGCSIRQTGDL-STSQQAVDGIIGFGQL 178
Query: 234 NSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
S+ +QLAA N+ + F+HCL+ K GGGI IG + P + TP+VP+ HYNV+L
Sbjct: 179 ELSVPNQLAAQQNIPRVFSHCLEGEKRGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRG 238
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ V N L + + ++ G I+DSGTTLAY P Y++ + R
Sbjct: 239 ISVNSNRLPIDAEDFSSTNDTGVIMDSGTTLAYFPSGAYNVFVQAIR 285
>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 489
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 138/318 (43%), Positives = 188/318 (59%), Gaps = 13/318 (4%)
Query: 21 AVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMAS----IDLELGGNGH 76
AVGG V +E F + E LS L+ D+ RH RM+ S +D + G
Sbjct: 17 AVGGSPV----TLTLERAFPSNDGVE--LSELRARDSLRHRRMLQSTNYVVDFPVKGTFD 70
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSS 136
PS GLY+TKV LGTP E+YVQ+DTGSD+LWV+C C+ CP S L I+L FDP SS
Sbjct: 71 PSQVGLYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSS 130
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
TS I+CSD CR+ SC S +C Y YGDGS TSGY+V D++
Sbjct: 131 TSSLISCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGT 190
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
T ++SV+FGC Q+GDL + ++ AVDGI GFGQ S++SQL+ G + F+HCL
Sbjct: 191 LTTNSSASVVFGCSILQTGDL-TKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHCL 249
Query: 256 D-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
GGG+ +G++V P + +P+V + PHYN+ L+ + V G + + ++ T + RG
Sbjct: 250 KGDNSGGGVLVLGEIVEPNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNNRG 309
Query: 315 TIIDSGTTLAYLPPMLYD 332
TI+DSGTTLAYL Y+
Sbjct: 310 TIVDSGTTLAYLAEEAYN 327
>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 136/348 (39%), Positives = 200/348 (57%), Gaps = 11/348 (3%)
Query: 4 LRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRM 63
+ +LAL++ A++ AV G + + +E F E L L+ D RHGR+
Sbjct: 5 ISILALILAFAAILLTAAVVHCGSPASLL-TLERAFPVNQRVE--LEVLRARDQARHGRL 61
Query: 64 M-----ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ +D + G P GLYFTKV LG+P E+ VQ+DTGSD+LWV C C+ CP
Sbjct: 62 LRGVVGGVVDFTVYGTSDPYLVGLYFTKVKLGSPPREFNVQIDTGSDILWVTCNSCNDCP 121
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSST 177
S LGI+L+ FDPS SST+ ++CS C + CSP +C Y YGDGS T
Sbjct: 122 RTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAECSPQSNQCSYSFHYGDGSGT 181
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
+GY+V D++ + G+ A ++S++FGC QSGDL + D A+DGI GFGQ + S+
Sbjct: 182 TGYYVSDMLYFDTVLGDSLIANSSASIVFGCSTYQSGDL-TKVDKAIDGIFGFGQQDLSV 240
Query: 238 LSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVG 296
+SQL++ G K F+HCL GGG +G+++ P + +P+VP+ HYN+ L+ + V
Sbjct: 241 VSQLSSLGITPKVFSHCLKGEGDGGGKLVLGEILEPNIIYSPLVPSQSHYNLNLQSISVN 300
Query: 297 GNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
G L + ++ T + +GTI+DSGTTL YL YD +S ++S
Sbjct: 301 GQLLPIDPAVFATSNNQGTIVDSGTTLTYLVETAYDPFVSAITATVSS 348
>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 507
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 122/299 (40%), Positives = 172/299 (57%), Gaps = 12/299 (4%)
Query: 49 LSALKQHDTRRHGRMM----------ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
LS L+ D RH R++ +D + G+ P GLYFTKV LG+P E+ V
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGLYFTKVKLGSPPTEFNV 115
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
Q+DTGSD+LWV C+ CS CP S LGI L FD S T+G + CSD C + +
Sbjct: 116 QIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVTCSDPICSSVFQTTAAQ 175
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
CS +C Y YGDGS TSGY++ D + G A ++ ++FGC QSGDL +
Sbjct: 176 CSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQSGDL-T 234
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTT 277
+D AVDGI GFG+ S++SQL++ G F+HCL GGG+F +G+++ P + +
Sbjct: 235 KSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVPGMVYS 294
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
P++P+ PHYN+ L + V G L + ++ + RGTI+D+GTTL YL YD L+
Sbjct: 295 PLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGTTLTYLVKEAYDPFLN 353
>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 486
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 183/315 (58%), Gaps = 11/315 (3%)
Query: 26 GVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMM-----ASIDLELGGNGHPSAT 80
V G F+ +E G R ++ALK D RH RM+ +D + G P++
Sbjct: 18 AVHGVFL-PLERSIPPTGHRVE-VAALKARDRARHARMLRGVAGGVVDFSVQGTSDPNSV 75
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
GLY+TKV +GTP E+ VQ+DTGSD+LWVNC CS CP S LGI+L FD SST+
Sbjct: 76 GLYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAAL 135
Query: 141 IACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
I CSD C + CSP V +C Y YGDGS TSGY+V D + + G
Sbjct: 136 IPCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVN 195
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VV 258
+++++FGC QSGDL + TD AVDGI GFG S++SQL++ G K F+HCL
Sbjct: 196 SSATIVFGCSISQSGDL-TKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDG 254
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER-GTII 317
GGG+ +G+++ P + +P+VP+ PHYN+ L+ + V G L + ++ + R GTI+
Sbjct: 255 DGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRGGTIV 314
Query: 318 DSGTTLAYLPPMLYD 332
D GTTLAYL YD
Sbjct: 315 DCGTTLAYLIQEAYD 329
>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 512
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 125/304 (41%), Positives = 173/304 (56%), Gaps = 17/304 (5%)
Query: 49 LSALKQHDTRRHGRMM----------ASIDLELGGNGHPSATG-----LYFTKVGLGTPT 93
LS L+ D RH R++ +D + G+ P G LYFTKV LG+P
Sbjct: 56 LSELRARDRVRHARILLGGGRQSSVGGVVDFPVQGSSDPYLVGSKMTMLYFTKVKLGSPP 115
Query: 94 DEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYN 153
E+ VQ+DTGSD+LWV C+ CS CP S LGI L FD S T+G + CSD C + +
Sbjct: 116 TEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVTCSDPICSSVFQ 175
Query: 154 NRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
CS +C Y YGDGS TSGY++ D + G A ++ ++FGC QS
Sbjct: 176 TTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSAPIVFGCSTYQS 235
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSP 272
GDL + +D AVDGI GFG+ S++SQL++ G F+HCL GGG+F +G+++ P
Sbjct: 236 GDL-TKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGGVFVLGEILVP 294
Query: 273 KVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
+ +P+VP+ PHYN+ L + V G L L ++ + RGTI+D+GTTL YL YD
Sbjct: 295 GMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGTTLTYLVKEAYD 354
Query: 333 LVLS 336
L L+
Sbjct: 355 LFLN 358
>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
Length = 478
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 129/295 (43%), Positives = 175/295 (59%), Gaps = 8/295 (2%)
Query: 49 LSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
LS L+ D RH R++ +D + G+ P GLYFTKV LG+P E+ VQ+DTG
Sbjct: 27 LSQLRARDRLRHARLLQGFVGGVVDFSVQGSPDPYLVGLYFTKVKLGSPPREFNVQIDTG 86
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SD+LWV C C+ CP S LGI+L FD S SST+G + CSD C + CSP
Sbjct: 87 SDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSDPICTSAVQTTVTQCSPQT 146
Query: 164 -RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+C Y Y DGS TSGY+V D + + G ++ ++FGC QSGDL + TD
Sbjct: 147 NQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSSALIVFGCSTFQSGDL-TMTDK 205
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVP 281
AVDGI GFGQ S++SQL+ G + F+HCL GGGI +G+++ P + +P+VP
Sbjct: 206 AVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGEGIGGGILVLGEILEPGMVYSPLVP 265
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ PHYN+ L+ + V G L + S+ T + +GTI+DSGTTLAYL YD +S
Sbjct: 266 SQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQGTIVDSGTTLAYLVAEAYDPFVS 320
>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
vinifera]
Length = 499
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 130/294 (44%), Positives = 177/294 (60%), Gaps = 8/294 (2%)
Query: 49 LSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
L LK D RHGR + S +D + G P GLYFT+V LG+P E+YVQ+DTGS
Sbjct: 45 LDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGS 104
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP-GV 163
D+LWV+C C+ CP S L I L FDP SST+ I+CSD C + CS G
Sbjct: 105 DVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGN 164
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C Y YGDGS TSGY+V D++ + G+ T ++S++FGC Q+GDL + +D A
Sbjct: 165 QCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS-SASIVFGCSISQTGDL-TKSDRA 222
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHC-LDVVKGGGIFAIGDVVSPKVKTTPMVPN 282
VDGI GFGQ + S++SQ+++ G K F+HC GGGI +G++V + +P+VP+
Sbjct: 223 VDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS 282
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
PHYN+ L+ + V G L + + T RGTI+DSGTTLAYL YD +S
Sbjct: 283 QPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVS 336
>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
Length = 484
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 130/294 (44%), Positives = 177/294 (60%), Gaps = 8/294 (2%)
Query: 49 LSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
L LK D RHGR + S +D + G P GLYFT+V LG+P E+YVQ+DTGS
Sbjct: 30 LDELKARDRVRHGRFLQSSVGVVDFPVEGTYDPYRVGLYFTRVLLGSPPKEFYVQIDTGS 89
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP-GV 163
D+LWV+C C+ CP S L I L FDP SST+ I+CSD C + CS G
Sbjct: 90 DVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSDQRCSLGVQSSDAGCSSQGN 149
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C Y YGDGS TSGY+V D++ + G+ T ++S++FGC Q+GDL + +D A
Sbjct: 150 QCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS-SASIVFGCSISQTGDL-TKSDRA 207
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHC-LDVVKGGGIFAIGDVVSPKVKTTPMVPN 282
VDGI GFGQ + S++SQ+++ G K F+HC GGGI +G++V + +P+VP+
Sbjct: 208 VDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGILVLGEIVEEDIVYSPLVPS 267
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
PHYN+ L+ + V G L + + T RGTI+DSGTTLAYL YD +S
Sbjct: 268 QPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVDSGTTLAYLAEEAYDPFVS 321
>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 492
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/297 (44%), Positives = 177/297 (59%), Gaps = 16/297 (5%)
Query: 45 RERTLSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQ 99
E L+ L+ D+ RHGR++ S ++ + G P GLY+TKV LGTP E+ VQ
Sbjct: 41 HELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQ 100
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGSD+LWV+C C+ CP S+L I+L+ FDP SS++ ++CSD C + + C
Sbjct: 101 IDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTE-SGC 159
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV--IFGCGNRQSGDLG 217
SP C Y YGDGS TSGY++ D + + + T +NSS +FGC N QSGDL
Sbjct: 160 SPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITS--TLAINSSAPFVFGCSNLQSGDL- 216
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVVSPKVKT 276
AVDGI G GQ + S++SQLA G + F+HCL K GGGI +G + P
Sbjct: 217 QRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVY 276
Query: 277 TPMVPNMPHYNVILEEVEVGGN--PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TP+VP+ PHYNV L+ + V G P+D + TGD GTIID+GTTLAYLP Y
Sbjct: 277 TPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGD--GTIIDTGTTLAYLPDEAY 331
>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 130/281 (46%), Positives = 170/281 (60%), Gaps = 9/281 (3%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLW 108
LK HD RR + A +D L G+ P TGLY+TK+ LGTP YYVQVDTGSD+ W
Sbjct: 6 FETLKAHDRRR---LAAVVDFPLTGDDDPFVTGLYYTKIYLGTPPVGYYVQVDTGSDVTW 62
Query: 109 VNCAGCSRCPTKSDL-GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEY 167
+NCA C+ C T++ L IKLT +DPS+SST G ++C D+ C + SC+ C Y
Sbjct: 63 LNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGALSCRDSNCGAALGSNEVSCTSAGYCAY 122
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
TYGDGSST GYF++D++ + N + +SV FGCG QSG+L S+ A+DG+
Sbjct: 123 STTYGDGSSTQGYFIQDVMTFQEIHNNTQVNG-TASVYFGCGTTQSGNLLMSS-RALDGL 180
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHY 286
+GFGQA S+ SQLA+ G V FAHCL +GGG IG V P + TP+V + HY
Sbjct: 181 IGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGGTIVIGSVSEPNISYTPIV-SRNHY 239
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDER-GTIIDSGTTLAYL 326
V ++ + V G + P S T G I+DSGTTLAYL
Sbjct: 240 AVGMQNIAVNGRNVTTPASFDTTSTSAGGVIMDSGTTLAYL 280
>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
Length = 434
Score = 226 bits (577), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 125/286 (43%), Positives = 166/286 (58%), Gaps = 11/286 (3%)
Query: 49 LSALKQHDTRRHGRMMAS-IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLL 107
+ LK HD R ++ +S + L + G P GLYFT+V LGTP Y +QVDTGSDLL
Sbjct: 1 MQLLKAHDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLL 60
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEY 167
WVNC C CP SDL I + +D S++S ++ CSD C C+ +C Y
Sbjct: 61 WVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGY 120
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
YGDGS T GY V D++ + ++VIFGCG +QSGDL S+++ A+DGI
Sbjct: 121 SFQYGDGSGTLGYLVEDVLHYMVNA--------TATVIFGCGFKQSGDL-STSERALDGI 171
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHY 286
+GFG ++ S SQLA G FAHCLD +GGGI +G+V+ P ++ TP+VP M HY
Sbjct: 172 IGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMSHY 231
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
NV+L+ + V L + L +GTI DSGTTLAYLP Y
Sbjct: 232 NVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAYQ 277
>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
Length = 388
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 125/285 (43%), Positives = 166/285 (58%), Gaps = 11/285 (3%)
Query: 49 LSALKQHDTRRHGRMMAS-IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLL 107
+ LK HD R ++ +S + L + G P GLYFT+V LGTP Y +QVDTGSDLL
Sbjct: 1 MQLLKAHDRGRMVKLKSSAVSLPVEGVADPYIAGLYFTQVQLGTPPRTYNLQVDTGSDLL 60
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEY 167
WVNC C CP SDL I + +D S++S ++ CSD C C+ +C Y
Sbjct: 61 WVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSCTLITQISESGCNDQNQCGY 120
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
YGDGS T GY V D++ + ++VIFGCG +QSGDL S+++ A+DGI
Sbjct: 121 SFQYGDGSGTLGYLVEDVLHYMVNA--------TATVIFGCGFKQSGDL-STSERALDGI 171
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHY 286
+GFG ++ S SQLA G FAHCLD +GGGI +G+V+ P ++ TP+VP M HY
Sbjct: 172 IGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILVLGNVIEPDIQYTPLVPYMYHY 231
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
NV+L+ + V L + L +GTI DSGTTLAYLP Y
Sbjct: 232 NVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAYLPDEAY 276
>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 493
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 177/297 (59%), Gaps = 16/297 (5%)
Query: 45 RERTLSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQ 99
E L+ L+ D+ RHGR++ S ++ + G P GLY+TKV LGTP E+ VQ
Sbjct: 41 HELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQ 100
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGSD+LWV+C C+ CP S+L I+L+ FDP SS++ ++CSD C + + C
Sbjct: 101 IDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTE-SGC 159
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV--IFGCGNRQSGDLG 217
SP C Y YGDGS TSG+++ D + + + T +NSS +FGC N Q+GDL
Sbjct: 160 SPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITS--TLAINSSAPFVFGCSNLQTGDL- 216
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVVSPKVKT 276
AVDGI G GQ + S++SQLA G + F+HCL K GGGI +G + P
Sbjct: 217 QRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVY 276
Query: 277 TPMVPNMPHYNVILEEVEVGGN--PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TP+VP+ PHYNV L+ + V G P+D + TGD GTIID+GTTLAYLP Y
Sbjct: 277 TPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGD--GTIIDTGTTLAYLPDEAY 331
>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
Length = 388
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 125/260 (48%), Positives = 164/260 (63%), Gaps = 4/260 (1%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
LYFT+VGLG P Y VQVDTGSD+LWVNC CS CP KS L I LT++DP +SST+ +
Sbjct: 1 LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60
Query: 142 ACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+CSD C CS CEY+ +YGDGS++ GY+VRD +Q N S N A
Sbjct: 61 SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSN-GLANT 119
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK- 259
S V+FGC RQ+GDL S++ AVDGI+GFGQ S+ +QLAA N+ + F+HCL+ K
Sbjct: 120 TSQVLFGCSIRQTGDL-STSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKR 178
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDS 319
GGGI IG + P + TP+VP+ HYNV+L + V N L + + ++ G I+DS
Sbjct: 179 GGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDS 238
Query: 320 GTTLAYLPPMLYDLVLSQFR 339
GTTLAY P Y++ + R
Sbjct: 239 GTTLAYFPSGAYNVFVQAIR 258
>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 476
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 127/330 (38%), Positives = 175/330 (53%), Gaps = 21/330 (6%)
Query: 6 LLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMA 65
LLA++ V ++ VH GV +E R + + R +
Sbjct: 8 LLAVITVLLSAVH-------GVF----LPLERSIPPTSHRVEVAALRARDRARHARMLRG 56
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI 125
+D + G P++ G+Y G + VQ+DTGSD+LWVNC CS CP S LGI
Sbjct: 57 VVDFSVQGTSDPNSVGMY------GXXXXXFNVQIDTGSDILWVNCNTCSNCPQSSQLGI 110
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRD 184
+L FD SST+ I CSD C + CSP V +C Y YGDGS TSGY+V D
Sbjct: 111 ELNFFDTVGSSTAALIPCSDLICTSGVQGAAAECSPRVNQCSYTFQYGDGSGTSGYYVSD 170
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
+ N G ++++FGC QSGDL + TD AVDGI GFG S++SQL++
Sbjct: 171 AMYFNLIMGQPPAVNSTATIVFGCSISQSGDL-TKTDKAVDGIFGFGPGPLSVVSQLSSQ 229
Query: 245 GNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLP 303
G K F+HCL GGGI +G+++ P + +P+VP+ PHYN+ L+ + V G PL +
Sbjct: 230 GITPKVFSHCLKGDGNGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQPLPIN 289
Query: 304 TSLLGTGDER-GTIIDSGTTLAYLPPMLYD 332
++ + R GTI+D GTTLAYL YD
Sbjct: 290 PAVFSISNNRGGTIVDCGTTLAYLIQEAYD 319
>gi|6579210|gb|AAF18253.1|AC011438_15 T23G18.7 [Arabidopsis thaliana]
Length = 566
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/294 (43%), Positives = 167/294 (56%), Gaps = 33/294 (11%)
Query: 46 ERTLSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
E L+ L+ D+ RHGR++ S ++ + G P GLY+TKV LGTP E+ VQ+
Sbjct: 90 ELGLTELRAFDSARHGRLLQSPVGGVVNFPVDGASDPFLVGLYYTKVKLGTPPREFNVQI 149
Query: 101 DTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
DTGSD+LWV+C C+ CP S+L I+L+ FDP SS++ ++CSD C + + CS
Sbjct: 150 DTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVSCSDRRCYSNFQTE-SGCS 208
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
P C Y YGDGS TSGY++ D F C N QSGDL
Sbjct: 209 PNNLCSYSFKYGDGSGTSGYYISD---------------------FMCSNLQSGDL-QRP 246
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVVSPKVKTTPM 279
AVDGI G GQ + S++SQLA G + F+HCL K GGGI +G + P TP+
Sbjct: 247 RRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGIMVLGQIKRPDTVYTPL 306
Query: 280 VPNMPHYNVILEEVEVGGN--PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
VP+ PHYNV L+ + V G P+D + TGD GTIID+GTTLAYLP Y
Sbjct: 307 VPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGD--GTIIDTGTTLAYLPDEAY 358
>gi|255637574|gb|ACU19113.1| unknown [Glycine max]
Length = 290
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 114/253 (45%), Positives = 155/253 (61%), Gaps = 7/253 (2%)
Query: 49 LSALKQHDTRRHGRMMAS----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
LS L+ D+ RH RM+ S +D + G PS GLY+TKV LGTP E YVQ+DTGS
Sbjct: 39 LSELRARDSLRHRRMLQSTNYVVDFPVKGTFDPSQVGLYYTKVKLGTPPRELYVQIDTGS 98
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGV 163
D+LWV+C C+ CP S L I+L FDP SSTS I+C D CR+ SCS
Sbjct: 99 DVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLISCLDRRCRSGVQTSDASCSGRNN 158
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C Y YGDGS TSGY+V D++ T ++SV+FGC Q+GDL + ++ A
Sbjct: 159 QCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSSASVVFGCSILQTGDL-TKSERA 217
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPN 282
VDGI GFGQ S++SQL++ G + F+HCL GGG+ +G++V P + +P+VP+
Sbjct: 218 VDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDNSGGGVLVLGEIVEPNIVYSPLVPS 277
Query: 283 MPHYNVILEEVEV 295
PHYN+ L+ + V
Sbjct: 278 QPHYNLNLQSISV 290
>gi|217073142|gb|ACJ84930.1| unknown [Medicago truncatula]
Length = 191
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 97/175 (55%), Positives = 123/175 (70%), Gaps = 7/175 (4%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGL 89
N VF+VE R+ TLS +K HD R GR ++S+D LGGNG P+ TGLYFTK+GL
Sbjct: 24 NLVFQVE-------RRKTTLSGIKHHDHHRRGRFLSSVDFNLGGNGLPTRTGLYFTKLGL 76
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
G+P +YYVQVDTGSD+LWVNC CSRCPTKS +G+ LTL+DP S TS I+C FC
Sbjct: 77 GSPKKDYYVQVDTGSDILWVNCVECSRCPTKSQIGMDLTLYDPKGSHTSELISCDHEFCS 136
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
+TY+ P C C Y +TYGDGS+T+GY+VRD + ++ +GNL TAP NSS+
Sbjct: 137 STYDGPIPGCRAETPCPYSITYGDGSATTGYYVRDYLTFDRINGNLHTAPQNSSI 191
>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 459
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 116/301 (38%), Positives = 164/301 (54%), Gaps = 14/301 (4%)
Query: 43 GERERTLSALKQHDTRRHGRMMASI-DLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVD 101
G L++HD RR R++ + + G+ TGLY+T++ LGTP ++YV VD
Sbjct: 7 GMSSEYYRTLREHDQRRLRRILPEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVD 66
Query: 102 TGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS- 160
TGSD+ WVNC C+ C S++ + +++FDP KS++ I+C+D C N++ CS
Sbjct: 67 TGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSK---CSF 123
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA-SGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
+ C Y YGDGSST+GY + D++ NQ SGN + + FGCG+ Q+G
Sbjct: 124 NSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW--- 180
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTP 278
DG++GFGQA SL SQL+ FAHCL KG G IG + P + TP
Sbjct: 181 ---LTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGLVYTP 237
Query: 279 MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+VP HYNV L + V G + PT+ + G I+DSGTTL YL YD ++
Sbjct: 238 IVPKQSHYNVELLNIGVSGTNVTTPTA-FDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKV 296
Query: 339 R 339
R
Sbjct: 297 R 297
>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 491
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 109/271 (40%), Positives = 159/271 (58%), Gaps = 4/271 (1%)
Query: 67 IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIK 126
++ + G+ +P GLYFTKV LG P E+ VQ+DTGSD+LWV C+ C CP S LGI+
Sbjct: 69 VNFSVKGSSNP-FVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIE 127
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDII 186
L LFD +KSS++ + C+D C + C Y Y D S TSG++V D +
Sbjct: 128 LNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSM 187
Query: 187 QLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
+ G A +++++FGC Q GDL +T A+DGI GFGQ S++SQL++ G
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRATK-ALDGIFGFGQGEFSVISQLSSRGI 246
Query: 247 VRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTS 305
K F+HCL GGGI +G+++ P + +P++P+ PHY + L+ + + G PT
Sbjct: 247 TPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPT- 305
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + TIIDSGTTLAYL +YD ++S
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVS 336
>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 488
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 109/271 (40%), Positives = 159/271 (58%), Gaps = 4/271 (1%)
Query: 67 IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIK 126
++ + G+ +P GLYFTKV LG P E+ VQ+DTGSD+LWV C+ C CP S LGI+
Sbjct: 69 VNFSVKGSSNP-FVGLYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIE 127
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDII 186
L LFD +KSS++ + C+D C + C Y Y D S TSG++V D +
Sbjct: 128 LNLFDTTKSSSARVLPCTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSM 187
Query: 187 QLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
+ G A +++++FGC Q GDL +T A+DGI GFGQ S++SQL++ G
Sbjct: 188 HFDILLGESTIANSSATIVFGCSIYQYGDLTRATK-ALDGIFGFGQGEFSVISQLSSRGI 246
Query: 247 VRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTS 305
K F+HCL GGGI +G+++ P + +P++P+ PHY + L+ + + G PT
Sbjct: 247 TPKVFSHCLKGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPT- 305
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + TIIDSGTTLAYL +YD ++S
Sbjct: 306 MFPISNAGETIIDSGTTLAYLVEEVYDWIVS 336
>gi|20466302|gb|AAM20468.1| putative aspartyl protease [Arabidopsis thaliana]
gi|23198124|gb|AAN15589.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 320
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 96/167 (57%), Positives = 129/167 (77%), Gaps = 1/167 (0%)
Query: 171 YGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGF 230
YGDGSST+GY V+D++ L+ +GN +T N ++IFGCG++QSG LG S AAVDGI+GF
Sbjct: 2 YGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFGCGSKQSGQLGES-QAAVDGIMGF 60
Query: 231 GQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVIL 290
GQ+NSS +SQLA+ G V++ FAHCLD GGGIFAIG+VVSPKVKTTPM+ HY+V L
Sbjct: 61 GQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFAIGEVVSPKVKTTPMLSKSAHYSVNL 120
Query: 291 EEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
+EVG + L+L ++ +GD++G IIDSGTTL YLP +Y+ +L++
Sbjct: 121 NAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVYLPDAVYNPLLNE 167
>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 308
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 118/287 (41%), Positives = 160/287 (55%), Gaps = 17/287 (5%)
Query: 52 LKQHDTRRHGRMMASI-DLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN 110
L++HD RR RM+ + + G+ A GLY+T++ LGTP ++YV VDTGS++ WV
Sbjct: 9 LRKHDQRRLRRMLPEVVSFPISGDNDIFAMGLYYTRISLGTPPQQFYVDVDTGSNVAWVK 68
Query: 111 CAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG-VRCEYVV 169
CA C+ C D+ + ++ FDP KS+T I+C+D C N+ CSP + C Y +
Sbjct: 69 CAPCTGCEHSGDVPVPMSTFDPRKSTTKISISCTDAECGVL--NKKLQCSPERLSCPYSL 126
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS-VIFGCGNRQSGDLGSSTDAAVDGIL 228
YGDGSST+GY++ D+ NQ + TA ++ ++FGCG Q+G +VDG+L
Sbjct: 127 LYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSW------SVDGLL 180
Query: 229 GFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYN 287
GFG SL +QLA FAHCL V G G IG + P + TPMV HYN
Sbjct: 181 GFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVYTPMVFGEDHYN 240
Query: 288 VILEEVEVGGNPLDLPTS--LLGTGDERGTIIDSGTTLAYLPPMLYD 332
V L + + G + P S L TG G IIDSGTTL YL YD
Sbjct: 241 VQLLNIGISGRNVTTPASFDLEYTG---GVIIDSGTTLTYLVQPAYD 284
>gi|357168204|ref|XP_003581534.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
2-like [Brachypodium distachyon]
Length = 436
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 162/305 (53%), Gaps = 15/305 (4%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGH--PSATGLYFTKVGLGTPTDEYYVQVD 101
ER +L L + R + + G G + GLY V LG P+ YY+
Sbjct: 35 ERRPSLKGLGVEELSELDRKRFAAKKQQGVTGFVLEAMPGLYCITVKLGNPSRHYYLAFH 94
Query: 102 TGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-- 159
TGSD++WV C+ C+ CPT D+G L L+DP SSTS EI+CSD+ C + C
Sbjct: 95 TGSDVMWVPCSSCTDCPTPDDIGFSLDLYDPKNSSTSSEISCSDDRCADALKTGHAICHT 154
Query: 160 --SPGVRCEYVVTYGDGS-STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDL 216
S G +C Y Y DG +T+GY+V D I + GN A ++SVIFGC +SG L
Sbjct: 155 SHSSGDQCGYNQIYADGVLATTGYYVSDDIHFDIFMGNESFASSSASVIFGCSKSRSGHL 214
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVK 275
DG++GFG+ SL+SQL + G V F+ CL D GGG+ + +V P ++
Sbjct: 215 ------QADGVIGFGKDAPSLISQLNSQG-VSHAFSRCLDDSDDGGGVLILDEVGEPGLE 267
Query: 276 TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
T +V + P YN+ ++ + V + + +SL T +GT +DSGT+LAY P +YD V+
Sbjct: 268 FTSLVASRPCYNLNMKSIAVNNQNVPIDSSLFTTSSTQGTFLDSGTSLAYFPDGVYDPVI 327
Query: 336 SQFRF 340
F
Sbjct: 328 RAILF 332
>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 147 bits (370), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 140/259 (54%), Gaps = 21/259 (8%)
Query: 45 RERTLSALKQHDTRRHGRMMAS-------IDLELGGNGHPSATGLYFTKVGLGTPTDEYY 97
E L+ L D+ RHGRM+ S +E G N + +Y+T + +GTP E+
Sbjct: 40 HELDLTQLGAFDSARHGRMLQSHVHGAFSFPVERGTN---PISRIYYTTLQIGTPPREFN 96
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
V +DTGSD+LWV+C C CP ++ +T FDP SS++ ++ACSD C + + +
Sbjct: 97 VVIDTGSDVLWVSCISCVGCPLQN-----VTFFDPGASSSAVKLACSDKRCFSDLHKK-S 150
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
CSP EY V Y DGS TSGY++ D+I + T ++ +FGC N +G L
Sbjct: 151 GCSP---LEYKVEYSDGSFTSGYYISDLISFETVMSSNLTVKSSAPFVFGCSNLHAG-LI 206
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKT 276
S + ++ GI+G G+ ++SQL++ + F+ CL +GGG+ +G+ P
Sbjct: 207 SLPETSIHGIVGLGKGRLLVVSQLSSQRLAPEVFSLCLSGGQEGGGVIILGENRLPNTVY 266
Query: 277 TPMVPNMPHYNVILEEVEV 295
TP+V + HYNV L+ V
Sbjct: 267 TPLVRSQTHYNVNLKTFAV 285
>gi|125589909|gb|EAZ30259.1| hypothetical protein OsJ_14308 [Oryza sativa Japonica Group]
Length = 178
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/183 (41%), Positives = 104/183 (56%), Gaps = 8/183 (4%)
Query: 28 MGNFVFEVENKFKA--GGERERTLSALKQHDTRRHGRM-MASIDLELGGNGHPSATGLYF 84
M N VF+V KF G + + AL+ HD RH R + + +L LGG P TGLY+
Sbjct: 1 MANGVFQVRRKFHIVDGVYKGSDIGALQTHDENRHRRRNLMAAELPLGGFNIPYGTGLYY 60
Query: 85 TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS 144
T +G+GTP +YYVQ+DTGS WVN C +CP +SD+ KLT +DP S +S E+ C
Sbjct: 61 TDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCD 120
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
D C + P C+ +RC Y+ Y DG T G D++ +Q GN +T P ++SV
Sbjct: 121 DTICTSR-----PPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSV 175
Query: 205 IFG 207
FG
Sbjct: 176 TFG 178
>gi|302757345|ref|XP_002962096.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
gi|300170755|gb|EFJ37356.1| hypothetical protein SELMODRAFT_403622 [Selaginella moellendorffii]
Length = 506
Score = 144 bits (362), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 106/310 (34%), Positives = 148/310 (47%), Gaps = 34/310 (10%)
Query: 34 EVENKFKAGGER---ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLG 90
E+E K G+R E L H R R + +DL L G+ AT Y+ ++G+G
Sbjct: 38 ELEGSSKQSGKRGMSEEHFRQLMDHTRARSRRFLLEVDLMLNGSSTSDAT--YYAQIGVG 95
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI--------KLTLFDPSKSSTSGEIA 142
P VDTGSD+LW C C C +K ++ + +TL+DP S T+
Sbjct: 96 HPVQFLNAIVDTGSDILWFKCKLCQGCSSKKNVIVCSSIIMQGPITLYDPELSITASPAT 155
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CSD C + R + S C Y ++Y D SS++G + RD++ L A LN+
Sbjct: 156 CSDPLCSEGGSCRGNNNS----CAYDISYEDTSSSTGIYFRDVVHLGHK------ASLNT 205
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GG 261
++ GC SG VDGI+GFG++ S+ +QLAA F HCL K GG
Sbjct: 206 TMFLGCATSISGLW------PVDGIMGFGRSKVSVPNQLAAQAGSYNIFYHCLSGEKEGG 259
Query: 262 GIFAIG-DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL---GTGDERGTII 317
GI +G + P++ TPM+ N YNV L + V L + S T GTII
Sbjct: 260 GILVLGKNDEFPEMVYTPMLANDIVYNVKLVSLSVNSKALPIEASEFEYNATVGNGGTII 319
Query: 318 DSGTTLAYLP 327
DSGT+ A P
Sbjct: 320 DSGTSSATFP 329
>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
Length = 466
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 133/261 (50%), Gaps = 27/261 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+G+P + +DTGSD+ WV C CS+C ++ D +LFDPS SST +
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVD-----SLFDPSASSTYSPFS 185
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C ++ + +C+Y+V+Y DGSST+G + D + L S +K
Sbjct: 186 CSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLG--SNAIK------ 237
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-G 261
FGC +SG T DG++G G SL+SQ AG K F++CL G
Sbjct: 238 GFQFGCSQSESGGFSDQT----DGLMGLGGDAQSLVSQ--TAGTFGKAFSYCLPPTPGSS 291
Query: 262 GIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIID 318
G +G TPM+ +P +Y V+LE + VGG L++PTS+ G+++D
Sbjct: 292 GFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVF----SAGSVMD 347
Query: 319 SGTTLAYLPPMLYDLVLSQFR 339
SGT + LPP Y + S F+
Sbjct: 348 SGTVITRLPPTAYSALSSAFK 368
>gi|302783112|ref|XP_002973329.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
gi|300159082|gb|EFJ25703.1| hypothetical protein SELMODRAFT_413603 [Selaginella moellendorffii]
Length = 437
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 155/306 (50%), Gaps = 26/306 (8%)
Query: 43 GERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDT 102
G + L L +H+ RR GR + I L GN S GLY+T++GLG P + V VDT
Sbjct: 46 GMSKHHLQHLVEHNDRR-GRFLQGISFPLKGN--YSDLGLYYTEIGLGNPVQKLKVIVDT 102
Query: 103 GSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-- 160
GSD+LWV C+ C C +K D+ L++++ S SSTS +CSD C CS
Sbjct: 103 GSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSSCSDPLC----TGEQAVCSRS 158
Query: 161 -PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
C Y ++Y D S++ G +V+D + GN T S + FGC +G
Sbjct: 159 GSNSACAYGISYQDKSTSIGAYVKDDMHYVLQGGNATT----SHIFFGCAINITGSW--- 211
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVV-SPKVKTT 277
DGI+GFGQ + ++ +Q+A N+ + F+HCL K GGGI G+ + ++ T
Sbjct: 212 ---PADGIMGFGQISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEEPNTTEMVFT 268
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDLPTSLL----GTGDERGTIIDSGTTLAYLPPMLYDL 333
P++ HYNV L + V L + + + +E G IIDSGT+ A L +
Sbjct: 269 PLLNVTTHYNVDLLSISVNSKVLPIDSKEFSYVSNSTNETGVIIDSGTSFALLATKANRI 328
Query: 334 VLSQFR 339
+ S+ +
Sbjct: 329 LFSEIK 334
>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
gi|194704920|gb|ACF86544.1| unknown [Zea mays]
gi|223949445|gb|ACN28806.1| unknown [Zea mays]
gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
Length = 515
Score = 137 bits (345), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 153/311 (49%), Gaps = 35/311 (11%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEY 96
+ GE R AL + D +R R +A + L GG+ L Y+ V +GTP +
Sbjct: 53 RGSGEYYR---ALVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSF 109
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
V +DTGSDL WV C C +C S +L L ++ P++S+TS + CS C++
Sbjct: 110 LVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSV- 167
Query: 153 NNRYPSCS-PGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
P C+ P C Y + Y + +++SG + D + LN ++ P+N+SVI GCG
Sbjct: 168 ----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQ 220
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+QSGD A DG+LG G A+ S+ S LA AG V+ F+ C G IF GD
Sbjct: 221 KQSGDYLDGI--APDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQG 277
Query: 271 SPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
P ++TP VP + Y V +++ +G L+ G ++DSGT+ LP
Sbjct: 278 VPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLP 329
Query: 328 PMLYDLVLSQF 338
+Y +F
Sbjct: 330 FDVYKAFTMEF 340
>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
Length = 469
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 153/311 (49%), Gaps = 35/311 (11%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEY 96
+ GE R AL + D +R R +A + L GG+ L Y+ V +GTP +
Sbjct: 53 RGSGEYYR---ALVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSF 109
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
V +DTGSDL WV C C +C S +L L ++ P++S+TS + CS C++
Sbjct: 110 LVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSV- 167
Query: 153 NNRYPSCS-PGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
P C+ P C Y + Y + +++SG + D + LN ++ P+N+SVI GCG
Sbjct: 168 ----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQ 220
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+QSGD A DG+LG G A+ S+ S LA AG V+ F+ C G IF GD
Sbjct: 221 KQSGDYLDGI--APDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQG 277
Query: 271 SPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
P ++TP VP + Y V +++ +G L+ G ++DSGT+ LP
Sbjct: 278 VPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLP 329
Query: 328 PMLYDLVLSQF 338
+Y +F
Sbjct: 330 FDVYKAFTMEF 340
>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
Length = 485
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 153/311 (49%), Gaps = 35/311 (11%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEY 96
+ GE R AL + D +R R +A + L GG+ L Y+ V +GTP +
Sbjct: 23 RGSGEYYR---ALVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSF 79
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
V +DTGSDL WV C C +C S +L L ++ P++S+TS + CS C++
Sbjct: 80 LVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSV- 137
Query: 153 NNRYPSCS-PGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
P C+ P C Y + Y + +++SG + D + LN ++ P+N+SVI GCG
Sbjct: 138 ----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQ 190
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+QSGD A DG+LG G A+ S+ S LA AG V+ F+ C G IF GD
Sbjct: 191 KQSGDYLDGI--APDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQG 247
Query: 271 SPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
P ++TP VP + Y V +++ +G L+ G ++DSGT+ LP
Sbjct: 248 VPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLP 299
Query: 328 PMLYDLVLSQF 338
+Y +F
Sbjct: 300 LDVYKAFTMEF 310
>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
Length = 470
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 102/280 (36%), Positives = 142/280 (50%), Gaps = 36/280 (12%)
Query: 75 GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSK 134
G+ T Y LGTP ++VDTGSDL WV C C+ S K LFDP++
Sbjct: 129 GYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCA---APSCYRQKDPLFDPAQ 185
Query: 135 SSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
SS+ + C + C Y S +C YVV+YGDGS+T+G + D + L
Sbjct: 186 SSSYAAVPCGRSACAGL--GIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAA---- 239
Query: 195 LKTAPLNSSV---IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKE 250
N++V +FGCG+ QSG L + +DG+LGFG+ SL+ Q A A G V
Sbjct: 240 ------NATVQGFLFGCGHAQSGGLFT----GIDGLLGFGREQPSLVQQTAGAYGGV--- 286
Query: 251 FAHCLDVVKG-GGIFAIGDV--VSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPT 304
F++CL G +G V+P TT ++ PN P +Y V+L + VGG PL +P
Sbjct: 287 FSYCLPTKSSTTGYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPA 346
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
S GT++D+GT + LPP Y + S FR +AS
Sbjct: 347 SAFAA----GTVVDTGTVITRLPPAAYAALRSAFRSGMAS 382
>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
Length = 515
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 101/311 (32%), Positives = 152/311 (48%), Gaps = 35/311 (11%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEY 96
+ GE R AL + D +R R +A + L GG+ L Y+ V +GTP +
Sbjct: 53 RGSGEYYR---ALVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSF 109
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
V +DTGSDL WV C C +C S +L L ++ P++S+TS + CS C++
Sbjct: 110 LVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSV- 167
Query: 153 NNRYPSCS-PGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
P C+ P C Y + Y + +++SG + D + LN ++ P+N+SVI GCG
Sbjct: 168 ----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIGCGQ 220
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+QSGD A DG+L G A+ S+ S LA AG V+ F+ C G IF GD
Sbjct: 221 KQSGDYLDGI--APDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIF-FGDQG 277
Query: 271 SPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
P ++TP VP + Y V +++ +G L+ G ++DSGT+ LP
Sbjct: 278 VPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLE--------GTSFKALVDSGTSFTSLP 329
Query: 328 PMLYDLVLSQF 338
+Y +F
Sbjct: 330 FDVYKAFTMEF 340
>gi|413952262|gb|AFW84911.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 312
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/145 (49%), Positives = 96/145 (66%), Gaps = 2/145 (1%)
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
GN +TA ++S++FGC N QSGDL + D AVDGI GFGQ S++SQL + G K F+
Sbjct: 8 GNEQTANSSASIVFGCSNSQSGDL-TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFS 66
Query: 253 HCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
HCL GGGI +G++V P + TP+VP+ PHYN+ LE + V G L + +SL T +
Sbjct: 67 HCLKGSDNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSN 126
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLS 336
+GTI+DSGTTLAYL YD +S
Sbjct: 127 TQGTIVDSGTTLAYLADGAYDPFVS 151
>gi|302789618|ref|XP_002976577.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
gi|300155615|gb|EFJ22246.1| hypothetical protein SELMODRAFT_416663 [Selaginella moellendorffii]
Length = 437
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 105/306 (34%), Positives = 155/306 (50%), Gaps = 26/306 (8%)
Query: 43 GERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDT 102
G ++ L L +H+ RR GR + I L GN S GLY+T++GLG P + V VDT
Sbjct: 46 GMSKQHLQHLVEHNDRR-GRFLQGISFPLKGN--YSDLGLYYTEIGLGNPVQKLKVIVDT 102
Query: 103 GSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP- 161
GSD+LWV C+ C C +K D+ L++++ S SSTS +CSD C CS
Sbjct: 103 GSDILWVKCSPCRSCLSKQDIIPPLSIYNLSASSTSSVSSCSDPLC----TGEEVVCSRS 158
Query: 162 --GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
C YV +Y D S++ G +VRD + GN T S + FGC +G
Sbjct: 159 GNNSACAYVSSYQDKSASVGAYVRDDMHYVLHGGNATT----SRIFFGCATNITGSW--- 211
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVV-SPKVKTT 277
VDGI+GFG + ++ +Q+A N+ + F+HCL K GGGI G+ + ++ T
Sbjct: 212 ---PVDGIMGFGLISKTVPNQIATQRNMSRVFSHCLGGEKHGGGILEFGEAPNTTEMVFT 268
Query: 278 PMVPNMPHYNVILEEVEVGGN--PLDLP--TSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
P++ HYNV L + V P+D + + + + G IIDSGTT L +
Sbjct: 269 PLLNVTTHYNVDLLSISVNSKVLPIDPKEFSYVRNSTNNTGVIIDSGTTFVLLTTKANRM 328
Query: 334 VLSQFR 339
+ + +
Sbjct: 329 LFQEIK 334
>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
gi|194693730|gb|ACF80949.1| unknown [Zea mays]
gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
Length = 519
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 102/303 (33%), Positives = 150/303 (49%), Gaps = 34/303 (11%)
Query: 51 ALKQHDTRRHGRMMASID--LELGGNGHPSATG-----LYFTKVGLGTPTDEYYVQVDTG 103
AL + D +R R +A + L L G + G LY+ V +GTPT + V +DTG
Sbjct: 61 ALLRSDLQRQKRRLAGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTG 120
Query: 104 SDLLWVNCAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
SDL WV C C +C S +L L ++ P++S+TS + CS C+
Sbjct: 121 SDLFWVPC-DCIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSG----CT 175
Query: 160 SPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
+P C Y + Y + +++SG + D + LN G+ AP+N+SVI GCG +QSGD
Sbjct: 176 NPKQPCTYNIDYFSENTTSSGLLIEDSLHLNSREGH---APVNASVIIGCGRKQSGDYLD 232
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTP 278
A DG+LG G A+ S+ S LA AG VR F+ C G IF GD ++TP
Sbjct: 233 GI--APDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSSGRIF-FGDQGVSSQQSTP 289
Query: 279 MVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
VP + Y V +++ +G L+ G ++DSGT+ LPP +Y
Sbjct: 290 FVPLYGKLQTYAVNVDKSCIGHKCLE--------GSSFQALVDSGTSFTSLPPDVYKAFT 341
Query: 336 SQF 338
++F
Sbjct: 342 TEF 344
>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 423
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 100/298 (33%), Positives = 138/298 (46%), Gaps = 27/298 (9%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
ER LS L + H S+ +GGN +P GLY+ + LG+P Y++ +DTGSD
Sbjct: 10 ERDLSRLGKSSVGNH-----SVRFHVGGNIYPD--GLYYMALLLGSPPKLYFLDMDTGSD 62
Query: 106 LLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
L W C A C C L++P K+ + C C C+ V+
Sbjct: 63 LTWAQCDAPCRNCAIGPH-----GLYNPKKAKV---VDCHLPVCAQIQQGGSYECNSDVK 114
Query: 165 -CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
C+Y V Y DGSST G V D + + +G L + + I GCG Q G L S A+
Sbjct: 115 QCDYEVEYADGSSTMGVLVEDTLTVRLTNGTL----IQTKAIIGCGYDQQGTLAKSP-AS 169
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPK--VKTTPMV 280
DG++G + +L +QLA G ++ HCL D GGG GD + P + TPM+
Sbjct: 170 TDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLADGSNGGGYLFFGDELVPSWGMTWTPMM 229
Query: 281 --PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
P M Y L+ + GG+ L L T + DSGT+ YL P Y VLS
Sbjct: 230 GKPEMLGYQARLQSIRYGGDSLVLNNDEDLTRSTSSVMFDSGTSFTYLVPQAYASVLS 287
>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 372
Score = 135 bits (339), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/269 (34%), Positives = 138/269 (51%), Gaps = 31/269 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + + LGTP + V +DTGSDL W+ C C ++D +FDPSKSST +
Sbjct: 23 GEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQAD-----PIFDPSKSSTYNK 77
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
IACS + C + +CS C Y YGDGS T GYF ++ I +G
Sbjct: 78 IACSSSACADLLGTQ--TCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGE------ 129
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVK 259
V FG +G G D +GILG GQ S+ SQL + + +F++CL D +
Sbjct: 130 --EVKFGASVYNTGTFG---DTGGEGILGLGQGPVSMPSQLGSV--LGNKFSYCLVDWLS 182
Query: 260 GG---GIFAIGDVVSP--KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSL--LGT 309
G GD P +V+ TP+VPN H Y + ++ + VGG+ LD+ S+ + +
Sbjct: 183 AGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDS 242
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G GTIIDSGTT+ YL +++ +++ +
Sbjct: 243 GGSGGTIIDSGTTITYLQQEVFNALVAAY 271
>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
Length = 473
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 131/264 (49%), Gaps = 26/264 (9%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y VGLGTP E+ + DTGSDL W C C++ K K DP+KS++
Sbjct: 130 SGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQ----KEPRLDPTKSTSYK 185
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
I+CS FC+ SCS C Y V YGDGS + G+F + + L+ ++
Sbjct: 186 NISCSSAFCKLLDTEGGESCSSPT-CLYQVQYGDGSYSIGFFATETLTLSSSN------- 237
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ + +FGCG + SG + G+LG G+ SL SQ A +K F++CL
Sbjct: 238 VFKNFLFGCGQQNSGLFRGAA-----GLLGLGRTKLSLPSQ--TAQKYKKLFSYCLPASS 290
Query: 260 GG-GIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G + G VS VK TP+ + P Y + + E+ VGGN L + S+ T GT
Sbjct: 291 SSKGYLSFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTS---GT 347
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFR 339
+IDSGT + LP Y + S F+
Sbjct: 348 VIDSGTVITRLPSTAYSALSSAFQ 371
>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like [Cucumis sativus]
Length = 524
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 111/341 (32%), Positives = 161/341 (47%), Gaps = 51/341 (14%)
Query: 27 VMGNFVFEVENKFKAGGER---------ERTL---SALKQHDTRRHGRMMASIDLE---- 70
V G+F F + + + + E TL +A+ + D H R + +
Sbjct: 31 VFGSFTFNIHHLYSPAVRQILPFHSFPDEGTLDYYAAMVRTDXFVHSRRLGQVQDHRPLT 90
Query: 71 -LGGNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT---KSDL 123
L GN S G LY+ +V +GTP Y V +DTGSDL W+ C C C T +
Sbjct: 91 FLSGNETLRISPLGFLYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQG 149
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTY-GDGSSTSGYF 181
+ ++ P+ SSTS E+ CS + C + C SP C Y V+Y D +S++GY
Sbjct: 150 PVNFNIYSPNNSSTSKEVQCSSSLC-----SHLDQCSSPSDTCPYQVSYLSDNTSSTGYL 204
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V DI+ L + ++++ P+N+ + GCG QSG SS AA +G+ G G N S+ S L
Sbjct: 205 VEDILHL--TTNDVQSKPVNARITLGCGKDQSGAFLSS--AAPNGLFGLGIENVSVPSIL 260
Query: 242 AAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM----PHYNVILEEVEVGG 297
A AG + F+ C + G I GD SP TP N+ P YNV + ++ VGG
Sbjct: 261 ANAGLISNSFSLCFGPARMGRI-EFGDKGSPGQNETPF--NLGRRHPTYNVSITQIGVGG 317
Query: 298 NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ DL + I DSGT+ YL Y L +F
Sbjct: 318 HISDL---------DVAVIFDSGTSFTYLNDPAYSLFADKF 349
>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 547
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/266 (35%), Positives = 135/266 (50%), Gaps = 31/266 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT---KSDLGIKLTLFDPSKSSTS 138
LY+ +V +GTP Y V +DTGSDL W+ C C C T + + ++ P+ SSTS
Sbjct: 129 LYYAEVTVGTPGVPYLVALDTGSDLFWLPC-DCVNCITGLNTTQGPVNFNIYSPNNSSTS 187
Query: 139 GEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLK 196
E+ CS + C + C SP C Y V+Y D +S++GY V DI+ L + +++
Sbjct: 188 KEVQCSSSLC-----SHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHL--TTNDVQ 240
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ P+N+ + GCG QSG SS AA +G+ G G N S+ S LA AG + F+ C
Sbjct: 241 SKPVNARITLGCGKDQSGAFLSS--AAPNGLFGLGIENVSVPSILANAGLISNSFSLCFG 298
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPNM----PHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
+ G I GD SP TP N+ P YNV + ++ VGG+ DL +
Sbjct: 299 PARMGRI-EFGDKGSPGQNETPF--NLGRRHPTYNVSITQIGVGGHISDL---------D 346
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
I DSGT+ YL Y L +F
Sbjct: 347 VAVIFDSGTSFTYLNDPAYSLFADKF 372
>gi|125547762|gb|EAY93584.1| hypothetical protein OsI_15370 [Oryza sativa Indica Group]
Length = 202
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 72/180 (40%), Positives = 99/180 (55%), Gaps = 8/180 (4%)
Query: 32 VFEVENKFK--AGGERERTLSALKQHDTRRHGRMMASIDLELGGNG--HPSATGLYFTKV 87
+F+V KF GG + + AL+ HD RH + + D LGG G S+TG Y +
Sbjct: 27 LFQVRRKFSIMGGGCKGSDIGALQTHDRNRHLSRLVAADFSLGGLGGISTSSTG-YMLQC 85
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
G+ ++ VDTGS WVNC C +CP KSD+ KLTL+DP S +S + C D F
Sbjct: 86 SFGSI---HFFLVDTGSSAFWVNCIPCKQCPRKSDILKKLTLYDPRSSVSSKVVKCDDMF 142
Query: 148 CRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
C + + P C+ + C ++ TY DG ST G FV D++ NQ SGN T N+S+ FG
Sbjct: 143 CTSPDRDVQPECNTSLLCPFIATYADGGSTIGAFVTDLVHYNQLSGNGLTQSTNTSLTFG 202
>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
Length = 599
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 91/263 (34%), Positives = 134/263 (50%), Gaps = 32/263 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR-C-PTKSDLGIKLTLFDPSKSSTS 138
G ++ + LGTP ++ V VDTGS + +V CA C R C P D FDP+ SS+S
Sbjct: 60 GYFYATLHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKD-----AAFDPASSSSS 114
Query: 139 GEIACSDNFCRTTYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I C + C R P CS C Y TY + SS++G V D +QL +
Sbjct: 115 AVIGCDSDKC---ICGRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQLRDGA----- 166
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
V+FGC +++G++ + DGILG G + SL++QLA +G + FA C
Sbjct: 167 ----VEVVFGCETKETGEI---YNQEADGILGLGNSEVSLVNQLAGSGVIDDVFALCFGS 219
Query: 258 VKGGGIFAIGDVVSPK----VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTG 310
V+G G +GDV + + ++ T ++ ++ H Y+V LE + VGG L + G
Sbjct: 220 VEGDGALMLGDVDAAEYDVALQYTALLSSLAHPHYYSVQLEALWVGGQQLPVKPERYEEG 279
Query: 311 DERGTIIDSGTTLAYLPPMLYDL 333
GT++DSGTT YLP + L
Sbjct: 280 --YGTVLDSGTTFTYLPSEAFQL 300
>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
Length = 520
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/296 (33%), Positives = 144/296 (48%), Gaps = 30/296 (10%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEYYVQVDTGSDLLWVN 110
Q RR G + L GG+ PS L Y+T V +GTP + V +DTGSDL WV
Sbjct: 70 QRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVP 129
Query: 111 CAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
C C +C S L L ++ PS+S+TS + CS C +P C
Sbjct: 130 C-DCIQCAPLSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASG----CTNPKQPCP 184
Query: 167 YVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y + Y + +++SG + D++ L+ G+ AP+N+SVI GCG +QSG A D
Sbjct: 185 YNIDYFSENTTSSGLLIEDMLHLDSREGH---APVNASVIIGCGKKQSGSYLEGI--APD 239
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP---N 282
G+LG G A+ S+ S LA AG VR F+ C G IF GD P ++TP VP
Sbjct: 240 GLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIF-FGDQGVPTQQSTPFVPMNGK 298
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ Y V +++ +G + G G + ++D+GT+ LP Y + +F
Sbjct: 299 LQTYAVNVDKYCIGHKCTE------GAGFQ--ALVDTGTSFTSLPLDAYKSITMEF 346
>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
Length = 520
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 99/296 (33%), Positives = 144/296 (48%), Gaps = 30/296 (10%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEYYVQVDTGSDLLWVN 110
Q RR G + L GG+ PS L Y+T V +GTP + V +DTGSDL WV
Sbjct: 70 QRQKRRVGGKYQLLSLSQGGSIFPSGNDLGWLYYTWVDVGTPNTSFLVALDTGSDLFWVP 129
Query: 111 CAGCSRCPTKS----DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
C C +C S L L ++ PS+S+TS + CS C +P C
Sbjct: 130 C-DCIQCAPLSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASG----CTNPKQPCP 184
Query: 167 YVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y + Y + +++SG + D++ L+ G+ AP+N+SVI GCG +QSG A D
Sbjct: 185 YNIDYFSENTTSSGLLIEDMLHLDSREGH---APVNASVIIGCGKKQSGSYLEGI--APD 239
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP---N 282
G+LG G A+ S+ S LA AG VR F+ C G IF GD P ++TP VP
Sbjct: 240 GLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIF-FGDQGVPTQQSTPFVPMNGK 298
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ Y V +++ +G + G G + ++D+GT+ LP Y + +F
Sbjct: 299 LQTYAVNVDKYCIGHKCTE------GAGFQ--ALVDTGTSFTSLPLDAYKSITMEF 346
>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
Length = 475
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 137/268 (51%), Gaps = 32/268 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V LGTP ++VDTGSD+ WV C C P S + LFDP++SS+ +
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQ---RDPLFDPTRSSSYSAVP 198
Query: 143 CSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C+ C Y+N CS G +C YVV+YGDGS+T+G + D + L S LK
Sbjct: 199 CAAASCSQLALYSN---GCS-GGQCGYVVSYGDGSTTTGVYSSDTLTLT-GSNALK---- 249
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+FGCG+ Q G A VDG+LG G+ SL+SQ A+ F++CL +
Sbjct: 250 --GFLFGCGHAQQGLF-----AGVDGLLGLGRQGQSLVSQ--ASSTYGGVFSYCLPPTQN 300
Query: 261 --GGIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G I G + TTP++ N P +Y V+L + VGG PL + S+ + G
Sbjct: 301 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFAS----GA 356
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
++D+GT + LPP Y + S FR +A
Sbjct: 357 VVDTGTVVTRLPPTAYSALRSAFRAAMA 384
>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
Length = 464
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 97/268 (36%), Positives = 137/268 (51%), Gaps = 32/268 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V LGTP ++VDTGSD+ WV C C P S + LFDP++SS+ +
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQ---RDPLFDPTRSSSYSAVP 187
Query: 143 CSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C+ C Y+N CS G +C YVV+YGDGS+T+G + D + L S LK
Sbjct: 188 CAAASCSQLALYSN---GCS-GGQCGYVVSYGDGSTTTGVYSSDTLTLT-GSNALK---- 238
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+FGCG+ Q G A VDG+LG G+ SL+SQ A+ F++CL +
Sbjct: 239 --GFLFGCGHAQQGLF-----AGVDGLLGLGRQGQSLVSQ--ASSTYGGVFSYCLPPTQN 289
Query: 261 --GGIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G I G + TTP++ N P +Y V+L + VGG PL + S+ + G
Sbjct: 290 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFAS----GA 345
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
++D+GT + LPP Y + S FR +A
Sbjct: 346 VVDTGTVVTRLPPTAYSALRSAFRAAMA 373
>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 414
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 97/273 (35%), Positives = 128/273 (46%), Gaps = 25/273 (9%)
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTL 129
+GGN +P GLY+ + +G P YY+ +DTGSDL W+ C A C C L
Sbjct: 21 IGGNIYPD--GLYYMAMRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPH-----GL 73
Query: 130 FDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQL 188
+DP ++ + C C +CS VR C+Y V Y DGSST G V D I L
Sbjct: 74 YDPKRARV---VDCRRPTCAQVQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITL 130
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
+G + + GCG Q G L + A DG++G + SL SQLAA G
Sbjct: 131 VLTNGTR----FQTRAVIGCGYDQQGTLAKAP-AVTDGVIGLSSSKISLPSQLAAKGIAN 185
Query: 249 KEFAHCLD-VVKGGGIFAIGDVVSPKV--KTTPMV--PNMPHYNVILEEVEVGGNPLDLP 303
HCL GGG GD + P + TPM+ P + Y L ++ GG L+L
Sbjct: 186 NVIGHCLAGGSNGGGYLFFGDTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELE 245
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ T D G + DSGT+ YL P Y VLS
Sbjct: 246 GT---TDDVGGAMFDSGTSFTYLVPNAYTAVLS 275
>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
gi|194705620|gb|ACF86894.1| unknown [Zea mays]
gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
Length = 477
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 90/263 (34%), Positives = 127/263 (48%), Gaps = 27/263 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
YFT + LGTP + V++DTGSD W+ C C C + + LFDPSKSST +I
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHE-----ALFDPSKSSTYSDIT 188
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C+ ++ +CS +C Y +TY D S T G RD + L+
Sbjct: 189 CSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTLSPTDA-------VP 241
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVK 259
+FGCG+ +G G +DG+LG G+ +SL SQ+AA F++CL
Sbjct: 242 GFVFGCGHNNAGSFGE-----IDGLLGLGRGKASLSSQVAA--RYGAGFSYCLPSSPSAT 294
Query: 260 GGGIFAIGDVVSP-KVKTTPMVP--NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G F+ +P + T MV + Y + L + V G + +P S+ T GTI
Sbjct: 295 GYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATA--AGTI 352
Query: 317 IDSGTTLAYLPPMLYDLVLSQFR 339
IDSGT + LPP Y + S R
Sbjct: 353 IDSGTAFSCLPPSAYAALRSSVR 375
>gi|147834977|emb|CAN67955.1| hypothetical protein VITISV_031916 [Vitis vinifera]
Length = 291
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 71/170 (41%), Positives = 99/170 (58%), Gaps = 6/170 (3%)
Query: 44 ERERTLSALKQHDTRRHGRMMASI-----DLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
E+ L L+ D RHGR++ + D + G P GLYFTKV LG+P E+ V
Sbjct: 122 EKRVELEVLRARDQARHGRLLRGVVGGVVDFTVYGTSDPYLVGLYFTKVKLGSPPREFNV 181
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
Q+DTGSD+LWV C C+ CP S LGI+L+ FDPS SST+ ++CS C +
Sbjct: 182 QIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSHPICTSLVQTTAAE 241
Query: 159 CSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
CSP +C Y YGDGS T+GY+V D++ + G+ A ++S++FG
Sbjct: 242 CSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASIVFG 291
>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
Length = 413
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/281 (34%), Positives = 134/281 (47%), Gaps = 31/281 (11%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
R R ++S+ + GN +P G Y + +G P YY+ +DTGSDL W+ C A C RC
Sbjct: 26 RFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 83
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSST 177
L L+ PS S I C+D C+ + N C +C+Y V Y DG S+
Sbjct: 84 -----LEAPHPLYQPS----SDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSS 134
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G VRD+ +N G L+ P + GCG Q G+S+ +DG+LG G+ S+
Sbjct: 135 LGVLVRDVFSMNYTQG-LRLTP---RLALGCGYDQIP--GASSHHPLDGVLGLGRGKVSI 188
Query: 238 LSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVIL-EEV 293
LSQL + G V+ HCL + GGGI GD + S +V TPM HY+ + E+
Sbjct: 189 LSQLHSQGYVKNVIGHCLSSL-GGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL 247
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
GG L L T+ DSG++ Y Y V
Sbjct: 248 LFGGRTTGLKNLL--------TVFDSGSSYTYFNSKAYQAV 280
>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/281 (34%), Positives = 134/281 (47%), Gaps = 31/281 (11%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
R R ++S+ + GN +P G Y + +G P YY+ +DTGSDL W+ C A C RC
Sbjct: 38 RFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 95
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSST 177
L L+ PS S I C+D C+ + N C +C+Y V Y DG S+
Sbjct: 96 -----LEAPHPLYQPS----SDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSS 146
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G VRD+ +N G L+ P + GCG Q G+S+ +DG+LG G+ S+
Sbjct: 147 LGVLVRDVFSMNYTKG-LRLTP---RLALGCGYDQIP--GASSHHPLDGVLGLGRGKVSI 200
Query: 238 LSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVIL-EEV 293
LSQL + G V+ HCL + GGGI GD + S +V TPM HY+ + E+
Sbjct: 201 LSQLHSQGYVKNVIGHCLSSL-GGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL 259
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
GG L L T+ DSG++ Y Y V
Sbjct: 260 LFGGRTTGLKNLL--------TVFDSGSSYTYFNSKAYQAV 292
>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 401
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/281 (34%), Positives = 134/281 (47%), Gaps = 31/281 (11%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
R R ++S+ + GN +P G Y + +G P YY+ +DTGSDL W+ C A C RC
Sbjct: 35 RFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 92
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSST 177
L L+ PS S I C+D C+ + N C +C+Y V Y DG S+
Sbjct: 93 -----LEAPHPLYQPS----SDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSS 143
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G VRD+ +N G L+ P + GCG Q G+S+ +DG+LG G+ S+
Sbjct: 144 LGVLVRDVFSMNYTQG-LRLTP---RLALGCGYDQIP--GASSHHPLDGVLGLGRGKVSI 197
Query: 238 LSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVIL-EEV 293
LSQL + G V+ HCL + GGGI GD + S +V TPM HY+ + E+
Sbjct: 198 LSQLHSQGYVKNVIGHCLSSL-GGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL 256
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
GG L L T+ DSG++ Y Y V
Sbjct: 257 LFGGRTTGLKNLL--------TVFDSGSSYTYFNSKAYQAV 289
>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 127 bits (320), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 96/281 (34%), Positives = 134/281 (47%), Gaps = 31/281 (11%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
R R ++S+ + GN +P G Y + +G P YY+ +DTGSDL W+ C A C RC
Sbjct: 38 RFTRAVSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC 95
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSST 177
L L+ PS S I C+D C+ + N C +C+Y V Y DG S+
Sbjct: 96 -----LEAPHPLYQPS----SDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSS 146
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G VRD+ +N G L+ P + GCG Q G+S+ +DG+LG G+ S+
Sbjct: 147 LGVLVRDVFSMNYTQG-LRLTP---RLALGCGYDQIP--GASSHHPLDGVLGLGRGKVSI 200
Query: 238 LSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVIL-EEV 293
LSQL + G V+ HCL + GGGI GD + S +V TPM HY+ + E+
Sbjct: 201 LSQLHSQGYVKNVIGHCLSSL-GGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGEL 259
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
GG L L T+ DSG++ Y Y V
Sbjct: 260 LFGGRTTGLKNLL--------TVFDSGSSYTYFNSKAYQAV 292
>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 488
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 91/268 (33%), Positives = 127/268 (47%), Gaps = 30/268 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR-CPTKSDLGIKLTLFDPSKSSTS 138
+G YF VGLGTP + + DTGSDL W C C+R C + D +FDPSKS++
Sbjct: 142 SGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQD-----AIFDPSKSTSY 196
Query: 139 GEIACSDNFCR--TTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
I C+ C +T P CS + C Y + YGD S + GYF R+ + ++
Sbjct: 197 SNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERL-------SV 249
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + +FGCG G G S G++G G+ S + Q AA RK F++CL
Sbjct: 250 TATDIVDNFLFGCGQNNQGLFGGSA-----GLIGLGRHPISFVQQTAAV--YRKIFSYCL 302
Query: 256 DVVKGG-GIFAIGDVVSPKVKTTP---MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
G + G + VK TP + Y + + + VGG L + +S TG
Sbjct: 303 PATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTG- 361
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G IIDSGT + LPP Y + S FR
Sbjct: 362 --GAIIDSGTVITRLPPTAYTALRSAFR 387
>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
Length = 525
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 100/272 (36%), Positives = 125/272 (45%), Gaps = 33/272 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
TG Y +GLGTP Y V DTGSD WV C C C + + LFDP++SST
Sbjct: 183 TGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQE-----KLFDPARSSTD 237
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+C+ C Y CS G C Y V YGDGS + G+F D + L+
Sbjct: 238 ANISCAAPACSDLYTK---GCS-GGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDA----- 288
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
FGCG R G G + G+LG G+ +SL Q A FAHC
Sbjct: 289 --IKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQ--AYDKYGGVFAHCFPAR 339
Query: 259 KGG-GIFAIGDVVSPKVK---TTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G G SP V TTPM+ + + Y V L + VGG L +P S+ T
Sbjct: 340 SSGTGYLDFGPGSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA-- 397
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
GTI+DSGT + LPP Y + S F IA+
Sbjct: 398 -GTIVDSGTVITRLPPAAYSSLRSAFASAIAA 428
>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 518
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 157/316 (49%), Gaps = 37/316 (11%)
Query: 39 FKAGGERERTLSALKQHDTRRHGRMMASIDLELG---GNG--HPSATG-LYFTKVGLGTP 92
F + G E + L D GR + +++ L GN S+ G L++T V LGTP
Sbjct: 52 FPSKGSFEY-YAELAHRDQMLRGRKLYNVEAPLAFSDGNSTFRISSLGFLHYTTVELGTP 110
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSRC-PTK-----SDLGIKLTLFDPSKSSTSGEIACSDN 146
++ V +DTGSDL WV C CS+C PT+ SD +L+++DP +SSTS ++ C++N
Sbjct: 111 GMKFMVALDTGSDLFWVPC-DCSKCAPTQGVAYASDF--ELSIYDPKQSSTSKKVTCNNN 167
Query: 147 FCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C + NR C Y+V+Y +STSG V D++ L N ++ + + V
Sbjct: 168 LC--AHRNR--CLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQES--IKAYVT 221
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFA 265
FGCG QSG ++ AA +G+ G G S+ S L+ G F+ C G G +
Sbjct: 222 FGCGQVQSGSFLNT--AAPNGLFGLGMDQISVPSILSREGLTADSFSMCFG-HDGVGRIS 278
Query: 266 IGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTL 323
GD SP + TP P+ P YN+ + +V VG +D+ + + DSGT+
Sbjct: 279 FGDKGSPDQEETPFNSNPSHPSYNISVTQVRVGTTLVDV---------DFTALFDSGTSF 329
Query: 324 AYLPPMLYDLVLSQFR 339
YL +Y +V F
Sbjct: 330 TYLINPIYAMVSENFH 345
>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 461
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 131/267 (49%), Gaps = 38/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG+P + +DTGSD+ WV C CS+C +++D LFDPS SST +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 182
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C CS +C+Y+VTYGDGSST+G + D + L ++
Sbjct: 183 CGSAAC-AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVK-------- 233
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
S FGC N +SG + DG++G G SL+SQ AG + + F++CL
Sbjct: 234 SFQFGCSNVESG-----FNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSS 286
Query: 263 IF----------AIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
F G V +P ++++ VP Y V L+ + VGG L +P S+
Sbjct: 287 GFLTLGAAGGSGTSGFVKTPMLRSS-QVPTF--YGVRLQAIRVGGRQLSIPASVF----S 339
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT + LPP Y + S F+
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFK 366
>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 531
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 131/267 (49%), Gaps = 38/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG+P + +DTGSD+ WV C CS+C +++D LFDPS SST +
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 252
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C CS +C+Y+VTYGDGSST+G + D + L ++
Sbjct: 253 CGSADC-AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVR-------- 303
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
S FGC N +SG + DG++G G SL+SQ AG + + F++CL
Sbjct: 304 SFQFGCSNVESG-----FNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSS 356
Query: 263 IF----------AIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
F G V +P ++++ VP Y V L+ + VGG L +P S+
Sbjct: 357 GFLTLGAAGGSGTSGFVKTPMLRSS-QVPTF--YGVRLQAIRVGGRQLSIPASVFSA--- 410
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT + LPP Y + S F+
Sbjct: 411 -GTVMDSGTVITRLPPTAYSALSSAFK 436
>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
Length = 461
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 131/267 (49%), Gaps = 38/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG+P + +DTGSD+ WV C CS+C +++D LFDPS SST +
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 182
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C CS +C+Y+VTYGDGSST+G + D + L ++
Sbjct: 183 CGSADC-AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVR-------- 233
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
S FGC N +SG + DG++G G SL+SQ AG + + F++CL
Sbjct: 234 SFQFGCSNVESG-----FNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSS 286
Query: 263 IF----------AIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
F G V +P ++++ VP Y V L+ + VGG L +P S+
Sbjct: 287 GFLTLGAAGGSGTSGFVKTPMLRSS-QVPTF--YGVRLQAIRVGGRQLSIPASVF----S 339
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT + LPP Y + S F+
Sbjct: 340 AGTVMDSGTVITRLPPTAYSALSSAFK 366
>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 535
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 93/270 (34%), Positives = 138/270 (51%), Gaps = 31/270 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSDL WV C C +C S L L+ + PS S+
Sbjct: 101 LHYTWIDIGTPNVSFLVALDAGSDLSWVPC-DCIQCAPLSASLYKPLDRDLSEYRPSLST 159
Query: 137 TSGEIACSDNFCRT---TYNNRYPSCSPGVRCEYVVTYGD-GSSTSGYFVRDIIQLNQAS 192
TS ++C+ C N + P C Y+ Y D +S+SG+ V DI+ L S
Sbjct: 160 TSRHLSCNHQLCELGSHCKNLKDP-------CPYIADYADPNTSSSGFLVEDILHLASVS 212
Query: 193 --GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
N + +SVI GCG +Q+G G AA DG++G G + S+ S LA AG +RK
Sbjct: 213 DDSNSTQKRVQASVILGCGRKQTG--GYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKS 270
Query: 251 FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE--VGGNPLDLPTSLLG 308
F+ C D V G G GD K+TP++P +Y+ L EVE GN + L
Sbjct: 271 FSLCFD-VNGSGTILFGDQGHTSQKSTPLLPTQGNYDAYLIEVESYCVGN-----SCLKQ 324
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+G + ++DSG + YLP +Y+ ++ +F
Sbjct: 325 SGFK--ALVDSGASFTYLPIDVYNKIVLEF 352
>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 133/272 (48%), Gaps = 40/272 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y V LGTP V+VDTGSD+ WV C CS C ++ D LFDP+KSST
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRD-----QLFDPAKSSTYSA 197
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C + C Y + G +C YVV+YGDGS+T+G + D + L AP
Sbjct: 198 VPCGADACSEL--RIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLAL---------APG 246
Query: 201 NS--SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
N+ + +FGCG+ Q+G A +DG+L G+ + SL SQ AAG F++CL
Sbjct: 247 NTVGTFLFGCGHAQAGMF-----AGIDGLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSK 299
Query: 259 KG-------GGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
+ GG + + + T P Y V+L + VGG + +P S
Sbjct: 300 QSAAGYLTLGGPTSASGFATTGLLTAWAAPTF--YMVMLTGISVGGQQVAVPASAF---- 353
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
GT++D+GT + LPP Y + S FR IA
Sbjct: 354 AGGTVVDTGTVITRLPPTAYAALRSAFRGAIA 385
>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 407
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 140/287 (48%), Gaps = 45/287 (15%)
Query: 62 RMMASIDLELGGNGHPSA----------TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC 111
R+ AS+ LG HP+A G Y T++ +GTP E+ + VD+GS + +V C
Sbjct: 58 RLAASLRRGLGDGAHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC 117
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY 171
A C +C D F P SS+ + C+ + C + + +C Y Y
Sbjct: 118 ASCEQCGNHQD-----PRFQPDLSSSYSPVKCNVD-CTCDSDKK--------QCTYERQY 163
Query: 172 GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
+ SS+SG DI+ + S LK +FGC N ++GDL S DGI+G G
Sbjct: 164 AEMSSSSGVLGEDIVSFGRES-ELKA----QRAVFGCENSETGDLFSQ---HADGIMGLG 215
Query: 232 QANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMP 284
+ S++ QL G + F+ C +D+ GGG +G V +P ++ P+ P
Sbjct: 216 RGQLSIMDQLVEKGVINDSFSLCYGGMDI--GGGAMVLGGVPTPSDMVFSRSDPL--RSP 271
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+YN+ L+E+ V G L + + + + + GT++DSGTT AYLP +
Sbjct: 272 YYNIELKEIHVAGKALRVDSRIFDS--KHGTVLDSGTTYAYLPEQAF 316
>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
Length = 385
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 131/267 (49%), Gaps = 38/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG+P + +DTGSD+ WV C CS+C +++D LFDPS SST +
Sbjct: 52 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 106
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C CS +C+Y+VTYGDGSST+G + D + L ++
Sbjct: 107 CGSADC-AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSSAVR-------- 157
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
S FGC N +SG + DG++G G SL+SQ AG + + F++CL
Sbjct: 158 SFQFGCSNVESG-----FNDQTDGLMGLGGGAQSLVSQ--TAGTLGRAFSYCLPPTPSSS 210
Query: 263 IF----------AIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
F G V +P ++++ VP Y V L+ + VGG L +P S+
Sbjct: 211 GFLTLGAAGGSGTSGFVKTPMLRSS-QVPTF--YGVRLQAIRVGGRQLSIPASVF----S 263
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT + LPP Y + S F+
Sbjct: 264 AGTVMDSGTVITRLPPTAYSALSSAFK 290
>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
Length = 515
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 105/356 (29%), Positives = 157/356 (44%), Gaps = 50/356 (14%)
Query: 12 VTVAVVHQWAVGGGGVMGNFVFEVENKFK------------AGGERERTLSALKQHDTRR 59
+ + +V W + +G F FE ++F + + + D
Sbjct: 14 LILMLVSSWVLDRCEGLGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRLI 73
Query: 60 HGRMMASIDLEL----GGNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCA 112
GR +AS D L GN +A G L++ V +GTP+D + V +DTGSDL W+ C
Sbjct: 74 RGRRLASEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFWLPCD 133
Query: 113 GCSRC------PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRC 165
+ C P S L L ++ P+ SSTS ++ C+ C R C SP C
Sbjct: 134 CSTNCVRELKAPGGSSL--DLNIYSPNASSTSSKVPCNSTLC-----TRVDRCASPLSDC 186
Query: 166 EYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAV 224
Y + Y +G+S++G V D++ L N K P+ + + GCG Q+G AA
Sbjct: 187 PYQIRYLSNGTSSTGVLVEDVLHLVSMEKNSK--PIRARITLGCGLVQTGVFHDG--AAP 242
Query: 225 DGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMP 284
+G+ G G + S+ S LA G F+ C G G + GD S + TP+ P
Sbjct: 243 NGLFGLGLEDISVPSVLAKEGIAANSFSMCFG-DDGAGRISFGDKGSVDQRETPLNIRQP 301
Query: 285 H--YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
H YNV + ++ VGGN DL E + D+GT+ YL Y L+ F
Sbjct: 302 HPTYNVTVTQISVGGNTGDL---------EFDAVFDTGTSFTYLTDAPYTLISESF 348
>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 133/272 (48%), Gaps = 40/272 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y V LGTP V+VDTGSD+ WV C CS C ++ D LFDP+KSST
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRD-----QLFDPAKSSTYSA 197
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C + C Y + G +C YVV+YGDGS+T+G + D + L AP
Sbjct: 198 VPCGADACSEL--RIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLAL---------APG 246
Query: 201 NS--SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
N+ + +FGCG+ Q+G A +DG+L G+ + SL SQ AAG F++CL
Sbjct: 247 NTVGTFLFGCGHAQAGMF-----AGIDGLLALGRQSMSLKSQ--AAGAYGGVFSYCLPSK 299
Query: 259 KG-------GGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
+ GG + + + T P Y V+L + VGG + +P S
Sbjct: 300 QSAAGYLTLGGPSSASGFATTGLLTAWAAPTF--YMVMLTGISVGGQQVAVPASAF---- 353
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
GT++D+GT + LPP Y + S FR IA
Sbjct: 354 AGGTVVDTGTVITRLPPTAYAALRSAFRGAIA 385
>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
Length = 466
Score = 124 bits (311), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 127/263 (48%), Gaps = 35/263 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V LG+P + +DTGSD+ WV C CS+C +++D LFDPS SST +
Sbjct: 133 YLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQAD-----PLFDPSSSSTYSPFS 187
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C CS +C+Y VTYGDGSST+G + D + L +
Sbjct: 188 CSSAAC-AQLGQEGNGCSSS-QCQYTVTYGDGSSTTGTYSSDTLALGSNAVR-------- 237
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV-KGG 261
FGC N +SG + DG++G G SL+SQ AG F++CL
Sbjct: 238 KFQFGCSNVESG-----FNDQTDGLMGLGGGAQSLVSQ--TAGTFGAAFSYCLPATSSSS 290
Query: 262 GIFAIGDVVSPKVKTTPM-----VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G +G S VK TPM VP Y V ++ + VGG L +PTS+ GTI
Sbjct: 291 GFLTLGAGTSGFVK-TPMLRSSQVPTF--YGVRIQAIRVGGRQLSIPTSVF----SAGTI 343
Query: 317 IDSGTTLAYLPPMLYDLVLSQFR 339
+DSGT L LPP Y + S F+
Sbjct: 344 MDSGTVLTRLPPTAYSALSSAFK 366
>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
Length = 420
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 94/276 (34%), Positives = 132/276 (47%), Gaps = 31/276 (11%)
Query: 64 MASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSD 122
++S+ + GN +P G Y + +G P YY+ +DTGSDL W+ C A C RC
Sbjct: 21 VSSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVRC----- 73
Query: 123 LGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFV 182
L L+ PS S I C+D C+ + N C +C+Y V Y DG S+ G V
Sbjct: 74 LEAPHPLYQPS----SDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLV 129
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
RD+ +N G L+ P + GCG Q G+S+ +DG+LG G+ S+LSQL
Sbjct: 130 RDVFSMNYTQG-LRLTP---RLALGCGYDQIP--GASSHHPLDGVLGLGRGKVSILSQLH 183
Query: 243 AAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVIL-EEVEVGGN 298
+ G V+ HCL + GGGI GD + S +V TPM HY+ + E+ GG
Sbjct: 184 SQGYVKNVIGHCLSSL-GGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGR 242
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
L L T+ DSG++ Y Y V
Sbjct: 243 TTGLKNLL--------TVFDSGSSYTYFNSKAYQAV 270
>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
Length = 626
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 98/313 (31%), Positives = 141/313 (45%), Gaps = 73/313 (23%)
Query: 55 HDTRRH--------GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDL 106
H +RRH RM DL + G Y T++ +GTP E+ + VDTGS +
Sbjct: 49 HYSRRHLQNSELPNARMRLFDDL--------LSNGYYTTRLFIGTPPQEFALIVDTGSTV 100
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS---PGV 163
+V C+ C +C D F P SST + C+ PSC+ G
Sbjct: 101 TYVPCSSCEQCGKHQD-----PRFQPDLSSTYRPVKCN------------PSCNCDDEGK 143
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C Y Y + SS+SG D++ S LK +FGC N ++GDL S
Sbjct: 144 QCTYERRYAEMSSSSGVIAEDVVSFGNES-ELKP----QRAVFGCENVETGDLYSQR--- 195
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPKVKTTPMV 280
DGI+G G+ S++ QL G + F+ C +DV GGG +G + P
Sbjct: 196 ADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDV--GGGAMVLGQISPP-------- 245
Query: 281 PNM----------PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPP-- 328
PNM P+YN+ L+E+ V G PL L + ++ GT++DSGTT AY P
Sbjct: 246 PNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVF--DEKHGTVLDSGTTYAYFPEAA 303
Query: 329 --MLYDLVLSQFR 339
L D ++ + R
Sbjct: 304 FHALKDAIMKEIR 316
>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
Length = 424
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 94/280 (33%), Positives = 130/280 (46%), Gaps = 29/280 (10%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
R R +S+ + GN +P G Y + +G P YY+ +DTGSDL W+ C A C C
Sbjct: 35 RFTRAASSVVFPVHGNVYP--LGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCDAPCVHC 92
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSST 177
L L+ PS I C+D C+ + N C +C+Y V Y DG S+
Sbjct: 93 -----LEAPHPLYQPSNDL----IPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSS 143
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G VRD+ LN G L+ P + GCG Q G+S +DG+LG G+ S+
Sbjct: 144 LGVLVRDVFSLNYTKG-LRLTP---RLALGCGYDQIP--GASGHHPLDGVLGLGRGKVSI 197
Query: 238 LSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV-SPKVKTTPMV-PNMPHYNVIL-EEVE 294
LSQL + G V+ HCL + GG +F D+ S +V TPM N HY+ + E+
Sbjct: 198 LSQLHSQGYVKNVVGHCLSSLGGGILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELL 257
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
GG L L T+ DSG++ Y Y V
Sbjct: 258 FGGRTTGLKNLL--------TVFDSGSSYTYFNSKAYQAV 289
>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
Length = 500
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 105/314 (33%), Positives = 139/314 (44%), Gaps = 37/314 (11%)
Query: 36 ENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDE 95
+N+ K+ R T + + + +R+ + + +G TG Y +GLGTP
Sbjct: 120 QNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA------SSGSALGTGNYVVTIGLGTPAGR 173
Query: 96 YYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
Y V DTGSD WV C C K + LFDP++SST I+C+ C Y
Sbjct: 174 YTVVFDTGSDTTWVQCEPCVVVCYKQ----QEKLFDPARSSTYANISCAAPACSDLYIK- 228
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
CS G C Y V YGDGS + G+F D + L+ FGCG R G
Sbjct: 229 --GCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA-------IKGFRFGCGERNEGL 278
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG-GIFAIG----DVV 270
G + G+LG G+ +SL Q A FAHC G G G V
Sbjct: 279 YGEAA-----GLLGLGRGKTSLPVQ--AYDKYGGVFAHCFPARSSGTGYLDFGPGSLPAV 331
Query: 271 SPKVKTTPMVPNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
S K+ T +V N P Y V L + VGG L +P S+ T GTI+DSGT + LPP
Sbjct: 332 SAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTT---SGTIVDSGTVITRLPPA 388
Query: 330 LYDLVLSQFRFWIA 343
Y + S F +A
Sbjct: 389 AYSSLRSAFASAMA 402
>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
Length = 482
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 109/333 (32%), Positives = 154/333 (46%), Gaps = 40/333 (12%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDT---RRHGRMMASIDLELGGNGH-----PSAT 80
GN + V G+R +T+ H T RR + SI L G G P++
Sbjct: 59 GNTIQIVHRACLQSGDR-KTVPDHHPHYTGILRRDHNRVRSIHRRLTGAGDTAATIPASL 117
Query: 81 GL------YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSK 134
GL Y +G+GTP + V DTGSDL WV C C T S + LFDPSK
Sbjct: 118 GLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPC----TDSCYQQQEPLFDPSK 173
Query: 135 SSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
SST ++ C C+ + +C G CEY V YGD S T G ++ L+
Sbjct: 174 SSTYVDVPCGTPQCKIG-GGQDLTCG-GTTCEYSVKYGDQSVTRGNLAQEAFTLS----- 226
Query: 195 LKTAPLNSSVIFGCGNR-QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+AP + V+FGC + SG G+ + +V G+LG G+ +SS+LSQ GN F++
Sbjct: 227 -PSAPPAAGVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQ-TRRGNSGDVFSY 284
Query: 254 CLDVV-KGGGIFAIGDVVSPK--VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSL 306
CL G IG P+ + TP+V + Y V L + V G L + S
Sbjct: 285 CLPPRGSSAGYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASA 344
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT+IDSGT + ++P Y ++ +FR
Sbjct: 345 FYI----GTVIDSGTVITHMPAAAYYVLRDEFR 373
>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 484
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 122/263 (46%), Gaps = 29/263 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y +GLGTP + V DTGSDL WV C CS C + D LFDP++SST
Sbjct: 143 TGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKD-----PLFDPARSSTYS 197
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ C+ + SCS +C Y V YGD S T G RD + L Q+
Sbjct: 198 AVPCASPECQGLDSR---SCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD------- 247
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVV 258
+ +FGCG + +G G + DG++G G+ SL SQ AA F++CL
Sbjct: 248 VLPGFVFGCGEQDTGLFGRA-----DGLVGLGREKVSLSSQ--AASKYGAGFSYCLPSSP 300
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G ++G + T M Y V L V+V G + + + GT
Sbjct: 301 SAAGYLSLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAA---GT 357
Query: 316 IIDSGTTLAYLPPMLYDLVLSQF 338
+IDSGT + LPP +Y + S F
Sbjct: 358 VIDSGTVITRLPPRVYAALRSAF 380
>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
Length = 474
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 95/274 (34%), Positives = 134/274 (48%), Gaps = 41/274 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y V LGTP ++VDTGSDL WV C C+ C ++ D LFDP++SS+
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKD-----PLFDPAQSSSYAA 194
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C C Y S +C YVV+YGDGS T+G + D + L+
Sbjct: 195 VPCGGPVCGGL--GIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSP---------- 242
Query: 201 NSSV---IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
N +V FGCG+ QSG G+ DG+LG G+ +SL+ Q AG F++CL
Sbjct: 243 NDAVRGFFFGCGHAQSGFTGN------DGLLGLGREEASLVEQ--TAGTYGGVFSYCLPT 294
Query: 258 VKG-GGIFAIG---DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTG 310
G +G P TT ++ PN +Y V+L + VGG L +P+S+
Sbjct: 295 RPSTTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVF--- 351
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
GT++D+GT + LPP Y + S FR +AS
Sbjct: 352 -AGGTVVDTGTVITRLPPTAYAALRSAFRSGMAS 384
>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
Length = 521
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/278 (35%), Positives = 127/278 (45%), Gaps = 33/278 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G TG Y +GLGTP Y V DTGSD WV C C K + LFDP+
Sbjct: 173 SGRALGTGNYVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQ----QEKLFDPA 228
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+SST ++C+ C Y CS G C Y V YGDGS + G+F D + L+
Sbjct: 229 RSSTYANVSCAAPACSDLYTR---GCSGG-HCLYSVQYGDGSYSIGFFAMDTLTLSSYDA 284
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFA 252
FGCG R G G + G+LG G+ +SL Q G V FA
Sbjct: 285 -------VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---FA 329
Query: 253 HCLDVVKGGGIFAIGDVVSPKV----KTTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSL 306
HCL G + SP +TTPM+ N P Y V + + VGG L +P S+
Sbjct: 330 HCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSV 389
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
T GTI+DSGT + LPP Y + S F +A+
Sbjct: 390 FSTA---GTIVDSGTVITRLPPAAYSSLRSAFASAMAA 424
>gi|357517935|ref|XP_003629256.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523278|gb|AET03732.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 544
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 97/302 (32%), Positives = 141/302 (46%), Gaps = 32/302 (10%)
Query: 50 SALKQHDTRRHGRMMAS-----IDLELGGNGHPSATG--LYFTKVGLGTPTDEYYVQVDT 102
+A+ D HGR +A I G H A L+F V +GTP + V +DT
Sbjct: 73 AAMVHRDRVFHGRRLADDRDTPITFAAGNETHQIAAFGFLHFANVSVGTPPLWFLVALDT 132
Query: 103 GSDLLWV--NCAGCSR-CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
GSDL W+ NC C R T++ I L +++ KSST + C+ N C+ T +
Sbjct: 133 GSDLFWLPCNCTSCVRGLKTQNGKVIDLNIYELDKSSTRKNVPCNSNMCKQTQCH----- 187
Query: 160 SPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
S G C Y V Y + +S+SG+ V D++ L + N +T +++ + GCG Q+G +
Sbjct: 188 SSGSSCRYEVEYLSNDTSSSGFLVEDVLHL--ITDNDQTKDIDTQITIGCGQVQTGVFLN 245
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTP 278
AA +G+ G G N S+ S LA G + F+ C G G GD S TP
Sbjct: 246 G--AAPNGLFGLGMENVSVPSILAQKGLISDSFSMCFG-SDGSGRITFGDTGSSDQGKTP 302
Query: 279 --MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + P YNV + ++ VGG D E I DSGT+ YL Y L+
Sbjct: 303 FNLRESHPTYNVTITQIIVGGYAAD---------HEFHAIFDSGTSFTYLNDPAYTLISE 353
Query: 337 QF 338
+F
Sbjct: 354 KF 355
>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
Length = 485
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 124/262 (47%), Gaps = 27/262 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y VGLGTP +Y V DTGSDL WV C C+ C + D LFDPS SST
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQD-----PLFDPSLSSTYA 200
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+AC C+ + CS RC Y V YGD S T G VRD + L+ AS L
Sbjct: 201 AVACGAPECQELDAS---GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLS-ASDTLP--- 253
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+FGCG++ +G G VDG+ G G+ SL SQ A + F +CL
Sbjct: 254 ---GFVFGCGDQNAGLFGQ-----VDGLFGLGREKVSLPSQ--GAPSYGPGFTYCLPSSS 303
Query: 260 GG-GIFAIGDVVSPKVKTTPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G G ++G + T + Y + L ++VGG + +P + GT+
Sbjct: 304 SGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAA--AGGTV 361
Query: 317 IDSGTTLAYLPPMLYDLVLSQF 338
IDSGT + LPP Y + + F
Sbjct: 362 IDSGTVITRLPPRAYAPLRAAF 383
>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 138/298 (46%), Gaps = 43/298 (14%)
Query: 37 NKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEY 96
N K R L D + RM DL L G Y T++ +GTP ++
Sbjct: 45 NSSKFISNPHRRLRQFPTSDNLSNARMRLYDDLLLNG--------YYTTRLWIGTPPQQF 96
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNR 155
+ VDTGS + +V C+ C +C D FDP SST I C+ D C
Sbjct: 97 ALIVDTGSTVTYVPCSTCEQCGRHQD-----PKFDPESSSTYKPIKCNIDCICD------ 145
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
S GV+C Y Y + S++SG D+I GN ++ + +FGC N ++GD
Sbjct: 146 ----SDGVQCVYERQYAEMSTSSGVLGEDVISF----GN-QSELIPQRAVFGCENMETGD 196
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSP 272
L S DGI+G G + SL+ QL G + F+ C +D+ GGG +G + P
Sbjct: 197 LFSQR---ADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDI--GGGAMVLGGISPP 251
Query: 273 K--VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER-GTIIDSGTTLAYLP 327
+ T P+YNV L+E+ V G L L + G D R G ++DSGTT AYLP
Sbjct: 252 SDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSS---GIFDGRYGAVLDSGTTYAYLP 306
>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
Length = 485
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 94/262 (35%), Positives = 124/262 (47%), Gaps = 27/262 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y VGLGTP +Y V DTGSDL WV C C+ C + D LFDPS SST
Sbjct: 146 TGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQD-----PLFDPSLSSTYA 200
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+AC C+ + CS RC Y V YGD S T G VRD + L+ AS L
Sbjct: 201 AVACGAPECQELDAS---GCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLS-ASDTLP--- 253
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+FGCG++ +G G VDG+ G G+ SL SQ A + F +CL
Sbjct: 254 ---GFVFGCGDQNAGLFGQ-----VDGLFGLGREKVSLPSQ--GAPSYGPGFTYCLPSSS 303
Query: 260 GG-GIFAIGDVVSPKVKTTPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G G ++G + T + Y + L ++VGG + +P + GT+
Sbjct: 304 SGRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAA--AGGTV 361
Query: 317 IDSGTTLAYLPPMLYDLVLSQF 338
IDSGT + LPP Y + + F
Sbjct: 362 IDSGTVITRLPPRAYAPLRAAF 383
>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 641
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 99/298 (33%), Positives = 138/298 (46%), Gaps = 43/298 (14%)
Query: 37 NKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEY 96
N K R L D + RM DL L G Y T++ +GTP ++
Sbjct: 45 NSSKFISNPHRRLRQFPTSDNLSNARMRLYDDLLLNG--------YYTTRLWIGTPPQQF 96
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNR 155
+ VDTGS + +V C+ C +C D FDP SST I C+ D C
Sbjct: 97 ALIVDTGSTVTYVPCSTCEQCGRHQD-----PKFDPESSSTYKPIKCNIDCICD------ 145
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
S GV+C Y Y + S++SG D+I GN ++ + +FGC N ++GD
Sbjct: 146 ----SDGVQCVYERQYAEMSTSSGVLGEDVISF----GN-QSELIPQRAVFGCENMETGD 196
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSP 272
L S DGI+G G + SL+ QL G + F+ C +D+ GGG +G + P
Sbjct: 197 LFSQR---ADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDI--GGGAMVLGGISPP 251
Query: 273 K--VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER-GTIIDSGTTLAYLP 327
+ T P+YNV L+E+ V G L L + G D R G ++DSGTT AYLP
Sbjct: 252 SDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPLSS---GIFDGRYGAVLDSGTTYAYLP 306
>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
Length = 746
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 134/260 (51%), Gaps = 30/260 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRC-PTKSDLGIKLTLFDPSKSSTS 138
G ++ + LGTP ++ V VDTGS + +V C+ C S C P D FDP SST+
Sbjct: 76 GYFYATLYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAA-----FDPEASSTA 130
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I+C+ C P C + C Y +Y + SS+SG + D++ L+ L
Sbjct: 131 SRISCTSPKCSCGS----PRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDG---LPG 183
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
AP +IFGC R++G++ DG+ G G +++S+++QL AG + F+ C +
Sbjct: 184 AP----IIFGCETRETGEIFRQR---ADGLFGLGNSDASVVNQLVKAGVIDDVFSLCFGM 236
Query: 258 VKGGGIFAIGDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGD 311
V+G G +GD P ++ TP++ + H YNV + + V G L + SL G
Sbjct: 237 VEGDGALLLGDAEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLPVSQSLFDQG- 295
Query: 312 ERGTIIDSGTTLAYLPPMLY 331
GT++DSGTT Y+P ++
Sbjct: 296 -YGTVLDSGTTFTYMPSPVF 314
>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
Length = 479
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 130/269 (48%), Gaps = 31/269 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y VGLG+P V +DTGSD+ WV C C + P + G LFDP+ SST
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG---ALFDPAASSTYAAF 191
Query: 142 ACSDNFCRTTYNN-RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
CS C ++ C RC+Y+V YGDGS+T+G + D++ L+ + +
Sbjct: 192 NCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSD-------V 244
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
FGC + +LG+ D DG++G G SL+SQ AA K F++CL
Sbjct: 245 VRGFQFGCSH---AELGAGMDDKTDGLIGLGGDAQSLVSQTAA--RYGKSFSYCLPATPA 299
Query: 261 GGIF-------AIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTG 310
F + G + + TTPM+ +P +Y LE++ VGG L L S+
Sbjct: 300 SSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA- 358
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+++DSGT + LPP Y + S FR
Sbjct: 359 ---GSLVDSGTVITRLPPAAYAALSSAFR 384
>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
Length = 632
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 133/262 (50%), Gaps = 26/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTK--SDLGIK-LTLFDPSKSS 136
L++T + +GTP+ + V +D+GSDLLW+ NC C+ + S L K L FDPS S+
Sbjct: 96 LHYTWIDIGTPSVSFLVALDSGSDLLWIPCNCVQCAPLSSAYYSSLATKDLNEFDPSAST 155
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYG-DGSSTSGYFVRDIIQLNQASGN 194
TS CS C + P+C SP +C Y VTY + +S+SG V D++ L ++
Sbjct: 156 TSKVFPCSHKLCESA-----PACESPKEQCPYTVTYASENTSSSGLLVEDVLHLAYSAN- 209
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
++ + + V+ GCG +QSG+ A DG++G G S+ S LA AG +R F+ C
Sbjct: 210 -ASSSVKARVVVGCGEKQSGEFLKGI--APDGVMGLGPGEISVPSFLAKAGLMRNSFSMC 266
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVG--GNPLDLPTSLLGTGDE 312
D G I+ GDV ++T +P + VEV GN +S
Sbjct: 267 FDEEDSGRIY-FGDVGPSTQQSTRFLPYKNEFVAYFVGVEVCCVGNSCLKQSSFT----- 320
Query: 313 RGTIIDSGTTLAYLPPMLYDLV 334
T+IDSG + +LP +Y V
Sbjct: 321 --TLIDSGQSFTFLPEEIYREV 340
>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
Length = 481
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 93/271 (34%), Positives = 128/271 (47%), Gaps = 28/271 (10%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
G P T Y VGLGTP + V DTGSDL WV C C C + D LFDPS
Sbjct: 129 RGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHD-----PLFDPS 183
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+S+T + C CR + SCS G +C Y V YGD S T G RD + L +S
Sbjct: 184 QSTTYSAVPCGAQECRRLDSG---SCSSG-KCRYEVVYGDMSQTDGNLARDTLTLGPSSS 239
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ + L +FGCG+ +G G + DG+ G G+ SL SQ AA F++
Sbjct: 240 SSSSDQLQ-EFVFGCGDDDTGLFGKA-----DGLFGLGRDRVSLASQ--AAAKYGAGFSY 291
Query: 254 CL-DVVKGGGIFAIGDVVSPKVKTTPMV-----PNMPHYNVILEEVEVGGNPLDLPTSLL 307
CL G ++G P + T MV P+ + N++ ++V G + + ++
Sbjct: 292 CLPSSSTAEGYLSLGSAAPPNARFTAMVTRSDTPSFYYLNLV--GIKVAGRTVRVSPAVF 349
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T GT+IDSGT + LP Y + S F
Sbjct: 350 RTP---GTVIDSGTVITRLPSRAYAALRSSF 377
>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 381
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/283 (34%), Positives = 131/283 (46%), Gaps = 30/283 (10%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
A++ +L GN +P GLY+ + +G P YY+ +DTGSDL W+ C A C C +
Sbjct: 7 ATVFSQLRGNIYPD--GLYYMAMLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPH- 63
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFV 182
L+DP K+ + C C +C VR C+Y V Y DGSST G +
Sbjct: 64 ----GLYDPKKARL---VDCRVPLCALVQQGGSYACGGPVRQCDYDVEYADGSSTMGVLM 116
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
D I L +G ++ I GCG Q G L + T A+ DG++G A SL SQLA
Sbjct: 117 EDTITLLLTNGTRS----KTTAIIGCGYDQQGTL-AQTPASTDGVMGLSSAKISLPSQLA 171
Query: 243 AAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKV--KTTPMVPNMPHYNVILEEVEVGGNP 299
G VR HCL GGG GD + P + TP++ N +GG
Sbjct: 172 KKGIVRNVIGHCLAGGSNGGGYLFFGDSLVPALGMTWTPIMGKSITGN-------IGGKS 224
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
D TGD G + DSGT+ YL P Y+ VLS +
Sbjct: 225 GDADDK---TGDIGGVMFDSGTSFTYLVPEAYNAVLSAMEMQV 264
>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 472
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/335 (31%), Positives = 153/335 (45%), Gaps = 55/335 (16%)
Query: 36 ENKFKAGGERERTLSALKQHDTRR-HGRMMASIDLELGGNGHPSATG------LYFTKVG 88
+ K + ER R+ A H R+ GR M S E GG P+ G Y +G
Sbjct: 74 DKKKPSFAERLRSDRARADHILRKASGRRMMS---EGGGASIPTYLGGFVDSLEYVVTLG 130
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+GTP + V +DTGSDL WV C C S C + D LFDPSKSST I C+ +
Sbjct: 131 IGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKD-----PLFDPSKSSTFATIPCASD 185
Query: 147 FCRTT----YNNRYPSCSPGV--RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C+ Y+N + + G+ +C Y + YG+G+ T G + + + L ++ +
Sbjct: 186 ACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLALGSSA-------V 238
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
S FGCG+ Q G DG+LG G A SL+SQ A+ F++CL +
Sbjct: 239 VKSFRFGCGSDQHGPYDK-----FDGLLGLGGAPESLVSQTASV--YGGAFSYCLPPLNS 291
Query: 261 GGIFAI------------GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
G F G V +P +P + Y V L + VGG LD+P ++
Sbjct: 292 GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATF--YVVTLTGISVGGKALDIPPAVFA 349
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+G I+DSGT + +P Y + + FR +A
Sbjct: 350 ----KGNIVDSGTVITGIPTTAYKALRTAFRSAMA 380
>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
Length = 463
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 89/265 (33%), Positives = 126/265 (47%), Gaps = 31/265 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V +DTGSD+ WV C C P + G LFDP+KSST ++
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTG---ALFDPAKSSTYRAVS 183
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C+ C + C+Y V YGDGS+T+G + RD + L+ AS +K
Sbjct: 184 CAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVK------ 237
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGG 261
FGC + +SG DG++G G SL+SQ AAA GN F++CL G
Sbjct: 238 GFQFGCSHLESG-----FSDQTDGLMGLGGGAQSLVSQTAAAYGN---SFSYCLPPTSGS 289
Query: 262 -------GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G V+ ++ + +P Y L+++ VGG L L S+ G
Sbjct: 290 SGFLTLGGGGGASGFVTTRMLRSKQIPTF--YGARLQDIAVGGKQLGLSPSVFAA----G 343
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
+++DSGT + LPP Y + S F+
Sbjct: 344 SVVDSGTIITRLPPTAYSALSSAFK 368
>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
gi|223973065|gb|ACN30720.1| unknown [Zea mays]
Length = 631
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/283 (31%), Positives = 139/283 (49%), Gaps = 45/283 (15%)
Query: 62 RMMASIDLELGGNGHPSA----------TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC 111
R+ AS+ LG HP+A G Y T++ +GTP E+ + VD+GS + +V C
Sbjct: 57 RLAASLRRGLGDGVHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC 116
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY 171
+ C +C D F P SS+ + C+ + C + + +C Y Y
Sbjct: 117 SSCEQCGNHQD-----PRFQPDLSSSYSPVKCNVD-CTCDSDKK--------QCTYERQY 162
Query: 172 GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
+ SS+SG DI+ + S LK IFGC N ++GDL S DGI+G G
Sbjct: 163 AEMSSSSGVLGEDIVSFGRES-ELKP----QHAIFGCENSETGDLFSQ---HADGIMGLG 214
Query: 232 QANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMP 284
+ S++ QL G + F+ C +D+ GGG +G +++P + P+ P
Sbjct: 215 RGQLSIMDQLVEKGVISDSFSLCYGGMDI--GGGAMVLGGMLAPPDMIFSNSDPL--RSP 270
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+YN+ L+E+ V G L + + + + + GT++DSGTT AYLP
Sbjct: 271 YYNIELKEIHVAGKALRVESRIFNS--KHGTVLDSGTTYAYLP 311
>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
Length = 460
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/263 (34%), Positives = 126/263 (47%), Gaps = 34/263 (12%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V LG+P V +D+GSD+ WV C C +C ++ D LFDPS SST +
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVD-----PLFDPSLSSTYSPFS 185
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C + CS +C+Y+V Y DGSST+G + D + L + S
Sbjct: 186 CSSAACAQLGQDGN-GCSSSSQCQYIVRYADGSSTTGTYSSDTLALGSNT--------IS 236
Query: 203 SVIFGCGNRQSG--DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVK 259
+ FGC + +SG DL DG++G G SL SQ AG F++CL
Sbjct: 237 NFQFGCSHVESGFNDL-------TDGLMGLGGGAPSLASQ--TAGTFGTAFSYCLPPTPS 287
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G +G S VK TPM+ + P Y V LE + VGG L +PTS+ G +
Sbjct: 288 SSGFLTLGAGTSGFVK-TPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF----SAGMV 342
Query: 317 IDSGTTLAYLPPMLYDLVLSQFR 339
+DSGT + LP Y + S F+
Sbjct: 343 MDSGTIITRLPRTAYSALSSAFK 365
>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
Length = 459
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 106/315 (33%), Positives = 146/315 (46%), Gaps = 43/315 (13%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL-YFTKVGLGTPTDEYYVQVDT 102
ER R A ++ R + SI LGG S L Y VGLGTP + +DT
Sbjct: 84 ERLRRSRARSKYIMSRASKSNVSIPTHLGG----SVDSLEYVVTVGLGTPAVSQVLLIDT 139
Query: 103 GSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS-C 159
GSDL WV CA C + C + D LFDPS+SST I C+ + CR + Y S C
Sbjct: 140 GSDLSWVQCAPCNSTTCYPQKD-----PLFDPSRSSTYAPIPCNTDACRDLTRDGYGSDC 194
Query: 160 SP----GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP--LNSSVIFGCGNRQS 213
+ G +C Y +TYGDGS T+G + + + + AP FGCG+ Q
Sbjct: 195 TSGSGGGAQCGYAITYGDGSQTTGVYSNETLTM---------APGVTVKDFHFGCGHDQD 245
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-GGIFAIGDVVSP 272
G + DG+LG G A SL+ Q ++ F++CL G A+G V+
Sbjct: 246 G-----PNDKYDGLLGLGGAPESLVVQTSSV--YGGAFSYCLPAANDQAGFLALGAPVND 298
Query: 273 K--VKTTPMVPNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
TPMV + V+ + + VGG P+D+P S G IIDSGT + L
Sbjct: 299 ASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAF----SGGMIIDSGTVVTELQHT 354
Query: 330 LYDLVLSQFRFWIAS 344
Y + + FR +A+
Sbjct: 355 AYAALQAAFRKAMAA 369
>gi|224096686|ref|XP_002310698.1| predicted protein [Populus trichocarpa]
gi|222853601|gb|EEE91148.1| predicted protein [Populus trichocarpa]
Length = 441
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 101/305 (33%), Positives = 147/305 (48%), Gaps = 36/305 (11%)
Query: 50 SALKQHDTRRHGRMMASIDLELG---GNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTG 103
+AL D GR ++ D L GN S+ G L++T V LGTP ++ V +DTG
Sbjct: 58 AALAHRDQMLRGRRLSDADASLAFSDGNSTFRISSLGFLHYTTVELGTPGVKFMVALDTG 117
Query: 104 SDLLWVNCAGCSRC-PTK-----SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
SDL WV C CSRC PT SD +L++++P +SSTS ++ C+++ C R
Sbjct: 118 SDLFWVPC-DCSRCAPTHGASYASDF--ELSIYNPRESSTSKKVTCNNDMC----AQRNR 170
Query: 158 SCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDL 216
C Y+V+Y +STSG V+D++ L G + + + V FGCG QSG
Sbjct: 171 CLGTFSSCPYIVSYVSAQTSTSGILVKDVLHLTTEDGGREF--VEAYVTFGCGQVQSGSF 228
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKT 276
AA +G+ G G S+ S L+ G + F+ C G G + GD SP +
Sbjct: 229 LDI--AAPNGLFGLGMEKISVPSVLSREGLIADSFSMCFG-HDGIGRISFGDKGSPDQEE 285
Query: 277 TP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
TP + P P YNV + + VG +D+ E + DSGT+ Y+ Y V
Sbjct: 286 TPFNVNPAHPTYNVTVTQARVGTMLIDV---------EFTALFDSGTSFTYMVDPAYSRV 336
Query: 335 LSQFR 339
+F
Sbjct: 337 SEKFH 341
>gi|413952261|gb|AFW84910.1| hypothetical protein ZEAMMB73_904583 [Zea mays]
Length = 298
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 65/130 (50%), Positives = 84/130 (64%), Gaps = 2/130 (1%)
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAI 266
C N QSGDL + D AVDGI GFGQ S++SQL + G K F+HCL GGGI +
Sbjct: 9 CSNSQSGDL-TKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCLKGSDNGGGILVL 67
Query: 267 GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
G++V P + TP+VP+ PHYN+ LE + V G L + +SL T + +GTI+DSGTTLAYL
Sbjct: 68 GEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIVDSGTTLAYL 127
Query: 327 PPMLYDLVLS 336
YD +S
Sbjct: 128 ADGAYDPFVS 137
>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
Length = 632
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 89/287 (31%), Positives = 138/287 (48%), Gaps = 45/287 (15%)
Query: 62 RMMASIDLELGGNGHPSA----------TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC 111
R+ AS LG HP+A G Y T++ +GTP E+ + VD+GS + +V C
Sbjct: 58 RLAASSRRGLGDGAHPNARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPC 117
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY 171
A C +C D F P SS+ + C+ + C + + +C Y Y
Sbjct: 118 ASCEQCGNHQD-----PRFQPDLSSSYSPVKCNVD-CTCDSDKK--------QCTYERQY 163
Query: 172 GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
+ SS+SG DI+ + S LK +FGC N ++GDL S DGI+G G
Sbjct: 164 AEMSSSSGVLGEDIVSFGRES-ELKP----QRAVFGCENSETGDLFSQ---HADGIMGLG 215
Query: 232 QANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMP 284
+ S++ QL G + F+ C +D+ GGG +G V +P + P+ P
Sbjct: 216 RGQLSIMDQLVEKGVISDSFSLCYGGMDI--GGGAMVLGGVPAPSDMVFSHSDPL--RSP 271
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+YN+ L+E+ V G L + + + + + GT++DSGTT AYLP +
Sbjct: 272 YYNIELKEIHVAGKALRVDSRVFNS--KHGTVLDSGTTYAYLPEQAF 316
>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
distachyon]
Length = 629
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 96/298 (32%), Positives = 141/298 (47%), Gaps = 51/298 (17%)
Query: 64 MASIDLELGGNGHPSA----------TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
+AS LG G PSA G Y T++ +GTP E+ + VD+GS + +V CA
Sbjct: 56 LASSRRVLGDGGRPSARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCAS 115
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNRYPSCSPGVRCEYVVTYG 172
C +C D F P SST + CS D C + + +C Y Y
Sbjct: 116 CEQCGNHQD-----PRFQPDLSSTYSPVKCSADCTCDSDKS----------QCTYERQYA 160
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ SS+SG DI+ S LK +FGC N ++GDL S DGI+G G+
Sbjct: 161 EMSSSSGVLGEDIVSFGTES-ELKP----QRAVFGCENSETGDLFSQ---HADGIMGLGR 212
Query: 233 ANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPH 285
S++ QL G + F+ C +D+ GGG +G + +P ++ P+ P+
Sbjct: 213 GQLSIMDQLVDKGVIGDSFSMCYGGMDI--GGGAMVLGAMPAPPDMVFSRSDPV--RSPY 268
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY----DLVLSQFR 339
YN+ L+E+ V G L L + + + GT++DSGTT AYLP + D V S+ R
Sbjct: 269 YNIELKEIHVAGKALRLDPRIFDS--KHGTVLDSGTTYAYLPEQAFVAFKDAVTSKVR 324
>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 516
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 99/264 (37%), Positives = 125/264 (47%), Gaps = 31/264 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y VGLGTP Y V DTGSD WV C C + + LFDP++SST
Sbjct: 176 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ----REKLFDPARSSTYA 231
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C+ C + + R CS G C Y V YGDGS + G+F D + L+
Sbjct: 232 NVSCAAPAC-SDLDTR--GCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA------ 281
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFAHCLDVV 258
FGCG R G G + G+LG G+ +SL Q G V FAHCL
Sbjct: 282 -VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---FAHCLPAR 332
Query: 259 KGGGIFAIGDVVSP--KVKTTPM-VPNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G + SP ++ TTPM V N P Y V L + VGG L +P S+ T G
Sbjct: 333 STGTGYLDFGAGSPAARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATA---G 389
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
TI+DSGT + LPP Y + S F
Sbjct: 390 TIVDSGTVITRLPPAAYSSLRSAF 413
>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 545
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 152/340 (44%), Gaps = 40/340 (11%)
Query: 16 VVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASID----LEL 71
VV +WA GG + +++ A G E SAL +HD R + D
Sbjct: 47 VVRRWAEARGGPL------AADRWPARGTPE-YYSALSRHDRARRALAGGADDGLLTFAA 99
Query: 72 GGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDLG---IK 126
G + + S T LY+ +V LGTP + V +DTGSDL WV +C C+ P+ + G
Sbjct: 100 GNDTYQSGT-LYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPP 158
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTY-GDGSSTSGYFVR 183
L + P +SSTS ++AC + C R CS C Y V Y +S+SG V+
Sbjct: 159 LRPYSPRRSSTSEQVACDNPLC-----GRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQ 213
Query: 184 DIIQLNQASGNLKTA--PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
D++ L + A L + V+FGCG Q+G AVDG++G G S+ S L
Sbjct: 214 DVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSAL 273
Query: 242 AAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGGN 298
AA+G V + F+ C G G GD S TP P YNV + +G
Sbjct: 274 AASGLVASDSFSMCFG-DDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTSIGIGSE 332
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ E ++DSGT+ YL Y + ++F
Sbjct: 333 SV---------AAEFAAVMDSGTSFTYLSDPEYTQLATKF 363
>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 485
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 80/251 (31%), Positives = 122/251 (48%), Gaps = 28/251 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
++T + LGTP + V +DTGS + ++ C CS C + FDP KS+T+ ++A
Sbjct: 13 FYTTLKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHT-----AEWFDPDKSTTAKKLA 67
Query: 143 CSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C D C N PSC+ RC Y TY + SS+ G+ + D + ++
Sbjct: 68 CGDPLC----NCGTPSCTCNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVR----- 118
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
++FGC N ++G++ DGI+G G +++ SQL + F+ C K
Sbjct: 119 --LVFGCENGETGEIYRQ---MADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPK-D 172
Query: 262 GIFAIGDVVSPKVKTTPMVPNMPH-----YNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
GI +GDV P+ T P + H YNV ++ + V G L S+ G GT+
Sbjct: 173 GILLLGDVTLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRG--YGTV 230
Query: 317 IDSGTTLAYLP 327
+DSGTT YLP
Sbjct: 231 LDSGTTFTYLP 241
>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
gi|223948487|gb|ACN28327.1| unknown [Zea mays]
Length = 434
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 93/264 (35%), Positives = 131/264 (49%), Gaps = 32/264 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
TG Y V LGTP + + V DTGSD WV C C + C + K LFDP+KS+T
Sbjct: 93 TGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQ-----KEPLFDPTKSATY 147
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+CS ++C Y + CS G C Y + YGDGS T G++ +D + L A +K
Sbjct: 148 ANISCSSSYCSDLYVS---GCS-GGHCLYGIQYGDGSYTIGFYAQDTLTL--AYDTIK-- 199
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ FGCG + G G + G+LG G+ +SL Q A FA+CL
Sbjct: 200 ----NFRFGCGEKNRGLFGRAA-----GLLGLGRGKTSLPVQ--AYDKYGGVFAYCLPAT 248
Query: 259 KGG-GIFAIGD-VVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G +G + + TPM+ + Y V + ++VGG+ L +P S+ T G
Sbjct: 249 SAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTA---G 305
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
T++DSGT + LPP Y + S F
Sbjct: 306 TLVDSGTVITRLPPSAYAPLRSAF 329
>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 609
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 88/270 (32%), Positives = 131/270 (48%), Gaps = 37/270 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y T++ +G+P E+ + VDTGS + +V C+ C +C D F P SST
Sbjct: 87 GYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQD-----PRFQPELSSTYQP 141
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ + C N GV+C Y Y + S++SG D++ + S + P
Sbjct: 142 VKCNAD-CNCDEN--------GVQCTYERRYAEMSTSSGVLAEDVMSFGKES---ELVP- 188
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDV 257
+FGC +SGDL + DGI+G G+ S++ QL G V F+ C +DV
Sbjct: 189 -QRAVFGCETMESGDLYTQ---RADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244
Query: 258 VKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGTGDERG 314
GGG +G + SP V + P+YN+ L+E+ V G PL L P + G + G
Sbjct: 245 --GGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDG---KYG 299
Query: 315 TIIDSGTTLAYLPPMLY----DLVLSQFRF 340
I+DSGTT AY P Y D ++ + F
Sbjct: 300 AILDSGTTYAYFPEKAYYAFKDAIMKKISF 329
>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 384
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 96/273 (35%), Positives = 134/273 (49%), Gaps = 31/273 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + LG+P + V VDTGSDL WV C C C + G K FDPSKS + +
Sbjct: 37 GEYLMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQP--GPK---FDPSKSRSFRK 91
Query: 141 IACSDNFCRTTYNNRYP--SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
AC+DN C + P +C+ V C+Y TYGD S+T+G + I LN +G ++
Sbjct: 92 AACTDNLCNVS---ALPLKACAANV-CQYQYTYGDQSNTNGDLAFETISLNNGAGT-QSV 146
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
P + FGCG + G T A G++G GQ SL SQL+ +F++CL +
Sbjct: 147 P---NFAFGCGTQNLG-----TFAGAAGLVGLGQGPLSLNSQLSHT--FANKFSYCLVSL 196
Query: 259 K--GGGIFAIGDV-VSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDE 312
G + + ++ T +V N H Y V L +EVGG PL+L S+
Sbjct: 197 NSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSIEVGGQPLNLAPSVFAIDQS 256
Query: 313 R---GTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
GTIIDSGTT+ L Y VL + ++
Sbjct: 257 TGRGGTIIDSGTTITMLTLPAYSAVLRAYESFV 289
>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
Length = 499
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 93/264 (35%), Positives = 131/264 (49%), Gaps = 32/264 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
TG Y V LGTP + + V DTGSD WV C C + C + K LFDP+KS+T
Sbjct: 158 TGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQ-----KEPLFDPTKSATY 212
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+CS ++C Y + CS G C Y + YGDGS T G++ +D + L A +K
Sbjct: 213 ANISCSSSYCSDLYVS---GCS-GGHCLYGIQYGDGSYTIGFYAQDTLTL--AYDTIK-- 264
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ FGCG + G G + G+LG G+ +SL Q A FA+CL
Sbjct: 265 ----NFRFGCGEKNRGLFGRAA-----GLLGLGRGKTSLPVQ--AYDKYGGVFAYCLPAT 313
Query: 259 KGG-GIFAIGD-VVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G +G + + TPM+ + Y V + ++VGG+ L +P S+ T G
Sbjct: 314 SAGTGFLDLGPGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTA---G 370
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
T++DSGT + LPP Y + S F
Sbjct: 371 TLVDSGTVITRLPPSAYAPLRSAF 394
>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
sativus]
Length = 639
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 88/270 (32%), Positives = 131/270 (48%), Gaps = 37/270 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y T++ +G+P E+ + VDTGS + +V C+ C +C D F P SST
Sbjct: 87 GYYTTRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQD-----PRFQPELSSTYQP 141
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ + C N GV+C Y Y + S++SG D++ + S + P
Sbjct: 142 VKCNAD-CNCDEN--------GVQCTYERRYAEMSTSSGVLAEDVMSFGKES---ELVP- 188
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDV 257
+FGC +SGDL + DGI+G G+ S++ QL G V F+ C +DV
Sbjct: 189 -QRAVFGCETMESGDLYTQ---RADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDV 244
Query: 258 VKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGTGDERG 314
GGG +G + SP V + P+YN+ L+E+ V G PL L P + G + G
Sbjct: 245 --GGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDG---KYG 299
Query: 315 TIIDSGTTLAYLPPMLY----DLVLSQFRF 340
I+DSGTT AY P Y D ++ + F
Sbjct: 300 AILDSGTTYAYFPEKAYYAFKDAIMKKISF 329
>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 490
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/269 (33%), Positives = 128/269 (47%), Gaps = 31/269 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR-CPTKSDLGIKLTLFDPSKSSTS 138
+G YF VGLGTP + + DTGSDL W C C+R C + D+ +FDPSKS++
Sbjct: 143 SGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDV-----IFDPSKSTSY 197
Query: 139 GEIACSDNFCR--TTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
I C+ C +T P CS + C Y + YGD S + GYF R+ + +
Sbjct: 198 SNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD--- 254
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + +FGCG G G S G++G G+ S + Q AA RK F++CL
Sbjct: 255 ----VVDNFLFGCGQNNQGLFGGSA-----GLIGLGRHPISFVQQTAA--KYRKIFSYCL 303
Query: 256 DVVKGG-GIFAIGDVVSPK-VKTTP---MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
G + G + + +K TP + Y + + + VGG L + +S TG
Sbjct: 304 PSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTG 363
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G IIDSGT + LPP Y + S FR
Sbjct: 364 ---GAIIDSGTVITRLPPTAYGALRSAFR 389
>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
Length = 659
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 95/303 (31%), Positives = 139/303 (45%), Gaps = 53/303 (17%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSA----------TGLYFTKVGL 89
K+ G R+ A++ RRH L+ HP+A G Y T++ +
Sbjct: 47 KSSGHRQ----AIEGSYWRRH--------LKSDPYHHPNARMRLYDDLLSNGYYTTRLWI 94
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP E+ + VDTGS + +V C+ C C D F P +SST + C+ + C
Sbjct: 95 GTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQD-----PRFQPDESSTYHPVKCNMD-CN 148
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
++ GV C Y Y + SS+SG DII S + P +FGC
Sbjct: 149 CDHD--------GVNCVYERRYAEMSSSSGVLGEDIISFGNQS---EVVP--QRAVFGCE 195
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGD 268
N ++GDL S DGI+G G+ S++ QL + F+ C + GGG +G
Sbjct: 196 NVETGDLYSQR---ADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLGG 252
Query: 269 VVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
+ P ++ P P+YN+ L+E+ V G PL L S + GT++DSGTT A
Sbjct: 253 IPPPPDMVFSRSDPY--RSPYYNIELKEIHVAGKPLKLSPSTFDR--KHGTVLDSGTTYA 308
Query: 325 YLP 327
YLP
Sbjct: 309 YLP 311
>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 531
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 135/266 (50%), Gaps = 25/266 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSDLLWV C C +C S LG L + PS SS
Sbjct: 102 LHYTWIDIGTPNVSFLVALDAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSS 160
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVT-YGDGSSTSGYFVRDIIQLNQASGNL 195
TS ++C+D C + + S C Y+ + Y + +S+SG + D + L S +
Sbjct: 161 TSKPLSCNDQLCELGSDCK----SSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHA 216
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + +SVI GCG +QSG S AA DG++G G + S+ S LA AG VR F+ C
Sbjct: 217 SRSSVWASVIIGCGRKQSGAF--SDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICF 274
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE---VGGNPLDLPTSLLGTGDE 312
D G I GD K+T VP + L EVE VG +SL G +
Sbjct: 275 DDNHSGTIL-FGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS------SSLKTAGFQ 327
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
++DSGT+ +LP +Y+ ++ +F
Sbjct: 328 --ALVDSGTSFTFLPYEIYEKIVVEF 351
>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 135/266 (50%), Gaps = 25/266 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSDLLWV C C +C S LG L + PS SS
Sbjct: 92 LHYTWIDIGTPNVSFLVALDAGSDLLWVPC-DCMQCAPLSASYYDRLGRDLNEYSPSLSS 150
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVT-YGDGSSTSGYFVRDIIQLNQASGNL 195
TS ++C+D C + + S C Y+ + Y + +S+SG + D + L S +
Sbjct: 151 TSKPLSCNDQLCELGSDCK----SSKDPCPYLASYYSENTSSSGLLIEDRLHLAPFSEHA 206
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + +SVI GCG +QSG S AA DG++G G + S+ S LA AG VR F+ C
Sbjct: 207 SRSSVWASVIIGCGRKQSGAF--SDGAAPDGLMGLGPGDLSVPSLLAKAGLVRNTFSICF 264
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE---VGGNPLDLPTSLLGTGDE 312
D G I GD K+T VP + L EVE VG +SL G +
Sbjct: 265 DDNHSGTIL-FGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGS------SSLKTAGFQ 317
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
++DSGT+ +LP +Y+ ++ +F
Sbjct: 318 --ALVDSGTSFTFLPYEIYEKIVVEF 341
>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 365
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 94/271 (34%), Positives = 133/271 (49%), Gaps = 31/271 (11%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
+A G Y V LGTP + V VDTGSDL WV C+ C +C +++D LF P+ S++
Sbjct: 8 AARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQND-----ALFLPNTSTS 62
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++AC C +P C+ C Y +YGDGS T+G FV D I ++ +G +
Sbjct: 63 FTKLACGSALCNGL---PFPMCN-QTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQ 118
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC--- 254
P + FGCG+ G A DGILG GQ S SQL + N +F++C
Sbjct: 119 VP---NFAFGCGHDNEGSF-----AGADGILGLGQGPLSFHSQLKSVYN--GKFSYCLVD 168
Query: 255 -LDVVKGGGIFAIGDV---VSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLL 307
L GD + P VK P++ P +P +Y V L + VG N L++ +++
Sbjct: 169 WLAPPTQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVF 228
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLVLS 336
GTI DSGTT+ L Y VL+
Sbjct: 229 DIDSVGGAGTIFDSGTTVTQLAEAAYKEVLA 259
>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 538
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/275 (34%), Positives = 135/275 (49%), Gaps = 28/275 (10%)
Query: 60 HGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCP 118
R +S L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 138 EARENSSALLPIRGNVFPD--GQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCA 195
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR-YPSCSPGVRCEYVVTYGDGSST 177
L+ P K + + D++C+ N+ Y S +C+Y +TY D SS+
Sbjct: 196 KGPH-----PLYKPEKPNV---VPPRDSYCQELQGNQNYGDTS--KQCDYEITYADRSSS 245
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G RD +QL A G + N +FGCG Q G+L SS A DGILG A SL
Sbjct: 246 MGILARDNMQLITADGERE----NLDFVFGCGYDQQGNLLSSP-ANTDGILGLSNAAISL 300
Query: 238 LSQLAAAGNVRKEFAHCL--DVVKGGGIFAIGDVVSPKVKTTPM-VPNMPH--YNVILEE 292
+QLA+ G + F HC+ D GG +F +GD P+ T M + N P Y+ +++
Sbjct: 301 PTQLASQGIISNVFGHCIAADPSNGGYMF-LGDDYVPRWGMTWMPIRNGPENLYSTEVQK 359
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
V G L++ G I DSG++ YLP
Sbjct: 360 VNYGDQQLNVRRK---AGKLTQVIFDSGSSYTYLP 391
>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 543
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 106/340 (31%), Positives = 152/340 (44%), Gaps = 40/340 (11%)
Query: 16 VVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASID----LEL 71
VV +WA GG + +++ A G E SAL +HD R + D
Sbjct: 45 VVRRWAEARGGPL------AADQWPARGTPE-YYSALSRHDRARRALAGGADDGLLTFAA 97
Query: 72 GGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDLG---IK 126
G + + S T LY+ +V LGTP + V +DTGSDL WV +C C+ P+ + G
Sbjct: 98 GNDTYQSGT-LYYAEVELGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPS 156
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTY-GDGSSTSGYFVR 183
L + P +SSTS ++AC + C + CS C Y V Y +S+SG V+
Sbjct: 157 LRPYSPRRSSTSKQVACDNPLC-----GQRNGCSAATNGSCPYEVQYVSANTSSSGVLVQ 211
Query: 184 DIIQLNQASGNLKTA--PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
D++ L + A L + V+FGCG Q+G AVDG++G G S+ S L
Sbjct: 212 DVLHLTRERPGPGAAGEALQAPVVFGCGQVQTGAFLDGGGGAVDGLMGLGMGKVSVPSAL 271
Query: 242 AAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGGN 298
AA+G V + F+ C G G GD S TP P YNV + VG
Sbjct: 272 AASGLVASDSFSMCFG-DDGVGRVNFGDAGSRGQAETPFTVRSLNPTYNVSFTSIGVGSE 330
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ E ++DSGT+ YL Y + ++F
Sbjct: 331 SV---------AAEFAAVMDSGTSFTYLSDPEYTQLATKF 361
>gi|225431324|ref|XP_002269880.1| PREDICTED: aspartic proteinase-like protein 1 [Vitis vinifera]
gi|297739017|emb|CBI28369.3| unnamed protein product [Vitis vinifera]
Length = 518
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 156/338 (46%), Gaps = 43/338 (12%)
Query: 17 VHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLEL---GG 73
V +W+ G G + + F+ E L D GR ++ ID L G
Sbjct: 38 VKKWSEGAGNGFPAGNWPAKGSFEYYAE-------LAHRDRALRGRRLSDIDGLLTFSDG 90
Query: 74 NG--HPSATG-LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTK-----SDLG 124
N S+ G L++T V LGTP ++ V +DTGSDL WV C CSRC PT+ SD
Sbjct: 91 NSTFRISSLGFLHYTTVSLGTPGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYASDF- 148
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVR 183
+L++++P SSTS ++ C ++ C + NR C Y+V+Y +STSG V
Sbjct: 149 -ELSIYNPKGSSTSRKVTCDNSLC--AHRNR--CLGTFSNCPYMVSYVSAETSTSGILVE 203
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D++ L + + + V FGCG Q+G AA +G+ G G S+ S L+
Sbjct: 204 DVLHLTTEDN--RQEFVEAYVTFGCGQVQTGSFLDI--AAPNGLFGLGLEKISVPSILSK 259
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLD 301
G F+ C G G + GD SP + TP N P YN+ + +V VG +D
Sbjct: 260 EGFTADSFSMCFG-PDGIGRISFGDKGSPDQEETPFNLNALHPTYNITVTQVRVGTTLID 318
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L + L DSGT+ YL +Y VL F
Sbjct: 319 LDFTAL---------FDSGTSFTYLVDPIYTNVLKSFH 347
>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 139/298 (46%), Gaps = 35/298 (11%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLE--LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
+R ++AL++ + R+ ++ S E + NG G Y ++ +GTP DTG
Sbjct: 50 DRIVNALRR-SSHRNTVVLESDTAEAPIFNNG-----GEYLVEISVGTPPFSIVAVADTG 103
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SD++W C CS C + +FDPSKS+T +ACS C +Y+ SCS
Sbjct: 104 SDVIWTQCKPCSNCYQQ-----NAPMFDPSKSTTYKNVACSSPVC--SYSGDGSSCSDDS 156
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
C Y + YGD S + G D + + SG P + GCG+ +G +A
Sbjct: 157 ECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFP---RTVIGCGHDNAGTF----NAN 209
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG--------GIFAIGDVVSPKVK 275
V GI+G G+ +SL++QL A +F++CL + G + +V
Sbjct: 210 VSGIVGLGRGPASLVTQLGPA--TGGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTV 267
Query: 276 TTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
+TP+ + + Y++ LE V VG + P G E IIDSGTTL YLP L
Sbjct: 268 STPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIIDSGTTLTYLPSAL 325
>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 478
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/270 (35%), Positives = 129/270 (47%), Gaps = 29/270 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y LGTP ++VDTGSDL WV C CS P S K LFDP++SS+ +
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAP--SCYSQKDPLFDPAQSSSYAAVP 197
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C S +C YVV+YGDGS+T+G + D + L+ +S
Sbjct: 198 CGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA-------VQ 249
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-G 261
FGCG+ QSG VDG+LG G+ SL+ Q AG F++CL
Sbjct: 250 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTA 302
Query: 262 GIFAIG----DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G +G +P TT ++ PN P +Y V+L + VGG L +P S G
Sbjct: 303 GYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF----AGG 358
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
T++D+GT + LPP Y + S FR +AS
Sbjct: 359 TVVDTGTVITRLPPTAYAALRSAFRSGMAS 388
>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
Length = 538
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 95/275 (34%), Positives = 135/275 (49%), Gaps = 28/275 (10%)
Query: 60 HGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCP 118
R +S L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 138 EARENSSALLPIRGNVFPD--GQYYTSMYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCA 195
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR-YPSCSPGVRCEYVVTYGDGSST 177
L+ P K + + D++C+ N+ Y S +C+Y +TY D SS+
Sbjct: 196 KGPH-----PLYKPEKPNV---VPPRDSYCQELQGNQNYGDTS--KQCDYEITYADRSSS 245
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G RD +QL A G + N +FGCG Q G+L SS A DGILG A SL
Sbjct: 246 MGILARDNMQLITADGERE----NLDFVFGCGYDQQGNLLSSP-ANTDGILGLSNAAISL 300
Query: 238 LSQLAAAGNVRKEFAHCL--DVVKGGGIFAIGDVVSPKVKTTPM-VPNMPH--YNVILEE 292
+QLA+ G + F HC+ D GG +F +GD P+ T M + N P Y+ +++
Sbjct: 301 PTQLASQGIISNVFGHCIAADPSNGGYMF-LGDDYVPRWGMTWMPIRNGPENLYSTEVQK 359
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
V G L++ G I DSG++ YLP
Sbjct: 360 VNYGDQQLNVRRK---AGKLTQVIFDSGSSYTYLP 391
>gi|25347778|pir||B84556 hypothetical protein At2g17760 [imported] - Arabidopsis thaliana
Length = 473
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/265 (33%), Positives = 129/265 (48%), Gaps = 28/265 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSR---CPTKSDLGIKLTLFDPSKSS 136
L++ V +GTP+D + V +DTGSDL W+ +C C R P S L L ++ P+ SS
Sbjct: 54 LHYANVTVGTPSDWFMVALDTGSDLFWLPCDCTNCVRELKAPGGSSL--DLNIYSPNASS 111
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
TS ++ C+ C T +R SP C Y + Y +G+S++G V D++ L +
Sbjct: 112 TSTKVPCNSTLC--TRGDR--CASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSS 167
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
K P + V FGCG Q+G AA +G+ G G + S+ S LA G F+ C
Sbjct: 168 KAIP--ARVTFGCGQVQTGVFHDG--AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCF 223
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G + GD S + TP+ PH YN+ + ++ VGGN DL E
Sbjct: 224 G-NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDL---------EF 273
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
+ DSGT+ YL Y L+ F
Sbjct: 274 DAVFDSGTSFTYLTDAAYTLISESF 298
>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 513
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/266 (33%), Positives = 129/266 (48%), Gaps = 30/266 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC------PTKSDLGIKLTLFDPSKS 135
L++ V +GTP+D + V +DTGSDL W+ C C+ C P S L L ++ P+ S
Sbjct: 103 LHYANVTVGTPSDWFMVALDTGSDLFWLPC-DCTNCVRELKAPGGSSL--DLNIYSPNAS 159
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGN 194
STS ++ C+ C T +R SP C Y + Y +G+S++G V D++ L +
Sbjct: 160 STSTKVPCNSTLC--TRGDR--CASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKS 215
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
K P + V FGCG Q+G AA +G+ G G + S+ S LA G F+ C
Sbjct: 216 SKAIP--ARVTFGCGQVQTGVFHDG--AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMC 271
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G + GD S + TP+ PH YN+ + ++ VGGN DL E
Sbjct: 272 FG-NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVGGNTGDL---------E 321
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ DSGT+ YL Y L+ F
Sbjct: 322 FDAVFDSGTSFTYLTDAAYTLISESF 347
>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
Length = 464
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 91/275 (33%), Positives = 137/275 (49%), Gaps = 34/275 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFD 131
+G+ T Y V +GTP + +DTGSD+ WV CA C+ C ++ D LFD
Sbjct: 120 SGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKD-----KLFD 174
Query: 132 PSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
P+ S+T +C C + C +C+Y+V YGDGS+T+G + D + L +
Sbjct: 175 PAMSATYSAFSCGSAQC-AQLGDEGNGCLKS-QCQYIVKYGDGSNTAGTYGSDTLSLT-S 231
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
S +K S FGC +R +G +G +DG++G G SL+SQ AA K F
Sbjct: 232 SDAVK------SFQFGCSHRAAGFVGE-----LDGLMGLGGDTESLVSQTAA--TYGKAF 278
Query: 252 AHCL--DVVKGGGIF---AIGDVVSPKVKTTPMVP-NMP-HYNVILEEVEVGGNPLDLPT 304
++CL GGG A G S + TPMV ++P Y V L+ + V G L++P
Sbjct: 279 SYCLPPPSSSGGGFLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPA 338
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
S+ +++DSGT + LPP Y + + F+
Sbjct: 339 SVF----SGASVVDSGTVITQLPPTAYQALRTAFK 369
>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
Length = 631
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/254 (33%), Positives = 126/254 (49%), Gaps = 35/254 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y T++ +GTP+ E+ + VD+GS + +V CA C +C D F P SST
Sbjct: 89 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQD-----PRFQPDLSSTYSP 143
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C N T N R +C Y Y + SS+SG DI+ + S LK
Sbjct: 144 VKC--NVDCTCDNER-------SQCTYERQYAEMSSSSGVLGEDIMSFGKES-ELKP--- 190
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDV 257
+FGC N ++GDL S DGI+G G+ S++ QL G + F+ C +DV
Sbjct: 191 -QRAVFGCENTETGDLFSQ---HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDV 246
Query: 258 VKGGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
GGG +G + +P + P+ P+YN+ L+E+ V G L L + + +
Sbjct: 247 --GGGTMVLGGMPAPPDMVFSHSNPV--RSPYYNIELKEIHVAGKALRLDPKIFNS--KH 300
Query: 314 GTIIDSGTTLAYLP 327
GT++DSGTT AYLP
Sbjct: 301 GTVLDSGTTYAYLP 314
>gi|356540982|ref|XP_003538963.1| PREDICTED: uncharacterized protein LOC100811106 [Glycine max]
Length = 813
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/131 (46%), Positives = 82/131 (62%), Gaps = 31/131 (23%)
Query: 175 SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG--------------------------- 207
+++GY+V+D + N +GNL+TAP NSS+IFG
Sbjct: 640 KNSTGYYVQDYLTYNHVNGNLRTAPQNSSIIFGRIMPAVNVQYERIILVVNGIFILLSQL 699
Query: 208 ----CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI 263
CG QS SS++ A+DGI+GFGQ+NSS+LSQLAA+G V+K F+HCLD ++GGGI
Sbjct: 700 FLVMCGAVQSVTFSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGGI 759
Query: 264 FAIGDVVSPKV 274
FAIG+VV PKV
Sbjct: 760 FAIGEVVEPKV 770
>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 134/269 (49%), Gaps = 35/269 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+G+P + +DTGSD+ WV C +D LTLFDPSKS+T +
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRC-------NSTD---GLTLFDPSKSTTYAPFS 178
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C NN + G C+Y V YGDGS+T+G + D + L+ AS + +
Sbjct: 179 CSSAACAQLGNNGDGCSNSG--CQYRVQYGDGSNTTGTYSSDTLALS-ASDTV------T 229
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVK 259
FGC + + G +DG++G G SL+SQ AA K F++CL +
Sbjct: 230 DFHFGCSHHEEDFDGEK----IDGLMGLGGDAQSLVSQTAA--TYGKSFSYCLPPTNRTS 283
Query: 260 GGGIFAIGDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G F + S TTPM+ P P Y V+L+++ VGG PL + S+L G++
Sbjct: 284 GFLTFGAPNGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVL----SNGSV 339
Query: 317 IDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+DSGT + +LP Y + S FR + L
Sbjct: 340 MDSGTVITWLPRRAYSALSSAFRSSMTRL 368
>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
lyrata]
Length = 451
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 133/285 (46%), Gaps = 31/285 (10%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G S +G YF + +G P + DTGSDL+WV C+ C C S T+F P
Sbjct: 74 SGASSGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS----PATVFFPR 129
Query: 134 KSSTSGEIACSDNFCRTTYN-NRYPSCSP---GVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
SST C D CR R P C+ C Y Y DGS TSG F R+ L
Sbjct: 130 HSSTFSPAHCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLK 189
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANSSLLSQLAAA-GNV 247
+SG K A L SV FGCG R SG S T +G++G G+ S SQL GN
Sbjct: 190 TSSG--KEAKLK-SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGN- 245
Query: 248 RKEFAHCL-----------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVG 296
+F++CL ++ G G A+ + + T P+ P Y V L+ V V
Sbjct: 246 --KFSYCLMDYTLSPPPTSYLIIGDGGDAVSKLFFTPLLTNPLSPTF--YYVKLKSVFVN 301
Query: 297 GNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G L + S+ D GT++DSGTTLA+L Y LV++ +
Sbjct: 302 GAKLRIDPSIWEIDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVK 346
>gi|297847186|ref|XP_002891474.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337316|gb|EFH67733.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 578
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 161/346 (46%), Gaps = 58/346 (16%)
Query: 30 NFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLEL------------------ 71
+FVF V +K +A ER L+ + +G+ + S+DLEL
Sbjct: 124 SFVFPVYHKLRAREFHERILA---EDLGLENGKFVESMDLELVNPVKVNDVLSTSAGSID 180
Query: 72 --------GGNGHPSATGLYFTKVGLGTPTD--EYYVQVDTGSDLLWVNC-AGCSRCPTK 120
GGN +P GLY+T++ +G P D Y++ +DTGSDL W+ C A C+ C
Sbjct: 181 SSTTIFPVGGNVYPD--GLYYTRILVGKPEDGQYYHLDIDTGSDLTWIQCDAPCTSCAKG 238
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY-PSCSPGVRCEYVVTYGDGSSTSG 179
++ L+ P K + + S+ FC N+ C +C+Y + Y D S + G
Sbjct: 239 AN-----QLYKPRKDNL---VRSSEPFCVEVQRNQLTEHCESCHQCDYEIEYADHSYSMG 290
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
+D L +G+L S ++FGCG Q G L +T DGILG +A SL S
Sbjct: 291 VLTKDKFHLKLHNGSLA----ESDIVFGCGYDQQG-LLLNTLLKTDGILGLSRAKISLPS 345
Query: 240 QLAAAGNVRKEFAHCL--DVVKGGGIFAIGDVV-SPKVKTTPMV--PNMPHYNVILEEVE 294
QLA+ G + HCL D+ G IF D+V S + PM+ P++ Y + + ++
Sbjct: 346 QLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHHPHLEVYQMQVTKMS 405
Query: 295 VGGNPLDLPTSLLGTGDERGTII-DSGTTLAYLPPMLYDLVLSQFR 339
G L SL G G ++ D+G++ Y P Y +++ +
Sbjct: 406 YGNAML----SLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQ 447
>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
Length = 519
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/278 (35%), Positives = 128/278 (46%), Gaps = 33/278 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G TG Y VGLGTP Y V DTGSD WV C C + + LFDP+
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ----REKLFDPA 226
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+SST I+C+ C + + R CS G C Y V YGDGS + G+F D + L+
Sbjct: 227 RSSTYANISCAAPAC-SDLDTR--GCSGG-NCLYGVQYGDGSYSIGFFAMDTLTLSSYDA 282
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFA 252
FGCG R G G + G+LG G+ +SL Q G V FA
Sbjct: 283 -------VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---FA 327
Query: 253 HCLDVVKGGGIFAIGDVVSPKVK----TTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSL 306
HCL G + SP TTPM+ N P Y V + + VGG L +P S+
Sbjct: 328 HCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSV 387
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
T GTI+DSGT + LPP Y + S F +A+
Sbjct: 388 FTTA---GTIVDSGTVITRLPPAAYSSLRSAFASAMAA 422
>gi|222642011|gb|EEE70143.1| hypothetical protein OsJ_30189 [Oryza sativa Japonica Group]
Length = 671
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 132/272 (48%), Gaps = 31/272 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL--GIKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C P +S +K ++ P++S+TS
Sbjct: 34 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 92
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLK- 196
++ CS N C R S S C Y + Y D +S+SG V D++ L S K
Sbjct: 93 RKVPCSSNLCDLQNACRSKSNS----CPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKI 148
Query: 197 -TAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
TAP ++FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 149 VTAP----IMFGCGQVQTGSFLGS---AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 201
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G GD S K TP+ P+YN+ + + VG + E
Sbjct: 202 FG-DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSIST---------E 251
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
I+DSGT+ L +Y + S F I S
Sbjct: 252 FSAIVDSGTSFTALSDPMYTQITSSFDAQIRS 283
>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
[Oryza sativa Japonica Group]
gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 463
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/265 (33%), Positives = 127/265 (47%), Gaps = 31/265 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V +DTGSD+ WV C C P + G LFDP+KSST ++
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTG---ALFDPAKSSTYRAVS 183
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C+ C + C+Y V YGDGS+T+G + RD + L+ AS +K
Sbjct: 184 CAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVK------ 237
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGG 261
FGC + +SG DG++G G SL+SQ AAA GN F++CL G
Sbjct: 238 GFQFGCSHVESG-----FSDQTDGLMGLGGGAQSLVSQTAAAYGN---SFSYCLPPTSGS 289
Query: 262 -------GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G + V+ ++ + +P Y L+++ VGG L L S+ G
Sbjct: 290 SGFLTLGGGGGVSGFVTTRMLRSRQIPTF--YGARLQDIAVGGKQLGLSPSVFAA----G 343
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
+++DSGT + LPP Y + S F+
Sbjct: 344 SVVDSGTIITRLPPTAYSALSSAFK 368
>gi|32526671|dbj|BAC79194.1| chloroplast nucleoid DNA-binding protein -like protein [Oryza
sativa Japonica Group]
Length = 732
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/272 (34%), Positives = 132/272 (48%), Gaps = 31/272 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL--GIKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C P +S +K ++ P++S+TS
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 156
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLK- 196
++ CS N C R S S C Y + Y D +S+SG V D++ L S K
Sbjct: 157 RKVPCSSNLCDLQNACRSKSNS----CPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKI 212
Query: 197 -TAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
TAP ++FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 213 VTAP----IMFGCGQVQTGSFLGS---AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G GD S K TP+ P+YN+ + + VG + E
Sbjct: 266 FG-DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSIST---------E 315
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
I+DSGT+ L +Y + S F I S
Sbjct: 316 FSAIVDSGTSFTALSDPMYTQITSSFDAQIRS 347
>gi|218202547|gb|EEC84974.1| hypothetical protein OsI_32231 [Oryza sativa Indica Group]
Length = 513
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 133/272 (48%), Gaps = 31/272 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL--GIKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C P +S +K ++ P++S+TS
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLQSPNYGSLKFDVYSPAQSTTS 156
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQL--NQASGNL 195
++ CS N C R S S C Y + Y D +S+SG V D++ L + A +
Sbjct: 157 RKVPCSSNLCDLQNACRSKSNS----CPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKI 212
Query: 196 KTAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
TAP ++FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 213 VTAP----IMFGCGQVQTGSFLGS---AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 265
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G GD S K TP+ P+YN+ + + VG + E
Sbjct: 266 FG-DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI---------STE 315
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
I+DSGT+ L +Y + S F I S
Sbjct: 316 FSAIVDSGTSFTALSDPMYTQITSSFDAQIRS 347
>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 463
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 124/267 (46%), Gaps = 32/267 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y +GLG+P + + DTGSDL W C+ FDP+KS++
Sbjct: 131 TGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET-------------FDPTKSTSYA 177
Query: 140 EIACSDNFCRTTYNNR-YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
++CS C + + PS C Y + YGDGS + G+ ++ + + +
Sbjct: 178 NVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIG-------ST 230
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ ++ FGCG G G + G+LG G+ S++SQ A N + F++CL
Sbjct: 231 DIFNNFYFGCGQDVDGLFGKAA-----GLLGLGRDKLSVVSQTAPKYN--QLFSYCLPSS 283
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPN-MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G + G S K TP+ YN+ L + VGG L +P S+ T GTII
Sbjct: 284 SSTGFLSFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTA---GTII 340
Query: 318 DSGTTLAYLPPMLYDLVLSQFRFWIAS 344
DSGT + LPP Y + S FR +AS
Sbjct: 341 DSGTVVTRLPPAAYSALRSAFRKAMAS 367
>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
protein [Arabidopsis thaliana]
gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 452
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 132/285 (46%), Gaps = 31/285 (10%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G S +G YF + +G P + DTGSDL+WV C+ C C S T+F P
Sbjct: 75 SGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS----PATVFFPR 130
Query: 134 KSSTSGEIACSDNFCRTTYN-NRYPSCSP---GVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
SST C D CR +R P C+ C Y Y DGS TSG F R+ L
Sbjct: 131 HSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK 190
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANSSLLSQLAAA-GNV 247
+SG K A L SV FGCG R SG S T +G++G G+ S SQL GN
Sbjct: 191 TSSG--KEARLK-SVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGN- 246
Query: 248 RKEFAHCL-----------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVG 296
+F++CL ++ G G I + + T P+ P Y V L+ V V
Sbjct: 247 --KFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFTPLLTNPLSPTF--YYVKLKSVFVN 302
Query: 297 GNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G L + S+ D GT++DSGTTLA+L Y V++ R
Sbjct: 303 GAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYRSVIAAVR 347
>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
gi|223948083|gb|ACN28125.1| unknown [Zea mays]
gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
Length = 466
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 143/307 (46%), Gaps = 41/307 (13%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGL------YFTKVGLGTPTDEYYVQVDT 102
L A H R ++ +L+ G P+++G Y V LGTP + +DT
Sbjct: 90 LRAANIHAKLSSPRNSSAKELQQSGVTIPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDT 149
Query: 103 GSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
GSD+ WV CA C+ C ++ D LFDP+KS+T +CS C C
Sbjct: 150 GSDVSWVQCAPCAAQSCSSQKD-----KLFDPAKSATYSAFSCSSAQC-AQLGGEGNGCL 203
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
C+Y+V Y D S+T+G + D + L T+ + FGC +R +G +G
Sbjct: 204 -NSHCQYIVKYVDHSNTTGTYGSDTL-------GLTTSDAVKNFQFGCSHRANGFVGQ-- 253
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--DVVKGGGIFAIGDVV----SPKV 274
+DG++G G SL+SQ AA K F++CL GG +G S +
Sbjct: 254 ---LDGLMGLGGDTESLVSQTAA--TYGKAFSYCLPPSSSSAGGFLTLGAAAGGTSSSRY 308
Query: 275 KTTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
TP+V N+P Y V L+ + V G L++P S+ +++DSGT + LPP Y
Sbjct: 309 SRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF----SGASVVDSGTVITQLPPTAYQ 364
Query: 333 LVLSQFR 339
+ + F+
Sbjct: 365 ALRTAFK 371
>gi|52076082|dbj|BAD46595.1| aspartic proteinase nepenthesin II -like [Oryza sativa Japonica
Group]
Length = 476
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 133/272 (48%), Gaps = 31/272 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL--GIKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C P +S +K ++ P++S+TS
Sbjct: 61 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 119
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQL--NQASGNL 195
++ CS N C R S S C Y + Y D +S+SG V D++ L + A +
Sbjct: 120 RKVPCSSNLCDLQNACRSKSNS----CPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKI 175
Query: 196 KTAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
TAP ++FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 176 VTAP----IMFGCGQVQTGSFLGS---AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 228
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G GD S K TP+ P+YN+ + + VG + E
Sbjct: 229 FG-DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI---------STE 278
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
I+DSGT+ L +Y + S F I S
Sbjct: 279 FSAIVDSGTSFTALSDPMYTQITSSFDAQIRS 310
>gi|255576176|ref|XP_002528982.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531572|gb|EEF33401.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 542
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 131/267 (49%), Gaps = 27/267 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT-----KSDLGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSDLLWV C C +C S L L + PS SS
Sbjct: 112 LHYTWIDIGTPHVSFLVALDAGSDLLWVPC-DCLQCAPLSASYYSSLDRDLNEYSPSHSS 170
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVT-YGDGSSTSGYFVRDIIQLNQASGN 194
TS ++CS C P+C SP C Y + Y + +S+SG V DI+ L N
Sbjct: 171 TSKHLSCSHQLCELG-----PNCNSPKQPCPYSMDYYTENTSSSGLLVEDILHLASNGDN 225
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + + V+ GCG +QSG G A DG++G G A S+ S LA AG +R F+ C
Sbjct: 226 ALSYSVRAPVVIGCGMKQSG--GYLDGVAPDGLMGLGLAEISVPSFLAKAGLIRNSFSMC 283
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
D G IF GD ++TP + N Y V +E VG +S L
Sbjct: 284 FDEDDSGRIF-FGDQGPTTQQSTPFLTLDGNYTTYVVGVEGFCVG-------SSCLKQTS 335
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQF 338
R ++D+GT+ +LP +Y+ + +F
Sbjct: 336 FRA-LVDTGTSFTFLPNGVYERITEEF 361
>gi|115480451|ref|NP_001063819.1| Os09g0542100 [Oryza sativa Japonica Group]
gi|113632052|dbj|BAF25733.1| Os09g0542100, partial [Oryza sativa Japonica Group]
Length = 490
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 93/272 (34%), Positives = 133/272 (48%), Gaps = 31/272 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL--GIKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C P +S +K ++ P++S+TS
Sbjct: 75 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPFQSPNYGSLKFDVYSPAQSTTS 133
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQL--NQASGNL 195
++ CS N C R S S C Y + Y D +S+SG V D++ L + A +
Sbjct: 134 RKVPCSSNLCDLQNACRSKSNS----CPYSIQYLSDNTSSSGVLVEDVLYLTSDSAQSKI 189
Query: 196 KTAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
TAP ++FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 190 VTAP----IMFGCGQVQTGSFLGS---AAPNGLLGLGMDSKSVPSLLASKGLAANSFSMC 242
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G G GD S K TP+ P+YN+ + + VG + E
Sbjct: 243 FG-DDGHGRINFGDTGSSDQKETPLNVYKQNPYYNITITGITVGSKSI---------STE 292
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
I+DSGT+ L +Y + S F I S
Sbjct: 293 FSAIVDSGTSFTALSDPMYTQITSSFDAQIRS 324
>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 154/347 (44%), Gaps = 36/347 (10%)
Query: 1 MGGLRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQH--DTR 58
M L L L + ++ + ++ + F E+ ++ + QH D
Sbjct: 1 MNTLSFLTLSLFSLCFIASFS---HALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAA 57
Query: 59 RHGRMMASIDLELGGNGHPSAT-----GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
R A+ + P +T G Y +GTP + Y DTGSD++W+ C
Sbjct: 58 RRSINRANHFFKDSDTSTPESTVIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEP 117
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
C +C ++ +F+PSKSS+ I CS C + R SCS C+Y ++YGD
Sbjct: 118 CEQCYNQT-----TPIFNPSKSSSYKNIPCSSKLCHSV---RDTSCSDQNSCQYKISYGD 169
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
S + G D + L SG+ + P ++ GCG +G G A GI+G G
Sbjct: 170 SSHSQGDLSVDTLSLESTSGSPVSFP---KIVIGCGTDNAGTFG----GASSGIVGLGGG 222
Query: 234 NSSLLSQLAAAGNVRKEFAHCL-----DVVKGGGIFAIGD--VVSPK-VKTTPMVPNMP- 284
SL++QL ++ + +F++CL I + GD VVS V +TP++ P
Sbjct: 223 PVSLITQLGSS--IGGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPV 280
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
Y + L+ VG ++ S G DE IIDSGTTL +P +Y
Sbjct: 281 FYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVY 327
>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 518
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/279 (35%), Positives = 128/279 (45%), Gaps = 35/279 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDP 132
+G TG Y VGLGTP Y V DTGSD WV C C C + + LFDP
Sbjct: 170 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE-----KLFDP 224
Query: 133 SKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
++SST ++C+ C ++ CS G C Y V YGDGS + G+F D + L+
Sbjct: 225 ARSSTYANVSCAAPAC---FDLDTRGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 280
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEF 251
FGCG R G G + G+LG G+ +SL Q G V F
Sbjct: 281 A-------VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---F 325
Query: 252 AHCLDVVKGGGIFAIGDVVSPKVK----TTPMVP-NMP-HYNVILEEVEVGGNPLDLPTS 305
AHCL G + SP TTPM+ N P Y V + + VGG L +P S
Sbjct: 326 AHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQS 385
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
+ T GTI+DSGT + LPP Y + S F +A+
Sbjct: 386 VFATA---GTIVDSGTVITRLPPPAYSSLRSAFVSAMAA 421
>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
Length = 452
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 89/269 (33%), Positives = 128/269 (47%), Gaps = 31/269 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y VGLG+P V +DTGSD+ WV C C + P + G LFDP+ SST
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG---ALFDPAASSTYAAF 164
Query: 142 ACSDNFCRTTYNN-RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
CS C ++ C RC+Y+V YGDGS+T+G + D++ L+ + +
Sbjct: 165 NCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTLSGSD-------V 217
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
FGC + +LG+ D DG++G G S +SQ AA K F +CL
Sbjct: 218 VRGFQFGCSH---AELGAGMDDKTDGLIGLGGDAQSPVSQTAA--RYGKSFFYCLPATPA 272
Query: 261 GGIF-------AIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTG 310
F + G + + TTPM+ +P +Y LE++ VGG L L S+
Sbjct: 273 SSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAA- 331
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+++DSGT + LPP Y + S FR
Sbjct: 332 ---GSLVDSGTVITRLPPAAYAALSSAFR 357
>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 129/270 (47%), Gaps = 32/270 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDP 132
+G +TG Y VGLGTP +Y V DTGSD WV C C +C + K LFDP
Sbjct: 154 SGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQ-----KEPLFDP 208
Query: 133 SKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+KSST ++C+D+ C N C+ G C Y V YGDGS T G+F +D + + A
Sbjct: 209 AKSSTYANVSCTDSACADLDTN---GCTGG-HCLYAVQYGDGSYTVGFFAQDTLTI--AH 262
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
+K FGCG + +G G + G++G G+ +SL Q A FA
Sbjct: 263 DAIK------GFRFGCGEKNNGLFGKTA-----GLMGLGRGKTSLTVQ--AYNKYGGAFA 309
Query: 253 HCLDVV-KGGGIFAIGD-VVSPKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLG 308
+CL + G G G + TPM+ + Y V + + VGG + + S+
Sbjct: 310 YCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFS 369
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T GT++DSGT + LP Y + S F
Sbjct: 370 TA---GTLVDSGTVITRLPATAYTALSSAF 396
>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
Length = 519
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 95/265 (35%), Positives = 123/265 (46%), Gaps = 33/265 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G TG Y VGLGTP Y V DTGSD WV C C + + LFDP+
Sbjct: 171 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQ----REKLFDPA 226
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+SST ++C+ C + CS G C Y V YGDGS + G+F D + L+
Sbjct: 227 RSSTYANVSCAAPACS---DLNIHGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA 282
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFA 252
FGCG R G G + G+LG G+ +SL Q G V FA
Sbjct: 283 -------VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---FA 327
Query: 253 HCLDVVKGGG---IFAIGDVVSPKVK-TTPMV-PNMP-HYNVILEEVEVGGNPLDLPTSL 306
HCL G F G + + + + TTPM+ N P Y V + + VGG L +P S+
Sbjct: 328 HCLPARSTGTGYLDFGAGSLAAARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSV 387
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLY 331
T GTI+DSGT + LPP Y
Sbjct: 388 FATA---GTIVDSGTVITRLPPAAY 409
>gi|356567798|ref|XP_003552102.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 520
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 97/313 (30%), Positives = 150/313 (47%), Gaps = 41/313 (13%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
+ K K GG R + L HG S+ + G L++T + +GTP+
Sbjct: 63 LRRKIKVGGARYQLLFP-------SHGSKTMSLGNDFGW--------LHYTWIDIGTPST 107
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPT-----KSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
+ V +D GSDLLW+ C C +C S+L L + PS+S +S ++CS C
Sbjct: 108 SFLVALDAGSDLLWIPC-DCVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHQLCD 166
Query: 150 TTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
N + S +C Y+V+Y + +S+SG V DI+ L Q+ G+L + + + V+ GC
Sbjct: 167 KGSNCK----SSQQQCPYMVSYLSENTSSSGLLVEDILHL-QSGGSLSNSSVQAPVVLGC 221
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G +QSG G A DG+LG G SS+ S LA +G + F+ C + G IF GD
Sbjct: 222 GMKQSG--GYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHDSFSLCFNEDDSGRIF-FGD 278
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVE---VGGNPLDLPTSLLGTGDERGTIIDSGTTLAY 325
++T +P Y+ + VE VG + L + TS +DSGT+ +
Sbjct: 279 QGPTIQQSTSFLPLDGLYSTYIIGVESCCVGNSCLKM-TSF-------KVQVDSGTSFTF 330
Query: 326 LPPMLYDLVLSQF 338
LP +Y + +F
Sbjct: 331 LPGHVYGAIAEEF 343
>gi|224133616|ref|XP_002327639.1| predicted protein [Populus trichocarpa]
gi|222836724|gb|EEE75117.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 130/264 (49%), Gaps = 24/264 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC--PTKSDLG-IKLTLFDPSKSSTS 138
L++ V +GTP+ + V +DTGS+LLW+ C CS C +S G + L ++ P+ SSTS
Sbjct: 61 LHYANVSVGTPSVSFLVALDTGSNLLWLPC-DCSSCVHSLRSPSGTVDLNIYSPNTSSTS 119
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKT 197
++ C+ C T +R P S C Y V Y +G+ST+GY V+D++ L S + ++
Sbjct: 120 EKVPCNSTLCSQTQRDRCP--SDQSNCPYQVVYLSNGTSTTGYIVQDLLHL--ISDDSQS 175
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+++ + FGCG Q+G T A +G+ G G +N S+ S LA G F+ C
Sbjct: 176 KAVDAKITFGCGKVQTGSF--LTGGAPNGLFGLGMSNISVPSTLAHNGYTSGSFSMCFS- 232
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G + GD S T P YN+ + + +GG DL S
Sbjct: 233 PNGIGRISFGDKGSTGQGETSFNQGQPRSSLYNISITQTSIGGQASDLVYS--------- 283
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
I DSGT+ YL Y L+ F
Sbjct: 284 AIFDSGTSFTYLNDPAYTLIAESF 307
>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 418
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 92/297 (30%), Positives = 138/297 (46%), Gaps = 33/297 (11%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER+R + ++ T +SI L L GN +P+ G Y + +G P Y++ DTG
Sbjct: 23 ERKRPILSVP---TASSSFASSSIVLPLQGNVYPN--GFYNVTLYVGQPPKPYFLDPDTG 77
Query: 104 SDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG 162
SDL W+ C A C +C P ++ + C D C + +++ C
Sbjct: 78 SDLTWLQCDAPCQQC---------TETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENP 128
Query: 163 VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+C+Y V Y DG S+ G VRD+ LN +G+ P+ + GCG Q D GSS+
Sbjct: 129 DQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD----PIRPRLALGCGYDQ--DPGSSSYH 182
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD-VVSP-KVKTTPMV 280
+DGILG G+ S++SQL G VR HC + KGGG GD + P ++ TPM
Sbjct: 183 PMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFN-SKGGGYLFFGDGIYDPYRLVWTPMS 241
Query: 281 PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ P HY+ E+ G L + + DSG++ Y Y ++ S
Sbjct: 242 RDYPKHYSPGFGELIFNGRSTGLRNLFV--------VFDSGSSYTYFNAQAYQVLTS 290
>gi|217426809|gb|ACK44517.1| AT5G10080-like protein [Arabidopsis arenosa]
Length = 506
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 133/264 (50%), Gaps = 26/264 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTK--SDLGIK-LTLFDPSKSS 136
L++T + +GTP+ + V +DTGSDLLW+ NC C+ + S L K L ++PS SS
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNL 195
TS CS C + + SP +C Y V Y G +S+SG V DI+ L + N
Sbjct: 159 TSKVFLCSHKLCDSASDCE----SPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNR 214
Query: 196 K---TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
++ + + V+ GCG +QSGD A DG++G G A S+ S L+ AG +R F+
Sbjct: 215 LMNGSSSVKARVVIGCGKKQSGDYLDG--VAPDGLMGLGPAEISVPSFLSKAGLMRNSFS 272
Query: 253 HCLDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
C D G I+ GD+ ++TP + N Y V +E +G + L TS
Sbjct: 273 LCFDEEDSGRIY-FGDMGPSIQQSTPFLQLENNSGYIVGVEACCIGNSCLK-QTSF---- 326
Query: 311 DERGTIIDSGTTLAYLPPMLYDLV 334
T IDSG + YLP +Y V
Sbjct: 327 ---TTFIDSGQSFTYLPEEIYRKV 347
>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 499
Score = 117 bits (292), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 129/270 (47%), Gaps = 32/270 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDP 132
+G +TG Y VGLGTP +Y V DTGSD WV C C +C + K LFDP
Sbjct: 154 SGRAVSTGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQ-----KGPLFDP 208
Query: 133 SKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+KSST ++C+D+ C N C+ G C Y V YGDGS T G+F +D + + A
Sbjct: 209 AKSSTYANVSCTDSACADLDTN---GCTGG-HCLYAVQYGDGSYTVGFFAQDTLTI--AH 262
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
+K FGCG + +G G + G++G G+ +SL Q A FA
Sbjct: 263 DAIK------GFRFGCGEKNNGLFGKTA-----GLMGLGRGKTSLTVQ--AYNKYGGAFA 309
Query: 253 HCLDVV-KGGGIFAIGD-VVSPKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLG 308
+CL + G G G + TPM+ + Y V + + VGG + + S+
Sbjct: 310 YCLPALTTGTGYLDFGPGSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFS 369
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T GT++DSGT + LP Y + S F
Sbjct: 370 TA---GTLVDSGTVITRLPATAYTALSSAF 396
>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
Length = 634
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/287 (32%), Positives = 137/287 (47%), Gaps = 35/287 (12%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDL 106
RTLS ++H R A+ + L + P G Y T++ +GTP + + VDTGS L
Sbjct: 58 RTLSHSRRHLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTL 115
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRC 165
+V C+ C +C D F P SST + CS C +C S + C
Sbjct: 116 TYVPCSTCEQCGKHQDPN-----FQPDWSSTYQPLKCSME-C---------TCDSEMMHC 160
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y Y + SS+SG DI+ + S LK +FGC N ++GD+ S D
Sbjct: 161 VYDRQYAEMSSSSGVLGEDIVSFGKQS-ELKP----QRTVFGCENVETGDIYSQR---AD 212
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK--VKTTPMV 280
GI+G G+ + S++ QL G + F+ C +DV GGG +G + P V T
Sbjct: 213 GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDV--GGGAMVLGGISPPAGMVFTHSDP 270
Query: 281 PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+YN+ L+E+ + G LP + + + GTI+DSGTT AYLP
Sbjct: 271 ARSAYYNIDLKEIHIAGK--QLPINPMVFDGKYGTILDSGTTYAYLP 315
>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
Length = 633
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/287 (32%), Positives = 137/287 (47%), Gaps = 35/287 (12%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDL 106
RTLS ++H R A+ + L + P G Y T++ +GTP + + VDTGS L
Sbjct: 58 RTLSHSRRHLQRSESHSTATARMPLYDDLIP--YGYYTTRIWIGTPPQTFALIVDTGSTL 115
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRC 165
+V C+ C +C D F P SST + CS C +C S + C
Sbjct: 116 TYVPCSTCEQCGKHQDPN-----FQPDWSSTYQPLKCSME-C---------TCDSEMMHC 160
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y Y + SS+SG DI+ + S LK +FGC N ++GD+ S D
Sbjct: 161 VYDRQYAEMSSSSGVLGEDIVSFGKQS-ELKP----QRTVFGCENVETGDIYSQR---AD 212
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK--VKTTPMV 280
GI+G G+ + S++ QL G + F+ C +DV GGG +G + P V T
Sbjct: 213 GIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDV--GGGAMVLGGISPPAGMVFTHSDP 270
Query: 281 PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+YN+ L+E+ + G LP + + + GTI+DSGTT AYLP
Sbjct: 271 ARSAYYNIDLKEIHIAGK--QLPINPMVFDGKYGTILDSGTTYAYLP 315
>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 461
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 139/272 (51%), Gaps = 35/272 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + K+ +G+P + +DTGSDL+W C C +C +S +FDP +SS+ +
Sbjct: 109 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQST-----PIFDPKQSSSFYK 163
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I+CS C + +CS CEY+ TYGD SST G + ++ + + P
Sbjct: 164 ISCSSELCGALPTS---TCSSD-GCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIP- 218
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+ FGCGN +GD G S A G++G G+ SL+SQL ++FA+CL +
Sbjct: 219 --GLGFGCGNDNNGD-GFSQGA---GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDD 267
Query: 261 G--GIFAIGDV--VSPK-----VKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLG 308
+G + ++PK +KTTP++ P+ P Y + L+ + VGG L +P S
Sbjct: 268 SKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFE 327
Query: 309 TGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
D+ G IIDSGTT+ Y+ + + ++F
Sbjct: 328 LHDDGSGGVIIDSGTTITYVENSAFTSLKNEF 359
>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 133/287 (46%), Gaps = 47/287 (16%)
Query: 71 LGGNGHPSA----------TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
L G PSA G Y T++ +GTP E+ + VD+GS + +V CA C +C
Sbjct: 66 LAEGGRPSARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNH 125
Query: 121 SDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSG 179
D F P SST + C+ D C + N +C Y Y + SS+SG
Sbjct: 126 QD-----PRFQPDLSSTYSPVKCNVDCTCDSDKN----------QCTYERQYAEMSSSSG 170
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
DI+ S LK +FGC N ++GDL S DGI+G G+ S++
Sbjct: 171 VLGEDIVSFGTES-ELKP----QRAVFGCENSETGDLFSQ---HADGIMGLGRGQLSIMD 222
Query: 240 QLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVE 294
QL G + F+ C +D+ GGG +G + +P + T P+YN+ L+E+
Sbjct: 223 QLVDKGVIGDSFSMCYGGMDI--GGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMH 280
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY----DLVLSQ 337
V G L + + + GT++DSGTT AYLP + D V SQ
Sbjct: 281 VAGKALRVDPRIF--DGKHGTVLDSGTTYAYLPEQAFVAFKDAVSSQ 325
>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
Length = 429
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 135/282 (47%), Gaps = 31/282 (10%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
+S L G+ +P GLY+ + +G P Y++ VDTGSDL W+ C P +S
Sbjct: 50 SSAVFPLYGDVYPH--GLYYVAMNIGNPPKPYFLDVDTGSDLTWLQCDA----PCRSCNK 103
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYN--NRYPSC-SPGVRCEYVVTYGDGSSTSGYF 181
+ L+ P+K+ + C D C + +N NR C SP +C+YV+ Y D S++G
Sbjct: 104 VPHPLYRPTKNKL---VPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVL 160
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V D L A+G++ + S+ FGCG Q + S + DG+LG G + SLLSQ
Sbjct: 161 VNDSFALRLANGSV----VRPSLAFGCGYDQQ--VSSGEMSPTDGVLGLGTGSVSLLSQF 214
Query: 242 AAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--KVKTTPMV--PNMPHYNVILEEVEVGG 297
G + HCL ++GGG GD + P +V TPMV P +Y+ + G
Sbjct: 215 KQHGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRVTWTPMVRSPLRNYYSPGSASLYFGD 273
Query: 298 NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L + + + + DSG++ Y Y +++ +
Sbjct: 274 QSLRVKLTEV--------VFDSGSSFTYFAAQPYQALVTALK 307
>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
Length = 583
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 132/278 (47%), Gaps = 33/278 (11%)
Query: 73 GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFD 131
GN +P GLYFT + +G P YY+ +DT SDL W+ C A C+ C ++ L+
Sbjct: 200 GNVYPD--GLYFTYILVGNPPRPYYLDIDTASDLTWIQCDAPCTSCAKGAN-----ALYK 252
Query: 132 PSKSSTSGEIACSDNFCRTTYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
P + + + D+ C + N+ C +C+Y + Y D SS+ G RD + L
Sbjct: 253 PRRDNI---VTPKDSLCVELHRNQKAGYCETCQQCDYEIEYADHSSSMGVLARDELHLTM 309
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
A+G+ N FGC Q G L +T DGILG +A SL SQLA G +
Sbjct: 310 ANGSSTNLKFN----FGCAYDQQG-LLLNTLVKTDGILGLSKAKVSLPSQLANRGIINNV 364
Query: 251 FAHCL--DVVKGGGIFAIGDVVSPK--VKTTPMV--PNMPHYNVILEEVEVGGNPLDLPT 304
HCL DVV GG +F +GD P+ + PM+ P++ Y + ++ G PL L
Sbjct: 365 VGHCLANDVVGGGYMF-LGDDFVPRWGMSWVPMLDSPSIDSYQTQIMKLNYGSGPLSL-- 421
Query: 305 SLLGTGDERGT---IIDSGTTLAYLPPMLYDLVLSQFR 339
G ER + DSG++ Y Y +++ +
Sbjct: 422 ----GGQERRVRRIVFDSGSSYTYFTKEAYSELVASLK 455
>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 631
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 128/267 (47%), Gaps = 37/267 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y T++ +GTP E+ + VD+GS + +V CA C +C D F P SST
Sbjct: 86 GYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQD-----PRFQPDLSSTYSP 140
Query: 141 IACS-DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ D C + N +C Y Y + SS+SG DI+ S LK
Sbjct: 141 VKCNVDCTCDSDKN----------QCTYERQYAEMSSSSGVLGEDIVSFGTES-ELKP-- 187
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LD 256
+FGC N ++GDL S DGI+G G+ S++ QL G + F+ C +D
Sbjct: 188 --QRAVFGCENSETGDLFSQ---HADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMD 242
Query: 257 VVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+ GGG +G + +P + T P+YN+ L+E+ V G L + + + G
Sbjct: 243 I--GGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIF--DGKHG 298
Query: 315 TIIDSGTTLAYLPPMLY----DLVLSQ 337
T++DSGTT AYLP + D V SQ
Sbjct: 299 TVLDSGTTYAYLPEQAFVAFKDAVSSQ 325
>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 351
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 135/276 (48%), Gaps = 38/276 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y ++ LGTP ++ VDTGSDL WV CA C+RC + D LF P SS+
Sbjct: 5 SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPD-----PLFIPLASSSYS 59
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+C+D+ C P+CS C Y +YGDGS+T G F + + LN ++
Sbjct: 60 NASCTDSLCDAL---PRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTL------ 110
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ + FGCG+ Q G T A DG++G GQ SL SQL ++ F++CL
Sbjct: 111 --ARIGFGCGHNQEG-----TFAGADGLIGLGQGPLSLPSQLNSS--FTHIFSYCLVDQS 161
Query: 260 GGGIFA---IGDVV-SPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLL----- 307
G F+ G+ + + TP++ N +Y V +E + VG + P S
Sbjct: 162 TTGTFSPITFGNAAENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDAN 221
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
G G G I+DSGTT+ Y + +L++ R I+
Sbjct: 222 GVG---GVILDSGTTITYWRLAAFIPILAELRRQIS 254
>gi|356502091|ref|XP_003519855.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 519
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 139/303 (45%), Gaps = 32/303 (10%)
Query: 50 SALKQHDTRRHGRMMASIDLELG---GNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTG 103
+ L D GR ++ ID L GN S+ G L++T V +GTP ++ V +DTG
Sbjct: 61 AELADRDRLLRGRKLSQIDAGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTG 120
Query: 104 SDLLWVNCAGCSRCPTKSDLG----IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
SDL WV C C+RC L +++P+ SSTS ++ C+++ C +R
Sbjct: 121 SDLFWVPC-DCTRCAASDSTAFASDFDLNVYNPNGSSTSKKVTCNNSLC----THRSQCL 175
Query: 160 SPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
C Y+V+Y +STSG V D++ L Q + N VIFGCG QSG
Sbjct: 176 GTFSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEAN--VIFGCGQIQSGSFLD 233
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTP 278
AA +G+ G G S+ S L+ G F+ C G G + GD S TP
Sbjct: 234 V--AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG-RDGIGRISFGDKGSFDQDETP 290
Query: 279 --MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ P+ P YN+ + +V VG +D+ E + DSGT+ YL Y +
Sbjct: 291 FNLNPSHPTYNITVTQVRVGTTVIDV---------EFTALFDSGTSFTYLVDPTYTRLTE 341
Query: 337 QFR 339
F
Sbjct: 342 SFH 344
>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-1-like, partial [Cucumis sativus]
Length = 716
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 139/272 (51%), Gaps = 35/272 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + K+ +G+P + +DTGSDL+W C C +C +S +FDP +SS+ +
Sbjct: 364 GEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQS-----TPIFDPKQSSSFYK 418
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I+CS C + +CS CEY+ TYGD SST G + ++ + + P
Sbjct: 419 ISCSSELCGALPTS---TCSSD-GCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIP- 473
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+ FGCGN +GD G S A G++G G+ SL+SQL ++FA+CL +
Sbjct: 474 --GLGFGCGNDNNGD-GFSQGA---GLVGLGRGPLSLVSQLK-----EQKFAYCLTAIDD 522
Query: 261 G--GIFAIGDV--VSPK-----VKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLG 308
+G + ++PK +KTTP++ P+ P Y + L+ + VGG L +P S
Sbjct: 523 SKPSSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFE 582
Query: 309 TGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
D+ G IIDSGTT+ Y+ + + ++F
Sbjct: 583 LHDDGSGGVIIDSGTTITYVENSAFTSLKNEF 614
>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
Length = 523
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/264 (33%), Positives = 125/264 (47%), Gaps = 32/264 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y VGLGTP + V DTGSDL WV C C+ C + D LFDPS+S+T
Sbjct: 185 TANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHD-----PLFDPSQSTTYS 239
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C C + +CS G +C Y V YGD S T G RD + L +S L+
Sbjct: 240 AVPCGAQECLDS-----GTCSSG-KCRYEVVYGDMSQTDGNLARDTLTLGPSSDQLQ--- 290
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVV 258
+FGCG+ +G G + DG+ G G+ SL SQ AA F++CL
Sbjct: 291 ---GFVFGCGDDDTGLFGRA-----DGLFGLGRDRVSLASQ--AAARYGAGFSYCLPSSW 340
Query: 259 KGGGIFAIGDVVS-PKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+ G ++G + P + T MV + P Y + L ++V G + + ++ G
Sbjct: 341 RAEGYLSLGSAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVF---KAPG 397
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
T+IDSGT + LP Y + S F
Sbjct: 398 TVIDSGTVITRLPSRAYSALRSSF 421
>gi|326523463|dbj|BAJ92902.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 633
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 56/111 (50%), Positives = 71/111 (63%), Gaps = 1/111 (0%)
Query: 32 VFEVENKFKAGGERERTLSALKQHDTRRHGR-MMASIDLELGGNGHPSATGLYFTKVGLG 90
VFEV KF + L+ L+ HD RRHGR + A++DL LGGN P TGLYFT++G+G
Sbjct: 86 VFEVRRKFPCHDGSGKHLANLRAHDARRHGRSLAAAVDLPLGGNALPYETGLYFTQIGIG 145
Query: 91 TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
TP YYVQVDT SD+ WVNC C CP KS LG+ +L P + S ++
Sbjct: 146 TPAKSYYVQVDTSSDIFWVNCVFCDTCPRKSGLGVLPSLPFPLQLLCSADL 196
>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
Length = 519
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 117/265 (44%), Gaps = 28/265 (10%)
Query: 75 GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSK 134
G TG Y VGLGTP Y V DTGSD WV C C + + LFDP+
Sbjct: 175 GRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQ----REKLFDPAS 230
Query: 135 SSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
SST ++C+ C + CS G C Y V YGDGS + G+F D + L+
Sbjct: 231 SSTYANVSCAAPACS---DLDVSGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA- 285
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
FGCG R G G + G+LG G+ +SL Q G FAHC
Sbjct: 286 ------VKGFRFGCGERNDGLFGEAA-----GLLGLGRGKTSLPVQ--TYGKYGGVFAHC 332
Query: 255 LDVVK-GGGIFAIGDVVSPKVKTTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSLLGTGD 311
L G G G P TTPM+ N P Y V + + VGG L + S+
Sbjct: 333 LPARSTGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA- 391
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLS 336
GTI+DSGT + LPP Y + S
Sbjct: 392 --GTIVDSGTVITRLPPAAYSSLRS 414
>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
Length = 476
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 91/266 (34%), Positives = 125/266 (46%), Gaps = 34/266 (12%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEI 141
+ VG GTP Y V DTGSD+ W+ C CS C + D +FDP+KS+T +
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHD-----PIFDPTKSATYSVV 189
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C C ++ CS G C Y V YGDGSS++G + + L + + P
Sbjct: 190 PCGHPQCAAADGSK---CSNGT-CLYKVEYGDGSSSAGVLSHETLSLT----STRALP-- 239
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-G 260
FGCG GD G VDG++G G+ SL SQ AA + F++CL
Sbjct: 240 -GFAFGCGQTNLGDFGD-----VDGLIGLGRGQLSLSSQ--AAASFGGTFSYCLPSDNTT 291
Query: 261 GGIFAIGDVVSPK---VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G IG V+ T MV + Y V L +++GG L +P +L + G
Sbjct: 292 HGYLTIGPTTPASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLF---TDDG 348
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRF 340
T +DSGT L YLPP Y + +F+F
Sbjct: 349 TFLDSGTILTYLPPEAYTALRDRFKF 374
>gi|356538031|ref|XP_003537508.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 521
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/313 (30%), Positives = 149/313 (47%), Gaps = 41/313 (13%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
+ K K GG R + L HG S+ + G L++T + +GTP+
Sbjct: 64 LRRKIKVGGTRYQLLFP-------SHGSKTMSLGNDFGW--------LHYTWIDIGTPST 108
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPT-----KSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
+ V +D GSDLLW+ C C +C S+L L + PS+S +S ++CS C
Sbjct: 109 SFLVALDAGSDLLWIPC-DCVQCAPLSSSYYSNLDRDLNEYSPSRSLSSKHLSCSHRLCD 167
Query: 150 TTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
N + S +C Y+V+Y + +S+SG V DI+ L Q+ G L + + + V+ GC
Sbjct: 168 KGSNCK----SSQQQCPYMVSYLSENTSSSGLLVEDILHL-QSGGTLSNSSVQAPVVLGC 222
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD 268
G +QSG G A DG+LG G SS+ S LA +G + F+ C + G +F GD
Sbjct: 223 GMKQSG--GYLDGVAPDGLLGLGPGESSVPSFLAKSGLIHYSFSLCFNEDDSGRMF-FGD 279
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVE---VGGNPLDLPTSLLGTGDERGTIIDSGTTLAY 325
++T +P Y+ + VE +G + L + TS +DSGT+ +
Sbjct: 280 QGPTSQQSTSFLPLDGLYSTYIIGVESCCIGNSCLKM-TSFKAQ-------VDSGTSFTF 331
Query: 326 LPPMLYDLVLSQF 338
LP +Y + +F
Sbjct: 332 LPGHVYGAITEEF 344
>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
Length = 515
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 93/260 (35%), Positives = 116/260 (44%), Gaps = 28/260 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y VGLGTP Y V DTGSD WV C C + + LFDP+ SST
Sbjct: 176 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQ----REKLFDPASSSTYA 231
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C+ C + CS G C Y V YGDGS + G+F D + L+
Sbjct: 232 NVSCAAPACS---DLDVSGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA------ 281
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-V 258
FGCG R G G + G+LG G+ +SL Q G FAHCL
Sbjct: 282 -VKGFRFGCGERNDGLFGEAA-----GLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPARS 333
Query: 259 KGGGIFAIGDVVSPKVKTTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G G G P TTPM+ N P Y V + + VGG L + S+ GTI
Sbjct: 334 TGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA---GTI 390
Query: 317 IDSGTTLAYLPPMLYDLVLS 336
+DSGT + LPP Y + S
Sbjct: 391 VDSGTVITRLPPAAYSSLRS 410
>gi|414587774|tpg|DAA38345.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 520
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 124/263 (47%), Gaps = 24/263 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
L++ V +GTP + V +DTGSDL W+ C GC+ T + + T + P SSTS
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQCDGCTPPATAASGSFQATFYIPGMSSTSK 167
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C+ NFC + CS ++C Y + Y G+S+SG+ V D++ L ++ N
Sbjct: 168 AVPCNSNFC-----DLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYL--STENAHPQ 220
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
L + ++ GCG Q+G + AA +G+ G G S+ S LA G F+ C
Sbjct: 221 ILKAQIMLGCGQTQTGSFLDA--AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG-R 277
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G G + GD S + TP+ N H Y + + + VG P D+ + TI
Sbjct: 278 DGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDM---------DFITI 328
Query: 317 IDSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 329 FDTGTSFTYLADPAYTYITQSFH 351
>gi|449529194|ref|XP_004171586.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 417
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/268 (32%), Positives = 131/268 (48%), Gaps = 32/268 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC------PTKSDLGIKLTLFDPSKS 135
L++T V LGTP ++ V +DTGSDL WV C CSRC P SD +L+++ P KS
Sbjct: 3 LHYTTVQLGTPGTKFMVALDTGSDLFWVPC-DCSRCAPTEGSPYASDF--ELSVYSPKKS 59
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDG-SSTSGYFVRDIIQLNQASG 193
STS + C+++ C + C+ C YVV+Y +ST+G + D++ L +
Sbjct: 60 STSKTVPCNNSLCA-----QRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLK--TE 112
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
N + P+ + + FGCG QSG AA +G+ G G S+ S L+ G + F+
Sbjct: 113 NKHSEPIQAYITFGCGQVQSGSFLDV--AAPNGLFGLGMEQISVPSILSREGLMANSFSM 170
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGD 311
C G G GD S + + TP N P+YN+ + + VG +D + L
Sbjct: 171 CFS-DDGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADITAL---- 225
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
DSGT+ +Y +Y + + F
Sbjct: 226 -----FDSGTSFSYFTDPIYSKLSASFH 248
>gi|147839328|emb|CAN63378.1| hypothetical protein VITISV_015700 [Vitis vinifera]
Length = 585
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 156/338 (46%), Gaps = 51/338 (15%)
Query: 17 VHQWAVGGGGVMGNFVFEVENKFKAGGERER----TLSALKQHDTRRHGRMMASIDLEL- 71
V +W+ G G N F AG + + L D GR ++ ID L
Sbjct: 38 VKKWSEGAG-----------NGFPAGNWPAKGSFEYYAELAHRDRALRGRRLSDIDGLLT 86
Query: 72 --GGNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTK----- 120
GN S+ G L++T V LGTP ++ V +DTGSDL WV C CSRC PT+
Sbjct: 87 FSDGNSTFRISSLGFLHYTTVSLGTPGKKFLVALDTGSDLFWVPC-DCSRCAPTEGTTYA 145
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSG 179
SD +L++++P SSTS ++ C+++ C + NR C Y+V+Y +STSG
Sbjct: 146 SDF--ELSIYNPKGSSTSRKVTCNNSLC--AHRNR--CLGTFSNCPYMVSYVSAETSTSG 199
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
V D++ L + + + V FGCG Q+G AA +G+ G G S+ S
Sbjct: 200 ILVEDVLHLTTEDN--RQEFVEAYVTFGCGQVQTGSFLDI--AAPNGLFGLGLEKISVPS 255
Query: 240 QLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGG 297
L+ G F+ C G G + GD P + TP N P YN+ + +V VG
Sbjct: 256 ILSKEGFTADSFSMCFG-PDGIGRISFGDKGGPDQEETPFNLNALHPTYNITVTQVRVGT 314
Query: 298 NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
+DL + L DSGT+ YL +Y VL
Sbjct: 315 TLIDLDFTAL---------FDSGTSFTYLVDPIYTNVL 343
>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 460
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 130/269 (48%), Gaps = 35/269 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
+G Y+ K+GLG+P Y + +DTGS L W+ C C C ++ D LF+PS S+T
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVD-----PLFEPSASNTY 171
Query: 139 GEIACSDNFCR----TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
+ CS + C T N+ P C+ C Y +YGD S + GY RD++ L +
Sbjct: 172 RPLYCSSSECSLLKAATLND--PLCTASGVCVYTASYGDASYSMGYLSRDLLTLTPS--- 226
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+T P S +GCG G G + GI+G + S+L+QL+ F++C
Sbjct: 227 -QTLP---SFTYGCGQDNEGLFGKAA-----GIVGLARDKLSMLAQLSP--KYGYAFSYC 275
Query: 255 L--DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGT 309
L GGG +IG + K TPM+ N + Y + L + V G P+ + +
Sbjct: 276 LPTSTSSGGGFLSIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAA---- 331
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G + TIIDSGT + LP +Y + F
Sbjct: 332 GYQVPTIIDSGTVVTRLPISIYAALREAF 360
>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
Length = 443
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 144/311 (46%), Gaps = 56/311 (18%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDL 106
++L+AL D R+ L L +G Y ++G+GTPT Y +DTGSDL
Sbjct: 65 QSLAALAPGDAITAARI-----LVLASDGE------YLMEMGIGTPTRYYSAILDTGSDL 113
Query: 107 LWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
+W CA C C PT FDP++S+T + C+ C Y YP C V
Sbjct: 114 IWTQCAPCLLCVDQPTP--------YFDPARSATYRSLGCASPACNALY---YPLCYQKV 162
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
C Y YGD +ST+G + G +T + FGCGN +G L + +
Sbjct: 163 -CVYQYFYGDSASTAGVLANETFTF----GTNETRVSLPGISFGCGNLNAGSLANGS--- 214
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG-------GIFAI---GDVVSPK 273
G++GFG+ + SL+SQL + F++CL G++A + S
Sbjct: 215 --GMVGFGRGSLSLVSQLGS-----PRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEP 267
Query: 274 VKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYLP 327
V++TP V P +P Y + + + VGG L + ++ D GTIIDSGTT+ YL
Sbjct: 268 VQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLA 327
Query: 328 PMLYDLVLSQF 338
YD V + F
Sbjct: 328 EPAYDAVRAAF 338
>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 627
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 93/269 (34%), Positives = 130/269 (48%), Gaps = 36/269 (13%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS----DLGIKLTLFDPSKSST 137
LY+T V +GTP + V +DTGSDL W+ C C C S L L ++ P++S+T
Sbjct: 207 LYYTWVDVGTPNTSFMVALDTGSDLFWIPC-DCIECAPLSGYHGSLDRDLGIYKPAESTT 265
Query: 138 SGEIACSDNFC---RTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASG 193
S + CS C N + P C Y Y + +++SG V DI+ L+
Sbjct: 266 SRHLPCSHELCLLGSDCTNQKQP-------CPYNTKYLQENTTSSGLLVEDILHLDSRES 318
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDA-AVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
+ AP+ +SVI GCG +QS GS D A DG+LG G A+ S+ S LA AG VR F+
Sbjct: 319 H---APVKASVIIGCGRKQS---GSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFS 372
Query: 253 HCLDVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGT 309
C K G GD ++TP VP + Y V +++ VG + TS
Sbjct: 373 MCF--TKDSGRIFFGDQGVSTQQSTPFVPLYGKLQTYTVNVDKSCVGHKCFE-STSFQA- 428
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
I+DSGT+ LP +Y V +F
Sbjct: 429 ------IVDSGTSFTALPLDIYKAVAIEF 451
>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
Length = 641
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 85/259 (32%), Positives = 126/259 (48%), Gaps = 35/259 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT-----LFDPSKS 135
G Y T++ +GTP+ E+ + VD+GS + +V CA C +C + F P S
Sbjct: 89 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLS 148
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
ST + C N T N R +C Y Y + SS+SG DI+ + S L
Sbjct: 149 STYSPVKC--NVDCTCDNER-------SQCTYERQYAEMSSSSGVLGEDIMSFGKES-EL 198
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC- 254
K +FGC N ++GDL S DGI+G G+ S++ QL G + F+ C
Sbjct: 199 KP----QRAVFGCENTETGDLFSQ---HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY 251
Query: 255 --LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
+DV GGG +G + +P + P+ P+YN+ L+E+ V G L L +
Sbjct: 252 GGMDV--GGGTMVLGGMPAPPDMVFSHSNPV--RSPYYNIELKEIHVAGKALRLDPKIFN 307
Query: 309 TGDERGTIIDSGTTLAYLP 327
+ + GT++DSGTT AYLP
Sbjct: 308 S--KHGTVLDSGTTYAYLP 324
>gi|224083757|ref|XP_002307112.1| predicted protein [Populus trichocarpa]
gi|222856561|gb|EEE94108.1| predicted protein [Populus trichocarpa]
Length = 492
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 85/264 (32%), Positives = 132/264 (50%), Gaps = 21/264 (7%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT-----KSDLGIKLTLFDPSKSS 136
L++T + +GTP + V +D+GSDL WV C C +C S L L+ + PS+SS
Sbjct: 97 LHYTWIDIGTPHVSFMVALDSGSDLFWVPC-DCVQCAPLSASHYSSLDRDLSEYSPSQSS 155
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVT-YGDGSSTSGYFVRDIIQLNQASGN 194
TS +++CS C + P+C +P C Y + Y + +S+SG V DII L +
Sbjct: 156 TSKQLSCSHRLC-----DMGPNCKNPKQSCPYSINYYTESTSSSGLLVEDIIHLASGGDD 210
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + VI GCG +QSG G A DG+LG G S+ S LA AG ++ F+ C
Sbjct: 211 TLNTSVKAPVIIGCGMKQSG--GYLDGVAPDGLLGLGLQEISVPSFLAKAGLIQNSFSMC 268
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+ G IF GD ++ P + +Y + VEV + TS L
Sbjct: 269 FNEDDSGRIF-FGDQGPATQQSAPFLKLNGNYTTYIVGVEV----CCVGTSCLKQS-SFS 322
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
++DSGT+ +LP +++++ +F
Sbjct: 323 ALVDSGTSFTFLPDDVFEMIAEEF 346
>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
Length = 642
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 127/263 (48%), Gaps = 35/263 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT-----LFDPSKS 135
G Y T++ +GTP+ E+ + VD+GS + +V CA C +C + F P S
Sbjct: 90 GYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLS 149
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
ST + C N T N R +C Y Y + SS+SG DI+ + S L
Sbjct: 150 STYSPVKC--NVDCTCDNER-------SQCTYERQYAEMSSSSGVLGEDIMSFGKES-EL 199
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC- 254
K +FGC N ++GDL S DGI+G G+ S++ QL G + F+ C
Sbjct: 200 KP----QRAVFGCENTETGDLFSQ---HADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY 252
Query: 255 --LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG 308
+DV GGG +G + +P + P+ P+YN+ L+E+ V G L L +
Sbjct: 253 GGMDV--GGGTMVLGGMPAPPDMVFSHSNPV--RSPYYNIELKEIHVAGKALRLDPKIFN 308
Query: 309 TGDERGTIIDSGTTLAYLPPMLY 331
+ + GT++DSGTT AYLP +
Sbjct: 309 S--KHGTVLDSGTTYAYLPEQAF 329
>gi|255586856|ref|XP_002534038.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223525945|gb|EEF28342.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 533
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 142/307 (46%), Gaps = 38/307 (12%)
Query: 50 SALKQHDTRRHGRMMASIDLE-----LGGNG---HPSATGLYFTKVGLGTPTDEYYVQVD 101
+++ D HGR + S + GN S L++ V +GTP+ Y V +D
Sbjct: 72 ASMAHRDILIHGRKLVSDNTSTPLTFFSGNETYRFSSLGFLHYANVSIGTPSLSYLVALD 131
Query: 102 TGSDLLWVNC----AGCSR-CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY 156
TGSDL W+ C +GC + S I ++ P+ SSTS I C++ C + +R
Sbjct: 132 TGSDLFWLPCDCTNSGCVQGLQFPSGEQIDFNIYRPNASSTSQTIPCNNTLC--SRQSRC 189
Query: 157 PSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
PS C Y V Y +G+S++G V D++ L + + ++ L++ +IFGCG Q+G
Sbjct: 190 PSAQ--STCPYQVQYLSNGTSSTGVLVEDLLHL--TTDDAQSRALDAKIIFGCGRVQTGS 245
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVK 275
AA +G+ G G N S+ S LA G F+ C G G + GD S
Sbjct: 246 FLDG--AAPNGLFGLGMTNISVPSTLAREGYTSNSFSMCFG-RDGIGRISFGDTGSSGQG 302
Query: 276 TTPMVPNM----PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TP N+ P YNV + ++ VGG DL E I DSGT+ YL Y
Sbjct: 303 ETPF--NLRQLHPTYNVSITKINVGGRDADL---------EFSAIFDSGTSFTYLNDPAY 351
Query: 332 DLVLSQF 338
L+ F
Sbjct: 352 TLISESF 358
>gi|148910602|gb|ABR18371.1| unknown [Picea sitchensis]
Length = 446
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/318 (31%), Positives = 145/318 (45%), Gaps = 31/318 (9%)
Query: 32 VFEVENKFKAG-GERERTLSALKQHDTRRHGRMMASID---LELGGNGHPSATGLYFTKV 87
V+ ++ K+ A + E + ++ DT R GR + + L GN P GLY+ +
Sbjct: 26 VYRLQPKYPAADNDEEGSKASFVSRDTNRIGRRLQAHQTAIFSLKGNVVP--YGLYYVTM 83
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTL--FDPSKSSTSGEIACS 144
+G P+ Y++ VD+GS+L W+ C A C C KL PSK +
Sbjct: 84 LVGNPSKPYFLDVDSGSELTWIQCDAPCISCAKGPHPLYKLKKGSLVPSKDPLCAAVQAG 143
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
Y+N + RC+Y V Y D + G+ VRD ++ + + TA +
Sbjct: 144 SGH----YHNHKEASQ---RCDYDVAYADHGYSEGFLVRDSVRALLTNKTVLTA----NS 192
Query: 205 IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV--KGGG 262
+FGCG Q L S DA DGILG G +SL SQ A G ++ HC+ GG
Sbjct: 193 VFGCGYNQRESLPVS-DARTDGILGLGSGMASLPSQWAKQGLIKNVIGHCIFGAGRDGGY 251
Query: 263 IFAIGDVVSPKVKT-TPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTII-D 318
+F D+VS T PM+ P++ HY V ++ G PLD G G + G II D
Sbjct: 252 MFFGDDLVSTSAMTWVPMLGRPSIKHYYVGAAQMNFGNKPLDKD----GDGKKLGGIIFD 307
Query: 319 SGTTLAYLPPMLYDLVLS 336
SG+T Y Y LS
Sbjct: 308 SGSTYTYFTNQAYGAFLS 325
>gi|449445106|ref|XP_004140314.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449479851|ref|XP_004155727.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 523
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 90/267 (33%), Positives = 135/267 (50%), Gaps = 27/267 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSD----LGIKLTLFDPSKSS 136
L++T + LGTP+ + V +D GSDLLWV C C +C P ++ L L+ ++P+ SS
Sbjct: 102 LHYTWIDLGTPSVPFLVALDVGSDLLWVPC-DCIQCAPLSANYYSVLDRDLSEYNPALSS 160
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
TS + C C + + S C Y Y D +STSG+ + D +QL S +
Sbjct: 161 TSKHLFCGHQLCAWSTTCK----SANDPCTYKRDYYSDNTSTSGFMIEDKLQLTSFSKHG 216
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ L +SV+FGCG +QS GS D AA DG++G G N S+ + LA G VR F+ C
Sbjct: 217 THSLLQASVVFGCGRKQS---GSYLDGAAPDGVMGLGPGNISVPTLLAQEGLVRNTFSLC 273
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
D G G GD +TT +P Y + +E VG + L +G
Sbjct: 274 FD-NNGSGRILFGDDGPATQQTTQFLPLFGEFAAYFIGVESFCVGS------SCLQRSGF 326
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ ++DSG++ YLP +Y ++ +F
Sbjct: 327 Q--ALVDSGSSFTYLPAEVYKKIVFEF 351
>gi|449533544|ref|XP_004173734.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
1-like, partial [Cucumis sativus]
Length = 408
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 130/263 (49%), Gaps = 19/263 (7%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP+ + V +D GSDLLWV C C +C S L L + PS SS
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-NCIQCAPLSASYYGSLDKDLNEYRPSSSS 160
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
TS I+CS N C + + + SP C YV+ Y + +S+SG ++D++ L+ N
Sbjct: 161 TSKHISCSHNLCDSGQSCQ----SPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENS 216
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + VI GCG +QSG G + A DG+ G G S+LS LA V+ F+ C
Sbjct: 217 SNCTIQAPVILGCGMKQSG--GYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF 274
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
+ G IF GD +TT VP Y + VG + S L +
Sbjct: 275 NEDGSGRIF-FGDEGPASQQTTSFVPLDGKYETYI----VGVEACCIENSCLKQTSFKA- 328
Query: 316 IIDSGTTLAYLPPMLYDLVLSQF 338
+IDSGT+ YLP Y+ ++ +F
Sbjct: 329 LIDSGTSFTYLPEEAYENIVIEF 351
>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 355
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 91/279 (32%), Positives = 130/279 (46%), Gaps = 53/279 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y V LGTP + V VDTGSDL WV C+ C C +++D +LF P+ S++ +
Sbjct: 1 GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQND-----SLFIPNTSTSFTK 55
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+AC C YP C+ C Y +YGDGS ++G FV D I ++ +G + P
Sbjct: 56 LACGTELCNGL---PYPMCN-QTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVP- 110
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+ FGCG+ G A DGILG GQ S SQL N +F++CL
Sbjct: 111 --NFAFGCGHDNEGSF-----AGADGILGLGQGPLSFPSQLKTVFN--GKFSYCLV---- 157
Query: 261 GGIFAIGDVVSPKVKTTPM------VPNMP---------------HYNVILEEVEVGGNP 299
D ++P +T+P+ VP P +Y V L + VGG
Sbjct: 158 -------DWLAPPTQTSPLLFGDAAVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKL 210
Query: 300 LDLPTSLLGTGD--ERGTIIDSGTTLAYLPPMLYDLVLS 336
L++ ++ GTI DSGTT+ L ++ VL+
Sbjct: 211 LNISSTAFDIDSVGRAGTIFDSGTTVTQLAGEVHQEVLA 249
>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
Length = 516
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 93/260 (35%), Positives = 116/260 (44%), Gaps = 28/260 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG Y VGLGTP Y V DTGSD WV C C + + LFDP+ SST
Sbjct: 177 TGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQ----REKLFDPASSSTYA 232
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C+ C + CS G C Y V YGDGS + G+F D + L+
Sbjct: 233 NVSCAAPACS---DLDVSGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA------ 282
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-V 258
FGCG R G G + G+LG G+ +SL Q G FAHCL
Sbjct: 283 -VKGFRFGCGERNDGLFGEAA-----GLLGLGRGKTSLPVQ--TYGKYGGVFAHCLPPRS 334
Query: 259 KGGGIFAIGDVVSPKVKTTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G G G P TTPM+ N P Y V + + VGG L + S+ GTI
Sbjct: 335 TGTGYLDFGAGSPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAA---GTI 391
Query: 317 IDSGTTLAYLPPMLYDLVLS 336
+DSGT + LPP Y + S
Sbjct: 392 VDSGTVITRLPPAAYSSLRS 411
>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
Length = 398
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 125/281 (44%), Gaps = 42/281 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y T + LGTP + V DTGSDL+W+ C C C + D +FDP SS+
Sbjct: 35 SGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKD-----PIFDPEGSSS 89
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++C D C + SCSP C+Y YGDGS T G + + L G K
Sbjct: 90 YTTMSCGDTLCDSLPRK---SCSP--NCDYSYGYGDGSGTRGTLSSETVTLTSTQGE-KL 143
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
A N + FGCG+ G ++ G++G G+ N S +SQL +F++CL
Sbjct: 144 AAKN--IAFGCGHLNRGSFNDAS-----GLVGLGRGNLSFVSQLGDL--FGHKFSYCLVP 194
Query: 256 --DVVKGGGIFAIGDVVSP-------KVKTTPMVPN---MPHYNVILEEVEVGGNPLDLP 303
D GD S TPM+ N Y V L+++ + G L +P
Sbjct: 195 WRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIP 254
Query: 304 TSLLGTGDER-----GTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+ D + G I DSGTTL LP Y +VL R
Sbjct: 255 A---GSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALR 292
>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
Length = 473
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 124/269 (46%), Gaps = 31/269 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +VG+G+P + Y+ VD+GSD++WV C C +C ++D LFDP+ SS+
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSSFS 181
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C CRT +C+Y VTYGDGS T G + + L +
Sbjct: 182 GVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTA------- 234
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL-AAAGNVRKEFAHCLDVV 258
V GCG+R SG G+LG G SL+ QL AAG V F++CL
Sbjct: 235 -VQGVAIGCGHRNSGLF-----VGAAGLLGLGWGAMSLVGQLGGAAGGV---FSYCLASR 285
Query: 259 KGGG----IFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
GG + + V P+V N Y V L + VGG L L SL +
Sbjct: 286 GAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTE 345
Query: 312 E--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G ++D+GT + LP Y + F
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAF 374
>gi|449434468|ref|XP_004135018.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 568
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 97/339 (28%), Positives = 149/339 (43%), Gaps = 49/339 (14%)
Query: 26 GVMGNFVFEVENKF--------KAGGERER----TLSALKQHDTRRHGRMMASIDLELG- 72
G +F F++ ++F + G E+ + + D GR +A+ D++
Sbjct: 27 GDAASFKFDIHHRFSDSIKGIFHSEGLPEKHTPGYYATMVHRDRLVRGRRLAASDVDTQL 86
Query: 73 ----GNGH---PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG- 124
GN P LY+ V +GTP+ ++ V +DTGSDL W+ C CS C T +
Sbjct: 87 TFAYGNDTAFIPDLGFLYYANVSVGTPSLDFLVALDTGSDLFWLPCE-CSSCFTYLNTSN 145
Query: 125 ---IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS-GY 180
L + P+ S+TS + C+ + C +N+ C Y + Y +++S GY
Sbjct: 146 GGKFMLNHYSPNDSTTSSTVPCTSSLCNRCTSNQN-------VCPYEMRYLSANTSSIGY 198
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
V D++ L LK P+ + + FGCG Q+G +T AA +G++G G S+ S
Sbjct: 199 LVEDVLHLATDDSLLK--PVEAKITFGCGTVQTGIF--ATTAAPNGLIGLGMEKISVPSF 254
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGN 298
LA G F+ C G G GD K TP + + YNV + VGG
Sbjct: 255 LADQGLTSNSFSMCFG-ADGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTFNVINVGGE 313
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
P D+P + I DSGT+ YL Y + Q
Sbjct: 314 PNDVPFT---------AIFDSGTSFTYLTEPAYSTITKQ 343
>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
Length = 464
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 86/263 (32%), Positives = 123/263 (46%), Gaps = 28/263 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +VG+G+P + Y+ VD+GSD++WV C C +C ++D LFDP+ SS+
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSSFS 181
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C CRT +C+Y VTYGDGS T G + + L +
Sbjct: 182 GVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTA------- 234
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL-AAAGNVRKEFAHCLDVV 258
V GCG+R SG G+LG G SL+ QL AAG V F++CL
Sbjct: 235 -VQGVAIGCGHRNSGLF-----VGAAGLLGLGWGAMSLVGQLGGAAGGV---FSYCLASR 285
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNM-PHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGT 315
GG G +V + + P Y V L + VGG L L SL ++ G
Sbjct: 286 GAGG---AGSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGV 342
Query: 316 IIDSGTTLAYLPPMLYDLVLSQF 338
++D+GT + LP Y + F
Sbjct: 343 VMDTGTAVTRLPREAYAALRGAF 365
>gi|356559244|ref|XP_003547910.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 515
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 96/303 (31%), Positives = 139/303 (45%), Gaps = 32/303 (10%)
Query: 50 SALKQHDTRRHGRMMASIDLELG---GNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTG 103
+ L D GR ++ ID L GN S+ G L++T V +GTP ++ V +DTG
Sbjct: 57 AELADRDRLLRGRKLSQIDDGLAFSDGNSTFRISSLGFLHYTTVQIGTPGVKFMVALDTG 116
Query: 104 SDLLWVNCAGCSRCPTKSDLG----IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
SDL WV C C+RC L +++P+ SSTS ++ C+++ C +R
Sbjct: 117 SDLFWVPC-DCTRCAATDSSAFASDFDLNVYNPNGSSTSKKVTCNNSLCM----HRSQCL 171
Query: 160 SPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
C Y+V+Y +STSG V D++ L Q + N VIFGCG QSG
Sbjct: 172 GTLSNCPYMVSYVSAETSTSGILVEDVLHLTQEDNHHDLVEAN--VIFGCGQIQSGSFLD 229
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTP 278
AA +G+ G G S+ S L+ G F+ C G G + GD S TP
Sbjct: 230 V--AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG-RDGIGRISFGDKGSFDQDETP 286
Query: 279 --MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ P+ P YN+ + +V VG +D+ E + DSGT+ YL Y +
Sbjct: 287 FNLNPSHPTYNITVTQVRVGTTLIDV---------EFTALFDSGTSFTYLVDPTYTRLTE 337
Query: 337 QFR 339
F
Sbjct: 338 SFH 340
>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
Length = 398
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 127/285 (44%), Gaps = 42/285 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y T + LGTP + V DTGSDL+W+ C C C + D +FDP SS+
Sbjct: 35 SGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKD-----PIFDPEGSSS 89
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++C D C + SCSP C+Y YGDGS T G + + L G K
Sbjct: 90 YTTMSCGDTLCDSLPRK---SCSP--DCDYSYGYGDGSGTRGTLSSETVTLTSTQGE-KL 143
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
A N + FGCG+ G ++ G++G G+ N S +SQL +F++CL
Sbjct: 144 AAKN--IAFGCGHLNRGSFNDAS-----GLVGLGRGNLSFVSQLGDL--FGHKFSYCLVP 194
Query: 256 --DVVKGGGIFAIGDVVSP-------KVKTTPMVPN---MPHYNVILEEVEVGGNPLDLP 303
D GD S TPM+ N Y V L+++ + G L +P
Sbjct: 195 WRDAPSKTSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIP 254
Query: 304 TSLLGTGDER-----GTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
G+ D + G I DSGTTL LP Y +VL R I+
Sbjct: 255 A---GSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKIS 296
>gi|296082464|emb|CBI21469.3| unnamed protein product [Vitis vinifera]
Length = 530
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 129/264 (48%), Gaps = 21/264 (7%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSDLLW+ C C +C S L L + PS SS
Sbjct: 99 LHYTWIDIGTPNISFLVALDAGSDLLWIPC-DCIQCAPLSASYYGSLDRDLNQYSPSGSS 157
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVT-YGDGSSTSGYFVRDIIQLNQASGN 194
TS ++CS C ++ P+C SP C Y + Y + +S+SG + DI+ L +
Sbjct: 158 TSKHLSCSHQLCESS-----PNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDD 212
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + + VI GCG RQ+G G A DG++G G S+ S L+ AG V+ F+ C
Sbjct: 213 ASNSSVRAPVIIGCGMRQTG--GYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLC 270
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+ G IF GD +TT +P+ Y + VG + +S + R
Sbjct: 271 FNDDDSGRIF-FGDQGLATQQTTLFLPSDGKYETYI----VGVEACCIGSSCIKQTSFRA 325
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
++DSG + +LP Y V+ +F
Sbjct: 326 -LVDSGASFTFLPDESYRNVVDEF 348
>gi|449451627|ref|XP_004143563.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 532
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 89/263 (33%), Positives = 130/263 (49%), Gaps = 19/263 (7%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP+ + V +D GSDLLWV C C +C S L L + PS SS
Sbjct: 102 LHYTWIDIGTPSVSFLVALDAGSDLLWVPC-NCIQCAPLSASYYGSLDKDLNEYRPSSSS 160
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
TS I+CS N C + + + SP C YV+ Y + +S+SG ++D++ L+ N
Sbjct: 161 TSKHISCSHNLCDSGQSCQ----SPKQSCPYVIDYITENTSSSGLLIQDVLHLSSGCENS 216
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + VI GCG +QSG G + A DG+ G G S+LS LA V+ F+ C
Sbjct: 217 SNCTIQAPVILGCGMKQSG--GYLSGVAPDGLFGLGLGEISVLSSLAKEELVQNSFSLCF 274
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
+ G IF GD +TT VP Y + VG + S L +
Sbjct: 275 NEDGSGRIF-FGDEGPASQQTTSFVPLDGKYETYI----VGVEACCIENSCLKQTSFKA- 328
Query: 316 IIDSGTTLAYLPPMLYDLVLSQF 338
+IDSGT+ YLP Y+ ++ +F
Sbjct: 329 LIDSGTSFTYLPEEAYENIVIEF 351
>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 127/265 (47%), Gaps = 28/265 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSR---CPTKSDLGIKLTLFDPSKSS 136
L++ V +GTP+D + V +DTGSDL W+ +C C R P S L L ++ P+ SS
Sbjct: 103 LHYANVTVGTPSDWFLVALDTGSDLFWLPCDCTNCVRELKAPGGSSL--DLNIYSPNASS 160
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
TS ++ C+ C T +R SP C Y + Y +G+S++G V D++ L +
Sbjct: 161 TSTKVPCNSTLC--TRGDR--CASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSS 216
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
K P + V GCG Q+G AA +G+ G G + S+ S LA G F+ C
Sbjct: 217 KAIP--ARVTLGCGQVQTGVFHDG--AAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCF 272
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G + GD S + TP+ PH YN+ + ++ V GN DL E
Sbjct: 273 G-NDGAGRISFGDKGSVDQRETPLNIRQPHPTYNITVTKISVEGNTGDL---------EF 322
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
+ DSGT+ YL Y L+ F
Sbjct: 323 DAVFDSGTSFTYLTDAAYTLISESF 347
>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
Length = 479
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 131/274 (47%), Gaps = 37/274 (13%)
Query: 75 GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSK 134
G TG Y G GTP + +DTGSD+ W+ C CS C ++ D +F+P +
Sbjct: 130 GSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVD-----PIFEPQQ 184
Query: 135 SSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
SS+ ++C + C TT N+ C G C Y + YGDGS + G F ++ + L S
Sbjct: 185 SSSYKHLSCLSSACTELTTMNH----CRLG-GCVYEINYGDGSRSQGDFSQETLTLGSDS 239
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
S FGCG+ +G S G+LG G+ S SQ + +F+
Sbjct: 240 --------FPSFAFGCGHTNTGLFKGSA-----GLLGLGRTALSFPSQTKS--KYGGQFS 284
Query: 253 HCL-DVVK--GGGIFAIGDVVSPKVKT-TPMVPNMPH---YNVILEEVEVGGNPLDLPTS 305
+CL D V G F++G P T P+V N + Y V L + VGG L +P +
Sbjct: 285 YCLPDFVSSTSTGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPA 344
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+LG G GTI+DSGT + L P YD + + FR
Sbjct: 345 VLGRG---GTIVDSGTVITRLVPQAYDALKTSFR 375
>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 394
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 135/281 (48%), Gaps = 43/281 (15%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
Q R + RM DL L G Y T++ +GTP + + VDTGS + +V C+
Sbjct: 69 QGSARPNARMRLYDDLLL--------NGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCST 120
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
C +C D F+P SST ++C N T N R +C Y Y +
Sbjct: 121 CEQCGRHQD-----PKFEPELSSTYQPVSC--NIDCTCDNER-------KQCVYERQYAE 166
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
SS+SG DII GN ++ + IFGC N+++GDL S DGI+G G+
Sbjct: 167 MSSSSGVLGEDIISF----GN-QSELVPQRAIFGCENQETGDLYSQR---ADGIMGLGRG 218
Query: 234 NSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPHY 286
+ S++ QL G + F+ C +D+ GGG +G + P ++ P+ +Y
Sbjct: 219 DLSIVDQLVEKGVISDSFSLCYGGMDI--GGGAMILGGISPPSGMVFAESDPVRSQ--YY 274
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
N+ L+ + V G L L S+ + GT++DSGTT AYLP
Sbjct: 275 NIDLKAIHVAGKQLHLDPSIF--DGKHGTVLDSGTTYAYLP 313
>gi|357463449|ref|XP_003602006.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355491054|gb|AES72257.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 529
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 126/265 (47%), Gaps = 22/265 (8%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT-----KSDLGIKLTLFDPSKSS 136
L++T + +GTP+ + V +D GSDLLWV C C C S+L L + PS+S
Sbjct: 99 LHYTWIDIGTPSTSFLVALDAGSDLLWVPC-DCIHCAPLSASFYSNLDRDLNEYSPSRSL 157
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
+S ++CS C N + S +C Y + Y D +S+SG V DI L G+
Sbjct: 158 SSKHLSCSHRLCDMGSNCK---TSKQQQCPYTINYLSDNTSSSGLLVEDIFHLQSGDGST 214
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + + V+ GCG +QSG G A DG++G G SS+ S LA +G +R F+ C
Sbjct: 215 SNSSVQAPVVVGCGMKQSG--GYLDGTAPDGLIGLGPGESSVPSFLAKSGLIRDSFSLCF 272
Query: 256 DVVKGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
+ G +F GD S ++TP +V M ++ E GN TS
Sbjct: 273 NEDDSGRLF-FGDQGSTVQQSTPFLLVDGMFSTYIVGVETCCIGNSCPKVTSF------- 324
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
DSGT+ +LP Y + +F
Sbjct: 325 NAQFDSGTSFTFLPGHAYGAIAEEF 349
>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
gi|224034427|gb|ACN36289.1| unknown [Zea mays]
Length = 443
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 99/311 (31%), Positives = 144/311 (46%), Gaps = 56/311 (18%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDL 106
++L+AL D R+ L L +G Y ++G+GTPT Y +DTGSDL
Sbjct: 65 QSLAALAPGDAITAARI-----LVLASDGE------YLMEMGIGTPTRYYSAILDTGSDL 113
Query: 107 LWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
+W CA C C PT FDP++S+T + C+ C Y YP C V
Sbjct: 114 IWTQCAPCLLCVDQPTP--------YFDPARSATYRSLGCASPACNALY---YPLCYQKV 162
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
C Y YGD +ST+G + G +T + FGCGN +G L + +
Sbjct: 163 -CVYQYFYGDSASTAGVLANETFTF----GTNETRVSLPGISFGCGNLNAGLLANGS--- 214
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG-------GIFAI---GDVVSPK 273
G++GFG+ + SL+SQL + F++CL G++A + S
Sbjct: 215 --GMVGFGRGSLSLVSQLGS-----PRFSYCLTSFLSPVPSRLYFGVYATLNSTNASSEP 267
Query: 274 VKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYLP 327
V++TP V P +P Y + + + VGG L + ++ D GTIIDSGTT+ YL
Sbjct: 268 VQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSGTTITYLA 327
Query: 328 PMLYDLVLSQF 338
YD V + F
Sbjct: 328 EPAYDAVRAAF 338
>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
Length = 358
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 81/284 (28%), Positives = 127/284 (44%), Gaps = 44/284 (15%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
+L GN +P TG Y+ + +G P Y++ VDTGSDL W+ C P +S +
Sbjct: 42 FQLQGNVYP--TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDA----PCRSCNKVPHP 95
Query: 129 LFDPSKSSTSGEIACSDNFCRTTY-----NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVR 183
L+ P+ +S + C++ C + NN+ PS +C+Y + Y D +S+ G +
Sbjct: 96 LYRPTANSL---VPCANALCTALHSGHGSNNKCPSPK---QCDYQIKYTDSASSQGVLIN 149
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D L S N++ + FGCG Q + AA DG+LG G+ + SL+SQL
Sbjct: 150 DNFSLPMRSSNIRPG-----LTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQ 204
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMPHY------NVILEEVEV 295
G + HCL GGG GD + P + T PM +Y + + +
Sbjct: 205 QGITKNVLGHCLS-TNGGGFLFFGDDIVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSL 263
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G P+++ + DSG+T Y Y V+S +
Sbjct: 264 GVKPMEV-------------VFDSGSTYTYFTAQPYQAVVSALK 294
>gi|225438629|ref|XP_002281243.1| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
Length = 511
Score = 114 bits (285), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 129/264 (48%), Gaps = 21/264 (7%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSDLLW+ C C +C S L L + PS SS
Sbjct: 80 LHYTWIDIGTPNISFLVALDAGSDLLWIPC-DCIQCAPLSASYYGSLDRDLNQYSPSGSS 138
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVT-YGDGSSTSGYFVRDIIQLNQASGN 194
TS ++CS C ++ P+C SP C Y + Y + +S+SG + DI+ L +
Sbjct: 139 TSKHLSCSHQLCESS-----PNCDSPKQLCPYTINYYSENTSSSGLLIEDILHLTSGIDD 193
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ + + VI GCG RQ+G G A DG++G G S+ S L+ AG V+ F+ C
Sbjct: 194 ASNSSVRAPVIIGCGMRQTG--GYLDGVAPDGLMGLGLGEISVPSFLSKAGLVKNSFSLC 251
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+ G IF GD +TT +P+ Y + VG + +S + R
Sbjct: 252 FNDDDSGRIF-FGDQGLATQQTTLFLPSDGKYETYI----VGVEACCIGSSCIKQTSFRA 306
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
++DSG + +LP Y V+ +F
Sbjct: 307 -LVDSGASFTFLPDESYRNVVDEF 329
>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
Length = 457
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 128/270 (47%), Gaps = 32/270 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V +GTP + DTGSDL+WVNC+ +D G + +F P++SST +++
Sbjct: 103 YLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNV-VFQPTRSSTYSQLS 161
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ--ASGNLKTAPL 200
C N C+ SC C+Y +YGDGS T G + G ++ +
Sbjct: 162 CQSNACQALSQ---ASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRV 218
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----D 256
N FGC +G S DG++G G SL+SQL A ++ ++ ++CL D
Sbjct: 219 N----FGCSTASAGTFRS------DGLVGLGAGAFSLVSQLGATTHIDRKLSYCLIPSYD 268
Query: 257 VVKGGGI-FAIGDVVS-PKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
+ F VVS P +TP+VP+ +Y V LE V VGG + T D
Sbjct: 269 ANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQE-------VATHDS 321
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
R I+DSGTTL +L P L ++++ I
Sbjct: 322 R-IIVDSGTTLTFLDPALLGPLVTELERRI 350
>gi|449434466|ref|XP_004135017.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 525
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 129/266 (48%), Gaps = 32/266 (12%)
Query: 84 FTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC------PTKSDLGIKLTLFDPSKSST 137
+T V LGTP ++ V +DTGSDL WV C CSRC P SD +L+++ P KSST
Sbjct: 113 YTTVQLGTPGTKFMVALDTGSDLFWVPC-DCSRCAPTEGSPYASDF--ELSVYSPKKSST 169
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNL 195
S + C++N C + C+ C YVV+Y +ST+G + D++ L + +
Sbjct: 170 SKTVPCNNNLCA-----QRDQCTEAFGNCPYVVSYVSAETSTTGILIEDLLHLK--TEHK 222
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ P+ + + FGCG QSG AA +G+ G G S+ S L+ G + F+ C
Sbjct: 223 HSEPIQAYITFGCGQVQSGSFLDV--AAPNGLFGLGMEQISVPSILSREGLMANSFSMCF 280
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G GD S + + TP N P+YN+ + + VG +D + L
Sbjct: 281 S-DDGVGRINFGDKGSLEQEETPFNLNQLHPNYNITVTSIRVGTTLIDADITAL------ 333
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
DSGT+ +Y +Y + + F
Sbjct: 334 ---FDSGTSFSYFTDPIYSKLSASFH 356
>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
sativus]
Length = 508
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 135/304 (44%), Gaps = 44/304 (14%)
Query: 50 SALKQHDTRRHGRMMASID-------------LELGGNGHPSATGLYFTKVGLGTPTDEY 96
+A+ D HGR +A+ + EL G G+ LY+ V +GTP +
Sbjct: 63 AAMVHRDRLLHGRNLATTNGDTPLMFSYGNETYELSGLGN-----LYYANVSIGTPGLYF 117
Query: 97 YVQVDTGSDLLWVNCAGCSRCP---TKSDLG-IKLTLFDPSKSSTSGEIACSDNFCRTTY 152
V +DTGSDL W+ C C++CP TK D G L + + SSTS + CS + C
Sbjct: 118 LVALDTGSDLFWLPCE-CTKCPTYLTKRDNGKFWLNHYSSNASSTSIRVPCSSSLCELA- 175
Query: 153 NNRYPSCSPG-VRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
CS C Y Y + SS++GY V+DI+ + LK P++ V GCG
Sbjct: 176 ----NQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLK--PVDVKVTLGCGK 229
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
Q+G + T A +G++G G S+ S LA+ G F+ C G G GD+
Sbjct: 230 VQTGKFSNVT--APNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYY-GYGRIDFGDIG 286
Query: 271 SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
+ TP P YNV + ++ V P T IIDSG + YL
Sbjct: 287 PVGQRETPFNPASLSYNVTILQIIVTNRP---------TNVHLTAIIDSGASFTYLTDPF 337
Query: 331 YDLV 334
Y ++
Sbjct: 338 YSII 341
>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
Length = 557
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 137/304 (45%), Gaps = 27/304 (8%)
Query: 42 GGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVD 101
GG + R + + T R ++ L + GN P G Y+T + +G P Y++ VD
Sbjct: 151 GGRKARNRMEVAKAAT---ARTNSTALLPIKGNVFPD--GQYYTSIFIGNPPRPYFLDVD 205
Query: 102 TGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
TGSDL W+ C A C+ C L+ P+K + D C+ N+ C
Sbjct: 206 TGSDLTWIQCDAPCTNCAKGPH-----PLYKPAKEKI---VPPRDLLCQELQGNQN-YCE 256
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
+C+Y + Y D SS+ G RD + + +G + +FGC Q G L SS
Sbjct: 257 TCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL----DFVFGCAYDQQGQLLSSP 312
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFA-IGDVVSPKVKTT-P 278
A DGILG A S SQLA+ G + F HC+ +GGG + +GD P+ T
Sbjct: 313 -AKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWT 371
Query: 279 MVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + P Y+ V+ G L P G I DSG++ YLP +Y+ +++
Sbjct: 372 SIRSGPDNLYHTQAHHVKYGDQQLRRPEQ---AGSTVQVIFDSGSSYTYLPNEIYENLVA 428
Query: 337 QFRF 340
++
Sbjct: 429 AIKY 432
>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
Length = 465
Score = 114 bits (284), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 97/278 (34%), Positives = 129/278 (46%), Gaps = 43/278 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G GTP+ + +DTGSD+ WV CA C + C + D LFDPSKSST
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKD-----PLFDPSKSSTYAP 179
Query: 141 IACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
IAC + C ++ C S G +C Y V YGDGSST G + + I AP
Sbjct: 180 IACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETITF---------AP 230
Query: 200 --LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
FGCG+ Q G DG+LG G A SL+ Q A+ F++CL
Sbjct: 231 GITVKDFHFGCGHDQRG-----PSDKFDGLLGLGGAPESLVVQTASV--YGGAFSYCLPA 283
Query: 258 VKG-GGIFAIGDVVSPKVKT-------TPM--VP-NMPHYNVILEEVEVGGNPLDLPTSL 306
+ G A+G V P T TPM +P + Y V + + VGG PLD+P S
Sbjct: 284 LNSEAGFLALG--VRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSA 341
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
G +IDSGT + LP Y+ + + R A+
Sbjct: 342 F----RGGMLIDSGTIVTELPETAYNALNAALRKAFAA 375
>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 458
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 125/265 (47%), Gaps = 33/265 (12%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V +GTP V +DTGSD+ WV+C +R S L FDP KSST +
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCH--ARAGAGSSL-----FFDPGKSSTYTPFS 177
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG--NLKTAPL 200
CS C T R CS C+Y V YGDGS+T+G + D + LN N +
Sbjct: 178 CSSAAC-TRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTEKVENFQ---- 232
Query: 201 NSSVIFGCGNRQSGDLGSSTDA-AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-V 258
FGC ++ D G D DG++G G SL+SQ AA F++CL
Sbjct: 233 -----FGC--SETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAA--TYGSAFSYCLPATT 283
Query: 259 KGGGIFAIGDVV-SPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+ G +G + TTPM + Y VIL+ + VGG+P+ + ++ G
Sbjct: 284 RSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAA----G 339
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
+I+DSGT + LPP Y + + FR
Sbjct: 340 SIMDSGTIITRLPPRAYSALSAAFR 364
>gi|357517921|ref|XP_003629249.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355523271|gb|AET03725.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 553
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 94/294 (31%), Positives = 136/294 (46%), Gaps = 36/294 (12%)
Query: 50 SALKQHDTRRHGRMMASIDLELG---GNG--HPSATG-LYFTKVGLGTPTDEYYVQVDTG 103
+ L D GR ++ D L GN S+ G L++T + LGTP ++ V +DTG
Sbjct: 62 AELADRDRFLRGRRLSQFDAGLAFSDGNSTFRISSLGFLHYTTIELGTPGVKFMVALDTG 121
Query: 104 SDLLWVNCAGCSRCPTK--------SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
SDL WV C C+RC L++++P+ SSTS ++ C+++ C +R
Sbjct: 122 SDLFWVPC-DCTRCSATRSSAFASALASDFDLSVYNPNGSSTSKKVTCNNSLC----THR 176
Query: 156 YPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
C Y+V+Y +STSG V D++ L Q N N VIFGCG QSG
Sbjct: 177 NQCLGTFSNCPYMVSYVSAETSTSGILVEDVLHLTQPDDNHDLVEAN--VIFGCGQVQSG 234
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKV 274
AA +G+ G G S+ S L+ G F+ C G G + GD S
Sbjct: 235 SFLDV--AAPNGLFGLGMEKISVPSMLSREGFTADSFSMCFG-RDGIGRISFGDKGSLDQ 291
Query: 275 KTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
TP + P+ P YN+ + +V VG +D+ E + DSGT+ YL
Sbjct: 292 DETPFNVNPSHPTYNITINQVRVGTTLIDV---------EFTALFDSGTSFTYL 336
>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 125/260 (48%), Gaps = 26/260 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +GTP + Y DTGSD++W+ C C +C ++ +F+PSKSS+
Sbjct: 85 GGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQT-----TPIFNPSKSSSYKN 139
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I C C + R SCS C+Y ++YGD S + G D + L SG+ + P
Sbjct: 140 IPCLSKLCHSV---RDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFP- 195
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
+ GCG +G G A GI+G G SL++QL ++ + +F++CL
Sbjct: 196 --KTVIGCGTDNAGTFG----GASSGIVGLGGGPVSLITQLGSS--IGGKFSYCLVPLLN 247
Query: 256 DVVKGGGIFAIGD--VVSPK-VKTTPMVPNMP-HYNVILEEVEVGGNPLDLPTSLLGTGD 311
I + GD VVS V +TP++ P Y + L+ VG ++ S G D
Sbjct: 248 KESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDD 307
Query: 312 ERGTIIDSGTTLAYLPPMLY 331
E IIDSGTTL +P +Y
Sbjct: 308 EGNIIIDSGTTLTLIPSDVY 327
>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 527
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 124/265 (46%), Gaps = 25/265 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG----IKLTLFDPSKSST 137
L+F V +GTP Y V +DTGSDL W+ C C++C L I ++D +SST
Sbjct: 112 LHFANVSVGTPASSYLVALDTGSDLFWLPC-NCTKCVHGIQLSTGQKIAFNIYDNKESST 170
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLK 196
S +AC+ + C S S G C Y V Y + +ST+G+ V D++ L + +
Sbjct: 171 SKNVACNSSLCE---QKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHL-ITDNDDQ 226
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
T N + FGCG Q+G AA +G+ G G ++ S+ S LA G F+ C
Sbjct: 227 TQHANPLITFGCGQVQTGAFLDG--AAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCF- 283
Query: 257 VVKGGGIFAIGDVVSPKVK-TTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G GD S + TP + P+ YN+ + ++ VGGN DL E
Sbjct: 284 AADGLGRITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADL---------EF 334
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
I D+GT+ YL Y + F
Sbjct: 335 NAIFDTGTSFTYLNNPAYKQITQSF 359
>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 367
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 95/275 (34%), Positives = 129/275 (46%), Gaps = 33/275 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
TG YF VG+GTP + Y+ VDTGSD+ W+ CA C+ C + D LF+PS SS+
Sbjct: 13 TGEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKD-----ALFNPSSSSSFK 67
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ CS + C N C +C Y YGDGS T G V D + L+ A G +
Sbjct: 68 VLDCSSSLC---LNLDVMGCLSN-KCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVL 123
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
N + GCG+ G G++ GILG G+ S + L A+ R F++CL +
Sbjct: 124 TN--IPLGCGHDNEGTFGTAA-----GILGLGRGPLSFPNNLDAS--TRNIFSYCLPDRE 174
Query: 260 GG----GIFAIGDVVSPK-----VKTTPMVPN---MPHYNVILEEVEVGGNPL-DLPTSL 306
GD P VK P + N +Y V + + VGGN L ++P S+
Sbjct: 175 SDPNHKSTLVFGDAAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASV 234
Query: 307 --LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L + GTI DSGTT+ L Y V FR
Sbjct: 235 FQLDSHGNGGTIFDSGTTITRLEARAYTAVRDAFR 269
>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
Length = 536
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 131/268 (48%), Gaps = 27/268 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS------DLGIKLTLFDPSKS 135
L++T + +GTP + V +D GSDLLWV C C +C S L L+ + PS S
Sbjct: 106 LHYTWIDIGTPNVSFLVALDAGSDLLWVPC-DCIQCAPLSASYYNISLDRDLSEYSPSLS 164
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD--GSSTSGYFVRDIIQLNQASG 193
STS ++C C N + +P C Y+ Y D ++++G+ V D + L
Sbjct: 165 STSRHLSCDHQLCEWGSNCK----NPKDPCPYIFNYDDFENTTSAGFLVEDKLHLASVGD 220
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ L +SV+ GCG +Q G AA DG++G G + S+ S LA AG ++ F+
Sbjct: 221 HTARKMLQASVVLGCGRKQGGSFFDG--AAPDGVMGLGPGDISVPSLLAKAGLIQNCFSL 278
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
C D G I GD ++TP +P Y V +E VG + L +G
Sbjct: 279 CFDENDSGRIL-FGDRGHASQQSTPFLPIQGTYVAYFVGVESYCVGN------SCLKRSG 331
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ ++DSG++ YLP +Y+ ++S+F
Sbjct: 332 FK--ALVDSGSSFTYLPSEVYNELVSEF 357
>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
Length = 487
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/315 (31%), Positives = 141/315 (44%), Gaps = 43/315 (13%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGL------YFTKVGLGTPTDEYYVQVDTGSDLL 107
+H R R + + + P+ GL Y +G+GTP + V DTGSDL
Sbjct: 87 RHRVRSIYRRLTAAETTTTTTTIPARLGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLT 146
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT--TYNNRYPSCSPGVRC 165
WV C CP S + LFDPSKSST ++ CS C R + S C
Sbjct: 147 WVQCL---PCPDSSCYPQQEPLFDPSKSSTYVDVPCSAPECHIGGVQQTRCGATS----C 199
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
EY V YGD S T G + L+ S AP + V+FGC + + + + T V
Sbjct: 200 EYSVKYGDESETHGSLAEETFTLSPPS---PLAPAATGVVFGC-SHEYISVFNDTGMGVA 255
Query: 226 GILGFGQANSSLLSQ----LAAAGNVRKEFAHCLD--------VVKGGGIFAIGDVVSPK 273
G+LG G+ +SS+LSQ + + G V F++CL + GGG A S
Sbjct: 256 GLLGLGRGDSSILSQTRRSINSGGGV---FSYCLPPRGSSTGYLTIGGGAAAPQQQYS-N 311
Query: 274 VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
+ TP++ + Y V L V V G +D+P S G +IDSGT + ++P
Sbjct: 312 LSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL----GAVIDSGTVVTHMPAA 367
Query: 330 LYDLVLSQFRFWIAS 344
Y + +FR + S
Sbjct: 368 AYYPLRDEFRLHMGS 382
>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
Length = 473
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 86/269 (31%), Positives = 123/269 (45%), Gaps = 31/269 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +VG+G+P + Y+ VD+GSD++WV C C +C ++D LFDP+ SS+
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSSFS 181
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C CRT +C+Y VTYGDGS T G + + L +
Sbjct: 182 GVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTA------- 234
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL-AAAGNVRKEFAHCLDVV 258
V GCG+R SG G+LG G SL+ QL AAG V F++CL
Sbjct: 235 -VQGVAIGCGHRNSGLF-----VGAAGLLGLGWGAMSLIGQLGGAAGGV---FSYCLASR 285
Query: 259 KGGG----IFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
GG + + V P+V N Y V L + VGG L L L +
Sbjct: 286 GAGGAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTE 345
Query: 312 E--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G ++D+GT + LP Y + F
Sbjct: 346 DGAGGVVMDTGTAVTRLPREAYAALRGAF 374
>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 564
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 99/300 (33%), Positives = 141/300 (47%), Gaps = 33/300 (11%)
Query: 51 ALKQHDTRRHGRMMASIDL-ELGGNGHPSAT--GLYFTKVGLGTPTDEYYVQVDTGSDLL 107
AL + D +R R + + E GG P LY+T V +GTP + V +DTGSDL
Sbjct: 108 ALVRSDLQRQKRKHQLLSVSEAGGIFSPGNDFGWLYYTWVDVGTPNTSFMVALDTGSDLF 167
Query: 108 WVNCAGCSRCPT----KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
WV C C C + L L ++ P++S+TS + CS C SP
Sbjct: 168 WVPC-DCIECAPLAGYRETLDRDLGIYKPAESTTSRHLPCSHELCPPGSG----CSSPKQ 222
Query: 164 RCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
C Y Y + +++SG + DI+ L+ + AP+ +SV+ GCG +QS GS D
Sbjct: 223 PCPYSTDYLQENTTSSGLLIEDILHLDSRESH---APVKASVVIGCGRKQS---GSYLDG 276
Query: 223 -AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP 281
A DG+LG G A+ S+ S LA AG VR F+ C G F GD ++TP VP
Sbjct: 277 IAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSGRIFF--GDQGVSIQQSTPFVP 334
Query: 282 ---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y V +++ VG + T E ++DSGT+ LP +Y V +F
Sbjct: 335 LYGKYQTYAVNVDKSCVGHKCFE------ATSFE--ALVDSGTSFTALPLNVYKAVAVEF 386
>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
Length = 393
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 125/279 (44%), Gaps = 46/279 (16%)
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
HP G Y + +GTP + DTGSDL+WV C+ C T+FDP +S
Sbjct: 49 HPDGGG-YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGG-------TIFDPRQS 100
Query: 136 STSGEIACSDNFCRTTYNNRYP-SCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
ST E+ CS C P SC PG C Y YG G T G F RD I L SG
Sbjct: 101 STFREMDCSSQLC-----TELPGSCEPGSSACSYSYEYGSG-ETEGEFARDTISLGTTSG 154
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ P S GCG SG G VDG++G GQ SL SQL+AA + +F++
Sbjct: 155 GSQKFP---SFAVGCGMVNSGFDG------VDGLVGLGQGPVSLTSQLSAA--IDSKFSY 203
Query: 254 CLDVVKG---------GGIFAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEVGGNPLDLP 303
CL + G A+ K TP P Y ++ + + V G + P
Sbjct: 204 CLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP 263
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
GT TIIDSGTTL Y+P +Y VLS+ +
Sbjct: 264 ----GT-----TIIDSGTTLTYVPSGVYGRVLSRMESMV 293
>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 358
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 141/305 (46%), Gaps = 42/305 (13%)
Query: 45 RERTL-SALKQHDTRRHGRMMASIDLELGGN-------GHPSATGLYFTKVGLGTPTDEY 96
R +TL S L + DTR ++ D+ + G +G Y+ KVG G+P Y
Sbjct: 72 RVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYY 131
Query: 97 YVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT----T 151
+ VDTGS L W+ C C C ++D LFDPS S T ++C+ + C + T
Sbjct: 132 SMIVDTGSSLSWLQCKPCVVYCHVQAD-----PLFDPSASKTYKSLSCTSSQCSSLVDAT 186
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
NN S V C Y +YGD S + GY +D++ L + +T P ++GCG
Sbjct: 187 LNNPLCETSSNV-CVYTASYGDSSYSMGYLSQDLLTLAPS----QTLP---GFVYGCGQD 238
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIG--DV 269
G G + GILG G+ S+L Q+++ F++CL GGG +IG +
Sbjct: 239 SDGLFGRAA-----GILGLGRNKLSMLGQVSS--KFGYAFSYCLPTRGGGGFLSIGKASL 291
Query: 270 VSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
K TPM P P Y + L + VGG L + + TIIDSGT + L
Sbjct: 292 AGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQY----RVPTIIDSGTVITRL 347
Query: 327 PPMLY 331
P +Y
Sbjct: 348 PMSVY 352
>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 436
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 100/307 (32%), Positives = 141/307 (45%), Gaps = 45/307 (14%)
Query: 46 ERTLSALKQHDTR--RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER A+K+ R R AS + + H + G + K+ +GTP + Y +DTG
Sbjct: 59 ERLQRAMKRGKLRLQRLSAKTASFESSVEAPVH-AGNGEFLMKLAIGTPAETYSAIMDTG 117
Query: 104 SDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
SDL+W C C C PT +FDP KSS+ ++ CS + C SCS
Sbjct: 118 SDLIWTQCKPCKDCFDQPTP--------IFDPKKSSSFSKLPCSSDLCAAL---PISSCS 166
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
G CEY+ +YGD SST G + AS S + FGCG G G S
Sbjct: 167 DG--CEYLYSYGDYSSTQGVLATETFAFGDAS--------VSKIGFGCGEDNDGS-GFSQ 215
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVKGGGIFAIGDVVSPK-VKT 276
A G++G G+ SL+SQL +F++CL D KG +G + K T
Sbjct: 216 GA---GLVGLGRGPLSLISQLG-----EPKFSYCLTSMDDSKGISSLLVGSEATMKNAIT 267
Query: 277 TPMV--PNMP-HYNVILEEVEVGGN--PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TP++ P+ P Y + LE + VG P++ T + G IIDSGTT+ YL +
Sbjct: 268 TPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAF 327
Query: 332 DLVLSQF 338
+ +F
Sbjct: 328 AALKKEF 334
>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
Length = 426
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 134/281 (47%), Gaps = 32/281 (11%)
Query: 63 MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKS 121
+ +S+ L GN +P G Y+ + +G P Y++ DTGSDL W+ C A C RC TK+
Sbjct: 49 IQSSVVFPLYGNVYP--LGYYYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPCVRC-TKA 105
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYF 181
P + + C D C + + Y C +C+Y V Y DG S+ G
Sbjct: 106 P--------HPLYRPNNNLVICKDPMCASLHPPGY-KCEHPEQCDYEVEYADGGSSLGVL 156
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V+D+ LN +G L+ AP + GCG Q + + +DG+LG G+ SS++SQL
Sbjct: 157 VKDVFPLNFTNG-LRLAP---RLALGCGYDQ---IPGQSYHPLDGVLGLGKGKSSIVSQL 209
Query: 242 AAAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVILEEVEVGGN 298
+ G +R HC+ +GGG GD + S +V TPM+ + HY+ E+ +GG
Sbjct: 210 HSQGVIRNVVGHCVS-SRGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGK 268
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L+ DSG++ YL + Y ++ R
Sbjct: 269 TTVFKNLLV--------TFDSGSSYTYLNSLAYQALVHLVR 301
>gi|357159746|ref|XP_003578546.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 530
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/264 (34%), Positives = 126/264 (47%), Gaps = 27/264 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKS-DLG-IKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C P S D G +K ++ P KSSTS
Sbjct: 98 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCIKCAPLASPDYGDLKFDMYSPRKSSTS 156
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKT 197
++ CS + C + S S C Y + Y + +S+ G V D++ L SG K
Sbjct: 157 RKVPCSSSLCDPQADCSAASNS----CPYSIQYLSENTSSKGVLVEDVLYLTTESGQSKI 212
Query: 198 APLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ + FGCG QSG LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 213 T--QAPITFGCGQVQSGSFLGS---AAPNGLLGLGMDSKSVPSLLASKGIAANSFSMCFG 267
Query: 257 VVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G GD S TP+ P+YN+ + VGG D S
Sbjct: 268 -EDGHGRINFGDTGSSDQLETPLNIYKQNPYYNISITGAMVGGKSFDTKFS--------- 317
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
++DSGT+ L +Y + S F
Sbjct: 318 AVVDSGTSFTALSDPMYTEITSTF 341
>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 137/304 (45%), Gaps = 47/304 (15%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
R L++ + G + +++ + N G Y K+ +GTP DTGSD
Sbjct: 53 HRVADTLRRSISHNTGLVTNTVEAPIYNN-----RGEYLMKLSVGTPPFPIIAVADTGSD 107
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRC 165
++W C C+ C + L +F+PSKS+T +++CS C T + SCS C
Sbjct: 108 IIWTQCVPCTNCYQQ-----DLPMFNPSKSTTYRKVSCSSPVCSFTGEDN--SCSFKPDC 160
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y ++YGD S + G F D + + SG + P + GCG+ +G S DA V
Sbjct: 161 TYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTA---IGCGHDNAG----SFDANVS 213
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHCLDV------------------VKGGGIFAIG 267
GI+G G +SL+ Q+ +A V +F++CL V G G +
Sbjct: 214 GIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTP 271
Query: 268 DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+S K K+ Y++ L+ V VG N T+ G + IIDSGTTL LP
Sbjct: 272 IYISDKFKS--------FYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLP 323
Query: 328 PMLY 331
LY
Sbjct: 324 VDLY 327
>gi|118486628|gb|ABK95151.1| unknown [Populus trichocarpa]
Length = 393
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/280 (31%), Positives = 127/280 (45%), Gaps = 28/280 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
R+ +SI L L GN +P+ G Y + +G P+ Y++ VDTGSDL W+ C A C +C
Sbjct: 15 RVPSSIVLPLHGNVYPN--GYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEA 72
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
P + + C D C++ ++N C +C+Y V Y DG S+ G
Sbjct: 73 PH---------PYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGV 123
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
V D LN S + +PL + GCG Q + +DG+LG G+ SS++SQ
Sbjct: 124 LVTDTFNLNFTSEK-RHSPL---LALGCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVSQ 176
Query: 241 LAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNP 299
L++ G VR HCL G F S +V TPM P+ HY+ L E+ G
Sbjct: 177 LSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKHYSPGLAELTFDGKT 236
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L T DSG + YL Y ++S +
Sbjct: 237 TGFKNLL--------TTFDSGASYTYLNSQAYQGLISLLK 268
>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
Length = 444
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 91/281 (32%), Positives = 129/281 (45%), Gaps = 43/281 (15%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSST 137
G Y ++G+GTP Y +DTGSDL+W CA C C PT FDP+ SST
Sbjct: 90 GEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTP--------YFDPANSST 141
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ CS C Y YP C C Y YGD +ST+G + G T
Sbjct: 142 YRSLGCSAPACNALY---YPLCYQKT-CVYQYFYGDSASTAGVLANETFTF----GTNDT 193
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
+ FGCGN +G L + + G++GFG+ + SL+SQL + F++CL
Sbjct: 194 RVTLPRISFGCGNLNAGSLANGS-----GMVGFGRGSLSLVSQLGS-----PRFSYCLTS 243
Query: 256 ------DVVKGGGIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSL 306
+ G + + V++TP + P +P Y + + + VGGN L + ++
Sbjct: 244 FLSPVRSRLYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAV 303
Query: 307 LGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
L D GTIIDSGTT+ YL Y V F ++ S
Sbjct: 304 LAINDTDGTGGTIIDSGTTITYLAEPAYYAVREAFVLYLNS 344
>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 431
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 125/260 (48%), Gaps = 28/260 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +GTP Y VDTGSD++W+ C C +C ++ +F+PSKSS+
Sbjct: 85 GEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTT-----PIFNPSKSSSYKN 139
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I CS N C++ RY SC+ CEY + + D S + G + + L+ +G+ + P
Sbjct: 140 IPCSSNLCQSV---RYTSCNKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFP- 195
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
+ GCG+ G T GI+G G SL +QL ++ + +F++CL
Sbjct: 196 --KTVIGCGHNNRGMFQGET----SGIVGLGIGPVSLTTQLKSS--IGGKFSYCLLPLLV 247
Query: 256 DVVKGGGI-FAIGDVVSPK-VKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGD 311
D K + F VVS V +TP V P Y + LE VG ++ +L +
Sbjct: 248 DSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEF--EVLDDSE 305
Query: 312 ERGTIIDSGTTLAYLPPMLY 331
E I+DSGTTL LP +Y
Sbjct: 306 EGNIILDSGTTLTLLPSHVY 325
>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
Length = 367
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 98/286 (34%), Positives = 136/286 (47%), Gaps = 47/286 (16%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y ++ LG+P ++ VDTGSDL+W+ C CS+C ++SD ++DPS SST
Sbjct: 1 SGAYTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSD-----PIYDPSASSTFA 55
Query: 140 EIACSDNFCRTTYNNRYPS--CSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ +CS T+ P+ CS + C Y YGD SST G F + + L + G+ K
Sbjct: 56 KTSCS-----TSSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSK 110
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL- 255
P + FGCG SG G + GI+G GQ SL +QL +A + +F++CL
Sbjct: 111 AFP---NFQFGCGRLNSGSFGGAA-----GIVGLGQGKISLSTQLGSA--INNKFSYCLV 160
Query: 256 ----DVVKGGG-IFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLL 307
D K IF +TP++PN +Y V LE + VGG L L T +
Sbjct: 161 DFDDDSSKTSPLIFGSSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAI 220
Query: 308 GTGDER---------------GTIIDSGTTLAYLPPMLYDLVLSQF 338
R GTI DSGTTL L +Y V S F
Sbjct: 221 DFLSVRSKKKLRVRALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAF 266
>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 435
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 137/304 (45%), Gaps = 47/304 (15%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
R L++ + G + +++ + N G Y K+ +GTP DTGSD
Sbjct: 53 HRVADTLRRSISHNTGLVTNTVEAPIYNN-----RGEYLMKLSVGTPPFPIIAVADTGSD 107
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRC 165
++W C C+ C + L +F+PSKS+T +++CS C T + SCS C
Sbjct: 108 IIWTQCEPCTNCYQQ-----DLPMFNPSKSTTYRKVSCSSPVCSFTGEDN--SCSFKPDC 160
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y ++YGD S + G F D + + SG + P + GCG+ +G S DA V
Sbjct: 161 TYSISYGDNSHSQGDFAVDTLTMGSTSGRVVAFPRTA---IGCGHDNAG----SFDANVS 213
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHCLDV------------------VKGGGIFAIG 267
GI+G G +SL+ Q+ +A V +F++CL V G G +
Sbjct: 214 GIVGLGLGPASLIKQMGSA--VGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTP 271
Query: 268 DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+S K K+ Y++ L+ V VG N T+ G + IIDSGTTL LP
Sbjct: 272 IYISDKFKS--------FYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGTTLTLLP 323
Query: 328 PMLY 331
LY
Sbjct: 324 VDLY 327
>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
Length = 557
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 129/277 (46%), Gaps = 24/277 (8%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKL 127
L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 175 LPIKGNVFPD--GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPH----- 227
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
L+ P+K + D C+ N+ C +C+Y + Y D SS+ G RD +
Sbjct: 228 PLYKPTKEKI---VPPRDLLCQELQGNQN-YCETCKQCDYEIEYADQSSSMGVLARDDMH 283
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
L +G + +FGC Q G L SS A DGILG A SL SQLA+ G +
Sbjct: 284 LIATNGGREKL----DFVFGCAYDQQGQLLSSP-AKTDGILGLSNAAISLPSQLASHGII 338
Query: 248 RKEFAHCLDVVKGGGIFA-IGDVVSPKVKTT-PMVPNMPH--YNVILEEVEVGGNPLDLP 303
F HC+ +GGG + +GD P+ T + + P Y+ V+ G L +
Sbjct: 339 SNIFGHCITREQGGGGYMFLGDDYVPRWGITWTSIRSGPDNLYHTEAHHVKYGDQQLRMR 398
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
G+ I DSG++ YLP +Y+ +++ ++
Sbjct: 399 EQ---AGNTVQVIFDSGSSYTYLPDEIYENLVAAIKY 432
>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 683
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 137/282 (48%), Gaps = 47/282 (16%)
Query: 56 DTRRH--GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
+++RH RM DL L G Y T++ +GTP + + VDTGS + +V C+
Sbjct: 60 ESKRHPNARMRLHDDLLLNG--------YYTTRLWIGTPPQMFALIVDTGSTVTYVPCST 111
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNRYPSCSPGVRCEYVVTYG 172
C +C D F P SST + C+ D C N+R ++C Y Y
Sbjct: 112 CEQCGRHQD-----PKFQPDLSSTYQPVKCTLDCNCD---NDR-------MQCVYERQYA 156
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ S++SG D++ S + AP +FGC N ++GDL S DGI+G G+
Sbjct: 157 EMSTSSGVLGEDVVSFGNQS---ELAP--QRAVFGCENVETGDLYSQ---HADGIMGLGR 208
Query: 233 ANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPH 285
+ S++ QL V F+ C +DV GGG +G + P ++ P+ P+
Sbjct: 209 GDLSIMDQLVDKNVVSDSFSLCYGGMDV--GGGAMVLGGISPPSDMVFAQSDPV--RSPY 264
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
YN+ L+E+ V G L L S+ + G+++DSGTT AYLP
Sbjct: 265 YNIDLKEIHVAGKRLPLNPSVF--DGKHGSVLDSGTTYAYLP 304
>gi|357168101|ref|XP_003581483.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
distachyon]
Length = 510
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 122/262 (46%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P S + + PS SSTS
Sbjct: 101 LHYALVTVGTPGHTFMVALDTGSDLFWLPCQ-CDGCPPPASGASGSASFYIPSMSSTSQA 159
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ +FC + CS C Y + Y +S+SG+ V D++ L+ + +
Sbjct: 160 VPCNSDFC-----DHRKDCSTTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDNHPQI-- 212
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++FGCG Q+G + AA +G+ G G S+ S LA G F+ C
Sbjct: 213 LKAQIMFGCGQVQTGSFLDA--AAPNGLFGLGIDMISVPSILAHKGLTSDSFSMCFG-RD 269
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ N H Y + + + VG P+DL E TI
Sbjct: 270 GIGRISFGDQGSSDQEETPLDINQKHPTYAITITGITVGTEPMDL---------EFSTIF 320
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GTT YL Y + F
Sbjct: 321 DTGTTFTYLADPAYTYITQSFH 342
>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 439
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/269 (30%), Positives = 131/269 (48%), Gaps = 34/269 (12%)
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSS 136
P A Y +GTP + Y VDTGSD +W C C C ++ +F+PSKSS
Sbjct: 84 PYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTS-----PIFNPSKSS 138
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
T I CS C+ R S + +CEY +TY D S + G +D + LN G+
Sbjct: 139 TYKNIRCSSPICKRGEKTRC-SSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPI 197
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ P ++ GCG++ S +T+ GI+GFG+ N S++SQL ++ + +F++CL
Sbjct: 198 SFP---KIVIGCGHKNS----LTTEGLASGIIGFGRGNFSIVSQLGSS--IGGKFSYCL- 247
Query: 257 VVKGGGIFAIGDVVSPK------------VKTTPMVPNMP--HYNVILEEVEVGGNPLDL 302
+F+ ++ S V +TP++ + +Y LE VG + + L
Sbjct: 248 ----ASLFSKANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKL 303
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
S L +E +IDSG+T+ LP +Y
Sbjct: 304 KDSSLIPDNEGNAVIDSGSTITQLPNDVY 332
>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
Length = 443
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/292 (32%), Positives = 138/292 (47%), Gaps = 26/292 (8%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
+R +A + +R + ++D+ N G YF K+ +GTP E V DTGSD
Sbjct: 57 DRLRNAFSRSISRVNVFKTKAVDINSFQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSD 116
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRC 165
L WV C C C + K LFDPS+SS+ + C FC + C
Sbjct: 117 LTWVQCLPCDPCYRQ-----KSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNIC 171
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAV 224
EY +YGD S T+G + + G+ + P++ S ++FGCG G D
Sbjct: 172 EYHYSYGDKSYTNGNLATEKFTI----GSTSSRPVHLSPIVFGCGTGNGGTF----DELG 223
Query: 225 DGILGFGQANSSLLSQLAAAGNVRKEFAHCL------DVVKGGGIFAIGDVVS-PKVKTT 277
GI+G G SL+SQL++ ++ +F++CL V F V+S P+V +T
Sbjct: 224 SGIVGLGGGALSLVSQLSSI--IKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVST 281
Query: 278 PMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERG-TIIDSGTTLAYL 326
P+V P +Y V LE + VG L LL E+G IIDSGTTL +L
Sbjct: 282 PLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFL 333
>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 640
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 135/285 (47%), Gaps = 45/285 (15%)
Query: 52 LKQHDTRRH--GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
L++ +++RH RM DL + G Y T++ +GTP + + VDTGS + +V
Sbjct: 64 LQRSESKRHPNARMRLYDDLLING--------YYTTRLWIGTPPQRFALIVDTGSTVTYV 115
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNRYPSCSPGVRCEYV 168
C+ C C D F P S T + C+ D C N +C Y
Sbjct: 116 PCSTCEHCGRHQD-----PKFQPDLSETYQPVKCTPDCNCDGDTN----------QCMYD 160
Query: 169 VTYGDGSSTSGYFVRDIIQLNQASGNL-KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
Y + SS+SG D++ GNL + AP +FGC N ++GDL S DGI
Sbjct: 161 RQYAEMSSSSGVLGEDVVSF----GNLSELAP--QRAVFGCENDETGDLYSQR---ADGI 211
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK--VKTTPMVPN 282
+G G+ + S++ QL + F+ C +DV GGG +G + P+ V T
Sbjct: 212 MGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDV--GGGAMILGGISPPEDMVFTHSDPDR 269
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
P+YN+ L+E+ V G L L + + GT++DSGTT AYLP
Sbjct: 270 SPYYNINLKEMHVAGKKLQLNPKVF--DGKHGTVLDSGTTYAYLP 312
>gi|195647908|gb|ACG43422.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
gi|414587776|tpg|DAA38347.1| TPA: aspartic-type endopeptidase/ pepsin A [Zea mays]
Length = 498
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 123/262 (46%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P + T + P SSTS
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPATAASGSATFYIPGMSSTSKA 166
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ NFC + CS ++C Y + Y G+S+SG+ V D++ L+ + + +
Sbjct: 167 VPCNSNFC-----DLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI-- 219
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++ GCG Q+G + AA +G+ G G S+ S LA G F+ C
Sbjct: 220 LKAQIMLGCGQTQTGSFLDA--AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG-RD 276
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ N H Y + + + VG P D+ + TI
Sbjct: 277 GIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDM---------DFITIF 327
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 328 DTGTSFTYLADPAYTYITQSFH 349
>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
Length = 437
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/318 (28%), Positives = 137/318 (43%), Gaps = 50/318 (15%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
++ K G R R+++A+ Q + + A +G Y V +GTP
Sbjct: 61 IKRAIKRGERRMRSINAMLQSSSGIETPVYA-------------GSGEYLMNVAIGTPAS 107
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN 154
+DTGSDL+W C C++C ++ +F+P SS+ + C +C+
Sbjct: 108 SLSAIMDTGSDLIWTQCEPCTQCFSQ-----PTPIFNPQDSSSFSTLPCESQYCQ----- 157
Query: 155 RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
PS S C+Y YGDGSST GY + +S P ++ FGCG G
Sbjct: 158 DLPSESCYNDCQYTYGYGDGSSTQGYMATETFTFETSS-----VP---NIAFGCGEDNQG 209
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD--VVKGGGIFAIGDVVSP 272
G A G++G G SL SQL +F++C+ A+G S
Sbjct: 210 -FGQGNGA---GLIGMGWGPLSLPSQLGVG-----QFSYCMTSSGSSSPSTLALGSAASG 260
Query: 273 KVKTTPMVP------NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLA 324
+ +P N +Y + L+ + VGG+ L +P+S D+ G IIDSGTTL
Sbjct: 261 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 320
Query: 325 YLPPMLYDLVLSQFRFWI 342
YLP Y+ V F I
Sbjct: 321 YLPQDAYNAVAQAFTDQI 338
>gi|15238055|ref|NP_196570.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
gi|75180764|sp|Q9LX20.1|ASPL1_ARATH RecName: Full=Aspartic proteinase-like protein 1; Flags: Precursor
gi|7960727|emb|CAB92049.1| putative protein [Arabidopsis thaliana]
gi|332004108|gb|AED91491.1| aspartic proteinase-like protein 1 [Arabidopsis thaliana]
Length = 528
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 90/266 (33%), Positives = 132/266 (49%), Gaps = 28/266 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTK--SDLGIK-LTLFDPSKSS 136
L++T + +GTP+ + V +DTGS+LLW+ NC C+ + S L K L ++PS SS
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSNLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNL 195
TS CS C + + SP +C Y V Y G +S+SG V DI+ L + N
Sbjct: 159 TSKVFLCSHKLCDSASDCE----SPKEQCPYTVNYLSGNTSSSGLLVEDILHLTYNTNNR 214
Query: 196 K---TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
++ + + V+ GCG +QSGD A DG++G G A S+ S L+ AG +R F+
Sbjct: 215 LMNGSSSVKARVVIGCGKKQSGDYLDG--VAPDGLMGLGPAEISVPSFLSKAGLMRNSFS 272
Query: 253 HCLDVVKGGGIFAIGDVVSPKVKTTPMVP----NMPHYNVILEEVEVGGNPLDLPTSLLG 308
C D G I+ GD+ ++TP + Y V +E +G + L TS
Sbjct: 273 LCFDEEDSGRIY-FGDMGPSIQQSTPFLQLDNNKYSGYIVGVEACCIGNSCLK-QTSFT- 329
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLV 334
T IDSG + YLP +Y V
Sbjct: 330 ------TFIDSGQSFTYLPEEIYRKV 349
>gi|194700652|gb|ACF84410.1| unknown [Zea mays]
gi|414587775|tpg|DAA38346.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 500
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 123/262 (46%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P + T + P SSTS
Sbjct: 108 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPATAASGSATFYIPGMSSTSKA 166
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ NFC + CS ++C Y + Y G+S+SG+ V D++ L+ + + +
Sbjct: 167 VPCNSNFC-----DLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYLSTENAHPQI-- 219
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++ GCG Q+G + AA +G+ G G S+ S LA G F+ C
Sbjct: 220 LKAQIMLGCGQTQTGSFLDA--AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG-RD 276
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ N H Y + + + VG P D+ + TI
Sbjct: 277 GIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDM---------DFITIF 327
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 328 DTGTSFTYLADPAYTYITQSFH 349
>gi|242072510|ref|XP_002446191.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
gi|241937374|gb|EES10519.1| hypothetical protein SORBIDRAFT_06g003200 [Sorghum bicolor]
Length = 499
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 122/262 (46%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P + T + P SSTS
Sbjct: 107 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPATAASGSATFYIPGMSSTSKA 165
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ NFC + CS ++C Y + Y G+S+SG+ V D++ L ++ N
Sbjct: 166 VPCNSNFC-----DLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYL--STENAHPQI 218
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++ GCG Q+G + AA +G+ G G S+ S LA G F+ C
Sbjct: 219 LKAQIMLGCGQTQTGSFLDA--AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG-RD 275
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ N H Y + + + +G P DL + TI
Sbjct: 276 GIGRISFGDQGSSDQEETPLNINQQHPTYAITISGITIGNKPTDL---------DFITIF 326
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 327 DTGTSFTYLADPAYTYITQSFH 348
>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
Length = 465
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 93/273 (34%), Positives = 128/273 (46%), Gaps = 39/273 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G+GTP + V +DTGSDL WV C C C + D LFDPS SS+
Sbjct: 118 YVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKD-----PLFDPSSSSSYAS 172
Query: 141 IACSDNFCRTTYNNRY-PSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C + CR Y C+ G CEY + YG+ ++T+G + + + LK
Sbjct: 173 VPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETL-------TLKP 225
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ + FGCG+ Q G DG+LG G A SL+SQ ++ F++CL
Sbjct: 226 GVVVADFGFGCGDHQHGPY-----EKFDGLLGLGGAPESLVSQTSS--QFGGPFSYCLPP 278
Query: 258 VKGG-GIFAIGDVVSPKVKT-------TPM--VPNMP-HYNVILEEVEVGGNPLDLPTSL 306
GG G A+G S T TPM +P++P Y V L + VGG PL +P S
Sbjct: 279 TSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSA 338
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G +IDSGT + LP Y + S FR
Sbjct: 339 F----SSGMVIDSGTVITGLPATAYAALRSAFR 367
>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 528
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 94/312 (30%), Positives = 139/312 (44%), Gaps = 37/312 (11%)
Query: 20 WAVGGGGVMGNFVFEVENKFKAGGERERTL-------------SALKQHDTRRHGRMMAS 66
W G F FEV + F ++ L L D GR +AS
Sbjct: 18 WGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRLIRGRGLAS 77
Query: 67 IDLEL-----GGNGHPSAT---GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ E GGN S LY+ V +GTP + V +DTGSDL W+ C + C
Sbjct: 78 NNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCI 137
Query: 119 TK-SDLG----IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
D+G + L L+ P+ S+TS I CSD C + ++ S SP C Y ++Y +
Sbjct: 138 RDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRC---FGSKKCS-SPSSICPYQISYSN 193
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
+ T G ++D++ L NL P+ ++V GCG +Q+G + +V+G+LG G
Sbjct: 194 STGTKGTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLF--QRNNSVNGVLGLGIK 249
Query: 234 NSSLLSQLAAAGNVRKEFAHCLDVVKGG-GIFAIGDVVSPKVKTTPMVPNMPH--YNVIL 290
S+ S LA A F+ C V G G + GD + TP + P Y V +
Sbjct: 250 GYSVPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPSTAYGVNI 309
Query: 291 EEVEVGGNPLDL 302
V V G+P+D+
Sbjct: 310 SGVSVAGDPVDI 321
>gi|297802338|ref|XP_002869053.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297314889|gb|EFH45312.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 522
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 128/265 (48%), Gaps = 26/265 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL---GIKLTLFDPSKSST 137
L++T V LGTP + V +DTGSDL WV C C +C PT+ +L++++P S+T
Sbjct: 104 LHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKISTT 162
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLK 196
+ ++ C+++ C R C Y+V+Y +STSG + D++ L N +
Sbjct: 163 NKKVTCNNSLCA----QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPE 218
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ + V FGCG QSG AA +G+ G G S+ S LA G V F+ C
Sbjct: 219 R--VEAYVTFGCGQVQSGSFLDI--AAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFG 274
Query: 257 VVKGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G + GD S + TP + P+ P+YN+ + V VG +D DE
Sbjct: 275 -HDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID---------DEFT 324
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
+ D+GT+ YL +Y V F
Sbjct: 325 ALFDTGTSFTYLVDPMYTTVSESFH 349
>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
Length = 509
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/310 (31%), Positives = 137/310 (44%), Gaps = 39/310 (12%)
Query: 39 FKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
F G + ++ +Q DT + EL P+ATG ++ P +
Sbjct: 133 FSMGDDGTGGMAKAQQQDTHHQ------VVEELSSAADPAATG--GSRRSRLRPGVRQLM 184
Query: 99 QVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT--TYNN 154
+DT SD+ WV C C S+C ++D+ L+DPSKS +S ACS CR Y N
Sbjct: 185 LLDTASDVAWVQCFPCPASQCYAQTDV-----LYDPSKSRSSESFACSSPTCRQLGPYAN 239
Query: 155 RYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
S S +C+Y V Y DGS+TSG V D + L+ S K FGC +
Sbjct: 240 GCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPK-------FEFGCSHAAR 292
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVV---KGGGIFAIGDV 269
G S A GI+ G+ SL+SQ + G V F++C KG + +
Sbjct: 293 GSFSRSKTA---GIMALGRGVQSLVSQTSTKYGQV---FSYCFPPTASHKGFFVLGVPRR 346
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
S + TPM+ Y V LE + V G LD+P ++ G +DS T + LPP
Sbjct: 347 SSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVFAA----GAALDSRTVITRLPPT 402
Query: 330 LYDLVLSQFR 339
Y + S FR
Sbjct: 403 AYQALRSAFR 412
>gi|357124567|ref|XP_003563970.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 395
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/257 (33%), Positives = 121/257 (47%), Gaps = 28/257 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y+T + +G P Y++ +DTGSD W++C A C+ C TK P T G+I
Sbjct: 16 YYTSINIGNPPRPYFLDIDTGSDFTWIHCDAPCTNC-TKGP--------HPVYKPTEGKI 66
Query: 142 A-CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
D C N+ C +C+Y +TY D SS+ G RD +QL A G +K
Sbjct: 67 VHPRDPLCEELQGNQN-YCETCKQCDYEITYADRSSSKGVLARDNMQLTTADGEMK---- 121
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--DVV 258
N +FGC + Q G L S + DGILG SL +QLA +G + F HC+ D
Sbjct: 122 NVDFVFGCAHNQQGKLLDSP-TSTDGILGLSNGAISLSTQLANSGIISNVFGHCMATDPS 180
Query: 259 KGGGIFAIGDVVSPKVKTTPMVP--NMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERG 314
GG +F +GD P+ T VP N P Y+ + +V G L+L G
Sbjct: 181 SGGYMF-LGDDYVPRWGMT-WVPIRNGPGNVYSTEVPKVNYGAQELNLRGQ---AGKLTQ 235
Query: 315 TIIDSGTTLAYLPPMLY 331
I DSG++ Y P +Y
Sbjct: 236 VIFDSGSSYTYFPHEIY 252
>gi|357117138|ref|XP_003560331.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Brachypodium distachyon]
Length = 509
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/306 (31%), Positives = 140/306 (45%), Gaps = 34/306 (11%)
Query: 50 SALKQHDTRRH----GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
SAL HD R G+ + + G + A L++ KV LGTP + V +DTGSD
Sbjct: 46 SALSAHDRARRVLAGGKGESLLSFADGNSTTRHAGSLHYAKVALGTPNATFVVALDTGSD 105
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-R 164
L WV C C RC ++ L + P +SSTS + CS + C +R +C G
Sbjct: 106 LFWVPC-DCKRCAPIANTSELLKPYSPRQSSTSKPVTCSHSLC-----DRPNACGNGNGS 159
Query: 165 CEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTA-------PLNSSVIFGCGNRQSGDL 216
C Y V Y +S+SG V D++ + + S + ++ + + V+FGCG Q+G
Sbjct: 160 CPYTVKYVSANTSSSGVLVEDVLYMTRQSSSSRSGNGGNVGEAVGARVVFGCGQEQTGAF 219
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPKVK 275
AA++G+LG G S+ S LAAAG V + F+ C G G G+ +
Sbjct: 220 LDG--AAMEGLLGLGMDRVSVPSLLAAAGLVGSDSFSMCFS-PDGNGRINFGEPSDAGAQ 276
Query: 276 T-TPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
TP + P YN+ + V V G E ++DSGT+ YL Y
Sbjct: 277 NETPFIVSKTRPTYNISVTAVNVKGKG--------AMAAEFAAVVDSGTSFTYLNDPAYS 328
Query: 333 LVLSQF 338
L+ + F
Sbjct: 329 LLATSF 334
>gi|42567433|ref|NP_195313.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|190576481|gb|ACE79041.1| At4g35880 [Arabidopsis thaliana]
gi|222423134|dbj|BAH19546.1| AT4G35880 [Arabidopsis thaliana]
gi|332661184|gb|AEE86584.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 524
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 85/265 (32%), Positives = 128/265 (48%), Gaps = 26/265 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL---GIKLTLFDPSKSST 137
L++T V LGTP + V +DTGSDL WV C C +C PT+ +L++++P S+T
Sbjct: 106 LHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTT 164
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLK 196
+ ++ C+++ C R C Y+V+Y +STSG + D++ L N +
Sbjct: 165 NKKVTCNNSLCA----QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPE 220
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ + V FGCG QSG AA +G+ G G S+ S LA G V F+ C
Sbjct: 221 R--VEAYVTFGCGQVQSGSFLDI--AAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFG 276
Query: 257 VVKGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G + GD S + TP + P+ P+YN+ + V VG +D DE
Sbjct: 277 -HDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID---------DEFT 326
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
+ D+GT+ YL +Y V F
Sbjct: 327 ALFDTGTSFTYLVDPMYTTVSESFH 351
>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
Length = 492
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 143/300 (47%), Gaps = 29/300 (9%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLW 108
L + RR ++ S ++L + G Y ++V +GTP E+ + VDTGS + +
Sbjct: 3 LELVANSHRRRDRELLGSARMDL--HDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTY 60
Query: 109 VNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYV 168
V C+ C+ C D F P+ SS+ + C C T + + G R +Y
Sbjct: 61 VPCSSCTHCGNHQD-----PRFSPALSSSYKPLECGSE-CSTGFCD-------GSR-KYQ 106
Query: 169 VTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGIL 228
Y + S++SG +D+I + +S +L ++FGC ++GDL D DGI+
Sbjct: 107 RQYAEKSTSSGVLGKDVIGFSNSS-DLG----GQRLVFGCETAETGDL---YDQTADGII 158
Query: 229 GFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPK--VKTTPMVPNMPH 285
G G+ S++ QL + F+ C + +GGG +G PK V T P+
Sbjct: 159 GLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTASDPHRSPY 218
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
YN++L+ + VGG+PL L + + GT++DSGTT AY P + S + + SL
Sbjct: 219 YNLMLKGIRVGGSPLRLKPEVF--DGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSL 276
>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 454
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 128/278 (46%), Gaps = 40/278 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V +GTP Y VDTGSDL+W C C C +S +FDPS SST
Sbjct: 103 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTYAT 157
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C ++ C+ +C Y TYGD SST G + L ++
Sbjct: 158 VPCSSASCSDLPTSK---CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-------- 206
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
V+FGCG+ GD G S A G++G G+ SL+SQL +F++CL +
Sbjct: 207 LPGVVFGCGDTNEGD-GFSQGA---GLVGLGRGPLSLVSQLG-----LDKFSYCLTSLDD 257
Query: 261 --------GGIFAI--GDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLL 307
G + I + V+TTP++ P+ P Y V L+ + VG + LP+S
Sbjct: 258 TNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAF 317
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
D+ G I+DSGT++ YL Y + F +A
Sbjct: 318 AVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA 355
>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 634
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 89/282 (31%), Positives = 134/282 (47%), Gaps = 47/282 (16%)
Query: 56 DTRRH--GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
+++RH RM DL L G Y T++ +GTP + + VDTGS + +V C+
Sbjct: 63 ESKRHPNARMRLHDDLLLNG--------YYTTRLWIGTPPQMFALIVDTGSTVTYVPCST 114
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACS-DNFCRTTYNNRYPSCSPGVRCEYVVTYG 172
C +C D F P SST + C+ D C S ++C Y Y
Sbjct: 115 CEQCGRHQD-----PKFQPESSSTYQPVKCTIDCNCD----------SDRMQCVYERQYA 159
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ S++SG D+I S + AP +FGC N ++GDL S DGI+G G+
Sbjct: 160 EMSTSSGVLGEDLISFGNQS---ELAP--QRAVFGCENVETGDLYSQ---HADGIMGLGR 211
Query: 233 ANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPH 285
+ S++ QL + F+ C +DV GGG +G + P + P+ P+
Sbjct: 212 GDLSIMDQLVDKNVISDSFSLCYGGMDV--GGGAMVLGGISPPSDMAFAYSDPV--RSPY 267
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
YN+ L+E+ V G L L ++ + GT++DSGTT AYLP
Sbjct: 268 YNIDLKEIHVAGKRLPLNANVF--DGKHGTVLDSGTTYAYLP 307
>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 456
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 91/264 (34%), Positives = 132/264 (50%), Gaps = 36/264 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+G+P + +DTGSD+ WV C CS+C +++D +LFDPS SST +
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQAD-----SLFDPSSSSTYSAFS 181
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS-GNLKTAPLN 201
C+ C R CS +C+Y V YGDGS+ SG + D + L ++ N +
Sbjct: 182 CTSAACA---QLRQRGCSSS-QCQYTVKYGDGSTGSGTYSSDTLALGSSTVENFQ----- 232
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG- 260
FGC +SG+L A + G+ G + SL +Q AG K F++CL G
Sbjct: 233 ----FGCSQSESGNLLQDQTAGLMGLGGGAE---SLATQ--TAGTFGKAFSYCLPPTPGS 283
Query: 261 GGIFAIGDVVSPKVKTTPM-----VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G +G S V TPM VP+ +Y V+L+ + VGG L++P S G+
Sbjct: 284 SGFLTLGASTSGFVVKTPMLRSTQVPS--YYGVLLQAIRVGGRQLNIPASAF----SAGS 337
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFR 339
I+DSGT + LP Y + S F+
Sbjct: 338 IMDSGTIITRLPRTAYSALSSAFK 361
>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
Full=Nepenthesin-II; Flags: Precursor
gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
Length = 438
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/322 (28%), Positives = 139/322 (43%), Gaps = 57/322 (17%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
++ K G R R+++A+ Q + + A G+G Y V +GTP
Sbjct: 61 IKRAIKRGERRMRSINAMLQSSSGIETPVYA-------GDGE------YLMNVAIGTPDS 107
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR----T 150
+ +DTGSDL+W C C++C ++ +F+P SS+ + C +C+
Sbjct: 108 SFSAIMDTGSDLIWTQCEPCTQCFSQ-----PTPIFNPQDSSSFSTLPCESQYCQDLPSE 162
Query: 151 TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
T NN C+Y YGDGS+T GY + +S P ++ FGCG
Sbjct: 163 TCNNN--------ECQYTYGYGDGSTTQGYMATETFTFETSS-----VP---NIAFGCGE 206
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV--KGGGIFAIGD 268
G G A G++G G SL SQL +F++C+ A+G
Sbjct: 207 DNQG-FGQGNGA---GLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPSTLALGS 257
Query: 269 VVSPKVKTTPMVP------NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSG 320
S + +P N +Y + L+ + VGG+ L +P+S D+ G IIDSG
Sbjct: 258 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 317
Query: 321 TTLAYLPPMLYDLVLSQFRFWI 342
TTL YLP Y+ V F I
Sbjct: 318 TTLTYLPQDAYNAVAQAFTDQI 339
>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
Length = 456
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 118/269 (43%), Gaps = 28/269 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V +GTP + DTGSDL+WVNC+ SD + +F PS+S+T ++
Sbjct: 100 YLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAV---VFHPSRSTTYSLLS 156
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ SC C+Y YGDGS T G + A G +
Sbjct: 157 CQSAACQALSQA---SCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVP 213
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVV 258
V FGC +G S DG++G G SL+SQL AA + + F++CL
Sbjct: 214 RVSFGCSTGSAGSFRS------DGLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAA 267
Query: 259 KGGGIFAIGD---VVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
+ G V P +TP+VP+ +Y V LE V V G + + +
Sbjct: 268 NSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQD-------VASANSS 320
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
I+DSGTTL +L P L ++++ I
Sbjct: 321 RIIVDSGTTLTFLDPALLRPLVAELERRI 349
>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 459
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 145/331 (43%), Gaps = 42/331 (12%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
V + R LS + + + R + G PS Y + +GTP
Sbjct: 56 VRRAVQRSKARAAALSVARLGGSNKGARQQDQNQQQPGLPVRPSGDLEYLVDLAVGTPPQ 115
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN 154
+DTGSDL+W CA C+ C + D +F P SS+ + C+ C ++
Sbjct: 116 PVSALLDTGSDLIWTQCAPCASCLPQPD-----PIFSPGASSSYEPMRCAGELCNDILHH 170
Query: 155 RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
SC C Y +YGDG++T G + + + +S +T L++ + FGCG G
Sbjct: 171 ---SCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTMNKG 227
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG----IF-----A 265
L + + GI+GFG+A SL+SQLA + F++CL G +F
Sbjct: 228 SLNNGS-----GIVGFGRAPLSLVSQLAI-----RRFSYCLTPYASGRKSTLLFGSLRGG 277
Query: 266 IGDVVSPKVKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTII 317
+ D + V+TT ++ N Y V V VG L +P S G+G G I+
Sbjct: 278 VYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSG---GAIV 334
Query: 318 DSGTTLAYLP-PMLYDLV---LSQFRFWIAS 344
DSGT L P P+L ++V SQ R A+
Sbjct: 335 DSGTALTLFPAPVLAEVVRAFRSQLRLPFAA 365
>gi|356548395|ref|XP_003542587.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 525
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 126/265 (47%), Gaps = 27/265 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSD-----LGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSD+LWV C C C + S L L + PS S+
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPC-DCIECASLSAGNYNVLDRDLNQYRPSLSN 162
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDG-SSTSGYFVRDIIQLNQASG 193
TS + C C S G + C Y V Y +S+SGY D + L
Sbjct: 163 TSRHLPCGHKLCDVH------SFCKGSKDPCPYEVQYASANTSSSGYVFEDKLHLTSDGK 216
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ + + +S+I GCG +Q+GD A DG+LG G N S+ S LA AG ++ F+
Sbjct: 217 HAEQNSVQASIILGCGRKQTGDYLHG--AGPDGVLGLGPGNISVPSLLAKAGLIQNSFSI 274
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
CLD + G I GD +TP +P + Y V +E VG SL
Sbjct: 275 CLDENESGRII-FGDQGHVTQHSTPFLPIIA-YMVGVESFCVG--------SLCLKETRF 324
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
+IDSG++ +LP +Y V+++F
Sbjct: 325 QALIDSGSSFTFLPNEVYQKVVTEF 349
>gi|224083514|ref|XP_002307058.1| predicted protein [Populus trichocarpa]
gi|222856507|gb|EEE94054.1| predicted protein [Populus trichocarpa]
Length = 376
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 128/281 (45%), Gaps = 29/281 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
R+ +SI L L GN +P+ G Y + +G P+ Y++ VDTGSDL W+ C A C +C
Sbjct: 1 RVPSSIVLPLHGNVYPN--GYYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDAPCVQCTEA 58
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
P + + C D C++ ++N C +C+Y V Y DG S+ G
Sbjct: 59 PH---------PYYRPRNNLVPCMDPICQSLHSNGDHRCENPGQCDYEVEYADGGSSFGV 109
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFG-CGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
VRD LN S + +PL + G CG Q + +DG+LG G+ SS++S
Sbjct: 110 LVRDTFNLNFTSEK-RHSPL---LALGLCGYDQ---FPGGSHHPIDGVLGLGKGKSSIVS 162
Query: 240 QLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGN 298
QL++ G VR HCL G F S +V TPM P+ HY+ L E+ G
Sbjct: 163 QLSSLGLVRNVIGHCLSGHGGGFLFFGDDLYDSSRVAWTPMSPDAKHYSPGLAELTFDGK 222
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L T DSG + YL Y ++S +
Sbjct: 223 TTGFKNLL--------TTFDSGASYTYLNSQAYQGLISLLK 255
>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
Length = 444
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 128/278 (46%), Gaps = 40/278 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V +GTP Y VDTGSDL+W C C C +S +FDPS SST
Sbjct: 93 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTYAT 147
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C ++ C+ +C Y TYGD SST G + L ++
Sbjct: 148 VPCSSASCSDLPTSK---CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-------- 196
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
V+FGCG+ GD G S A G++G G+ SL+SQL +F++CL +
Sbjct: 197 LPGVVFGCGDTNEGD-GFSQGA---GLVGLGRGPLSLVSQLG-----LDKFSYCLTSLDD 247
Query: 261 --------GGIFAI--GDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLL 307
G + I + V+TTP++ P+ P Y V L+ + VG + LP+S
Sbjct: 248 TNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAF 307
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
D+ G I+DSGT++ YL Y + F +A
Sbjct: 308 AVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA 345
>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 441
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 107/348 (30%), Positives = 147/348 (42%), Gaps = 36/348 (10%)
Query: 3 GLRLLALVVVTVAVVHQWAVG---GGGVMGNFVFEVENK---FKAGGERERTLSALKQHD 56
G+++ VVV + H VG GGG + + F R L+
Sbjct: 5 GVKIFFNVVVVGFLFHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRS 64
Query: 57 TRRHGRMMASIDLELGGNGH--PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC 114
R GR S G PSA G Y + +GTP VDTGSDL W C C
Sbjct: 65 ASRVGRFRQSAMTSDGIQSRLVPSA-GEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPC 123
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG 174
+ C + + FDP SST + +C +FC N+R SC G +C ++ +Y DG
Sbjct: 124 THCYKQV-----VPFFDPKNSSTYRDSSCGTSFCLALGNDR--SCRNGKKCTFMYSYADG 176
Query: 175 SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQAN 234
S T G + + + +G + P FGC +R G D GI+G G A
Sbjct: 177 SFTGGNLAVETLTVASTAGKPVSFP---GFAFGCVHRSGGIF----DEHSSGIVGLGVAE 229
Query: 235 SSLLSQLAAAGNVRKEFAHCL-----DVVKGGGI-FAIGDVVS-PKVKTTPMV---PNMP 284
S++SQL + N R F++CL D I F +VS +TP+V P+
Sbjct: 230 LSMISQLKSTINGR--FSYCLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTY 287
Query: 285 HYNVILEEVEVGGNPLDLP-TSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+Y + LE VG L S +E I+DSGTT YLP Y
Sbjct: 288 YYLITLEGFSVGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFY 335
>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
vinifera]
Length = 354
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/262 (33%), Positives = 126/262 (48%), Gaps = 34/262 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
+G Y+ KVGLG+P Y + VDTGS L W+ C C C ++D LFDPS S T
Sbjct: 10 SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQAD-----PLFDPSASKTY 64
Query: 139 GEIACSDNFCRT----TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
++C+ + C + T NN S V C Y +YGD S + GY +D++ L +
Sbjct: 65 KSLSCTSSQCSSLVDATLNNPLCETSSNV-CVYTASYGDSSYSMGYLSQDLLTLAPS--- 120
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+T P ++GCG G G + GILG G+ S+L Q+++ F++C
Sbjct: 121 -QTLP---GFVYGCGQDSEGLFGRAA-----GILGLGRNKLSMLGQVSS--KFGYAFSYC 169
Query: 255 LDVVKGGGIFAIG--DVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLGT 309
L GGG +IG + K TPM P P Y + L + VGG L + +
Sbjct: 170 LPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQY-- 227
Query: 310 GDERGTIIDSGTTLAYLPPMLY 331
TIIDSGT + LP +Y
Sbjct: 228 --RVPTIIDSGTVITRLPMSVY 247
>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
Length = 453
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 127/275 (46%), Gaps = 43/275 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP +DTGSDL+W C C+ C + D LF P SS+ +
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPD-----PLFSPRMSSSYEPMR 152
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C+ C ++ SC C Y +YGDG++T GY+ + +SG ++ PL
Sbjct: 153 CAGQLCGDILHH---SCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG- 208
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------- 255
FGCG G L +++ GI+GFG+ SL+SQL+ + F++CL
Sbjct: 209 ---FGCGTMNVGSLNNAS-----GIVGFGRDPLSLVSQLSI-----RRFSYCLTPYASSR 255
Query: 256 -DVVKGGGIFAIG--DVVSPKVKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSLL-- 307
++ G + +G D + V+TTP++ N Y V V VG L +P S
Sbjct: 256 KSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFAL 315
Query: 308 ---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+G G IIDSGT L P + V+ FR
Sbjct: 316 RPDGSG---GVIIDSGTALTLFPAAVLAEVVRAFR 347
>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 450
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 103/307 (33%), Positives = 145/307 (47%), Gaps = 48/307 (15%)
Query: 49 LSALKQHDTRRHGRM---MASIDLE-LGGNGHPSATGL------YFTKVGLGTPTDEYYV 98
SA HD R + +A+ D + + + P A+G Y T++GLGTPT Y +
Sbjct: 64 FSAFITHDAARIAGLASRLATKDKDWVAASSVPLASGASVGVGNYITRLGLGTPTTTYVM 123
Query: 99 QVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC----RTTYN 153
VD+GS L W+ CA C+ C ++ L+DP SST + CS C T N
Sbjct: 124 VVDSGSSLTWLQCAPCAVSCHPQAG-----PLYDPRASSTYAAVPCSAPQCAELQAATLN 178
Query: 154 NRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
SCS C+Y +YGDGS + GY +D + L+ +SG+ +GCG
Sbjct: 179 PS--SCSGSGVCQYQASYGDGSFSFGYLSKDTVSLS-SSGSFP------GFYYGCGQDNV 229
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--DVVKGGGIFAIG---D 268
G G + G++G + SLLSQLA +V FA+CL G + G D
Sbjct: 230 GLFGRAA-----GLIGLARNKLSLLSQLAP--SVGNSFAYCLPTSAAASAGYLSFGSNSD 282
Query: 269 VVSP-KVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
+P K T MV + Y V L + V G+PL +P+S G+ TIIDSGT +
Sbjct: 283 NKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEYGS---LPTIIDSGTVIT 339
Query: 325 YLPPMLY 331
LP +Y
Sbjct: 340 RLPTPVY 346
>gi|226501154|ref|NP_001146408.1| uncharacterized protein LOC100279988 [Zea mays]
gi|219887047|gb|ACL53898.1| unknown [Zea mays]
gi|414587777|tpg|DAA38348.1| TPA: hypothetical protein ZEAMMB73_272638 [Zea mays]
Length = 416
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 123/264 (46%), Gaps = 24/264 (9%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTS 138
+ L++ V +GTP + V +DTGSDL W+ C C C P + T + P SSTS
Sbjct: 4 SSLHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPATAASGSATFYIPGMSSTS 62
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C+ NFC + CS ++C Y + Y G+S+SG+ V D++ L ++ N
Sbjct: 63 KAVPCNSNFC-----DLQKECSTALQCPYKMVYVSAGTSSSGFLVEDVLYL--STENAHP 115
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
L + ++ GCG Q+G + AA +G+ G G S+ S LA G F+ C
Sbjct: 116 QILKAQIMLGCGQTQTGSFLDA--AAPNGLFGLGIDEVSVPSILAQKGLTSNSFSMCFG- 172
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G G + GD S + TP+ N H Y + + + VG P D+ + T
Sbjct: 173 RDGIGRISFGDQESSDQEETPLDINRQHPTYAITISGITVGNKPTDM---------DFIT 223
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFR 339
I D+GT+ YL Y + F
Sbjct: 224 IFDTGTSFTYLADPAYTYITQSFH 247
>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 481
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 91/269 (33%), Positives = 126/269 (46%), Gaps = 32/269 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR-CPTKSDLGIKLTLFDPSKSSTS 138
TG Y VGLGTP + DTGSDL W C C+R C + + +F+PSKS++
Sbjct: 135 TGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQE-----PIFNPSKSTSY 189
Query: 139 GEIACSDNFCRTTYN--NRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
I+CS C + PSCS C Y + YGD S + G+F +D + L
Sbjct: 190 TNISCSSPTCDELKSGTGNSPSCSAST-CVYGIQYGDQSYSVGFFAQDKLALT------- 241
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ + ++ +FGCG G V G++G G+ SL+SQ A K F++CL
Sbjct: 242 STDVFNNFLFGCGQNNRGLF-----VGVAGLIGLGRNALSLVSQTAQ--KYGKLFSYCLP 294
Query: 257 VVK---GGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTG 310
G F G S VK TP + N Y + L + VGG L S+ T
Sbjct: 295 STSSSTGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTA 354
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT ++ LPP Y + + F+
Sbjct: 355 ---GTIIDSGTVISRLPPTAYSDLRASFQ 380
>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
gi|219888509|gb|ACL54629.1| unknown [Zea mays]
gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
Length = 415
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 128/285 (44%), Gaps = 45/285 (15%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
+L G+ +P TG Y+ + +G P Y++ VDTGSDL W+ C P +S +
Sbjct: 41 FQLQGDVYP--TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDA----PCRSCNKVPHP 94
Query: 129 LFDPSKSSTSGEIACSDNFCRTTY-----NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVR 183
L+ P+ + + C++ C + NN+ PS +C+Y + Y D +S+ G +
Sbjct: 95 LYRPTANRL---VPCANALCTALHSGQGSNNKCPSPK---QCDYQIKYTDSASSQGVLIN 148
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D L S N++ + FGCG Q + AA+DG+LG G+ + SL+SQL
Sbjct: 149 DSFSLPMRSSNIRPG-----LTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQ 203
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYN-----VILEEVE 294
G + HCL GGG GD V P + T PM +Y+ + +
Sbjct: 204 QGITKNVVGHCLS-TNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRS 262
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+G P+++ + DSG+T Y Y V+S +
Sbjct: 263 LGVKPMEV-------------VFDSGSTYTYFTAQPYQAVVSALK 294
>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 415
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 128/285 (44%), Gaps = 45/285 (15%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
+L G+ +P TG Y+ + +G P Y++ VDTGSDL W+ C P +S +
Sbjct: 41 FQLQGDVYP--TGHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDA----PCRSCNKVPHP 94
Query: 129 LFDPSKSSTSGEIACSDNFCRTTY-----NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVR 183
L+ P+ + + C++ C + NN+ PS +C+Y + Y D +S+ G +
Sbjct: 95 LYRPTANRL---VPCANALCTALHSGQGSNNKCPSPK---QCDYQIKYTDSASSQGVLIN 148
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D L S N++ + FGCG Q + AA+DG+LG G+ + SL+SQL
Sbjct: 149 DSFSLPMRSSNIRPG-----LTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQ 203
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYN-----VILEEVE 294
G + HCL GGG GD V P + T PM +Y+ + +
Sbjct: 204 QGITKNVVGHCLS-TNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRS 262
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+G P+++ + DSG+T Y Y V+S +
Sbjct: 263 LGVKPMEV-------------VFDSGSTYTYFTAQPYQAVVSALK 294
>gi|224096119|ref|XP_002310541.1| predicted protein [Populus trichocarpa]
gi|222853444|gb|EEE90991.1| predicted protein [Populus trichocarpa]
Length = 379
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 125/283 (44%), Gaps = 33/283 (11%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRC 117
R+ +SI L L GN +P TG Y + +G P+ Y++ VDTGSDL W+ C A C+
Sbjct: 1 RVPSSIVLPLHGNVYP--TGFYNVTLNIGQPSKPYFLDVDTGSDLTWLQCDVPRAQCTEA 58
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSST 177
P P ++ +AC D C++ + C +C+Y V Y DG S+
Sbjct: 59 P------------HPYYKPSNNLVACKDPICQSLHTGGDQRCENPGQCDYEVEYADGGSS 106
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G V+D LN S ++ L + CG Q L T +DG+LG G+ S+
Sbjct: 107 LGVLVKDAFNLNFTSEKRQSPLLALGL---CGYDQ---LPGGTYHPIDGVLGLGRGKPSI 160
Query: 238 LSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVG 296
+SQL+ G VR HCL G F S +V TPM PN HY+ E+
Sbjct: 161 VSQLSGLGLVRNVIGHCLSGRGGGFLFFGDDLYDSSRVAWTPMSPNAKHYSPGFAELTFD 220
Query: 297 GNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G ++ DSG + YL +Y ++S +
Sbjct: 221 GKTTGFKNLIVA--------FDSGASYTYLNSQVYQGLISLIK 255
>gi|186510920|ref|NP_190702.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645260|gb|AEE78781.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 530
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 98/343 (28%), Positives = 149/343 (43%), Gaps = 40/343 (11%)
Query: 9 LVVVTVAVVHQWAVGGGGVMGNFVFEVENKFK--------------AGGERERTLSALKQ 54
V++++ V+ W + G F FEV + F G E L
Sbjct: 8 FVLLSMLVLIFWGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEY-FKVLAH 66
Query: 55 HDTRRHGRMMASIDLE-----LGGNGHPSATGL---YFTKVGLGTPTDEYYVQVDTGSDL 106
D GR +AS + E +G N + L ++ V LGTP + V +DTGSDL
Sbjct: 67 RDRFIRGRGLASNNEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDL 126
Query: 107 LWVNCAGCSRC-----PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP 161
W+ C + C + + L L+ P+ S+TS I CSD C + SP
Sbjct: 127 FWLPCNCGTTCIHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGK----CSSP 182
Query: 162 GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
C Y + + T+G ++D++ L +LK P+N++V GCG Q+G TD
Sbjct: 183 ESICPYQIALSSNTVTTGTLLQDVLHLVTEDEDLK--PVNANVTLGCGQNQTGAF--QTD 238
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMV 280
AV+G+LG S+ S LA A F+ C ++ G + GD + TP+V
Sbjct: 239 IAVNGVLGLSMKEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLV 298
Query: 281 --PNMPHYNVILEEVEVGGNPLDLPT-SLLGTGDERGTIIDSG 320
Y V + V VGG P+D+P +L TG +++S
Sbjct: 299 SLETSTAYGVNVTGVSVGGVPVDVPLFALFDTGSSFTLLLESA 341
>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
Length = 471
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 124/262 (47%), Gaps = 26/262 (9%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSG 139
G Y VGLGTP ++ + DTGSDL W C CS C ++D FDP+KS++
Sbjct: 130 GGYAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQND-----EKFDPTKSTSYK 184
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++CS C++ CS C Y V YG G T G+ + + + +
Sbjct: 185 NLSCSSEPCKSIGKESAQGCSSSNSCLYGVKYGTG-YTVGFLATETLTITPSD------- 236
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ + + GCG R G + G+LG G++ +L SQ ++ + F++CL
Sbjct: 237 VFENFVIGCGERNGGRF-----SGTAGLLGLGRSPVALPSQTSST--YKNLFSYCLPASS 289
Query: 260 GG-GIFAIGDVVSPKVKTTPMVPNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G + G VS K TP+ +P Y + + + VGG L + S+ T GTII
Sbjct: 290 SSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTA---GTII 346
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
DSGTTL YLP + + S F+
Sbjct: 347 DSGTTLTYLPSTAHSALSSAFQ 368
>gi|356559246|ref|XP_003547911.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 516
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/294 (31%), Positives = 135/294 (45%), Gaps = 36/294 (12%)
Query: 50 SALKQHDTRRHGRMMASID------LELGGNGHPSATG--LYFTKVGLGTPTDEYYVQVD 101
+ + D GR +A D G + H A+ L+F V +GTP + V +D
Sbjct: 64 AVMAHRDRVFRGRRLAGADHHSPLTFAAGNDTHQIASSGFLHFANVSVGTPPLWFLVALD 123
Query: 102 TGSDLLWVNCAGCSRC-----PTKSDLGIKLTLFDPSKSSTSGEIACSDN-FCRTTYNNR 155
TGSDL W+ C C C T++ +K +D KSSTS E++C+++ FCR R
Sbjct: 124 TGSDLFWLPC-DCISCVHGGLRTRTGKILKFNTYDLDKSSTSNEVSCNNSTFCR----QR 178
Query: 156 YPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
S G C Y V Y + +S+ G+ V D++ L K A ++ + FGCG Q+G
Sbjct: 179 QQCPSAGSTCRYQVDYLSNDTSSRGFVVEDVLHLITDDDQTKDA--DTRIAFGCGQVQTG 236
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKV 274
+ AA +G+ G G N S+ S LA G + F+ C G I GD SP
Sbjct: 237 VFLNG--AAPNGLFGLGMDNISVPSILAREGLISNSFSMCFGSDSAGRI-TFGDTGSPDQ 293
Query: 275 KTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
+ TP P YN+ + ++ V + DL E I DSGT+ Y+
Sbjct: 294 RKTPFNVRKLHPTYNITITKIIVEDSVADL---------EFHAIFDSGTSFTYI 338
>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
Length = 423
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 89/278 (32%), Positives = 128/278 (46%), Gaps = 40/278 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V +GTP Y VDTGSDL+W C C C +S +FDPS SST
Sbjct: 72 GEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTYAT 126
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C ++ C+ +C Y TYGD SST G + L ++
Sbjct: 127 VPCSSASCSDLPTSK---CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-------- 175
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
V+FGCG+ GD G S A G++G G+ SL+SQL +F++CL +
Sbjct: 176 LPGVVFGCGDTNEGD-GFSQGA---GLVGLGRGPLSLVSQLG-----LDKFSYCLTSLDD 226
Query: 261 --------GGIFAI--GDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLL 307
G + I + V+TTP++ P+ P Y V L+ + VG + LP+S
Sbjct: 227 TNNSPLLLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAF 286
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
D+ G I+DSGT++ YL Y + F +A
Sbjct: 287 AVQDDGTGGVIVDSGTSITYLEVQGYRALKKAFAAQMA 324
>gi|356540838|ref|XP_003538891.1| PREDICTED: peroxidase [Glycine max]
Length = 829
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 120/262 (45%), Gaps = 23/262 (8%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
L+F V +GTP + V +DTGSDL W+ NC C R + I ++D SSTS
Sbjct: 101 LHFANVSVGTPPLSFLVALDTGSDLFWLPCNCTKCVRGVESNGEKIAFNIYDLKGSSTSQ 160
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C+ N C R S + C Y V Y +G+ST+G+ V D++ L K A
Sbjct: 161 TVLCNSNLCEL---QRQCPSSDSI-CPYEVNYLSNGTSTTGFLVEDVLHLITDDDETKDA 216
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
++ + FGCG Q+G AA +G+ G G N S+ S LA G F+ C
Sbjct: 217 --DTRITFGCGQVQTGAFLDG--AAPNGLFGLGMGNESVPSILAKEGLTSNSFSMCFG-S 271
Query: 259 KGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
G G GD S TP + P YN+ + ++ VGGN DL E I
Sbjct: 272 DGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGGNAADL---------EFHAI 322
Query: 317 IDSGTTLAYLPPMLYDLVLSQF 338
DSGT+ +L Y + + F
Sbjct: 323 FDSGTSFTHLNDPAYKQITNSF 344
>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
Length = 421
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPT 119
G +S L G+ +P GLY+ + +G P Y++ VDTGSDL W+ C A C C
Sbjct: 38 GAEESSAVFPLYGDVYPH--GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK 95
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY---NNRYPSCSPGVRCEYVVTYGDGSS 176
+ L+ P+K+ + C D C + R+ SP +C+Y + Y D S
Sbjct: 96 -----VPHPLYRPTKNKL---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS 147
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANS 235
+ G V D L A+ ++ + + FGCG Q +GSST+ +A DG+LG G +
Sbjct: 148 SLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQ--VGSSTEVSATDGVLGLGSGSV 201
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYNVILE 291
SLLSQL G + HCL +GGG GD + P + T PM + +Y+
Sbjct: 202 SLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSA 260
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ GG PL + + DSG++ Y Y ++ +
Sbjct: 261 NLYFGGRPLGV--------RPMEVVFDSGSSFTYFSAQPYQALVDAIK 300
>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
Length = 452
Score = 110 bits (276), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 98/321 (30%), Positives = 137/321 (42%), Gaps = 49/321 (15%)
Query: 49 LSALKQHDTRRHGRMMASIDLELG-----GNGH-----PSATGLYFTKVGLGTPTDEYYV 98
L L++ R H RM + G G G + G + V +GTP Y
Sbjct: 56 LQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQVPVHAGNGEFLMDVAIGTPALSYAA 115
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
VDTGSDL+W C C C +S +FDPS SST + CS C + +
Sbjct: 116 IVDTGSDLVWTQCKPCVDCFKQST-----PVFDPSSSSTYATVPCSSALCSDLPTS---T 167
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
C+ +C Y TYGD SST G + L + L V FGCG+ GD G
Sbjct: 168 CTSASKCGYTYTYGDASSTQGVLASETFTLGKEKKKLP------GVAFGCGDTNEGD-GF 220
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-----------VVKGGGIFAIG 267
+ A G++G G+ SL+SQL +F++CL ++ G
Sbjct: 221 TQGA---GLVGLGRGPLSLVSQLGL-----DKFSYCLTSLDDGDGKSPLLLGGSAAAISE 272
Query: 268 DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTT 322
+ V+TTP+V P+ P Y V L + VG + LP S D+ G I+DSGT+
Sbjct: 273 SAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIVDSGTS 332
Query: 323 LAYLPPMLYDLVLSQFRFWIA 343
+ YL Y + F +A
Sbjct: 333 ITYLELQGYRALKKAFVAQMA 353
>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
Length = 426
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 87/272 (31%), Positives = 128/272 (47%), Gaps = 32/272 (11%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKL 127
L L GN +PS G Y + +G P Y++ DTGSDL W+ C A C +C
Sbjct: 55 LPLYGNVYPS--GYYHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPCIQCTPAPH----- 107
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
L+ P T+ + C D C + + + Y C +C+Y V Y DG S+ G V D+
Sbjct: 108 PLYQP----TNDLVVCKDPICASLHPDNY-RCDDPDQCDYEVEYADGGSSIGVLVNDLFP 162
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
+N SG ++ P + GCG Q L +DG+LG G+ +SS+++QL++ G V
Sbjct: 163 VNLTSG-MRARP---RLTIGCGYDQ---LPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLV 215
Query: 248 RKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPN-MPHYNVILEEVEVGGNPLDLPT 304
R HC +GGG GD + S KV TPM + + HY E+ + G L
Sbjct: 216 RNVVGHCFS-RRGGGYLFFGDDIYDSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKN 274
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
L+ + DSG++ Y Y +LS
Sbjct: 275 LLV--------VFDSGSSYTYFNTQTYQTLLS 298
>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
Length = 395
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPT 119
G +S L G+ +P GLY+ + +G P Y++ VDTGSDL W+ C A C C
Sbjct: 38 GAEESSAVFPLYGDVYPH--GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK 95
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY---NNRYPSCSPGVRCEYVVTYGDGSS 176
+ L+ P+K+ + C D C + R+ SP +C+Y + Y D S
Sbjct: 96 -----VPHPLYRPTKNKL---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS 147
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANS 235
+ G V D L A+ ++ + + FGCG Q +GSST+ +A DG+LG G +
Sbjct: 148 SLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQ--QVGSSTEVSATDGVLGLGSGSV 201
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYNVILE 291
SLLSQL G + HCL +GGG GD + P + T PM + +Y+
Sbjct: 202 SLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSA 260
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ GG PL + + DSG++ Y Y ++ +
Sbjct: 261 NLYFGGRPLGV--------RPMEVVFDSGSSFTYFSAQPYQALVDAIK 300
>gi|115457374|ref|NP_001052287.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|38346027|emb|CAE01958.2| OSJNBb0071D01.4 [Oryza sativa Japonica Group]
gi|113563858|dbj|BAF14201.1| Os04g0228000 [Oryza sativa Japonica Group]
gi|215740420|dbj|BAG97076.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222626225|gb|EEE60357.1| hypothetical protein OsJ_13479 [Oryza sativa Japonica Group]
Length = 530
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 120/262 (45%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P S + + PS SSTS
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQA 173
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ FC CS +C Y + Y +S+SG+ V D++ L+ +
Sbjct: 174 VPCNSQFCELRKE-----CSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQI-- 226
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++FGCG Q+G + AA +G+ G G S+ S LA G FA C
Sbjct: 227 LKAQILFGCGQVQTGSFLDA--AAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFS-RD 283
Query: 260 GGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ P P Y + + E+ VG + DL E TI
Sbjct: 284 GIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDL---------EFSTIF 334
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 335 DTGTSFTYLADPAYTYITQSFH 356
>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 421
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPT 119
G +S L G+ +P GLY+ + +G P Y++ VDTGSDL W+ C A C C
Sbjct: 38 GAEESSAVFPLYGDVYPH--GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK 95
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY---NNRYPSCSPGVRCEYVVTYGDGSS 176
+ L+ P+K+ + C D C + R+ SP +C+Y + Y D S
Sbjct: 96 -----VPHPLYRPTKNKL---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS 147
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANS 235
+ G V D L A+ ++ + + FGCG Q +GSST+ +A DG+LG G +
Sbjct: 148 SLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQ--VGSSTEVSATDGVLGLGSGSV 201
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYNVILE 291
SLLSQL G + HCL +GGG GD + P + T PM + +Y+
Sbjct: 202 SLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSA 260
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ GG PL + + DSG++ Y Y ++ +
Sbjct: 261 NLYFGGRPLGV--------RPMEVVFDSGSSFTYFSAQPYQALVDAIK 300
>gi|3805854|emb|CAA21474.1| putative protein [Arabidopsis thaliana]
gi|7270540|emb|CAB81497.1| putative protein [Arabidopsis thaliana]
Length = 455
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 127/260 (48%), Gaps = 26/260 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDL---GIKLTLFDPSKSST 137
L++T V LGTP + V +DTGSDL WV C C +C PT+ +L++++P S+T
Sbjct: 106 LHYTTVKLGTPGMRFMVALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTT 164
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLK 196
+ ++ C+++ C R C Y+V+Y +STSG + D++ L N +
Sbjct: 165 NKKVTCNNSLCA----QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPE 220
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ + V FGCG QSG AA +G+ G G S+ S LA G V F+ C
Sbjct: 221 R--VEAYVTFGCGQVQSGSFLDI--AAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFG 276
Query: 257 VVKGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G + GD S + TP + P+ P+YN+ + V VG +D DE
Sbjct: 277 -HDGVGRISFGDKGSSDQEETPFNLNPSHPNYNITVTRVRVGTTLID---------DEFT 326
Query: 315 TIIDSGTTLAYLPPMLYDLV 334
+ D+GT+ YL +Y V
Sbjct: 327 ALFDTGTSFTYLVDPMYTTV 346
>gi|222628608|gb|EEE60740.1| hypothetical protein OsJ_14268 [Oryza sativa Japonica Group]
Length = 181
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 67/180 (37%), Positives = 90/180 (50%), Gaps = 29/180 (16%)
Query: 32 VFEVENKFK--AGGERERTLSALKQHDTRRHGRMMASIDLELGGNG--HPSATGLYFTKV 87
+F+V KF GG + + AL+ HD RH + + D LGG G S+TG Y +
Sbjct: 27 LFQVRRKFSIMGGGCKGSDIGALQTHDRNRHLSRLVAADFSLGGLGGISTSSTG-YMLQC 85
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
G+ ++ VDTGS WVNC C +CP KSD+ KLTL+DP S
Sbjct: 86 SFGSI---HFFLVDTGSSAFWVNCIPCKQCPRKSDILKKLTLYDPRSS------------ 130
Query: 148 CRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
P C+ + C ++ TY DG ST G FV D++ NQ SGN T N+S+ FG
Sbjct: 131 ---------PECNTSLLCPFIATYADGGSTIGAFVTDLVHYNQLSGNGLTQSTNTSLTFG 181
>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
Length = 451
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 119/262 (45%), Gaps = 39/262 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +VG+G+P + Y+ VD+GSD++WV C C +C ++D LFDP+ SS+
Sbjct: 127 SGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTD-----PLFDPAASSSFS 181
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++C CRT +C+Y VTYGDGS T G + + L +
Sbjct: 182 GVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTA------- 234
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL-AAAGNVRKEFAHCLDVV 258
V GCG+R SG G+LG G SL+ QL AAG V F++CL
Sbjct: 235 -VQGVAIGCGHRNSGLF-----VGAAGLLGLGWGAMSLVGQLGGAAGGV---FSYCLASR 285
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTI 316
GG ++ Y V L + VGG L L SL ++ G +
Sbjct: 286 GAGGAGSLAS---------------SFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVV 330
Query: 317 IDSGTTLAYLPPMLYDLVLSQF 338
+D+GT + LP Y + F
Sbjct: 331 MDTGTAVTRLPREAYAALRGAF 352
>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
lyrata]
Length = 475
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 134/298 (44%), Gaps = 32/298 (10%)
Query: 50 SALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
S L + T H S DL +G +G Y VGLGTP ++ + DTGSDL W
Sbjct: 101 SKLSKKLTTNHVSQSQSTDLP-AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWT 159
Query: 110 NCAGCSR-CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC--RTTYNNRYPSCSPGVRCE 166
C C R C + K +F+PSKS++ ++CS C ++ SCS C
Sbjct: 160 QCQPCVRTCYDQ-----KEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS-NCI 213
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y + YGD S + G+ +D L ++ + V FGCG G V G
Sbjct: 214 YGIQYGDQSFSVGFLAKDKF-------TLTSSDVFDGVYFGCGENNQGLF-----TGVAG 261
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDV-VSPKVKTTP---MVP 281
+LG G+ S SQ A A N K F++CL G G +S VK TP +
Sbjct: 262 LLGLGRDKLSFPSQTATAYN--KIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITD 319
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
Y + + + VGG L +P+++ T G +IDSGT + LPP Y + S F+
Sbjct: 320 GTSFYGLNIVAITVGGQKLPIPSTVFST---PGALIDSGTVITRLPPKAYAALRSSFK 374
>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
Length = 441
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 90/276 (32%), Positives = 126/276 (45%), Gaps = 42/276 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G+GTP + V +DTGSDL WV C C C + D LFDPS SS+
Sbjct: 91 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKD-----PLFDPSSSSSYAS 145
Query: 141 IACSDNFCRTTYNNRYPSCSPGVR------CEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
+ C + CR Y GV CEY + YG+ ++T+G + + +
Sbjct: 146 VPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL-------T 198
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
LK + + FGCG+ Q G DG+LG G A SL+SQ ++ F++C
Sbjct: 199 LKPGVVVADFGFGCGDHQHGPY-----EKFDGLLGLGGAPESLVSQTSS--QFGGPFSYC 251
Query: 255 LDVVKGG-GIFAIG-------DVVSPKVKTTPM--VPNMP-HYNVILEEVEVGGNPLDLP 303
L GG G +G + + TPM +P++P Y V L + VGG PL +P
Sbjct: 252 LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 311
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
S G +IDSGT + LP Y + S FR
Sbjct: 312 PSAF----SSGMVIDSGTVITGLPATAYAALRSAFR 343
>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
Length = 519
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 120/265 (45%), Gaps = 33/265 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G TG Y VGLGTP Y V DTGSD WV C C + + LFDP+
Sbjct: 171 SGRALGTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQ----REKLFDPA 226
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+SST ++C+ C + CS G C Y V YGDGS + G+F D + L+
Sbjct: 227 RSSTYANVSCAAPACS---DLNIHGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYDA 282
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFA 252
FGCG R G G + G+LG G+ +SL Q G V FA
Sbjct: 283 -------VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---FA 327
Query: 253 HCLDVVKGGGIF----AIGDVVSPKVKTTPMVP-NMP-HYNVILEEVEVGGNPLDLPTSL 306
HCL G + A + TTPM+ N P Y V + + VGG L +P S+
Sbjct: 328 HCLPARSTGTGYLDFGAGSLAAASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSV 387
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLY 331
T GTI+DSGT + LPP Y
Sbjct: 388 FATA---GTIVDSGTVITRLPPAAY 409
>gi|116308959|emb|CAH66084.1| H0209A05.1 [Oryza sativa Indica Group]
Length = 530
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 120/262 (45%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P S + + PS SSTS
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQA 173
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ FC CS +C Y + Y +S+SG+ V D++ L+ +
Sbjct: 174 VPCNSQFCELRKE-----CSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQI-- 226
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++FGCG Q+G + AA +G+ G G S+ S LA G FA C
Sbjct: 227 LKAQILFGCGQVQTGSFLDA--AAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFS-RD 283
Query: 260 GGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ P P Y + + E+ VG + DL E TI
Sbjct: 284 GIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEITVGNSLTDL---------EFSTIF 334
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 335 DTGTSFTYLADPAYTYITQSFH 356
>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 521
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 126/276 (45%), Gaps = 42/276 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G+GTP + V +DTGSDL WV C C C + D LFDPS SS+
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKD-----PLFDPSSSSSYAS 225
Query: 141 IACSDNFCRTTYNNRYPSCSPGVR------CEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
+ C + CR Y GV CEY + YG+ ++T+G + + +
Sbjct: 226 VPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL-------T 278
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
LK + + FGCG+ Q G DG+LG G A SL+SQ ++ F++C
Sbjct: 279 LKPGVVVADFGFGCGDHQHGPY-----EKFDGLLGLGGAPESLVSQTSS--QFGGPFSYC 331
Query: 255 LDVVKGG-GIFAIGDVVSPKVKT-------TPM--VPNMP-HYNVILEEVEVGGNPLDLP 303
L GG G +G + T TPM +P++P Y V L + VGG PL +P
Sbjct: 332 LPPTSGGAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 391
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
S G +IDSGT + LP Y + S FR
Sbjct: 392 PSAF----SSGMVIDSGTVITGLPATAYAALRSAFR 423
>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 453
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 127/275 (46%), Gaps = 43/275 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP +DTGSDL+W C C+ C + D LF P SS+ +
Sbjct: 98 YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPD-----PLFSPRMSSSYEPMR 152
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C+ C ++ SC C Y +YGDG++T GY+ + +SG ++ PL
Sbjct: 153 CAGQLCGDILHH---SCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG- 208
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------- 255
FGCG G L +++ GI+GFG+ SL+SQL+ + F++CL
Sbjct: 209 ---FGCGTMNVGSLNNAS-----GIVGFGRDPLSLVSQLSI-----RRFSYCLTPYASSR 255
Query: 256 -DVVKGGGIFAIG--DVVSPKVKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSLL-- 307
++ G + +G D + V+TTP++ N Y V V VG L +P S
Sbjct: 256 KSTLQFGSLADVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFAL 315
Query: 308 ---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+G G IIDSGT L P + V+ FR
Sbjct: 316 RPDGSG---GVIIDSGTALTLFPVAVLAEVVRAFR 347
>gi|18402471|ref|NP_564539.1| aspartyl protease [Arabidopsis thaliana]
gi|7770346|gb|AAF69716.1|AC016041_21 F27J15.15 [Arabidopsis thaliana]
gi|13430540|gb|AAK25892.1|AF360182_1 unknown protein [Arabidopsis thaliana]
gi|14532748|gb|AAK64075.1| unknown protein [Arabidopsis thaliana]
gi|332194267|gb|AEE32388.1| aspartyl protease [Arabidopsis thaliana]
Length = 583
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 158/347 (45%), Gaps = 58/347 (16%)
Query: 29 GNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLEL----------------- 71
+FVF V +K +A ER L ++ + + S+DLEL
Sbjct: 128 ASFVFPVYHKLRAREFHERIL---EEDLGLENENFVESMDLELVNPVKVNDVLSTSAGSI 184
Query: 72 ---------GGNGHPSATGLYFTKVGLGTPTD--EYYVQVDTGSDLLWVNC-AGCSRCPT 119
GGN +P GLY+T++ +G P D Y++ +DTGS+L W+ C A C+ C
Sbjct: 185 DSSTTIFPVGGNVYPD--GLYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAK 242
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS-CSPGVRCEYVVTYGDGSSTS 178
++ L+ P K + + S+ FC N+ C +C+Y + Y D S +
Sbjct: 243 GAN-----QLYKPRKDNL---VRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSM 294
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G +D L +G+L S ++FGCG Q G L +T DGILG +A SL
Sbjct: 295 GVLTKDKFHLKLHNGSLA----ESDIVFGCGYDQQG-LLLNTLLKTDGILGLSRAKISLP 349
Query: 239 SQLAAAGNVRKEFAHCL--DVVKGGGIFAIGDVV-SPKVKTTPMVPN--MPHYNVILEEV 293
SQLA+ G + HCL D+ G IF D+V S + PM+ + + Y + + ++
Sbjct: 350 SQLASRGIISNVVGHCLASDLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKM 409
Query: 294 EVGGNPLDLPTSLLGTGDERGTII-DSGTTLAYLPPMLYDLVLSQFR 339
G L SL G G ++ D+G++ Y P Y +++ +
Sbjct: 410 SYGQGML----SLDGENGRVGKVLFDTGSSYTYFPNQAYSQLVTSLQ 452
>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
Length = 383
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/224 (33%), Positives = 111/224 (49%), Gaps = 26/224 (11%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
+S L G+ +P GLY+ + +G P Y++ VD+GSDL W+ C P +S
Sbjct: 50 SSAVFPLYGDVYPH--GLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA----PCRSCNE 103
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYN---NRYPSCSPGVRCEYVVTYGDGSSTSGYF 181
+ L+ P+KS + C C + +N ++ SP +C+YV+ Y D S++G
Sbjct: 104 VPHPLYRPTKSKL---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 160
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ---SGDLGSSTDAAVDGILGFGQANSSLL 238
+ D L +G++ SV FGCG Q SGDL S T DG+LG G + SLL
Sbjct: 161 INDSFALRLTNGSVA----RPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLL 212
Query: 239 SQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--KVKTTPMV 280
SQL G + HCL ++GGG GD + P + TPM
Sbjct: 213 SQLKQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMA 255
>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
Length = 432
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 111/225 (49%), Gaps = 27/225 (12%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
+S L G+ +P GLY+ + +G P Y++ VD+GSDL W+ C P +S
Sbjct: 48 SSAVFPLYGDVYPH--GLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA----PCRSCNE 101
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYN----NRYPSCSPGVRCEYVVTYGDGSSTSGY 180
+ L+ P+KS + C C + +N ++ SP +C+YV+ Y D S++G
Sbjct: 102 VPHPLYRPTKSKL---VPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGV 158
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ---SGDLGSSTDAAVDGILGFGQANSSL 237
V D L +G++ SV FGCG Q SGDL S T DG+LG G + SL
Sbjct: 159 LVNDSFALRLTNGSVA----RPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSL 210
Query: 238 LSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--KVKTTPMV 280
LSQL G + HCL ++GGG GD + P + TPM
Sbjct: 211 LSQLKQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMA 254
>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
sylvestris]
Length = 502
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 124/283 (43%), Gaps = 35/283 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P TG Y VGLGTP + + DTGSDL W C C KS + +FDPS
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC----VKSCYAQQQPIFDPS 200
Query: 134 KSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
S T I+C+ C + P CS C Y + YGD S T G+F +D + L Q
Sbjct: 201 ASKTYSNISCTSTACSGLKSATGNSPGCSSS-NCVYGIQYGDSSFTVGFFAKDTLTLTQN 259
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
+ +FGCG G G + G++G G+ S++ Q A K F
Sbjct: 260 D-------VFDGFMFGCGQNNRGLFGKTA-----GLIGLGRDPLSIVQQTAQ--KFGKYF 305
Query: 252 AHCLDVVKG-GGIFAIGD--------VVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPL 300
++CL +G G G+ V + TP + Y + + + VGG L
Sbjct: 306 SYCLPTSRGSNGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKAL 365
Query: 301 DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+ L GTIIDSGT + LP +Y + S F+ +++
Sbjct: 366 SISPMLF---QNAGTIIDSGTVITRLPSTVYGSLKSTFKQFMS 405
>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
Length = 502
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 125/283 (44%), Gaps = 35/283 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P TG Y VGLGTP + + DTGSDL W C C KS + +FDPS
Sbjct: 145 SGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPC----VKSCYAQQQPIFDPS 200
Query: 134 KSSTSGEIACSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
S T I+C+ C + + P CS C Y + YGD S T G+F +D + L Q
Sbjct: 201 TSKTYSNISCTSAACSSLKSATGNSPGCSSS-NCVYGIQYGDSSFTIGFFAKDKLTLTQN 259
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
+ +FGCG G G + G++G G+ S++ Q A K F
Sbjct: 260 D-------VFDGFMFGCGQNNKGLFGKTA-----GLIGLGRDPLSIVQQTAQ--KFGKYF 305
Query: 252 AHCLDVVKG-GGIFAIGD--------VVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPL 300
++CL +G G G+ V + TP + +Y + + + VGG L
Sbjct: 306 SYCLPTSRGSNGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKAL 365
Query: 301 DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+ L GTIIDSGT + LP Y + S F+ +++
Sbjct: 366 SISPMLF---QNAGTIIDSGTVITRLPSTAYGSLKSAFKQFMS 405
>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
Length = 458
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/261 (34%), Positives = 133/261 (50%), Gaps = 28/261 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+G+P + +DTGSD+ WV C CS+C ++ D +LFDPS SST +
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVD-----SLFDPSSSSTYSPFS 176
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C ++ + +C+Y+V YGD SST+G + D + L ++ +
Sbjct: 177 CSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSSA--------MT 228
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-G 261
FGC +SG T DG++G G SL SQ AG F++CL G
Sbjct: 229 DFQFGCSQSESGGFNDQT----DGLMGLGGGAQSLASQ--TAGTFGTAFSYCLPPTSGSS 282
Query: 262 GIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIID 318
G +G S VK TPM+ +P +Y V+LE ++VG L+LPTS+ G+++D
Sbjct: 283 GFLTLGTGSSGFVK-TPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVF----SAGSLMD 337
Query: 319 SGTTLAYLPPMLYDLVLSQFR 339
SGT + LPP Y + S F+
Sbjct: 338 SGTIITRLPPTAYSALSSAFK 358
>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 485
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 129/262 (49%), Gaps = 34/262 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFD----PSKSS 136
G Y ++V +GTP E+ + VDTGS + +V C+ C+ C G FD P SS
Sbjct: 97 GYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSCTHC------GHHQACFDPRFKPDNSS 150
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ ++C+ C T C V +C+Y Y + SS+ G +D++ S L
Sbjct: 151 SYQTVSCNSPDCITKM------CDARVHQCKYERVYAEMSSSKGVLGKDLLGFGNGS-RL 203
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ PL +FGC ++GDL DGI+G G+ S++ QL G + F+ C
Sbjct: 204 QPHPL----LFGCETAETGDL---YLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLCY 256
Query: 256 -DVVKGGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
+ +GGG +G + P K+ P N +YN+ L E++V G L++P+ +
Sbjct: 257 GGMDEGGGSMVLGAIPPPPAMVFAKSDPNRSN--YYNLELSEIQVQGVSLNVPSEVF--N 312
Query: 311 DERGTIIDSGTTLAYLPPMLYD 332
GT++DSGTT AYLP +D
Sbjct: 313 GRLGTVLDSGTTYAYLPDKAFD 334
>gi|326498555|dbj|BAJ98705.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 126/262 (48%), Gaps = 22/262 (8%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y+T + +G P Y++ VDTGS L W+ C A C+ C TK L+ P+K + +
Sbjct: 129 YYTSINIGNPARPYFLDVDTGSALTWIQCDAPCTNC-TKG----PHPLYKPAKENI---V 180
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
D+ C+ N+ C +C+Y + Y D SS++G RD ++L A G + N
Sbjct: 181 PPRDSHCQELQGNQN-YCDTCKQCDYEIAYADRSSSAGVLARDNMELITADGERE----N 235
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
++FGC + Q G L S A+ DGILG SL +QLA G + F HC+ G
Sbjct: 236 MDLVFGCAHDQQGKLLGSP-ASSDGILGLSNGAMSLPTQLAKQGIISNVFGHCIATDPSG 294
Query: 262 GIFA-IGDVVSPKVKTTPM-VPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
+ +GD P+ T + V N P Y+ ++++V G L++ G I
Sbjct: 295 SAYMFLGDDYVPRWGMTWVPVRNGPEDVYSTVVQKVNYGCQELNVREQ---AGKLTQVIF 351
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
DSG++ Y P +Y +++
Sbjct: 352 DSGSSYTYFPHEIYTSLITSLE 373
>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
Length = 451
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 131/288 (45%), Gaps = 34/288 (11%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPT 119
G +S L G+ +P GLY+ + +G P Y++ VDTGSDL W+ C A C C
Sbjct: 38 GAEESSAVFPLYGDVYPH--GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCSK 95
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY---NNRYPSCSPGVRCEYVVTYGDGSS 176
+ L+ P+K+ + C D C + R+ SP +C+Y + Y D S
Sbjct: 96 -----VPHPLYRPTKNKL---VPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGS 147
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD-AAVDGILGFGQANS 235
+ G V D L A+ ++ + + FGCG Q +GSST+ +A DG+LG G +
Sbjct: 148 SLGVLVTDSFALRLANSSI----VRPGLAFGCGYDQQ--VGSSTEVSATDGVLGLGSGSV 201
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYNVILE 291
SLLSQL G + HCL +GGG GD + P + T PM + +Y+
Sbjct: 202 SLLSQLKQHGITKNVVGHCLS-TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSA 260
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ GG PL + + DSG++ Y Y ++ +
Sbjct: 261 NLYFGGRPLGV--------RPMEVVFDSGSSFTYFSAQPYQALVDAIK 300
>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
Group]
gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
Length = 573
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 131/278 (47%), Gaps = 26/278 (9%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKL 127
L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 191 LPIKGNVFPD--GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPH----- 243
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
L+ P+K + D C+ N+ C +C+Y + Y D SS+ G RD +
Sbjct: 244 PLYKPAKEKI---VPPKDLLCQELQGNQN-YCETCKQCDYEIEYADRSSSMGVLARDDMH 299
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
+ +G + +FGC Q G L +S A DGILG A SL SQLA G +
Sbjct: 300 IITTNGGREKL----DFVFGCAYDQQGQLLASP-AKTDGILGLSSAGISLPSQLANQGII 354
Query: 248 RKEFAHCLDV-VKGGGIFAIGDVVSPK--VKTTPMVPNMPH--YNVILEEVEVGGNPLDL 302
F HC+ GGG +GD P+ + +TP + + P ++ ++V G L +
Sbjct: 355 SNVFGHCITRDPNGGGYMFLGDDYVPRWGMTSTP-IRSAPDNLFHTEAQKVYYGDQQLSM 413
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
+ +G+ I DSG++ YLP +Y +++ ++
Sbjct: 414 RGA---SGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKY 448
>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
Length = 574
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 84/278 (30%), Positives = 131/278 (47%), Gaps = 26/278 (9%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKL 127
L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 192 LPIKGNVFPD--GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPH----- 244
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
L+ P+K + D C+ N+ C +C+Y + Y D SS+ G RD +
Sbjct: 245 PLYKPAKEKI---VPPKDLLCQELQGNQN-YCETCKQCDYEIEYADRSSSMGVLARDDMH 300
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
+ +G + +FGC Q G L +S A DGILG A SL SQLA G +
Sbjct: 301 IITTNGGREKL----DFVFGCAYDQQGQLLASP-AKTDGILGLSSAGISLPSQLANQGII 355
Query: 248 RKEFAHCLDV-VKGGGIFAIGDVVSPK--VKTTPMVPNMPH--YNVILEEVEVGGNPLDL 302
F HC+ GGG +GD P+ + +TP + + P ++ ++V G L +
Sbjct: 356 SNVFGHCITRDPNGGGYMFLGDDYVPRWGMTSTP-IRSAPDNLFHTEAQKVYYGDQQLSM 414
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
+ +G+ I DSG++ YLP +Y +++ ++
Sbjct: 415 RGA---SGNSVQVIFDSGSSYTYLPDEIYKNLIAAIKY 449
>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 440
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 121/275 (44%), Gaps = 29/275 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
R +S+ + GN +P G Y + +G P Y++ +DTGSDL W+ C A CSRC
Sbjct: 66 RSGSSVVFPVHGNVYP--VGFYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSRCSQT 123
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
L+ PS + C C + + C +C+Y V Y D S+ G
Sbjct: 124 PH-----PLYRPSNDL----VPCRHPLCASVHQTDNYECEVEHQCDYEVEYADHYSSLGV 174
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
V D+ LN +G L + GCG Q S+ VDG+LG G+ SSL+SQ
Sbjct: 175 LVNDVYVLNFTNG----VQLKVRMALGCGYDQI--FPDSSYHPVDGMLGLGRGKSSLISQ 228
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVV-SPKVKTTPMVP-NMPHYNVILEEVEVGGN 298
L G VR HCL GG IF GDV S ++ TPM + HY+ E+ +GG
Sbjct: 229 LNGQGLVRNVVGHCLSAQGGGYIF-FGDVYDSSRLAWTPMSSRDYKHYSAGAAELVLGGK 287
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
L + D+G++ Y Y L
Sbjct: 288 RTGFGNLL--------AVFDAGSSYTYFNSNAYQL 314
>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 443
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 128/278 (46%), Gaps = 40/278 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
++ G Y +G+GTP Y +DTGSDL+W CA C C + FDP++S +
Sbjct: 84 ASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQ-----PTPFFDPAQSPS 138
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++ C+ C Y YP C V C Y YGD ++T+G + G T
Sbjct: 139 YAKLPCNSPMCNALY---YPLCYRNV-CVYQYFYGDSANTAGVLSNETFTF----GTNDT 190
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ FGCGN +G L + + G++GFG+ SL+SQL + F++CL
Sbjct: 191 RVTVPRIAFGCGNLNAGSLFNGS-----GMVGFGRGPLSLVSQLGS-----PRFSYCLTS 240
Query: 258 VKGG-------GIFAIGDVVSPK----VKTTPMV--PNMP-HYNVILEEVEVGGNPLDLP 303
G +A + S V++TP + P +P Y + + + VGG L +
Sbjct: 241 FMSPVPSRLYFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPID 300
Query: 304 TSLLGTGDERGT---IIDSGTTLAYLPPMLYDLVLSQF 338
S+ D GT IIDSG+T+ YL YD+V F
Sbjct: 301 PSVFAINDADGTGGVIIDSGSTITYLARAAYDMVHQAF 338
>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
Length = 393
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 123/278 (44%), Gaps = 44/278 (15%)
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
HP G Y + +GTP + DTGSDL+WV C+ C T+FDP +S
Sbjct: 49 HPDGGG-YVMDISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGG-------TIFDPRQS 100
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
ST E+ CS C SC PG C Y YG G T G F RD I L S
Sbjct: 101 STFREMDCSSQLCAELPG----SCEPGSSTCSYSYEYGSG-ETEGEFARDTISLGTTSDG 155
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ P S GCG SG G VDG++G GQ SL SQL+AA + +F++C
Sbjct: 156 SQKFP---SFAVGCGMVNSGFDG------VDGLVGLGQGPVSLTSQLSAA--IDSKFSYC 204
Query: 255 LDVVKG---------GGIFAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEVGGNPLDLPT 304
L + G A+ K TP P Y ++ + + V G + P
Sbjct: 205 LVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTMGSP- 263
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
GT TIIDSGTTL Y+P +Y VLS+ +
Sbjct: 264 ---GT-----TIIDSGTTLTYVPSGVYGRVLSRMESMV 293
>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
Length = 469
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 118/248 (47%), Gaps = 29/248 (11%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT--TYNNRYP 157
+DT SD+ WV C S CPT K L+DP+KSS+SG +C+ C Y N
Sbjct: 148 LDTASDVTWVQC---SPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN--- 201
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
C+ +C+Y V Y DG+ST+G ++ D++ + A+ S FGC + G
Sbjct: 202 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATA-------VRSFQFGCSHGVQGSFS 254
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIG--DVVSPKVK 275
+ AA GI+ G SL+SQ AA + F+HC G F +G V + +
Sbjct: 255 FGSSAA--GIMALGGGPESLVSQTAA--TYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYV 310
Query: 276 TTPMV--PNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TPM+ P +P Y V LE + V G + +P ++ G +DS T + LPP Y
Sbjct: 311 LTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA----GAALDSRTAITRLPPTAY 366
Query: 332 DLVLSQFR 339
+ FR
Sbjct: 367 QALRQAFR 374
>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
gi|194692946|gb|ACF80557.1| unknown [Zea mays]
Length = 424
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 109/220 (49%), Gaps = 26/220 (11%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
L G+ +P GLY+ + +G P Y++ VD+GSDL W+ C P +S +
Sbjct: 45 FPLYGDVYPH--GLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA----PCRSCNEVPHP 98
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYN---NRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDI 185
L+ P+KS + C C + +N ++ SP +C+YV+ Y D S++G + D
Sbjct: 99 LYRPTKSKL---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDS 155
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQ---SGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
L +G++ SV FGCG Q SGDL S T DG+LG G + SLLSQL
Sbjct: 156 FALRLTNGSVA----RPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLLSQLK 207
Query: 243 AAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--KVKTTPMV 280
G + HCL ++GGG GD + P + TPM
Sbjct: 208 QRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMA 246
>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
Length = 473
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 91/318 (28%), Positives = 142/318 (44%), Gaps = 33/318 (10%)
Query: 28 MGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKV 87
+G FV N K GG + S +S + G+ +P+ GLYFT +
Sbjct: 57 LGKFVDFHVNDMKPGGINKLATSV---------SAFDSSTIFPVRGDVYPN--GLYFTHI 105
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+G+P Y++ +DTGSDL W+ C A C+ C + L+ P K + + D+
Sbjct: 106 FVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPN-----PLYKPKKGNL---VPLKDS 157
Query: 147 FCRTTYNN-RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C N + C +C+Y + Y D SS+ G D + L A+G+L ++
Sbjct: 158 LCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKL----GIM 213
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-VKGGGIF 264
FGC Q G L +S A DGILG +A SL SQLA+ + HCL GGG
Sbjct: 214 FGCAYDQQGLLLNSL-AKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYM 272
Query: 265 AIGDVVSPK--VKTTPMV-PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
+GD P + PM+ + P+Y+ + ++ G L L G + D+G+
Sbjct: 273 FLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLSLGRQ---DGRTERVVFDTGS 329
Query: 322 TLAYLPPMLYDLVLSQFR 339
+ Y P Y +++ +
Sbjct: 330 SYTYFPKEAYYALVASLK 347
>gi|413936884|gb|AFW71435.1| hypothetical protein ZEAMMB73_652585 [Zea mays]
Length = 287
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/121 (49%), Positives = 76/121 (62%), Gaps = 12/121 (9%)
Query: 22 VGGGGVMGNFVFEVENKFKAGGER--ERTLSALKQHDTRRHGRMM-ASIDLELGGNGHPS 78
VG G G VF+V KF G R L+AL++HD RHGR++ A +DL LGG G P+
Sbjct: 25 VGRAGATG--VFQVRRKFPRHGRRGVAEHLAALRRHDVGRHGRLLGAVVDLGLGGVGLPT 82
Query: 79 ATG-------LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFD 131
A G LY+T++ +G+P YYVQVDTGSD+LWVNC C CP +S LGI+LT
Sbjct: 83 AAGCLPAQRSLYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPARSGLGIELTPLQ 142
Query: 132 P 132
P
Sbjct: 143 P 143
>gi|125546587|gb|EAY92726.1| hypothetical protein OsI_14476 [Oryza sativa Indica Group]
Length = 530
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 120/262 (45%), Gaps = 24/262 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSGE 140
L++ V +GTP + V +DTGSDL W+ C C C P S + + PS SSTS
Sbjct: 115 LHYALVTVGTPGQTFMVALDTGSDLFWLPCQ-CDGCTPPASAASGSASFYIPSMSSTSQA 173
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ FC CS +C Y + Y +S+SG+ V D++ L+ +
Sbjct: 174 VPCNSQFCELRKE-----CSTTSQCPYKMVYVSADTSSSGFLVEDVLYLSTEDAIPQI-- 226
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
L + ++FGCG Q+G + AA +G+ G G S+ S LA G FA C
Sbjct: 227 LKAQILFGCGQVQTGSFLDA--AAPNGLFGLGIDMISIPSILAQKGLTSNSFAMCFS-RD 283
Query: 260 GGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
G G + GD S + TP+ P P Y + + E+ VG + DL E TI
Sbjct: 284 GIGRISFGDQGSSDQEETPLDVNPQHPTYTISISEMTVGNSLTDL---------EFSTIF 334
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
D+GT+ YL Y + F
Sbjct: 335 DTGTSFTYLADPAYTYITQSFH 356
>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
Length = 433
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 76/224 (33%), Positives = 111/224 (49%), Gaps = 26/224 (11%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
+S L G+ +P GLY+ + +G P Y++ VD+GSDL W+ C P +S
Sbjct: 50 SSAVFPLYGDVYPH--GLYYVAMNIGNPPKPYFLDVDSGSDLTWLQCDA----PCRSCNE 103
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYN---NRYPSCSPGVRCEYVVTYGDGSSTSGYF 181
+ L+ P+KS + C C + +N ++ SP +C+YV+ Y D S++G
Sbjct: 104 VPHPLYRPTKSKL---VPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 160
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ---SGDLGSSTDAAVDGILGFGQANSSLL 238
+ D L +G++ SV FGCG Q SGDL S T DG+LG G + SLL
Sbjct: 161 INDSFALRLTNGSVA----RPSVAFGCGYDQQVRSGDLSSPT----DGVLGLGTGSVSLL 212
Query: 239 SQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--KVKTTPMV 280
SQL G + HCL ++GGG GD + P + TPM
Sbjct: 213 SQLKQRGVTKNVVGHCLS-LRGGGFLFFGDDLVPYQRATWTPMA 255
>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 525
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/300 (29%), Positives = 139/300 (46%), Gaps = 40/300 (13%)
Query: 52 LKQHDTRRHGRMM------ASIDLELGGNGHPSAT----GLYFTKVGLGTPTDEYYVQVD 101
L+ HD RH R +S+D + G+ + GL+++ + +GTP ++ V +D
Sbjct: 70 LRDHDVARHTRTARRILAASSMDQYVLIQGNATEQLFGGGLHYSYIDIGTPNVQFLVVLD 129
Query: 102 TGSDLLWVNCAGCSRCP-----TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY 156
TGSDLLW+ C C C +K +L + PS SST+ + CSD C +
Sbjct: 130 TGSDLLWIPCE-CESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVLCSDPLCEMSST--- 185
Query: 157 PSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
+P +C Y + Y +STSG D + + SG P+ V GCG Q+G
Sbjct: 186 -CMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGG---NPVKLPVYLGCGKVQTGS 241
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVK 275
L AA +G++G G + S+ ++LA+ G + F+ C+ G G GD +
Sbjct: 242 LLKG--AAPNGLMGLGTTDISVPNKLASTGQLADSFSLCIS-PGGSGTLTFGDEGPAAQR 298
Query: 276 TTPMVPN----MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TTP++P + Y V ++ + VG L + + L D+GT+ YL +Y
Sbjct: 299 TTPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHAL---------FDTGTSFTYLSKTVY 349
>gi|326505434|dbj|BAJ95388.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 529
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/265 (33%), Positives = 125/265 (47%), Gaps = 29/265 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS--DLG-IKLTLFDPSKSSTS 138
L++ V LGTP + V +DTGSDL WV C C +C S D G +K ++ P KSSTS
Sbjct: 107 LHYAVVALGTPNVTFLVALDTGSDLFWVPC-DCLKCAPLSSPDYGNLKFDVYSPRKSSTS 165
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLK 196
++ CS N C + CS C Y + Y D +S+ G V D++ L SG+ K
Sbjct: 166 RKVPCSSNMC-----DLQTECSAASNSCPYKIEYLSDNTSSKGVLVEDVMYLATESGHSK 220
Query: 197 TAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ + FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 221 IT--QAPITFGCGQVQTGSFLGS---AAPNGLLGLGMDSKSVPSLLASQGVAANSFSMCF 275
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G GD S TP+ + P+YN+ + GG S
Sbjct: 276 G-EDGHGRINFGDTGSADQLETPLNIYKHNPYYNISIVGAMAGGKTFSTKFS-------- 326
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
++DSGT+ L +Y + S F
Sbjct: 327 -AVVDSGTSFTALSDPMYTEITSAF 350
>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 663
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/279 (31%), Positives = 134/279 (48%), Gaps = 41/279 (14%)
Query: 56 DTRRH--GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG 113
+++RH RM DL L G Y T++ +GTP + + VDTGS + +V C+
Sbjct: 91 ESKRHPNARMRLHDDLLLNG--------YYTTRLWIGTPPQMFALIVDTGSTVTYVPCST 142
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
C +C D F P SST + C+ + C + ++C Y Y +
Sbjct: 143 CEQCGRHQD-----PKFQPESSSTYQPVKCTID-CNCDGDR--------MQCVYERQYAE 188
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
S++SG D+I S + AP +FGC N ++GDL S DGI+G G+
Sbjct: 189 MSTSSGVLGEDVISFGNQS---ELAP--QRAVFGCENVETGDLYSQ---HADGIMGLGRG 240
Query: 234 NSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPKVKTTPMV-PNM-PHYNV 288
+ S++ QL + F+ C +DV GGG +G + P T P+ P+YN+
Sbjct: 241 DLSIMDQLVDKKVISDSFSLCYGGMDV--GGGAMVLGGISPPSDMTFAYSDPDRSPYYNI 298
Query: 289 ILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
L+E+ V G L L ++ + GT++DSGTT AYLP
Sbjct: 299 DLKEMHVAGKRLPLNANVF--DGKHGTVLDSGTTYAYLP 335
>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 686
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 144/320 (45%), Gaps = 37/320 (11%)
Query: 28 MGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKV 87
+G FV N K GG + S +S + G+ +P+ GLYFT +
Sbjct: 270 LGKFVDFHVNDMKPGGINKLATSV---------SAFDSSTIFPVRGDVYPN--GLYFTHI 318
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+G+P Y++ +DTGSDL W+ C A C+ C + L+ P K + + D+
Sbjct: 319 FVGSPPRRYFLDMDTGSDLTWIQCDAPCTSCAKGPN-----PLYKPKKGNL---VPLKDS 370
Query: 147 FCRTTYNN-RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C N + C +C+Y + Y D SS+ G D + L A+G+L ++
Sbjct: 371 LCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGSLTKL----GIM 426
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-VKGGGIF 264
FGC Q G L +S A DGILG +A SL SQLA+ + HCL GGG
Sbjct: 427 FGCAYDQQGLLLNSL-AKTDGILGLSKAKVSLPSQLASQRIINNVLGHCLTSDATGGGYM 485
Query: 265 AIGDVVSPK--VKTTPMV-PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG--TIIDS 319
+GD P + PM+ + P+Y+ + ++ G L LG D R + D+
Sbjct: 486 FLGDDFVPYWGMAWVPMLNSHSPNYHSQIMKISHGSRQLS-----LGRQDGRTERVVFDT 540
Query: 320 GTTLAYLPPMLYDLVLSQFR 339
G++ Y P Y +++ +
Sbjct: 541 GSSYTYFPKEAYYALVASLK 560
>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
gi|219888491|gb|ACL54620.1| unknown [Zea mays]
Length = 557
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/304 (29%), Positives = 136/304 (44%), Gaps = 27/304 (8%)
Query: 42 GGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVD 101
GG + R + + T R ++ L + GN P G Y+T + +G P Y++ VD
Sbjct: 151 GGRKARNRMEVAKAAT---ARTNSTALLPIKGNVFPD--GQYYTSIFIGNPPRPYFLDVD 205
Query: 102 TGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
TGSDL W+ C A C+ L+ P+K + D C+ N+ C
Sbjct: 206 TGSDLTWIQCDAPCTNFAKGPH-----PLYKPAKEKI---VPPRDLLCQELQGNQN-YCE 256
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
+C+Y + Y D SS+ G RD + + +G + +FGC Q G L SS
Sbjct: 257 TCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL----DFVFGCAYDQQGQLLSSP 312
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFA-IGDVVSPKVKTT-P 278
A DGILG A S SQLA+ G + F HC+ +GGG + +GD P+ T
Sbjct: 313 -AKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWT 371
Query: 279 MVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + P Y+ V+ G L P G I DSG++ YLP +Y+ +++
Sbjct: 372 SIRSGPDNLYHTQAHHVKYGDQQLRRPEQ---AGSTVQVIFDSGSSYTYLPNEIYENLVA 428
Query: 337 QFRF 340
++
Sbjct: 429 AIKY 432
>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Cucumis sativus]
Length = 418
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 135/296 (45%), Gaps = 31/296 (10%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER+R + ++ T +SI L L GN +P+ G Y + +G P Y++ DTG
Sbjct: 23 ERKRPILSVP---TASSSFASSSIVLPLQGNVYPN--GFYNVTLYVGQPPKPYFLDPDTG 77
Query: 104 SDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG 162
SDL W+ C A C +C P ++ + C D C + +++ C
Sbjct: 78 SDLTWLQCDAPCQQC---------TETLHPLYQPSNDLVPCKDPLCMSLHSSMDHRCENP 128
Query: 163 VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+C+Y V Y DG S+ G VRD+ LN +G+ P+ + GCG Q D GSS+
Sbjct: 129 DQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD----PIRPRLALGCGYDQ--DPGSSSYH 182
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP-KVKTTPMVP 281
+DGILG G+ S++SQL G VR HC + GG F + P ++ TPM
Sbjct: 183 PMDGILGLGRGAVSIVSQLHNQGIVRNVVGHCFNSKGGGYXFFGDGIYDPYRLVWTPMSR 242
Query: 282 NMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ P HY+ E+ G L + + DSG++ Y Y ++ S
Sbjct: 243 DYPKHYSPGFGELIFNGRSTGLRNLFV--------VFDSGSSYTYFNAQAYQVLTS 290
>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 134/270 (49%), Gaps = 39/270 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + K+ +GTP + Y +DTGSDL+W C C++C +S +FDP KSS+ +
Sbjct: 95 GEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQST-----PIFDPKKSSSFSK 149
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
++CS C + SC+ G CEY+ +YGD SST G + + +AS P
Sbjct: 150 LSCSSQLCEALPQS---SCNNG--CEYLYSYGDYSSTQGILASETLTFGKAS-----VP- 198
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+V FGCG G G S A G++G G+ SL+SQL +F++CL V
Sbjct: 199 --NVAFGCGADNEGS-GFSQGA---GLVGLGRGPLSLVSQLK-----EPKFSYCLTTVDD 247
Query: 261 G-------GIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTG 310
G A + S +KTTP++ + H Y + LE + VG L + S
Sbjct: 248 TKTSTLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQ 307
Query: 311 DE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
D+ G IIDSGTT+ YL ++LV +F
Sbjct: 308 DDGSGGLIIDSGTTITYLEESAFNLVAKEF 337
>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
Length = 455
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 140/288 (48%), Gaps = 52/288 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSST 137
G Y + LGTP ++ V VDTGS+L+W CA C+RC PT + + P++SST
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAP------VLQPARSST 142
Query: 138 SGEIACSDNFCRTTYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ C+ +FC+ + P +C+ C Y TYG G T+GY + + +
Sbjct: 143 FSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDG----- 196
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVD---GILGFGQANSSLLSQLAAAGNVRKEFAH 253
T P V FGC ST+ VD GI+G G+ SL+SQLA F++
Sbjct: 197 TFP---KVAFGC----------STENGVDNSSGIVGLGRGPLSLVSQLAVG-----RFSY 238
Query: 254 CL--DVVKGGG---IFAIGDVVSPK--VKTTPMVPN-----MPHYNVILEEVEVGGNPLD 301
CL D+ GG +F ++ + V++TP++ N HY V L + V L
Sbjct: 239 CLRSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELP 298
Query: 302 LPTSLLG---TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASLD 346
+ S G TG GTI+DSGTTL YL Y +V F+ +A+L+
Sbjct: 299 VTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLN 346
>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 494
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/248 (32%), Positives = 118/248 (47%), Gaps = 29/248 (11%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT--TYNNRYP 157
+DT SD+ WV C S CPT K L+DP+KSS+SG +C+ C Y N
Sbjct: 173 LDTASDVTWVQC---SPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYAN--- 226
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
C+ +C+Y V Y DG+ST+G ++ D++ + A+ S FGC + G
Sbjct: 227 GCTNNNQCQYRVRYPDGTSTAGTYISDLLTITPATA-------VRSFQFGCSHGVQGSFS 279
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIG--DVVSPKVK 275
+ AA GI+ G SL+SQ AA + F+HC G F +G V + +
Sbjct: 280 FGSSAA--GIMALGGGPESLVSQTAA--TYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYV 335
Query: 276 TTPMV--PNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
TPM+ P +P Y V LE + V G + +P ++ G +DS T + LPP Y
Sbjct: 336 LTPMLKNPAIPPTFYMVRLEAIAVAGQRIAVPPTVFAA----GAALDSRTAITRLPPTAY 391
Query: 332 DLVLSQFR 339
+ FR
Sbjct: 392 QALRQAFR 399
>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
gi|224033441|gb|ACN35796.1| unknown [Zea mays]
gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
Length = 456
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 98/302 (32%), Positives = 132/302 (43%), Gaps = 42/302 (13%)
Query: 52 LKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC 111
L H+ R+ A + GG AT Y + +GTP + +DTGSDL+W C
Sbjct: 59 LSSHERPVRARVRAGLVAAAGGI----ATNEYLVHLAVGTPPRPVALTLDTGSDLVWTQC 114
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY 171
A C C D GI L DP+ SST + C CR + SC G C YV Y
Sbjct: 115 APCRDC---FDQGIP--LLDPAASSTYAALPCGAPRCRAL---PFTSCG-GRSCVYVYHY 165
Query: 172 GDGSSTSGYFVRDIIQL--NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
GD S T G D N + P + FGCG+ G S+ GI G
Sbjct: 166 GDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCGHFNKGVFQSNE----TGIAG 221
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHCLD---------VVKGGGIFAI-GDVVSPKVKTTPM 279
FG+ SL SQL A F++C V GG A+ S +V+TTP+
Sbjct: 222 FGRGRWSLPSQLNA-----TSFSYCFTSMFDSKSSIVTLGGAPAALYSHAHSGEVRTTPL 276
Query: 280 V--PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
P+ P Y + L+ + VG L +P + R TIIDSG ++ LP +Y+ V +
Sbjct: 277 FKNPSQPSLYFLSLKGISVGKTRLPVPETKF-----RSTIIDSGASITTLPEEVYEAVKA 331
Query: 337 QF 338
+F
Sbjct: 332 EF 333
>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
Length = 390
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 92/310 (29%), Positives = 140/310 (45%), Gaps = 40/310 (12%)
Query: 45 RERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
R R + Q RH R + + +G +G YF ++G+G+P YY+++DTGS
Sbjct: 7 RLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLELDTGS 66
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
D+ W+ CA CS C ++ D ++DPS SS+ + C C+ Y +C G+
Sbjct: 67 DVTWIQCAPCSSCYSQVD-----PIYDPSNSSSYRRVYCGSALCQAL---DYSACQ-GMG 117
Query: 165 CEYVVTYGDGSSTSGYFVRDI-IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
C Y V YGD S++SG D+ I+ N TA N + FGCG+ SG
Sbjct: 118 CSYRVVYGDSSASSG----DLGIESFYLGPNSSTAMRN--IAFGCGHSNSGLFRGEAGLL 171
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----DVVKGGGIFAIGDVVSP-KVKTT 277
G S SQ+AA ++ F++CL + G P + T
Sbjct: 172 GM-----GGGTLSFFSQIAA--SIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARFT 224
Query: 278 PMVPNM---PHYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPM 329
P++ N Y IL + VGG L +P + GTG G I+DSGT++ + P
Sbjct: 225 PLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTG---GAILDSGTSVTRVVPA 281
Query: 330 LYDLVLSQFR 339
Y ++ +R
Sbjct: 282 AYAVLRDAYR 291
>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
Length = 464
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 124/279 (44%), Gaps = 31/279 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +VG+G+P E Y+ VD+GSD++WV C C C ++D LFDP+
Sbjct: 118 SGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQAD-----PLFDPA 172
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S+T + C CRT R C C+Y V+YGDGS T G + + L +
Sbjct: 173 TSATFSAVPCGSAVCRTL---RTSGCGDSGGCDYEVSYGDGSYTKGALALETLTLGGTA- 228
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
V GCG+R G G+LG G SL+ QL F++
Sbjct: 229 -------VEGVAIGCGHRNRGLF-----VGAAGLLGLGWGPMSLVGQLGG--AAGGAFSY 274
Query: 254 CLDVVKGGGIFAIG--DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLG 308
CL +G G +G + V P+V P P Y V L + VG L L L
Sbjct: 275 CL-ASRGAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQ 333
Query: 309 TGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
++ G ++D+GT + LP Y + F + +L
Sbjct: 334 LTEDGAGGVVMDTGTAVTRLPQEAYAALRDAFVAAVGAL 372
>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 124/284 (43%), Gaps = 45/284 (15%)
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
PS Y + +GTP +DTGSDL+W CA C+ C + D LF P+ S
Sbjct: 96 RPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPD-----PLFAPAAS 150
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
S+ + CS C ++ SC C Y YGDG++T G + + +SG
Sbjct: 151 SSYVPMRCSGQLCNDILHH---SCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEK 207
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ PL FGCG G L + + GI+GFG+ SL+SQL+ + F++CL
Sbjct: 208 LSVPLG----FGCGTMNVGSLNNGS-----GIVGFGRDPLSLVSQLSI-----RRFSYCL 253
Query: 256 DVVK------------GGGIFAIGDVVSPKVKTTPMV---PNMPHYNVILEEVEVGGNPL 300
G+F D + +V+TT ++ N Y V V VG L
Sbjct: 254 TPYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRL 313
Query: 301 DLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+P S G+G G I+DSGT L P + VL FR
Sbjct: 314 RIPLSAFALRPDGSG---GVIVDSGTALTLFPAAVLTEVLRAFR 354
>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
Length = 564
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 84/253 (33%), Positives = 124/253 (49%), Gaps = 33/253 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y T++ +GTP + + VDTGS + +V C+ C +C D F P SST
Sbjct: 11 GYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQD-----PKFQPDLSSTYQS 65
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ + C + +C Y Y + S++SG DII GNL +A
Sbjct: 66 VKCNID-CNCDDEKQ--------QCVYERQYAEMSTSSGVLGEDIISF----GNL-SALA 111
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+FGC N ++GDL S DGI+G G+ + S++ L G + F+ C +
Sbjct: 112 PQRAVFGCENMETGDLYSQ---HADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGI 168
Query: 261 GGIFAIGDVVSPK-----VKTTPMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGTGDERG 314
GG + +SP ++ P+ P+YN+ L+E+ V G PL L PT G + G
Sbjct: 169 GGGAMVLGGISPPSNMVFSQSDPV--RSPYYNIDLKEIHVAGKPLPLNPTVFDG---KHG 223
Query: 315 TIIDSGTTLAYLP 327
TI+DSGTT AYLP
Sbjct: 224 TILDSGTTYAYLP 236
>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 421
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 136/284 (47%), Gaps = 34/284 (11%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+S +L G+ +P GLY+ + +G P Y++ VDTGSDL W+ C A C C
Sbjct: 42 SSAVFQLYGDVYPH--GLYYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPCVSCNK---- 95
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTY---NNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
+ L+ P+K+ + C D C + + + ++ SP +C+Y + Y D S+ G
Sbjct: 96 -VPHPLYRPTKNKI---VPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGV 151
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA-VDGILGFGQANSSLLS 239
+ D + A+ ++ + S+ FGCG Q +GSST+ A DG+LG G + SLLS
Sbjct: 152 LLTDSFAVRLANSSI----VRPSLAFGCGYDQ--QVGSSTEVAPTDGVLGLGSGSISLLS 205
Query: 240 QLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNM--PHYNVILEEVEV 295
QL G + HCL ++GGG GD + P + T PMV + +Y+ +
Sbjct: 206 QLKQHGITKNVVGHCLS-IRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYF 264
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GG L + ++DSG++ Y Y +++ +
Sbjct: 265 GGRSLGV--------RPMEVVLDSGSSFTYFGAQPYQALVTALK 300
>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
Length = 503
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/265 (32%), Positives = 127/265 (47%), Gaps = 33/265 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
TG Y + LGTP + V DTGSD WV C C + C + K LF P+KS+T
Sbjct: 162 TGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQ-----KEPLFTPTKSATY 216
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+C+ ++C + + R CS G C Y V YGDGS T G++ +D + L +
Sbjct: 217 ANISCTSSYC-SDLDTR--GCSGG-HCLYAVQYGDGSYTVGFYAQDTLTLGYDT------ 266
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
FGCG + G G + G++G G+ +S+ Q A FA+C+
Sbjct: 267 --VKDFRFGCGEKNRGLFGKAA-----GLMGLGRGKTSVPVQ--AYDKYSGVFAYCIPAT 317
Query: 259 KGGG---IFAIGDVVSPKVKTTPM-VPNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G F G + + TPM V N P Y V + ++VGG+ L +P ++ +
Sbjct: 318 SSGTGFLDFGPGAPAAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVF---SDA 374
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQF 338
G ++DSGT + LPP Y+ + S F
Sbjct: 375 GALVDSGTVITRLPPSAYEPLRSAF 399
>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
Length = 491
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 102/273 (37%), Positives = 130/273 (47%), Gaps = 41/273 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC---SRCPTKSDLGIKLTLFDPSKSSTSG 139
+ VGLGTP + DTGSDL WV C C C + D LFDPSKSST
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQD-----PLFDPSKSSTYA 203
Query: 140 EIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C + C CS C Y+V YGDGSST+G RD + L +S L
Sbjct: 204 AVHCGEPQCAAAGG----LCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALT-SSRALAGF 258
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
P FGCG R GD G VDG+LG G+ SL SQ AA + F++CL
Sbjct: 259 P------FGCGTRNLGDFGR-----VDGLLGLGRGELSLPSQ--AAASFGAVFSYCLPSS 305
Query: 259 KG-GGIFAIGDVVSPKVKT-----TPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGT 309
G IG +P T T M+ P P Y V L +++GG L +P ++
Sbjct: 306 NSTTGYLTIG--ATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTR 363
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
G GT++DSGT L YLP Y+L+ +FR +
Sbjct: 364 G---GTLLDSGTVLTYLPAQAYELLRDRFRLTM 393
>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 551
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 88/288 (30%), Positives = 128/288 (44%), Gaps = 44/288 (15%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC-----P 118
+++ L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C P
Sbjct: 175 STVLLPIKGNVFPD--GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHP 232
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
K+ P + S E+ N+C T C +C+Y + Y D SS+
Sbjct: 233 LYKPAKEKIV---PPRDSLCQELQGDQNYCET--------CK---QCDYEIEYADRSSSM 278
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G +D + L +G + +FGC Q G L SS A DGILG A SL
Sbjct: 279 GVLAKDDMHLIATNGGREKL----DFVFGCAYDQQGQLLSSP-AKTDGILGLSSAAISLP 333
Query: 239 SQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVP------NMPHYNVILE 291
SQLA+ G + F HC+ GGG +GD P+ T P N+ Y+ +
Sbjct: 334 SQLASKGIISNVFGHCITRETNGGGYMFLGDDYVPRWGMT-WAPIRGGPDNL--YHTEAQ 390
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+V G L G+ I DSG++ YLP +Y ++ +
Sbjct: 391 KVNYGDQELH-------AGNSVQVIFDSGSSYTYLPEEMYKNLIDAIK 431
>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 455
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 124/281 (44%), Gaps = 32/281 (11%)
Query: 57 TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS- 115
T+ G +AS+ L G + G Y T++GLGTP Y + VDTGS L W+ C+ C
Sbjct: 94 TQAAGSSLASVPLTPGTS---VGVGNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRV 150
Query: 116 RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGD 173
C +S +FDP SS+ ++CS C +T CSP C Y +YGD
Sbjct: 151 SCHRQSG-----PVFDPKTSSSYAAVSCSSPQCDGLSTATLNPAVCSPSNVCIYQASYGD 205
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
S + GY +D + S + +GCG G G S G++G +
Sbjct: 206 SSFSVGYLSKDTVSFGANS--------VPNFYYGCGQDNEGLFGRSA-----GLMGLARN 252
Query: 234 NSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVIL 290
SLL QLA + F++CL G +IG TPMV N Y + L
Sbjct: 253 KLSLLYQLAP--TLGYSFSYCLPSTSSSGYLSIGSYNPGGYSYTPMVSNTLDDSLYFISL 310
Query: 291 EEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+ V G PL + +S TIIDSGT + LP +Y
Sbjct: 311 SGMTVAGKPLAVSSSEY---TSLPTIIDSGTVITRLPTSVY 348
>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 88/311 (28%), Positives = 142/311 (45%), Gaps = 39/311 (12%)
Query: 38 KFKAGGERERTLSALKQHDTRRH---GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
+F + +R +A ++ +R R + L+L P +G Y V +GTP
Sbjct: 45 EFSSLSHYDRLTNAFRRSLSRSATLLNRAATNGALDLQAPLTP-GSGEYLMSVSIGTPPV 103
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN 154
+Y DTGSDL+W C C +C +S +FDP KS++ + C+ C+ ++
Sbjct: 104 DYIGMADTGSDLMWAQCLPCLKCYKQSR-----PIFDPLKSTSFSHVPCNSQNCKAIDDS 158
Query: 155 RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
C C+Y TYGD + T G + I + +S +K+ + GCG+
Sbjct: 159 H---CGAQGVCDYSYTYGDQTYTKGDLGFEKITIGSSS--VKS-------VIGCGHES-- 204
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV----KGGGIFAIGDVV 270
G++G G SL+SQ++ + + F++CL + G F VV
Sbjct: 205 ---GGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVV 261
Query: 271 S-PKVKTTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
S P V +TP++ P +Y V LE + +G + + + IIDSGTTL++LP
Sbjct: 262 SGPGVVSTPLISKNPVTYYYVTLEAISIGNE------RHMASAKQGNVIIDSGTTLSFLP 315
Query: 328 PMLYDLVLSQF 338
LYD V+S
Sbjct: 316 KELYDGVVSSL 326
>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
gi|194696366|gb|ACF82267.1| unknown [Zea mays]
gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 411
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 128/266 (48%), Gaps = 37/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +V GTP V +DTGSD+ W+ C CS +C + D L+DPS SST
Sbjct: 79 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKD-----PLYDPSHSSTYSA 133
Query: 141 IACSDNFCRTTYNNRYPS-CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ + C+ + Y S C+ G +C + ++Y DG+ST G + +D + L +
Sbjct: 134 VPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGA------- 186
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ + FGCG+ + G DG+LG G+ SL ++ F++CL V
Sbjct: 187 IVQNFYFGCGHGKHAVRG-----LFDGVLGLGRLRESLGARYGGV------FSYCLPSVS 235
Query: 260 GG-GIFAIGDVVSPK-VKTTPM--VPNMPHYN-VILEEVEVGGNPLDL-PTSLLGTGDER 313
G A+G +P TPM VP P ++ V L + VGG LDL P++ G
Sbjct: 236 SKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSG----- 290
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
G I+DSGT + L Y + S FR
Sbjct: 291 GMIVDSGTVITGLQSTAYRALRSAFR 316
>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 399
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 133/263 (50%), Gaps = 28/263 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP------TKSDLGIKLTLFDPSK 134
G Y ++V +GTP +E+ + VDTGS + +V C+ C+ C + L + F P
Sbjct: 38 GYYTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPEN 97
Query: 135 SSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
SS+ +I C + C T + S +C+Y Y + S++ G +D++ AS
Sbjct: 98 SSSYQKIGCRSSDCITGLCD-----SNSHQCKYERMYAEMSTSKGVLGKDLLDFGPAS-R 151
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
L++ L+ FGC +SGDL DGI+G G+ S++ QL G + F+ C
Sbjct: 152 LQSQLLS----FGCETAESGDLYLQ---VADGIMGLGRGPLSIVDQLVGNGAIEDSFSLC 204
Query: 255 L-DVVKGGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
+ +GGG +G + +P K+ P N +YN+ L E++V G L L +++
Sbjct: 205 YGGMDEGGGSMVLGAIPAPSGMVFAKSDPRRSN--YYNLELTEIQVQGASLKLDSNVF-- 260
Query: 310 GDERGTIIDSGTTLAYLPPMLYD 332
+ GTI+DSGTT AYLP ++
Sbjct: 261 NGKFGTILDSGTTYAYLPDRAFE 283
>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
Length = 445
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 128/266 (48%), Gaps = 37/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +V GTP V +DTGSD+ W+ C CS +C + D L+DPS SST
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKD-----PLYDPSHSSTYSA 167
Query: 141 IACSDNFCRTTYNNRYPS-CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ + C+ + Y S C+ G +C + ++Y DG+ST G + +D + L +
Sbjct: 168 VPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTLAPGA------- 220
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ + FGCG+ + G DG+LG G+ SL ++ F++CL V
Sbjct: 221 IVQNFYFGCGHGKHAVRG-----LFDGVLGLGRLRESLGARYGGV------FSYCLPSVS 269
Query: 260 GG-GIFAIGDVVSPK-VKTTPM--VPNMPHYN-VILEEVEVGGNPLDL-PTSLLGTGDER 313
G A+G +P TPM VP P ++ V L + VGG LDL P++ G
Sbjct: 270 SKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSG----- 324
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
G I+DSGT + L Y + S FR
Sbjct: 325 GMIVDSGTVITGLQSTAYRALRSAFR 350
>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 425
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 123/266 (46%), Gaps = 35/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP V +DT +D W+ C+GC C + LFDPSKSS+S +
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N PSC+ C + +TYG GS+ Y +D + L + +
Sbjct: 141 CEAPQCKQAPN---PSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTL--------ASDVIP 188
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
+ FGC N+ SG T G++G G+ SL+SQ + + F++CL K
Sbjct: 189 NYTFGCINKASG-----TSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 261 -GGIFAIGDVVSP-KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLG--TGDER 313
G +G P ++KTTP++ N Y V L + VG +D+PTS L
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L Y V ++FR
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAVRNEFR 327
>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 631
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 121/257 (47%), Gaps = 37/257 (14%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
+ G Y T++ +GTP E+ + VDTGS + +V C+ C +C D F P S++
Sbjct: 72 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQD-----PKFQPELSTSY 126
Query: 139 GEIACSDNFCRTTYNNRYPSCS---PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ C+ P C+ G C Y Y + SS+SG D+I S
Sbjct: 127 QALKCN------------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNES--- 171
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ +P +FGC N ++GDL S DGI+G G+ S++ QL G + F+ C
Sbjct: 172 QLSP--QRAVFGCENEETGDLFSQR---ADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY 226
Query: 256 DVVK-GGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
++ GGG +G + P + P P+YN+ L+++ V G L L +
Sbjct: 227 GGMEVGGGAMVLGKISPPPGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF--N 282
Query: 311 DERGTIIDSGTTLAYLP 327
+ GT++DSGTT AY P
Sbjct: 283 GKHGTVLDSGTTYAYFP 299
>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
Length = 586
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 121/257 (47%), Gaps = 37/257 (14%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
+ G Y T++ +GTP E+ + VDTGS + +V C+ C +C D F P S++
Sbjct: 72 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQD-----PKFQPELSTSY 126
Query: 139 GEIACSDNFCRTTYNNRYPSCS---PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ C+ P C+ G C Y Y + SS+SG D+I S
Sbjct: 127 QALKCN------------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNES--- 171
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ +P +FGC N ++GDL S DGI+G G+ S++ QL G + F+ C
Sbjct: 172 QLSP--QRAVFGCENEETGDLFSQR---ADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY 226
Query: 256 DVVK-GGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
++ GGG +G + P + P P+YN+ L+++ V G L L +
Sbjct: 227 GGMEVGGGAMVLGKISPPPGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF--N 282
Query: 311 DERGTIIDSGTTLAYLP 327
+ GT++DSGTT AY P
Sbjct: 283 GKHGTVLDSGTTYAYFP 299
>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
Length = 445
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 89/275 (32%), Positives = 129/275 (46%), Gaps = 36/275 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G YF ++ +GTP E V DTGSDL+WV C C C + K +F+P +SST
Sbjct: 92 GEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQ-----KSPIFNPKQSSTYRR 146
Query: 141 IACSDNFCRTTYNNRYPSCSPG---VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C +C N+ +CS C Y +YGD S T GY + + + +++
Sbjct: 147 VLCETRYCNAL-NSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNNSIQ- 204
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ FGCGN G+ D GI+G G + SL+SQL + +F++CL
Sbjct: 205 -----ELAFGCGNSNGGNF----DEVGSGIVGLGGGSLSLISQLGT--KIDNKFSYCLVP 253
Query: 258 VKGGGIFAIGDVV---------SPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSL 306
+ F++G +V S +TP+V P Y + LE + VG L S
Sbjct: 254 ILEKSNFSLGKIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSR 313
Query: 307 LGTGDERGT-IIDSGTTLAYLPPMLY---DLVLSQ 337
E+G IIDSGTTL +L LY +LVL +
Sbjct: 314 NDGNVEKGNIIIDSGTTLTFLDSKLYNKLELVLEK 348
>gi|147802609|emb|CAN73001.1| hypothetical protein VITISV_037997 [Vitis vinifera]
Length = 424
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 134/281 (47%), Gaps = 32/281 (11%)
Query: 63 MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKS 121
+ +S+ L GN +P G Y+ + +G P Y++ TGSDL W+ C A C RC TK+
Sbjct: 49 IQSSVVFPLYGNVYP--LGYYYVSLSIGQPPXPYFLDPXTGSDLSWLQCDAPCVRC-TKA 105
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYF 181
L+ P+ + + C D C + Y C +C+Y V Y DG S+ G
Sbjct: 106 ----XHXLYRPNNNL----VICKDPMCAXLHPPGY-KCEHPEQCDYEVEYADGGSSLGVL 156
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V+D+ LN +G L+ AP + GCG Q + + +DG+LG G+ SS++SQL
Sbjct: 157 VKDVFPLNFTNG-LRLAP---RLALGCGYDQ---IPGXSYHPLDGVLGLGKGKSSIVSQL 209
Query: 242 AAAGNVRKEFAHCLDVVKGGGIFAIGDVV--SPKVKTTPMVPNM-PHYNVILEEVEVGGN 298
+ G +R HC+ GGG GD + S +V TPM+ + HY+ E+ +GG
Sbjct: 210 HSQGVIRNVVGHCVS-SHGGGFLFFGDDLYDSSRVVWTPMLRDQHTHYSSGYAELILGGK 268
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L+ DSG++ YL + Y ++ R
Sbjct: 269 TTVFKNLLV--------TFDSGSSYTYLNSLAYQALVHLVR 301
>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
Length = 482
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 137/304 (45%), Gaps = 36/304 (11%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
ER + L ++ G +L L +G TG Y G GTP + +DTGSD
Sbjct: 101 ERDNARLNTIRSKNSGPYTTMSNLPLQ-SGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSD 159
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR---TTYNNRYPSCSPG 162
L W+ C C+ C ++ D +F+P +SS+ + C C T+ +N P G
Sbjct: 160 LTWIQCKPCADCYSQVD-----AIFEPKQSSSYKTLPCLSATCTELITSESNPTPCLLGG 214
Query: 163 VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
C Y + YGDGSS+ G F ++ + L S + FGCG+ +G S+
Sbjct: 215 --CVYEINYGDGSSSQGDFSQETLTLGSDSFQ--------NFAFGCGHTNTGLFKGSS-- 262
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV----VSPKVKTTP 278
G+LG GQ + S SQ + +FA+CL V + TP
Sbjct: 263 ---GLLGLGQNSLSFPSQ--SKSKYGGQFAYCLPDFGSSTSTGSFSVGKGSIPASAVFTP 317
Query: 279 MVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
+V N + Y V L + VGG+ L +P ++LG G TI+DSGT + L P Y+ +
Sbjct: 318 LVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGS---TIVDSGTVITRLLPQAYNALK 374
Query: 336 SQFR 339
+ FR
Sbjct: 375 TSFR 378
>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
Length = 425
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 123/266 (46%), Gaps = 35/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP V +DT +D W+ C+GC C + LFDPSKSS+S +
Sbjct: 88 YIVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N PSC+ C + +TYG GS+ Y +D + L + +
Sbjct: 141 CEAPQCKQAPN---PSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTL--------ASDVIP 188
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
+ FGC N+ SG T G++G G+ SL+SQ + + F++CL K
Sbjct: 189 NYTFGCINKASG-----TSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 261 -GGIFAIGDVVSP-KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLG--TGDER 313
G +G P ++KTTP++ N Y V L + VG +D+PTS L
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L Y V ++FR
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAVRNEFR 327
>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
Length = 517
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 94/266 (35%), Positives = 120/266 (45%), Gaps = 35/266 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDP 132
+G TG Y VGLGTP Y V DTGSD WV C C C + + LFDP
Sbjct: 169 SGRALGTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQE-----KLFDP 223
Query: 133 SKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+SST ++C+ C + CS G C Y V YGDGS + G+F D + L+
Sbjct: 224 VRSSTYANVSCAAPACS---DLNIHGCSGG-HCLYGVQYGDGSYSIGFFAMDTLTLSSYD 279
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEF 251
FGCG R G G + G+LG G+ +SL Q G V F
Sbjct: 280 A-------VKGFRFGCGERNEGLFGEAA-----GLLGLGRGKTSLPVQTYDKYGGV---F 324
Query: 252 AHCLDVVKGGGIF----AIGDVVSPKVKTTPMVP-NMPHYNVI-LEEVEVGGNPLDLPTS 305
AHCL G + A + TTPM+ N P + I + + VGG L +P S
Sbjct: 325 AHCLPARSTGTGYLDFGAGSPAAASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQS 384
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLY 331
+ T GTI+DSGT + LPP Y
Sbjct: 385 VFATA---GTIVDSGTVITRLPPPAY 407
>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 641
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 92/311 (29%), Positives = 140/311 (45%), Gaps = 54/311 (17%)
Query: 50 SALKQHDTRRHGRMMASIDLELGGNGHPS------ATGLYFTKVGLGTPTDEYYVQVDTG 103
S K + H R + + DL N H + G Y T++ +GTP E+ + VDTG
Sbjct: 52 SHRKPFTSNYHRRQLHNSDLP---NAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTG 108
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS--- 160
S + +V C+ C +C D F P SST + C+ PSC+
Sbjct: 109 STVTYVPCSTCEQCGKHQD-----PRFQPESSSTYKPMQCN------------PSCNCDD 151
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
G +C Y Y + SS+SG D++ S + P IFGC ++G+L S
Sbjct: 152 EGKQCTYERRYAEMSSSSGLLAEDVLSFGNES---ELTP--QRAIFGCETVETGELFSQR 206
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK---- 273
DGI+G G+ S++ QL V F+ C +DVV GG +G++ P
Sbjct: 207 ---ADGIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVV--GGAMVLGNIPPPPDMVF 261
Query: 274 VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY-- 331
+ P +YN+ L+E+ V G L L + + GT++DSGTT AYLP +
Sbjct: 262 AHSDPY--RSAYYNIELKELHVAGKRLKLNPRVF--DGKHGTVLDSGTTYAYLPEEAFVA 317
Query: 332 --DLVLSQFRF 340
D ++ + +F
Sbjct: 318 FKDAIIKEIKF 328
>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 420
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 125/273 (45%), Gaps = 28/273 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y ++ +GTP + Y DTGSDL W +C C+ C + + +FDP KS+T
Sbjct: 70 GHYLMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRN-----PMFDPQKSTTYRN 124
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I+C C CSP RC Y Y + T G ++ I L+ G K+ PL
Sbjct: 125 ISCDSKLCHKLDTG---VCSPQKRCNYTYAYASAAITRGVLAQETITLSSTKG--KSVPL 179
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
++FGCG+ +G GI+G G SL+SQ+ ++ K F+ CL
Sbjct: 180 K-GIVFGCGHNNTGGFNDHE----MGIIGLGGGPVSLISQMGSSFG-GKRFSQCLVPFHT 233
Query: 256 DV-VKGGGIFAIGDVVSPK-VKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGD 311
DV V F G VS K V +TP+V Y V L + V L S
Sbjct: 234 DVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGS--SQNV 291
Query: 312 ERGTI-IDSGTTLAYLPPMLYDLVLSQFRFWIA 343
E+G + +DSGT LP LYD V++Q R +A
Sbjct: 292 EKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVA 324
>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 570
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 87/261 (33%), Positives = 129/261 (49%), Gaps = 25/261 (9%)
Query: 73 GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFD 131
G+ +P GLY+T + +G P Y++ +DTGSDL WV C A CS C + L+
Sbjct: 191 GDIYPD--GLYYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPCSSCGKG-----RSPLYK 243
Query: 132 PSKSSTSGEIACSDNFCRTTYNNRY-PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
P + + ++ D+ C N C+ +C Y V Y D SS+ G V+D L
Sbjct: 244 PRRENV---VSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRF 300
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
++G+L LN+ IFGC Q G L +T + DGILG +A SL SQLA+ G +
Sbjct: 301 SNGSLTK--LNA--IFGCAYDQQG-LLLNTLSKTDGILGLSRAKVSLPSQLASRGIINNV 355
Query: 251 FAHCLD-VVKGGGIFAIGDVVSPK--VKTTPMV--PNMPHYNVILEEVEVGGNPLDLPTS 305
HCL GGG +GD P+ + M+ P++ Y + ++ G PL L T
Sbjct: 356 VGHCLTGDPAGGGYLFLGDDFVPQWGMAWVAMLDSPSIDFYQTKVVRIDYGSIPLSLDT- 414
Query: 306 LLGTGDERGTIIDSGTTLAYL 326
G+ E+ + DSG++ Y
Sbjct: 415 -WGSSREQ-VVFDSGSSYTYF 433
>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
Length = 486
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 102/273 (37%), Positives = 130/273 (47%), Gaps = 41/273 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC---SRCPTKSDLGIKLTLFDPSKSSTSG 139
+ VGLGTP + DTGSDL WV C C C + D LFDPSKSST
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQD-----PLFDPSKSSTYA 198
Query: 140 EIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C + C + CS C Y+V YGDGSST+G RD + L +S L
Sbjct: 199 AVHCGEPQCAAAGD----LCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALT-SSRALTGF 253
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
P FGCG R GD G VDG+LG G+ SL SQ AA + F++CL
Sbjct: 254 P------FGCGTRNLGDFGR-----VDGLLGLGRGELSLPSQ--AAASFGAVFSYCLPSS 300
Query: 259 KG-GGIFAIGDVVSPKVKT-----TPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGT 309
G IG +P T T M+ P P Y V L +++GG L +P ++
Sbjct: 301 NSTTGYLTIG--ATPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTR 358
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
G GT++DSGT L YLP Y L+ +FR +
Sbjct: 359 G---GTLLDSGTVLTYLPAQAYALLRDRFRLTM 388
>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
Length = 455
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/288 (33%), Positives = 137/288 (47%), Gaps = 52/288 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSST 137
G Y + LGTP ++ V VDTGS+L+W CA C+RC PT + + P++SST
Sbjct: 89 GAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAP------VLQPARSST 142
Query: 138 SGEIACSDNFCRTTYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ C+ +FC+ + P +C+ C Y TYG G T+GY + + +
Sbjct: 143 FSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETLTVGDG----- 196
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVD---GILGFGQANSSLLSQLAAAGNVRKEFAH 253
T P V FGC ST+ VD GI+G G+ SL+SQLA F++
Sbjct: 197 TFP---KVAFGC----------STENGVDNSSGIVGLGRGPLSLVSQLAVG-----RFSY 238
Query: 254 CL--DVVKGGG-IFAIGDVVS----PKVKTTPMVPN-----MPHYNVILEEVEVGGNPLD 301
CL D+ GG G + V++TP++ N HY V L + V L
Sbjct: 239 CLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELP 298
Query: 302 LPTSLLG---TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASLD 346
+ S G TG GTI+DSGTTL YL Y +V F+ +A+L+
Sbjct: 299 VTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLN 346
>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 665
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 79/257 (30%), Positives = 120/257 (46%), Gaps = 37/257 (14%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
+ G Y T++ +GTP E+ + VDTGS + +V C+ C +C D F P SS+
Sbjct: 76 SNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQD-----PKFQPELSSSY 130
Query: 139 GEIACSDNFCRTTYNNRYPSCS---PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ C+ P C+ G C Y Y + SS+SG D+I S
Sbjct: 131 KALKCN------------PDCNCDDEGKLCVYERRYAEMSSSSGVLSEDLISFGNES--- 175
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ P +FGC N ++GDL S DGI+G G+ S++ QL G + F+ C
Sbjct: 176 QLTP--QRAVFGCENVETGDLFSQR---ADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY 230
Query: 256 DVVK-GGGIFAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
++ GGG +G + P + P P+YN+ L+++ V G L L +
Sbjct: 231 GGMEVGGGAMVLGKISPPAGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVF--N 286
Query: 311 DERGTIIDSGTTLAYLP 327
+ GT++DSGTT AY P
Sbjct: 287 GKHGTVLDSGTTYAYFP 303
>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
Length = 451
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 126/279 (45%), Gaps = 33/279 (11%)
Query: 79 ATGLYFTKVGLGTPTDEYY-VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
++G Y +GTP + + +DTGSDL+W C C C LFDPS SST
Sbjct: 83 SSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVC-----FDQPFPLFDPSVSST 137
Query: 138 SGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+AC D CR + +C+ RC Y+ +YGD S T+GY +D +G
Sbjct: 138 FRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGA 197
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL- 255
S + FGCG+ +G S+ GI GFG+ SL SQL F++CL
Sbjct: 198 PPVAVSGLAFGCGDYNTGVFASNE----SGIAGFGRGPLSLPSQLRVG-----RFSYCLT 248
Query: 256 --DVVKGGGIFAIGDVVSPK---------VKTTPMV--PNMP-HYNVILEEVEVGGN--P 299
D + A+ P ++TP++ P+ P Y + LE + VG P
Sbjct: 249 SHDETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLP 308
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+D L GT+IDSGT + P +++ + ++F
Sbjct: 309 VDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEF 347
>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 491
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 125/255 (49%), Gaps = 41/255 (16%)
Query: 100 VDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT--TYNNR 155
+DT SD+ WV CA C C ++D+ L+DPSKSS+S CS CR Y N
Sbjct: 160 IDTASDVPWVQCAPCPAPHCHAQTDV-----LYDPSKSSSSAAFPCSSPACRNLGPYAN- 213
Query: 156 YPSCSP-GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR--Q 212
C+P G +C+Y V Y DGS+++G ++ D++ LN A K A S FGC + Q
Sbjct: 214 --GCTPAGDQCQYRVQYPDGSASAGTYISDVLTLNPA----KPASAISEFRFGCSHALLQ 267
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCL---DVVKGGGIFAIGD 268
G + T GI+ G+ SL +Q A G+V F++CL V G I +
Sbjct: 268 PGSFSNKT----SGIMALGRGAQSLPTQTKATYGDV---FSYCLPPTPVHSGFFILGVPR 320
Query: 269 VVSPKVKTTPMV-----PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTL 323
V + + TPM+ P + Y V L +EV G L +P ++ G ++DS T +
Sbjct: 321 VAASRYAVTPMLRSKAAPML--YLVRLIAIEVAGKRLPVPPAVFAA----GAVMDSRTIV 374
Query: 324 AYLPPMLYDLVLSQF 338
LPP Y + + F
Sbjct: 375 TRLPPTAYMALRAAF 389
>gi|15219354|ref|NP_175079.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12320825|gb|AAG50556.1|AC074228_11 nucellin, putative [Arabidopsis thaliana]
gi|332193902|gb|AEE32023.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 405
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 87/276 (31%), Positives = 119/276 (43%), Gaps = 29/276 (10%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+S+ L GN P G Y + +G+P + +DTGSDL WV C A CS C +L
Sbjct: 33 SSVVFPLSGNVFP--LGYYSVLMQIGSPPKAFQFDIDTGSDLTWVQCDAPCSGCTLPPNL 90
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFV 182
K I CS+ C + P C +P +C+Y V Y D S+ G V
Sbjct: 91 QYK---------PKGNIIPCSNPICTALHWPNKPHCPNPQEQCDYEVKYADQGSSMGALV 141
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
D L +G+ P V FGCG QS + A G+LG G+ LL+QL
Sbjct: 142 TDQFPLKLVNGSFMQPP----VAFGCGYDQSYP-SAHPPPATAGVLGLGRGKIGLLTQLV 196
Query: 243 AAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPL 300
+AG R HCL KGGG GD + P V TP++ HY ++ G
Sbjct: 197 SAGLTRNVVGHCLS-SKGGGFLFFGDNLVPSIGVAWTPLLSQDNHYTTGPADLLFNGK-- 253
Query: 301 DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
PT L G I D+G++ Y Y +++
Sbjct: 254 --PTGLKGL----KLIFDTGSSYTYFNSKAYQTIIN 283
>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 438
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 128/279 (45%), Gaps = 30/279 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
R +S+ + GN +P G Y + +G P Y++ +DTGSDL W+ C A CSRC
Sbjct: 58 RAGSSVVFPVHGNVYP--VGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQT 115
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
L+ PS + C + C + +++ C +C+Y V Y D S+ G
Sbjct: 116 PH-----PLYRPSNDF----VPCRHSLCASLHHSDNYDCEVPHQCDYEVQYADHYSSLGV 166
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
+ D+ LN +G L + GCG Q + +DG+LG G+ +SL SQ
Sbjct: 167 LLHDVYTLNFTNG----VQLKVRMALGCGYDQI--FPDPSHHPLDGMLGLGRGKTSLTSQ 220
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVV-SPKVKTTPMVP-NMPHYNVI-LEEVEVGG 297
L + G VR HCL GG IF GDV S ++ TPM + HY+ E+ GG
Sbjct: 221 LNSQGLVRNVIGHCLSAQGGGYIF-FGDVYDSSRLTWTPMSSRDYKHYSAAGAAELLFGG 279
Query: 298 NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
S +G+ + D+G++ Y P Y ++S
Sbjct: 280 K-----KSGIGS---LHAVFDTGSSYTYFNPYAYQALIS 310
>gi|6562286|emb|CAB62656.1| putative protein [Arabidopsis thaliana]
Length = 518
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 142/332 (42%), Gaps = 40/332 (12%)
Query: 20 WAVGGGGVMGNFVFEVENKFK--------------AGGERERTLSALKQHDTRRHGRMMA 65
W + G F FEV + F G E L D GR +A
Sbjct: 7 WGLERCEASGKFSFEVHHMFSDVVKQTLGFDDLVPENGSLEY-FKVLAHRDRFIRGRGLA 65
Query: 66 SIDLE-----LGGNGHPSATGL---YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC 117
S + E +G N + L ++ V LGTP + V +DTGSDL W+ C + C
Sbjct: 66 SNNEETPLTSIGSNLTLALNFLGFLHYANVSLGTPATWFLVALDTGSDLFWLPCNCGTTC 125
Query: 118 -----PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYG 172
+ + L L+ P+ S+TS I CSD C + SP C Y +
Sbjct: 126 IHDLKDARFSESVPLNLYTPNASTTSSSIRCSDKRCFGSGK----CSSPESICPYQIALS 181
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ T+G ++D++ L +LK P+N++V GCG Q+G TD AV+G+LG
Sbjct: 182 SNTVTTGTLLQDVLHLVTEDEDLK--PVNANVTLGCGQNQTGAF--QTDIAVNGVLGLSM 237
Query: 233 ANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMV--PNMPHYNVI 289
S+ S LA A F+ C ++ G + GD + TP+V Y V
Sbjct: 238 KEYSVPSLLAKANITANSFSMCFGRIISVVGRISFGDKGYTDQEETPLVSLETSTAYGVN 297
Query: 290 LEEVEVGGNPLDLPT-SLLGTGDERGTIIDSG 320
+ V VGG P+D+P +L TG +++S
Sbjct: 298 VTGVSVGGVPVDVPLFALFDTGSSFTLLLESA 329
>gi|242094226|ref|XP_002437603.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
gi|241915826|gb|EER88970.1| hypothetical protein SORBIDRAFT_10g030330 [Sorghum bicolor]
Length = 541
Score = 107 bits (268), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/340 (29%), Positives = 142/340 (41%), Gaps = 42/340 (12%)
Query: 16 VVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMAS------IDL 69
VV QWA G + A G E SAL +HD R + +
Sbjct: 45 VVRQWAEARGHPFA------AQDWPARGSPEY-YSALSRHDRAVLSRRALADGADGLVTF 97
Query: 70 ELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDL----GI 125
G + LY+ V +GTP + V +DTGSDL WV C C +C + +++
Sbjct: 98 AAGNDTLQYIGSLYYAVVEVGTPNATFLVALDTGSDLFWVPC-DCKQCASIANVTGQPAT 156
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTY-GDGSSTSGYFV 182
L + P +SSTS ++ C + C +R CS C Y V Y +STSG V
Sbjct: 157 ALRPYSPRESSTSKQVTCDNALC-----DRPNGCSAATNGSCPYEVQYLSANTSTSGVLV 211
Query: 183 RDIIQLNQ---ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
+D++ L + + L + V+FGCG Q+G AA DG++G G+ N S+ S
Sbjct: 212 QDVLHLTRERPGAAAEAGEALQAPVVFGCGQVQTGTFLDG--AAFDGLMGLGRENVSVPS 269
Query: 240 QLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGN 298
LA++G V + F+ C G G GD S TP YNV V V
Sbjct: 270 VLASSGLVASDSFSMCFG-DDGVGRINFGDSGSSGQGETPFTGRRTLYNVSFTAVNV--- 325
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
E +IDSGT+ YL Y + + F
Sbjct: 326 ------ETKSVAAEFAAVIDSGTSFTYLADPEYTELATNF 359
>gi|356554625|ref|XP_003545645.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 452
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 130/289 (44%), Gaps = 31/289 (10%)
Query: 53 KQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC- 111
K+ + H R+ +S ++ GN +P G Y + +G P Y + +D+GSDL WV C
Sbjct: 36 KKLSSDNHHRLSSSAVFKVQGNVYP--LGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCD 93
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC-RTTYNNRYPSCSPGVRCEYVVT 170
A C C D L+ P+ + + C D C + Y SP +C+Y V
Sbjct: 94 APCKGCTKPRD-----QLYKPNHNL----VQCVDQLCSEVQLSMEYTCASPDDQCDYEVE 144
Query: 171 YGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGF 230
Y D S+ G VRD I +G++ + V FGCG Q GS++ A G+LG
Sbjct: 145 YADHGSSLGVLVRDYIPFQFTNGSV----VRPRVAFGCGYDQKYS-GSNSPPATSGVLGL 199
Query: 231 GQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVP--NMPHY 286
G +S+LSQL + G + HCL +GGG GD P + T M+P + HY
Sbjct: 200 GNGRASILSQLHSLGLIHNVVGHCLS-ARGGGFLFFGDDFIPSSGIVWTSMLPSSSEKHY 258
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
+ E+ G + + G E I DSG++ Y Y V+
Sbjct: 259 SSGPAELVFNGK------ATVVKGLE--LIFDSGSSYTYFNSQAYQAVV 299
>gi|297819834|ref|XP_002877800.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297323638|gb|EFH54059.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 531
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 97/339 (28%), Positives = 150/339 (44%), Gaps = 46/339 (13%)
Query: 29 GNFVFEVENKFKAGGERERTL-------------SALKQHDTRRHGRMMASIDLEL---- 71
G F FEV + F ++ L L D GR +AS + +
Sbjct: 27 GKFGFEVHHIFSDAVKQSLGLDDLVPEQGSLEYFKVLAHRDRLIRGRGLASNNEDTPVTF 86
Query: 72 -GGNGHPSAT---GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK-SDLG-- 124
GGN S LY+ V +GTP + V +DTGSDL W+ C + C D+G
Sbjct: 87 DGGNLTVSIKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVP 146
Query: 125 --IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFV 182
+ L L+ P+ S+TS I CSD C + ++ S SP C Y ++Y + + T+G +
Sbjct: 147 QSVPLNLYTPNASTTSSSIRCSDKRC---FGSKKCS-SPKSICPYQISYSNSTGTTGTLL 202
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
+D++ L NL P+ ++V GCG +Q+G + +V+G+LG G S+ S LA
Sbjct: 203 QDVLHLATEDENL--TPVKTNVTLGCGQKQTGLF--QRNNSVNGVLGLGIKGYSVPSLLA 258
Query: 243 AAGNVRKEFAHCLDVVKGG-GIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNP 299
A F+ C V G G + GD + TP + P Y + + V VGG+P
Sbjct: 259 KANITADSFSMCFGRVIGNVGRISFGDKGYTDQEETPFISVAPSTAYGLNVTGVSVGGDP 318
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G D+G++ +L Y ++ F
Sbjct: 319 V---------GTRLFAKFDTGSSFTHLMEPAYGVLTKSF 348
>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
Length = 516
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 125/270 (46%), Gaps = 40/270 (14%)
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+GTP Y VDTGSDL+W C C C +S +FDPS SST + CS C
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQS-----TPVFDPSSSSTYATVPCSSASC 227
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
++ C+ +C Y TYGD SST G + L ++ V+FGC
Sbjct: 228 SDLPTSK---CTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK--------LPGVVFGC 276
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-------- 260
G+ GD G S A G++G G+ SL+SQL +F++CL +
Sbjct: 277 GDTNEGD-GFSQGA---GLVGLGRGPLSLVSQLGL-----DKFSYCLTSLDDTNNSPLLL 327
Query: 261 GGIFAI--GDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDE--R 313
G + I + V+TTP++ P+ P Y V L+ + VG + LP+S D+
Sbjct: 328 GSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTG 387
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
G I+DSGT++ YL Y + F +A
Sbjct: 388 GVIVDSGTSITYLEVQGYRALKKAFAAQMA 417
>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 413
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 79/283 (27%), Positives = 129/283 (45%), Gaps = 40/283 (14%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKL 127
+L G+ +P TG Y+ + +G P Y++ +DTGSDL W+ C A C C +
Sbjct: 40 FQLNGDVYP--TGHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCDAPCQSCNK-----VPH 92
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPS--CSPGVRCEYVVTYGDGSSTSGYFVRDI 185
L+ P+K+ + C+ + C T ++ + P+ C+ +C+Y + Y D +S+ G V D
Sbjct: 93 PLYKPTKNKL---VPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDN 149
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
L + ++ + S FGCG Q A DG+LG G+ + SL+SQL G
Sbjct: 150 FTLPLRN----SSSVRPSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLG 205
Query: 246 NVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYN-----VILEEVEVG 296
+ HCL GGG GD V P + T PMV + +Y+ + + +G
Sbjct: 206 ITKNVLGHCLS-TNGGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLG 264
Query: 297 GNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
P+++ + DSG+T Y Y +S +
Sbjct: 265 VKPMEV-------------VFDSGSTYTYFAAQPYQATVSALK 294
>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
Length = 468
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 92/280 (32%), Positives = 132/280 (47%), Gaps = 43/280 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y VGLGTP+ + +DTGSDL WV C C + C + D LFDPSKSST
Sbjct: 124 YVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKD-----PLFDPSKSSTYAP 178
Query: 141 IACSDNFCRTTYNNRY-PSCSPG---VRCEYVVTYGDGSSTSGYFVRDIIQLNQ--ASGN 194
I C+ + CR ++ Y C+ G +C + +TYGDGS T G + + + L A +
Sbjct: 179 IPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAPGVAVKD 238
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ FGCG+ Q G + DG+LG G A SL+ Q A+ F++C
Sbjct: 239 FR---------FGCGHDQDG-----ANDKYDGLLGLGGAPESLVVQTASV--YGGAFSYC 282
Query: 255 L----DVVKGGGIFAIGDVVSPKVKT-----TPMVPNMPHYNVI-LEEVEVGGNPLDLPT 304
L + V + G V T TPM+ + V+ + + VGG P+D+P
Sbjct: 283 LPALNNQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPP 342
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
S G IIDSGT + L Y+ + + FR +A+
Sbjct: 343 SAF----SGGMIIDSGTVVTELQHTAYNALQAAFRKAMAA 378
>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 468
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 133/284 (46%), Gaps = 50/284 (17%)
Query: 62 RMMASIDLEL---GGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ A+ DL++ GNG + + +GTP Y VDTGSDL+W C C C
Sbjct: 100 KAAAAPDLQVPVHAGNGE------FLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECF 153
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSST 177
+S +FDPS SST + CS + C + +C+ + C Y TYGD SST
Sbjct: 154 NQST-----PVFDPSSSSTYSTLPCSSSLCSDLPTS---TCTSAAKDCGYTYTYGDASST 205
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G + L + V FGCG+ GD G + A G++G G+ SL
Sbjct: 206 QGVLAAETFTLAKTK--------LPGVAFGCGDTNEGD-GFTQGA---GLVGLGRGPLSL 253
Query: 238 LSQLAAAGNVRKEFAHCL----DVVKG----GGIFAIG-DVVS-PKVKTTPMV--PNMP- 284
+SQL +F++CL D K G + AI D S ++TTP++ P+ P
Sbjct: 254 VSQLGLG-----KFSYCLTSLDDTSKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPS 308
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYL 326
Y V L+ + VG + LP S D+ G I+DSGT++ YL
Sbjct: 309 FYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVIVDSGTSITYL 352
>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 473
Score = 107 bits (267), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 87/274 (31%), Positives = 126/274 (45%), Gaps = 34/274 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR-CPTKSDLGIKLTLFDPSKSSTS 138
+G Y VGLGTP + DTGSDL W C C+R C + D +F PS+S+T
Sbjct: 128 SGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKD-----PVFVPSQSTTY 182
Query: 139 GEIACSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
I+CS C + P CS C Y + YGD S + GYF ++ + L
Sbjct: 183 SNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETL-------TLT 235
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ + + +FGCG G GS+ G++G GQ S++ Q A + F++CL
Sbjct: 236 STDVIENFLFGCGQNNRGLFGSAA-----GLIGLGQDKISIVKQTAQ--KYGQVFSYCLP 288
Query: 257 VVKG--GGIFAIGDVVSPKVKTTPM-----VPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
G + G +K TP+ V N Y V + ++VGG + + +S+ T
Sbjct: 289 KTSSSTGYLTFGGGGGGGALKYTPITKAHGVANF--YGVDIVGMKVGGTQIPISSSVFST 346
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
G IIDSGT + LPP Y + S F +A
Sbjct: 347 ---SGAIIDSGTVITRLPPDAYSALKSAFEKGMA 377
>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 435
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 91/304 (29%), Positives = 134/304 (44%), Gaps = 30/304 (9%)
Query: 37 NKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEY 96
NK K+G A+ + + +SI + GN +P G Y + +G P Y
Sbjct: 30 NKRKSGRNSILPGEAMSSRPSLMNHAAGSSIVFPIYGNVYP--VGFYNVTLNIGQPPRPY 87
Query: 97 YVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
++ VDTGS+L W+ C A CS+C L+ PS I C D C +
Sbjct: 88 FLDVDTGSELTWLQCDAPCSQCSETPH-----PLYKPSNDF----IPCKDPLCASLQPTD 138
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
+C +C+Y + Y D ST G + D+ LN +G L + GCG Q
Sbjct: 139 DYTCEDPNQCDYEIKYADQYSTLGVLLNDVYLLNFTNG----VQLKVRMALGCGYDQI-- 192
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV-SPKV 274
ST +DGILG G+ +SL+SQL + G VR HCL +GGG G+V S ++
Sbjct: 193 FSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLS-SRGGGYIFFGNVYDSSRM 251
Query: 275 KTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
TP+ + + HY+ E+ GG G G I D+G++ Y Y
Sbjct: 252 SWTPISSIDSGKHYSAGPAELVFGGRK-------TGVG-SLNIIFDTGSSYTYFNSQAYQ 303
Query: 333 LVLS 336
++S
Sbjct: 304 AMIS 307
>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
lyrata]
Length = 425
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/266 (31%), Positives = 122/266 (45%), Gaps = 35/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP V +DT +D W+ C+GC C + LFDPSKSS+S +
Sbjct: 88 YIVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSS-------VLFDPSKSSSSRTLQ 140
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N PSC+ C + +TYG GS+ Y +D + L +
Sbjct: 141 CEAPQCKQAPN---PSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTL--------ATDVIP 188
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
+ FGC N+ SG T G++G G+ SL+SQ + + F++CL K
Sbjct: 189 NYTFGCINKASG-----TSLPAQGLMGLGRGPLSLISQ--SQNLYQSTFSYCLPNSKSSN 241
Query: 261 -GGIFAIGDVVSP-KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLG--TGDER 313
G +G P ++KTTP++ N Y V L + VG +D+PTS L
Sbjct: 242 FSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGA 301
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L Y + ++FR
Sbjct: 302 GTIFDSGTVYTRLVEPAYVAMRNEFR 327
>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 139/298 (46%), Gaps = 38/298 (12%)
Query: 46 ERTLSALKQHDTRRHGR---MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDT 102
+R +AL++ +R H AS+ + + S G Y + LGTP + DT
Sbjct: 55 QRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADT 114
Query: 103 GSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG 162
GSDL+W C C RC + D LFDP S T + +C C + +CS G
Sbjct: 115 GSDLIWTQCKPCERCYKQVD-----PLFDPKSSKTYRDFSCDARQCSLLDQS---TCS-G 165
Query: 163 VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG---DLGSS 219
C+Y +YGD S T G D I L+ +G+ + P + GCG+ G D GS
Sbjct: 166 NICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFP---KTVIGCGHENDGTFSDKGS- 221
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG------IFAIGDVVS-P 272
GI+G G SL+SQ+ ++ V +F++CL + F VVS P
Sbjct: 222 ------GIVGLGAGPLSLISQMGSS--VGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGP 273
Query: 273 KVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
V++TP++ + Y + LE + VG + S LGTG E IIDSGTTL +P
Sbjct: 274 GVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTG-EGNIIIDSGTTLTIVP 330
>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
Length = 436
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 136/307 (44%), Gaps = 45/307 (14%)
Query: 46 ERTLSALKQHDTR--RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER A+K+ R R AS + + H + G + + +GTP + Y +DTG
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVH-AGNGEFLMNLAIGTPAETYSAIMDTG 117
Query: 104 SDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
SDL+W C C C PT +FDP KSS+ ++ CS + C SCS
Sbjct: 118 SDLIWTQCKPCKVCFDQPTP--------IFDPEKSSSFSKLPCSSDLCVAL---PISSCS 166
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
G CEY +YGD SST G + AS S + FGCG G S
Sbjct: 167 DG--CEYRYSYGDHSSTQGVLATETFTFGDAS--------VSKIGFGCGEDNRGRAYSQG 216
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVKGGGIFAIGDVVSPKVKT- 276
G++G G+ SL+SQL +F++CL D KG +G + K
Sbjct: 217 ----AGLVGLGRGPLSLISQLGVP-----KFSYCLTSIDDSKGISTLLVGSEATVKSAIP 267
Query: 277 TPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLY 331
TP++ P+ P Y + LE + VG L + S D+ G IIDSGTT+ YL +
Sbjct: 268 TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDSAF 327
Query: 332 DLVLSQF 338
+ +F
Sbjct: 328 AALKKEF 334
>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 434
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 129/260 (49%), Gaps = 24/260 (9%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +G P + Y +DTGSD++W+ C C +C ++ +FDPSKS+T
Sbjct: 84 GEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTT-----RIFDPSKSNTYKI 138
Query: 141 IACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ S C++ + SCS R CEY + YGDGS + G + + L +G+ +
Sbjct: 139 LPFSSTTCQSVEDT---SCSSDNRKMCEYTIYYGDGSYSQGDLSVETLTLGSTNGS--SV 193
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL-AAAGNVRKEFAHCLDV 257
+VI GCG + S + GI+G G SL++QL + ++ ++F++CL
Sbjct: 194 KFRRTVI-GCGRNNT----VSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLAS 248
Query: 258 VKG-GGIFAIGD--VVSPK-VKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGD 311
+ GD VVS +TP+V + P Y + LE VG N ++ +S G+
Sbjct: 249 MSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGE 308
Query: 312 ERGTIIDSGTTLAYLPPMLY 331
+ IIDSGTTL LP +Y
Sbjct: 309 KGNIIIDSGTTLTLLPNDIY 328
>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 494
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 121/259 (46%), Gaps = 38/259 (14%)
Query: 98 VQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR---TTY 152
V VDT SD+ WV C C +C + D L+DP+KSST I C C+ ++Y
Sbjct: 171 VVVDTSSDIPWVQCLPCPIPQCHLQKD-----PLYDPAKSSTFAPIPCGSPACKELGSSY 225
Query: 153 NNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
N CSP C+Y+V YGDG +T+G +V D + ++ + FGC +
Sbjct: 226 GN---GCSPTTDECKYIVNYGDGKATTGTYVTDTLTMSPTI-------VVKDFRFGCSHA 275
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
G + GIL G SLL Q A A GN F++C+ G ++G V
Sbjct: 276 VRGSFSNQN----AGILALGGGRGSLLEQTADAYGNA---FSYCIPKPSSAGFLSLGGPV 328
Query: 271 --SPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAY 325
S K TP++ N Y V LE + V G L +P + T G ++DSG +
Sbjct: 329 EASLKFSYTPLIKNKHAPTFYIVHLEAIIVAGKQLAVPPTAFAT----GAVMDSGAVVTQ 384
Query: 326 LPPMLYDLVLSQFRFWIAS 344
LPP +Y + + FR +A+
Sbjct: 385 LPPQVYAALRAAFRSAMAA 403
>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
Length = 456
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/285 (30%), Positives = 130/285 (45%), Gaps = 49/285 (17%)
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
PS Y + +GTP +DTGSDL+W CA C+ C + D LF P +S
Sbjct: 95 RPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPD-----PLFAPGES 149
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN- 194
++ + C+ C ++ C C Y YGDG+ T G + + + G+
Sbjct: 150 ASYEPMRCAGQLCSDILHH---GCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDR 206
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
L T PL FGCG+ G L + + GI+GFG+ SL+SQL+ + F++C
Sbjct: 207 LMTVPLG----FGCGSMNVGSLNNGS-----GIVGFGRNPLSLVSQLSI-----RRFSYC 252
Query: 255 LDVVK------------GGGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNP 299
L GG++ GD P V+TTP++ ++ + Y V L + VG
Sbjct: 253 LTSYGSGRKSTLLFGSLSGGVY--GDATGP-VQTTPLLQSLQNPTFYYVHLAGLTVGARR 309
Query: 300 LDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L +P S G+G G I+DSGT L LP + V+ FR
Sbjct: 310 LRIPESAFALRPDGSG---GVIVDSGTALTLLPGAVLAEVVRAFR 351
>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 430
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 129/294 (43%), Gaps = 60/294 (20%)
Query: 45 RERTLSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQ 99
E L+ L D+ RHGR++ S + ++ + + LY+T V +GTP E V
Sbjct: 35 HELDLTQLMTFDSARHGRLLQSPVHGSFNWKVERDTSILLSALYYTTVQIGTPPRELDVV 94
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGSDL+WV+C C CP + +T FDP SS++ ++ACSD C + + C
Sbjct: 95 IDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRCSSDLQKK-SRC 148
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
S C Y V YGDGS TSGY++ D+I + S A ++S + RQ +G+
Sbjct: 149 SLLESCTYKVEYGDGSVTSGYYISDLISFDTMSDWTYIAFRDNST-WHPWVRQGAIIGT- 206
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAG-NVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTP 278
F S+ S +++ +F+H + V A+ D+ P
Sbjct: 207 ----------FPALCSTPCSTVSSQPLYYNPQFSHMMTV-------AVNDLRLP------ 243
Query: 279 MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
+ S+ GTIIDSGTTL + P YD
Sbjct: 244 -----------------------IDPSVFSVAKGYGTIIDSGTTLVHFPGEAYD 274
>gi|356551638|ref|XP_003544181.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 880
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 83/268 (30%), Positives = 124/268 (46%), Gaps = 29/268 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSD-----LGIKLTLFDPSKSS 136
L++T + +GTP + V +D GSD+LWV C C C + S L L + PS S+
Sbjct: 104 LHYTWIDIGTPNVSFLVALDAGSDMLWVPC-DCIECASLSAGNYNVLDRDLNQYRPSLSN 162
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDG-SSTSGYFVRDIIQLNQASG 193
TS + C C S G + C Y V Y +S+SGY D + L
Sbjct: 163 TSRHLPCGHKLCDVH------SVCKGSKDPCPYAVQYSSANTSSSGYVFEDKLHLTSNGK 216
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ + + +S+I GCG +Q+G+ A DG+LG G N S+ S LA AG ++ F+
Sbjct: 217 HAEQNSVQASIILGCGRKQTGEYLRG--AGPDGVLGLGPGNISVPSLLAKAGLIQNSFSI 274
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE---VGGNPLDLPTSLLGTG 310
C + + G I GD +TP +P +N + VE VG SL
Sbjct: 275 CFEENESGRII-FGDQGHVTQHSTPFLPIDGKFNAYIVGVESFCVG--------SLCLKE 325
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+IDSG++ +LP +Y V+ +F
Sbjct: 326 TRFQALIDSGSSFTFLPNEVYQKVVIEF 353
>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 447
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 136/302 (45%), Gaps = 46/302 (15%)
Query: 54 QHDTRRHGRMMASIDLELGGNGH------PSATG-LYFTKVGLGTPTDEYYVQVDTGSDL 106
QH R + A I+ L N PS TG + +G P V +DTGSD+
Sbjct: 65 QHSAARLANIQARIEGSLVSNNDYKARVSPSLTGRTIMANISIGQPPIPQLVVMDTGSDI 124
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
LWV C C+ C +DLG+ LFDPSKSST + C+T P G RC+
Sbjct: 125 LWVMCTPCTNC--DNDLGL---LFDPSKSSTFSPL------CKT------PCDFEGCRCD 167
Query: 167 ---YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+ VTY D S+ SG F RD + S V+FGCG+ ++G TD
Sbjct: 168 PIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRI---SDVLFGCGH----NIGHDTDPG 220
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPM 279
+GILG SL+++L ++F++C+ D +G+ + +TP
Sbjct: 221 HNGILGLNNGPDSLVTKLG------QKFSYCIGNLADPYYNYHQLILGEGADLEGYSTPF 274
Query: 280 VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER--GTIIDSGTTLAYLPPMLYDLVLSQ 337
Y V +E + VG LD+ + R G IID+G+T+ +L ++ L+ +
Sbjct: 275 EVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSVHKLLSKE 334
Query: 338 FR 339
R
Sbjct: 335 VR 336
>gi|255079464|ref|XP_002503312.1| predicted protein [Micromonas sp. RCC299]
gi|226518578|gb|ACO64570.1| predicted protein [Micromonas sp. RCC299]
Length = 649
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 92/308 (29%), Positives = 142/308 (46%), Gaps = 49/308 (15%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHP-----SATGLYFTKVGLGTPTDE-YYVQVDT 102
L+ L++HD R R++ S G + P G Y+ + LG P+ + V VDT
Sbjct: 73 LAHLREHDAHRRRRILESPAESPGASTFPLHGSVKEHGYYYANIALGDPSPRTFQVIVDT 132
Query: 103 GSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG 162
GS L +V CA C++C T + T FDP T + C + C+ + PG
Sbjct: 133 GSTLTYVPCATCAKCGTHT----GGTRFDP----TGKWLTCQEKQCKA-------AGGPG 177
Query: 163 V----------RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS-SVIFGCGNR 211
+ RC Y TY +GS SG VRD + G++ A + V+FGC N
Sbjct: 178 ICAGGRGAAANRCTYSRTYAEGSGVSGDLVRDKMHFG---GDIAPATNGTLDVVFGCTNA 234
Query: 212 QSGDLGSSTDAAVDGILGFGQAN-SSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+SG + D DG++G G +S+ +QLA + + F+ C +GGG + G +
Sbjct: 235 ESGTI---HDQEADGLIGLGNNQFASIPNQLADTHGLPRVFSLCFGSFEGGGALSFGRLP 291
Query: 271 ----SPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTL 323
+P + T M N H Y V +++G + P+ L G GT++DSGTT
Sbjct: 292 ATPHTPPLVYTDMRVNEAHPAYYVVSTAAMKIGDVAVATPSD-LAVG--YGTVMDSGTTF 348
Query: 324 AYLPPMLY 331
Y+P ++
Sbjct: 349 TYVPTKVF 356
>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
Length = 471
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 120/280 (42%), Gaps = 38/280 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +VG+G+P E Y+ VD+GSD++WV C C C ++D LFDP+
Sbjct: 116 SGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQAD-----PLFDPA 170
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S+T ++C CRT R C CEY V+YGDGS T G + + L +
Sbjct: 171 SSATFSAVSCGSAICRTL---RTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTLGGTA- 226
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
V GCG+R G G+LG G SL+ QL F++
Sbjct: 227 -------VEGVAIGCGHRNRGLF-----VGAAGLLGLGWGPMSLVGQLGG--AAGGAFSY 272
Query: 254 CLDVVKGGG----------IFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPL 300
CL G G + + V P+V P P Y V + + VG L
Sbjct: 273 CLASRGGSGSGAADAAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERL 332
Query: 301 DLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
L L ++ G ++D+GT + LP Y + F
Sbjct: 333 PLQDGLFQLTEDGGGGVVMDTGTAVTRLPQEAYAALRDAF 372
>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 203
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 59/153 (38%), Positives = 85/153 (55%), Gaps = 11/153 (7%)
Query: 46 ERTLSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
E L+ L D+ RHGR++ S + ++ + + LY+T V +GTP E V +
Sbjct: 36 ELDLTQLMTFDSARHGRLLQSPVHGSFNWKVERDTSILLSALYYTTVQIGTPPRELDVVI 95
Query: 101 DTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
DTGSDL+WV+C C CP + +T FDP SS++ ++ACSD C + + CS
Sbjct: 96 DTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRCSSDLQKK-SRCS 149
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C Y V YGDGS TSGY++ D+I + SG
Sbjct: 150 LLESCTYKVEYGDGSVTSGYYISDLISFDTMSG 182
>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
Length = 428
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 124/269 (46%), Gaps = 35/269 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP V +DT +D WV C+GC C + LFDPSKSS+S +
Sbjct: 91 YIVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASS-------VLFDPSKSSSSRNLQ 143
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N P+C+ G C + +TYG GS+ +D + L A+ +K
Sbjct: 144 CDAPQCKQAPN---PTCTAGKSCGFNMTYG-GSTIEASLTQDTLTL--ANDVIK------ 191
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
S FGC ++ +G T G++G G+ SL+SQ F++CL K
Sbjct: 192 SYTFGCISKATG-----TSLPAQGLMGLGRGPLSLISQ--TQNLYMSTFSYCLPNSKSSN 244
Query: 261 -GGIFAIGDVVSP-KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLG--TGDER 313
G +G P ++KTTP++ N Y V L + VG +D+PTS L
Sbjct: 245 FSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGA 304
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
GTI DSGT L Y V ++FR I
Sbjct: 305 GTIFDSGTVFTRLVEPAYVAVRNEFRRRI 333
>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
Length = 446
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 132/298 (44%), Gaps = 32/298 (10%)
Query: 50 SALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
S L + H S DL +G +G Y VGLGTP ++ + DTGSDL W
Sbjct: 72 SKLSKKLATDHVSESKSTDLP-AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWT 130
Query: 110 NCAGCSR-CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC--RTTYNNRYPSCSPGVRCE 166
C C R C + K +F+PSKS++ ++CS C ++ SCS C
Sbjct: 131 QCQPCVRTCYDQ-----KEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS-NCI 184
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y + YGD S + G+ ++ L + + V FGCG G V G
Sbjct: 185 YGIQYGDQSFSVGFLAKEKFTLTNSD-------VFDGVYFGCGENNQGLF-----TGVAG 232
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDV-VSPKVKTTP---MVP 281
+LG G+ S SQ A A N K F++CL G G +S VK TP +
Sbjct: 233 LLGLGRDKLSFPSQTATAYN--KIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITD 290
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
Y + + + VGG L +P+++ T G +IDSGT + LPP Y + S F+
Sbjct: 291 GTSFYGLNIVAITVGGQKLPIPSTVFST---PGALIDSGTVITRLPPKAYAALRSSFK 345
>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
Length = 436
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/307 (31%), Positives = 136/307 (44%), Gaps = 45/307 (14%)
Query: 46 ERTLSALKQHDTR--RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
ER A+K+ R R AS + + H + G + + +GTP + Y +DTG
Sbjct: 59 ERLQRAVKRGRLRLQRLSAKTASFEPSVEAPVH-AGNGEFLMNLAIGTPAETYSAIMDTG 117
Query: 104 SDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
SDL+W C C C PT +FDP KSS+ ++ CS + C SCS
Sbjct: 118 SDLIWTQCKPCKVCFDQPTP--------IFDPEKSSSFSKLPCSSDLCVAL---PISSCS 166
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
G CEY +YGD SST G + AS S + FGCG G S
Sbjct: 167 DG--CEYRYSYGDHSSTQGVLATETFTFGDAS--------VSKIGFGCGEDNRGRAYSQG 216
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVKGGGIFAIGDVVSPKVKT- 276
G++G G+ SL+SQL +F++CL D KG +G + K
Sbjct: 217 ----AGLVGLGRGPLSLISQLGVP-----KFSYCLTSIDDSKGISTLLVGSEATVKSAIP 267
Query: 277 TPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLY 331
TP++ P+ P Y + LE + VG L + S D+ G IIDSGTT+ YL +
Sbjct: 268 TPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLKDNAF 327
Query: 332 DLVLSQF 338
+ +F
Sbjct: 328 AALKKEF 334
>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
Length = 350
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/267 (36%), Positives = 134/267 (50%), Gaps = 36/267 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
T Y VG GTP V DTGS++ W+ C C C + + LFDP+ SST
Sbjct: 13 TANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQE-----PLFDPTLSSTY 67
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+C+ C T ++R CS G C Y VTYGDGSST G+ + L A+GN+
Sbjct: 68 RNISCTSAAC-TGLSSR--GCS-GSTCVYGVTYGDGSSTVGFLATETFTL--AAGNVF-- 119
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDV 257
++ IFGCG G T AA G++G G++ SL SQLA + GN+ F++CL
Sbjct: 120 ---NNFIFGCGQNNQGLF---TGAA--GLIGLGRSPYSLNSQLATSLGNI---FSYCLPS 168
Query: 258 VKGG-GIFAIGDVV-SP---KVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G IG+ + +P + T P + Y + L + VGG L L +++
Sbjct: 169 TSSATGYLNIGNPLRTPGYTAMLTNSRAPTL--YFIDLIGISVGGTRLALSSTVF---QS 223
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + LPP Y + + FR
Sbjct: 224 VGTIIDSGTVITRLPPTAYGALRTAFR 250
>gi|297841447|ref|XP_002888605.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
gi|297334446|gb|EFH64864.1| hypothetical protein ARALYDRAFT_475850 [Arabidopsis lyrata subsp.
lyrata]
Length = 410
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 122/277 (44%), Gaps = 29/277 (10%)
Query: 64 MASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSD 122
++S+ L L GN P G Y + +G P + +DTGSD+ WV C A C+ C
Sbjct: 37 LSSVVLLLSGNVFP--LGYYSVLLQIGNPPKAFEFDIDTGSDITWVQCDAPCTGCNLPPK 94
Query: 123 LGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYF 181
L K K +T + CSD C + P C +P +C+Y V Y D S+ G
Sbjct: 95 LQYK------PKGNT---VPCSDPICLALHFPNNPQCPNPKEQCDYEVNYADQGSSMGAL 145
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V D +G + + + FGCG QS + A G+LG G+ LL+QL
Sbjct: 146 VIDQFPFKLLNG----SAMQPRLAFGCGYDQSYP-SAHPPPATAGVLGLGRGKIGLLTQL 200
Query: 242 AAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNP 299
+AG R HCL KGGG GD + P V TP++P HY E+ G
Sbjct: 201 VSAGLTRNVVGHCLS-SKGGGYLFFGDTLIPSLGVAWTPLLPPDNHYTTGPAELLFNGK- 258
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
PT L G I D+G++ Y Y +++
Sbjct: 259 ---PTGLKGL----KLIFDTGSSYTYFNSKTYQTIVN 288
>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/308 (32%), Positives = 142/308 (46%), Gaps = 45/308 (14%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQ 99
K G ER LS + R +AS GNG Y + G+P + V
Sbjct: 49 KRGAERRAQLSKHILAEGRLFSTPVAS------GNGE------YLIDISFGSPPQKASVI 96
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
VDTGSDL+W C C C + + +FDP KSST ++C+ NFC + + SC
Sbjct: 97 VDTGSDLIWTQCLPCETCNAAASV-----IFDPVKSSTYDTVSCASNFCSSL---PFQSC 148
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
+ C+Y YGDGSSTSG + + T P +V FGCG+ +LGS
Sbjct: 149 T--TSCKYDYMYGDGSSTSG-----ALSTETVTVGTGTIP---NVAFGCGHT---NLGSF 195
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI--FAIGDVVSP-KVKT 276
AA GI+G GQ SL+SQ A+ K+F++CL + IGD + V
Sbjct: 196 AGAA--GIVGLGQGPLSLISQ--ASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVAY 251
Query: 277 TPMVPNMPH---YNVILEEVEVGGNPLDLP--TSLLGTGDERGTIIDSGTTLAYLPPMLY 331
T ++ N + Y L + V G + P T + + G I+DSGTTL YL +
Sbjct: 252 TALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGAF 311
Query: 332 DLVLSQFR 339
+ +++ +
Sbjct: 312 NALVAALK 319
>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 442
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 90/273 (32%), Positives = 131/273 (47%), Gaps = 39/273 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G + + +GTP + Y +DTGSDL+W C C++C + +FDP KSS+
Sbjct: 95 SGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPS-----PIFDPKKSSS 149
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+++CS C+ P S CEY+ TYGD SST G + + S
Sbjct: 150 FSKLSCSSQLCKA-----LPQSSCSDSCEYLYTYGDYSSTQGTMATETFTFGKVS----- 199
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+V FGCG GD G + + G++G G+ SL+SQL A +F++CL
Sbjct: 200 ---IPNVGFGCGEDNEGD-GFTQGS---GLVGLGRGPLSLVSQLKEA-----KFSYCLTS 247
Query: 258 VKGG-------GIFAIGDVVSPKVKTTPMVPN--MP-HYNVILEEVEVGGNPLDLPTSLL 307
+ G A + S ++TTP++ N P Y + LE + VGG L + S
Sbjct: 248 IDDTKTSTLLMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTF 307
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
D+ G IIDSGTT+ YL +DLV +F
Sbjct: 308 QLQDDGTGGLIIDSGTTITYLEESAFDLVKKEF 340
>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 474
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/298 (31%), Positives = 132/298 (44%), Gaps = 32/298 (10%)
Query: 50 SALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
S L + H S DL +G +G Y VGLGTP ++ + DTGSDL W
Sbjct: 100 SKLSKKLATDHVSESKSTDLP-AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWT 158
Query: 110 NCAGCSR-CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC--RTTYNNRYPSCSPGVRCE 166
C C R C + K +F+PSKS++ ++CS C ++ SCS C
Sbjct: 159 QCQPCVRTCYDQ-----KEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSAS-NCI 212
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y + YGD S + G+ ++ L + + V FGCG G V G
Sbjct: 213 YGIQYGDQSFSVGFLAKEKFTLTNSD-------VFDGVYFGCGENNQGLF-----TGVAG 260
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDV-VSPKVKTTP---MVP 281
+LG G+ S SQ A A N K F++CL G G +S VK TP +
Sbjct: 261 LLGLGRDKLSFPSQTATAYN--KIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITD 318
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
Y + + + VGG L +P+++ T G +IDSGT + LPP Y + S F+
Sbjct: 319 GTSFYGLNIVAITVGGQKLPIPSTVFST---PGALIDSGTVITRLPPKAYAALRSSFK 373
>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
sativa Japonica Group]
gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/262 (33%), Positives = 118/262 (45%), Gaps = 34/262 (12%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSST 137
A G Y T++GLGTP Y + VDTGS L W+ C+ CS C ++ +FDP S T
Sbjct: 127 AVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAG-----PVFDPRASGT 181
Query: 138 SGEIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+ CS + C T N +CS C Y +YGD S + GY +D + SG
Sbjct: 182 YAAVQCSSSECGELQAATLNPS--ACSVSNVCIYQASYGDSSYSVGYLSKDTVSFG--SG 237
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ +GCG G G S G++G + SLL QLA ++ F++
Sbjct: 238 SFP------GFYYGCGQDNEGLFGRSA-----GLIGLAKNKLSLLYQLAP--SLGYAFSY 284
Query: 254 CLDVVK-GGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGT 309
CL G +IG + TPM + Y V L + V G PL +P S
Sbjct: 285 CLPTSSAAAGYLSIGSYNPGQYSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEY-- 342
Query: 310 GDERGTIIDSGTTLAYLPPMLY 331
TIIDSGT + LPP +Y
Sbjct: 343 -RSLPTIIDSGTVITRLPPNVY 363
>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 392
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 126/284 (44%), Gaps = 49/284 (17%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF LGTP ++++ VDTGSDL +V CA C C + L+ PS SST
Sbjct: 31 SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDG-----PLYQPSNSSTFT 85
Query: 140 EIACSDNFCR-------TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+ C C ++ YP P C Y YGD SST G F + +
Sbjct: 86 PVPCDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV---- 141
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
G ++ + V FGCGNR G S+ G+LG GQ S SQ A +FA
Sbjct: 142 GGIRV----NHVAFGCGNRNQGSFVSA-----GGVLGLGQGALSFTSQAGYA--FENKFA 190
Query: 253 HCL-DVVKGGGIFA-----------IGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL 300
+CL + +F+ I D+ + + P+ P++ Y V + + GG L
Sbjct: 191 YCLTSYLSPTSVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSV--YYVQIVRICFGGETL 248
Query: 301 DLPTSL-----LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+P S +G G GTI DSGTT+ Y P Y +++ F
Sbjct: 249 LIPDSAWKIDSVGNG---GTIFDSGTTVTYWSPQAYARIIAAFE 289
>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
gi|194692214|gb|ACF80191.1| unknown [Zea mays]
gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
Length = 441
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 95/314 (30%), Positives = 133/314 (42%), Gaps = 35/314 (11%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGG-------NGHPSATGLYFTKVGLGTP 92
+A G+R R Q +RR GR + ++ +G + TG YF KV +GTP
Sbjct: 41 RARGDRRRHAYISAQLPSRRGGRQRVAAEVASSSAVSLPMSSGAYAGTGQYFVKVLVGTP 100
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
E+ + DTGS+L WV CAG + P +F P S + + CS + C+
Sbjct: 101 AQEFTLVADTGSELTWVKCAGGASPPG--------LVFRPEASKSWAPVPCSSDTCKLDV 152
Query: 153 NNRYPSCSPGVR-CEYVVTYGDGSSTS-GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGN 210
+CS C Y Y +GS+ + G D + G K A L V+ GC +
Sbjct: 153 PFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGG--KVAQLQ-DVVLGCSS 209
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC----LDVVKGGGIFAI 266
G S VDG+L G A S S+ AA F++C L G A
Sbjct: 210 THDGQSFKS----VDGVLSLGNAKISFASR--AAARFGGSFSYCLVDHLAPRNATGYLAF 263
Query: 267 GDVVSPKVKTTP----MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
G P+ T + P MP Y V ++ V V G LD+P + G I+DSGTT
Sbjct: 264 GPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDP-KSGGVILDSGTT 322
Query: 323 LAYLPPMLYDLVLS 336
L L Y V++
Sbjct: 323 LTVLATPAYKAVVA 336
>gi|7413629|emb|CAB85978.1| putative protein [Arabidopsis thaliana]
Length = 356
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/294 (29%), Positives = 129/294 (43%), Gaps = 60/294 (20%)
Query: 45 RERTLSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQ 99
E L+ L D+ RHGR++ S + ++ + + LY+T V +GTP E V
Sbjct: 35 HELDLTQLMTFDSARHGRLLQSPVHGSFNWKVERDTSILLSALYYTTVQIGTPPRELDVV 94
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGSDL+WV+C C CP + +T FDP SS++ ++ACSD C + + C
Sbjct: 95 IDTGSDLVWVSCNSCVGCPLHN-----VTFFDPGASSSAVKLACSDKRCSSDLQKK-SRC 148
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
S C Y V YGDGS TSGY++ D+I + S A ++S + RQ +G+
Sbjct: 149 SLLESCTYKVEYGDGSVTSGYYISDLISFDTMSDWTYIAFRDNST-WHPWVRQGAIIGT- 206
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAG-NVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTP 278
F S+ S +++ +F+H + V A+ D+ P
Sbjct: 207 ----------FPALCSTPCSTVSSQPLYYNPQFSHMMTV-------AVNDLRLP------ 243
Query: 279 MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
+ S+ GTIIDSGTTL + P YD
Sbjct: 244 -----------------------IDPSVFSVAKGYGTIIDSGTTLVHFPGEAYD 274
>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
Length = 564
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 129/281 (45%), Gaps = 24/281 (8%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+++ L + GN P G Y+T + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 178 STVLLPIKGNVFPD--GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPH- 234
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVR 183
L+ P+K + D C+ ++ C+ +C+Y + Y D SS+ G +
Sbjct: 235 ----PLYKPAKEKI---VPPRDLLCQELQGDQN-YCATCKQCDYEIEYADRSSSMGVLAK 286
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D + + +G + +FGC Q G L +S A DGILG A SL SQLA+
Sbjct: 287 DDMHMIATNGGREKL----DFVFGCAYDQQGQLLTSP-AKTDGILGLSSAAISLPSQLAS 341
Query: 244 AGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKV-KTTPMVPNMPH--YNVILEEVEVGGNP 299
G + F HC+ GGG +GD P+ T + P Y+ ++V G
Sbjct: 342 QGIISNVFGHCITKEPNGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQQ 401
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
L + G I DSG++ YLP +Y +++ ++
Sbjct: 402 LRMHGQ---AGSSIQVIFDSGSSYTYLPDEIYKKLVTAIKY 439
>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 452
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/294 (31%), Positives = 135/294 (45%), Gaps = 46/294 (15%)
Query: 56 DTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS 115
+++ G +A I L+ +G +G Y+ K+GLG+PT Y + VDTGS W+ C C+
Sbjct: 79 SSKKVGPKLAGIPLK---SGLSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT 135
Query: 116 -RCPTKSDLGIKLTLFDPSKSSTSGEIAC----SDNFCRTTYNNRYPSCSPGVR-CEYVV 169
C + D +F+PS S T + C + T N P+CS C Y
Sbjct: 136 IYCHIQED-----PVFNPSASKTYKTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKA 188
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
+YGD S + GY +D++ L + SS ++GCG G G + DGI+G
Sbjct: 189 SYGDSSFSLGYLSQDVLTLTPSQ-------TLSSFVYGCGQDNQGLFGRT-----DGIIG 236
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHCLDV------VKGGGIFAIGD---VVSPKVKTTPMV 280
S+LSQL +G F++CL G +IG S K TP++
Sbjct: 237 LANNELSMLSQL--SGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLL 294
Query: 281 --PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PN P Y + LE + V G PL + S + TIIDSGT + LP +Y
Sbjct: 295 KNPNNPSLYFIDLESITVAGRPLGVAAS----SYKVPTIIDSGTVITRLPTPVY 344
>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
Length = 439
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 105/338 (31%), Positives = 155/338 (45%), Gaps = 60/338 (17%)
Query: 34 EVENKFKAGGER----------ERTLSALKQ--HDTRRHGRM--MASIDLELGGNGHPSA 79
+V+N F+A + ER +K+ H +R M +AS + E+ P
Sbjct: 35 KVQNGFRAKLKHVDSGKNLTKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAPVLP-G 93
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSS 136
G + K+ +GTP + Y +DTGSDL+W C C++C PT +FDP KSS
Sbjct: 94 NGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTP--------IFDPKKSS 145
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ +++CS C + +CS G CEY+ YGD SST G + + + S
Sbjct: 146 SFSKLSCSSKLCEALPQS---TCSDG--CEYLYGYGDYSSTQGMLASETLTFGKVS---- 196
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
P V FGCG G S + G++G G+ SL+SQL +F++CL
Sbjct: 197 -VP---EVAFGCGEDNEG----SGFSQGSGLVGLGRGPLSLVSQLK-----EPKFSYCLT 243
Query: 257 VVK--GGGIFAIGDVVSPK-----VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSL 306
V +G + S K +KTTP++ N Y + LE + VG L + S
Sbjct: 244 SVDDTKASTLLMGSLASVKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKST 303
Query: 307 LGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
++ G IIDSGTT+ YL +DLV +F I
Sbjct: 304 FSLQEDGSGGLIIDSGTTITYLEQSAFDLVAKEFTSQI 341
>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 458
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 97/287 (33%), Positives = 145/287 (50%), Gaps = 38/287 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G + +G YF + LGTP + + DTGSDL+WV C+ C C T+ G + F
Sbjct: 80 SGASTGSGQYFVDLRLGTPPQKLLLVADTGSDLVWVKCSACRNC-TRHTPG---SAFLAR 135
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPG---VRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
S+T C D+ C+ ++ C+ C Y +YGDGS TSG+F ++ LN
Sbjct: 136 HSTTFSPNHCYDSACQLVPLPKHHRCNHARLHSPCRYEYSYGDGSKTSGFFSKETTTLNT 195
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGD--LGSSTDAAVDGILGFGQANSSLLSQLAAA-GNV 247
+SG + A L + FGC R SG G+S + A G++G G+ SL SQL GN
Sbjct: 196 SSG--REAKLK-GIAFGCAFRISGPSVSGASFNGA-HGVMGLGRGPISLSSQLGHRFGN- 250
Query: 248 RKEFAHCL---DVVKGGGIFAI----GDVVSP---KVKTTPMV--PNMPHYNVI-LEEVE 294
+F++CL D+ + + + V+P +++ TP+ P P + I +E V
Sbjct: 251 --KFSYCLMDHDISPSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVS 308
Query: 295 VGGNPLDLPTSL-----LGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
V G L + S+ LG G GTI+DSGTTL +LP Y +L+
Sbjct: 309 VDGIKLPINPSVWALDELGNG---GTIVDSGTTLTFLPEPAYLQILT 352
>gi|326504502|dbj|BAJ91083.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 537
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/339 (30%), Positives = 145/339 (42%), Gaps = 37/339 (10%)
Query: 16 VVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRH--GRMMASIDLE--- 70
VV +WA G + E E +AL +HD R H R +A D E
Sbjct: 40 VVRRWAEARGHPGAAWWAEAEGT-------PEYYAALHRHD-RAHLARRGLAEGDGEGLL 91
Query: 71 --LGGNGHPSATG-LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDL-- 123
GN G L++ +V +GTP + V +DTGSDL WV +C C+ SDL
Sbjct: 92 TFASGNLTFRLEGSLHYAEVAVGTPNATFLVALDTGSDLFWVPCDCKQCAPIANASDLRG 151
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFV 182
G L + P KSSTS + C C N + + C Y V Y +S+SG V
Sbjct: 152 GPDLRPYSPGKSSTSKAVTCEHALCERP-NACAAAGNSSTSCPYTVRYVSANTSSSGVLV 210
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
D++ L++ + + + + V+ GCG Q+G AAVDG+LG G S+ S L
Sbjct: 211 EDVLHLSREAAGGASTAVTAPVVLGCGQVQTGAF--LDGAAVDGLLGLGMDKVSVPSVLH 268
Query: 243 AAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPKVKTTPMV--PNMPHYNVILEEVEVGGNP 299
AAG V + F+ C G G GD TP P YN+ + + V G
Sbjct: 269 AAGLVASDSFSMCFS-PDGFGRINFGDSGRRGQAETPFTVRNTHPTYNISVTAMSVSGKE 327
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ E I+DSGT+ YL Y + + F
Sbjct: 328 V---------AAEFAAIVDSGTSFTYLNDPAYTELATGF 357
>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
distachyon]
Length = 836
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/266 (34%), Positives = 131/266 (49%), Gaps = 39/266 (14%)
Query: 75 GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSK 134
GH T Y V LGTP V+VDTGSD+ WV CA C+ + K LFDP+K
Sbjct: 492 GHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQ---KDQLFDPAK 548
Query: 135 SSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
SS+ + C+ + C +TY + C+ G +C YVV+YGDGS+T+G + D + L A
Sbjct: 549 SSSYSAVPCAADACSELSTYGH---GCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLTDAD 605
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA--GNVRKE 250
+ +FGCG+ Q+G A +DG+L G+ SL SQ + A G V
Sbjct: 606 A-------VTGFLFGCGHAQAGLF-----AGIDGLLALGRKGMSLTSQTSGAYGGGV--- 650
Query: 251 FAHCLD-------VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLD-L 302
F++CL + GG + + + T VP Y V+L + VGG L +
Sbjct: 651 FSYCLPPSPSSTGFLTLGGPSSASGFATTGLLTAWDVPTF--YMVMLTGIGVGGQQLSGV 708
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPP 328
P S GT++D+GT + LPP
Sbjct: 709 PASAFAG----GTVVDTGTVITRLPP 730
>gi|357469591|ref|XP_003605080.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506135|gb|AES87277.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 425
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 123/279 (44%), Gaps = 26/279 (9%)
Query: 63 MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSD 122
+++S+ + GN +P GLY + +G P Y + +DTGSDL WV C G P K
Sbjct: 44 LISSLVYTIKGNVYPD--GLYTVSINIGNPPKPYELDIDTGSDLTWVQCDG-PDAPCKGC 100
Query: 123 LGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY--PSCSP-GVRCEYVVTYGDGSSTSG 179
K L+ P+ + CSD C T + CS C Y V Y D +ST G
Sbjct: 101 TMPKDKLYKPNGKQV---VKCSDPICVATQSTHVLGQICSKQSPPCVYNVQYADHASTLG 157
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
VRD + + S + K PL V FGCG Q + + GILG G +S+LS
Sbjct: 158 VLVRDYMHIGSPSSSTKD-PL---VAFGCGYEQKFSGPTPPHSKPAGILGLGNGKTSILS 213
Query: 240 QLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNM--PHYNVILEEVEV 295
QL + G + HCL +GGG +GD P + TP++ + HYN ++
Sbjct: 214 QLTSIGFIHNVLGHCLS-AEGGGYLFLGDKFVPSSGIVWTPIIQSSLEKHYNTGPVDLFF 272
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
G P I DSG++ Y +Y +V
Sbjct: 273 NGKPT--------PAKGLQIIFDSGSSYTYFSSPVYTIV 303
>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
Length = 466
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 130/299 (43%), Gaps = 37/299 (12%)
Query: 53 KQHDTRRHGRMMAS---IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
+Q +RR +AS + L + + S TG YF K+ +GTP E+ + DTGSDL WV
Sbjct: 84 RQGGSRRVAAEVASSSAVSLPMSSGAY-SGTGQYFVKLRVGTPVQEFTLVADTGSDLTWV 142
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYV 168
CAG S P + +F P S + I CS + C+ +C SP C Y
Sbjct: 143 KCAGASP-PGR--------VFRPKTSRSWAPIPCSSDTCKLDVPFTLANCSSPASPCTYD 193
Query: 169 VTYGDGSSTSGYFVRDIIQLNQASGNL---KTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
Y +GS+ + R I+ A+ L K A L V+ GC + G S D
Sbjct: 194 YRYKEGSAGA----RGIVGTESATIALPGGKVAQLK-DVVLGCSSSHDGQSFRSA----D 244
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHC----LDVVKGGGIFAIGDVVSPKVKTTP--- 278
G+L G A S +Q AA F++C L G A G P+ T
Sbjct: 245 GVLSLGNAKISFATQ--AAARFGGSFSYCLVDHLAPRNATGYLAFGPGQVPRTPATQTKL 302
Query: 279 -MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ P MP Y V ++ + V G LD+P + G I+DSG TL L Y V++
Sbjct: 303 FLDPEMPFYGVKVDAIHVAGKALDIPAEVW-DAKSGGVILDSGNTLTVLAAPAYKAVVA 360
>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
Length = 357
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 84/274 (30%), Positives = 127/274 (46%), Gaps = 38/274 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++G+G P YY+++DTGSD+ W+ CA CS C ++ D ++DPS SS+
Sbjct: 9 SGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVD-----PIYDPSNSSSYR 63
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C C+ Y +C G+ C Y V YGD S++SG + L N TA
Sbjct: 64 RVYCGSALCQAL---DYSACQ-GMGCSYRVVYGDSSASSGDLGIESFYLGP---NSSTAM 116
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
N + FGCG+ SG G S SQ+AA+ + F++CL
Sbjct: 117 RN--IAFGCGHSNSGLFRGEAGLLGM-----GGGTLSFFSQIAAS--IGPAFSYCLVDRY 167
Query: 256 -DVVKGGGIFAIGDVVSP-KVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLL--- 307
+ G P + TP++ N Y +L + VGG PL +P +
Sbjct: 168 SQLQSRSSPLIFGRTAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALT 227
Query: 308 --GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTG G I+DSGT++ + P Y ++ +R
Sbjct: 228 GNGTG---GAILDSGTSVTRVVPPAYAVLRDAYR 258
>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
Length = 452
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 93/292 (31%), Positives = 134/292 (45%), Gaps = 46/292 (15%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-R 116
++ G +A I L+ +G +G Y+ K+GLG+PT Y + VDTGS W+ C C+
Sbjct: 81 KKVGPKLAGIPLK---SGLSMGSGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIY 137
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIAC----SDNFCRTTYNNRYPSCSPGVR-CEYVVTY 171
C + D +F+PS S T + C + T N P+CS C Y +Y
Sbjct: 138 CHIQED-----PVFNPSASKTYKTVPCSSSQCSSLKSATLNE--PTCSKQSNACVYKASY 190
Query: 172 GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
GD S + GY +D++ L + SS ++GCG G G + DGI+G
Sbjct: 191 GDSSFSLGYLSQDVLTLTPSQ-------TLSSFVYGCGQDNQGLFGRT-----DGIIGLA 238
Query: 232 QANSSLLSQLAAAGNVRKEFAHCLDV------VKGGGIFAIGD---VVSPKVKTTPMV-- 280
S+LSQL +G F++CL G +IG S K TP++
Sbjct: 239 NNELSMLSQL--SGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFTPLLKN 296
Query: 281 PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
PN P Y + LE + V G PL + S + TIIDSGT + LP +Y
Sbjct: 297 PNNPSLYFIDLESITVAGRPLGVAAS----SYKVPTIIDSGTVITRLPTPVY 344
>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 114/242 (47%), Gaps = 27/242 (11%)
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+GTP E+ + VDTGS + +V C C +C D F P S T + C+ +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQD-----PKFQPDLSDTYHPVKCNPDCT 56
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
T N+ +C Y Y + SS+SG D++ S LK +FGC
Sbjct: 57 CDTEND---------QCTYERQYAEMSSSSGILGEDLVSFGNMS-ELKP----QRAVFGC 102
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIG 267
N ++GDL S DGI+G G+ + S++ QL G + F+ C ++ GGG +G
Sbjct: 103 ENAETGDLFSQ---HADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159
Query: 268 DVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAY 325
+ P V + P+YN+ L + V G LD+ + + GTI+DSGTT AY
Sbjct: 160 QISPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF--DGKHGTILDSGTTYAY 217
Query: 326 LP 327
LP
Sbjct: 218 LP 219
>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 490
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 123/269 (45%), Gaps = 32/269 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
+G Y VGLG+P + DTGSDL W C C C + + +FDPS S +
Sbjct: 144 SGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQRE-----HIFDPSTSLSY 198
Query: 139 GEIACSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
++C C + P CS C Y + YGDGS + G+F R+ + +L
Sbjct: 199 SNVSCDSPSCEKLESATGNSPGCSSST-CLYGIRYGDGSYSIGFFAREKL-------SLT 250
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL- 255
+ + ++ FGCG G G + G+LG + SL+SQ A K F++CL
Sbjct: 251 STDVFNNFQFGCGQNNRGLFGGTA-----GLLGLARNPLSLVSQ--TAQKYGKVFSYCLP 303
Query: 256 --DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTG 310
G F GD S VK TP N + Y + + + VG L +P S+ T
Sbjct: 304 SSSSSTGYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTA 363
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT ++ LPP +Y V FR
Sbjct: 364 ---GTIIDSGTVISRLPPTVYSSVQKVFR 389
>gi|356522749|ref|XP_003530008.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1336
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/325 (31%), Positives = 151/325 (46%), Gaps = 34/325 (10%)
Query: 30 NFVFEVENKFKAGGERERTLS--ALKQHD---TRRH---GRMMASID----LELGGNGHP 77
+F+F + KF G+++ L L Q + T+R G + ++D + GN +P
Sbjct: 131 SFLFPLFPKFGVLGQKDLKLQLGKLVQKEKFLTQRDVGDGSGVVAVDSSSVFPVSGNVYP 190
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSS 136
GLYFT + +G P Y++ VDTGSDL W+ C A C C + + K P++S+
Sbjct: 191 D--GLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCRSCGKGAHVQYK-----PTRSN 243
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
++ D+ C N+ ++C+Y + Y D SS+ G VRD + L +G+
Sbjct: 244 V---VSSVDSLCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGS 300
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
LN V+FGCG Q G L +T A DGI+G +A SL QLA+ G ++ HC
Sbjct: 301 --KTKLN--VVFGCGYDQEG-LILNTLAKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHC 355
Query: 255 L-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEV-GGNPLDLPTSLLGTGDE 312
L + GGG +GD P VP L + E+ G N + G
Sbjct: 356 LSNDGAGGGYMFLGDDFVPYWGMN-WVPMAYTLTTDLYQTEILGINYGNRQLKFDGQSKV 414
Query: 313 RGTIIDSGTTLAYLPPMLY-DLVLS 336
DSG++ Y P Y DLV S
Sbjct: 415 GKVFFDSGSSYTYFPKEAYLDLVAS 439
>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 374
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 125/272 (45%), Gaps = 25/272 (9%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +V +GTP + Y DTGSDL W +C C++C + + +FDP KS++
Sbjct: 23 GHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRN-----PIFDPQKSTSYRN 77
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I+C C CSP C Y Y + T G ++ I L+ G ++ PL
Sbjct: 78 ISCDSKLCHKLDTG---VCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKG--ESVPL 132
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
++FGCG+ +G D + GI+G G S +SQ+ ++ K F+ CL
Sbjct: 133 K-GIVFGCGHNNTGGF---NDREM-GIIGLGGGPVSFISQIGSSFG-GKRFSQCLVPFHT 186
Query: 256 DV-VKGGGIFAIGDVVSPK-VKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGD 311
DV V G VS K V +TP+V Y V L + VG L S + +
Sbjct: 187 DVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVE 246
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+ +DSGT LP LYD +++Q R +A
Sbjct: 247 KGNVFLDSGTPPTILPTQLYDRLVAQVRSEVA 278
>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
Length = 584
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 77/242 (31%), Positives = 114/242 (47%), Gaps = 27/242 (11%)
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+GTP E+ + VDTGS + +V C C +C D F P S T + C+ +
Sbjct: 2 IGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQD-----PKFQPDLSDTYHPVKCNPDCT 56
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
T N+ +C Y Y + SS+SG D++ S LK +FGC
Sbjct: 57 CDTEND---------QCTYERQYAEMSSSSGILGEDLVSFGNMS-ELKP----QRAVFGC 102
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIG 267
N ++GDL S DGI+G G+ + S++ QL G + F+ C ++ GGG +G
Sbjct: 103 ENAETGDLFSQ---HADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLG 159
Query: 268 DVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAY 325
+ P V + P+YN+ L + V G LD+ + + GTI+DSGTT AY
Sbjct: 160 QISPPSDMVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVF--DGKHGTILDSGTTYAY 217
Query: 326 LP 327
LP
Sbjct: 218 LP 219
>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 88/272 (32%), Positives = 131/272 (48%), Gaps = 40/272 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSST 137
G Y ++ +GTP Y +DTGSDL+W C C+RC PT +FDP KSS+
Sbjct: 106 GEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTP--------IFDPKKSSS 157
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+++C + C ++ +CS G CEYV +YGD S T G + ++ +
Sbjct: 158 FSKVSCGSSLCSALPSS---TCSDG--CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSV 212
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
++ FGCG GD G++G G+ SL+SQL + F++CL
Sbjct: 213 ----HNIGFGCGEDNEGD----GFEQASGLVGLGRGPLSLVSQLK-----EQRFSYCLTP 259
Query: 256 -DVVKGGGIF--AIGDVVSPK-VKTTPMVPN--MP-HYNVILEEVEVGGNPLDLPTSLLG 308
D K + ++G V K V TTP++ N P Y + LE + VG L + S
Sbjct: 260 IDDTKESVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFE 319
Query: 309 TGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
GD+ G IIDSGTT+ Y+ Y+ + +F
Sbjct: 320 VGDDGNGGVIIDSGTTITYVQQKAYEALKKEF 351
>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
Length = 462
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 91/267 (34%), Positives = 118/267 (44%), Gaps = 35/267 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEI 141
+ VG GTP Y + DTGSD+ W+ C CS C + D +FDP+KS+T +
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHD-----PIFDPTKSATYSAV 174
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C C CS C Y V YGDGSST+G + + L A A
Sbjct: 175 PCGHPQCAAAGGK----CSSNGTCLYKVQYGDGSSTAGVLSHETLSLTSARALPGFA--- 227
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK-G 260
FGCG GD G VDG++G G+ SL SQ AA+ + CL
Sbjct: 228 ----FGCGETNLGDFGD-----VDGLIGLGRGQLSLSSQAAASFGAAFSY--CLPSYNTS 276
Query: 261 GGIFAIGDVV----SPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER 313
G IG S V+ T M+ + Y V L + VGG L +P L
Sbjct: 277 HGYLTIGTTTPASGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILF---TRD 333
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRF 340
GT++DSGT L YLPP Y + +F+F
Sbjct: 334 GTLLDSGTVLTYLPPEAYTALRDRFKF 360
>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/255 (33%), Positives = 127/255 (49%), Gaps = 41/255 (16%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT----TYN 153
V VDTGSDL WV C C+RC + D +F+PSKS + + C+ CR+ T N
Sbjct: 79 VIVDTGSDLSWVQCQPCNRCYNQQD-----PVFNPSKSPSYRTVLCNSLTCRSLQLATGN 133
Query: 154 NRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
+ +P C YVV YGDGS TSG + + L + N + IFGCG +
Sbjct: 134 SGVCGSNPPT-CNYVVNYGDGSYTSGEVGMEHLNLGNTTVN--------NFIFGCGRKNQ 184
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDV--VKGGGIFAIGDVV 270
G G ++ G++G G+ + SL+SQ++ G V F++CL + G +G
Sbjct: 185 GLFGGAS-----GLVGLGRTDLSLISQISPMFGGV---FSYCLPTTEAEASGSLVMGGNS 236
Query: 271 SPKVKTTPMV-------PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTL 323
S TTP+ P +P Y + L + VGG + P+ G +R IIDSGT +
Sbjct: 237 SVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPS----FGKDR-MIIDSGTVI 291
Query: 324 AYLPPMLYDLVLSQF 338
+ LPP +Y + ++F
Sbjct: 292 SRLPPSIYQALKAEF 306
>gi|356529585|ref|XP_003533370.1| PREDICTED: lysine-specific histone demethylase 1 homolog 1-like
[Glycine max]
Length = 1388
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 151/325 (46%), Gaps = 34/325 (10%)
Query: 30 NFVFEVENKFKAGGERERTLS--ALKQHD---TRRH---GRMMASID----LELGGNGHP 77
+F+F + KF G+++ L L Q + T R G + ++D + GN +P
Sbjct: 129 SFLFPLFPKFGVLGQKDLKLQLGKLSQKEKFLTHRDDGDGSGVVAVDSSSVFPVSGNVYP 188
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSS 136
GLYFT + +G P Y++ VDTGSDL W+ C A C C + + L+ P++S+
Sbjct: 189 D--GLYFTILRVGNPPKSYFLDVDTGSDLTWMQCDAPCISCGKGAHV-----LYKPTRSN 241
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
++ D C N+ ++C+Y + Y D SS+ G VRD + L +G+
Sbjct: 242 V---VSSVDALCLDVQKNQKNGHHDESLLQCDYEIQYADHSSSLGVLVRDELHLVTTNGS 298
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
LN V+FGCG Q+G L +T DGI+G +A SL QLA+ G ++ HC
Sbjct: 299 --KTKLN--VVFGCGYDQAG-LLLNTLGKTDGIMGLSRAKVSLPYQLASKGLIKNVVGHC 353
Query: 255 L-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEV-GGNPLDLPTSLLGTGDE 312
L + GGG +GD P VP L + E+ G N + G
Sbjct: 354 LSNDGAGGGYMFLGDDFVPYWGMN-WVPMAYTLTTDLYQTEILGINYGNRQLRFDGQSKV 412
Query: 313 RGTIIDSGTTLAYLPPMLY-DLVLS 336
+ DSG++ Y P Y DLV S
Sbjct: 413 GKMVFDSGSSYTYFPKEAYLDLVAS 437
>gi|356500374|ref|XP_003519007.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
[Glycine max]
Length = 454
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 73/220 (33%), Positives = 104/220 (47%), Gaps = 19/220 (8%)
Query: 55 HDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AG 113
+ H R+ +S +L GN +P G Y + +G P Y + +D+GSDL WV C A
Sbjct: 38 YSDNNHHRLSSSAVFKLQGNVYP--LGHYTVSLNIGYPPKLYDLDIDSGSDLTWVQCDAP 95
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYG 172
C C D L+ P+ + + C D C + + +C SP C+Y V Y
Sbjct: 96 CKGCTKPRD-----QLYKPNHNL----VQCVDQLCSEVHLSMAYNCPSPDDPCDYEVEYA 146
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
D S+ G VRD I +G++ + V FGCG Q GS++ A G+LG G
Sbjct: 147 DHGSSLGVLVRDYIPFQFTNGSV----VRPRVAFGCGYDQKYS-GSNSPPATSGVLGLGN 201
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP 272
+S+LSQL + G +R HCL +GGG GD P
Sbjct: 202 GRASILSQLHSLGLIRNVVGHCLS-AQGGGFLFFGDDFIP 240
>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 449
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 122/261 (46%), Gaps = 41/261 (15%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + + +GTP Y +DTGSDL+W C C C +S +FDPS SST
Sbjct: 100 GEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQST-----PVFDPSSSSTYAA 154
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C +++ S +C Y TYGD SST G + L +
Sbjct: 155 LPCSSTLCSDLPSSKCTS----AKCGYTYTYGDSSSTQGVLAAETFTLAKTK-------- 202
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----D 256
V FGCG+ GD G + A G++G G+ SL+SQL +F++CL D
Sbjct: 203 LPDVAFGCGDTNEGD-GFTQGA---GLVGLGRGPLSLVSQLG-----LNKFSYCLTSLDD 253
Query: 257 VVKG----GGIFAI--GDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLL 307
K G + I + V+TTP++ P+ P Y V L+ + VG + LP+S
Sbjct: 254 TSKSPLLLGSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAF 313
Query: 308 GTGDE--RGTIIDSGTTLAYL 326
D+ G I+DSGT++ YL
Sbjct: 314 AVQDDGTGGVIVDSGTSITYL 334
>gi|297807039|ref|XP_002871403.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297317240|gb|EFH47662.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 529
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 91/265 (34%), Positives = 133/265 (50%), Gaps = 28/265 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTK--SDLGIK-LTLFDPSKSS 136
L++T + +GTP+ + V +DTGSDLLW+ NC C+ + S L K L ++PS SS
Sbjct: 99 LHYTWIDIGTPSVSFLVALDTGSDLLWIPCNCVQCAPLTSTYYSSLATKDLNEYNPSSSS 158
Query: 137 TSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGN 194
+S CS C + + C SP +C Y V Y G +S+SG V DI+ L + N
Sbjct: 159 SSKVFLCSHKLCGSASD-----CDSPKEQCTYTVKYLSGNTSSSGLLVEDILHLTYNTNN 213
Query: 195 LK---TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
++ + + V+ GCG +QSGD A DG++G G A S+ S L+ AG +R F
Sbjct: 214 RLMNGSSSVKARVVVGCGKKQSGDYLDG--VAPDGLMGLGPAEISVPSFLSKAGLMRNSF 271
Query: 252 AHCLDVVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
+ C D G I+ GD+ ++ P + N Y V +E +G + L TS
Sbjct: 272 SLCFDEEDSGRIY-FGDMGPSIQQSAPFLQLENNSGYIVGVEACCIGNSCLK-QTSF--- 326
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLV 334
T IDSG + YLP +Y V
Sbjct: 327 ----TTFIDSGQSFTYLPEEIYRKV 347
>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 448
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 125/268 (46%), Gaps = 35/268 (13%)
Query: 68 DLELGGNGHPSA-TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIK 126
D EL N PS L+ +G P +DTGS++LWV CA C RC ++
Sbjct: 85 DFEL--NLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNG---- 138
Query: 127 LTLFDPSKSSTSGEIACSDNFCR---TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVR 183
L DPSKSST + C++ C + Y NR +C Y ++Y G S++G
Sbjct: 139 -PLLDPSKSSTYASLPCTNTMCHYAPSAYCNRLN------QCGYNLSYATGLSSAGVLAT 191
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
+ + + + + P SV+FGC + G D G+ G G+ +S ++++ +
Sbjct: 192 EQLIFHSSDEGVNAVP---SVVFGCSHEN----GDYKDRRFTGVFGLGKGITSFVTRMGS 244
Query: 244 AGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNP 299
+F++CL D G G+ + + +TP+ HY V LE + VG
Sbjct: 245 ------KFSYCLGNIADPHYGYNQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKR 298
Query: 300 LDL-PTSLLGTGDERGTIIDSGTTLAYL 326
LD+ T+ G+E+ +IDSGT L +L
Sbjct: 299 LDIDSTAFSMKGNEKSALIDSGTALTWL 326
>gi|356507437|ref|XP_003522473.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 440
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 127/282 (45%), Gaps = 30/282 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
R +S+ + GN +P G Y + +G P Y++ +DTGSDL W+ C A CSRC
Sbjct: 60 RAGSSVVFPVHGNVYP--VGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSRCSQT 117
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
L+ PS + C C + + + C +C+Y V Y D S+ G
Sbjct: 118 PH-----PLYRPSND----LVPCRHALCASLHLSDNYDCEVPHQCDYEVQYADHYSSLGV 168
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
+ D+ LN +G L + GCG Q + +DG+LG G+ +SL SQ
Sbjct: 169 LLHDVYTLNFTNG----VQLKVRMALGCGYDQI--FPDPSHHPLDGMLGLGRGKTSLTSQ 222
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVV-SPKVKTTPMVP-NMPHYNVI-LEEVEVGG 297
L + G VR HCL GG IF GDV S ++ TPM + HY+V E+ GG
Sbjct: 223 LNSQGLVRNVIGHCLSAQGGGYIF-FGDVYDSFRLTWTPMSSRDYKHYSVAGAAELLFGG 281
Query: 298 NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G G+ + D+G++ Y Y +++S +
Sbjct: 282 KK-------SGVGNLHA-VFDTGSSYTYFNSYAYQVLISWLK 315
>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
Length = 458
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 132/289 (45%), Gaps = 35/289 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G S +G YF + LG+P + DTGSDL WV C+ C T + + F
Sbjct: 74 SGASSGSGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACK---TNCSIHPPGSTFLAR 130
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSP---GVRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
S+T C + C+ C+ C Y Y DGS TSG+F ++ LN
Sbjct: 131 HSTTFSPTHCFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNT 190
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGD--LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
+SG S+ FGCG SG +GSS + A G++G G+ S SQL
Sbjct: 191 SSGREMKL---KSIAFGCGFHASGPSLIGSSFNGA-SGVMGLGRGPISFASQLGR--RFG 244
Query: 249 KEFAHC-LDVV---KGGGIFAIGDVVSPK------VKTTPMV--PNMP-HYNVILEEVEV 295
+ F++C LD IGDVVS K + TP++ P P Y + ++ V V
Sbjct: 245 RSFSYCLLDYTLSPPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFV 304
Query: 296 GGNPLDLPTSL-----LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G L + S+ LG G GT+IDSGTTL +L Y +LS F+
Sbjct: 305 DGVKLHIDPSVWSLDELGNG---GTVIDSGTTLTFLTEPAYREILSAFK 350
>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
Length = 469
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 89/270 (32%), Positives = 123/270 (45%), Gaps = 39/270 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G GTP+ + +DTGSD+ WV C C ++C + D LFDPSKSST
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKD-----PLFDPSKSSTYAP 185
Query: 141 IACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
IAC+ + CR ++ + C S G +C Y V Y DGS + G + + + L AP
Sbjct: 186 IACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLTL---------AP 236
Query: 200 --LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
FGCG Q G DG+LG G A SL+ Q ++ F++CL
Sbjct: 237 GITVEDFHFGCGRDQRG-----PSDKYDGLLGLGGAPVSLVVQTSSV--YGGAFSYCLPA 289
Query: 258 VKG-GGIFAIGDVVSPKVKT---TPMVPNMP----HYNVILEEVEVGGNPLDLPTSLLGT 309
+ G +G S TPM ++P Y V + + VGG PL +P S
Sbjct: 290 LNSEAGFLVLGSPPSGNKSAFVFTPMR-HLPGYATFYMVTMTGISVGGKPLHIPQSAF-- 346
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G IIDSGT LP Y+ + + R
Sbjct: 347 --RGGMIIDSGTVDTELPETAYNALEAALR 374
>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
Length = 439
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 118/265 (44%), Gaps = 27/265 (10%)
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSS 136
PSA G Y + +GTP VDTGSDL W C C+ C + + LFDP SS
Sbjct: 87 PSA-GEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQV-----VPLFDPKNSS 140
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
T + +C +FC +R SCS +C + +Y DGS T G + + ++ +G
Sbjct: 141 TYRDSSCGTSFCLALGKDR--SCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPV 198
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ P FGCG+ G D + GI+G G SL+SQL + + F++CL
Sbjct: 199 SFP---GFAFGCGHSSGGIF----DKSSSGIVGLGGGELSLISQLKS--TINGLFSYCLL 249
Query: 257 VVKGGGIF-------AIGDVVSPKVKTTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLL 307
V A G V +TP+V P Y + LE + VG L
Sbjct: 250 PVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSK 309
Query: 308 GTGDERGTII-DSGTTLAYLPPMLY 331
T E G II DSGTT +LP Y
Sbjct: 310 KTEVEEGNIIVDSGTTYTFLPQEFY 334
>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
Length = 459
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 89/294 (30%), Positives = 137/294 (46%), Gaps = 45/294 (15%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G + +G YF + LGTP + DTGSDL+WV C+ C C + F P
Sbjct: 79 SGASTGSGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHP----PSSAFLPR 134
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSC------SPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
SS+ C D CR + + C SP C ++ +Y DGS +SG+F ++
Sbjct: 135 HSSSFSPFHCFDPHCRLLPHAPHHLCNHTRLHSP---CRFLYSYADGSLSSGFFSKETTT 191
Query: 188 LNQASG---NLKTAPLNSSVIFGCGNRQSG-DLGSSTDAAVDGILGFGQANSSLLSQLAA 243
L SG +LK + FGCG R SG + + G++G G+ + S SQL
Sbjct: 192 LKSLSGSEIHLK------GLSFGCGFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGR 245
Query: 244 A-GNVRKEFAHCLD-----------VVKGGGIFAIGDVVSPKVKTTPMV--PNMP-HYNV 288
GN +F++CL ++ GGG+ ++ + K+ TP+ P P Y +
Sbjct: 246 RFGN---KFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYI 302
Query: 289 ILEEVEVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ + + G L + ++ DE+ GT++DSGTTL YL Y+ VL R
Sbjct: 303 TIHSITIDGVKLPINPAVWEI-DEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVR 355
>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 535
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 82/266 (30%), Positives = 127/266 (47%), Gaps = 36/266 (13%)
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG--CSRCPTKSDLGIKLT 128
L GN P GLY+T + LG+P Y++ VDTGS WV C C+ C +
Sbjct: 150 LAGNLFPE--GLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH-----P 202
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL 188
L+ P++ T+ + SD C + +P +C+Y ++Y DGSS+ G +VRD +Q
Sbjct: 203 LYRPAR--TADALPASDPLCEGAQHE-----NPN-QCDYEISYADGSSSMGVYVRDSMQF 254
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
G + N+ ++FGCG Q G L ++ + DG+LG SL +QLA+ G +
Sbjct: 255 VGEDGERE----NADIVFGCGYDQQGVLLNALE-TTDGVLGLTNKALSLPTQLASRGIIS 309
Query: 249 KEFAHCL--DVVKGGGIFAIGDVVSPKVKTTPMVP--NMPHYNV---ILEEVEVGGNPLD 301
F HC+ D GG +GD P+ T VP + P +V ++++ G L+
Sbjct: 310 NAFGHCMSTDPSGAGGYLFLGDDYIPRWGMT-WVPIRDGPADDVRRAQVKQINHGDQQLN 368
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLP 327
G + D+G+T Y P
Sbjct: 369 ------AQGKLTQVVFDTGSTYTYFP 388
>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
Length = 438
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 126/279 (45%), Gaps = 40/279 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y VG+G+P + +DTGSDL+W CA C C + F+P+KS++
Sbjct: 83 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQ-----PTPYFEPAKSTSYAS 137
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C Y+ P C C Y YGD +S++G + S +
Sbjct: 138 LPCSSAMCNALYS---PLCFQNA-CVYQAFYGDSASSAGVLANETFTFGTNSTRVAVP-- 191
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
V FGCGN +G L + + G++GFG+ SL+SQL + F++CL
Sbjct: 192 --RVSFGCGNMNAGTLFNGS-----GMVGFGRGALSLVSQLGS-----PRFSYCLTSFMS 239
Query: 261 G-------GIFAIGDVV----SPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSL 306
G +A + S V++TP + P +P Y + + + V G+ L + S+
Sbjct: 240 PATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSV 299
Query: 307 LGTGDERGT---IIDSGTTLAYLPPMLYDLVLSQFRFWI 342
+ GT IIDSGTT+ +L Y +V F W+
Sbjct: 300 FAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWV 338
>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 131/289 (45%), Gaps = 34/289 (11%)
Query: 57 TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCS 115
T + R+ +S+ + GN +P TG Y + +G P + + +DTGSDL WV C A C
Sbjct: 44 TPANDRVGSSVFFRVTGNVYP--TGHYSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCK 101
Query: 116 RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDG 174
C D L+ P + + C+ + C+ NN +C P +C+Y V Y D
Sbjct: 102 GCTKPLD-----KLYKPKNN----RVPCASSLCQAIQNN---NCDIPTEQCDYEVEYADL 149
Query: 175 SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQAN 234
S+ G + D L +G+L L + FGCG Q LG + GILG G+
Sbjct: 150 GSSLGVLLSDYFPLRLNNGSL----LQPRIAFGCGYDQKY-LGPHSPPDTAGILGLGRGK 204
Query: 235 SSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPH--YNVIL 290
+S+LSQL G + HC V GG +F GD + P + TPM+ + Y+
Sbjct: 205 ASILSQLRTLGITQNVVGHCFSRVTGGFLF-FGDHLLPPSGITWTPMLRSSSDTLYSSGP 263
Query: 291 EEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
E+ GG P + L I DSG++ Y +Y +L+ R
Sbjct: 264 AELLFGGKPTGIKGLQL--------IFDSGSSYTYFNAQVYQSILNLVR 304
>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 126/279 (45%), Gaps = 40/279 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y VG+G+P + +DTGSDL+W CA C C + F+P+KS++
Sbjct: 86 GEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQ-----PTPYFEPAKSTSYAS 140
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C Y+ P C C Y YGD +S++G + S +
Sbjct: 141 LPCSSAMCNALYS---PLCFQNA-CVYQAFYGDSASSAGVLANETFTFGTNSTRVAVP-- 194
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
V FGCGN +G L + + G++GFG+ SL+SQL + F++CL
Sbjct: 195 --RVSFGCGNMNAGTLFNGS-----GMVGFGRGALSLVSQLGS-----PRFSYCLTSFMS 242
Query: 261 G-------GIFAIGDVV----SPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSL 306
G +A + S V++TP + P +P Y + + + V G+ L + S+
Sbjct: 243 PATSRLYFGAYATLNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSV 302
Query: 307 LGTGDERGT---IIDSGTTLAYLPPMLYDLVLSQFRFWI 342
+ GT IIDSGTT+ +L Y +V F W+
Sbjct: 303 FAINETDGTGGVIIDSGTTVTFLAQPAYAMVQGAFVAWV 341
>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 509
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 94/280 (33%), Positives = 121/280 (43%), Gaps = 48/280 (17%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSST 137
TG Y VGLGTP + V DTGSDL WV C CS C + D LF PS SST
Sbjct: 151 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQD-----PLFAPSDSST 205
Query: 138 SGEIACSDNFCRTTYNNRYPSC--SPG-VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
+ C CR SC SPG RC Y V YGD S T G+ D + L
Sbjct: 206 FSAVRCGARECRARQ-----SCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLG----- 255
Query: 195 LKTAPLNSSV---------IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
AP N+S +FGCG +G G + DG+ G G+ SL SQ AAG
Sbjct: 256 -TMAPANASAENDNKLPGFVFGCGENNTGLFGQA-----DGLFGLGRGKVSLSSQ--AAG 307
Query: 246 NVRKEFAHCLD--VVKGGGIFAIGDVV--SPKVKTTPMVPNM---PHYNVILEEVEVGGN 298
+ F++CL G ++G V + TPM+ Y V L + V G
Sbjct: 308 KFGEGFSYCLPSSSSSAPGYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGR 367
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ + + + I+DSGT + L P Y + + F
Sbjct: 368 AIRVSSPRVAL----PLIVDSGTVITRLAPRAYRALRAAF 403
>gi|413953656|gb|AFW86305.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
Length = 406
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 83/265 (31%), Positives = 126/265 (47%), Gaps = 34/265 (12%)
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG--CSRCPTKSDLGIKLT 128
L GN P GLY+T + LG+P Y++ VDTGS WV C C+ C +
Sbjct: 150 LAGNLFPE--GLYYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAH-----P 202
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL 188
L+ P++ T+ + SD C + P+ +C+Y ++Y DGSS+ G +VRD +Q
Sbjct: 203 LYRPAR--TADALPASDPLCEGAQHEN-PN-----QCDYEISYADGSSSMGVYVRDSMQF 254
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
G + N+ ++FGCG Q G L ++ + DG+LG SL +QLA+ G +
Sbjct: 255 VGEDGERE----NADIVFGCGYDQQGVLLNALE-TTDGVLGLTNKALSLPTQLASRGIIS 309
Query: 249 KEFAHCL--DVVKGGGIFAIGDVVSPKVKTTPMVP--NMPHYNVILEEVEV--GGNPLDL 302
F HC+ D GG +GD P+ T VP + P +V +V+ G+
Sbjct: 310 NAFGHCMSTDPSGAGGYLFLGDDYIPRWGMT-WVPIRDGPADDVRRAQVKQINHGD---- 364
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLP 327
L G + D+G+T Y P
Sbjct: 365 -QQLNAQGKLTQVVFDTGSTYTYFP 388
>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 431
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 96/300 (32%), Positives = 138/300 (46%), Gaps = 33/300 (11%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHP-----SATGLYFTKVGLGTPTDEYYVQVD 101
T S ++ RR R + P S G Y + +GTP D
Sbjct: 45 ETSSQRMRNAIRRSARSTLQFSNDDASPNSPQSFITSNRGEYLMNISIGTPPVPILAIAD 104
Query: 102 TGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP 161
TGSDL+W C C C ++ LFDP +SST +++CS + CR + SCS
Sbjct: 105 TGSDLIWTQCNPCEDCYQQTS-----PLFDPKESSTYRKVSCSSSQCRALED---ASCST 156
Query: 162 GVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
C Y +TYGD S T G D + + +SG + N +I GCG+ +G
Sbjct: 157 DENTCSYTITYGDNSYTKGDVAVDTVTMG-SSGRRPVSLRN--MIIGCGHENTGTF---- 209
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGI-----FAIGDVVSPK- 273
D A GI+G G ++SL+SQL + + +F++CL G+ F +VS
Sbjct: 210 DPAGSGIIGLGGGSTSLVSQLRKS--INGKFSYCLVPFTSETGLTSKINFGTNGIVSGDG 267
Query: 274 VKTTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
V +T MV P +Y + LE + VG + +++ GTG E +IDSGTTL LP Y
Sbjct: 268 VVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTG-EGNIVIDSGTTLTLLPSNFY 326
>gi|242094534|ref|XP_002437757.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
gi|241915980|gb|EER89124.1| hypothetical protein SORBIDRAFT_10g002060 [Sorghum bicolor]
Length = 575
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 103/348 (29%), Positives = 150/348 (43%), Gaps = 50/348 (14%)
Query: 16 VVHQWAV----GGGGVMGNFVFEVENKFKAGGERERTLSALKQHD----TRRHGRMMA-- 65
VV QW V GG GV G+ E G SAL +HD TRR G A
Sbjct: 41 VVRQWMVDARGGGHGVPGSSWLLPEEAPAVG--SPEYYSALLRHDRALFTRRRGLASAAD 98
Query: 66 --SIDLELG-GNGHPSATG--LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
S L GN T L++ +V +GTP+ ++ V +DTGSDL W+ C C C
Sbjct: 99 GQSTTLTFADGNATRLDTYEYLHYAEVEVGTPSSKFLVALDTGSDLFWLPCE-CKLCAKN 157
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR----CEYVVTYGDGSS 176
T++ PS SSTS + C C R +C+ + C Y V Y ++
Sbjct: 158 GS-----TMYSPSLSSTSKTVPCGHPLCE-----RPDACATAGKSSSSCPYEVKYVSANT 207
Query: 177 -TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
+SG V D++ L G + + ++FGCG Q+G AA G++G G
Sbjct: 208 GSSGVLVEDVLHLVDGGGGGGGKAVQAPIVFGCGQVQTGAF--LRGAAAGGLMGLGLDKV 265
Query: 236 SLLSQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP----NMPHYNVIL 290
S+ S LA++G V + F+ C G G GD SP TP++ +YN+ +
Sbjct: 266 SVPSALASSGLVASDSFSMCFS-RDGVGRINFGDAGSPDQAETPLIAAGSLQPSYYNISV 324
Query: 291 EEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ V + + E ++DSGT+ YL Y + + F
Sbjct: 325 GAITVDSKAMAV---------EFTAVVDSGTSFTYLDDPAYTFLTTNF 363
>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
Length = 463
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 80/261 (30%), Positives = 122/261 (46%), Gaps = 33/261 (12%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+GTP E + DTGS L+W C C C K+ +FDP+KS++ +
Sbjct: 132 YIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYP------KVPVFDPTKSASFKGLP 185
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C++ R SP +C Y+ Y D SS++G + I + + K
Sbjct: 186 CSSKLCQSI---RQGCSSP--KCTYLTAYVDNSSSTGTLATETISFSHLKYDFK------ 234
Query: 203 SVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
+++ GC ++ SG+ LG S GI+G ++ SL SQ A + K F++C+ G
Sbjct: 235 NILIGCSDQVSGESLGES------GIMGLNRSPISLASQTANIYD--KLFSYCIPSTPGS 286
Query: 262 -GIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTIID 318
G G V V+ +P+ P Y++ + + VGG L + S + + ID
Sbjct: 287 TGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAF----KIASTID 342
Query: 319 SGTTLAYLPPMLYDLVLSQFR 339
SG L LPP Y + S FR
Sbjct: 343 SGAVLTRLPPKAYSALRSVFR 363
>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
Length = 418
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 128/282 (45%), Gaps = 42/282 (14%)
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTL 129
L G+ +P TG Y+ + +G P Y++ VDTGSDL W+ C A C C + L
Sbjct: 47 LSGDVYP--TGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNK-----VPHPL 99
Query: 130 FDPSKSSTSGEIACSDNFCRTTYNNRYPS--CSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
+ P+K+ + C+++ C ++ P+ C+ +C+Y + Y D +S+ G V D
Sbjct: 100 YRPTKNKL---VPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFS 156
Query: 188 LN-QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
L + N++ S+ FGCG Q + A DG+LG G+ + SLLSQL G
Sbjct: 157 LPLRNKSNVR-----PSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGI 211
Query: 247 VRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYN-----VILEEVEVGG 297
+ HCL GGG GD + P + T PMV + +Y+ + + +
Sbjct: 212 TKNVLGHCLS-TSGGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLST 270
Query: 298 NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
P+++ + DSG+T Y Y +S +
Sbjct: 271 KPMEV-------------VFDSGSTYTYFSAQPYQATISAIK 299
>gi|125589905|gb|EAZ30255.1| hypothetical protein OsJ_14305 [Oryza sativa Japonica Group]
Length = 213
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 51/97 (52%), Positives = 69/97 (71%), Gaps = 2/97 (2%)
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEVGGNPLDL 302
AG +K F+HCLD GGGIFAIG+VV PKVKTTP+V N Y+++ L+ + V G L L
Sbjct: 5 AGKTKKIFSHCLDSTNGGGIFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQL 64
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLY-DLVLSQF 338
P ++ GT +GT IDSG+TL YLP ++Y +L+L+ F
Sbjct: 65 PANIFGTTKTKGTFIDSGSTLVYLPEIIYSELILAVF 101
>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
Length = 455
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 87/262 (33%), Positives = 121/262 (46%), Gaps = 27/262 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
+ + +G P YV +DTGSDL W+ C C C + D +++ +KS + E+
Sbjct: 106 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKD-----PIYNRTKSDSYTEML 160
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL-NQASGNLKTAPLN 201
C++ C + R CS C Y +Y DGS TSG + + + S KTA
Sbjct: 161 CNEPPCLSL--GREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTA--- 215
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVV 258
V FGCG + + SS D V G+ G SL+SQL+A G V K FA+C L
Sbjct: 216 -QVGFGCGLQNLNFVTSSRDGGVLGL---GPGLVSLVSQLSAIGKVSKSFAYCFGNLSNP 271
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNMPHYNVIL------EEVEVGGNPLDLPTSLLGTGDE 312
GG GD TPMV +Y +L EE + N G+G
Sbjct: 272 NAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSG-- 329
Query: 313 RGTIIDSGTTLAYLPPMLYDLV 334
G IIDSG+TL+ PP +Y++V
Sbjct: 330 -GVIIDSGSTLSIFPPEVYEVV 350
>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 418
Score = 103 bits (258), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 95/278 (34%), Positives = 126/278 (45%), Gaps = 44/278 (15%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF LGTP ++ + VD+GSDLLWV C+ C +C + L+ PS SST
Sbjct: 61 SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDS-----PLYVPSNSSTFS 115
Query: 140 EIACSDNFCRTTYNNRYPSCS---PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ C + C C PG C Y Y D SS+ G F + ++ +
Sbjct: 116 PVPCLSSDCLLIPATEGFPCDFRYPGA-CAYEYLYADTSSSKGVFAYESATVDGVRID-- 172
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFAHC- 254
V FGCG+ G AA G+LG GQ S SQ+ A GN +FA+C
Sbjct: 173 ------KVAFGCGSDNQGSF-----AAAGGVLGLGQGPLSFGSQVGYAYGN---KFAYCL 218
Query: 255 ---LDVVKGGGIFAIGDVVSPKV---KTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTS 305
LD GD + + + TP+V P P Y V +E+V VGG L + S
Sbjct: 219 VNYLDPTSVSSSLIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDS 278
Query: 306 -----LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
LLG G G+I DSGTTL Y P Y +L+ F
Sbjct: 279 AWEIDLLGNG---GSIFDSGTTLTYWFPSAYSHILAAF 313
>gi|115484513|ref|NP_001065918.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|122221757|sp|Q0IU52.1|ASP1_ORYSJ RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
gi|33340111|gb|AAQ14543.1|AF308691_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|33340113|gb|AAQ14544.1|AF308692_1 nucellin-like protein [Oryza sativa Japonica Group]
gi|62954898|gb|AAY23267.1| nucellin-like protein [Oryza sativa Japonica Group]
gi|77548967|gb|ABA91764.1| Aspartic proteinase Asp1 precursor, putative, expressed [Oryza
sativa Japonica Group]
gi|113644622|dbj|BAF27763.1| Os11g0184800 [Oryza sativa Japonica Group]
gi|215766817|dbj|BAG99045.1| unnamed protein product [Oryza sativa Japonica Group]
gi|385717694|gb|AFI71282.1| aspartic proteinase [Oryza sativa Japonica Group]
Length = 410
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 127/281 (45%), Gaps = 30/281 (10%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+++ LEL GN +P G +F + +G P Y++ +DTGS L W+ C A C+ C
Sbjct: 22 SAVVLELHGNVYP--IGHFFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNI---- 75
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNN--RYPSCSPGVRCEYVVTYGDGSSTSGYF 181
+ L+ P+ + C+D+ C Y + + C +C+YV+ Y D SS+ G
Sbjct: 76 -VPHVLYKPTPKKL---VTCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVL 130
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V D L+ ++G T P +++ FGCG Q G + VD ILG + +LLSQL
Sbjct: 131 VIDRFSLSASNG---TNP--TTIAFGCGYDQ-GKKNRNVPIPVDSILGLSRGKVTLLSQL 184
Query: 242 AAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGN 298
+ G + K HC+ KGGG GD P V TPM +Y+ + N
Sbjct: 185 KSQGVITKHVLGHCIS-SKGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSN 243
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ + + I DSG T Y Y LS +
Sbjct: 244 SKAISAAPM------AVIFDSGATYTYFAAQPYQATLSVVK 278
>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
Length = 475
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/301 (29%), Positives = 131/301 (43%), Gaps = 35/301 (11%)
Query: 20 WAVGGGGVMGNFVFEVENKFKAGGERERTL-------------SALKQHDTRRHGRMMAS 66
W G F FEV + F ++ L L D GR +AS
Sbjct: 18 WGFERCEATGKFGFEVHHIFSDSVKQSLGLGDLVPEQGSLEYFKVLAHRDRLIRGRGLAS 77
Query: 67 IDLEL-----GGNGHPSAT---GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ E GGN S LY+ V +GTP + V +DTGSDL W+ C + C
Sbjct: 78 NNDETPITFDGGNLTVSVKLLGSLYYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCI 137
Query: 119 TK-SDLG----IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
D+G + L L+ P+ S+TS I CSD C + ++ S SP C Y ++Y +
Sbjct: 138 RDLEDIGVPQSVPLNLYTPNASTTSSSIRCSDKRC---FGSKKCS-SPSSICPYQISYSN 193
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
+ T G ++D++ L NL P+ ++V GCG +Q+G + +V+G+LG G
Sbjct: 194 STGTKGTLLQDVLHLATEDENL--TPVKANVTLGCGQKQTGLF--QRNNSVNGVLGLGIK 249
Query: 234 NSSLLSQLAAAGNVRKEFAHCLDVVKGG-GIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
S+ S LA A F+ C V G G + GD + TP + P + E
Sbjct: 250 GYSVPSLLAKANITANSFSMCFGRVIGNVGRISFGDRGYTDQEETPFISVAPRRRPVDPE 309
Query: 293 V 293
+
Sbjct: 310 L 310
>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
Length = 461
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 89/285 (31%), Positives = 127/285 (44%), Gaps = 48/285 (16%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y + +GTP + +DTGSDL+W CA C C L L DP+ SST
Sbjct: 89 TNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDC-----FHQGLPLLDPAASSTYA 143
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR---------CEYVVTYGDGSSTSGYFVRDIIQLNQ 190
+ C CR + SC G R C Y+ YGD S T G D
Sbjct: 144 ALPCGAPRCRAL---PFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGG 200
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
+G+ + + FGCG+ G S+ GI GFG+ SL SQL NV
Sbjct: 201 DNGDGDSRLPTRRLTFGCGHFNKGVFQSNE----TGIAGFGRGRWSLPSQL----NV-TT 251
Query: 251 FAHCLD---------VVKGGG-----IFAIGDVVSPKVKTTPMV--PNMPH-YNVILEEV 293
F++C V GG +++ +S +V+TTP++ P+ P Y + L+ +
Sbjct: 252 FSYCFTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGI 311
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VG L +P + L R TIIDSG ++ LP +Y+ V ++F
Sbjct: 312 SVGKTRLAVPEAKL-----RSTIIDSGASITTLPEAVYEAVKAEF 351
>gi|255563835|ref|XP_002522918.1| nucellin, putative [Ricinus communis]
gi|223537845|gb|EEF39461.1| nucellin, putative [Ricinus communis]
Length = 433
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 131/282 (46%), Gaps = 30/282 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
R +S+ L GN +P+ G Y + +G P Y++ VDTGSDL W+ C A C +C
Sbjct: 52 RAGSSLVFPLHGNVYPA--GYYNVTLSIGQPAKPYFLDVDTGSDLTWLQCDAPCRQC--- 106
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
+ L+ PS + + C D C + +C +C+Y V Y DG S+ G
Sbjct: 107 --IEAPHPLYRPSNNL----VICEDPLCASLQPPGVHNCQDPDQCDYEVEYADGGSSLGV 160
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
V+D+ LN +G LN + GCG Q L ++ +DGILG G+ SS+ SQ
Sbjct: 161 LVKDVFVLNFTNGKR----LNPLLALGCGYDQ---LPGRSNHPLDGILGLGRGISSIPSQ 213
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVV-SPKVKTTPMV-PNMPHYNVILEEVEVGGN 298
L++ G V HCL GG +F D+ S V TPM ++ HY+ E+ G
Sbjct: 214 LSSQGLVSNVIGHCLSGRGGGFLFFGEDIYDSSGVTWTPMSRDHLKHYSPGFAELIFDGK 273
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD-LVLSQFR 339
+ L+ + DSG++ YL Y LV S R
Sbjct: 274 STGIRNLLV--------VFDSGSSYTYLNAQAYQHLVFSLKR 307
>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
Length = 481
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 96/306 (31%), Positives = 142/306 (46%), Gaps = 34/306 (11%)
Query: 41 AGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
AGGE +T L Q + G+ M+S + G +A G +K+ P V +
Sbjct: 108 AGGEDFQTNGNLLQVNYGNSGQPMSSEAQQSGVVNASAAGGGSRSKL----PGVIQTVVL 163
Query: 101 DTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
D+ SD+ WV C C P + + +DPS+S +S +CS C T Y +
Sbjct: 164 DSASDVPWVQCVPCPIPPCHPQVD---SFYDPSRSPSSAPFSCSSPTC--TALGPYANGC 218
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
+C+Y+V Y DGSSTSG ++ D++ L+ +GN S FGC + + G S
Sbjct: 219 ANNQCQYLVRYPDGSSTSGAYIADLLTLD--AGNAV-----SGFKFGCSHAEQG----SF 267
Query: 221 DAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKG-GGIFAIG--DVVSPKVKT 276
DA GI+ G SLLSQ A+ GN F++C+ G F +G S +
Sbjct: 268 DARAAGIMALGGGPESLLSQTASRYGNA---FSYCIPATASDSGFFTLGVPRRASSRYVV 324
Query: 277 TPMV---PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
TPMV Y V+L + VGG L + ++ G+++DS T + LPP Y
Sbjct: 325 TPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAA----GSVLDSRTAITRLPPTAYQA 380
Query: 334 VLSQFR 339
+ S FR
Sbjct: 381 LRSAFR 386
>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
Length = 440
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/276 (33%), Positives = 129/276 (46%), Gaps = 49/276 (17%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DTGSDL+W C C+ C +S L +D S+SST +
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQS-----LPYYDASRSSTFALPS 145
Query: 143 CSDNFCRTTYNNRYPSCSPGVR-----CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C C+ PS + V C + +YGD S+T G+ D+ ++ +G +
Sbjct: 146 CDSTQCKLD-----PSVTMCVNQTVQTCAFSYSYGDKSATIGFL--DVETVSFVAG--AS 196
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
P V+FGCG +G S+ GI GFG+ SL SQL GN F+HC
Sbjct: 197 VP---GVVFGCGLNNTGIFRSNE----TGIAGFGRGPLSLPSQL-KVGN----FSHCFTA 244
Query: 258 VKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSL 306
V G + D+ V+TTP++ N H Y + L+ + VG L +P S
Sbjct: 245 VSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 304
Query: 307 L----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
GTG GTIIDSGT LPP +Y LV +F
Sbjct: 305 FALKNGTG---GTIIDSGTAFTSLPPRVYRLVHDEF 337
>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 478
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 93/270 (34%), Positives = 124/270 (45%), Gaps = 29/270 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y LGTP ++VDTGSDL WV C CS P S K LFDP++SS+ +
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAP--SCYSQKDPLFDPAQSSSYAAVP 197
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C S +C YVV+YGDGS+T+G + D + L+ +S
Sbjct: 198 CGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA-------VQ 249
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-G 261
FGCG+ QSG VDG+LG G+ SL+ Q AG F++CL
Sbjct: 250 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTA 302
Query: 262 GIFAIG----DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G +G +P TT ++ PN P +Y V+L + VGG L +P S G
Sbjct: 303 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVD 362
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
T T + LPP Y + S FR +AS
Sbjct: 363 TG----TVVTRLPPTAYAALRSAFRSGMAS 388
>gi|242050026|ref|XP_002462757.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
gi|241926134|gb|EER99278.1| hypothetical protein SORBIDRAFT_02g031460 [Sorghum bicolor]
Length = 523
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 123/264 (46%), Gaps = 27/264 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
L++ V LGTP + V +DTGSDL WV +C C+ + + +K + P KSSTS
Sbjct: 103 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSPNYRDLKFDTYSPQKSSTSR 162
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK--T 197
++ CS N C R S S EY+ D +S++G V D++ L G K T
Sbjct: 163 KVPCSSNLCDLQSACRSASSSCPYSIEYL---SDNTSSTGVLVEDVLYLITEYGQPKIVT 219
Query: 198 APLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
AP + FGCG Q+G LGS AA +G+LG G + S+ S LA+ G F+ C
Sbjct: 220 AP----ITFGCGRIQTGSFLGS---AAPNGLLGLGMDSISVPSLLASEGVAANSFSMCFG 272
Query: 257 VVKGGGIFAIGDVVSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G GD S + TP+ P+YN+ + VG +
Sbjct: 273 -DDGRGRINFGDTGSSDQQETPLNIYKQNPYYNISITGAMVGSKSFNT---------NFN 322
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
I+DSGT+ L +Y + S F
Sbjct: 323 AIVDSGTSFTALSDPMYSEITSSF 346
>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
Length = 383
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 144/314 (45%), Gaps = 51/314 (16%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
++ + ER L +T + + + ++G +G Y ++ +GTP
Sbjct: 1 MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIG-------SGEYLIQMAIGTPAL 53
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN 154
+DTGSDL+W C C+ C T S SST ++ C + C+
Sbjct: 54 SLSAIMDTGSDLVWTKCNPCTDCSTSSIYDPS-------SSSTYSKVLCQSSLCQPP--- 103
Query: 155 RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
SC+ CEYV YGD SSTSG + ++ S P ++ FGCG+ G
Sbjct: 104 SIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQS-----LP---NITFGCGHDNQG 155
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVV 270
V G++GFG+ + SL+SQL + + +F++CL D K +F IG+
Sbjct: 156 ------FDKVGGLVGFGRGSLSLVSQLGPS--MGNKFSYCLVSRTDSSKTSPLF-IGNTA 206
Query: 271 SPKVKT---TPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDER-----GTIIDSG 320
S + T TP+V + HY + LE + VGG L +PT GT D + G IIDSG
Sbjct: 207 SLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPT---GTFDIQSDGSGGLIIDSG 263
Query: 321 TTLAYLPPMLYDLV 334
TTL +L YD V
Sbjct: 264 TTLTFLQQTAYDAV 277
>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
Length = 470
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 116/259 (44%), Gaps = 33/259 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSG 139
G Y T++GLGTP Y + VDTGS L W+ C+ C C +G L+DP SST
Sbjct: 132 GNYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSC--HRQVG---PLYDPRASSTYA 186
Query: 140 EIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ CS + C T N +CS C Y +YGD S + GY RD + S
Sbjct: 187 TVPCSASQCDELQAATLNPS--ACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS--- 241
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ +GCG G G S G++G + SLL QLA ++ F++CL
Sbjct: 242 -----YPNFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQLAP--SLGYSFSYCL 289
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G +IG S TPM + Y V L + VGG+PL + +
Sbjct: 290 PTPASTGYLSIGPYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEY---SS 346
Query: 313 RGTIIDSGTTLAYLPPMLY 331
TIIDSGT + LP +Y
Sbjct: 347 LPTIIDSGTVITRLPTAVY 365
>gi|356509401|ref|XP_003523438.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 407
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 66/193 (34%), Positives = 92/193 (47%), Gaps = 18/193 (9%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+SI ++ GN +P G Y + +G P Y + +DTGSDL WV C A C C D
Sbjct: 32 SSIAFQIKGNVYP--LGYYSVNLAIGNPPKAYELDIDTGSDLTWVQCDAPCKGCTLPRDR 89
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFV 182
K + C D C + P C +P +C+Y V Y D S+ G V
Sbjct: 90 QYK---------PHGNLVKCVDPLCAAIQSAPNPPCVNPNEQCDYEVEYADQGSSLGVLV 140
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
RDII L +G L +S + FGCG Q+ +G + + G+LG G +S+LSQL
Sbjct: 141 RDIIPLKLTNGTLT----HSMLAFGCGYDQT-HVGHNPPPSAAGVLGLGNGRASILSQLN 195
Query: 243 AAGNVRKEFAHCL 255
+ G +R HCL
Sbjct: 196 SKGLIRNVVGHCL 208
>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 123/271 (45%), Gaps = 41/271 (15%)
Query: 67 IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGI 125
+ ++ GN +P G Y + +G P Y + +DTGSDL WV C A C C +
Sbjct: 50 VAFQIKGNVYP--LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCQGCTIPRN--- 104
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRD 184
L+ P+ + C D C+ + C+ P +C+Y V Y D S+ G +RD
Sbjct: 105 --RLYKPN----GNLVKCGDPLCKAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRD 158
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
I L +G+L L FGCG Q +G + A+ G+LG G +S+LSQL +
Sbjct: 159 NIPLKFTNGSLARPIL----AFGCGYDQK-HVGHNPSASTAGVLGLGNGKTSILSQLHSL 213
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMV--PNMPHYNVILEEVEVGGNPL 300
G +R HCL +GGG GD + P+ V TP++ + HY P
Sbjct: 214 GLIRNVVGHCLS-ERGGGFLFFGDQLVPQSGVVWTPLLQSSSTQHYKT---------GPA 263
Query: 301 DL-----PTSLLGTGDERGTIIDSGTTLAYL 326
DL PTS+ G I DSG++ Y
Sbjct: 264 DLFFDRKPTSVKGL----QLIFDSGSSYTYF 290
>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Brachypodium distachyon]
Length = 464
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 115/264 (43%), Gaps = 37/264 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V +G+P + +DTGSD+ W+ C K L+DP SST +
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRC--------------KSRLYDPGTSSTYAPFS 176
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C R CS G C Y V YGDGS+T+G + D + L S PL S
Sbjct: 177 CSAPAC-AQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTS-----EPLIS 230
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV-KGG 261
FGC + G +T DG++G G S +SQ AA F++CL
Sbjct: 231 GFQFGCSAVEHGFEEDNT----DGLMGLGGDAQSFVSQTAA--TYGSAFSYCLPPTWNSS 284
Query: 262 GIFAIGDVVSPKVKTTPMVPNM------PHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G +G S P + Y ++L + VGG L++P+S+ G+
Sbjct: 285 GFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVF----SAGS 340
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFR 339
I+DSGT + LPP Y + + FR
Sbjct: 341 IVDSGTVITRLPPTAYGALSAAFR 364
>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 632
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 122/273 (44%), Gaps = 39/273 (14%)
Query: 60 HGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT 119
H RM DL + G Y T++ +GTP + + VD+GS + +V C+ C +C
Sbjct: 78 HSRMRLYDDLLING--------YYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGK 129
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSG 179
D F P SST + C+ + C + +C Y Y + SS+ G
Sbjct: 130 HQD-----PKFQPEMSSTYQPVKCNMD-CNCDDDRE--------QCVYEREYAEHSSSKG 175
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
D+I S + P +FGC ++GDL S DGI+G GQ + SL+
Sbjct: 176 VLGEDLISFGNES---QLTP--QRAVFGCETVETGDLYSQ---RADGIIGLGQGDLSLVD 227
Query: 240 QLAAAGNVRKEFAHC---LDVVKGGGIFAIG--DVVSPKVKTTPMVPNMPHYNVILEEVE 294
QL G + F C +DV GGG +G D S V T P+YN+ L +
Sbjct: 228 QLVDKGLISNSFGLCYGGMDV--GGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIR 285
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
V G L L + + E G ++DSGTT AYLP
Sbjct: 286 VAGKQLSLHSRVF--DGEHGAVLDSGTTYAYLP 316
>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
Length = 524
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 123/286 (43%), Gaps = 47/286 (16%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G Y +V +G+P E Y+ VD+GSD++WV C C C ++D LFDP+
Sbjct: 162 SGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQAD-----PLFDPA 216
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQLNQA 191
S+T ++C CR + +C G CEY V+Y DGS T G + + L
Sbjct: 217 TSATFSGVSCGSAICRILPTS---ACGDGELGGCEYEVSYADGSYTKGALALETLTLGGT 273
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
+ V+ GCG+R G G++G G SL+ QL G V F
Sbjct: 274 A--------VEGVVIGCGHRNRGLF-----VGAAGLMGLGWGPMSLVGQL--GGEVGGAF 318
Query: 252 AHCLDVVKGGG-----------IFAIGDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGG 297
++CL G G + + V P+V P P Y V L +EVG
Sbjct: 319 SYCLASRGGYGSGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGD 378
Query: 298 NPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
L L L G GD ++D+GTT+ LP Y + F
Sbjct: 379 ERLPLQAGLFQLTEDGAGD---VVMDTGTTVTRLPQEAYAALRDAF 421
>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 645
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 131/285 (45%), Gaps = 45/285 (15%)
Query: 52 LKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC 111
LK+ D+ H + +L NG+ Y ++ +GTP + + VDTGS + +V C
Sbjct: 68 LKESDSEHHPNARMRLYDDLLRNGY------YTARLWIGTPPQRFALIVDTGSTVTYVPC 121
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY 171
+ C C + D F P S T + C+ C + + +C Y Y
Sbjct: 122 STCRHCGSHQD-----PKFRPEDSETYQPVKCTWQ-CNCDNDRK--------QCTYERRY 167
Query: 172 GDGSSTSGYFVRDIIQL-NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGF 230
+ S++SG D++ NQ + + A IFGC N ++GD+ + DGI+G
Sbjct: 168 AEMSTSSGALGEDVVSFGNQTELSPQRA------IFGCENDETGDI---YNQRADGIMGL 218
Query: 231 GQANSSLLSQLAAAGNVRKEFAHCLDVVKG-------GGIFAIGDVVSPKVKTTPMVPNM 283
G+ + S++ QL + F+ C + GGI D+V ++ P+
Sbjct: 219 GRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMVF--TRSDPV--RS 274
Query: 284 PHYNVILEEVEVGGNPLDL-PTSLLGTGDERGTIIDSGTTLAYLP 327
P+YN+ L+E+ V G L L P G + GT++DSGTT AYLP
Sbjct: 275 PYYNIDLKEIHVAGKRLHLNPKVFDG---KHGTVLDSGTTYAYLP 316
>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
Length = 410
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 129/292 (44%), Gaps = 31/292 (10%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-A 112
+ T + R+ +S+ + GN +P TG Y + +G P + +DTGSDL WV C A
Sbjct: 27 ESSTPANDRVGSSVFFRVTGNVYP--TGYYSVILNIGNPPKAFDFDIDTGSDLTWVQCDA 84
Query: 113 GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTY 171
C C D L+ P + + CS++ C+ C +P +C+Y + Y
Sbjct: 85 PCKGCTKPRD-----KLYKPKNN----LVPCSNSLCQAVSTGENYHCDAPDDQCDYEIEY 135
Query: 172 GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
D S+ G + D L ++G L L + FGCG Q LG GILG G
Sbjct: 136 ADLGSSIGVLLSDSFPLRLSNGTL----LQPKMAFGCGYDQK-HLGPHPPPDTAGILGLG 190
Query: 232 QANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--KVKTTPMVPNMPH--YN 287
+ S+LSQL G + HC +GG +F GD + P ++ TPM+ + Y+
Sbjct: 191 RGKVSILSQLRTLGITQNVVGHCFSRARGGFLF-FGDHLFPSSRITWTPMLRSSSDTLYS 249
Query: 288 VILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
E+ GG P + L I DSG++ Y +Y +L+ R
Sbjct: 250 SGPAELLFGGKPTGIKGLQL--------IFDSGSSYTYFNAQVYQSILNLVR 293
>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 413
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/271 (31%), Positives = 131/271 (48%), Gaps = 34/271 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y ++ +GTP + VDTGSDL+WV C C C + + +FDP KSST
Sbjct: 62 GQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQIN-----PMFDPLKSSTYTN 116
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I+C C Y CSP RC+Y Y D S T G ++ + L +G P+
Sbjct: 117 ISCDSPLCYKPYIGE---CSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTG----KPI 169
Query: 201 N-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
+ ++FGCG+ +G+ D + G++G G +SL+SQ+ +K F+ CL
Sbjct: 170 SLQGILFGCGHNNTGNFN---DHEM-GLIGLGGGPTSLVSQIGPLFGGKK-FSQCLVPFL 224
Query: 256 -DVVKGGGI-FAIG-DVVSPKVKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
D+ + F G +V+ V TTP+V +M Y V L + V L + +++
Sbjct: 225 TDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTI--- 281
Query: 310 GDERGT-IIDSGTTLAYLPPMLYDLVLSQFR 339
E+G ++DSGT LP LYD V + +
Sbjct: 282 --EKGNMLVDSGTPPNILPQQLYDRVYVEVK 310
>gi|215694947|dbj|BAG90138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 100
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/96 (52%), Positives = 63/96 (65%), Gaps = 2/96 (2%)
Query: 42 GGERERTLSALKQHDTRRHGRMMASIDLELGGNG--HPSATGLYFTKVGLGTPTDEYYVQ 99
GG + + AL+ HD RH + + D LGG G S+TGLY+T++G+GTP EYYVQ
Sbjct: 3 GGCKGSDIGALQTHDRNRHLSRLVAADFSLGGLGGISTSSTGLYYTEIGIGTPAMEYYVQ 62
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
VDTGS WVNC C +CP KSD+ KLTL+DP S
Sbjct: 63 VDTGSSAFWVNCIPCKQCPRKSDILKKLTLYDPRSS 98
>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 455
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/290 (32%), Positives = 137/290 (47%), Gaps = 38/290 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G S +G YF + +GTP + DTGSDL+WV C+ C C +S + F
Sbjct: 77 SGASSGSGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSP----GSAFFAR 132
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR------CEYVVTYGDGSSTSGYFVRDIIQ 187
S+T I C C+ +P +P R C Y TY D S+T+G+F ++ +
Sbjct: 133 HSTTYSAIHCYSPQCQLV---PHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALT 189
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSG-DLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
LN ++G +K LN + FGCG R SG L ++ G++G G+A S SQL
Sbjct: 190 LNTSTGKVKK--LN-GLSFGCGFRISGPSLTGASFEGAQGVMGLGRAPISFSSQLGR--R 244
Query: 247 VRKEFAHCL----------DVVKGGGIFAIGDVVSPK--VKTTPMV--PNMP-HYNVILE 291
+F++CL + GG A VS K + TP++ P P Y + ++
Sbjct: 245 FGSKFSYCLMDYTLSPPPTSFLTIGG--AQNVAVSKKGIMSFTPLLINPLSPTFYYIAIK 302
Query: 292 EVEVGGNPLDLPTSLLGTGD--ERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
V V G L + S+ D GTIIDSGTTL ++ Y +L F+
Sbjct: 303 GVYVNGVKLPINPSVWSIDDLGNGGTIIDSGTTLTFITEPAYTEILKAFK 352
>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
Length = 453
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 120/288 (41%), Gaps = 33/288 (11%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
RM ++ L GN +P G Y + +G P Y + +D+GSDL W+ C A C C TK
Sbjct: 49 RMGHTVVFPLQGNVYPQ--GFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSC-TK 105
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSG 179
+ P G I C+D C + P C +C+Y V+Y D S+ G
Sbjct: 106 AP--------HPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLG 157
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
V DI L +G L AP + FGCG QS G + VDG+LG G SS+++
Sbjct: 158 VLVHDIFSLQLTNGTL-AAP---RLAFGCGYDQSYP-GPNAPPFVDGVLGLGYGKSSIVT 212
Query: 240 QLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNP 299
QL + G +R HCL + + TTP + P E G
Sbjct: 213 QLRSLGLIRSIVGHCLSGRG-----GGFLFLGDGLSTTPGIIWTPMSRKSGESAYALG-- 265
Query: 300 LDLPTSLLGTGDERGT-----IIDSGTTLAYLPPMLYDLVLSQFRFWI 342
P LL G G + DSG++ Y Y LS R ++
Sbjct: 266 ---PADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYL 310
>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 477
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/264 (34%), Positives = 125/264 (47%), Gaps = 35/264 (13%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEIACSD 145
VG GTP + +DTGSDL W+ C CS C + D FDP+KSS+ + C
Sbjct: 141 VGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPD-----FDPAKSSSYAAVPCGT 195
Query: 146 NFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C C+ G C Y V YGDGSST+G RD + N +S +
Sbjct: 196 PVCAAAGG----MCN-GTTCLYGVQYGDGSSTTGVLSRDTLTFNSSSK-------FTGFT 243
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG---- 261
FGCG + GD G VDG+LG G+ SL SQ AA + F++CL
Sbjct: 244 FGCGEKNIGDFGE-----VDGLLGLGRGKLSLPSQ--AAPSFGGVFSYCLPSYNTTPGYL 296
Query: 262 GIFAIGDVVSPKVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGTIID 318
I A + V+ T M+ P P + I L + +GG L +P S+ + GT++D
Sbjct: 297 NIGATKPTSTVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVF---TKTGTLLD 353
Query: 319 SGTTLAYLPPMLYDLVLSQFRFWI 342
SGT L YLPP Y + +F+F +
Sbjct: 354 SGTILTYLPPPAYTSLRDRFKFTM 377
>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
Length = 640
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 133/296 (44%), Gaps = 54/296 (18%)
Query: 50 SALKQHDTRRHGRMMASIDLELGGNGHPSA----------TGLYFTKVGLGTPTDEYYVQ 99
S+L + RRH + S HP+A G Y T++ +GTP + +
Sbjct: 57 SSLSHFNPRRHLQGSQS-------EHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALI 109
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
VDTGS + +V C+ C C + D F P S T + C+ C + +
Sbjct: 110 VDTGSTVTYVPCSTCKHCGSHQD-----PKFRPEASETYQPVKCTWQ-CNCDDDRK---- 159
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQL-NQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
+C Y Y + S++SG D++ NQ+ + + A IFGC N ++GD+
Sbjct: 160 ----QCTYERRYAEMSTSSGVLGEDVVSFGNQSELSPQRA------IFGCENDETGDI-- 207
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-------GGIFAIGDVVS 271
+ DGI+G G+ + S++ QL + F+ C + GGI D+V
Sbjct: 208 -YNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGISPPADMVF 266
Query: 272 PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+ P+ P+YN+ L+E+ V G L L + + GT++DSGTT AYLP
Sbjct: 267 --THSDPV--RSPYYNIDLKEIHVAGKRLHLNPKVF--DGKHGTVLDSGTTYAYLP 316
>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 430
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/266 (29%), Positives = 123/266 (46%), Gaps = 35/266 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y V +GTP +Y DTGSDL W C C +C + +F+P KS++
Sbjct: 89 SGEYLMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLR-----PIFNPLKSTSFS 143
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ C + C C+Y TYGD + + G + I + +S +K+
Sbjct: 144 HVPCNTQTCHAVDDGH---CGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSS--VKS-- 196
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV- 258
+ GCG+ SG G ++ G++G G SL+SQ++ + + F++CL +
Sbjct: 197 -----VIGCGHASSGGFGFAS-----GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL 246
Query: 259 ---KGGGIFAIGDVVS-PKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G F VVS P V +TP++ + +Y + LE + +G + +
Sbjct: 247 SHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNE------RHMAFAKQ 300
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
IIDSGTTL LP LYD V+S
Sbjct: 301 GNVIIDSGTTLTILPKELYDGVVSSL 326
>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
gi|194704586|gb|ACF86377.1| unknown [Zea mays]
gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 478
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 129/270 (47%), Gaps = 29/270 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y LGTP ++VDTGSDL WV C C+ P S K LFDP++SS+ +
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAP--SCYSQKDPLFDPAQSSSYAAVP 197
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C S +C YVV+YGDGS+T+G + D + L+ +S
Sbjct: 198 CGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA-------VQ 249
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-G 261
FGCG+ QSG VDG+LG G+ SL+ Q AG F++CL
Sbjct: 250 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTA 302
Query: 262 GIFAIG----DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G +G +P TT ++ PN P +Y V+L + VGG L +P S G
Sbjct: 303 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAG----G 358
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
T++D+GT + LPP Y + S FR +AS
Sbjct: 359 TVVDTGTVVTRLPPTAYAALRSAFRSGMAS 388
>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
Length = 443
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/275 (34%), Positives = 121/275 (44%), Gaps = 35/275 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSST 137
TG Y VGLGTP + V DTGSDL WV C CS C + D LF PS SST
Sbjct: 82 TGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQD-----PLFAPSSSST 136
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYFVRDIIQL------NQ 190
+ C + C + S SPG RC Y V YGD S T G+ D + L N
Sbjct: 137 FSAVRCGEPECPRARQSC--SSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNA 194
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
+ N P +FGCG +G G + DG+ G G+ SL SQ AAG +
Sbjct: 195 SENNSNKLP---GFVFGCGENNTGLFGKA-----DGLFGLGRGKVSLSSQ--AAGKYGEG 244
Query: 251 FAHCL--DVVKGGGIFAIGD--VVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLP 303
F++CL G ++G + TPM+ N P Y V L + V G + +
Sbjct: 245 FSYCLPSSSSNAHGYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKV- 303
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
S G I+DSGT + L P Y + + F
Sbjct: 304 -SSRPALWPAGLIVDSGTVITRLAPRAYSALRTAF 337
>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 447
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 122/275 (44%), Gaps = 45/275 (16%)
Query: 83 YFTKVGLGTP-TDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y G+GTP + ++VDTGSD++W C C C T+ L FD S S T +
Sbjct: 92 YLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQ-----PLPRFDTSASDTVHGV 146
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C+D CR R +C G C Y V YGD S T G +D + G T P
Sbjct: 147 LCTDPICRAL---RPHACFLG-GCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVP-- 200
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV--- 258
++FGCG +G+ S+ GI GFG+ SL QL + F++C +
Sbjct: 201 -DLVFGCGQYNTGNFHSNE----TGIAGFGRGPLSLPRQLGVS-----SFSYCFTTIFES 250
Query: 259 KGGGIF------------AIGDVVSPKVKTTPMVPNMPHYNVI-LEEVEVGGNPLDLPTS 305
K +F A G ++S TP +PN P Y + L+ + VG L +P S
Sbjct: 251 KSTPVFLGGAPADGLRAHATGPILS-----TPFLPNHPEYYYLSLKGITVGKTRLAVPES 305
Query: 306 --LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
++ GTIIDSGT + P ++ + F
Sbjct: 306 AFVVKADGSGGTIIDSGTAITAFPRAVFRSLWEAF 340
>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 439
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 129/310 (41%), Gaps = 27/310 (8%)
Query: 46 ERTLSALKQHDTRRHG---RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDT 102
+R +SA+++ +R H + I + + S G Y K LGTP + DT
Sbjct: 52 QRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMKFSLGTPAFDILAIADT 111
Query: 103 GSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG 162
GSDL+W C C +C + LFDP SST +I+CS C S
Sbjct: 112 GSDLIWTQCKPCDQCYEQ-----DAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGN 166
Query: 163 VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
C Y +YGD S TSG D I L SG P I GCG+ G
Sbjct: 167 KTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLP---KAIIGCGHNNGGSFTEKGSG 223
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI------FAIGDVVS-PKVK 275
V SL+SQL + + +F++CL + F +VS V+
Sbjct: 224 IVGLG----GGPISLISQLGST--IDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQ 277
Query: 276 TTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
+TP++ P Y + LE V VG + P S GT E IIDSGTTL P +
Sbjct: 278 STPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTS-EGNIIIDSGTTLTLFPEDFFSE 336
Query: 334 VLSQFRFWIA 343
+ S + +A
Sbjct: 337 LSSAVQDAVA 346
>gi|357489329|ref|XP_003614952.1| Aspartic proteinase-like protein [Medicago truncatula]
gi|355516287|gb|AES97910.1| Aspartic proteinase-like protein [Medicago truncatula]
Length = 530
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/268 (32%), Positives = 131/268 (48%), Gaps = 28/268 (10%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS-----DLGIKLTLFDPSKSS 136
L++T + +GTP + V +DTGSD+ WV C C C S L L + PS SS
Sbjct: 101 LHYTWIDIGTPNVSFLVALDTGSDMFWVPC-DCIECAPLSAAFYNALDRDLNQYSPSLSS 159
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNL 195
+S + C C N + RC Y+ Y D +S+SG+ + D + L AS N
Sbjct: 160 SSRHLPCGHQLCNQNSNCK----GFKDRCPYIKEYTSDNTSSSGFLIEDKLHL--ASNNA 213
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ +SVI GCG +QSG AA +G+LG G + S+ + LA AG +R + CL
Sbjct: 214 TKNSIQASVILGCGRKQSGYFLEG--AAPNGMLGLGPGSISVPALLAKAGLIRNSISICL 271
Query: 256 DVVKGGGIFAIGDV-VSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
+ KG G GD + + ++TP + + + +Y V +E VG S
Sbjct: 272 N-EKGSGRILFGDQGHATQRRSTPFLLDDGELLNYFVGVERFCVG--------SFCYKET 322
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
E ID+GT+ YLP +Y+ V+++F
Sbjct: 323 EFKAFIDTGTSFTYLPKGVYETVVAEFE 350
>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
Length = 390
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 120/288 (41%), Gaps = 33/288 (11%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTK 120
RM ++ L GN +P G Y + +G P Y + +D+GSDL W+ C A C C TK
Sbjct: 16 RMGHTVVFPLQGNVYPQ--GFYSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPCVSC-TK 72
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSG 179
+ P G I C+D C + P C +C+Y V+Y D S+ G
Sbjct: 73 AP--------HPPYKPNKGPITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLG 124
Query: 180 YFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
V DI L +G L AP + FGCG QS G + VDG+LG G SS+++
Sbjct: 125 VLVHDIFSLQLTNGTL-AAP---RLAFGCGYDQSYP-GPNAPPFVDGVLGLGYGKSSIVT 179
Query: 240 QLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNP 299
QL + G +R HCL + + TTP + P E G
Sbjct: 180 QLRSLGLIRSIVGHCLSGRG-----GGFLFLGDGLSTTPGIIWTPMSRKSGESAYALG-- 232
Query: 300 LDLPTSLLGTGDERGT-----IIDSGTTLAYLPPMLYDLVLSQFRFWI 342
P LL G G + DSG++ Y Y LS R ++
Sbjct: 233 ---PADLLFNGQNSGVKGLRLVFDSGSSYTYFNAQAYKTTLSLVRKYL 277
>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
gi|194700872|gb|ACF84520.1| unknown [Zea mays]
Length = 351
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 81/249 (32%), Positives = 119/249 (47%), Gaps = 30/249 (12%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
V +D+ SD+ WV C C P + + +DPS+S TS +CS C T Y
Sbjct: 31 VVLDSASDVPWVQCVPCPIPPCHPQVD---SFYDPSRSPTSAAFSCSSPTC--TALGPYA 85
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
+ +C+Y+V Y DGSSTSG ++ D++ L+ +GN S FGC + + G
Sbjct: 86 NGCANNQCQYLVRYPDGSSTSGAYIADLLTLD--AGNAV-----SGFKFGCSHAEQG--- 135
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKG-GGIFAIG--DVVSPK 273
S DA GI+ G SLLSQ A+ GN F++C+ G F +G S +
Sbjct: 136 -SFDARAAGIMALGGGPESLLSQTASRYGNA---FSYCIPATASDSGFFTLGVPRRASSR 191
Query: 274 VKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
TPMV Y V+L + VGG L + ++ G+++DS T + LPP
Sbjct: 192 YVVTPMVRFRQAATFYGVLLRTITVGGQRLGVAPAVFAA----GSVLDSRTAITRLPPTA 247
Query: 331 YDLVLSQFR 339
Y + + FR
Sbjct: 248 YQALRAAFR 256
>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
Length = 407
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 127/284 (44%), Gaps = 40/284 (14%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
+LGG+ HP TG ++ + +G P Y++ +DTGS+L W+ C + P K+ +
Sbjct: 28 FKLGGDVHP--TGHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHA-TPGPCKTCNKVPHP 84
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNN--RYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDI 185
L+ P K + C+D C + + C +C Y + Y DG+++ G + D
Sbjct: 85 LYRPKKL-----VPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDK 139
Query: 186 IQLNQASGNLKTAPLNSSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
L S ++ FGCG Q + VDGILG G+ + L+SQL
Sbjct: 140 FSLPTGSAR--------NIAFGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKH 191
Query: 244 AGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK-------VKTTPMVPNMPHYNVILEEVEV 295
+G V K HCL KGGG IG+ P + PN HY+ + +
Sbjct: 192 SGAVSKNVIGHCLS-SKGGGYLFIGEENVPSSHLHIIYIYCISREPN--HYSPGQATLHL 248
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G NP +GT + I DSG+T YLP L+ ++S +
Sbjct: 249 GRNP-------IGTKPFKA-IFDSGSTYTYLPENLHAQLVSALK 284
>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 406
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/318 (32%), Positives = 145/318 (45%), Gaps = 36/318 (11%)
Query: 43 GERERTLSALKQHDTRRHGRMMASIDLELGG-NGHPSATGLYFTKVGLGTPTDEYYVQVD 101
G +T++ L + +R + S D + +G +G YF ++ +GTP Y+ +D
Sbjct: 17 GRINQTVNGLTRSRSRDRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMD 76
Query: 102 TGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP 161
TGSD+LW+ CA C C +SD +FDP KSST + CS C N +C
Sbjct: 77 TGSDILWLQCAPCVNCYHQSD-----AIFDPYKSSTYSTLGCSTRQC---LNLDIGTCQA 128
Query: 162 GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
+C Y V YGDGS T+G F D + LN SG + LN + GCG+ G
Sbjct: 129 N-KCLYQVDYGDGSFTTGEFGTDDVSLNSTSG-VGQVVLN-KIPLGCGHDNEGYF----- 180
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----DVVKGGG-IFAIGDVVSPKVK 275
G+LG G+ S +Q+ R F++CL D +G +F V +
Sbjct: 181 VGAAGLLGLGKGPLSFPNQVDPQNGGR--FSYCLTDRETDSTEGSSLVFGEAAVPPAGAR 238
Query: 276 TTPMVPNM---PHYNVILEEVEVGGNPLDLPTSL-----LGTGDERGTIIDSGTTLAYLP 327
TP NM Y + + + VGG L +PTS LG G G IIDSGT++ L
Sbjct: 239 FTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNG---GVIIDSGTSVTRLQ 295
Query: 328 PMLYDLVLSQFRFWIASL 345
Y + FR + L
Sbjct: 296 NAAYASLRDAFRAGTSDL 313
>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 440
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 125/282 (44%), Gaps = 28/282 (9%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYF-TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RR R A I E+ N G F +G P V +DTGSDLLWV C C+
Sbjct: 65 RRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCAD 124
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C +S +FDPSKSST +++ C + +Y + +C Y +Y DGS+
Sbjct: 125 CFRQS-----TPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLN---QCIYNASYADGST 176
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+SG + I + T SSV+FGCG+ G D GILG + S
Sbjct: 177 SSGNLATEDIVFETSDQGTVTV---SSVVFGCGHSNRGRF----DGQQSGILGLSAGDQS 229
Query: 237 LLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
++S+L + F++C+ D +GD V + +TP Y V LE
Sbjct: 230 IVSRLGS------RFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEG 283
Query: 293 VEVGGNPLDLPTSLLGTGD--ERGTIIDSGTTLAYLPPMLYD 332
+ VG LD+ + + + G ++DSGTT +L +D
Sbjct: 284 ISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFD 325
>gi|326500240|dbj|BAK06209.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 134/304 (44%), Gaps = 32/304 (10%)
Query: 41 AGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
AG +R R LSA A++ + G L++ V +GTP + V +
Sbjct: 63 AGHDRHRALSAAGGRPPLTFSEGNATLKVSNLGF-------LHYALVTVGTPGHTFMVAL 115
Query: 101 DTGSDLLWV--NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
DTGSDL W+ C GC+ S + + PS SSTS + C+ +FC
Sbjct: 116 DTGSDLFWLPCQCDGCTPP-PSSAASAPASFYIPSLSSTSQAVPCNSDFC-----GLRKE 169
Query: 159 CSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
CS C Y + Y +S+SG+ V D++ L+ + + L + ++FGCG Q+G
Sbjct: 170 CSKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTHPQF--LKAQIMFGCGEVQTGSFL 227
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT 277
+ AA +G+ G G S+ S LA G F+ C G G + GD S + T
Sbjct: 228 DA--AAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFG-RDGIGRISFGDQGSSDQEET 284
Query: 278 PMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
P+ N H Y + + + VG N +DL E TI D+GT+ YL Y +
Sbjct: 285 PLDINQKHPTYAITITGIAVGNNLMDL---------EVSTIFDTGTSFTYLADPAYTYIT 335
Query: 336 SQFR 339
F
Sbjct: 336 DGFH 339
>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 87/283 (30%), Positives = 127/283 (44%), Gaps = 41/283 (14%)
Query: 52 LKQHDTRR--HGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
L + D++ H RM DL + G Y T++ +GTP + + VD+GS + +V
Sbjct: 69 LHKSDSKSLPHSRMRLYDDLLING--------YYTTRLWIGTPPQMFALIVDSGSTVTYV 120
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVV 169
C+ C +C D F P SST + C+ + C + +C Y
Sbjct: 121 PCSDCEQCGKHQD-----PKFQPELSSTYQPVKCNMD-CNCDDDKE--------QCVYER 166
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
Y + SS+ G D+I S + P +FGC ++GDL S DGI+G
Sbjct: 167 EYAEHSSSKGVLGEDLISFGNES---QLTP--QRAVFGCETVETGDLYSQ---RADGIIG 218
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIG--DVVSPKVKTTPMVPNMP 284
GQ + SL+ QL G + F C +DV GGG +G D S + T P
Sbjct: 219 LGQGDLSLVDQLVDKGLISNSFGLCYGGMDV--GGGSMILGGFDYPSDMIFTDSDPDRSP 276
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
+YN+ L + V G L L + + E G ++DSGTT AYLP
Sbjct: 277 YYNIDLTGIRVAGKKLSLNSRVF--DGEHGAVLDSGTTYAYLP 317
>gi|145324889|ref|NP_001077691.1| aspartyl protease [Arabidopsis thaliana]
gi|332194268|gb|AEE32389.1| aspartyl protease [Arabidopsis thaliana]
Length = 410
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 130/268 (48%), Gaps = 27/268 (10%)
Query: 82 LYFTKVGLGTPTD--EYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTS 138
LY+T++ +G P D Y++ +DTGS+L W+ C A C+ C ++ L+ P K +
Sbjct: 29 LYYTRILVGKPEDGQYYHLDIDTGSELTWIQCDAPCTSCAKGAN-----QLYKPRKDNL- 82
Query: 139 GEIACSDNFCRTTYNNRYPS-CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ S+ FC N+ C +C+Y + Y D S + G +D L +G+L
Sbjct: 83 --VRSSEAFCVEVQRNQLTEHCENCHQCDYEIEYADHSYSMGVLTKDKFHLKLHNGSLA- 139
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
S ++FGCG Q G L +T DGILG +A SL SQLA+ G + HCL
Sbjct: 140 ---ESDIVFGCGYDQQG-LLLNTLLKTDGILGLSRAKISLPSQLASRGIISNVVGHCLAS 195
Query: 256 DVVKGGGIFAIGDVV-SPKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
D+ G IF D+V S + PM+ + + Y + + ++ G L SL G
Sbjct: 196 DLNGEGYIFMGSDLVPSHGMTWVPMLHDSRLDAYQMQVTKMSYGQGML----SLDGENGR 251
Query: 313 RGTII-DSGTTLAYLPPMLYDLVLSQFR 339
G ++ D+G++ Y P Y +++ +
Sbjct: 252 VGKVLFDTGSSYTYFPNQAYSQLVTSLQ 279
>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
Length = 442
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 125/262 (47%), Gaps = 27/262 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
+ + +G P YV +DTGSDL W+ C C C + D +++ +KS + E+
Sbjct: 93 FLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKD-----PIYNRTKSDSYTEML 147
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL-NQASGNLKTAPLN 201
C++ C + R CS C Y Y DG+ TSG + + + S KTA
Sbjct: 148 CNEPPCVSL--GREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTA--- 202
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV--- 258
V FGCG + + S+ D V G+ G SL+SQL+A G V K FA+C +
Sbjct: 203 -QVGFGCGLQNLNFITSNRDGGVLGL---GPGLVSLVSQLSAIGKVSKSFAYCFGNISNP 258
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNMPHY-NVILEEVEVGGNPLDLPTSLL-----GTGDE 312
GG GD TPMV +Y N++ + VG LD+ +S G+G
Sbjct: 259 NAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSG-- 316
Query: 313 RGTIIDSGTTLAYLPPMLYDLV 334
G IIDSG+TL+ PP +Y++V
Sbjct: 317 -GVIIDSGSTLSVFPPEVYEVV 337
>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 430
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 75/281 (26%), Positives = 125/281 (44%), Gaps = 40/281 (14%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKL 127
+L G +P G Y+ + +G P Y++ VDTGSDL W+ C A C C +
Sbjct: 61 FQLQGAVYP--IGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNK-----VPH 113
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
+ P+K+ + C+ + C + N+ C+ +C+Y + Y D +S+ G + D
Sbjct: 114 PWYKPTKNKI---VPCAASLCTSLTPNK--KCAVPQQCDYQIKYTDKASSLGVLIADNFT 168
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
L+ + ++ + +++ FGCG Q + AA DG+LG G+ SLLSQL G
Sbjct: 169 LSLRN----SSTVRANLTFGCGYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVT 224
Query: 248 RKEFAHCLDVVKGGGIFAIGDVVSPKVKTT--PMVPNMP--HYN-----VILEEVEVGGN 298
+ HC GGG GD + P + T PM +Y+ + + +G
Sbjct: 225 KNVLGHCFS-TNGGGFLFFGDDIVPTSRVTWVPMARTTSGNYYSPGSGTLYFDRRSLGMK 283
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
P+++ + DSG+T AY Y +S +
Sbjct: 284 PMEV-------------VFDSGSTYAYFAAEPYQATVSALK 311
>gi|326499199|dbj|BAK06090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 505
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 134/304 (44%), Gaps = 32/304 (10%)
Query: 41 AGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
AG +R R LSA A++ + G L++ V +GTP + V +
Sbjct: 63 AGHDRHRALSAAGGRPPLTFSEGNATLKVSNLGF-------LHYALVTVGTPGHTFMVAL 115
Query: 101 DTGSDLLWV--NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
DTGSDL W+ C GC+ S + + PS SSTS + C+ +FC
Sbjct: 116 DTGSDLFWLPCQCDGCTPP-PSSAASAPASFYIPSLSSTSQAVPCNSDFC-----GLRKE 169
Query: 159 CSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
CS C Y + Y +S+SG+ V D++ L+ + + L + ++FGCG Q+G
Sbjct: 170 CSKTSSCPYKMVYVSADTSSSGFLVEDVLYLSTEDTHPQF--LKAQIMFGCGEVQTGSFL 227
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT 277
+ AA +G+ G G S+ S LA G F+ C G G + GD S + T
Sbjct: 228 DA--AAPNGLFGLGVDMISVPSILAQKGLTSNSFSMCFG-RDGIGRISFGDQGSSDQEET 284
Query: 278 PMVPNMPH--YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
P+ N H Y + + + VG N +DL E TI D+GT+ YL Y +
Sbjct: 285 PLDINQKHPTYAITITGIAVGNNLMDL---------EVSTIFDTGTSFTYLADPAYTYIT 335
Query: 336 SQFR 339
F
Sbjct: 336 DGFH 339
>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
Length = 408
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 125/282 (44%), Gaps = 28/282 (9%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYF-TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RR R A I E+ N G F +G P V +DTGSDLLWV C C+
Sbjct: 33 RRRTRRAAFIXDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCAD 92
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C +S +FDPSKSST +++ C + +Y + +C Y +Y DGS+
Sbjct: 93 CFRQS-----TPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLN---QCIYNASYADGST 144
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+SG + I + T SSV+FGCG+ G D GILG + S
Sbjct: 145 SSGNLATEDIVFETSDQGTVTV---SSVVFGCGHSNRGRF----DGQQSGILGLSAGDQS 197
Query: 237 LLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
++S+L + F++C+ D +GD V + +TP Y V LE
Sbjct: 198 IVSRLGS------RFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEG 251
Query: 293 VEVGGNPLDLPTSLLGTGD--ERGTIIDSGTTLAYLPPMLYD 332
+ VG LD+ + + + G ++DSGTT +L +D
Sbjct: 252 ISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFD 293
>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
Length = 357
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 74/265 (27%), Positives = 117/265 (44%), Gaps = 43/265 (16%)
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+G P Y++ VDTGSDL W+ C P +S + L+ P+ + + C++ C
Sbjct: 1 IGNPAKPYFLDVDTGSDLTWLQCDA----PCRSCNKVPHPLYRPTANRL---VPCANALC 53
Query: 149 RTTY-----NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS 203
+ NN+ PS +C+Y + Y D +S+ G + D L S N++
Sbjct: 54 TALHSGQGSNNKCPSPK---QCDYQIKYTDSASSQGVLINDSFSLPMRSSNIRPG----- 105
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI 263
+ FGCG Q + AA+DG+LG G+ + SL+SQL G + HCL GGG
Sbjct: 106 LTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS-TNGGGF 164
Query: 264 FAIGDVVSPKVKTT--PMVPNMP--HYN-----VILEEVEVGGNPLDLPTSLLGTGDERG 314
GD V P + T PM +Y+ + + +G P+++
Sbjct: 165 LFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV------------ 212
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
+ DSG+T Y Y V+S +
Sbjct: 213 -VFDSGSTYTYFTAQPYQAVVSALK 236
>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 386
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 94/270 (34%), Positives = 129/270 (47%), Gaps = 29/270 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y LGTP ++VDTGSDL WV C C+ P S K LFDP++SS+ +
Sbjct: 48 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAP--SCYSQKDPLFDPAQSSSYAAVP 105
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C S +C YVV+YGDGS+T+G + D + L+ +S
Sbjct: 106 CGGPVC-AGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA-------VQ 157
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-G 261
FGCG+ QSG VDG+LG G+ SL+ Q AG F++CL
Sbjct: 158 GFFFGCGHAQSGLFN-----GVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTA 210
Query: 262 GIFAIG----DVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G +G +P TT ++ PN P +Y V+L + VGG L +P S G
Sbjct: 211 GYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF----AGG 266
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
T++D+GT + LPP Y + S FR +AS
Sbjct: 267 TVVDTGTVVTRLPPTAYAALRSAFRSGMAS 296
>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 440
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 96/342 (28%), Positives = 144/342 (42%), Gaps = 30/342 (8%)
Query: 6 LLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMA 65
++AL V+VA + V G + + K E L + R A
Sbjct: 14 VIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDRFFRRFMSFSEA 73
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI 125
SI S G Y K+ +GTP + Y DTGSDL+W C C C +
Sbjct: 74 SISPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQ----- 128
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRD 184
K +FDPSKS++ E++C CR SCS P C++ YGDGS G +
Sbjct: 129 KNPMFDPSKSTSFKEVSCESQQCRLLDTV---SCSQPQKLCDFSYGYGDGSLAQGVIATE 185
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
+ LN SG T+ LN ++FGCG+ SG + G+ G G SL SQ+ +
Sbjct: 186 TLTLNSNSGQ-PTSILN--IVFGCGHNNSGTFNENE----MGLFGTGGRPLSLTSQIMST 238
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPK-------VKTTPMVP--NMPHYNVILEEVEV 295
++F+ CL + + P+ V +TP+V + +Y V L+ + V
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 296 GGN--PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
G P + + G+ ID+GT LP Y+ ++
Sbjct: 299 GDKLFPFSSSSPMATKGN---VFIDAGTPPTLLPRDFYNRLV 337
>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
Length = 408
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 125/282 (44%), Gaps = 28/282 (9%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYF-TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RR R A I E+ N G F +G P V +DTGSDLLWV C C+
Sbjct: 33 RRRTRRAAFITDEIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCAD 92
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C +S +FDPSKSST +++ C + +Y + +C Y +Y DGS+
Sbjct: 93 CFRQS-----TPIFDPSKSSTYVDLSYDSPICPNSPQKKYNHLN---QCIYNASYADGST 144
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+SG + I + T SSV+FGCG+ G D GILG + S
Sbjct: 145 SSGNLATEDIVFETSDQGTVTV---SSVVFGCGHSNRGRF----DGQQSGILGLSAGDQS 197
Query: 237 LLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEE 292
++S+L + F++C+ D +GD V + +TP Y V LE
Sbjct: 198 IVSRLGS------RFSYCIGDLFDPHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEG 251
Query: 293 VEVGGNPLDLPTSLLGTGD--ERGTIIDSGTTLAYLPPMLYD 332
+ VG LD+ + + + G ++DSGTT +L +D
Sbjct: 252 ISVGETRLDINPEVFQRTESGQGGVVMDSGTTATFLAKDGFD 293
>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 88/303 (29%), Positives = 140/303 (46%), Gaps = 31/303 (10%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDL-ELGGNGHPSATG-LYFTKVGLGTPTDEYYVQVDTG 103
ERT+ A + + ++ D+ +L N HPSA+ L+ +G P +DTG
Sbjct: 63 ERTMKASLARLSYLYAKIERDFDINDLWLNLHPSASEPLFLVNFSMGQPPVPQLAIMDTG 122
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS--CSP 161
S LLW+ CA C C + I +FDPS SST ++C + CR PS C
Sbjct: 123 SSLLWIQCAPCKSCSQQ----IIGPMFDPSISSTYDSLSCKNIICRYA-----PSGECDS 173
Query: 162 GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
+C Y TY +G + G + QL S + +N +V+FGC +R G+ D
Sbjct: 174 SSQCVYNQTYVEGLPSVGVIATE--QLIFGSSDEGRNAVN-NVLFGCSHRN----GNYKD 226
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTT 277
G+ G G +S+++Q+ + +F++C+ D + + V+ + +T
Sbjct: 227 RRFTGVFGLGSGITSVVNQMGS------KFSYCIGNIADPDYSYNQLVLSEGVNMEGYST 280
Query: 278 PMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
P+ HY VILE + VG L + P++ T +R IIDSGT +L Y +
Sbjct: 281 PLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAENEYRALER 340
Query: 337 QFR 339
+ R
Sbjct: 341 EVR 343
>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
Length = 452
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 119/276 (43%), Gaps = 34/276 (12%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S +G Y +V GTP Y +DTGSD+ W+ C C C + + +FDP+KSS+
Sbjct: 110 SGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA------PIFDPAKSSS 163
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL-NQASGNLK 196
AC C+ N C +C++ V YGDG+ G D I L +Q N
Sbjct: 164 YKPFACDSQPCQEISGN----CGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQYLPNFS 219
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
FGC S D SS G + ++L F++CL
Sbjct: 220 ---------FGCAESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGG-----TFSYCLP 265
Query: 257 VVKGGGIFAI----GDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGT 309
+ V S +K T ++ P+ P Y V L+ + VG + +P + + +
Sbjct: 266 SSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIAS 325
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
G GTIIDSGTT+ YL P Y + FR ++SL
Sbjct: 326 GG--GTIIDSGTTITYLVPSAYKDLRDAFRQQLSSL 359
>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 396
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 92/307 (29%), Positives = 131/307 (42%), Gaps = 40/307 (13%)
Query: 52 LKQHDTRRHGRMMASIDLE---LGGNGH----PSATGLYFTKVGLGTPTDEYYVQVDTGS 104
L +H++ + S +L LG NG S G Y K+ LGTP + Y VDTGS
Sbjct: 12 LIRHNSPNYSPFYKSDELHMHRLGSNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGS 71
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
DL+W C C C + K +F+P +S+T I C C + + + SCSP
Sbjct: 72 DLVWAQCTPCQGCYRQ-----KSPMFEPLRSNTYTPIPCDSEECNSLFGH---SCSPQKL 123
Query: 165 CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAV 224
C Y Y D S T G R+ + + G ++FGCG+ SG + +
Sbjct: 124 CAYSYAYADSSVTKGVLARETVTFSSTDGEPVVV---GDIVFGCGHSNSGTFNENDMGII 180
Query: 225 DGILGFGQANSSLLSQLAAAGNV--RKEFAHCLDVVKGG----GIFAIG---DVVSPKVK 275
SL+SQ GN+ K F+ CL G + G DV V
Sbjct: 181 GLG----GGPLSLVSQF---GNLYGSKRFSQCLVPFHADPHTLGTISFGDASDVSGEGVA 233
Query: 276 TTPMVPN--MPHYNVILEEVEVGGNPLDLPTS-LLGTGDERGTIIDSGTTLAYLPPMLYD 332
TP+V Y V LE + VG + +S +L G+ +IDSGT YLP YD
Sbjct: 234 ATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGN---IMIDSGTPATYLPQEFYD 290
Query: 333 LVLSQFR 339
++ + +
Sbjct: 291 RLVKELK 297
>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
Length = 443
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 89/279 (31%), Positives = 125/279 (44%), Gaps = 45/279 (16%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y ++ +GTP + +DTGSDL+W CA C C L + DP+ SST
Sbjct: 81 TNEYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDC-----FDQDLPVLDPAASSTYA 135
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR-------CEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+ C CR P S GVR C Y YGD S T G D +
Sbjct: 136 ALPCGAARCRA-----LPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSG 190
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
G+ ++ + FGCG+ G S+ GI GFG+ SL SQL NV F+
Sbjct: 191 GSGESL-HTRRLTFGCGHLNKGVFQSNE----TGIAGFGRGRWSLPSQL----NV-TSFS 240
Query: 253 HCLD---------VVKGGGIFAI-GDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNP 299
+C V GG A+ S +V+TTP++ P+ P Y + L+ + VG
Sbjct: 241 YCFTSMFESKSSLVTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTR 300
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
L +P + R TIIDSG ++ LP +Y+ V ++F
Sbjct: 301 LPVPETKF-----RSTIIDSGASITTLPEEVYEAVKAEF 334
>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 147/320 (45%), Gaps = 51/320 (15%)
Query: 35 VENKFKAGGERERTLSA--LKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTP 92
V++ K G R + L+A L ++ A I GNG Y ++ +GTP
Sbjct: 67 VQHGIKRGKSRLQRLNAMVLAASTLDSEDQLEAPIH---AGNGE------YLMELAIGTP 117
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
Y +DTGSDL+W C C++C PT +FDP KSS+ +++C + C
Sbjct: 118 PVSYPAVLDTGSDLIWTQCKPCTQCYKQPTP--------IFDPKKSSSFSKVSCGSSLCS 169
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
++ +CS G CEYV +YGD S T G + ++ + ++ FGCG
Sbjct: 170 AVPSS---TCSDG--CEYVYSYGDYSMTQGVLATETFTFGKSKNKVSV----HNIGFGCG 220
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVKGGGIF-- 264
GD G++G G+ SL+SQL F++CL D K +
Sbjct: 221 EDNEGD----GFEQASGLVGLGRGPLSLVSQLK-----EPRFSYCLTPMDDTKESILLLG 271
Query: 265 AIGDVVSPK-VKTTPMVPN--MP-HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIID 318
++G V K V TTP++ N P Y + LE + VG L + S GD+ G IID
Sbjct: 272 SLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIID 331
Query: 319 SGTTLAYLPPMLYDLVLSQF 338
SGTT+ Y+ ++ + +F
Sbjct: 332 SGTTITYIEQKAFEALKKEF 351
>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 391
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 125/286 (43%), Gaps = 39/286 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
NG P T Y + +GTP + +DTGSDL+W C C C L FDPS
Sbjct: 28 NGVP--TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPAC-----FDQALPYFDPS 80
Query: 134 KSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
SST +C C+ + P P C Y +YGD S T+G+ D A
Sbjct: 81 TSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGA 140
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
++ V FGCG +G S+ GI GFG+ SL SQL GN F
Sbjct: 141 GASVP------GVAFGCGLFNNGVFKSNE----TGIAGFGRGPLSLPSQL-KVGN----F 185
Query: 252 AHCLDVVKGGGIFAI-----GDVVSP---KVKTTPMV------PNMPHYNVILEEVEVGG 297
+HC + G + D+ S V+TTP++ N Y + L+ + VG
Sbjct: 186 SHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGS 245
Query: 298 NPLDLPTSLLG-TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
L +P S T GTIIDSGT++ LPP +Y +V +F I
Sbjct: 246 TRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI 291
>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
Length = 440
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 92/276 (33%), Positives = 128/276 (46%), Gaps = 49/276 (17%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DTGS L+W C C+ C +S L +D S+SST +
Sbjct: 91 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQS-----LPYYDASRSSTFALPS 145
Query: 143 CSDNFCRTTYNNRYPSCSPGVR-----CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C C+ PS + V C Y +YGD S+T G+ D+ ++ +G +
Sbjct: 146 CDSTQCKLD-----PSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAG--AS 196
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
P V+FGCG +G S+ GI GFG+ SL SQL GN F+HC
Sbjct: 197 VP---GVVFGCGLNNTGIFRSNE----TGIAGFGRGPLSLPSQL-KVGN----FSHCFTA 244
Query: 258 VKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSL 306
V G + D+ V+TTP++ N H Y + L+ + VG L +P S
Sbjct: 245 VSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 304
Query: 307 L----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
GTG GTIIDSGT LPP +Y LV +F
Sbjct: 305 FALKNGTG---GTIIDSGTAFTSLPPRVYRLVHDEF 337
>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 479
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 123/275 (44%), Gaps = 30/275 (10%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +VG+G+P E Y+ VD+GSD++W+ C C+ C ++D LFDP+
Sbjct: 124 SGISEGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQAD-----PLFDPA 178
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ + C CRT C+ C Y V+YGDGS T G + + ++
Sbjct: 179 ASASFTAVPCDSGVCRTLPGGS-SGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDST- 236
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
P+ V GCG+R G G+LG G SL+ QL F++
Sbjct: 237 -----PVQ-GVAIGCGHRNRGLF-----VGAAGLLGLGWGPMSLVGQLGG--AAGGAFSY 283
Query: 254 CL-----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTS 305
CL D G +F D + P++ N Y V L + VGG L L
Sbjct: 284 CLASRGADAGAGSLVFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDG 343
Query: 306 LLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
L ++ G ++D+GT + LPP Y + F
Sbjct: 344 LFDLTEDGGGGVVMDTGTAVTRLPPDAYAALRDAF 378
>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
Length = 430
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 127/289 (43%), Gaps = 53/289 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRCPTKSDLGIKLTLFDPSKSS 136
G Y + GTP E + DTGSDL+W+ C A + CP K+ + F SKS+
Sbjct: 52 GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA--CSRRPAFVASKSA 109
Query: 137 TSGEIACSDNFCRTTYNNR--YPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQL-NQA 191
T + CS C R PSCSP V C Y Y DGSST+G+ RD + N
Sbjct: 110 TLSVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGT 169
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
SG V FGCG R G S T G++G GQ S +Q + + F
Sbjct: 170 SGGAAV----RGVAFGCGTRNQGGSFSGT----GGVIGLGQGQLSFPAQ--SGSLFAQTF 219
Query: 252 AHCLDVVKGG-----------------GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE 294
++CL ++GG FA +VS P+ P Y V + +
Sbjct: 220 SYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVS-----NPLAPTF--YYVGVVAIR 272
Query: 295 VGGNPLDLPTS-----LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VG L +P S +LG G GT+IDSG+TL YL Y ++S F
Sbjct: 273 VGNRVLPVPGSEWAIDVLGNG---GTVIDSGSTLTYLRLGAYLHLVSAF 318
>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 449
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 134/302 (44%), Gaps = 45/302 (14%)
Query: 54 QHDTRRHGRMMASIDLELGGNGH------PSATG-LYFTKVGLGTPTDEYYVQVDTGSDL 106
QH R + A I+ L N PS TG + +G P V +DTGSD+
Sbjct: 65 QHSAARFAYIQARIEGSLVSNNEYKARVSPSLTGRTIMANISIGQPPIPQLVVMDTGSDI 124
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
LWV C C+ C + LG+ LFDPS SST + C+T + + CS RC+
Sbjct: 125 LWVMCTPCTNC--DNHLGL---LFDPSMSSTFSPL------CKTPCD--FKGCS---RCD 168
Query: 167 ---YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+ VTY D S+ SG F RD + P V+FGCG+ ++G TD
Sbjct: 169 PIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIP---DVLFGCGH----NIGQDTDPG 221
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPM 279
+GILG SL A + ++F++C+ D +G+ + +TP
Sbjct: 222 HNGILGLNNGPDSL------ATKIGQKFSYCIGDLADPYYNYHQLILGEGADLEGYSTPF 275
Query: 280 VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER--GTIIDSGTTLAYLPPMLYDLVLSQ 337
+ Y V +E + VG LD+ R G IID+G+T+ +L ++ L+ +
Sbjct: 276 EVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSVHRLLSKE 335
Query: 338 FR 339
R
Sbjct: 336 VR 337
>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
Length = 418
Score = 100 bits (250), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 103/211 (48%), Gaps = 20/211 (9%)
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTL 129
L G+ +P TG Y+ + +G P Y++ VDTGSDL W+ C A C C + L
Sbjct: 47 LSGDVYP--TGHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNK-----VPHPL 99
Query: 130 FDPSKSSTSGEIACSDNFCRTTYNNRYPS--CSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
+ P+K+ + C+++ C ++ P+ C+ +C+Y + Y D +S+ G V D
Sbjct: 100 YRPTKNKL---VPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFS 156
Query: 188 LN-QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
L + N++ S+ FGCG Q + A DG+LG G+ + SLLSQL G
Sbjct: 157 LPLRNKSNVRP-----SLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGI 211
Query: 247 VRKEFAHCLDVVKGGGIFAIGDVVSPKVKTT 277
+ HCL GGG GD + P + T
Sbjct: 212 TKNVLGHCLS-TSGGGFLFFGDDMVPTSRVT 241
>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
Length = 774
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 127/298 (42%), Gaps = 52/298 (17%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG 124
A +D NG P Y + +GTP + +DTGSDL+W C C C +++
Sbjct: 399 ARVDPGPYANGVPDTE--YLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRA--- 453
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSP----GVRCEYVVTYGDGSSTSGY 180
L DPS SST + CS C N + SC C YV Y DGS T+G+
Sbjct: 454 --LGPLDPSNSSTFDVLPCSSPVCD---NLTWSSCGKHNWGNQTCVYVYAYADGSITTGH 508
Query: 181 FVRDIIQLNQASGN-LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLS 239
+ A G T P + FGCG +G S+ GI GFG+ SL S
Sbjct: 509 LDAETFTFAAADGTGQATVP---DLAFGCGLFNNGIFTSNE----TGIAGFGRGALSLPS 561
Query: 240 QLAAAGNVRKEFAHCLDVVKG-----------GGIFAIGDVVSPKVKTTPMVPN---MPH 285
QL F+HC + G +++ D V++TP+V N +
Sbjct: 562 QLKV-----DNFSHCFTAITGSEPSSVLLGLPANLYSDADGA---VQSTPLVQNFSSLRA 613
Query: 286 YNVILEEVEVGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y + L+ + VG L +P S GTG GTIIDSGT + LP Y LV F
Sbjct: 614 YYLSLKGITVGSTRLPIPESTFALKQDGTG---GTIIDSGTGMTTLPQDAYKLVHDAF 668
>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
Length = 390
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/291 (31%), Positives = 126/291 (43%), Gaps = 50/291 (17%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
NG P T Y + +GTP + +DTGSDL+W C C C L FD S
Sbjct: 28 NGVP--TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSC-----FDQPLPYFDTS 80
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-------CEYVVTYGDGSSTSGYFVRDII 186
+SST+ + C C+ P+ + V+ C Y +YGD S T G D
Sbjct: 81 RSSTNALLPCESTQCKLD-----PTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKF 135
Query: 187 QLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
A +L V FGCG +G S+ GI GFG+ SL SQL GN
Sbjct: 136 TF-VAGTSLP------GVTFGCGLNNTGVFNSNE----TGIAGFGRGPLSLPSQL-KVGN 183
Query: 247 VRKEFAHCLDVVKGGGIFAI-----GDVVSP---KVKTTPMV------PNMPHYNVILEE 292
F+HC + G + D+ S V+TTP++ N Y + L+
Sbjct: 184 ----FSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKG 239
Query: 293 VEVGGNPLDLPTSLLG-TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
+ VG L +P S T GTIIDSGT++ LPP +Y +V +F I
Sbjct: 240 ITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQI 290
>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
Length = 448
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 123/274 (44%), Gaps = 38/274 (13%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
++ G Y + +GTP Y VDTGSDL+W CA C C + F P++S+T
Sbjct: 87 ASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPT-----PYFRPARSAT 141
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C C YP+C C Y YGD +ST+G + A+ +
Sbjct: 142 YRLVPCRSPLCAAL---PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAAN---SS 195
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ S V FGCGN SG L +S+ G++G G+ SL+SQL + F++CL
Sbjct: 196 KVMVSDVAFGCGNINSGQLANSS-----GMVGLGRGPLSLVSQLGPS-----RFSYCLTS 245
Query: 258 VKGG-------GIFAI-----GDVVSPKVKTTPMVPN--MPH-YNVILEEVEVGGNPLDL 302
G+FA V++TP+V N +P Y + L+ + +G L +
Sbjct: 246 FLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPI 305
Query: 303 PTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ D+ G IDSGT+L +L YD V
Sbjct: 306 DPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAV 339
>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
Length = 448
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 123/274 (44%), Gaps = 38/274 (13%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
++ G Y + +GTP Y VDTGSDL+W CA C C + F P++S+T
Sbjct: 87 ASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPT-----PYFRPARSAT 141
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C C YP+C C Y YGD +ST+G + A+ +
Sbjct: 142 YRLVPCRSPLCAAL---PYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAAN---SS 195
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ S V FGCGN SG L +S+ G++G G+ SL+SQL + F++CL
Sbjct: 196 KVMVSDVAFGCGNINSGQLANSS-----GMVGLGRGPLSLVSQLGPS-----RFSYCLTS 245
Query: 258 VKGG-------GIFAI-----GDVVSPKVKTTPMVPN--MPH-YNVILEEVEVGGNPLDL 302
G+FA V++TP+V N +P Y + L+ + +G L +
Sbjct: 246 FLSPEPSRLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPI 305
Query: 303 PTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ D+ G IDSGT+L +L YD V
Sbjct: 306 DPLVFAINDDGTGGVFIDSGTSLTWLQQDAYDAV 339
>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
Group]
Length = 384
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 92/276 (33%), Positives = 128/276 (46%), Gaps = 49/276 (17%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DTGS L+W C C+ C +S L +D S+SST +
Sbjct: 35 YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQS-----LPYYDASRSSTFALPS 89
Query: 143 CSDNFCRTTYNNRYPSCSPGVR-----CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C C+ PS + V C Y +YGD S+T G+ D+ ++ +G +
Sbjct: 90 CDSTQCKLD-----PSVTMCVNQTVQTCAYSYSYGDKSATIGFL--DVETVSFVAG--AS 140
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
P V+FGCG +G S+ GI GFG+ SL SQL GN F+HC
Sbjct: 141 VP---GVVFGCGLNNTGIFRSNE----TGIAGFGRGPLSLPSQL-KVGN----FSHCFTA 188
Query: 258 VKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSL 306
V G + D+ V+TTP++ N H Y + L+ + VG L +P S
Sbjct: 189 VSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESA 248
Query: 307 L----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
GTG GTIIDSGT LPP +Y LV +F
Sbjct: 249 FALKNGTG---GTIIDSGTAFTSLPPRVYRLVHDEF 281
>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
Length = 452
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 42/342 (12%)
Query: 12 VTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLEL 71
V+ ++H ++ N +E K G+ R L LK+ T R + A+ ++ +
Sbjct: 52 VSFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANR-LRFLKR--TSRSSKQDANANVPV 108
Query: 72 GGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFD 131
S +G Y +V GTP Y +DTGSD+ W+ C C C + + +FD
Sbjct: 109 R-----SGSGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTA------PIFD 157
Query: 132 PSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL-NQ 190
P+KSS+ AC C+ N C +C++ V+YGDG+ G D I L +Q
Sbjct: 158 PAKSSSYKPFACDSQPCQEISGN----CGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQ 213
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
N FGC S D S G + ++L
Sbjct: 214 YLPNFS---------FGCAESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGG-----T 259
Query: 251 FAHCLDVVKGGGIFAI----GDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLP 303
F++CL + V S +K T ++ P++P Y V L+ + VG + +P
Sbjct: 260 FSYCLPSSSTSSGSLVLGKEAAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVP 319
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ + +G GTIIDSGTT+ +L P Y + FR ++SL
Sbjct: 320 GTNIASGG--GTIIDSGTTITHLVPSAYTALRDAFRQQLSSL 359
>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
Length = 429
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 127/289 (43%), Gaps = 53/289 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRCPTKSDLGIKLTLFDPSKSS 136
G Y + GTP E + DTGSDL+W+ C A + CP K+ + F SKS+
Sbjct: 51 GQYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKA--CSRRPAFVASKSA 108
Query: 137 TSGEIACSDNFCRTTYNNR--YPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQL-NQA 191
T + CS C R P+CSP V C Y Y DGSST+G+ RD + N
Sbjct: 109 TLSVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGT 168
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
SG V FGCG R G S T G++G GQ S +Q + + F
Sbjct: 169 SGGAAV----RGVAFGCGTRNQGGSFSGT----GGVIGLGQGQLSFPAQ--SGSLFAQTF 218
Query: 252 AHCLDVVKGG-----------------GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVE 294
++CL ++GG FA +VS P+ P Y V + +
Sbjct: 219 SYCLLDLEGGRRGRSSSFLFLGRPERRAAFAYTPLVS-----NPLAPTF--YYVGVVAIR 271
Query: 295 VGGNPLDLPTS-----LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VG L +P S +LG G GT+IDSG+TL YL Y ++S F
Sbjct: 272 VGNRVLPVPGSEWAIDVLGNG---GTVIDSGSTLTYLRLGAYLHLVSAF 317
>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
Length = 460
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 144/311 (46%), Gaps = 55/311 (17%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHP--SATGLYFTKVGLGTPTDEYYVQVDTG 103
ER A+K+ R ++ S+D E+ P + G + K+ +GTP+ + +DTG
Sbjct: 78 ERFKRAIKRSQDRLE-KLQMSVD-EVKAVEAPVYAGNGEFLMKMAIGTPSLSFSAILDTG 135
Query: 104 SDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS 160
SDL W C C+ C PT ++DPS+SST ++ CS + C+ SCS
Sbjct: 136 SDLTWTQCKPCTDCYPQPTP--------IYDPSQSSTYSKVPCSSSMCQAL---PMYSCS 184
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
G CEY+ +YGD SST G + L S + FGCG G S
Sbjct: 185 -GANCEYLYSYGDQSSTQGILSYESFTLTSQSL--------PHIAFGCGQENEGGGFSQG 235
Query: 221 DAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVV-----KGGGIFAIGDVVSPKV 274
V G+ SL+SQL + GN +F++CL + K +F IG S
Sbjct: 236 GGLVGF----GRGPLSLISQLGQSLGN---KFSYCLVSITDSPSKTSPLF-IGKTASLNA 287
Query: 275 KT---TPMVPNMPH---YNVILEEVEVGGNPLDLP-----TSLLGTGDERGTIIDSGTTL 323
KT TP+V + Y + LE + VGG LD+ L GTG G IIDSGTT+
Sbjct: 288 KTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTG---GVIIDSGTTV 344
Query: 324 AYLPPMLYDLV 334
YL YD+V
Sbjct: 345 TYLEQSGYDVV 355
>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
Length = 502
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 126/279 (45%), Gaps = 46/279 (16%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF+++G+GTP E YV +DTGSD+ W+ C CS C +SD +FDP+
Sbjct: 155 SGTSQGSGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSD-----PIFDPT 209
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SST + CSD C + +C +C Y V+YGDGS T G + D + + SG
Sbjct: 210 SSSTFKSLTCSDPKCASL---DVSACRSN-KCLYQVSYGDGSFTVGNYATDTVTFGE-SG 264
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ + V GCG+ G + G S+ +Q+ A K F++
Sbjct: 265 KV------NDVALGCGHDNEGLFTGAAGLLGL-----GGGALSMTNQIKA-----KSFSY 308
Query: 254 CL---DVVKGGGI------FAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
CL D K + GD +P ++ + M Y V L VGG + +P+
Sbjct: 309 CLVDRDSAKSSSLDFNSVQIGAGDATAPLLRNSKM---DTFYYVGLSGFSVGGQQVSIPS 365
Query: 305 SLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
SL G G G I+D GT + L Y+ + F
Sbjct: 366 SLFEVDASGAG---GVILDCGTAVTRLQTQAYNSLRDAF 401
>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
Length = 351
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/268 (34%), Positives = 130/268 (48%), Gaps = 37/268 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTS 138
+G Y VG GTPT V DTGSD+ W+ C C+ RC + + LFDPS SST
Sbjct: 13 SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQE-----PLFDPSLSSTY 67
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
++C++ C + R CS C Y V YGDGSST G+ D L A K
Sbjct: 68 RNVSCTEPAC-VGLSTR--GCSSST-CLYGVFYGDGSSTIGFLAMDTFMLTPAQ-KFK-- 120
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS-SLLSQLAAA-GNVRKEFAHCLD 256
+ IFGCG +G + G++G G++++ SL SQ+A + GNV F++CL
Sbjct: 121 ----NFIFGCGQNNTGLFQGTA-----GLVGLGRSSTYSLNSQVAPSLGNV---FSYCLP 168
Query: 257 VVKGG-GIFAIGDVVS----PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
G IG+ + + T VP + Y + L + VGG L L +++
Sbjct: 169 STSSATGYLNIGNPQNTPGYTAMLTDTRVPTL--YFIDLIGISVGGTRLSLSSTVF---Q 223
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + LPP Y + + R
Sbjct: 224 SVGTIIDSGTVITRLPPTAYSALKTAVR 251
>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
Length = 440
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 142/342 (41%), Gaps = 30/342 (8%)
Query: 6 LLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMA 65
++AL V+VA + V G + + K E L + R A
Sbjct: 14 VIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERLDRFFRRFMSFSEA 73
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGI 125
SI S G Y K+ +GTP + Y DTGSDL+W C C C +
Sbjct: 74 SISPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQ----- 128
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRD 184
K +FDPSKS++ E++C CR SCS P C++ YGDGS G +
Sbjct: 129 KNPMFDPSKSTSFKEVSCESQQCRLLDTV---SCSQPQKLCDFSYGYGDGSLAQGVIATE 185
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
+ LN SG + +++FGCG+ SG + G+ G G SL SQ+ +
Sbjct: 186 TLTLNSNSGQPXSI---XNIVFGCGHNNSGTFNENE----MGLFGTGGRPLSLTSQIMST 238
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPK-------VKTTPMVP--NMPHYNVILEEVEV 295
++F+ CL + + P+ V +TP+V + +Y V L+ + V
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298
Query: 296 GGN--PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
G P + + G+ ID+GT LP Y+ ++
Sbjct: 299 GDKLFPFSSSSPMATKGN---VFIDAGTPPTLLPRDFYNRLV 337
>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
Length = 472
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y VGLGTP E+ + DTGSD+ W C C + K K +PS S++
Sbjct: 129 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQ----KEPRLNPSTSTSYKN 184
Query: 141 IACSDNFCRTTYNNRY--PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+CS C+ + + SCS C Y V YGDGS + G+F + + L+ ++
Sbjct: 185 ISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTLSSSN------ 237
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ + +FGCG + +G G + G+ +L SQ A +K F++CL
Sbjct: 238 -VFKNFLFGCGQQNNGLFGGAAGLLGL-----GRTKLALPSQ--TAKTYKKLFSYCLPAS 289
Query: 259 KGG-GIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G ++G VS VK TP+ + P Y + + + VGG L + S G
Sbjct: 290 SSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSA----G 345
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
T+IDSGT + L P Y + S F+
Sbjct: 346 TVIDSGTVITRLSPTAYSELSSAFQ 370
>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
Length = 460
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y VGLGTP E+ + DTGSD+ W C C + K K +PS S++
Sbjct: 117 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQ----KEPRLNPSTSTSYKN 172
Query: 141 IACSDNFCRTTYNNRY--PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+CS C+ + + SCS C Y V YGDGS + G+F + + L+ ++
Sbjct: 173 ISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTLSSSN------ 225
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ + +FGCG + +G G + G+ +L SQ A +K F++CL
Sbjct: 226 -VFKNFLFGCGQQNNGLFGGAAGLLGL-----GRTKLALPSQ--TAKTYKKLFSYCLPAS 277
Query: 259 KGG-GIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G ++G VS VK TP+ + P Y + + + VGG L + S G
Sbjct: 278 SSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSA----G 333
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
T+IDSGT + L P Y + S F+
Sbjct: 334 TVIDSGTVITRLSPTAYSELSSAFQ 358
>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 529
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 152/349 (43%), Gaps = 47/349 (13%)
Query: 20 WAVGGGGVMGNFVFEVENKFKAGGERE-------------RTLSALKQHDTRRHGRMMAS 66
W + G F FEV + F ++ L Q D GR +AS
Sbjct: 18 WGLERCEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRLIRGRGLAS 77
Query: 67 IDLE-----LGGNGHPSAT---GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ E + GN S L++ V +GTP + V +DTGSDL W+ C S C
Sbjct: 78 NNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCI 137
Query: 119 TK-SDLGIK----LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-G 172
++G+ L L+ P+ SSTS I CSD+ C + SP C Y + Y
Sbjct: 138 RDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCS----SPASSCPYQIQYLS 193
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ T+G D++ L L+ P+ +++ GCG Q+G L SS AAV+G+LG G
Sbjct: 194 KDTFTTGTLFEDVLHLVTEDEGLE--PVKANITLGCGKNQTGFLQSS--AAVNGLLGLGL 249
Query: 233 ANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVI 289
+ S+ S LA A F+ C +++ G + GD TP++P P Y V
Sbjct: 250 KDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPSPTYAVS 309
Query: 290 LEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ EV VGG+ G + + D+GT+ +L Y L+ F
Sbjct: 310 VTEVSVGGD---------AVGVQLLALFDTGTSFTHLLEPEYGLITKAF 349
>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 503
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 115/258 (44%), Gaps = 38/258 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
T Y +GLGTP + V DTGSD WV C C C + D LFDP+KSST
Sbjct: 160 TANYVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKD-----RLFDPAKSSTY 214
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ-ASGNLKT 197
++C+D C + C+ G C Y + YGDGS T G+F +D + + Q A K
Sbjct: 215 ANVSCADPACA---DLDASGCNAG-HCLYGIQYGDGSYTVGFFAKDTLAVAQDAIKGFK- 269
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
FGCG + G G + G+LG G+ +S+ Q A F++CL
Sbjct: 270 --------FGCGEKNRGLFGQTA-----GLLGLGRGPTSITVQ--AYEKYGGSFSYCLPA 314
Query: 258 VKGGGIF-----AIGDVVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPL-DLPTSLLGT 309
+ KTTPM+ + Y V L + VGG L +P S+
Sbjct: 315 SSAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVF-- 372
Query: 310 GDERGTIIDSGTTLAYLP 327
GT++DSGT + LP
Sbjct: 373 -SNSGTLVDSGTVITRLP 389
>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
Length = 412
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/265 (30%), Positives = 123/265 (46%), Gaps = 29/265 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y VGLGTP E+ + DTGSD+ W C C + K K +PS S++
Sbjct: 69 GDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQ----KEPRLNPSTSTSYKN 124
Query: 141 IACSDNFCRTTYNNRY--PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I+CS C+ + + SCS C Y V YGDGS + G+F + + L+ ++
Sbjct: 125 ISCSSALCKLVASGKKFSQSCSSST-CLYQVQYGDGSYSIGFFATETLTLSSSN------ 177
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ + +FGCG + +G G + G+ +L SQ A +K F++CL
Sbjct: 178 -VFKNFLFGCGQQNNGLFGGAAGLLGL-----GRTKLALPSQTAK--TYKKLFSYCLPAS 229
Query: 259 KGG-GIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G ++G VS VK TP+ + P Y + + + VGG L + S G
Sbjct: 230 SSSKGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSA----G 285
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFR 339
T+IDSGT + L P Y + S F+
Sbjct: 286 TVIDSGTVITRLSPTAYSELSSAFQ 310
>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
Length = 393
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 109/259 (42%), Gaps = 68/259 (26%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y VGLG+P V +DTGSD+ WV C C + P + G LFDP+ SST
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAG---ALFDPAASSTYAAF 162
Query: 142 ACSDNFCRTTYNN-RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
CS C ++ C RC+Y+V YGDGS+T+G
Sbjct: 163 NCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTG--------------------- 201
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+ FGC + +LG+ D DG++G G SL+SQ AA
Sbjct: 202 -TGFQFGCSH---AELGAGMDDKTDGLIGLGGDAQSLVSQTAAR---------------- 241
Query: 261 GGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSG 320
S KV T +Y LE++ VGG L L S+ G+++DSG
Sbjct: 242 ----------SKKVPT--------YYFAALEDIAVGGKKLGLSPSVFAA----GSLVDSG 279
Query: 321 TTLAYLPPMLYDLVLSQFR 339
T + LPP Y + S FR
Sbjct: 280 TVITRLPPAAYAALSSAFR 298
>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
Length = 418
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 81/269 (30%), Positives = 111/269 (41%), Gaps = 59/269 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + GTP E + +DTGSD+ W C RCP + L LFDPS SS+ +
Sbjct: 88 YLVHLAAGTPPQEVQLTLDTGSDITWTQC---KRCPASACFNQTLPLFDPSASSSFASLP 144
Query: 143 CSDNFCRTTYNNRYPSCSPG-----VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
CS C TT P C G C Y ++YGDGS + G R++ +G +
Sbjct: 145 CSSPACETT-----PPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSS 199
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A + ++FGCG+ G S+ GI GFG+ + SL SQL GN F+HC
Sbjct: 200 AAV-PGLVFGCGHANRGVFTSNET----GIAGFGRGSLSLPSQLKV-GN----FSHCFTT 249
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI- 316
+ G A+ + G P P S G RG+
Sbjct: 250 ITGSKTSAV----------------------------LLGLPGVAPPSASPLGRRRGSYR 281
Query: 317 -------IDSGTTLAYLPPMLYDLVLSQF 338
+SGT++ LPP Y V +F
Sbjct: 282 CRSTPRSSNSGTSITSLPPRTYRAVREEF 310
>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
Length = 538
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 100/351 (28%), Positives = 158/351 (45%), Gaps = 52/351 (14%)
Query: 13 TVAVVHQWAV---GGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHG------RM 63
+V VVH+ ++ ++ +E + R R L + R +
Sbjct: 115 SVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHEN 174
Query: 64 MASIDLELGG---NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
+A + E GG +G +G YFT++G+GTP E Y+ +DTGSD++W+ C CS+C ++
Sbjct: 175 VAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQ 234
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
D +F+PS S++ + C+ C +Y + Y +C G C Y V+YGDGS T G
Sbjct: 235 VD-----PIFNPSLSASFSTLGCNSAVC--SYLDAY-NCH-GGGCLYKVSYGDGSYTIGS 285
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
F +++ S +V GCG+ +G G+LG G S SQ
Sbjct: 286 FATEMLTFGTTSVR--------NVAIGCGHDNAGLF-----VGAAGLLGLGAGLLSFPSQ 332
Query: 241 LAAAGNVRKEFAHCL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILE 291
L + F++CL + G +G +++P + T P +P Y V L
Sbjct: 333 LGT--QTGRAFSYCLVDRFSESSGTLEFGPESVPLGSILTP-LLTNPSLPTF--YYVPLI 387
Query: 292 EVEVGGNPLD-LPTSLL---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ VGG LD +P + T G I+DSGT + L +YD V F
Sbjct: 388 SISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAF 438
>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
Length = 488
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 91/276 (32%), Positives = 128/276 (46%), Gaps = 22/276 (7%)
Query: 71 LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
L G +T Y + LGTP E V++DTGSD WV C C+ C + D +F
Sbjct: 127 LANWGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRD-----PVF 181
Query: 131 DPSKSSTSGEIACSDNFCR---TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
DP+ SST + C C+ ++ ++R S C Y V+Y D S T G RD +
Sbjct: 182 DPTASSTYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLT 241
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
L+ + + A +FGCG+ +G G VDG+LG G +SL SQ+AA
Sbjct: 242 LSPSP-SPSPADTVPGFVFGCGHSNAGTFGE-----VDGLLGLGLGKASLPSQVAA--RY 293
Query: 248 RKEFAHCL-DVVKGGGIFAIGDVVS-PKVKTTPMV--PNMPHYNVILEEVEVGGNPLDLP 303
F++CL G + G + + T MV + Y + L + V G + +P
Sbjct: 294 GAAFSYCLPSSPSAAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVP 353
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
S T GTIIDSGT + LPP Y + S FR
Sbjct: 354 ASAFATA--AGTIIDSGTAFSRLPPSAYAALRSSFR 387
>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
[Brachypodium distachyon]
Length = 452
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 94/265 (35%), Positives = 128/265 (48%), Gaps = 41/265 (15%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEIACSD 145
VG G+P DTGSDL W+ C CS C + D +FDP+KSS+ + C
Sbjct: 116 VGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHD-----PVFDPAKSSSYAVVPCGT 170
Query: 146 NFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C C+ G C Y V YGDGSST+G R+ + + +S + I
Sbjct: 171 TECAAAGGE----CN-GTTCVYGVEYGDGSSTTGVLARETLTFSSSSE-------FTGFI 218
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVK-GGGI 263
FGCG GD G VDG+LG G+ + SL SQ A A G + F++CL G
Sbjct: 219 FGCGETNLGDFGE-----VDGLLGLGRGSLSLSSQAAPAFGGI---FSYCLPSYNTTPGY 270
Query: 264 FAIGDVVSP-----KVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGT 315
+IG +P V+ T MV P+ P + I L + +GG L +P S + GT
Sbjct: 271 LSIG--ATPVTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEF---TKTGT 325
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFRF 340
++DSGT L YLPP Y + +F+F
Sbjct: 326 LLDSGTILTYLPPPAYTALRDRFKF 350
>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
Length = 482
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/254 (32%), Positives = 122/254 (48%), Gaps = 38/254 (14%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYN---N 154
V VDTGSDL WV C C RC + D +F+PS S + + CS C++ + N
Sbjct: 148 VIVDTGSDLSWVQCQPCKRCYNQQD-----PVFNPSTSPSYRTVLCSSPTCQSLQSATGN 202
Query: 155 RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
S C YVV YGDGS T G +L +L + ++ IFGCG G
Sbjct: 203 LGVCGSNPPSCNYVVNYGDGSYTRG-------ELGTEHLDLGNSTAVNNFIFGCGRNNQG 255
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDV--VKGGGIFAIGDVVS 271
G ++ G++G G+++ SL+SQ +A G V F++CL + + G +G S
Sbjct: 256 LFGGAS-----GLVGLGRSSLSLISQTSAMFGGV---FSYCLPITETEASGSLVMGGNSS 307
Query: 272 PKVKTTP-----MVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
TTP M+PN +P Y + L + VG + P+ + G +IDSGT +
Sbjct: 308 VYKNTTPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAPSF-----GKDGMMIDSGTVIT 362
Query: 325 YLPPMLYDLVLSQF 338
LPP +Y + +F
Sbjct: 363 RLPPSIYQALKDEF 376
>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
Length = 454
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 115/268 (42%), Gaps = 26/268 (9%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V LG+P DTGSDL+WV C + S T FDPS+SST G ++
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNN--DTSSAAAPTTQFDPSRSSTYGRVS 158
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ-ASGNLKTAPLN 201
C + C +C G C Y+ YGDGS+T+G + + SG
Sbjct: 159 CQTDACEALGRA---TCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRV 215
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----D 256
V FGC +G + + G SL++QL A ++ + F++CL +
Sbjct: 216 GGVKFGCSTATAGSFPADGLVGL------GGGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
A+ DV P +TP+V +Y V+L+ V+VG + +
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNK-------TVASAASSR 322
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
I+DSGTTL +L P L ++ + I
Sbjct: 323 IIVDSGTTLTFLDPSLLGPIVDELSRRI 350
>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 418
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/257 (29%), Positives = 121/257 (47%), Gaps = 35/257 (13%)
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+GTP +Y DTGSDL W C C +C + +F+P KS++ + C+ C
Sbjct: 86 IGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLR-----PIFNPLKSTSFSHVPCNTQTC 140
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
+ C C+Y TYGD + + G + I + +S +K+ + GC
Sbjct: 141 HAVDDGH---CGVQGVCDYSYTYGDRTYSKGDLGFEKITIGSSS--VKS-------VIGC 188
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV----KGGGIF 264
G+ SG G ++ G++G G SL+SQ++ + + F++CL + G F
Sbjct: 189 GHASSGGFGFAS-----GVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINF 243
Query: 265 AIGDVVS-PKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
VVS P V +TP++ + +Y + LE + +G + + IIDSGT
Sbjct: 244 GQNAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNE------RHMAFAKQGNVIIDSGT 297
Query: 322 TLAYLPPMLYDLVLSQF 338
TL++LP LYD V+S
Sbjct: 298 TLSFLPKELYDGVVSSL 314
>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 121/270 (44%), Gaps = 25/270 (9%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + ++ +GTP + VDTGSDL+W+ CA C C + IK +FDP KSST
Sbjct: 66 GQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQ----IK-PMFDPLKSSTYNN 120
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I+C C CSP RC Y YGD S T G +D +G K L
Sbjct: 121 ISCDSPLCHKLDTG---VCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTG--KPVSL 175
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC----LD 256
S +FGCG+ +G G++G G +SL+SQ+ +K F+ C L
Sbjct: 176 -SRFLFGCGHNNTGGFNDHE----MGLIGLGGGPTSLISQIGPLFGGKK-FSQCLVPFLT 229
Query: 257 VVKGGGIFAIG---DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
+K + G V+ V TTP+VP + + + + P + T +
Sbjct: 230 DIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMN--STIGKA 287
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
++DSGT LP LYD V ++ R +A
Sbjct: 288 NMLVDSGTPPILLPQQLYDKVFAEVRNKVA 317
>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
Length = 471
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 86/272 (31%), Positives = 128/272 (47%), Gaps = 38/272 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTS 138
+G Y+ K+GLGTP Y + +DTGS L W+ C C+ C ++D L+DPS S T
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQAD-----PLYDPSVSKTY 176
Query: 139 GEIACSDNFCR----TTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
+++C+ C T N+ P C C Y +YGD S + GY +D++ L +
Sbjct: 177 KKLSCASVECSRLKAATLND--PLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSS-- 232
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+T P +GCG G G + GI+G + S+L+QL+ F++
Sbjct: 233 --QTLP---QFTYGCGQDNQGLFGRAA-----GIIGLARDKLSMLAQLST--KYGHAFSY 280
Query: 254 CL---DVVKGGGIFAIGDVVSP-KVKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSL 306
CL + GG F +SP K TPM+ N Y + L + V G PLDL ++
Sbjct: 281 CLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAM 340
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T+IDSGT + LP +Y + F
Sbjct: 341 Y----RVPTLIDSGTVITRLPMSMYAALRQAF 368
>gi|224130234|ref|XP_002328687.1| predicted protein [Populus trichocarpa]
gi|222838863|gb|EEE77214.1| predicted protein [Populus trichocarpa]
Length = 603
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/249 (31%), Positives = 114/249 (45%), Gaps = 27/249 (10%)
Query: 92 PTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC-R 149
P YY+ DTGSDL W+ C A C+ C ++ + P + + + D C
Sbjct: 199 PPQPYYLDFDTGSDLTWIQCDAPCTSCAKGAN-----AWYKPRRGNI---VPPKDLLCME 250
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
N + C +C+Y + Y D SS+ G D + L A+G+L + IFGC
Sbjct: 251 VQRNQKAGYCETCDQCDYEIEYADHSSSMGVLATDKLLLMVANGSLTKL----NFIFGCA 306
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-VKGGGIFAIGD 268
Q G L T DGILG +A SL SQLA+ G + HCL + GGG +GD
Sbjct: 307 YDQQG-LLLKTLVKTDGILGLSRAKVSLPSQLASQGIINNVIGHCLTTDLGGGGYMFLGD 365
Query: 269 VVSPK--VKTTPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER--GTIIDSGTT 322
P+ + PM+ P+M Y+ + ++ G +PL LG + R + DSG++
Sbjct: 366 DFVPRWGMAWVPMLDSPSMEFYHTEVVKLNYGSSPLS-----LGGMESRVKHILFDSGSS 420
Query: 323 LAYLPPMLY 331
Y P Y
Sbjct: 421 YTYFPKEAY 429
>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
Length = 418
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 131/283 (46%), Gaps = 35/283 (12%)
Query: 63 MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSD 122
+++S+ + GN +P G+Y + +G P + Y + +DTGSDL WV C G P K
Sbjct: 44 LISSLVYTIKGNVYPD--GIYTVSINIGNPPNPYELDIDTGSDLTWVQCDG-PDAPCKGC 100
Query: 123 LGIKLTLFDPSKSSTSGEIACSDNFC---RTTYNNRYPSCS-PGVRCEYVVTYGDGSSTS 178
K L+ P+ + + CSD C + ++ C+ P C Y V Y D + ++
Sbjct: 101 TLPKDKLYKPNGNQL---VKCSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYADNAEST 157
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G RD + + SG+ PL V+FGCG Q + + G+LG G S+L
Sbjct: 158 GALARDYMHIGSPSGS--NVPL---VVFGCGYEQKFSG-PTPPPSTPGVLGLGNGKISIL 211
Query: 239 SQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVG 296
SQL + G + HCL +GGG +GD P + TP++ + LE+
Sbjct: 212 SQLHSMGFIHNVLGHCLS-AEGGGYLFLGDKFIPSSGIFWTPIIQSS------LEKHYST 264
Query: 297 GNPLDL-----PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
G P+DL PT G I DSG++ Y P +Y +V
Sbjct: 265 G-PVDLFFNGKPTPAKGL----QIIFDSGSSYTYFSPRVYTIV 302
>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 440
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 120/264 (45%), Gaps = 27/264 (10%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S +G Y + LGTP DTGSDLLW C C C T+ D LFDP SST
Sbjct: 89 SNSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVD-----PLFDPKASST 143
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+++CS + C T N+ + C Y +YGD S T G D + L G+ T
Sbjct: 144 YKDVSCSSSQC-TALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTL----GSTDT 198
Query: 198 APLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
P+ ++I GCG+ +G V G G SL++QL ++ +F++CL
Sbjct: 199 RPVQLKNIIIGCGHNNAGTFNKKGSGIV----GLGGGAVSLITQL--GDSIDGKFSYCLV 252
Query: 257 VVKGGGI------FAIGDVVS-PKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLPTSLL 307
+ F VVS V +TP++ Y + L+ + VG + P S
Sbjct: 253 PLTSENDRTSKINFGTNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDS 312
Query: 308 GTGDERGTIIDSGTTLAYLPPMLY 331
G+G E IIDSGTTL LP Y
Sbjct: 313 GSG-EGNIIIDSGTTLTLLPTEFY 335
>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 419
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/278 (33%), Positives = 122/278 (43%), Gaps = 44/278 (15%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF LGTP ++ + VD+GSDLLWV CA C +C + L+ PS SST
Sbjct: 62 SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQ-----DTPLYAPSNSSTFN 116
Query: 140 EIACSDNFCRTTYNNRYPSCS---PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ C C C PG C Y Y D S + G F + ++ +
Sbjct: 117 PVPCLSPECLLIPATEGFPCDFHYPGA-CAYEYRYADTSLSKGVFAYESATVDDVRID-- 173
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFAHC- 254
V FGCG G AA G+LG GQ S SQ+ A GN +FA+C
Sbjct: 174 ------KVAFGCGRDNQGSF-----AAAGGVLGLGQGPLSFGSQVGYAYGN---KFAYCL 219
Query: 255 ---LDVVKGGGIFAIGDVVSPKV---KTTPMVPNMPH---YNVILEEVEVGGNPLDLPTS 305
LD GD + + + TP+V N + Y V +E+V VGG L + S
Sbjct: 220 VNYLDPTSVSSWLIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHS 279
Query: 306 -----LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
LG G G+I DSGTT+ Y P Y +L+ F
Sbjct: 280 AWSLDFLGNG---GSIFDSGTTVTYWLPPAYRNILAAF 314
>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 472
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 136/302 (45%), Gaps = 38/302 (12%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLW 108
L+AL Q RR G +S + G +G YFT++G+GTP Y+ +DTGSD++W
Sbjct: 99 LAALNQSHARRSGSSFSSSIISGLAQG----SGEYFTRIGVGTPARYVYMVLDTGSDVVW 154
Query: 109 VNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEY 167
+ CA C +C T++D +FDP+KS T I C CR + P C+ + C+Y
Sbjct: 155 LQCAPCRKCYTQAD-----PVFDPTKSRTYAGIPCGAPLCRRLDS---PGCNNKNKVCQY 206
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
V+YGDGS T G F + + + + V GCG+ G +
Sbjct: 207 QVSYGDGSFTFGDFSTETLTFRRTR--------VTRVALGCGHDNEGLFIGAAGLLGL-- 256
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNM 283
G+ S Q N ++F++CL K + VS + TP++ N
Sbjct: 257 ---GRGRLSFPVQTGRRFN--QKFSYCLVDRSASAKPSSVVFGDSAVSRTARFTPLIKNP 311
Query: 284 P---HYNVILEEVEVGGNPLD-LPTSL--LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
Y + L + VGG+P+ L SL L G IIDSGT++ L Y +
Sbjct: 312 KLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAYIALRDA 371
Query: 338 FR 339
FR
Sbjct: 372 FR 373
>gi|414888271|tpg|DAA64285.1| TPA: hypothetical protein ZEAMMB73_923514, partial [Zea mays]
Length = 335
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 123/265 (46%), Gaps = 31/265 (11%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWV--NCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
L++ V LGTP + V +DTGSDL WV +C C+ + + +K + P KSSTS
Sbjct: 87 LHYAVVALGTPNVTFLVALDTGSDLFWVPCDCINCAPLVSPNYRDLKFDTYSPQKSSTSR 146
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASG---NL 195
++ CS N C + + S C Y + Y D +S++G V D++ L G +
Sbjct: 147 KVPCSSNLC----DEQSACRSASSSCPYSIQYLSDNTSSTGVLVEDVLYLVTEYGRQPKI 202
Query: 196 KTAPLNSSVIFGCGNRQSGD-LGSSTDAAVDGILGFGQANSSLLSQLAAAG-NVRKEFAH 253
TAP + FGCG Q+G LG+ AA +G+LG G S+ S LA+ G F+
Sbjct: 203 VTAP----ITFGCGRTQTGSFLGT---AAPNGLLGLGMDTISVPSLLASQGVAAANSFSM 255
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
C G G GD S + TP M P+YN+ + VG +
Sbjct: 256 CF-AQDGHGRINFGDTGSSDQQETPLNMYKQNPYYNISITGATVGSKSIHT--------- 305
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLS 336
+ I+DSGT+ L +Y + S
Sbjct: 306 KFNAIVDSGTSFTALSDPMYTQITS 330
>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 128/274 (46%), Gaps = 35/274 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +V +GTP Y+ +DTGSD+LW+ CA C C + D +FDP KSST
Sbjct: 34 SGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCD-----EVFDPYKSSTYS 88
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ C N C G +C Y V YGDGS ++G F D + LN SG +
Sbjct: 89 TLGCNSRQC---LNLDVGGCV-GNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVV- 143
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
LN + GCG+ G G+LG G+ S +Q+ + R F++CL
Sbjct: 144 LN-KIPLGCGHDNEGYF-----VGAAGLLGLGKGPLSFPNQINSENGGR--FSYCLTGRD 195
Query: 256 --DVVKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSL---- 306
+ IF V V+ TP N+ Y + + + VGG+ L +PTS
Sbjct: 196 TDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLD 255
Query: 307 -LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
LG G G IIDSGT++ L Y + FR
Sbjct: 256 SLGNG---GVIIDSGTSVTRLQNAAYASLREAFR 286
>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
Length = 452
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 127/279 (45%), Gaps = 31/279 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP + +DTGSDL W CA C T + L+DP++SST +
Sbjct: 94 GAYHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC----TTACFAQPTPLYDPARSSTFSK 149
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ C+ + + +C+ C Y Y G T+GY D + + G+ +
Sbjct: 150 LPCASPLCQ-ALPSAFRACN-ATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSS 206
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
+ V FGC GD+ ++ GI+G G++ SLLSQ+ F++CL
Sbjct: 207 FAGVAFGCSTANGGDMDGAS-----GIVGLGRSALSLLSQIGVG-----RFSYCLRSDAD 256
Query: 261 GG----IF-AIGDVVSPKVKTTPMVPN-------MPHYNVILEEVEVGGNPLDLPTSLLG 308
G +F A+ +V KV++T ++ N P+Y V L + VG L + +S G
Sbjct: 257 AGASPILFGALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFG 316
Query: 309 --TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
G I+DSGTT YL Y ++ F A L
Sbjct: 317 FTAAGAGGVIVDSGTTFTYLAEAGYTMLRQAFLSQTAGL 355
>gi|42565828|ref|NP_190704.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332645262|gb|AEE78783.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 488
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 148/341 (43%), Gaps = 43/341 (12%)
Query: 24 GGGVMGNFVFEVENKFKA------GGERERTLSALKQHDT---RRHGRMMASIDLE---- 70
V G+ FE+ ++F GG + +L + R GR + S +
Sbjct: 15 ASSVSGSLSFEIHHRFSEQVKTVLGGHGLPEMGSLDYYKALVHRDRGRQLTSNNNNQTTI 74
Query: 71 --LGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC--PTKSDLG-- 124
GN + L++ V +GTP + V +DTGSDL W+ C S C ++D G
Sbjct: 75 SFAQGNSTEEISFLHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGER 134
Query: 125 IKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVR 183
IKL +++PSKS +S ++ C+ C NR SP C Y + Y GS ++G V
Sbjct: 135 IKLNIYNPSKSKSSSKVTCNSTLC--ALRNR--CISPVSDCPYRIRYLSPGSKSTGVLVE 190
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D+I ++ G + A + FGC Q LG + AV+GI+G A+ ++ + L
Sbjct: 191 DVIHMSTEEGEARDA----RITFGCSESQ---LGLFKEVAVNGIMGLAIADIAVPNMLVK 243
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMP--HYNVILEEVEVGGNPLD 301
AG F+ C G G + GD S TP+ + Y+V + + +VG +D
Sbjct: 244 AGVASDSFSMCFG-PNGKGTISFGDKGSSDQLETPLSGTISPMFYDVSITKFKVGKVTVD 302
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
E DSGT + +L Y + + F +
Sbjct: 303 ---------TEFTATFDSGTAVTWLIEPYYTALTTNFHLSV 334
>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
Length = 357
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 128/273 (46%), Gaps = 38/273 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +VG+G+PT Y+ +DTGSD+ W+ C+ C C ++D +FDP SS+
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQND-----AVFDPRASSSFR 65
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++CS C+ + S RC Y V+YGDGS T G D +++ +T+P
Sbjct: 66 RLSCSTPQCKLL--DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRG----RTSP 119
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
V+FGCG+ G + G S SQL++ ++F++CL
Sbjct: 120 ----VVFGCGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSS-----RKFSYCLVSRD 165
Query: 256 DVVKGGGIFAIGDVVSPKVKT---TPMVPNMP---HYNVILEEVEVGGNPLDLPTS---L 306
+ V+ GD P + T ++ N Y L + +GG L +P++ L
Sbjct: 166 NGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKL 225
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ G IIDSGT++ LP Y ++ FR
Sbjct: 226 SSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFR 258
>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
Length = 436
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 126/270 (46%), Gaps = 39/270 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP + V DTGSDL+W CA C++C + F P+ SST +
Sbjct: 84 GGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQ-----PAPPFQPASSSTFSK 138
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ +FC+ N+ + G C Y YG G T+GY + +++ AS
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSG-YTAGYLATETLKVGDAS-------- 187
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
SV FGC +G+ST GI G G+ SL+ QL F++CL
Sbjct: 188 FPSVAFGCSTENG--VGNST----SGIAGLGRGALSLIPQLGVG-----RFSYCLRSGSA 236
Query: 261 GG----IF-AIGDVVSPKVKTTPMVPNMP----HYNVILEEVEVGGNPLDLPTSLLG--- 308
G +F ++ ++ V++TP V N +Y V L + VG L + TS G
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G GTI+DSGTTL YL Y++V F
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAF 326
>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
Length = 510
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 131/279 (46%), Gaps = 40/279 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ + LGTP E + +DTGSD+ W+ C C C + F+P SS+ ++
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDC-----VPALRPPFNPRHSSSFFKLP 192
Query: 143 CSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQAS-GNLKTAPL 200
C+ + C Y P CSP R C + + YGDGS +SG + I N + G+ + L
Sbjct: 193 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 252
Query: 201 NSSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DV 257
S++ GC +R+ G+S G+LG + S SQL++ ++F+HC D
Sbjct: 253 -SNITLGCADIDREGLPTGAS------GLLGMDRRPISFPSQLSS--RYARKFSHCFPDK 303
Query: 258 V-----KGGGIFAIGDVVSPKVKTTPMVPN-------MPHYNVILEEVEVGGNPLDLP-- 303
+ G F D++SP ++ TP+V N + +Y V L + V + L L
Sbjct: 304 IAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHK 363
Query: 304 ----TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G+G GTIIDSGT YL + + +F
Sbjct: 364 NFDIDKVTGSG---GTIIDSGTAFTYLKKPAFQAMRREF 399
>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
Length = 420
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 120/268 (44%), Gaps = 31/268 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++G+GTP Y+ DTGSD+ W+ C+ C +C + D +F+PS SS+
Sbjct: 78 SGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQD-----PIFNPSLSSSFK 132
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+AC+ + C + CS C Y V+YGDGS T G F + + + +
Sbjct: 133 PLACASSICGKL---KIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVR----- 184
Query: 200 LNSSVIFGCGNRQSG--DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
SV GCG G + G L F + + + + R+E A +
Sbjct: 185 ---SVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASL 241
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
V G V K + T ++PN +Y V L + V G+P+++P G RG
Sbjct: 242 VFG------PSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMG-SRG 294
Query: 315 T---IIDSGTTLAYLPPMLYDLVLSQFR 339
T I+DSGT ++ L Y + FR
Sbjct: 295 TGGVIVDSGTAISRLTTPAYTALRDAFR 322
>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
Length = 435
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 126/270 (46%), Gaps = 39/270 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP + V DTGSDL+W CA C++C + F P+ SST +
Sbjct: 84 GGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQ-----PAPPFQPASSSTFSK 138
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ +FC+ N+ + G C Y YG G T+GY + +++ AS
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSG-YTAGYLATETLKVGDAS-------- 187
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
SV FGC +G+ST GI G G+ SL+ QL F++CL
Sbjct: 188 FPSVAFGCSTENG--VGNST----SGIAGLGRGALSLIPQLGVG-----RFSYCLRSGSA 236
Query: 261 GG----IF-AIGDVVSPKVKTTPMVPNMP----HYNVILEEVEVGGNPLDLPTSLLG--- 308
G +F ++ ++ V++TP V N +Y V L + VG L + TS G
Sbjct: 237 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 296
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G GTI+DSGTTL YL Y++V F
Sbjct: 297 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAF 326
>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
Length = 353
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 78/268 (29%), Positives = 121/268 (45%), Gaps = 31/268 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++G+GTP Y+ DTGSD+ W+ C+ C +C + D +F+PS SS+
Sbjct: 11 SGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQD-----PIFNPSLSSSFK 65
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+AC+ + C + CS +C Y V+YGDGS T G F + + + +
Sbjct: 66 PLACASSICGKL---KIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVR----- 117
Query: 200 LNSSVIFGCGNRQSG--DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
SV GCG G + G L F + + + + R+E A +
Sbjct: 118 ---SVAMGCGRNNQGLFHGAAGLLGLGRGPLSFPSQTGTSYASVFSYCLPRRESAIAASL 174
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
V G V K + T ++PN +Y V L + V G+P+++P G RG
Sbjct: 175 VFG------PSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMG-SRG 227
Query: 315 T---IIDSGTTLAYLPPMLYDLVLSQFR 339
T I+DSGT ++ L Y + FR
Sbjct: 228 TGGVIVDSGTAISRLTTPAYTALRDAFR 255
>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 135/278 (48%), Gaps = 39/278 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +GTP + Y +DTGS+++W+ C C+ C ++ +F+PSKSS+
Sbjct: 87 GEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTS-----PIFNPSKSSSYKN 141
Query: 141 IACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
I C+ + C+ T N+ + SCS G CEY +TYG + + G D + L+ SG+ P
Sbjct: 142 IPCTSSTCKDT-NDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFP 200
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
+++ GCG+ S + G++G G+ SL+ Q+ ++ V +F++CL
Sbjct: 201 ---NIVIGCGHINVLQDNSQS----SGVVGMGRGPMSLIKQVGSSS-VGSKFSYCLIPYN 252
Query: 256 -------DVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTS 305
++ G + G++ V +TPMV +Y + LE VG N ++
Sbjct: 253 SDSNSSSKLIFGEDVVVSGEI----VVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGER 308
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+ +IDSGT L LP +L LS+ ++A
Sbjct: 309 --SNASTQNILIDSGTPLTMLP----NLFLSKLVSYVA 340
>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 462
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 128/289 (44%), Gaps = 63/289 (21%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSS 136
+G + ++ +G P +Y VDTGSDL+W C C+ C PT +FDP KSS
Sbjct: 105 SGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTP--------IFDPEKSS 156
Query: 137 TSGEIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+ ++ CS C R+ N S CEY+ TYGD SST G + +
Sbjct: 157 SYSKVGCSSGLCNALPRSNCNEDKDS------CEYLYTYGDYSSTRGLLATETFTFEDEN 210
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
S + FGCG GD G S + G++G G+ SL+SQL +F+
Sbjct: 211 S-------ISGIGFGCGVENEGD-GFSQGS---GLVGLGRGPLSLISQLK-----ETKFS 254
Query: 253 HCLDVV---KGGGIFAIGDVVSPKV------------KTTPMV--PNMP-HYNVILEEVE 294
+CL + + IG + S V KT ++ P+ P Y + L+ +
Sbjct: 255 YCLTSIEDSEASSSLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGIT 314
Query: 295 VGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VG L + S GTG G IIDSGTT+ YL + ++ +F
Sbjct: 315 VGAKRLSVEKSTFELSEDGTG---GMIIDSGTTITYLEETAFKVLKEEF 360
>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 494
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 83/266 (31%), Positives = 117/266 (43%), Gaps = 28/266 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF VGLGTP ++ + DTGSDL W C C KS K +F+PS+S++
Sbjct: 150 SGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPC----VKSCYNQKEAIFNPSQSTSYA 205
Query: 140 EIACSDNFCRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I+C C + + +C+ C Y + YGD S + G+F ++ + L
Sbjct: 206 NISCGSTLCDSLASATGNIFNCASST-CVYGIQYGDSSFSIGFFGKEKLSLTATD----- 259
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ + FGCG G G + G+ SL+SQ A N K F++CL
Sbjct: 260 --VFNDFYFGCGQNNKGLFGGAAGLLGL-----GRDKLSLVSQTAQRYN--KIFSYCLPS 310
Query: 258 VKGG-GIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G S TP+ Y + L + VGG L + S+ T
Sbjct: 311 SSSSTGFLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTA--- 367
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + LPP Y + S FR
Sbjct: 368 GTIIDSGTVITRLPPAAYSALSSTFR 393
>gi|297819836|ref|XP_002877801.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323639|gb|EFH54060.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 513
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 123/272 (45%), Gaps = 34/272 (12%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC--PTKSDLG--------IKLTLFD 131
L++ V +GTP + V +DTGSDL W+ C S C ++D G I+L +++
Sbjct: 110 LHYANVTIGTPAQWFLVALDTGSDLFWLPCNCNSTCVRSMETDQGETHMNAQRIRLNIYN 169
Query: 132 PSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQ 190
PS S++S ++ C+ C R SP C Y + Y GS ++G V D+I ++
Sbjct: 170 PSISTSSSKVTCNSTLCAL----RNRCISPLSDCPYRIRYLSPGSKSTGVLVEDVIHMST 225
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
G + A + FGC Q LG + AV+GI+G A+ ++ + L AG
Sbjct: 226 EEGEARDA----RITFGCSETQ---LGLFQEVAVNGIMGLAMADIAVPNMLVKAGVASDS 278
Query: 251 FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLG 308
F+ C G G + GD S TP+ + Y+V + + +VG ++ S
Sbjct: 279 FSMCFG-PNGKGTISFGDKGSSDQHETPLGGTISPLFYDVSITKFKVGKVTVETKFS--- 334
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
I DSGT + +L Y + + F
Sbjct: 335 ------AIFDSGTAVTWLLDPYYTALTTNFHL 360
>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 481
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 81/266 (30%), Positives = 117/266 (43%), Gaps = 30/266 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEI 141
YF VGLGTP + + DTGSDL W C C+ C + D +FDPSKSS+ I
Sbjct: 136 YFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQD-----AIFDPSKSSSYINI 190
Query: 142 ACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
C+ + C T+ + S C Y + YGD S++ G+ L+Q +
Sbjct: 191 TCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGF-------LSQERLTITATD 243
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ +FGCG G S G++G G+ S + Q ++ N K F++CL
Sbjct: 244 IVDDFLFGCGQDNEGLFSGSA-----GLIGLGRHPISFVQQTSSIYN--KIFSYCLPSTS 296
Query: 260 ---GGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G F + +K TP+ + Y + + + VGG LP T
Sbjct: 297 SSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGT--KLPAVSSSTFSAG 354
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+IIDSGT + L P Y + S FR
Sbjct: 355 GSIIDSGTVITRLAPTAYAALRSAFR 380
>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 510
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 84/292 (28%), Positives = 128/292 (43%), Gaps = 35/292 (11%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS 121
R++A+++ +G +G Y +V +GTP + + +DTGSDL W+ CA C C
Sbjct: 134 RLVATVE-----SGVAVGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDC---- 184
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTS 178
+ +FDP S++ + C D C P R C Y YGD S+T+
Sbjct: 185 -FDQRGPVFDPMASTSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTT 243
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + +N + + + V+ GCG+R G + G+ S
Sbjct: 244 GDLALEAFTVNLTASSSRRV---DGVVLGCGHRNRGLFHGAAGLLGL-----GRGPLSFA 295
Query: 239 SQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVV--SPKVKTTPMVPNMPH---YNVI 289
SQL A F++CL V +F +V+ P++ T P+ Y V
Sbjct: 296 SQLRAVYG--HAFSYCLVDHGSAVGSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQ 353
Query: 290 LEEVEVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQF 338
L+ + VGG LD+P++ G E GTIIDSGTTL+Y P Y + F
Sbjct: 354 LKGILVGGEMLDIPSNTWGVSKEDGSGGTIIDSGTTLSYFPEPAYKAIRQAF 405
>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
Length = 357
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 128/273 (46%), Gaps = 38/273 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +VG+G+PT Y+ +DTGSD+ W+ C+ C C ++D +FDP SS+
Sbjct: 11 SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQND-----AVFDPRASSSFR 65
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++CS C+ + S RC Y V+YGDGS T G D +++ +T+P
Sbjct: 66 RLSCSTPQCKLL--DVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRG----RTSP 119
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
V+FGCG+ G + G S SQL++ ++F++CL
Sbjct: 120 ----VVFGCGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSS-----RKFSYCLVSRD 165
Query: 256 DVVKGGGIFAIGDVVSPKVKT---TPMVPNMP---HYNVILEEVEVGGNPLDLPTS---L 306
+ V+ GD P + T ++ N Y L + +GG L +P++ L
Sbjct: 166 NGVRASSALLFGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKL 225
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ G IIDSGT++ LP Y ++ FR
Sbjct: 226 SSSTGRGGVIIDSGTSVTRLPTYAYTVMRDAFR 258
>gi|356496606|ref|XP_003517157.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
Length = 508
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 84/264 (31%), Positives = 122/264 (46%), Gaps = 26/264 (9%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLG----IKLTLFDPSKSST 137
L+F V +GTP + V +DTGSDL W+ C C++C L I ++D SST
Sbjct: 100 LHFANVSVGTPPLSFLVALDTGSDLFWLPC-NCTKCVHGIGLSNGEKIAFNIYDLKGSST 158
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLK 196
S + C+ + C + PS C Y V Y +G+ST+G+ V D++ L + + K
Sbjct: 159 SQPVLCNSSLCE--LQRQCPSSD--TICPYEVNYLSNGTSTTGFLVEDVLHL--ITDDDK 212
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
T ++ + FGCG Q+G AA +G+ G G +N S+ S LA G F+ C
Sbjct: 213 TKDADTRITFGCGQVQTGAFLDG--AAPNGLFGLGMSNESVPSILAKEGLTSNSFSMCFG 270
Query: 257 VVKGGGIFAIGDVVSPKVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G G GD S TP + P YN+ + ++ VG DL E
Sbjct: 271 -SDGLGRITFGDNSSLVQGKTPFNLRALHPTYNITVTQIIVGEKVDDL---------EFH 320
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQF 338
I DSGT+ YL Y + + F
Sbjct: 321 AIFDSGTSFTYLNDPAYKQITNSF 344
>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
Length = 445
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 90/266 (33%), Positives = 126/266 (47%), Gaps = 35/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y V GTP V +DTGSDL W+ C CS +C + D LFDPS SST
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKD-----PLFDPSHSSTYSA 166
Query: 141 IACSDNFCRTTYNNRYPS-CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ C+ + Y S CS G C + ++Y DG+ST G + +D + L +
Sbjct: 167 VPCASGECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTLAPGA------- 219
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ FGCG+ + SS DG+LG G+ + SL +Q F++CL V
Sbjct: 220 IVKDFYFGCGHSK-----SSLPGLFDGLLGLGRLSESLGAQYGGG----GGFSYCLPAVN 270
Query: 260 GG-GIFAIGDVVSPK-VKTTPM--VPNMPHYN-VILEEVEVGGNPLDL-PTSLLGTGDER 313
G A G +P TPM VP P ++ V L + VGG LDL P++ G
Sbjct: 271 SKPGFLAFGAGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSG----- 325
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
G I+DSGT + L +Y + + FR
Sbjct: 326 GMIVDSGTVVTVLQSTVYRALRAAFR 351
>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
gi|194704078|gb|ACF86123.1| unknown [Zea mays]
gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 471
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 85/260 (32%), Positives = 118/260 (45%), Gaps = 34/260 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSG 139
G Y T++GLGTP+ Y + VDTGS L W+ C+ C C +G LFDP SST
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSC--HRQVG---PLFDPRASSTYT 186
Query: 140 EIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ CS + C T N +CS C Y +YGD S + GY D + S
Sbjct: 187 SVRCSASQCDELQAATLNPS--ACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS--- 241
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
S +GCG G G S G++G + SLL QLA ++ F++CL
Sbjct: 242 -----YPSFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQLAP--SLGYSFSYCL 289
Query: 256 DVVKGGGIFAIGDVVSPKVKT-TPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
G +IG + + TPM + Y + L + VGG+PL + S +
Sbjct: 290 PTAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSS-- 347
Query: 312 ERGTIIDSGTTLAYLPPMLY 331
TIIDSGT + LP ++
Sbjct: 348 -LPTIIDSGTVITRLPTAVH 366
>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
Length = 401
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 125/282 (44%), Gaps = 42/282 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
NG P T Y + +GTP + +DTGSDL+W C C C ++ L FDPS
Sbjct: 75 NGVP--TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA-----LPYFDPS 127
Query: 134 KSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
SST +C C+ + P P C Y +YGD S T+G+ D A
Sbjct: 128 TSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGA 187
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
++ V FGCG +G S+ GI GFG+ SL SQL GN F
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNE----TGIAGFGRGPLSLPSQL-KVGN----F 232
Query: 252 AHCLDVVKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPL 300
+HC V G + D+ V++TP++ N + Y + L+ + VG L
Sbjct: 233 SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRL 292
Query: 301 DLPTSLL----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P S GTG GTIIDSGT + LP +Y LV F
Sbjct: 293 PVPESEFALKNGTG---GTIIDSGTAMTSLPTRVYRLVRDAF 331
>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 434
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 123/267 (46%), Gaps = 30/267 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G + + +GTP DTGSDL W C C C +S +F+P +SS+
Sbjct: 87 SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQ-----PIFNPRRSSSYR 141
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+++C+ + CR+ + C P ++ C Y +YGD S T G D I + G+ K
Sbjct: 142 KVSCASDTCRSLESYH---CGPDLQSCSYGYSYGDRSFTYGDLASDQITI----GSFK-- 192
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV- 257
L +VI GCG++ G G T + G + SL+SQ+ V+ F++CL
Sbjct: 193 -LPKTVI-GCGHQNGGTFGGVTSGIIGL----GGGSLSLVSQMRTIAGVKPRFSYCLPTF 246
Query: 258 -----VKGGGIFAIGDVVS-PKVKTTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGT 309
+ G F VVS +V +TP+VP P Y + LE + VG +
Sbjct: 247 FSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAM 306
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ IIDSGTTL LP LY V S
Sbjct: 307 TNHGNIIIDSGTTLTLLPRSLYYGVFS 333
>gi|3036792|emb|CAA18482.1| putative protein (fragment) [Arabidopsis thaliana]
Length = 335
Score = 97.4 bits (241), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 77/244 (31%), Positives = 117/244 (47%), Gaps = 26/244 (10%)
Query: 98 VQVDTGSDLLWVNCAGCSRC-PTKSDL---GIKLTLFDPSKSSTSGEIACSDNFCRTTYN 153
V +DTGSDL WV C C +C PT+ +L++++P S+T+ ++ C+++ C
Sbjct: 2 VALDTGSDLFWVPC-DCGKCAPTEGATYASEFELSIYNPKVSTTNKKVTCNNSLCA---- 56
Query: 154 NRYPSCSPGVRCEYVVTYGDG-SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
R C Y+V+Y +STSG + D++ L N + + + V FGCG Q
Sbjct: 57 QRNQCLGTFSTCPYMVSYVSAQTSTSGILMEDVMHLTTEDKNPER--VEAYVTFGCGQVQ 114
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP 272
SG AA +G+ G G S+ S LA G V F+ C G G + GD S
Sbjct: 115 SGSFLDI--AAPNGLFGLGMEKISVPSVLAREGLVADSFSMCFG-HDGVGRISFGDKGSS 171
Query: 273 KVKTTP--MVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
+ TP + P+ P+YN+ + V VG +D DE + D+GT+ YL +
Sbjct: 172 DQEETPFNLNPSHPNYNITVTRVRVGTTLID---------DEFTALFDTGTSFTYLVDPM 222
Query: 331 YDLV 334
Y V
Sbjct: 223 YTTV 226
>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 443
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 88/275 (32%), Positives = 128/275 (46%), Gaps = 32/275 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y ++ +GTP + Y QVDTGSDL+W+ C C+ C + + +FDP SST IA
Sbjct: 59 YLMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLN-----PMFDPQSSSTYSNIA 113
Query: 143 CSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C Y+ SCSP C Y +Y D S T G ++ + L +G K L
Sbjct: 114 YGSESCSKLYST---SCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTG--KPVAL- 167
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------ 255
VIFGCG+ +G D + GI+G G+ SL+SQ+ ++ K F+ CL
Sbjct: 168 KGVIFGCGHNNNGVFN---DKEM-GIIGLGRGPLSLVSQIGSSFG-GKMFSQCLVPFHTN 222
Query: 256 DVVKGGGIFAIG-DVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPT---SLLG 308
+ F G +V+ V +TP+V H Y V L + V ++LP S L
Sbjct: 223 PSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVED--INLPFNDGSSLE 280
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+ +IDSGT LP Y ++ + R +A
Sbjct: 281 PITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVA 315
>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
Length = 452
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 126/284 (44%), Gaps = 45/284 (15%)
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
PS Y + +GTP +DTGSDL+W CA C+ C ++ D LF P +S
Sbjct: 89 RPSGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPD-----PLFAPGQS 143
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
++ + C+ C ++ SC C Y YGDG+ T G + + +SG
Sbjct: 144 ASYEPMRCAGTLCSDILHH---SCERPDTCTYRYNYGDGTMTVGVYATERFTF-ASSGGG 199
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ FGCG+ G L + + GI+GFG+ SL+SQL+ + F++CL
Sbjct: 200 GLTTTTVPLGFGCGSVNVGSLNNGS-----GIVGFGRNPLSLVSQLSI-----RRFSYCL 249
Query: 256 DVVK------------GGGIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPL 300
G++ GD +V+TTP++ P P Y V + VG L
Sbjct: 250 TSYASRRQSTLLFGSLSDGVY--GDATG-RVQTTPLLQSPQNPTFYYVHFTGLTVGARRL 306
Query: 301 DLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+P S G+G G I+DSGT L LP + V+ FR
Sbjct: 307 RIPESAFALRPDGSG---GVIVDSGTALTLLPAAVLAEVVRAFR 347
>gi|218185383|gb|EEC67810.1| hypothetical protein OsI_35379 [Oryza sativa Indica Group]
Length = 423
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 123/283 (43%), Gaps = 27/283 (9%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+++ LEL GN +P G +F + +G P Y++ +DTGS L W+ C C C L
Sbjct: 22 SAVVLELHGNVYP--IGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKAHSL 79
Query: 124 GIKLTL--FDPS---KSSTSGEIACSDNFCRTTYNN-RYP-SCSPGVRCEYVVTYGDGSS 176
+ F P K + C++ C Y + R P C P +C Y + Y GSS
Sbjct: 80 FYPRLIGSFVPHGLYKPELKYAVKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGSS 139
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
G + D L ++G T P +S+ FGCG Q G + V+GILG G+ +
Sbjct: 140 I-GVLIVDSFSLPASNG---TNP--TSIAFGCGYNQ-GKNNHNVPTPVNGILGLGRGKVT 192
Query: 237 LLSQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEV 293
LLSQL + G + K HC+ KG G GD P V +PM HY+ +
Sbjct: 193 LLSQLKSQGVITKHVLGHCIS-SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTL 251
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ N + + + I DSG T Y Y LS
Sbjct: 252 QFNSNSKPISAAPM------EVIFDSGATYTYFALQPYHATLS 288
>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
Japonica Group]
Length = 446
Score = 97.4 bits (241), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 97/187 (51%), Gaps = 28/187 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P +G YF VG+GTP+ + + +DTGSDL+W+ C+ C RC + + +FDP
Sbjct: 77 SGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQ-----RGQVFDPR 131
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSC----SPGVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
+SST + CS CR R+P C + G C Y+V YGDGSS++G D +
Sbjct: 132 RSSTYRRVPCSSPQCRAL---RFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFA 188
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVR 248
+ ++V GCG G S+ G+LG G+ S+ +Q+A A G+V
Sbjct: 189 NDT-------YVNNVTLGCGRDNEGLFDSAA-----GLLGVGRGKISISTQVAPAYGSV- 235
Query: 249 KEFAHCL 255
F +CL
Sbjct: 236 --FEYCL 240
>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
[Cucumis sativus]
Length = 384
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 128/283 (45%), Gaps = 35/283 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C C +++D +F+P
Sbjct: 33 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTD-----PVFNPV 87
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + ++ C CR + P C+ C Y V+YGDGS T+G FV + + +
Sbjct: 88 KSGSFAKVLCRTPLCRRLES---PGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTK- 143
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
V GCG+ G + G+ S SQ A ++F++
Sbjct: 144 -------VEQVALGCGHDNEGLFVGAAGLLGL-----GRGGLSFPSQ--AGRTFNQKFSY 189
Query: 254 CL----DVVKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTS- 305
CL K + VS + TP++ N Y V L + VGG P+ T+
Sbjct: 190 CLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITAS 249
Query: 306 ---LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L TG+ G IID GT++ L Y + FR +SL
Sbjct: 250 HFKLDRTGNG-GVIIDCGTSVTRLNKPAYIALRDAFRAGASSL 291
>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
Length = 434
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 125/282 (44%), Gaps = 42/282 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
NG P T Y + +GTP + +DTGSDL+W C C C ++ L FDPS
Sbjct: 75 NGVP--TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA-----LPYFDPS 127
Query: 134 KSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
SST +C C+ + P P C Y +YGD S T+G+ D A
Sbjct: 128 TSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGA 187
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
++ V FGCG +G S+ GI GFG+ SL SQL GN F
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNE----TGIAGFGRGPLSLPSQL-KVGN----F 232
Query: 252 AHCLDVVKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPL 300
+HC V G + D+ V++TP++ N + Y + L+ + VG L
Sbjct: 233 SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRL 292
Query: 301 DLPTSLL----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P S GTG GTIIDSGT + LP +Y LV F
Sbjct: 293 PVPESEFALKNGTG---GTIIDSGTAMTSLPTRVYRLVRDAF 331
>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 506
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 127/300 (42%), Gaps = 54/300 (18%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS 121
R++A+++ +G P +G Y V LGTP + + +DTGSDL W+ CA C C +S
Sbjct: 133 RVVATVE-----SGVPVGSGEYLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQS 187
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCR--------TTYNNRYPSCSPGVRCEYVVTYGD 173
+FDP+ S + + C D+ CR R P P C Y YGD
Sbjct: 188 G-----PIFDPAASISYRNVTCGDDRCRLVSPPAESAPRECRRPRSDP---CPYYYWYGD 239
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG------------DLGSSTD 221
S+T+G + +N + V FGCG+R G S
Sbjct: 240 QSNTTGDLALEAFTVNLTQSGTRRV---DGVAFGCGHRNRGLFHGAAGLLGLGRGPLSFA 296
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP 281
+ + G+ G G A S L + +A + F H D + + P++ T P
Sbjct: 297 SQLRGVYG-GHAFSYCLVEHGSAAGSKIIFGH-DDAL----------LAHPQLNYTAFAP 344
Query: 282 NM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y + L+ + VGG +++ + L G GTIIDSGTTL+Y P Y + F
Sbjct: 345 TTDADTFYYLQLKSILVGGEAVNISSDTLSAG---GTIIDSGTTLSYFPEPAYQAIRQAF 401
>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
Length = 434
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 125/282 (44%), Gaps = 42/282 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
NG P T Y + +GTP + +DTGSDL+W C C C ++ L FDPS
Sbjct: 75 NGVP--TTEYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA-----LPYFDPS 127
Query: 134 KSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
SST +C C+ + P P C Y +YGD S T+G+ D A
Sbjct: 128 TSSTLSLTSCDSTLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGA 187
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
++ V FGCG +G S+ GI GFG+ SL SQL GN F
Sbjct: 188 GASVP------GVAFGCGLFNNGVFKSNE----TGIAGFGRGPLSLPSQL-KVGN----F 232
Query: 252 AHCLDVVKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPL 300
+HC V G + D+ V++TP++ N + Y + L+ + VG L
Sbjct: 233 SHCFTAVNGLKPSTVLLDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRL 292
Query: 301 DLPTSLL----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P S GTG GTIIDSGT + LP +Y LV F
Sbjct: 293 PVPESEFTLKNGTG---GTIIDSGTAMTSLPTRVYRLVRDAF 331
>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 458
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/325 (27%), Positives = 132/325 (40%), Gaps = 53/325 (16%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNG------HPSATGLYFTKVGLGTPTDEYYVQ 99
E + L + R + SID ELG + T L+ +G P
Sbjct: 53 EDHIKHLTDISSARFKYLQNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTI 112
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGS LLW+ C C C SD I +F+P+ SST E +C D FCR N C
Sbjct: 113 MDTGSSLLWIQCQPCKHC--SSDHMIH-PVFNPALSSTFVECSCDDRFCRYAPNGH---C 166
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN-LKTAPLNSSVIFGCGNRQSGDLGS 218
+C Y Y G+ + G ++ + +GN + T P + FGCG G
Sbjct: 167 GSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQP----IAFGCGYEN----GE 218
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----------DVVKGGGIFAIG 267
++ GILG G +SL QL + +F++C+ +V G +G
Sbjct: 219 QLESHFTGILGLGAKPTSLAVQLGS------KFSYCIGDLANKNYGYNQLVLGEDADILG 272
Query: 268 DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGTGDERGTIIDSGTTLAYL 326
D + +T + Y + LE + VG L++ P G G I+DSGT +L
Sbjct: 273 DPTPIEFETENSI-----YYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWL 327
Query: 327 PPMLYDLVLSQF---------RFWI 342
+ Y + ++ RFW
Sbjct: 328 ADIAYRELYNEIKSILDPKLERFWF 352
>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 410
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 131/281 (46%), Gaps = 32/281 (11%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+SI L + GN +P G + V +G P + + +DTGSDL WV C A C+ C D
Sbjct: 39 SSILLPVKGNVYP--LGHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPCTGCTLPHD- 95
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYN-NRYPSCSPGVRCEYVVTYGDGSSTSGYFV 182
L+ P + + C + C ++ ++ P +P +C+Y V Y D S+ G V
Sbjct: 96 ----RLYKPHNNV----VRCGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLV 147
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
+D + L +G + L ++ FGCG Q GS G+LG G + +++ +QL+
Sbjct: 148 KDPVPLRLTNGTI----LAPNLGFGCGYDQHNG-GSQLPPLTAGVLGLGNSKATMATQLS 202
Query: 243 AAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMP--HYNVILEEVEVGGNPL 300
A +VR HC GG +F GD+V + + P Y+ EV GGNP+
Sbjct: 203 ALSHVRNVLGHCFSGQGGGFLFFGGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPV 262
Query: 301 DLPTSLLGTGDERGTII--DSGTTLAYLPPMLYDLVLSQFR 339
+ RG I+ DSG++ Y +Y VL+ R
Sbjct: 263 GI----------RGLILTFDSGSSYTYFNSQVYGAVLNLLR 293
>gi|158513711|sp|A2ZC67.2|ASP1_ORYSI RecName: Full=Aspartic proteinase Asp1; Short=OSAP1; Short=OsAsp1;
AltName: Full=Nucellin-like protein; Flags: Precursor
Length = 410
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 123/281 (43%), Gaps = 36/281 (12%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN----CAGCSRCPTK 120
+++ LEL GN +P G +F + +G P Y++ +DTGS L W+ C C++ P
Sbjct: 22 SAVVLELHGNVYP--IGHFFVTMNIGDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPH- 78
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN-RYP-SCSPGVRCEYVVTYGDGSSTS 178
L+ P + C++ C Y + R P C P +C Y + Y GSS
Sbjct: 79 -------GLYKPELKYA---VKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGSSI- 127
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + D L ++G T P +S+ FGCG Q G + V+GILG G+ +LL
Sbjct: 128 GVLIVDSFSLPASNG---TNP--TSIAFGCGYNQ-GKNNHNVPTPVNGILGLGRGKVTLL 181
Query: 239 SQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEV 295
SQL + G + K HC+ KG G GD P V +PM HY+ ++
Sbjct: 182 SQLKSQGVITKHVLGHCIS-SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLQF 240
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
N + + + I DSG T Y Y LS
Sbjct: 241 NSNSKPISAAPM------EVIFDSGATYTYFALQPYHATLS 275
>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 461
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 87/289 (30%), Positives = 128/289 (44%), Gaps = 63/289 (21%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSS 136
+G + ++ +G P +Y VDTGSDL+W C C+ C PT +FDP KSS
Sbjct: 104 SGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP--------IFDPEKSS 155
Query: 137 TSGEIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+ ++ CS C R+ N + CEY+ TYGD SST G + +
Sbjct: 156 SYSKVGCSSGLCNALPRSNCNEDKDA------CEYLYTYGDYSSTRGLLATETFTFEDEN 209
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
S + FGCG GD G S + G++G G+ SL+SQL +F+
Sbjct: 210 S-------ISGIGFGCGVENEGD-GFSQGS---GLVGLGRGPLSLISQLK-----ETKFS 253
Query: 253 HCLDVV---KGGGIFAIGDVVSPKV------------KTTPMV--PNMPH-YNVILEEVE 294
+CL + + IG + S V KT ++ P+ P Y + L+ +
Sbjct: 254 YCLTSIEDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGIT 313
Query: 295 VGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VG L + S GTG G IIDSGTT+ YL + ++ +F
Sbjct: 314 VGAKRLSVEKSTFELAEDGTG---GMIIDSGTTITYLEETAFKVLKEEF 359
>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
Length = 506
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 122/272 (44%), Gaps = 41/272 (15%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF++VG+G+P + Y+ +DTGSD+ WV C C+ C +SD +FDPS S++
Sbjct: 163 SGEYFSRVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSASYA 217
Query: 140 EIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
++C CR + +C C Y V YGDGS T G F + + L ++
Sbjct: 218 AVSCDSQRCR---DLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST------ 268
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--- 255
P+ +V GCG+ G + G S SQ++A+ F++CL
Sbjct: 269 PVG-NVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAS-----TFSYCLVDR 317
Query: 256 ------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL-- 307
+ G G G V +P V+ +P Y V L + VGG PL +P S
Sbjct: 318 DSPAASTLQFGDGAAEAGTVTAPLVR-SPRTSTF--YYVALSGISVGGQPLSIPASAFAM 374
Query: 308 -GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T G I+DSGT + L Y + F
Sbjct: 375 DATSGSGGVIVDSGTAVTRLQSAAYAALRDAF 406
>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
Precursor
gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 447
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 130/296 (43%), Gaps = 37/296 (12%)
Query: 57 TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
+RR ++ DL+ G G A G +F + +GTP + + DTGSDL WV C C +
Sbjct: 62 SRRFNHQLSQTDLQSGLIG---ADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ 118
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C ++ +FD KSST C C+ + C+Y +YGD S
Sbjct: 119 CYKENG-----PIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSF 173
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+ G + + ++ ASG+ + P +FGCG G D GI+G G + S
Sbjct: 174 SKGDVATETVSIDSASGSPVSFP---GTVFGCGYNNGGTF----DETGSGIIGLGGGHLS 226
Query: 237 LLSQLAAAGNVRKEFAHCLD----VVKGGGIFAIGDVVSPK-------VKTTPMVPNMP- 284
L+SQL ++ + K+F++CL G + +G P V +TP+V P
Sbjct: 227 LISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPL 284
Query: 285 -HYNVILEEVEVGGNPLDLPTSLLGTGDE-------RGTIIDSGTTLAYLPPMLYD 332
+Y + LE + VG + S D+ IIDSGTTL L +D
Sbjct: 285 TYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFD 340
>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
Length = 487
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 117/258 (45%), Gaps = 36/258 (13%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
+ +DT DL W+ CA C P + LFDP +S TS + C C RY
Sbjct: 164 MSIDTSIDLPWIQCAPC---PMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL--GRYG 218
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
+ +C+Y V YGDG +TSG ++ D + LN + T +N FGC + G+
Sbjct: 219 AGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPS-----TVVMNFR--FGCSHAVRGNFS 271
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGGGIFAIGDVV------ 270
+ST G + G SLLSQ AA GN F++C+ G ++G
Sbjct: 272 AST----SGTMSLGGGRQSLLSQTAATFGNA---FSYCVPDPSSSGFLSLGGPADGGGAG 324
Query: 271 ----SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
+P V+ ++P + Y V L +EVGG L++P + G ++DS + L
Sbjct: 325 RFARTPLVRNPSIIPTL--YLVRLRGIEVGGRRLNVPPVVFAG----GAVMDSSVIITQL 378
Query: 327 PPMLYDLVLSQFRFWIAS 344
PP Y + FR +A+
Sbjct: 379 PPTAYRALRLAFRSAMAA 396
>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
Length = 427
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/279 (33%), Positives = 133/279 (47%), Gaps = 34/279 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++ +GTP ++ + VDTGSDL W+ C + T + +D S SS+
Sbjct: 56 SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNT--TANSSSPPAPWYDKSSSSSYR 113
Query: 140 EIACSDNFCRTTYNNRYPSCS--PGVRCEYVVTYGDGSSTSGYFVRDIIQLN------QA 191
EI C+D+ C+ SCS C+Y Y D S T+G + I + +
Sbjct: 114 EIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKR 173
Query: 192 SGNLKTAPLN-SSVIFGCGNRQSGD--LGSSTDAAVDGILGFGQANSSLLSQL--AAAGN 246
+GN KT + +V GC G LG+S G+LG GQ SL +Q A G
Sbjct: 174 AGNHKTRRIRIKNVALGCSRESVGASFLGAS------GVLGLGQGPISLATQTRHTALGG 227
Query: 247 VRKEFAHCL-DVVKG---GGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNP 299
+ F++CL D ++G +G K+ TP+V N Y V + V V G P
Sbjct: 228 I---FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKP 284
Query: 300 LD-LPTSLLGT-GD-ERGTIIDSGTTLAYLPPMLYDLVL 335
+D + +S G GD +GTI DSGTTL+YL Y VL
Sbjct: 285 VDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVL 323
>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Glycine max]
Length = 392
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 80/267 (29%), Positives = 117/267 (43%), Gaps = 31/267 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y VGLGTP + + DTGSDL W C C+ C + D +FDPSKSS+ I
Sbjct: 46 YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQD-----AIFDPSKSSSYTNI 100
Query: 142 ACSDNFC-RTTYNNRYPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
C+ + C + T + CS C Y YGD S++ G+ L+Q +
Sbjct: 101 TCTSSLCTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGF-------LSQERLTITAT 153
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ +FGCG G S G++G G+ S++ Q ++ N K F++CL
Sbjct: 154 DIVDDFLFGCGQDNEGLFNGSA-----GLMGLGRHPISIVQQTSS--NYNKIFSYCLPAT 206
Query: 259 K---GGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G F + + TP+ + Y + + + VGG LP T
Sbjct: 207 SSSLGHLTFGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTK--LPAVSSSTFSA 264
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G+IIDSGT + L P +Y + S FR
Sbjct: 265 GGSIIDSGTVITRLAPTVYAALRSAFR 291
>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 471
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/273 (31%), Positives = 120/273 (43%), Gaps = 36/273 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCA--------GCSRCPTKSDLGIKLTLFDPSK 134
Y V +GTP DTGSDL+W+NC+ +R G++ FDPSK
Sbjct: 100 YLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQ---FDPSK 156
Query: 135 SSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
S+T + C C SC +C Y +YGDGS TSG + A G
Sbjct: 157 STTFRLVDCDSVACSELPEA---SCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGA 213
Query: 195 L--KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
T ++V FGC +GSS + G+ + SL+SQL A ++ + F+
Sbjct: 214 RGDGTTTRVANVNFGCSTTF---VGSSVGDGLVGLG---GGDLSLVSQLGADTSLGRRFS 267
Query: 253 HCL--DVVKGGGIFAIGD---VVSPKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTS 305
+CL VK G V P TTP++P+ +Y V L V+VG + P
Sbjct: 268 YCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTFEAP-- 325
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
D I+DSGTTL +LP L D ++ +
Sbjct: 326 -----DRSPLIVDSGTTLTFLPEALVDPLVKEL 353
>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
Length = 337
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 76/238 (31%), Positives = 112/238 (47%), Gaps = 33/238 (13%)
Query: 45 RERTL-SALKQHDTRRHGRMMASIDLELGGN-------GHPSATGLYFTKVGLGTPTDEY 96
R +TL S L + DTR ++ D+ + G +G Y+ KVG G+P Y
Sbjct: 72 RVKTLNSRLTRKDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYY 131
Query: 97 YVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT----T 151
+ VDTGS L W+ C C C ++D LFDPS S T ++C+ + C + T
Sbjct: 132 SMIVDTGSSLSWLQCKPCVVYCHVQAD-----PLFDPSASKTYKSLSCTSSQCSSLVDAT 186
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
NN S V C Y +YGD S + GY +D++ L + +T P ++GCG
Sbjct: 187 LNNPLCETSSNV-CVYTASYGDSSYSMGYLSQDLLTLAPS----QTLP---GFVYGCGQD 238
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV 269
G G + GILG G+ S+L Q+++ F++CL GGG +IG
Sbjct: 239 SDGLFGRAA-----GILGLGRNKLSMLGQVSS--KFGYAFSYCLPTRGGGGFLSIGKA 289
>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 90/299 (30%), Positives = 130/299 (43%), Gaps = 39/299 (13%)
Query: 46 ERTLSALKQHDTRRH--GRMMASI-----DLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
+R A+++ +R H R A++ + E+ NG G Y + LGTP E
Sbjct: 54 QRWNKAMRRSVSRVHHFQRTAATVSPKEVESEIIANG-----GEYLMSLSLGTPPFEILA 108
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
DTGSDL+W C C +C + LFDP S T +++C C+ S
Sbjct: 109 IADTGSDLIWTQCTPCDKCYKQ-----IAPLFDPKSSKTYRDLSCDTRQCQNL--GESSS 161
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
CS C+Y YGD S T+G D + L +G P + GCG R +G
Sbjct: 162 CSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFP---KTVIGCGRRNNGTF-- 216
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI-------FAIGDVVS 271
D GI+G G SL+SQ+ ++ V +F++CL F VVS
Sbjct: 217 --DKKDSGIIGLGGGPMSLISQMGSS--VGGKFSYCLVPFSSESAGNSSKLHFGRNAVVS 272
Query: 272 -PKVKTTPMVPNMP--HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
V++TP++ P Y + LE + VG ++ G E IIDSGT+L P
Sbjct: 273 GSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEF-GGSSFGGSEGNIIIDSGTSLTLFP 330
>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 79/269 (29%), Positives = 120/269 (44%), Gaps = 32/269 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSG 139
G Y VGLGTP ++ + DTGSDL W C C C ++ FDP+ S++
Sbjct: 138 GAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQ-----PKFDPTTSTSYK 192
Query: 140 EIACSDNFCRTTYNNRYPS--CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++CS FC+ YP+ C C Y + YG G T G+ + + + +
Sbjct: 193 NVSCSSEFCKLIAEGNYPAQDCISNT-CLYGIQYGSG-YTIGFLATETLA-------IAS 243
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ + + +FGC G +T G+LG G++ +L SQ + F++CL
Sbjct: 244 SDVFKNFLFGCSEESRGTFNGTT-----GLLGLGRSPIALPSQ--TTNKYKNLFSYCLPA 296
Query: 258 VKGG-GIFAIGDVVSPKVKTTPMVPNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G + G VS K+TP+ P + Y + + V G L + G T
Sbjct: 297 SPSSTGHLSFGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPI------NGSISRT 350
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
IIDSGTT +LP Y + S FR +A+
Sbjct: 351 IIDSGTTFTFLPSPTYSALGSAFREMMAN 379
>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 496
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 82/263 (31%), Positives = 115/263 (43%), Gaps = 49/263 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V GTP ++ + +DTGS + W C C RC L FDPS S T
Sbjct: 160 GNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRC-----LKASRRHFDPSASLTYSL 214
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+C + TYN +TYGD S++ G + D + L + +
Sbjct: 215 GSCIPSTVGNTYN---------------MTYGDKSTSVGNYGCDTMTLEHSD-------V 252
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
FGCG GD GS DG+LG GQ S +SQ A+ +K F++CL
Sbjct: 253 FPKFQFGCGRNNEGDFGS----GADGMLGLGQGQLSTVSQTAS--KFKKVFSYCLPEEDS 306
Query: 261 GGIFAIGDVV---SPKVKTTPMVPNMP---------HYNVILEEVEVGGNPLDLPTSLLG 308
G G+ S +K T +V N P +Y V L ++ VG L++P+S+
Sbjct: 307 IGSLLFGEKATSQSSSLKFTSLV-NGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFA 365
Query: 309 TGDERGTIIDSGTTLAYLPPMLY 331
+ GTIIDSGT + LP Y
Sbjct: 366 S---PGTIIDSGTVITRLPQRAY 385
>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
Length = 511
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/279 (29%), Positives = 131/279 (46%), Gaps = 40/279 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y+ + +GTP E + +DTGSD+ W+ C C C + F+P SS+ ++
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDC-----VPALRPPFNPRHSSSFFKLP 193
Query: 143 CSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQAS-GNLKTAPL 200
C+ + C Y P CSP R C + + YGDGS +SG + I N + G+ + L
Sbjct: 194 CASSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKL 253
Query: 201 NSSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DV 257
S++ GC +R+ G+S G+LG + S SQL++ ++F+HC D
Sbjct: 254 -SNITLGCADIDREGLPTGAS------GLLGMDRRPISFPSQLSS--RYARKFSHCFPDK 304
Query: 258 V-----KGGGIFAIGDVVSPKVKTTPMVPN-------MPHYNVILEEVEVGGNPLDLP-- 303
+ G F D++SP ++ TP+V N + +Y V L + V + L L
Sbjct: 305 IAHLNSSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHK 364
Query: 304 ----TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G+G GTIIDSGT YL + + +F
Sbjct: 365 NFDIDKVTGSG---GTIIDSGTAFTYLKKPAFQAMRREF 400
>gi|223946655|gb|ACN27411.1| unknown [Zea mays]
Length = 378
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/217 (32%), Positives = 109/217 (50%), Gaps = 24/217 (11%)
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTY-GDGSSTSGYFVRD 184
L ++ P++S+TS + CS C++ P C+ P C Y + Y + +++SG + D
Sbjct: 6 LRIYRPAESTTSRHLPCSHELCQSV-----PGCTNPKQPCPYNIDYFSENTTSSGLLIED 60
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
+ LN ++ P+N+SVI GCG +QSGD A DG+LG G A+ S+ S LA A
Sbjct: 61 TLHLNYREDHV---PVNASVIIGCGQKQSGDYLDGI--APDGLLGLGMADISVPSFLARA 115
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLD 301
G V+ F+ C G IF GD P ++TP VP + Y V +++ +G L+
Sbjct: 116 GLVQNSFSMCFKEDSSGRIF-FGDQGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLE 174
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G ++DSGT+ LP +Y +F
Sbjct: 175 --------GTSFKALVDSGTSFTSLPFDVYKAFTMEF 203
>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/258 (32%), Positives = 117/258 (45%), Gaps = 29/258 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSG 139
G Y T++GLGTP Y + VDTGS L W+ C+ C C +S +FDP SS+
Sbjct: 135 GNYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSG-----PVFDPKTSSSYA 189
Query: 140 EIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++CS C +T +CS C Y +YGD S + GY +D + S
Sbjct: 190 AVSCSTPQCNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSFGSNS----- 244
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ +GCG G G S G++G + SLL QLA + F++CL
Sbjct: 245 ---VPNFYYGCGQDNEGLFGRSA-----GLMGLARNKLSLLYQLAP--TLGYSFSYCLPS 294
Query: 258 VKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
G +IG + TPMV + Y + L + V G PL + +S +
Sbjct: 295 SSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSS---LP 351
Query: 315 TIIDSGTTLAYLPPMLYD 332
TIIDSGT + LP +YD
Sbjct: 352 TIIDSGTVITRLPTTVYD 369
>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
Length = 493
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 124/270 (45%), Gaps = 35/270 (12%)
Query: 83 YFTKVGLGTPTDEYYVQ-VDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y V LG+P + +DTGSD+ WV C C +C + D LFDPS SST
Sbjct: 140 YVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQQCRPQVD-----PLFDPSLSSTYSP 194
Query: 141 IACSDNFCRTTYNN-RYPSCSPGVRCEYVVTYGDGS-STSGYFVRDIIQLNQASGNLKTA 198
+CS C + CS +C+Y+ YGDGS T+G + D + L S +
Sbjct: 195 FSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLALGSNSNTV--- 251
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-- 256
+ S FGC + ++G G++G G SL+SQ A F++CL
Sbjct: 252 -VVSKFRFGCSHAETG-----ITGLTAGLMGLGGGAQSLVSQTAGTFGT-TAFSYCLPPT 304
Query: 257 -------VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
+ G + G V +P ++++ VP Y V LE + VGG L +PT++
Sbjct: 305 PSSSGFLTLGAAGTSSAGFVKTPMLRSS-QVPAF--YGVRLEAIRVGGRQLSIPTTVF-- 359
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G I+DSGT + LPP Y + S F+
Sbjct: 360 --SAGMIMDSGTVVTRLPPTAYSSLSSAFK 387
>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 436
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 122/277 (44%), Gaps = 47/277 (16%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +GTP + Y VDTGSD++W+ C C C ++ +F+PSKSS+
Sbjct: 85 GEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQT-----TPMFNPSKSSSYKN 139
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I C C++ + SC+ CEY YGD S + G D + L +G + P
Sbjct: 140 IPCPSKLCQSMEDT---SCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFP- 195
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV-- 258
+++ GCG S + A GI+GFG +S ++QL ++ +F++CL +
Sbjct: 196 --NIVIGCGTNNI----LSYEGASSGIVGFGSGPASFITQLGSS--TGGKFSYCLTPLFS 247
Query: 259 ------KGGGIFAIGDVVSPK---VKTTPMVPNMPH--YNVILE-------EVEVGGNPL 300
GD + V TTP++ P Y + LE VE+GG P
Sbjct: 248 VTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVP- 306
Query: 301 DLPTSLLGTGDERGT-IIDSGTTLAYLPPMLYDLVLS 336
GD G IIDSGTTL L Y + S
Sbjct: 307 --------NGDNEGNIIIDSGTTLTSLTKDDYSFLES 335
>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
Length = 471
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 78/258 (30%), Positives = 117/258 (45%), Gaps = 36/258 (13%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
+ +DT DL W+ CA C P + LFDP +S TS + C C RY
Sbjct: 148 MSIDTSIDLPWIQCAPC---PMPECYPQQNALFDPRRSRTSAAVPCGSAACGEL--GRYG 202
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
+ +C+Y V YGDG +TSG ++ D + LN + T +N FGC + G+
Sbjct: 203 AGCSNNQCQYFVDYGDGRATSGTYMVDALTLNPS-----TVVMNFR--FGCSHAVRGNFS 255
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGGGIFAIGDVV------ 270
+ST G + G SLLSQ AA GN F++C+ G ++G
Sbjct: 256 AST----SGTMSLGGGRQSLLSQTAATFGNA---FSYCVPDPSSSGFLSLGGPADGGGAG 308
Query: 271 ----SPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
+P V+ ++P + Y V L +EVGG L++P + G ++DS + L
Sbjct: 309 RFARTPLVRNPSIIPTL--YLVRLRGIEVGGRRLNVPPVVFAG----GAVMDSSVIITQL 362
Query: 327 PPMLYDLVLSQFRFWIAS 344
PP Y + FR +A+
Sbjct: 363 PPTAYRALRLAFRSAMAA 380
>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
Length = 543
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/285 (30%), Positives = 124/285 (43%), Gaps = 51/285 (17%)
Query: 83 YFTKVGLG-----TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
Y T + LG +P V VDTGSDL WV C CS C + D LFDP+ S+T
Sbjct: 185 YVTTIALGGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRD-----PLFDPAGSAT 239
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGV------RCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
+ C+ + C + + + +PG RC Y + YGDGS + G D + L A
Sbjct: 240 YAAVRCNASACAASL--KAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGA 297
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKE 250
S + +FGCG G G + G++G G+ SL+SQ A G V
Sbjct: 298 SLD--------GFVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTALRYGGV--- 341
Query: 251 FAHCLDVVKGG---GIFAIGDVVSPKVKTTPMV--------PNMPHYNVILEEVEVGGNP 299
F++CL G G ++G S TTP+ P Y + + VGG
Sbjct: 342 FSYCLPATTSGDASGSLSLGGDASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTA 401
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
L LG + +IDSGT + L P +Y V ++F A+
Sbjct: 402 --LAAQGLGASN---VLIDSGTVITRLAPSVYRGVRAEFTRQFAA 441
>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 471
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 128/283 (45%), Gaps = 35/283 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C C +++D +F+P
Sbjct: 120 SGLAQGSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTD-----PVFNPV 174
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + ++ C CR + P C+ C Y V+YGDGS T+G FV + + +
Sbjct: 175 KSGSFAKVLCRTPLCRRLES---PGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKV 231
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
V GCG+ G + G+ S SQ A ++F++
Sbjct: 232 E--------QVALGCGHDNEGLFVGAAGLLGL-----GRGGLSFPSQ--AGRTFNQKFSY 276
Query: 254 CL----DVVKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTS- 305
CL K + VS + TP++ N Y V L + VGG P+ T+
Sbjct: 277 CLVDRSASSKPSSVVFGNSAVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITAS 336
Query: 306 ---LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L TG+ G IID GT++ L Y + FR +SL
Sbjct: 337 HFKLDRTGNG-GVIIDCGTSVTRLNKPAYIALRDAFRAGASSL 378
>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 414
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 84/257 (32%), Positives = 120/257 (46%), Gaps = 44/257 (17%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
V VDTGSDL WV C C C + D LF+PS S + I C+ + C++ +Y
Sbjct: 80 VIVDTGSDLTWVQCQPCRLCYNQQD-----PLFNPSGSPSYQTILCNSSTCQSL---QYA 131
Query: 158 SCSPGV------RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
+ + GV C YVV YGDGS T G L NL T + S+ IFGCG
Sbjct: 132 TGNLGVCGSNTPTCNYVVNYGDGSYTRG-------DLGMEQLNLGTTHV-SNFIFGCGRN 183
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--DVVKGGGIFAIGDV 269
G G ++ G++G G+++ SL+SQ +A F++CL G +G
Sbjct: 184 NKGLFGGAS-----GLMGLGKSDLSLVSQTSAI--FEGVFSYCLPTTAADASGSLILGGN 236
Query: 270 VSPKVKTTPMV-------PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
S TTP+ P +P Y + L + +GG L P + G +IDSGT
Sbjct: 237 SSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNY-----RQSGILIDSGT 291
Query: 322 TLAYLPPMLYDLVLSQF 338
+ LPP +Y + ++F
Sbjct: 292 VITRLPPPVYRDLKAEF 308
>gi|326533540|dbj|BAK05301.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/287 (27%), Positives = 123/287 (42%), Gaps = 41/287 (14%)
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRCPTKS 121
+I L GN +P G ++ + +G P Y++ VDTGS+L W+ C GC C +
Sbjct: 23 AIKFPLEGNVYP--VGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRP 80
Query: 122 DLGIKLTLFDPSKSSTSG--EIACSDNFCRTTYNNR--YPSCSPG--VRCEYVVTYGDGS 175
P + G ++ C C + P CS RC Y + Y G
Sbjct: 81 P--------HPYYTPADGNLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGK 132
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
S G DII +N + FGCG +Q + S + VDGILG G +
Sbjct: 133 S-EGDLATDIISVNGRD--------KKRIAFGCGYKQE-EPADSPPSPVDGILGLGMGKA 182
Query: 236 SLLSQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEE 292
L +QL +++ HCL KG G+ +GD P V PM ++ +Y+ L E
Sbjct: 183 GLAAQLKGHKMIKENVIGHCLS-SKGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAE 241
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
V + P+ + + DSG+T ++P +Y+ ++S+ R
Sbjct: 242 VFIDKQPIRGNPTF-------EAVFDSGSTYTHVPAQIYNEIVSKVR 281
>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 526
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 135/302 (44%), Gaps = 44/302 (14%)
Query: 51 ALKQHDTRRHGRMMASIDLELGGN-GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
A ++T + S+DL N G + T + ++G+G P ++Y+ D +D W+
Sbjct: 154 AASLYNTHHQHKNYYSLDLNASLNPGITTGTSNFLVQIGVGGPPQKFYMIFDLQTDFTWL 213
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVV 169
C C +C + D ++FDPS+SS+ ++C C N+ SCS C Y +
Sbjct: 214 QCQPCIKCYDQPD-----SIFDPSQSSSYTLLSCETKHCNLLPNS---SCSDDGYCRYNI 265
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
TY DG++T G + + + ++SG + L GC N+ G S DG G
Sbjct: 266 TYKDGTNTEGVLINETVSF-ESSGWVDRVSL------GCSNKNQGPFVGS-----DGTFG 313
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP--------KVKTTPMVP 281
G+ + S S++ A+ ++CL K G + + SP K+ P
Sbjct: 314 LGRGSLSFPSRINASS-----MSYCLVESKDGYSSSTLEFNSPPCSGSVKAKLLQNPKAE 368
Query: 282 NMPHYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
N+ Y V L+ ++VGG +D+P S G G G I+ S + + L Y++V
Sbjct: 369 NL--YYVGLKGIKVGGEKIDVPNSTFTIDPYGNG---GMIVSSSSLITMLENDTYNVVRD 423
Query: 337 QF 338
F
Sbjct: 424 AF 425
>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 463
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 103/334 (30%), Positives = 146/334 (43%), Gaps = 54/334 (16%)
Query: 33 FEVENKFKAGGERERTL-SALKQHDTRRHGRMMASIDLELGGNGHPSATGL--------- 82
F + ER R L S L ++ R+ A+ D GG S T L
Sbjct: 55 FSFSDMITKDEERVRFLHSRLTNKESVRNS---ATTDKLRGGPSLVSTTPLKSGLSIGSG 111
Query: 83 -YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGE 140
Y+ K+GLGTP + + VDTGS L W+ C C C + D +F PS S T
Sbjct: 112 NYYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVD-----PIFTPSTSKTYKA 166
Query: 141 IACSDNFCRTTYNNRY--PSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ CS + C + ++ P CS C Y +YGD S + GY +D++ L +
Sbjct: 167 LPCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSE----- 221
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLD 256
AP +S ++GCG G G S+ GI+G S+L QL+ GN F++CL
Sbjct: 222 AP-SSGFVYGCGQDNQGLFGRSS-----GIIGLANDKISMLGQLSKKYGNA---FSYCLP 272
Query: 257 VVKG-------GGIFAIG--DVVSPKVKTTPMVPN--MPH-YNVILEEVEVGGNPLDLPT 304
G +IG + S K TP+V N +P Y + L + V G PL +
Sbjct: 273 SSFSAPNSSSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSA 332
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
S TIIDSGT + LP +Y+ + F
Sbjct: 333 SSYNV----PTIIDSGTVITRLPVAVYNALKKSF 362
>gi|6562285|emb|CAB62655.1| putative protein [Arabidopsis thaliana]
Length = 519
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 150/347 (43%), Gaps = 49/347 (14%)
Query: 20 WAVGGGGVMGNFVFEVENKFKAGGERE-------------RTLSALKQHDTRRHGRMMAS 66
W + G F FEV + F ++ L Q D GR +AS
Sbjct: 18 WGLERCEASGKFSFEVHHMFSDRVKQSLGLDDLVPEKGSLEYFKVLAQRDRLIRGRGLAS 77
Query: 67 IDLE-----LGGNGHPSAT---GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ E + GN S L++ V +GTP + V +DTGSDL W+ C S C
Sbjct: 78 NNEETPITFMRGNRTISIDLLGFLHYANVSVGTPATWFLVALDTGSDLFWLPCNCGSTCI 137
Query: 119 TK-SDLGIK----LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-G 172
++G+ L L+ P+ SSTS I CSD+ C + SP C Y + Y
Sbjct: 138 RDLKEVGLSQSRPLNLYSPNTSSTSSSIRCSDDRCFGSSRCS----SPASSCPYQIQYLS 193
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ T+G D++ L L+ P+ +++ GCG Q+G L SS AAV+G+LG G
Sbjct: 194 KDTFTTGTLFEDVLHLVTEDEGLE--PVKANITLGCGKNQTGFLQSS--AAVNGLLGLGL 249
Query: 233 ANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILE 291
+ S+ S LA A F+ C +++ G + GD TP++P P +
Sbjct: 250 KDYSVPSILAKAKITANSFSMCFGNIIDVVGRISFGDKGYTDQMETPLLPTEPS----VT 305
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
EV VGG+ G + + D+GT+ +L Y L+ F
Sbjct: 306 EVSVGGD---------AVGVQLLALFDTGTSFTHLLEPEYGLITKAF 343
>gi|222615640|gb|EEE51772.1| hypothetical protein OsJ_33215 [Oryza sativa Japonica Group]
Length = 775
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 77/260 (29%), Positives = 115/260 (44%), Gaps = 28/260 (10%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
+F + +G P Y++ +DTGS L W+ C A C+ C + L+ P+ +
Sbjct: 403 FFITMNIGDPAKSYFLDIDTGSTLTWLQCDAPCTNCNI-----VPHVLYKPTPKKL---V 454
Query: 142 ACSDNFCRTTYNN--RYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
C+D+ C Y + + C +C+YV+ Y D SS+ G V D L+ ++G T P
Sbjct: 455 TCADSLCTDLYTDLGKPKRCGSQKQCDYVIQYVD-SSSMGVLVIDRFSLSASNG---TNP 510
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-FAHCLDVV 258
+++ FGCG Q G + VD ILG + +LLSQL + G + K HC+
Sbjct: 511 --TTIAFGCGYDQ-GKKNRNVPIPVDSILGLSRGKVTLLSQLKSQGVITKHVLGHCIS-S 566
Query: 259 KGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
KGGG GD P V TPM +Y+ + N + + + I
Sbjct: 567 KGGGFLFFGDAQVPTSGVTWTPMNREHKYYSPGHGTLHFDSNSKAISAAPM------AVI 620
Query: 317 IDSGTTLAYLPPMLYDLVLS 336
DSG T Y Y LS
Sbjct: 621 FDSGATYTYFAAQPYQATLS 640
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/169 (27%), Positives = 71/169 (42%), Gaps = 22/169 (13%)
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+C+Y + Y DG+ST G + D L + + T P ++ FGCG Q +
Sbjct: 28 QCDYEIKYADGASTIGALIVDQFSLPR----IATRP---NLPFGCGYNQGIGENFQQTSP 80
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFA-HCLDVVKGGGIFAIGDVVSPKVKTTPMVPN 282
V+GILG + S +SQL G + K HCL GGG+ +GD V +
Sbjct: 81 VNGILGLDRGKVSFVSQLKMLGIITKHVVGHCLS-SGGGGLLFVGDGDGNLVLLHANYYS 139
Query: 283 MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+ + +G NP+D+ + DSG+T Y Y
Sbjct: 140 PGSATLYFDRHSLGMNPMDV-------------VFDSGSTYTYFTAQPY 175
>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 470
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 128/273 (46%), Gaps = 47/273 (17%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y +GLG+ V VDTGSDL WV C C C ++ LF PS S + I
Sbjct: 122 YIVTMGLGS--QNMSVIVDTGSDLTWVQCEPCRSCYNQNG-----PLFKPSTSPSYQPIL 174
Query: 143 CSDNFCRTTYNNRYPSC----SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
C+ C++ +C S C+YVV YGDGS TSG + + S
Sbjct: 175 CNSTTCQSL---ELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGFGGIS------ 225
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCL-- 255
S+ +FGCG G G ++ G++G G++ S++SQ A G V F++CL
Sbjct: 226 --VSNFVFGCGRNNKGLFGGAS-----GLMGLGRSELSMISQTNATFGGV---FSYCLPS 275
Query: 256 -DVVKGGGIFAIGDV------VSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTS 305
D G +G+ V+P + T M+PN+ Y + L ++VGG L + S
Sbjct: 276 TDQAGASGSLVMGNQSGVFKNVTP-IAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQAS 334
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G G G I+DSGT ++ L P +Y + ++F
Sbjct: 335 SFGNG---GVILDSGTVISRLAPSVYKALKAKF 364
>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 407
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 86/285 (30%), Positives = 129/285 (45%), Gaps = 53/285 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y ++ +GTP + Y + DTGSDL+W C C++C + + +FDP SS+ I
Sbjct: 60 YLMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQN-----PMFDPRSSSSYTNIT 114
Query: 143 CSDNFCRTTYNNRYPS--CSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
C C N+ S CS + C Y +Y D S T G ++ + L +G P
Sbjct: 115 CGTESC-----NKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGE----P 165
Query: 200 LN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ----LAAAGNVRKEFAHC 254
+ +IFGCG+ SG D + G++G G+ SL+SQ L A GN+ F+ C
Sbjct: 166 VAFQGIIFGCGHNNSG----FNDREM-GLIGLGRGPLSLISQIGSSLGAGGNM---FSQC 217
Query: 255 L-------------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLD 301
L + KG + G V +P + + Y L + V ++
Sbjct: 218 LVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISK-----DGTGYFATLLGISV--EDIN 270
Query: 302 LPT---SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
LP S LGT + +IDSGTT+ YLP Y ++ Q R +A
Sbjct: 271 LPFSNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVA 315
>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 387
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 126/284 (44%), Gaps = 31/284 (10%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTK 120
M A I ++ +G P G Y K+ LGTP + +DTGSD+ W C C C +
Sbjct: 27 EMQADIPVQ---SGIPLGAGNYLVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQ 83
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
+ T FDP KSS+ ++CS + CR ++ C Y V YGDGS + G+
Sbjct: 84 AQ-----TKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVSSTCIYKVQYGDGSYSVGF 138
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
F + + ++ + + S+ +FGCG + +G G G A
Sbjct: 139 FATEKLTISPSD-------VISNFLFGCGQQNAGRFGRIAGLLGLGRGKLSLA------- 184
Query: 241 LAAAGNVRKEFAHCLDVVKGG--GIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEV 295
L + F +CL G +G V VK TP+ P N P Y + ++ + V
Sbjct: 185 LQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYGIDIKGLSV 244
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GG+ L + S+ G IIDSGT + L P +Y + S+F+
Sbjct: 245 GGHVLPIDASVFSNA---GAIIDSGTVITRLQPTVYSALSSKFQ 285
>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 446
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 137/305 (44%), Gaps = 52/305 (17%)
Query: 54 QHDTRRHGRMMASIDLELGGNG------HPSATG-LYFTKVGLGTPTDEYYVQVDTGSDL 106
+H R + A I+ L N PS TG + +G P+ V +DTGSD+
Sbjct: 65 EHSAARLAYIQARIEGSLVYNNDYTASVSPSLTGRTILVNLSIGQPSIPQLVVMDTGSDI 124
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
LW+ C C+ C + LG+ LFDPS SST + C+T P G +C+
Sbjct: 125 LWIMCNPCTNC--DNHLGL---LFDPSMSSTFSPL------CKT------PCGFKGCKCD 167
Query: 167 ---YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
+ ++Y D SS SG F RDI+ S VI GCG+ ++G ++D
Sbjct: 168 PIPFTISYVDNSSASGTFGRDILVFETTDEGTSQI---SDVIIGCGH----NIGFNSDPG 220
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPM 279
+GILG +SL +Q+ ++F++C+ D +G+ + +TP
Sbjct: 221 YNGILGLNNGPNSLATQIG------RKFSYCIGNLADPYYNYNQLRLGEGADLEGYSTPF 274
Query: 280 VPNMPHYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLV 334
Y V +E + VG LD+ GTG G I+DSGTT+ YL + L+
Sbjct: 275 EVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTG---GVILDSGTTITYLVDSAHKLL 331
Query: 335 LSQFR 339
++ R
Sbjct: 332 YNEVR 336
>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 420
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/347 (27%), Positives = 146/347 (42%), Gaps = 63/347 (18%)
Query: 1 MGGLRLLALVVVTVAVV-----HQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQH 55
MG L+ L+LV++T V ++ + G + + R R LS
Sbjct: 1 MGPLQALSLVLLTSLAVSAPSGYRLVLTHVDSKGGYTKTELMRRAVHRSRLRALSGYDAT 60
Query: 56 DTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS 115
R H S+ +E Y ++ +G P + DTGSDL W C C
Sbjct: 61 SPRLH-----SVQVE------------YLMELAIGKPPVPFVALADTGSDLTWTQCQPCK 103
Query: 116 RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGS 175
C ++DPS SST + CS C ++ +C+P C Y YGDG+
Sbjct: 104 LC-----FPQDTPVYDPSASSTFSPLPCSSATCLPIWSR---NCTPSSLCRYRYAYGDGA 155
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
++G + + L +S + V FGCG GD +ST G +G G+
Sbjct: 156 YSAGILGTETLTLGPSSAPVSVG----GVAFGCGTDNGGDSLNST-----GTVGLGRGTL 206
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGI---FAIGDV--VSP---KVKTTPMV--PNMP- 284
SLL+QL +F++CL + F +G + ++P V++TP++ P P
Sbjct: 207 SLLAQLGVG-----KFSYCLTDFFNSALDSPFLLGTLAELAPGPSTVQSTPLLQSPQNPS 261
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDER-----GTIIDSGTTLAYL 326
Y V L+ + +G L +P GT D R G I+DSGTT L
Sbjct: 262 RYFVSLQGISLGDVRLPIPN---GTFDLRGDGTGGMIVDSGTTFTIL 305
>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 429
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/292 (31%), Positives = 132/292 (45%), Gaps = 38/292 (13%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDL--ELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
E ++A+K+ RR R+ + +L S G Y + G P + VDTG
Sbjct: 52 EIFIAAVKRGHERR-ARLAKHVLAGDQLFETPVASGNGEYLIDISYGNPPQKSTAIVDTG 110
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SDL WV C C C L K FDPSKS++ + C NFC+ + + SC+
Sbjct: 111 SDLNWVQCLPCKSC--YETLSAK---FDPSKSASYKTLGCGSNFCQ---DLPFQSCA--A 160
Query: 164 RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA 223
C+Y YGDGSSTSG D + + +G + +V FGCGN G +
Sbjct: 161 SCQYDYMYGDGSSTSGALSTDDVTI--GTGKIP------NVAFGCGNSNLGTFAGAGGLV 212
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPKVKTTPMV 280
G SL+SQL G K+F++C L K ++ ++ V TPM+
Sbjct: 213 GLGKGPL-----SLVSQL--GGTATKKFSYCLVPLGSTKTSPLYIGDSTLAGGVAYTPML 265
Query: 281 PNMPH---YNVILEEVEVGGNPLDLPTS---LLGTGDERGTIIDSGTTLAYL 326
N + Y L+ + V G ++ P + + TG G I+DSGTTL YL
Sbjct: 266 TNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATG-RGGLILDSGTTLTYL 316
>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
At2g35615-like [Glycine max]
Length = 364
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 122/277 (44%), Gaps = 46/277 (16%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y K+ LGTP + Y VDT SDL+W C C C + K +FDP K
Sbjct: 26 SNNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQ-----KNPMFDPLKE-- 78
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C + +++ SCSP C+YV Y D S+T G ++I + G
Sbjct: 79 ----------CNSFFDH---SCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGK--- 122
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV--RKEFAHCL 255
P+ S+IFGCG+ +G + +G LS ++ GN+ K F+ CL
Sbjct: 123 -PIVESIIFGCGHNNTGVFNEND-------MGLIGLGGGPLSLVSQMGNLYGSKRFSQCL 174
Query: 256 DVVKG----GGIFAIG---DVVSPKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTS- 305
G ++G DV V TTP+V Y V LE + VG + +S
Sbjct: 175 VPFHADPHTSGTISLGEASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSE 234
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
+L G+ +IDSGT YLP YD ++ + + I
Sbjct: 235 MLSKGN---IMIDSGTPETYLPQEFYDRLVEELKVQI 268
>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
Length = 324
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/255 (34%), Positives = 124/255 (48%), Gaps = 29/255 (11%)
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
++VDTGSDL WV C C+ P S K LFDP++SS+ + C C
Sbjct: 1 MEVDTGSDLSWVQCKPCAAAP--SCYSQKDPLFDPAQSSSYAAVPCGGPVC-AGLGIYAA 57
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
S +C YVV+YGDGS+T+G + D + L+ +S FGCG+ QSG
Sbjct: 58 SACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASSA-------VQGFFFGCGHAQSGLFN 110
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-GGIFAIG----DVVSP 272
VDG+LG G+ SL+ Q AG F++CL G +G +P
Sbjct: 111 -----GVDGLLGLGREQPSLVEQ--TAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAP 163
Query: 273 KVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
TT ++ PN P +Y V+L + VGG L +P S GT++D+GT + LPP
Sbjct: 164 GFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF----AGGTVVDTGTVVTRLPPT 219
Query: 330 LYDLVLSQFRFWIAS 344
Y + S FR +AS
Sbjct: 220 AYAALRSAFRSGMAS 234
>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
Length = 446
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 96/187 (51%), Gaps = 28/187 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P +G YF VG+GTP+ + + +DTGSDL+W+ C+ C RC + + +FDP
Sbjct: 77 SGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQ-----RGQVFDPR 131
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSC----SPGVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
+SST + CS CR R+P C + G C Y+V YGDGSS++G D +
Sbjct: 132 RSSTYRRVPCSSPQCRAL---RFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFA 188
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVR 248
+ ++V GCG G S+ G+LG + S+ +Q+A A G+V
Sbjct: 189 NDT-------YVNNVTLGCGRDNEGLFDSAA-----GLLGVARGKISISTQVAPAYGSV- 235
Query: 249 KEFAHCL 255
F +CL
Sbjct: 236 --FEYCL 240
>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
Length = 437
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/362 (28%), Positives = 151/362 (41%), Gaps = 64/362 (17%)
Query: 6 LLALVVVTVAVV-------------HQWAVGGGGVMGNFVFEVEN--KFKAGGERERTLS 50
LLAL +V + V H+ V G +M V +N KF+ ER +
Sbjct: 9 LLALSIVYIFVAPTHSTSRTALNHHHEPKVAGFQIMLEHVDSGKNLTKFEL---LERAV- 64
Query: 51 ALKQHDTRRHGRMMASIDLELGGNGHPSA-TGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
+ +RR R+ A ++ G A G Y + +GTP + +DTGSDL+W
Sbjct: 65 ---ERGSRRLQRLEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWT 121
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVV 169
C C++C +S +F+P SS+ + CS C+ + P+CS C+Y
Sbjct: 122 QCQPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQAL---QSPTCSNN-SCQYTY 172
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
YGDGS T G + + S P ++ FGCG G G A G++G
Sbjct: 173 GYGDGSETQGSMGTETLTFGSVS-----IP---NITFGCGENNQG-FGQGNGA---GLVG 220
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHCLDVV--KGGGIFAIGDVVSPKVKTTP--------M 279
G+ SL SQL +V K F++C+ + +G + + +P
Sbjct: 221 MGRGPLSLPSQL----DVTK-FSYCMTPIGSSTSSTLLLGSLANSVTAGSPNTTLIESSQ 275
Query: 280 VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT---IIDSGTTLAYLPPMLYDLVLS 336
+P Y + L + VG PL + S+ GT IIDSGTTL Y Y V
Sbjct: 276 IPTF--YYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFADNAYQAVRQ 333
Query: 337 QF 338
F
Sbjct: 334 AF 335
>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
Length = 379
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 129/273 (47%), Gaps = 38/273 (13%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
+++G Y + +GTP Y +DTGSDL+W CA C C + FD KS+T
Sbjct: 84 ASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQ-----PTPYFDVKKSAT 138
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C + C + + PSC + C Y YGD +ST+G + A+
Sbjct: 139 YRALPCRSSRCASLSS---PSCFKKM-CVYQYYYGDTASTAGVLANETFTFGAANSTKVR 194
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A +++ FGCG+ +GDL +S+ G++GFG+ SL+SQL + F++CL
Sbjct: 195 A---TNIAFGCGSLNAGDLANSS-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTS 241
Query: 258 VKGG-------GIFA----IGDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLP 303
G++A V++TP V P +P+ Y + L+ + +G L +
Sbjct: 242 YLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPID 301
Query: 304 TSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ D+ G IIDSGT++ +L Y+ V
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV 334
>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 461
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 123/277 (44%), Gaps = 34/277 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YFT++G+GTP Y+ +DTGSD++W+ CA C +C T++D +FDP+KS T
Sbjct: 115 SGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTD-----HVFDPTKSRTYA 169
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I C CR + P CS + C+Y V+YGDGS T G F + + +
Sbjct: 170 GIPCGAPLCRRLDS---PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNR------ 220
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--- 255
+ V GCG+ G + G+ S Q N +F++CL
Sbjct: 221 --VTRVALGCGHDNEGLFTGAAGLLGL-----GRGRLSFPVQTGRRFN--HKFSYCLVDR 271
Query: 256 -DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD-LPTSL--LG 308
K + VS TP++ N Y + L + VGG P+ L SL L
Sbjct: 272 SASAKPSSVIFGDSAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLD 331
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
G IIDSGT++ L Y + FR + L
Sbjct: 332 AAGNGGVIIDSGTSVTRLTRPAYIALRDAFRIGASHL 368
>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 489
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 120/256 (46%), Gaps = 43/256 (16%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
VDTGSDL WV C C C + L+DPS SS+ + C+ + C+ S
Sbjct: 153 VDTGSDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATGNSG 207
Query: 160 SPG-------VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
G CEYVV+YGDGS T G + I L G+ K L +FGCG
Sbjct: 208 PCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVL----GDTKLENL----VFGCGRNN 259
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG--GIFAIGDVV 270
G G ++ G++G G+++ SL+SQ N F++CL ++ G G + G+
Sbjct: 260 KGLFGGAS-----GLMGLGRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGTLSFGNDF 312
Query: 271 S-----PKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
S V TP+V N Y + L +GG ++L T G RG +IDSGT
Sbjct: 313 SVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGG--VELKTLSFG----RGILIDSGTV 366
Query: 323 LAYLPPMLYDLVLSQF 338
+ LPP +Y V ++F
Sbjct: 367 ITRLPPSIYKAVKTEF 382
>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
Length = 437
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 100/362 (27%), Positives = 149/362 (41%), Gaps = 64/362 (17%)
Query: 6 LLALVVVTVAVV-------------HQWAVGGGGVMGNFVFEVEN--KFKAGGERERTLS 50
LLAL +V + V H+ V G +M V +N KF+ ER +
Sbjct: 9 LLALSIVYIFVAPTHSTSRTALNHHHEPKVAGFQIMLEHVDSGKNLTKFEL---LERAV- 64
Query: 51 ALKQHDTRRHGRMMASIDLELGGNGHPSA-TGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
+ +RR R+ A ++ G A G Y + +GTP + +DTGSDL+W
Sbjct: 65 ---ERGSRRLQRLEAMLNGPSGVETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWT 121
Query: 110 NCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVV 169
C C++C +S +F+P SS+ + CS C+ + P+CS C+Y
Sbjct: 122 QCQPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQAL---QSPTCSNN-SCQYTY 172
Query: 170 TYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
YGDGS T G + + S P ++ FGCG G G A G++G
Sbjct: 173 GYGDGSETQGSMGTETLTFGSVS-----IP---NITFGCGENNQG-FGQGNGA---GLVG 220
Query: 230 FGQANSSLLSQLAAAGNVRKEFAHCLDVV--KGGGIFAIGDVVSPKVKTTP--------M 279
G+ SL SQL +F++C+ + +G + + +P
Sbjct: 221 MGRGPLSLPSQLDVT-----KFSYCMTPIGSSNSSTLLLGSLANSVTAGSPNTTLIQSSQ 275
Query: 280 VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT---IIDSGTTLAYLPPMLYDLVLS 336
+P Y + L + VG PL + S+ GT IIDSGTTL Y Y V
Sbjct: 276 IPTF--YYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTLTYFVDNAYQAVRQ 333
Query: 337 QF 338
F
Sbjct: 334 AF 335
>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
Length = 440
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/286 (31%), Positives = 126/286 (44%), Gaps = 58/286 (20%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTS 138
G Y ++ +GTP+ E DTGSDL WV C+ C ++C L+DP SST
Sbjct: 94 GNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKC-----FAQNTPLYDPLNSSTF 148
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C C ++Y CS C Y TYGD S + G D I+L L
Sbjct: 149 TLLPCDSQPCTQLPYSQY-VCSDYGDCIYAYTYGDNSYSYGGLSSDSIRL-----MLLQL 202
Query: 199 PLNSSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL- 255
NS + FGCG N+ + D T GI+G G SL+SQL + +F++CL
Sbjct: 203 HYNSKICFGCGFQNKFTADKSGKT----TGIVGLGAGPLSLVSQL--GDEIGHKFSYCLL 256
Query: 256 ---------------DVVKGGGIFAIGDVVSPKVKTTPMV--PNMPHYNVILEEVEVGGN 298
+V+G G V +TP++ P++P Y + LE + VG
Sbjct: 257 PFSSNSNSKLKFGEAAIVQGNG-----------VVSTPLIIKPDLPFYYLNLEGITVGAK 305
Query: 299 PLDLPTSLLGTGDERGT-IIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
+ TG G IIDSG+TL YL Y+ +S + +A
Sbjct: 306 -------TVKTGQTDGNIIIDSGSTLTYLEESFYNEFVSLVKETVA 344
>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 469
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 121/277 (43%), Gaps = 34/277 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C RC +SD +FDP
Sbjct: 117 SGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSD-----PVFDPR 171
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
KS + IAC C + P C+ + C Y V+YGDGS T G F + + +
Sbjct: 172 KSRSFASIACRSPLC---HRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTR 228
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
+ V GCG+ G + G+ S SQ N +F+
Sbjct: 229 --------VARVALGCGHDNEGLFVGAAGLLGL-----GRGRLSFPSQTGRRFN--HKFS 273
Query: 253 HCL---DVVKGGGIFAIGD-VVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTS 305
+CL GD VS + TP+V N Y V L + VGG + T+
Sbjct: 274 YCLVDRSASSKPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITA 333
Query: 306 LLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQFR 339
L D+ G IIDSGT++ L Y FR
Sbjct: 334 SLFKLDQTGNGGVIIDSGTSVTRLTRPAYIAFRDAFR 370
>gi|2290202|gb|AAB96882.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|2290204|gb|AAB96883.1| nucellin [Hordeum vulgare subsp. vulgare]
gi|45357050|gb|AAS58479.1| nucellin [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 77/285 (27%), Positives = 123/285 (43%), Gaps = 37/285 (12%)
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRCPTKS 121
+I L GN +P G ++ + +G P Y++ VDTGS+L W+ C GC C +
Sbjct: 23 AIKFPLEGNVYP--VGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHHPVHGCKGCHPRP 80
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR--YPSCSPG--VRCEYVVTYGDGSST 177
+ P+ + ++ C C + P CS RC Y + Y G S
Sbjct: 81 ----PHPYYTPADGNL--KVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGKS- 133
Query: 178 SGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G DII +N + FGCG +Q + S + VDGILG G +
Sbjct: 134 EGDLATDIISVNGRD--------KKRIAFGCGYKQE-EPADSPPSPVDGILGLGMGKAGF 184
Query: 238 LSQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVE 294
+QL +++ HCL KG G+ +GD P V PM ++ +Y+ L EV
Sbjct: 185 AAQLKGHKMIKENVIGHCLS-SKGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAEVF 243
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ P+ + + DSG+T ++P +Y+ ++S+ R
Sbjct: 244 IDKQPIRGNPTF-------EAVFDSGSTYTHVPAQIYNEIVSKVR 281
>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 437
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 114/256 (44%), Gaps = 22/256 (8%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y +GTP + Y +DT +D +W C C C +FDPSKSST I
Sbjct: 89 YIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPC-----FNTTSPMFDPSKSSTYKTIP 143
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C+ N S V CEY TYG + + G D + LN N T
Sbjct: 144 CSSPKCKNVENTHCSSDDKKV-CEYSFTYGGEAYSQGDLSIDTLTLNS---NNDTPISFK 199
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------D 256
+++ GCG+R G L + V G +G G+ S +SQL ++ + +F++CL +
Sbjct: 200 NIVIGCGHRNKGPL----EGYVSGNIGLGRGPLSFISQLNSS--IGGKFSYCLVPLFSNE 253
Query: 257 VVKGGGIFAIGDVVS-PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
+ G F VVS +TP+ Y+ L + VG + + S + T
Sbjct: 254 GISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNT 313
Query: 316 IIDSGTTLAYLPPMLY 331
IIDSGTTL LP +Y
Sbjct: 314 IIDSGTTLTILPENVY 329
>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/319 (25%), Positives = 131/319 (41%), Gaps = 43/319 (13%)
Query: 39 FKAGGERERTLSALKQHDTRRHG---RMMAS-----IDLELGGN---GHPSATGLYFTKV 87
F + +A Q DT+R R +A+ + G + G +G YF ++
Sbjct: 79 FNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDVVSGMEQGSGEYFVRI 138
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
G+G+P YV +D+GSD++WV C C++C +SD +F+P+ SS+ ++C+
Sbjct: 139 GVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSD-----PVFNPADSSSYAGVSCASTV 193
Query: 148 CRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
C N C G RC Y V+YGDGS T G + + + L +V G
Sbjct: 194 CSHVDN---AGCHEG-RCRYEVSYGDGSYTKGTLALETLTFGRT--------LIRNVAIG 241
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV--VKGGGIFA 265
CG+ G G+LG G S + QL G F++CL ++ G+
Sbjct: 242 CGHHNQGMF-----VGAAGLLGLGSGPMSFVGQL--GGQAGGTFSYCLVSRGIQSSGLLQ 294
Query: 266 IGDVVSP------KVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDS 319
G P + P + + + V P+ L + G ++D+
Sbjct: 295 FGREAVPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDT 354
Query: 320 GTTLAYLPPMLYDLVLSQF 338
GT + LP Y+ F
Sbjct: 355 GTAVTRLPTAAYEAFRDAF 373
>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 447
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 130/296 (43%), Gaps = 37/296 (12%)
Query: 57 TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
+RR +++ DL+ G G A G +F + +GTP + + DTGSDL WV C C +
Sbjct: 62 SRRLNNILSQTDLQSGLIG---ADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQ 118
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C ++ +FD KSST C C ++ C+Y +YGD S
Sbjct: 119 CYKENG-----PIFDKKKSSTYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGDQSF 173
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+ G + I ++ ASG+ + P +FGCG G D GI+G G + S
Sbjct: 174 SKGDVATETISIDSASGSPVSFP---GTVFGCGYNNGGTF----DETGSGIIGLGGGHLS 226
Query: 237 LLSQLAAAGNVRKEFAHCLD----VVKGGGIFAIGDVVSPK-------VKTTPMVPNMP- 284
L+SQL ++ + K+F++CL G + +G P V +TP+V P
Sbjct: 227 LISQLGSS--ISKKFSYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPR 284
Query: 285 -HYNVILEEVEVGGNPLDLPTSLLGTGD-------ERGTIIDSGTTLAYLPPMLYD 332
+Y + LE + VG + S D IIDSGTTL L +D
Sbjct: 285 TYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFD 340
>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 119/273 (43%), Gaps = 26/273 (9%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG-CSRCPTKSDLGIKLTLFDPSKSSTS 138
TG YF + +GTP + + DTGSDL WV C G + P S L +F P+ S +
Sbjct: 107 TGQYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLA-SPRVFRPANSKSW 165
Query: 139 GEIACSDNFCRTTYNNRYPSCS----PGVRCEYVVTYGDGSSTSGYFVRDIIQLN-QASG 193
I CS + C++ +CS P C Y Y D SS G D + SG
Sbjct: 166 APIPCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSG 225
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ + A L V+ GC G S+ DG+L G +N S S+ AA R F++
Sbjct: 226 SDRKAKLQ-EVVLGCTTSYDGQSFQSS----DGVLSLGNSNISFASRAAARFGGR--FSY 278
Query: 254 CL-------DVVKGGGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLP 303
CL + +G SP TP++ + P Y V ++ V V G L++P
Sbjct: 279 CLVDHLAPRNATSYLTFGPVGAAHSP--SRTPLLLDAQVAPFYAVTVDAVSVAGKALNIP 336
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ G I+DSGT+L L Y V++
Sbjct: 337 AEVWDVKKNGGAILDSGTSLTILATPAYKAVVA 369
>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
Length = 441
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 129/273 (47%), Gaps = 38/273 (13%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
+++G Y + +GTP Y +DTGSDL+W CA C C + FD KS+T
Sbjct: 84 ASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQ-----PTPYFDVKKSAT 138
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C + C + + PSC + C Y YGD +ST+G + A+
Sbjct: 139 YRALPCRSSRCASLSS---PSCFKKM-CVYQYYYGDTASTAGVLANETFTFGAANSTKVR 194
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A +++ FGCG+ +GDL +S+ G++GFG+ SL+SQL + F++CL
Sbjct: 195 A---TNIAFGCGSLNAGDLANSS-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTS 241
Query: 258 VKGG-------GIFA----IGDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLP 303
G++A V++TP V P +P+ Y + L+ + +G L +
Sbjct: 242 YLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPID 301
Query: 304 TSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ D+ G IIDSGT++ +L Y+ V
Sbjct: 302 PLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV 334
>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
gi|223948009|gb|ACN28088.1| unknown [Zea mays]
gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
Length = 507
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 123/284 (43%), Gaps = 56/284 (19%)
Query: 83 YFTKVGLG----TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
Y T + LG +P V VDTGSDL WV C CS C + D LFDP+ S+T
Sbjct: 144 YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRD-----PLFDPAGSATY 198
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGV---------RCEYVVTYGDGSSTSGYFVRDIIQLN 189
+ C+ + C + R + +PG +C Y + YGDGS + G D + L
Sbjct: 199 AAVRCNASACADSL--RAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALG 256
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVR 248
AS +FGCG G G + G++G G+ SL+SQ A+ G V
Sbjct: 257 GAS--------LGGFVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGV- 302
Query: 249 KEFAHCLDVVKGG---GIFAIG---DVVSPKVKTTPMV--------PNMPHYNVILEEVE 294
F++CL G G ++G D S TTP+ P Y + +
Sbjct: 303 --FSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAA 360
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VGG L LG + +IDSGT + L P +Y V ++F
Sbjct: 361 VGGTA--LAAQGLGASN---VLIDSGTVITRLAPSVYRAVRAEF 399
>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 479
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 146/356 (41%), Gaps = 46/356 (12%)
Query: 11 VVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLE 70
++ + V GG M V + F + L + D +R ++ +
Sbjct: 56 IIPLEVSEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSG 115
Query: 71 LGGN------------GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
GG+ G +G YF ++G+G+P Y+ +D+GSD++WV C C++C
Sbjct: 116 GGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCY 175
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+SD +FDP+ S++ ++CS + C N C G RC Y V+YGDGS T
Sbjct: 176 HQSD-----PVFDPADSASFTGVSCSSSVCDRLEN---AGCHAG-RCRYEVSYGDGSYTK 226
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + + + + SV GCG+R G + G + S +
Sbjct: 227 GTLALETLTFGRT--------MVRSVAIGCGHRNRGMFVGAAGLLGL-----GGGSMSFV 273
Query: 239 SQLAAAGNVRKEFAHCLDVVKG---GGIFAIGDVVSPK-VKTTPMV--PNMPHYNVI-LE 291
QL G F++CL V +G G G P P+V P P + I L
Sbjct: 274 GQL--GGQTGGAFSYCL-VSRGTDSSGSLVFGREALPAGAAWVPLVRNPRAPSFYYIGLA 330
Query: 292 EVEVGG--NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ VGG P+ L + G ++D+GT + LP + Y F A+L
Sbjct: 331 GLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANL 386
>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 482
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 91/330 (27%), Positives = 137/330 (41%), Gaps = 46/330 (13%)
Query: 37 NKFKAGGERERTLSA--LKQHDTRRHGRMMASIDLELGGNGH---------PSATGL--- 82
++ G+ E T+S + D R + + + LGG P+ +G
Sbjct: 77 SQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDSTTLPAKSGRLIG 136
Query: 83 ---YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTS 138
Y+ VGLGTP + + DTGS L W C C+ C + D +FDPSKSS+
Sbjct: 137 SADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQD-----PIFDPSKSSSY 191
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I C+ + C T + + S S C Y V YGD S + G+ L+Q +
Sbjct: 192 TNIKCTSSLC-TQFRSAGCSSSTDASCIYDVKYGDNSISRGF-------LSQERLTITAT 243
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ +FGCG G + G++G + S + Q ++ N K F++CL
Sbjct: 244 DIVHDFLFGCGQDNEGLFRGTA-----GLMGLSRHPISFVQQTSSIYN--KIFSYCLPST 296
Query: 259 K---GGGIFAIGDVVSPKVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G F + +K TP Y + + + VGG LP T
Sbjct: 297 PSSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGT--KLPAVSSSTFSA 354
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
G+IIDSGT + LPP Y + S FR ++
Sbjct: 355 GGSIIDSGTVITRLPPTAYAALRSAFRQFM 384
>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
Length = 496
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 97/351 (27%), Positives = 146/351 (41%), Gaps = 52/351 (14%)
Query: 13 TVAVVHQWAV---GGGGVMGNFVFEVENKFKAGGERERTLS-----ALK-QHDTRRHGRM 63
+V +VH+ ++ G ++ +E K + R R L LK + D
Sbjct: 72 SVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYEN 131
Query: 64 MASIDLELGG---NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
+A + E G +G +G YFT++G+GTPT E Y+ +DTGSD++W+ C C C ++
Sbjct: 132 VAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQ 191
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
+D +F+PS S + + C C N C G C Y V+YGDGS T G
Sbjct: 192 AD-----PIFNPSSSVSFSTVGCDSAVCSQLDAN---DCHGG-GCLYEVSYGDGSYTVGS 242
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
+ + + S +V GCG+ G + G S +Q
Sbjct: 243 YATETLTFGTTS--------IQNVAIGCGHDNVGLFVGAAGLLGLGAGSL-----SFPAQ 289
Query: 241 LAAAGNVRKEFAHCL---DVVKGGGI------FAIGDVVSPKVKTTPMVPNMPHYNVILE 291
L + F++CL D G + IG + +P V P +P Y + +
Sbjct: 290 LGT--QTGRAFSYCLVDRDSESSGTLEFGPESVPIGSIFTPLV-ANPFLPTF--YYLSMV 344
Query: 292 EVEVGGNPLDLPTSLLGTGDER----GTIIDSGTTLAYLPPMLYDLVLSQF 338
+ VGG LD S DE G IIDSGT + L YD + F
Sbjct: 345 AISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAF 395
>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
Length = 497
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 125/282 (44%), Gaps = 45/282 (15%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ C C++C ++D LF+P+
Sbjct: 144 SGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTD-----PLFNPA 198
Query: 134 KSSTSGEIACSDNFCRT-----TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL 188
SST ++ C+ C+ N RY CEY V+YGDGS T G F + +
Sbjct: 199 ASSTYRKVPCATPLCKKLDISGCRNKRY--------CEYQVSYGDGSFTVGDFSTETLTF 250
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
+ V GCG+ G + G+ + S SQ A
Sbjct: 251 R--------GQVIRRVALGCGHDNEGLFIGAAGLLGL-----GRGSLSFPSQTGA--QFS 295
Query: 249 KEFAHCLDVVKGGGI---FAIGDVVSPKVKT-TPMVPNMP---HYNVILEEVEVGGNPL- 300
K F++CL G G PK TP++ N Y V L + VGG L
Sbjct: 296 KRFSYCLVDRSASGTASSLIFGKAAIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLT 355
Query: 301 DLPTSLL---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+P S+ TG+ G IIDSGT++ L Y + FR
Sbjct: 356 SIPASVFRMDATGNG-GVIIDSGTSVTRLVDSAYSTMRDAFR 396
>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
Length = 454
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 121/284 (42%), Gaps = 51/284 (17%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y V +GTP + +DTGSDL+W CA C C + + DP+ SST
Sbjct: 87 TNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQG----AAPVLDPAASSTHA 142
Query: 140 EIACSDNFCRTTYNNRYPSC---SPGVR-CEYVVTYGDGSSTSGYFVRDIIQL--NQASG 193
+ C CR + SC S G R C YV YGD S T G D + +G
Sbjct: 143 ALPCDAPLCRAL---PFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAG 199
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
L V FGCG+ G A GI GFG+ SL SQL NV F++
Sbjct: 200 GLAA----RRVTFGCGHINKGIF----QANETGIAGFGRGRWSLPSQL----NV-TSFSY 246
Query: 254 CL---------DVVKGGGIFA----------IGDVVSPKVKTTPMVPNMPHYNVILEEVE 294
C VV G A GDV + ++ P P++ Y V L +
Sbjct: 247 CFTSMFDTKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSL--YFVPLRGIS 304
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
VGG + +P S L TIIDSG ++ LP +Y+ V ++F
Sbjct: 305 VGGARVAVPESRL----RSSTIIDSGASITTLPEDVYEAVKAEF 344
>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 441
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 119/265 (44%), Gaps = 39/265 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSS 136
++T Y + +GTP +DTGSDL+W C A C RC + L+ P++S+
Sbjct: 87 ASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ-----PAPLYAPARSA 141
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
T ++C C+ + + CSP C Y +YGDG+ST G + L +
Sbjct: 142 TYANVSCRSPMCQA-LQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTA-- 198
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC- 254
V FGCG +LGS+ +++ G++G G+ SL+SQL F++C
Sbjct: 199 -----VRGVAFGCGTE---NLGSTDNSS--GLVGMGRGPLSLVSQLGV-----TRFSYCF 243
Query: 255 --LDVVKGGGIFAIGDV-VSPKVKTTPMVPN--------MPHYNVILEEVEVGGN--PLD 301
+ +F +S KTTP VP+ +Y + LE + VG P+D
Sbjct: 244 TPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPID 303
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYL 326
L + G IIDSGTT L
Sbjct: 304 PAVFRLTPMGDGGVIIDSGTTFTAL 328
>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
Length = 395
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 92/279 (32%), Positives = 132/279 (47%), Gaps = 34/279 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++ +GTP ++ + +DTGSDL W+ C + T + +D S SS+
Sbjct: 24 SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNT--TANSSSPPAPWYDKSSSSSYR 81
Query: 140 EIACSDNFCRTTYNNRYPSCS--PGVRCEYVVTYGDGSSTSGYFVRDIIQLN------QA 191
EI C+D+ C SCS C+Y Y D S T+G + I + +
Sbjct: 82 EIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKR 141
Query: 192 SGNLKTAPLN-SSVIFGCGNRQSGD--LGSSTDAAVDGILGFGQANSSLLSQL--AAAGN 246
+GN KT + +V GC G LG+S G+LG GQ SL +Q A G
Sbjct: 142 AGNHKTRTIRIKNVALGCSRESVGASFLGAS------GVLGLGQGPISLATQTRHTALGG 195
Query: 247 VRKEFAHCL-DVVKG---GGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNP 299
+ F++CL D ++G +G K+ TP+V N Y V + V V G P
Sbjct: 196 I---FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKP 252
Query: 300 LD-LPTSLLGT-GD-ERGTIIDSGTTLAYLPPMLYDLVL 335
+D + +S G GD +GTI DSGTTL+YL Y VL
Sbjct: 253 VDGIASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVL 291
>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
Length = 441
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 79/265 (29%), Positives = 119/265 (44%), Gaps = 39/265 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSS 136
++T Y + +GTP +DTGSDL+W C A C RC + L+ P++S+
Sbjct: 87 ASTATYLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQ-----PAPLYAPARSA 141
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSP-GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
T ++C C+ + + CSP C Y +YGDG+ST G + L +
Sbjct: 142 TYANVSCRSPMCQA-LQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTLGSDTA-- 198
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC- 254
V FGCG +LGS+ +++ G++G G+ SL+SQL F++C
Sbjct: 199 -----VRGVAFGCGTE---NLGSTDNSS--GLVGMGRGPLSLVSQLGV-----TRFSYCF 243
Query: 255 --LDVVKGGGIFAIGDV-VSPKVKTTPMVPN--------MPHYNVILEEVEVGGN--PLD 301
+ +F +S KTTP VP+ +Y + LE + VG P+D
Sbjct: 244 TPFNATAASPLFLGSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPID 303
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYL 326
L + G IIDSGTT L
Sbjct: 304 PAVFRLTPMGDGGVIIDSGTTFTAL 328
>gi|356527532|ref|XP_003532363.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 429
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 93/309 (30%), Positives = 133/309 (43%), Gaps = 36/309 (11%)
Query: 37 NKFKAGGERERTLSALKQHDTRRHGRMM----ASIDLELGGNGHPSATGLYFTKVGLGTP 92
NK K+G S L T R++ +SI L L GN +P G Y + +G P
Sbjct: 26 NKHKSGRN-----SILPSEATSSRSRLLNPAGSSIVLPLYGNVYP--VGFYNVTLNIGQP 78
Query: 93 TDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTT 151
Y++ VDTGSDL W+ C A C+ C L+ PS + C D C +
Sbjct: 79 ARPYFLDVDTGSDLTWLQCDAPCTHCSETPH-----PLYRPSNDF----VPCRDPLCASL 129
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
+C +C+Y + Y D ST G + D+ LN +G L + GCG
Sbjct: 130 QPTEDYNCEHPDQCDYEINYADQYSTFGVLLNDVYLLNFTNG----VQLKVRMALGCGYD 185
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVS 271
Q S+ +DG+LG G+ +SL+SQL + G VR HCL GG IF S
Sbjct: 186 QV--FSPSSYHPLDGLLGLGRGKASLISQLNSQGLVRNVIGHCLSAQGGGYIFFGNAYDS 243
Query: 272 PKVKTTPMVP-NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPML 330
+V TP+ + HY+ E+ GG G G + D+G++ Y
Sbjct: 244 ARVTWTPISSVDSKHYSAGPAELVFGGRK-------TGVG-SLTAVFDTGSSYTYFNSHA 295
Query: 331 YDLVLSQFR 339
Y +LS +
Sbjct: 296 YQALLSWLK 304
>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
Length = 498
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 153/353 (43%), Gaps = 56/353 (15%)
Query: 13 TVAVVHQWAV---GGGGVMGNFVFEVENKFKA------GGER--ERTLSALKQHDTRRHG 61
+V VVH+ A+ ++ ++ K + G ER ERTL+ K R
Sbjct: 75 SVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYEN 134
Query: 62 RMMASIDLELGG---NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+A +D + GG +G +G YFT++G+GTPT E Y+ +DTGSD+ W+ C C C
Sbjct: 135 --VAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECY 192
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+++D +F+PS S++ + C C + Y S G C Y +YGDGS ++
Sbjct: 193 SQAD-----PIFNPSYSASFSTVGCDSAVCSQL--DAYDCHSGG--CLYEASYGDGSYST 243
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G F + + S ++V GCG++ G + G S
Sbjct: 244 GSFATETLTFGTTS--------VANVAIGCGHKNVGLFIGAAGLLGL-----GAGALSFP 290
Query: 239 SQLAAAGNVRKEFAHCL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVI 289
+Q+ F++CL + G +G + +P ++ P +P Y +
Sbjct: 291 NQIGT--QTGHTFSYCLVDRESDSSGPLQFGPKSVPVGSIFTP-LEKNPHLPTF--YYLS 345
Query: 290 LEEVEVGGNPLD-LPTSLL---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ + VGG LD +P + T G IIDSGT + L YD V F
Sbjct: 346 VTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAF 398
>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
Length = 427
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 126/285 (44%), Gaps = 32/285 (11%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
++ R+ +++ + GN +P G Y+ + +G P + + +DTGSDL WV C A C+ C
Sbjct: 45 QNRRLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 102
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR-TTYNNRYPSCSPGVRCEYVVTYGDGSS 176
T + P+ ++ + CS C P P +C+Y + Y D +S
Sbjct: 103 ----------TKYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHAS 148
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+ G V D + L A+G++ +N + FGCG Q + G GILG G+
Sbjct: 149 SIGALVTDEVPLKLANGSI----MNLRLTFGCGYDQQ-NPGPHPPPPTAGILGLGRGKVG 203
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVE 294
L +QL + G + HCL G G +IGD + P V T + N P N + E
Sbjct: 204 LSTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAE 262
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ N D T + G + DSG++ Y Y +L R
Sbjct: 263 LLFN--DKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIR 301
>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 432
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 27/285 (9%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
++ R+ +++ + GN +P G Y+ + +G P + + +DTGSDL WV C A C+ C
Sbjct: 45 QNRRLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 102
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR-TTYNNRYPSCSPGVRCEYVVTYGDGSS 176
TK + + P+ ++ + CS C P P +C+Y + Y D +S
Sbjct: 103 -TKP----RAKQYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHAS 153
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+ G V D + L A+G++ +N + FGCG Q + G GILG G+
Sbjct: 154 SIGALVTDEVPLKLANGSI----MNLRLTFGCGYDQQ-NPGPHPPPPTAGILGLGRGKVG 208
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVE 294
L +QL + G + HCL G G +IGD + P V T + N P N + E
Sbjct: 209 LSTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAE 267
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ N D T + G + DSG++ Y Y +L R
Sbjct: 268 LLFN--DKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIR 306
>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
lyrata]
Length = 430
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 84/325 (25%), Positives = 134/325 (41%), Gaps = 54/325 (16%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNG------HPSATGLYFTKVGLGTPTDEYYVQ 99
E + + + R + SI ELG + T L+F +G P +
Sbjct: 25 EDHIQHMTDISSARFKYLQNSIVKELGSSDFQVDVHQAIKTSLFFVNFSVGQPPVPQFTI 84
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGS LLW+ C C C + + +F+P+ SST E +C D FCR N C
Sbjct: 85 MDTGSSLLWIQCHPCKHCSSNHMIH---PVFNPALSSTFVECSCDDRFCRYAPNGH---C 138
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN-LKTAPLNSSVIFGCGNRQSGDLGS 218
S +C Y Y G+ + G ++ + +GN + T P + FGCG+ G
Sbjct: 139 SSN-KCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQP----IAFGCGHEN----GE 189
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----------DVVKGGGIFAIG 267
++ GILG G +SL QL + +F++C+ +V G +G
Sbjct: 190 QLESEFTGILGLGAKPTSLAVQLGS------KFSYCIGDLANKNYGYNQLVLGEDADILG 243
Query: 268 DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGTGDERGTIIDSGTTLAYL 326
D + +T + Y + LE + VG L++ P G G I+D+GT +L
Sbjct: 244 DPTPIEFETENGI-----YYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTLYTWL 298
Query: 327 PPMLYDLVLSQF---------RFWI 342
+ Y + ++ RFW
Sbjct: 299 ADIAYRELYNEIKSILDPKLERFWF 323
>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 466
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 81/285 (28%), Positives = 128/285 (44%), Gaps = 27/285 (9%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRC 117
++ R+ +++ + GN +P G Y+ + +G P + + +DTGSDL WV C A C+ C
Sbjct: 45 QNRRLSSTVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDAPCNGC 102
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR-TTYNNRYPSCSPGVRCEYVVTYGDGSS 176
TK + + P+ ++ + CS C P P +C+Y + Y D +S
Sbjct: 103 -TKP----RAKQYKPNHNT----LPCSHILCSGLDLPQDRPCADPEDQCDYEIGYSDHAS 153
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
+ G V D + L A+G++ +N + FGCG Q + G GILG G+
Sbjct: 154 SIGALVTDEVPLKLANGSI----MNLRLTFGCGYDQQ-NPGPHPPPPTAGILGLGRGKVG 208
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVE 294
L +QL + G + HCL G G +IGD + P V T + N P N + E
Sbjct: 209 LSTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVPSSGVTWTSLATNSPSKNYMAGPAE 267
Query: 295 VGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ N D T + G + DSG++ Y Y +L R
Sbjct: 268 LLFN--DKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIR 306
>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 488
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 90/288 (31%), Positives = 128/288 (44%), Gaps = 43/288 (14%)
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG---CSRCPTKSDLGIKLTLFDP 132
+P + G Y V LGTP V +DTGS L WV C C C + + +F P
Sbjct: 84 YPHSYGGYAFSVSLGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHP 143
Query: 133 SKSSTSGEIACSDNFCRTTYNNRYPSCSP------GVRCE-YVVTYGDGSSTSGYFVRDI 185
SS+S + C + CR ++ +C G C Y+V YG GS TSG + D
Sbjct: 144 KNSSSSRLVGCRNPACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGS-TSGLLISDT 202
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
++L+ +S + AP + I GC S G+ GFG+ S+ SQL
Sbjct: 203 LRLSPSSSSSAPAPFRNFAI-GCSI-------VSVHQPPSGLAGFGRGAPSVPSQLKV-- 252
Query: 246 NVRKEFAHCL------DVVKGGGIFAIGDVVSP--KVKTT----PMV-------PNMPHY 286
+F++CL D G +GD + P K KTT P++ P +Y
Sbjct: 253 ---PKFSYCLLSRRFDDNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYY 309
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
+ L + VGG P++LP+ G IIDSGTT YL P ++ V
Sbjct: 310 YLALTGISVGGKPVNLPSRAFVPSSGGGAIIDSGTTFTYLDPTVFKPV 357
>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
nepenthesin-2-like [Vitis vinifera]
Length = 460
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 84/262 (32%), Positives = 115/262 (43%), Gaps = 49/262 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V GTP E + +DTGS + W C C C S+ FD S SST
Sbjct: 126 GNFLVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSN-----RYFDSSASSTY-- 178
Query: 141 IACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ SC P V Y +TYGD S++ G + D + L+ +
Sbjct: 179 --------------SFGSCIPSTVENNYNMTYGDDSTSVGNYGCDTM-------TLEPSD 217
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ FGCG GD GS VDG+LG GQ S +SQ A+ N K F++CL
Sbjct: 218 VFQKFQFGCGRNNKGDFGS----GVDGMLGLGQGQLSTVSQTASKFN--KVFSYCLPEED 271
Query: 260 GGGIFAIGDVV---SPKVKTTPMVPNMP-------HYNVILEEVEVGGNPLDLPTSLLGT 309
G G+ S +K T +V N P +Y V L ++ VG L++P+S+ +
Sbjct: 272 SIGSLLFGEKATSQSSSLKFTSLV-NGPGTLQESGYYFVNLSDISVGNERLNIPSSVFAS 330
Query: 310 GDERGTIIDSGTTLAYLPPMLY 331
GTIIDS T + LP Y
Sbjct: 331 ---PGTIIDSRTVITRLPQRAY 349
>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 448
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 119/267 (44%), Gaps = 44/267 (16%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P A+G YF VG+GTP + +DTGSD++W+ C C C + L+DP
Sbjct: 90 SGLPFASGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLS-----PLYDPR 144
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRD--IIQLN 189
SST + CS CR P G C Y + YGD SSTSG D + +
Sbjct: 145 GSSTYAQTPCSPPQCRN------PQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSND 198
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK 249
+ GN V GCG+ G GS+ G+LG + N+S +Q+ A + +
Sbjct: 199 TSVGN---------VTLGCGHDNEGLFGSAA-----GLLGVARGNNSFATQV--ADSYGR 242
Query: 250 EFAHCL-DVVKGGG-----IFAIGDVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPL 300
FA+CL D + G +F P TP+ P P Y V + VGG P+
Sbjct: 243 YFAYCLGDRTRSGSSSSYLVFGRTAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPV 302
Query: 301 ----DLPTSLLGTGDERGTIIDSGTTL 323
+ SL G ++DSGT++
Sbjct: 303 TGFSNASLSLDPATGRGGVVVDSGTSI 329
>gi|297852200|ref|XP_002893981.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
gi|297339823|gb|EFH70240.1| hypothetical protein ARALYDRAFT_314121 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 73/235 (31%), Positives = 102/235 (43%), Gaps = 29/235 (12%)
Query: 64 MASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRCPT 119
++S+ L L GN P G Y + +GTP + +DTGSDL WV C GC+ P
Sbjct: 37 LSSVVLPLSGNVFP--LGYYSVLLQIGTPPKAFEFDIDTGSDLTWVQCDAPCTGCTLPPI 94
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTS 178
+ + P ++ + C D C + P C +P +C+Y V Y D S+
Sbjct: 95 RQ--------YKPKGNT----VPCLDPICLALHFPNKPQCPNPKEQCDYEVNYADQGSSM 142
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G V D L +G + + + FGCG Q + A G+LG G+ +L
Sbjct: 143 GALVIDQFPLKLLNG----SAMQPRLAFGCGYDQILP-KAHPPPATAGVLGLGRGKIGVL 197
Query: 239 SQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILE 291
QL AAG R HCL KGGG GD + P V TP++ P Y
Sbjct: 198 PQLVAAGLTRNVVGHCLS-SKGGGYLFFGDTLIPTLGVAWTPLL--SPEYTFFFH 249
>gi|37542275|gb|AAK81698.1| aspartyl proteinase [Oryza sativa]
Length = 410
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 83/281 (29%), Positives = 121/281 (43%), Gaps = 36/281 (12%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN----CAGCSRCPTK 120
+++ LEL GN +P G +F + + P Y++ +DTGS L W+ C C++ P
Sbjct: 22 SAVVLELHGNVYP--IGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPH- 78
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN-RYP-SCSPGVRCEYVVTYGDGSSTS 178
L+ P + C++ C Y + R P C P +C Y + Y GSS
Sbjct: 79 -------GLYKPELKYA---VKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGSSI- 127
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + D L ++G T P +S+ FGCG Q G + V+GILG G+ +LL
Sbjct: 128 GVLIVDSFSLPASNG---TNP--TSIAFGCGYNQ-GKNNHNVPTPVNGILGLGRGKVTLL 181
Query: 239 SQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEV 295
SQL + G + K HC+ KG G GD P V +PM HY+ +
Sbjct: 182 SQLKSQGVITKHVLGHCIS-SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHF 240
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
N + + + I DSG T Y Y LS
Sbjct: 241 NSNSKPISAAPM------EVIFDSGATYTYFALQPYHATLS 275
>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 495
Score = 94.4 bits (233), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 84/275 (30%), Positives = 124/275 (45%), Gaps = 39/275 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT+VG+G P YY+ +DTGSD+ W+ C CS C +SD +F P+
Sbjct: 150 SGTSQGSGEYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSD-----PIFTPA 204
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SS+ + C C + + SC G +C Y V YGDGS T G FV + + SG
Sbjct: 205 ASSSYSPLTCDSQQCNSL---QMSSCRNG-QCRYQVNYGDGSFTFGDFVTETMSFG-GSG 259
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ S+ GCG+ G + G SL SQL A F++
Sbjct: 260 TVN------SIALGCGHDNEGLFVGAAGLLGLGGGPL-----SLTSQLKAT-----SFSY 303
Query: 254 CL---DVVKGGGI----FAIGD-VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTS 305
CL D + +GD V++P +K++ + Y V L + VGG L +P
Sbjct: 304 CLVNRDSAASSTLDFNSAPVGDSVIAPLLKSSKI---DTFYYVGLSGMSVGGELLRIPQE 360
Query: 306 LLGTGD--ERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ D + G I+D GT + L Y+ + F
Sbjct: 361 VFKLDDSGDGGVIVDCGTAITRLQSEAYNSLRDSF 395
>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 453
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 92/337 (27%), Positives = 147/337 (43%), Gaps = 49/337 (14%)
Query: 18 HQWAVGGGGVMGNFVFEVENKFKAGGERERTLS----ALKQHDTRRHGRMMASIDLELG- 72
H+ GGGG + + V V+ K R + ++ + +D+RR G M + E+
Sbjct: 42 HERFAGGGGDV-DRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTTPAEVEM 100
Query: 73 --GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
+G A G YF +V +G+P +++ VDTGS+ W+NC
Sbjct: 101 PMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------------- 141
Query: 131 DPSKSSTSGEIACSDNFCRTTYNNRYPSC---SPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
S + + C+ C+ + + P C Y ++Y DGSS G+F D I
Sbjct: 142 ----SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTDSIT 197
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
+ +G K LN+ I GC +S G + + GILG G A S + + AA
Sbjct: 198 VGLTNG--KQGKLNNLTI-GC--TKSMLNGVNFNEETGGILGLGFAKDSFIDK--AANKY 250
Query: 248 RKEFAHCL-DVVKGGGI---FAIGDVVSPK----VKTTPMVPNMPHYNVILEEVEVGGNP 299
+F++CL D + + IG + K ++ T ++ P Y V + + +GG
Sbjct: 251 GAKFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQM 310
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
L +P + E GT+IDSGTTL L Y+ V
Sbjct: 311 LKIPPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFE 347
>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 94.4 bits (233), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 120/263 (45%), Gaps = 27/263 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSST 137
TG Y VGLGTP +++ + DTGS + W C C S P K FDP+KS++
Sbjct: 132 TGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQ------KFDPTKSTS 185
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++CS C + + C Y + YGD S + G+F + + + +
Sbjct: 186 YNNVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETL-------TISS 238
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-D 256
+ + ++ +FGCG +G G + G+LG ++ SL SQ A +K+F++CL
Sbjct: 239 SDVFTNFLFGCGQSNNGLFGQAA-----GLLGLSSSSVSLPSQTAE--KYQKQFSYCLPS 291
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPNM-PHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
G G VS TP+ P Y + + + V G+ L + S+ T G
Sbjct: 292 TPSSTGYLNFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTT---SGA 348
Query: 316 IIDSGTTLAYLPPMLYDLVLSQF 338
IIDSGT + LPP Y + F
Sbjct: 349 IIDSGTVITRLPPTAYKALKEAF 371
>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
Length = 452
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 122/282 (43%), Gaps = 41/282 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P +G YF +G+G P V +DTGSDL+W+ C C RC + L+DP
Sbjct: 83 SGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQ-----VTPLYDPR 137
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
S T I C+ CR RYP C C Y+V YGDGS++SG D + L +
Sbjct: 138 NSKTHRRIPCASPQCRGVL--RYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDT 195
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEF 251
+V GCG+ G L S+ G+LG G+ S +QLA A G+V F
Sbjct: 196 -------RVHNVTLGCGHDNEGLLASAA-----GLLGAGRGQLSFPTQLAPAYGHV---F 240
Query: 252 AHCL-----------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL 300
++CL + G + ++T P P++ Y V + VGG +
Sbjct: 241 SYCLGDRMSRARNSSSYLVFGRTPELPSTAFTPLRTNPRRPSL--YYVDMVGFSVGGERV 298
Query: 301 ----DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ +L G ++DSGT ++ Y V F
Sbjct: 299 AGFSNASLALNPATGRGGVVVDSGTAISRFTRDAYAAVRDAF 340
>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
Length = 407
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 115/277 (41%), Gaps = 36/277 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++GLGTP ++ VDTGSDL W+ C C C ++D +FDP SS+
Sbjct: 51 SGEYFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQAD-----PIFDPRNSSSFQ 105
Query: 140 EIACSDNFCRTTYNNRYPSCS----PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
I C C+ + SCS RC Y V YGDGS + G F D+ L S +
Sbjct: 106 RIPCLSPLCKALEVH---SCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAM 162
Query: 196 KTAPLNSSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
SV FGCG N + G L F S + + + F++
Sbjct: 163 -------SVAFGCGFDNEGLFAGAAGLLGLGAGKLSF----PSQIFASSTNSSTANSFSY 211
Query: 254 CL-----DVVKGGGIFAIGDVVSPKVKT-TPMVPNMP---HYNVILEEVEVGGN--PLDL 302
CL + + G P +P++ N Y + V VGG P+ L
Sbjct: 212 CLVDRSNPMTRSSSSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISL 271
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ L G IIDSGT++ P +Y + FR
Sbjct: 272 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFR 308
>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
Length = 483
Score = 94.0 bits (232), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 115/277 (41%), Gaps = 36/277 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF ++G+GTP ++ VDTGSDL W+ C C C ++D +FDP SS+
Sbjct: 126 SGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQAD-----PIFDPRNSSSFQ 180
Query: 140 EIACSDNFCRTTYNNRYPSCS----PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
I C C+ + SCS RC Y V YGDGS + G F D+ L S +
Sbjct: 181 RIPCLSPLCKALEIH---SCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLGTGSKAM 237
Query: 196 KTAPLNSSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
SV FGCG N + G L F S + + + F++
Sbjct: 238 -------SVAFGCGFDNEGLFAGAAGLLGLGAGKLSF----PSQIFASSTNSSTANSFSY 286
Query: 254 CL-----DVVKGGGIFAIGDVVSPKVKT-TPMVPNMP---HYNVILEEVEVGGN--PLDL 302
CL + + G P +P++ N Y + V VGG P+ L
Sbjct: 287 CLVDRSNPMTRSSSSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISL 346
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ L G IIDSGT++ P +Y + FR
Sbjct: 347 KSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFR 383
>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
Length = 439
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 138/309 (44%), Gaps = 36/309 (11%)
Query: 56 DTRRHGRMMASIDLELG-----GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN 110
D +RH + + +G G+G T YFT++ +GTP ++ V VDTGS+L WVN
Sbjct: 52 DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 111
Query: 111 CAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP--SC-SPGVRCEY 167
C +R +F +S + + C C+ N + +C +P C Y
Sbjct: 112 CRYRARGKDNRR------VFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 165
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
Y DGS+ G F ++ I + +G + P + + GC + +G + DG+
Sbjct: 166 DYRYADGSAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTGQ----SFQGADGV 218
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGI---FAIGDVVSPKV---KTTPM- 279
LG ++ S S A +F++CL D + + G S K +TTP+
Sbjct: 219 LGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLD 276
Query: 280 ---VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+P P Y + + + +G + LD+P+ + GTI+DSGT+L L Y V++
Sbjct: 277 LTRIP--PFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVT 334
Query: 337 QFRFWIASL 345
++ L
Sbjct: 335 GLARYLVEL 343
>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 458
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/275 (31%), Positives = 123/275 (44%), Gaps = 32/275 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y + LGTP V +D +D WV C+ C C G FDP++SST
Sbjct: 97 TPSYVARARLGTPPQTLLVAIDPSNDAAWVPCSACLGC----APGASSPSFDPTQSSTYR 152
Query: 140 EIACSDNFCRTTYNNRYPSCS--PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C C PSC PG C + ++Y S+ +D + L+ ++G
Sbjct: 153 PVRCGAPQC-AQVPPATPSCPAGPGASCAFNLSYAS-STLHAVLGQDALSLSDSNG---A 207
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLD 256
A + FGC +G GS G++GFG+ S LSQ A G++ F++CL
Sbjct: 208 AVPDDHYTFGCLRVVTGSGGSVPP---QGLVGFGRGPLSFLSQTKATYGSI---FSYCLP 261
Query: 257 VVKG---GGIFAIGDVVSP-KVKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLL- 307
K G +G P ++KTTP++ N PH Y V + V V G + +P S L
Sbjct: 262 SYKSSNFSGTLRLGPAGQPRRIKTTPLLSN-PHRPSLYYVAMVGVRVNGKAVPIPASALA 320
Query: 308 ---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
TG GTI+D+GT L P Y + + FR
Sbjct: 321 LDAATG-RGGTIVDAGTMFTRLSPPAYAALRNAFR 354
>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 473
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 87/271 (32%), Positives = 126/271 (46%), Gaps = 34/271 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YFT++G+GTP Y+ +DTGSD++W+ C C++C +++D +FDPSKS +
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTD-----QIFDPSKSKSFA 181
Query: 140 EIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
I C CR + P CS C+Y V+YGDGS T G F + + +A+
Sbjct: 182 GIPCYSPLCRRLDS---PGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTFRRAA------ 232
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-- 256
V GCG+ G G+LG G+ S +Q N +F++CL
Sbjct: 233 --VPRVAIGCGHDNEGLF-----VGAAGLLGLGRGGLSFPTQTGTRFN--NKFSYCLTDR 283
Query: 257 --VVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD-LPTSL--LG 308
K I VS + TP+V N Y V L + VGG P+ + S L
Sbjct: 284 TASAKPSSIVFGDSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLD 343
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ G IIDSGT++ L Y + FR
Sbjct: 344 STGNGGVIIDSGTSVTRLTRPAYVSLRDAFR 374
>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 461
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 138/309 (44%), Gaps = 36/309 (11%)
Query: 56 DTRRHGRMMASIDLELG-----GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN 110
D +RH + + +G G+G T YFT++ +GTP ++ V VDTGS+L WVN
Sbjct: 74 DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 133
Query: 111 CAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP--SC-SPGVRCEY 167
C +R +F +S + + C C+ N + +C +P C Y
Sbjct: 134 CRYRARGKDNRR------VFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSY 187
Query: 168 VVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGI 227
Y DGS+ G F ++ I + +G + P + + GC + +G + DG+
Sbjct: 188 DYRYADGSAAQGVFAKETITVGLTNGRMARLPGH---LIGCSSSFTGQ----SFQGADGV 240
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGI---FAIGDVVSPKV---KTTPM- 279
LG ++ S S A +F++CL D + + G S K +TTP+
Sbjct: 241 LGLAFSDFSFTS--TATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLD 298
Query: 280 ---VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+P P Y + + + +G + LD+P+ + GTI+DSGT+L L Y V++
Sbjct: 299 LTRIP--PFYAINVIGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVT 356
Query: 337 QFRFWIASL 345
++ L
Sbjct: 357 GLARYLVEL 365
>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
Length = 353
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 125/283 (44%), Gaps = 63/283 (22%)
Query: 86 KVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC---PTKSDLGIKLTLFDPSKSSTSGEIA 142
++ +G P +Y VDTGSDL+W C C+ C PT +FDP KSS+ ++
Sbjct: 2 ELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTP--------IFDPEKSSSYSKVG 53
Query: 143 CSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
CS C R+ N + CEY+ TYGD SST G + +
Sbjct: 54 CSSGLCNALPRSNCNEDKDA------CEYLYTYGDYSSTRGLLATETFTFEDENS----- 102
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
S + FGCG GD G S + G++G G+ SL+SQL +F++CL +
Sbjct: 103 --ISGIGFGCGVENEGD-GFSQGS---GLVGLGRGPLSLISQLK-----ETKFSYCLTSI 151
Query: 259 ---KGGGIFAIGDVVSPKV------------KTTPMV--PNMP-HYNVILEEVEVGGNPL 300
+ IG + S V KT ++ P+ P Y + L+ + VG L
Sbjct: 152 EDSEASSSLFIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRL 211
Query: 301 DLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ S GTG G IIDSGTT+ YL + ++ +F
Sbjct: 212 SVEKSTFELAEDGTG---GMIIDSGTTITYLEETAFKVLKEEF 251
>gi|159463556|ref|XP_001690008.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
gi|158283996|gb|EDP09746.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
Length = 547
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 73/217 (33%), Positives = 107/217 (49%), Gaps = 27/217 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSG 139
G Y+T + +GTP +DTGS L C+GC+RC P+K+ +F P SSTS
Sbjct: 79 GYYYTYLTIGTPGQTVSGILDTGSTLPAFPCSGCTRCGPSKTG------MFKPELSSTSS 132
Query: 140 EIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
CSD C N SCS +C Y + Y +GSSTSG+ D++ A G+ A
Sbjct: 133 TFGCSDARCFCGAN----SCSCNNEQCGYSIRYLEGSSTSGFLAEDML----AVGDGGPA 184
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
++ +FGC +SG L S DG+ G G+ +SL QL G + F+ C
Sbjct: 185 ---ANFVFGCAQSESGLLYSQI---ADGVFGMGRTPASLYGQLVQQGVIDDAFSMCFGAP 238
Query: 259 KGGGIFAIGDVV----SPKVKTTPMVPNMPHYNVILE 291
+ G+ +G+V +P TP+V N +N+ +E
Sbjct: 239 R-EGVLLLGNVALPADAPAPVVTPVVGNTNKFNIQIE 274
>gi|37542277|gb|AAK81699.1| aspartyl proteinase [Oryza sativa]
Length = 411
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 120/281 (42%), Gaps = 35/281 (12%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVN----CAGCSRCPTK 120
+++ LEL GN +P G +F + + P Y++ +DTGS L W+ C C++ P
Sbjct: 22 SAVVLELHGNVYP--IGHFFVTMNISDPAKPYFLDIDTGSTLTWLQCDYPCINCNKVPH- 78
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN-RYP-SCSPGVRCEYVVTYGDGSSTS 178
L+ P + C++ C Y + R P C P +C Y + Y GSS
Sbjct: 79 -------GLYKPELKYA---VKCTEQRCADLYADLRKPMKCGPKNQCHYGIQYVGGSSI- 127
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + D L ++G T P +S+ FGCG Q G + V+GILG G+ +LL
Sbjct: 128 GVLIVDSFSLPASNG---TNP--TSIAFGCGYNQ-GKNNHNVPTPVNGILGLGRGKVTLL 181
Query: 239 SQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEEVEV 295
SQL + G + K HC+ KG G GD P V +PM HY+ +
Sbjct: 182 SQLKSQGVITKHVLGHCIS-SKGKGFLFFGDAKVPTSGVTWSPMNREHKHYSPRQGTLHF 240
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
N P S I DSG T Y Y LS
Sbjct: 241 NSNKQS-PIS----AAPMEVIFDSGATYTYFALQPYHATLS 276
>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
Length = 505
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 117/269 (43%), Gaps = 37/269 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEI 141
+ VG G+P Y + +DTGSD+ W+ C CS C + D +FDP+KS+T +
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHD-----PVFDPTKSATYSAV 215
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C C CS C Y VTYGDGSST+G + + L+ + + P
Sbjct: 216 PCGHPQCAAAGGK----CSNSGTCLYKVTYGDGSSTAGVLSHETLSLS----STRDLP-- 265
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVV 258
FGCG G+ G G+ SL SQ AA F++CL D
Sbjct: 266 -GFAFGCGQTNLGEFGGVDGLVGL-----GRGALSLPSQ--AAATFGATFSYCLPSYDTT 317
Query: 259 KG----GGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGD 311
G G V+ T M+ + Y V + +++GG L +P ++
Sbjct: 318 HGYLTMGSTTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVF---T 374
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
GT+ DSGT L YLPP Y + +F+F
Sbjct: 375 RDGTLFDSGTILTYLPPEAYASLRDRFKF 403
>gi|297723019|ref|NP_001173873.1| Os04g0331600 [Oryza sativa Japonica Group]
gi|255675338|dbj|BAH92601.1| Os04g0331600, partial [Oryza sativa Japonica Group]
Length = 72
Score = 93.6 bits (231), Expect = 1e-16, Method: Composition-based stats.
Identities = 46/72 (63%), Positives = 58/72 (80%), Gaps = 1/72 (1%)
Query: 211 RQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVV 270
+Q+G L +S + A+DGI+GFG +N +LLSQLAAAG +K F+HCLD GGGIFAIG+VV
Sbjct: 1 QQTGSLNNS-ELAIDGIIGFGNSNQTLLSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEVV 59
Query: 271 SPKVKTTPMVPN 282
PKVKTTP+V N
Sbjct: 60 EPKVKTTPIVKN 71
>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 436
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 114/267 (42%), Gaps = 28/267 (10%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP E DT SDL+WV C+ C C + LF+P KSST
Sbjct: 88 GEYLMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDT-----PLFEPHKSSTFAN 142
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
++C C T +N Y G C Y TYGDGSST G + I +
Sbjct: 143 LSCDSQPC--TSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHFGSQTVTFP---- 196
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
IFGCG+ + D V GI+G G SL+SQL + +F++CL
Sbjct: 197 --KTIFGCGS--NNDFMHQISNKVTGIVGLGAGPLSLVSQL--GDQIGHKFSYCLLPFTS 250
Query: 261 GGIFAIG-----DVVSPKVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDE 312
+ + V +TP++ P+ P Y + L + +G L + T+ G+
Sbjct: 251 TSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGN- 309
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQFR 339
IID GT L YL Y ++ R
Sbjct: 310 --IIIDLGTVLTYLEVNFYHNFVTLLR 334
>gi|357490961|ref|XP_003615768.1| F-box protein [Medicago truncatula]
gi|355517103|gb|AES98726.1| F-box protein [Medicago truncatula]
Length = 688
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/198 (35%), Positives = 93/198 (46%), Gaps = 32/198 (16%)
Query: 114 CSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGD 173
C+ CP S L I+ SG I SD C S +C Y YGD
Sbjct: 360 CNGCPQTSRLQIE---------CNSG-IQLSDATCS----------SQTKQCSYTFQYGD 399
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG-CGNRQSGDLGSSTDAAVDGILGFGQ 232
GS TSGY+V D + L+ +S G C N QSGDL + +D AVDGI GF Q
Sbjct: 400 GSGTSGYYVSDTMHLDTIFEGSDYKFFSSCSFLGDCSNEQSGDL-TKSDRAVDGIFGFWQ 458
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILE 291
S++SQL++ G F+HCL GGGI +G++V P + TP+VP+
Sbjct: 459 QQMSVISQLSSQGIASGVFSHCLRGDSSGGGIPVLGEIVEPNIVYTPIVPS--------- 509
Query: 292 EVEVGGNPLDLPTSLLGT 309
+ V G L + S+ T
Sbjct: 510 RISVNGQALQVDPSVCAT 527
>gi|145356007|ref|XP_001422234.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144582474|gb|ABP00551.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 488
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/247 (27%), Positives = 118/247 (47%), Gaps = 35/247 (14%)
Query: 96 YYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSD----NFCRTT 151
Y + VDTGS +V C GC+RC + +D +S + C + C T
Sbjct: 51 YDLIVDTGSARTYVPCKGCARCGEHAH-----GYYDYDRSMEFERLDCGEASDATLCEET 105
Query: 152 YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
+C RC YVV+Y +GSS+ GY VRD ++L + + L++ + FGC
Sbjct: 106 MKG---TCQSDGRCSYVVSYAEGSSSRGYVVRDRVRLGEGT-------LSAMLAFGC--- 152
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD-VVKGGGIFAIGD-- 268
+ + + + DG+ GFG+ +++ +QLA+AG + F+ C++ GG+ +G
Sbjct: 153 EEAETNAIYEQKADGLFGFGRGTATVHAQLASAGLIENVFSFCVEGFGANGGVLTLGRFD 212
Query: 269 --VVSPKVKTTPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
+P + TP+V P P ++ V + L SL+ + T +DSGTT
Sbjct: 213 FGADAPALARTPLVADPANPAFH------NVRTSSWKLGDSLIEHLNSYTTTLDSGTTFT 266
Query: 325 YLPPMLY 331
++P ++
Sbjct: 267 FVPRSVW 273
>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
Length = 471
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 118/259 (45%), Gaps = 32/259 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y T++GLGTP+ Y + VDTGS L W+ C+ C +G LFDP SST
Sbjct: 132 GNYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC-VVSCHRQVG---PLFDPRASSTYAS 187
Query: 141 IACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ CS + C T N +CS C Y +YGD S + G D + G+ +
Sbjct: 188 VRCSASQCDELQAATLNPS--ACSASNVCIYQASYGDSSFSVGSLSTDTVSF----GSTR 241
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
S +GCG G G S G++G + SLL QLA ++ F++CL
Sbjct: 242 ----YPSFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQLAP--SLGYSFSYCLP 290
Query: 257 VVKGGGIFAIGDVVSPKVKT-TPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
G +IG + + TPM + Y + L + VGG+PL + S +
Sbjct: 291 TAASTGYLSIGPYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSS--- 347
Query: 313 RGTIIDSGTTLAYLPPMLY 331
TIIDSGT + LP ++
Sbjct: 348 LPTIIDSGTVITRLPTAVH 366
>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
Length = 451
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 131/284 (46%), Gaps = 43/284 (15%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
+ G Y + +GTP + V DTGS L+W CA C+ C + F P+ SST
Sbjct: 86 SAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAAR-----PAPPFQPASSSTF 140
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
++ C+ + C+ + Y +C+ C Y YG G T+GY + + + AS
Sbjct: 141 SKLPCASSLCQ-FLTSPYLTCN-ATGCVYYYPYGMG-FTAGYLATETLHVGGAS------ 191
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
V FGC +G+S+ GI+G G++ SL+SQ+ F++CL
Sbjct: 192 --FPGVAFGCSTENG--VGNSS----SGIVGLGRSPLSLVSQVGVG-----RFSYCLRSD 238
Query: 259 KGGG----IF-AIGDVVSPKVKTTPMV--PNMP---HYNVILEEVEVGGNPLDLPTSLL- 307
G +F ++ V V++TP++ P MP +Y V L + VG L + ++
Sbjct: 239 ADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFG 298
Query: 308 -----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASLD 346
G G GTI+DSGTTL YL Y +V F +A+ +
Sbjct: 299 FTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATAN 342
>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
Length = 448
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 120/282 (42%), Gaps = 41/282 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P +G YF + +G P V +DTGSDL+W+ C C C + L+DP
Sbjct: 79 SGVPFDSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQV-----TPLYDPR 133
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
SST I C+ CR RYP C C Y+V YGDGS++SG D + +
Sbjct: 134 SSSTHRRIPCASPRCRDVL--RYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDT 191
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEF 251
+V GCG+ G L S+ G+LG G+ S +QLA A G+V F
Sbjct: 192 H-------VHNVTLGCGHDNVGLLESAA-----GLLGVGRGQLSFPTQLAPAYGHV---F 236
Query: 252 AHCL-----DVVKGGGIFAIGDVVSP------KVKTTPMVPNMPHYNVILEEVEVGGNPL 300
++CL G G P ++T P P++ Y V + VGG +
Sbjct: 237 SYCLGDRLSRAQNGSSYLVFGRTPEPPSTAFTPLRTNPRRPSL--YYVDMVGFSVGGERV 294
Query: 301 ----DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ +L G ++DSGT ++ Y V F
Sbjct: 295 TGFSNASLALNPATGRGGIVVDSGTAISRFARDAYAAVRDAF 336
>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
Length = 509
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/271 (29%), Positives = 122/271 (45%), Gaps = 39/271 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF++VG+G+P E Y+ +DTGSD+ WV C C+ C +SD +FDPS S++
Sbjct: 166 SGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSASYA 220
Query: 140 EIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
++C CR + +C C Y V YGDGS T G F + + L ++
Sbjct: 221 AVSCDSPRCR---DLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST------ 271
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--- 255
P+ ++V GCG+ G + G S SQ++A+ F++CL
Sbjct: 272 PV-TNVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAS-----TFSYCLVDR 320
Query: 256 -----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL--- 307
++ G A D V+ + +P Y V L + VGG L +P+S
Sbjct: 321 DSPAASTLQFGADGAEADTVTAPLVRSPRTGTF--YYVALSGISVGGQALSIPSSAFAMD 378
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T G I+DSGT + L Y + F
Sbjct: 379 ATSGSGGVIVDSGTAVTRLQSSAYAALRDAF 409
>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
Length = 485
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 92/289 (31%), Positives = 130/289 (44%), Gaps = 47/289 (16%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFTK+G+GTP + + +DTGSD++WV CA C RC +S +FDP
Sbjct: 120 SGLAQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSG-----PVFDPR 174
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQLNQ 190
+SS+ G + C CR R S +R C Y V YGDGS T+G FV + +
Sbjct: 175 RSSSYGAVGCGAALCR-----RLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTF-- 227
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
+G + A V GCG+ G A G+LG G+ S +Q++ +
Sbjct: 228 -AGGARVA----RVALGCGHDNEGLF-----VAAAGLLGLGRGGLSFPTQISR--RYGRS 275
Query: 251 FAHCL--DVVKGGGI-----------FAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVE 294
F++CL G G F G V + TPMV P M Y V L +
Sbjct: 276 FSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGIS 335
Query: 295 VGGNPL----DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
VGG + + L + G I+DSGT++ L Y + FR
Sbjct: 336 VGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSALRDAFR 384
>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 458
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 121/276 (43%), Gaps = 36/276 (13%)
Query: 64 MASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSD 122
+AS+ L G + G Y T++GLGTP +Y + VDTGS L W+ C+ C C +S
Sbjct: 106 LASVPLSPGAS---VGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSG 162
Query: 123 LGIKLTLFDPSKSSTSGEIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+F+P SST + CS C T N +CS C Y +YGD S +
Sbjct: 163 -----PVFNPKSSSTYASVGCSAQQCSDLPSATLNPS--ACSSSNVCIYQASYGDSSFSV 215
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
GY +D + S + +GCG G G S G++G + SLL
Sbjct: 216 GYLSKDTVSFGSTS--------LPNFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLL 262
Query: 239 SQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEV 295
QLA ++ F +CL G ++G + TPMV + Y + L + V
Sbjct: 263 YQLAP--SLGYSFTYCLPSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTV 320
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
GNPL + TIIDSGT + LP +Y
Sbjct: 321 AGNPL---SVSSSAYSSLPTIIDSGTVITRLPTSVY 353
>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 122/272 (44%), Gaps = 39/272 (14%)
Query: 83 YFTKVGLGTPTDEYYV-QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y + +G P + V +DTGSD++W C C+ C T+ L FD + S+T +
Sbjct: 92 YLIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQ-----PLPRFDTAASNTVRSV 146
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK-TAPL 200
ACSD C ++ + G C YV YGDGS + G+F+RD + G K T P
Sbjct: 147 ACSDPLCNA--HSEHGCFLHG--CTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVP- 201
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV--- 257
+ FGCG +G + GI GFG+ SL SQL ++F++C
Sbjct: 202 --DIGFGCGMYNAGRFLQTE----TGIAGFGRGPLSLPSQLKV-----RQFSYCFTTRFE 250
Query: 258 VKGGGIF----------AIGDVVS-PKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSL 306
K +F A G ++S P V++ P + HY + + V VG L +P
Sbjct: 251 AKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPE-- 308
Query: 307 LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ T IDSGT + P ++ + S F
Sbjct: 309 IKADGSGATFIDSGTDITTFPDAVFRQLKSAF 340
>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 474
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 124/281 (44%), Gaps = 47/281 (16%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG E V VDT S+L WV C C C + D LFDPS S + +
Sbjct: 120 YVATVGLGAA--EATVVVDTASELTWVQCQPCESCHDQQD-----PLFDPSSSPSYAAVP 172
Query: 143 CSDNFCRTTYNNRYPSCSPGV-------RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
C+ + C SP C Y ++Y DGS + G RD ++L A ++
Sbjct: 173 CNSSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL--AGQDI 230
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFAHC 254
+ +FGCG G + G++G G+++ SL+SQ + G V F++C
Sbjct: 231 E------GFVFGCGTSNQG----APFGGTSGLMGLGRSHVSLVSQTMDQFGGV---FSYC 277
Query: 255 LDVVKGG--GIFAIGDVVSPKVKTTPMVPNM----------PHYNVILEEVEVGGNPLDL 302
L + + G G +GD S +TP+V P Y + L + VGG ++
Sbjct: 278 LPMRESGSSGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVES 337
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
P G IIDSGT + L P +Y+ V ++F +A
Sbjct: 338 PWFSAGR-----VIIDSGTIITTLVPSVYNAVRAEFLSQLA 373
>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
Length = 521
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/349 (24%), Positives = 142/349 (40%), Gaps = 51/349 (14%)
Query: 11 VVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLE 70
++ + V GG M V + F + L + D +R ++ +
Sbjct: 117 IIPLEVSEDHEEGGEKWMMKVVHRDQLSFGNSDDHRHRLDGRLKRDAKRVASLIRRLSSG 176
Query: 71 LGGN------------GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
GG+ G +G YF ++G+G+P Y+ +D+GSD++WV C C++C
Sbjct: 177 GGGSYRVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCY 236
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+SD +FDP+ S++ ++CS + C N C G RC Y V+YGDGS T
Sbjct: 237 HQSD-----PVFDPADSASFTGVSCSSSVCDRLEN---AGCHAG-RCRYEVSYGDGSYTK 287
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G + + + + SV GCG+R G + G + S +
Sbjct: 288 GTLALETLTFGRT--------MVRSVAIGCGHRNRGMFVGAAGLLGL-----GGGSMSFV 334
Query: 239 SQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGG- 297
QL G F++CL + P V+ P P+ Y + L + VGG
Sbjct: 335 GQL--GGQTGGAFSYCL----------VSAAWVPLVR-NPRAPSF--YYIGLAGLGVGGI 379
Query: 298 -NPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
P+ L + G ++D+GT + LP + Y F A+L
Sbjct: 380 RVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANL 428
>gi|224140735|ref|XP_002323734.1| predicted protein [Populus trichocarpa]
gi|222866736|gb|EEF03867.1| predicted protein [Populus trichocarpa]
Length = 184
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 80/165 (48%), Gaps = 13/165 (7%)
Query: 49 LSALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
L LK D RH R++ +D + G+ P LYFTKV LG+P E+ VQ++TG
Sbjct: 27 LHQLKARDRLRHARLLQGFVGGVVDFSVQGSSDPYLVELYFTKVKLGSPPREFNVQINTG 86
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
SD+LWV C++ P S + + P+ G CS+ C + CS
Sbjct: 87 SDVLWVCYNSCNKLPAFSSISLI-----PTAHQLLG--GCSNPICTSAVQTTATQCSSQT 139
Query: 164 -RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
+C Y YGDGS TSGY+V D + + G A + ++FG
Sbjct: 140 DQCSYTSQYGDGSGTSGYYVSDTLYFDAILGQSLIANSSVLIVFG 184
>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 437
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 122/289 (42%), Gaps = 37/289 (12%)
Query: 55 HDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC 114
H T + IDL S +G Y V +GTP DTGSDLLW CA C
Sbjct: 69 HFTEKDNTPQPQIDLT-------SNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC 121
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGD 173
C T+ D LFDP SST +++CS + C N SCS C Y ++YGD
Sbjct: 122 DDCYTQVD-----PLFDPKTSSTYKDVSCSSSQCTALENQA--SCSTNDNTCSYSLSYGD 174
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
S T G D + L G+ T P+ ++I GCG+ +G V G
Sbjct: 175 NSYTKGNIAVDTLTL----GSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGL----GG 226
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI------FAIGDVVS-PKVKTTPMVPNMPH 285
SL+ QL ++ +F++CL + F +VS V +TP++
Sbjct: 227 GPVSLIKQL--GDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQ 284
Query: 286 ---YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
Y + L+ + VG + + E IIDSGTTL LP Y
Sbjct: 285 ETFYYLTLKSISVGSKQIQY-SGSDSESSEGNIIIDSGTTLTLLPTEFY 332
>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
Length = 488
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 82/269 (30%), Positives = 121/269 (44%), Gaps = 34/269 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C +C +++D +FDP+
Sbjct: 136 SGLAQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTD-----PVFDPT 190
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
KS + I C CR YP CS + C Y V+YGDGS T G F + +
Sbjct: 191 KSRSFANIPCGSPLCRRL---DYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTR 247
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
V+ GCG+ G + G+ S SQ+ N +F+
Sbjct: 248 VG--------RVVLGCGHDNEGLFVGAAGLLGL-----GRGRLSFPSQIGRRFN--SKFS 292
Query: 253 HCL---DVVKGGGIFAIGD-VVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD-LPT 304
+CL GD +S + TP++ N Y V L + VGG + +
Sbjct: 293 YCLGDRSASSRPSSIVFGDSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISA 352
Query: 305 SL--LGTGDERGTIIDSGTTLAYLPPMLY 331
SL L + G IIDSGT++ L Y
Sbjct: 353 SLFKLDSTGNGGVIIDSGTSVTRLTRAAY 381
>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 294
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 90/176 (51%), Gaps = 21/176 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y ++ +GTP + Y Q DTGSDL+W+ C C+ C + + +FD SST IA
Sbjct: 59 YLMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLN-----PMFDSQSSSTFSNIA 113
Query: 143 CSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C C Y+ SCSP + C+Y +Y DGS T G ++ + L +G
Sbjct: 114 CGSESCSKLYST---SCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAF--- 167
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA--GNVRKEFAHCL 255
VIFGCG+ +G D + GI+G G+ SL+SQ+ ++ GN+ F+ CL
Sbjct: 168 KGVIFGCGHNNNGAF---NDKEM-GIIGLGRGPLSLVSQIGSSLGGNM---FSQCL 216
>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
Length = 438
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 122/289 (42%), Gaps = 37/289 (12%)
Query: 55 HDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC 114
H T + IDL S +G Y V +GTP DTGSDLLW CA C
Sbjct: 69 HFTEKDNTPQPQIDLT-------SNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC 121
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGD 173
C T+ D LFDP SST +++CS + C N SCS C Y ++YGD
Sbjct: 122 DDCYTQVD-----PLFDPKTSSTYKDVSCSSSQCTALENQA--SCSTNDNTCSYSLSYGD 174
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
S T G D + L G+ T P+ ++I GCG+ +G V G
Sbjct: 175 NSYTKGNIAVDTLTL----GSSDTRPMQLKNIIIGCGHNNAGTFNKKGSGIVGL----GG 226
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI------FAIGDVVSPK-VKTTPMVPNMPH 285
SL+ QL ++ +F++CL + F +VS V +TP++
Sbjct: 227 GPVSLIKQL--GDSIDGKFSYCLVPLTSKKDQTSKINFGTNAIVSGSGVVSTPLIAKASQ 284
Query: 286 ---YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
Y + L+ + VG + + E IIDSGTTL LP Y
Sbjct: 285 ETFYYLTLKSISVGSKQIQY-SGSDSESSEGNIIIDSGTTLTLLPTEFY 332
>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 482
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 78/269 (28%), Positives = 119/269 (44%), Gaps = 38/269 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G + +G YF ++G+G+P Y+ +D+GSD++WV C CSRC +SD +FDP+
Sbjct: 134 SGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSD-----PVFDPA 188
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SS+ ++C + C N C+ G RC Y V+YGDGS T G + + + Q
Sbjct: 189 DSSSFAGVSCGSDVCDRLENT---GCNAG-RCRYEVSYGDGSYTKGTLALETLTVGQV-- 242
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ V GCG+ G + G + S + QL G F++
Sbjct: 243 ------MIRDVAIGCGHTNQGMFIGAAGLLGL-----GGGSMSFIGQL--GGQTGGAFSY 289
Query: 254 CLDVVKG---GGIFAIGDVVSPKVKT------TPMVPNMPHYNVILEEVEVGGNPLDLP- 303
CL V +G G G P T P P+ Y + L + VGG + +P
Sbjct: 290 CL-VSRGTGSTGALEFGRGALPVGATWISLIRNPRAPSF--YYIGLAGIGVGGVRVSVPE 346
Query: 304 -TSLLGTGDERGTIIDSGTTLAYLPPMLY 331
T L G ++D+GT + P Y
Sbjct: 347 ETFQLTEYGTNGVVMDTGTAVTRFPTAAY 375
>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
Length = 988
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 114/263 (43%), Gaps = 49/263 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V GTP ++ + +DTGS + W C C C S FD SST
Sbjct: 125 GNFLVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRH-----FDSLASSTYSF 179
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+C + TYN +TYGD S++ G + D + L+ + +
Sbjct: 180 GSCIPSTVGNTYN---------------MTYGDKSTSVGNYGCDTM-------TLEPSDV 217
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
FGCG GD GS DG+LG GQ S +SQ A+ +K F++CL
Sbjct: 218 FQKFQFGCGRNNEGDFGS----GADGMLGLGQGQLSTVSQTASK--FKKVFSYCLPEENS 271
Query: 261 GGIFAIGDVV---SPKVKTTPMVPNMP---------HYNVILEEVEVGGNPLDLPTSLLG 308
G G+ S +K T +V N P +Y V L ++ VG L++P+S+
Sbjct: 272 IGSLLFGEKATSQSSSLKFTSLV-NGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVFA 330
Query: 309 TGDERGTIIDSGTTLAYLPPMLY 331
+ GTIIDSGT + LP Y
Sbjct: 331 SP---GTIIDSGTVITRLPQRAY 350
>gi|255558640|ref|XP_002520345.1| nucellin, putative [Ricinus communis]
gi|223540564|gb|EEF42131.1| nucellin, putative [Ricinus communis]
Length = 424
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/310 (28%), Positives = 127/310 (40%), Gaps = 37/310 (11%)
Query: 36 ENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDE 95
E F A +R LK+ + +H +S+ L + GN +P G Y + +G P
Sbjct: 28 EGSFSAASQR----CTLKK--STQHSCFGSSLVLPVFGNVYP--LGYYSVSLYIGNPPKL 79
Query: 96 YYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN 154
+ + +DTGSDL WV C A C+ C L+ P + ++C D C N+
Sbjct: 80 FELDIDTGSDLTWVQCDAPCTGCTKPLH-----HLYKPRNNL----LSCIDPLCSAVQNS 130
Query: 155 RYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
C +C+Y + Y D S+ G V D L +G+ L + FGCG Q
Sbjct: 131 GTYQCQSATDQCDYEIQYADEGSSLGVLVTDYFPLRLMNGSF----LRPKMTFGCGYDQK 186
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK 273
G G+LG G +S++SQL A G + HCL KGGG G P
Sbjct: 187 SP-GPVAPPPTTGVLGLGNGKTSIISQLQALGVMGNVIGHCLS-RKGGGFLFFGQDPVPS 244
Query: 274 --VKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
+ PM +Y E+ GG P GT E I DSG++ Y
Sbjct: 245 FGISWAPMSQKSLDKYYASGPAELLYGGKP-------TGTKAEE-FIFDSGSSYTYFNAQ 296
Query: 330 LYDLVLSQFR 339
+Y L+ R
Sbjct: 297 VYQSTLNLIR 306
>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
Length = 500
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 122/268 (45%), Gaps = 36/268 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF++VG+G+P + Y+ +DTGSD+ WV C C+ C +SD +FDPS S++
Sbjct: 160 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSTSYA 214
Query: 140 EIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+AC + C ++ +C C Y V YGDGS T G F + + L +A
Sbjct: 215 SVACDNPRC---HDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLG------DSA 265
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--D 256
P+ SSV GCG+ G + G S SQ++A F++CL
Sbjct: 266 PV-SSVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISA-----TTFSYCLVDR 314
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLL---GTG 310
GD +V T P++ + Y V L + VGG L +P S GTG
Sbjct: 315 DSPSSSTLQFGDAADAEV-TAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTG 373
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G I+DSGT + L Y + F
Sbjct: 374 -AGGVIVDSGTAVTRLQSSAYAALRDAF 400
>gi|297819828|ref|XP_002877797.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297323635|gb|EFH54056.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 530
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 97/347 (27%), Positives = 148/347 (42%), Gaps = 43/347 (12%)
Query: 20 WAVGGGGVMGNFVFEVENKFKAGGERE-------------RTLSALKQHDTRRHGRMMAS 66
W + G F FEV + F ++ L Q D GR +AS
Sbjct: 19 WGLERCEASGKFSFEVHHMFSDRVKQTLGLDDLVPEKGSLEYFKVLAQRDRLIRGRGLAS 78
Query: 67 IDLE-----LGGNGHPSATGL---YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+ E + GN S L ++ V +GTP + V +DTGS+L W+ C S C
Sbjct: 79 NNEETPITFMRGNRTVSIDFLGFLHYANVSVGTPATWFLVALDTGSNLFWLPCNCGSTCI 138
Query: 119 TK-SDLGIK----LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTY-G 172
D+G+ L L+ P+ SSTS I C+D+ C + SP C Y + Y
Sbjct: 139 RDLKDIGLSQSRPLNLYSPNTSSTSSSIRCNDDRCFGSSQ----CSSPASSCPYQIQYLS 194
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
+ T+G D++ L +LK P+ +++ GCG Q+G L SS AA++G+LG G
Sbjct: 195 KDTFTTGTLFEDVLHLVTEDVDLK--PVKANITLGCGRNQTGFLQSS--AAINGLLGLGM 250
Query: 233 ANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILE 291
+ S+ S LA A F+ C +++ G + GD TP++P P +
Sbjct: 251 KDYSVPSILAKAKITANSFSMCFGNIIDVIGRISFGDKGYTDQMETPLLPTEPSPTYAVN 310
Query: 292 EVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
EV + LL + D+GT+ +L Y L+ F
Sbjct: 311 VTEVSVGGDVVGVQLLA-------LFDTGTSFTHLLEPEYGLITKAF 350
>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
Length = 542
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 81/262 (30%), Positives = 114/262 (43%), Gaps = 42/262 (16%)
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSS 136
PSA G Y + +GTP VDTGSDL W C C+ C + + LFDP SS
Sbjct: 87 PSA-GEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQV-----VPLFDPKNSS 140
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
T + +C +FC +R SCS +C + +Y DGS T G + + ++ +G
Sbjct: 141 TYRDSSCGTSFCLALGKDR--SCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPV 198
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ P FGCG+ G D + GI+G G SL+SQL + N F++CL
Sbjct: 199 SFP---GFAFGCGHSSGGIF----DKSSSGIVGLGGGELSLISQLKSTIN--GLFSYCLL 249
Query: 257 VVKGGGIF-------AIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
V A G V +TP+ +P Y ++ EV
Sbjct: 250 PVSTDSSISSRINFGASGRVSGYGTVSTPL--RLP-YKGYSKKTEV-------------- 292
Query: 310 GDERGTIIDSGTTLAYLPPMLY 331
+E I+DSGTT +LP Y
Sbjct: 293 -EEGNIIVDSGTTYTFLPQEFY 313
>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
Length = 441
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 81/274 (29%), Positives = 131/274 (47%), Gaps = 40/274 (14%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
+++G Y + +GTP Y +DTGSDL+W CA C C + FD +S+T
Sbjct: 84 ASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQ-----PTPYFDVKRSAT 138
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C + C + PSC + C Y YGD +ST+G + AS
Sbjct: 139 YRALPCRSSRCAALSS---PSCFKKM-CVYQYYYGDTASTAGVLANETFTFGAASSTKVR 194
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A +++ FGCG+ +G+L +S+ G++GFG+ SL+SQL + F++CL
Sbjct: 195 A---ANISFGCGSLNAGELANSS-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTS 241
Query: 258 VKGG-------GIFAIGDVV-----SPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDL 302
G+FA + SP V++TP V P +P+ Y + ++ + +G L +
Sbjct: 242 YLSPTPSRLYFGVFANLNSTNTSSGSP-VQSTPFVINPALPNMYFLSVKGISLGTKRLPI 300
Query: 303 PTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ D+ G IIDSGT++ +L Y+ V
Sbjct: 301 DPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAV 334
>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 494
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 127/292 (43%), Gaps = 44/292 (15%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFTK+G+GTP + +DTGSD++W+ CA C RC +S +FDP
Sbjct: 133 SGLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSG-----QVFDPR 187
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQLNQ 190
+S + G + CS CR R S +R C Y V YGDGS T+G F + +
Sbjct: 188 RSRSYGAVGCSAPLCR-----RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTF-- 240
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDL--GSSTDAAVDGILGF--------GQANSSLLSQ 240
+G + A + GCG+ G + G L F G++ S L
Sbjct: 241 -AGGARVA----RIALGCGHDNEGLFVAAAGLLGLGRGSLSFPAQISRRYGRSFSYCLVD 295
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGG 297
++ N +H V G G A+G V+ TPMV N Y V L + VGG
Sbjct: 296 RTSSAN---PASHSSTVTFGSG--AVGSTVAASF--TPMVKNPRMETFYYVQLVGISVGG 348
Query: 298 NPL----DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ D L + G I+DSGT++ L Y + FR A L
Sbjct: 349 ARVSGVADSDLRLDPSSGRGGVIVDSGTSVTRLARPAYSALRDAFRAAAAGL 400
>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
Length = 490
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 139/300 (46%), Gaps = 31/300 (10%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLW 108
L + RR ++ S ++L + G Y ++V +GTP E+ + VD S +
Sbjct: 3 LELVANSHRRRDRELLGSARMDL--HDDLLTKGYYTSRVKIGTPPHEFSLIVDRSS-FVS 59
Query: 109 VNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYV 168
CS ++ F P+ SS+ + C N C T + + G R +Y
Sbjct: 60 PKTMFCSF------FFLQDPRFSPALSSSYKPLECG-NECSTGFCD-------GSR-KYQ 104
Query: 169 VTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGIL 228
Y + S++SG +D+I + +S +L ++FGC ++GDL D DGI+
Sbjct: 105 RQYAEKSTSSGVLGKDVISFSNSS-DLG----GQRLVFGCETAETGDL---YDQTADGII 156
Query: 229 GFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGIFAIGDVVSPK--VKTTPMVPNMPH 285
G G+ S++ QL + F+ C + +GGG +G PK V T+ P+
Sbjct: 157 GLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQPPKDMVFTSSDPHRSPY 216
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
YN++L+ + VGG+PL L + + GT++DSGTT AY P + S + + SL
Sbjct: 217 YNLMLKGIRVGGSPLRLKPEVF--DGKYGTVLDSGTTYAYFPGAAFQAFKSAVKEQVGSL 274
>gi|18855042|gb|AAL79734.1|AC091774_25 putative chloroplast nucleoid DNA-binding protein [Oryza sativa
Japonica Group]
gi|54291046|dbj|BAD61723.1| aspartic proteinase nepenthesin II-like [Oryza sativa Japonica
Group]
gi|125598520|gb|EAZ38300.1| hypothetical protein OsJ_22678 [Oryza sativa Japonica Group]
Length = 551
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 151/360 (41%), Gaps = 67/360 (18%)
Query: 16 VVHQWAV--GGGGVMGNFVFEVENKFKAGGE---RERTLSALKQHDTRRHGR-------- 62
+V +WA G GV + AG E SAL +HD R
Sbjct: 38 IVQRWAEERGHAGV----------SWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDG 87
Query: 63 ----MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+I L L G+ L++ +V +GTP + V +DTGSDL WV C C +C
Sbjct: 88 LVTFADGNITLRLDGS-------LHYAEVAVGTPNTTFLVALDTGSDLFWVPC-DCKQCA 139
Query: 119 TKSDL-------GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVT 170
+L G +L + PSKSSTS + C+ N C ++ +C+ C Y V
Sbjct: 140 PLGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLC-----DQPNACATATSSCPYAVR 194
Query: 171 YG-DGSSTSGYFVRDIIQLNQAS---GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y +S+SG V D++ L + A + + V+FGCG Q+G AA DG
Sbjct: 195 YAMANTSSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDG--AAADG 252
Query: 227 ILGFGQANSSLLSQLAAAGNVR-KEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH 285
++G G S+ S LA+ G V+ F+ C G G GD S TP + H
Sbjct: 253 LMGLGMEKVSVPSILASTGVVKSNSFSMCFS-KDGLGRINFGDTGSADQSETPFIVKSTH 311
Query: 286 --YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
YN+ + + VG +LP I DSGT+ YL Y + F I+
Sbjct: 312 SYYNISITSMSVGDK--NLPLGFYA-------IADSGTSFTYLNDPAYTAYTTNFNAQIS 362
>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 144/332 (43%), Gaps = 60/332 (18%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGG-----------------NGHPSATGLYFTKVG 88
+ S+ Q D+RR R +A++ ++ G +G +G YFT++G
Sbjct: 89 QELFSSRLQRDSRRV-RSIATLAAQIPGRNVTHAPRPGGFSSSVVSGLSQGSGEYFTRLG 147
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+GTP Y+ +DTGSD++W+ CA C RC ++SD +FDP KS T I CS C
Sbjct: 148 VGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSD-----PIFDPRKSKTYATIPCSSPHC 202
Query: 149 RTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQL--NQASGNLKTAPLNSS 203
R R S R C Y V+YGDGS T G F + + N+ G
Sbjct: 203 R-----RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKG---------- 247
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVK 259
V GCG+ G G+LG G+ S Q N ++F++CL K
Sbjct: 248 VALGCGHDNEGLF-----VGAAGLLGLGKGKLSFPGQTGHRFN--QKFSYCLVDRSASSK 300
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLLGTGDER--- 313
+ VS + TP++ N Y V L + VGG + T+ L D+
Sbjct: 301 PSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNG 360
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
G IIDSGT++ L Y + FR +L
Sbjct: 361 GVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTL 392
>gi|125556778|gb|EAZ02384.1| hypothetical protein OsI_24487 [Oryza sativa Indica Group]
Length = 551
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 151/360 (41%), Gaps = 67/360 (18%)
Query: 16 VVHQWAV--GGGGVMGNFVFEVENKFKAGGE---RERTLSALKQHDTRRHGR-------- 62
+V +WA G GV + AG E SAL +HD R
Sbjct: 38 IVQRWAEERGHAGV----------SWPAGAEVIGSPEYYSALSRHDHALFARRGLAQGDG 87
Query: 63 ----MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+I L L G+ L++ +V +GTP + V +DTGSDL WV C C +C
Sbjct: 88 LVTFADGNITLRLDGS-------LHYAEVAVGTPNTTFLVALDTGSDLFWVPC-DCKQCA 139
Query: 119 TKSDL-------GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVT 170
+L G +L + PSKSSTS + C+ N C ++ +C+ C Y V
Sbjct: 140 PLGNLTAVDGGGGPELRQYSPSKSSTSKTVTCASNLC-----DQPNACATATSSCPYAVR 194
Query: 171 YG-DGSSTSGYFVRDIIQLNQAS---GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y +S+SG V D++ L + A + + V+FGCG Q+G AA DG
Sbjct: 195 YAMANTSSSGELVEDVLYLTREKGAAAAAAGAAVRTPVVFGCGQVQTGSFLDG--AAADG 252
Query: 227 ILGFGQANSSLLSQLAAAGNVR-KEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH 285
++G G S+ S LA+ G V+ F+ C G G GD S TP + H
Sbjct: 253 LMGLGMEKVSVPSILASTGVVKSNSFSMCFS-KDGLGRINFGDTGSADQSETPFIVKSTH 311
Query: 286 --YNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
YN+ + + VG +LP I DSGT+ YL Y + F I+
Sbjct: 312 SYYNISITSMSVGDK--NLPLGFYA-------IADSGTSFTYLNDPAYTAYTTNFNAQIS 362
>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 481
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 83/314 (26%), Positives = 139/314 (44%), Gaps = 43/314 (13%)
Query: 33 FEVENKFKAGGERERT-LSALKQHDTRRHGRMMASIDLELGG---NGHPSATGLYFTKVG 88
++ + F A +R++ ++ L + + R S++ E G +G +G YF ++G
Sbjct: 89 YDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVE-EFGAEVVSGMNQGSGEYFIRIG 147
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+G+P E YV +D+GSD++WV C C++C ++D +FDP+ S++ + CS + C
Sbjct: 148 VGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTD-----PVFDPADSASFMGVPCSSSVC 202
Query: 149 RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC 208
N C G C Y V YGDGS T G + + + + +V GC
Sbjct: 203 ERIEN---AGCHAG-GCRYEVMYGDGSYTKGTLALETLTFGRT--------VVRNVAIGC 250
Query: 209 GNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---------DVVK 259
G+R G + G + SL+ QL G F++CL +
Sbjct: 251 GHRNRGMFVGAAGLLGL-----GGGSMSLVGQL--GGQTGGAFSYCLVSRGTDSAGSLEF 303
Query: 260 GGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGN--PLDLPTSLLGTGDERGTII 317
G G +G P ++ P P+ Y + L V VGG P+ L G ++
Sbjct: 304 GRGAMPVGAAWIPLIR-NPRAPSF--YYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVM 360
Query: 318 DSGTTLAYLPPMLY 331
D+GT + +P + Y
Sbjct: 361 DTGTAVTRIPTVAY 374
>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
Length = 339
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 121/269 (44%), Gaps = 39/269 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y +V LGTP + ++ +DT +D WV C+GC+ C + T F P+ S+T G +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS--------TTFLPNASTTLGSLD 96
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS+ C P+ C + +YG SS + V+D I L +
Sbjct: 97 CSEAQCSQVRGFSCPATGSSA-CLFNQSYGGDSSLAATLVQDAITLAND--------VIP 147
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
FGC N SG G+LG G+ SL+SQ A F++CL K
Sbjct: 148 GFTFGCINAVSGG-----SIPPQGLLGLGRGPISLISQ--AGAMYSGVFSYCLPSFKSYY 200
Query: 261 -GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLL----GTG 310
G +G V PK ++TTP++ N PH Y V L V VG + +P+ L TG
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 259
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + +Y + +FR
Sbjct: 260 --AGTIIDSGTVITRFVQPVYFAIRDEFR 286
>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
Length = 339
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 121/269 (44%), Gaps = 39/269 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y +V LGTP + ++ +DT +D WV C+GC+ C + T F P+ S+T G +
Sbjct: 45 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS--------TTFLPNASTTLGSLD 96
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS+ C P+ C + +YG SS + V+D I L +
Sbjct: 97 CSEAQCSQVRGFSCPATGSSA-CLFNQSYGGDSSLAATLVQDAITLAND--------VIP 147
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
FGC N SG G+LG G+ SL+SQ A F++CL K
Sbjct: 148 GFTFGCINAVSGG-----SIPPQGLLGLGRGPISLISQ--AGAMYSGVFSYCLPSFKSYY 200
Query: 261 -GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLL----GTG 310
G +G V PK ++TTP++ N PH Y V L V VG + +P+ L TG
Sbjct: 201 FSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 259
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + +Y + +FR
Sbjct: 260 --AGTIIDSGTVITRFVQPVYFAIRDEFR 286
>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 500
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 118/268 (44%), Gaps = 36/268 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF++VG+G P + Y+ +DTGSD+ W+ C C+ C +SD ++DPS S++
Sbjct: 160 SGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSD-----PVYDPSVSTSYA 214
Query: 140 EIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C CR + +C C Y V YGDGS T G F + + L +A
Sbjct: 215 TVGCDSPRCR---DLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLG------DSA 265
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--D 256
P+ S+V GCG+ G + G S SQ++A F++CL
Sbjct: 266 PV-SNVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISA-----TTFSYCLVDR 314
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTGD- 311
GD P V T P++ P Y V L + VGG L +P+S D
Sbjct: 315 DSPSSSTLQFGDSEQPAV-TAPLI-RSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDA 372
Query: 312 -ERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G I+DSGT + L Y + F
Sbjct: 373 GSGGVIVDSGTAVTRLQSGAYGALREAF 400
>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 492
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 129/280 (46%), Gaps = 36/280 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF++VG+G P+ +Y+ +DTGSD+ W+ C CS C +SD +FDP+
Sbjct: 148 SGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSD-----PIFDPT 202
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SS+ + C C+ + +C G +C Y V+YGDGS T G +V + + S
Sbjct: 203 ASSSYNPLTCDAQQCQ---DLEMSACRNG-KCLYQVSYGDGSFTVGEYVTETVSFGAGSV 258
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
N V GCG+ G G+LG G SL SQ+ A F++
Sbjct: 259 N--------RVAIGCGHDNEGLF-----VGSAGLLGLGGGPLSLTSQIKATS-----FSY 300
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH------YNVILEEVEVGGNPLDLPTSLL 307
CL V + G + + SP+ + + P + + Y V L V VGG + +P
Sbjct: 301 CL-VDRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETF 359
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
G I+DSGT + L Y+ V F+ ++L
Sbjct: 360 AVDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNL 399
>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 141/309 (45%), Gaps = 35/309 (11%)
Query: 48 TLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLL 107
T ++ + + T G++MA+++ +G +G YF V +GTP Y + +DTGSDL
Sbjct: 60 TAASPESYGTGLSGQLMATLE-----SGVTLGSGEYFMDVFIGTPPKHYSLILDTGSDLN 114
Query: 108 WVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRC 165
W+ C C C ++ +DP +SS+ I C D C ++ + P + C
Sbjct: 115 WIQCVPCHDCFEQNG-----PYYDPKESSSFRNIGCHDPRCHLVSSPDPPLPCKAENQTC 169
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA-PLNSSVIFGCGNRQSGDLGSSTDAAV 224
Y YGD S+T+G F + +N S K+ +V+FGCG+ G ++
Sbjct: 170 PYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRVENVMFGCGHWNRGLFHGASGLLG 229
Query: 225 DGILGFGQANSSLLSQLAAAGNVRKEFAHCL------DVVKGGGIF-AIGDVVS-PKVKT 276
G+ S SQL + F++CL V IF D+++ P++
Sbjct: 230 L-----GRGPLSFSSQLQSL--YGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPELNF 282
Query: 277 TPMV-----PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPM 329
T +V P Y V ++ + VGG L++P S + GTI+DSGTTL+Y
Sbjct: 283 TTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVGGTIVDSGTTLSYFTEP 342
Query: 330 LYDLVLSQF 338
Y ++ F
Sbjct: 343 AYQIIKDAF 351
>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
Length = 485
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 124/277 (44%), Gaps = 37/277 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF+++G+G P + + +DTGSD+ W+ C CS C +SD +++P+
Sbjct: 136 SGMDQGSGEYFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSD-----PIYNPA 190
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SS+ + C N C+ CS C Y V+YGDGS T G F + +
Sbjct: 191 LSSSYKLVGCQANLCQQL---DVSGCSRNGSCLYQVSYGDGSYTQGNFATETL------- 240
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
L APL +V GCG+ G G+LG G + S SQL K F++
Sbjct: 241 TLGGAPLQ-NVAIGCGHDNEGLF-----VGAAGLLGLGGGSLSFPSQLTDENG--KIFSY 292
Query: 254 CL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
CL + G G V++P +K + + Y V L + VGG L +
Sbjct: 293 CLVDRDSESSSTLQFGRAAVPNGAVLAPMLKNSRL---DTFYYVSLSGISVGGKMLSISD 349
Query: 305 SLLG--TGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
S+ G G I+DSGT + L YD + FR
Sbjct: 350 SVFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFR 386
>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
Length = 357
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 122/274 (44%), Gaps = 36/274 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT+VG+G P ++Y+ +DTGSD+ W+ C C+ C ++D +FDP+
Sbjct: 11 SGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTD-----PIFDPT 65
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SST + C C + SC G +C Y V YGDGS T G F + + SG
Sbjct: 66 ASSTYAPVTCQSQQCSSL---EMSSCRSG-QCLYQVNYGDGSYTFGDFATESVSFGN-SG 120
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
++K +V GCG+ G + G SL +QL A F++
Sbjct: 121 SVK------NVALGCGHDNEGLFVGAAGLLGLGGGPL-----SLTNQLKAT-----SFSY 164
Query: 254 CLDVVKGGGIFAIGDVVSPKVK----TTPMVPNMP---HYNVILEEVEVGGNPLDLPTSL 306
CL G + D S ++ T P++ N Y V L + VGG + +P S
Sbjct: 165 CLVNRDSAGSSTL-DFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPEST 223
Query: 307 --LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
L G I+D GT + L Y+ + F
Sbjct: 224 FRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAF 257
>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
Length = 629
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 110/253 (43%), Gaps = 33/253 (13%)
Query: 98 VQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
V +D+GSD+ WV C C C + D LFDP+ S+T + C+ C R
Sbjct: 79 VIIDSGSDVSWVQCKPCPLPMCHRQRD-----PLFDPAMSTTYAAVPCTSAACAQLGPYR 133
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
CS +C++ + YGDGS+ +G + D + L + FGC + D
Sbjct: 134 R-GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD-------VIRGFRFGCAH---AD 182
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVK 275
GS+ D V G L G + SL+ Q A + F++CL F + V + +
Sbjct: 183 RGSAFDYDVAGSLALGGGSQSLVQQTAT--RYGRVFSYCLPPTASSLGFLVLGVPPERAQ 240
Query: 276 TTPMVPNMP---------HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
P + P Y V+L + V G PL +P ++ ++IDS T ++ L
Sbjct: 241 LIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSA----SSVIDSSTIISRL 296
Query: 327 PPMLYDLVLSQFR 339
PP Y + + FR
Sbjct: 297 PPTAYQALRAAFR 309
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 74/186 (39%), Gaps = 45/186 (24%)
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG-----CGNRQS 213
CS +C++ + YGDGS+ +G + D + L + + PL ++ +G C
Sbjct: 389 CSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDRQGLPLRTATQYGRVFSYCIPPSP 448
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK 273
LG T LG ++L+ V +P
Sbjct: 449 SSLGFIT-------LGVPPQRAALVPTF---------------------------VSTPL 474
Query: 274 VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
+ ++ M P Y V+L + V G PL +P ++ T ++I S T ++ LPP Y
Sbjct: 475 LSSSSMPPTF--YRVLLRAIIVAGRPLPVPPTVFST----SSVIASTTVISRLPPTAYQA 528
Query: 334 VLSQFR 339
+ + FR
Sbjct: 529 LRAAFR 534
>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 447
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 84/268 (31%), Positives = 112/268 (41%), Gaps = 30/268 (11%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y + LGTP + DTGSDLLW C C C + + +FDP+KS T
Sbjct: 90 SNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIE-----PIFDPAKSKT 144
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++C C CS C Y +YGDGS TSG D + + +G +
Sbjct: 145 YQILSCEGKSCSNLGGQG--GCSDDNTCIYSYSYGDGSHTSGDLAVDTLTIGSTTGRPVS 202
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
P V+FGCG+ G + G++G G S++SQL + F++CL
Sbjct: 203 VP---KVVFGCGHNNGGTF----ELHGSGLVGLGGGPLSMISQLRPL--IGGRFSYCLVP 253
Query: 256 -----DVVKGGGIFAIGDVVSPKVKTTPMVPNMP--HYNVILEEVEVGGNPLDLP----- 303
V + G V +TP+ P Y + LE + VG L
Sbjct: 254 LGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKV 313
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLY 331
S L DE IIDSGTTL LP Y
Sbjct: 314 GSPLADADEGNIIIDSGTTLTLLPQDFY 341
>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
Length = 499
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 122/274 (44%), Gaps = 36/274 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT+VG+G P ++Y+ +DTGSD+ W+ C C+ C ++D +FDP+
Sbjct: 152 SGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTD-----PIFDPT 206
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SST + C C + SC G +C Y V YGDGS T G F + + SG
Sbjct: 207 ASSTYAPVTCQSQQCSSL---EMSSCRSG-QCLYQVNYGDGSYTFGDFATESVSFGN-SG 261
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
++K +V GCG+ G + G SL +QL A F++
Sbjct: 262 SVK------NVALGCGHDNEGLFVGAAGLLGLGGGPL-----SLTNQLKATS-----FSY 305
Query: 254 CLDVVKGGGIFAIGDVVSPKV----KTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSL 306
CL G + D S ++ T P++ N Y V L + VGG + +P S
Sbjct: 306 CLVNRDSAGSSTL-DFNSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPEST 364
Query: 307 --LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
L G I+D GT + L Y+ + F
Sbjct: 365 FRLDESGNGGIIVDCGTAITRLQTQAYNPLRDAF 398
>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 433
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 76/261 (29%), Positives = 121/261 (46%), Gaps = 26/261 (9%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
SA G Y +GTP+ + + +DTGSD++W+ C C +C ++ +FD SKS T
Sbjct: 84 SALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTT-----PIFDSSKSQT 138
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C N C++ CS C Y + Y DGS + G + + L +G+
Sbjct: 139 YKTLPCPSNTCQSVQGTF---CSSRKHCLYSIHYVDGSQSLGDLSVETLTLGSTNGSPVQ 195
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC--- 254
P + GCG + + + GI+G G+ SL++QL+ + +F++C
Sbjct: 196 FP---GTVIGCGRYNAIGI----EEKNSGIVGLGRGPMSLITQLSPS--TGGKFSYCLVP 246
Query: 255 -LDVVKGGGIFAIGDVVSPK-VKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLLGTG 310
L F VVS + +TP+ + Y + LE VG N ++ + G+G
Sbjct: 247 GLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSP--GSG 304
Query: 311 DERGTIIDSGTTLAYLPPMLY 331
+ IIDSGTTL LP +Y
Sbjct: 305 GKGNIIIDSGTTLTALPNGVY 325
>gi|297723777|ref|NP_001174252.1| Os05g0187600 [Oryza sativa Japonica Group]
gi|255676094|dbj|BAH92980.1| Os05g0187600 [Oryza sativa Japonica Group]
Length = 340
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 42/108 (38%), Positives = 65/108 (60%)
Query: 224 VDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNM 283
VDG++G G +N+SL+ QLA + +K FAHCLD + GGIF +G +V PKV+ TP+
Sbjct: 89 VDGVMGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTS 148
Query: 284 PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
Y L E+ VG L L + + TI+++G+ ++YLP ++
Sbjct: 149 SRYRTTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKIF 196
>gi|356511197|ref|XP_003524315.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 431
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 120/274 (43%), Gaps = 27/274 (9%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDL 123
+SI L GN +P G Y + +G P Y++ VDTGSDL W+ C A C+ C
Sbjct: 55 SSIVFPLYGNVYP--VGFYNVTLNIGQPARPYFLDVDTGSDLTWLQCDAPCTHCSETP-- 110
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVR 183
P ++ + C D C + +C +C+Y + Y D ST G +
Sbjct: 111 -------HPLHRPSNDFVPCRDPLCASLQPTEDYNCEHPDQCDYEINYADQYSTYGVLLN 163
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA 243
D+ LN ++G L + GCG Q S+ +DG+LG G+ +SL+SQL +
Sbjct: 164 DVYLLNSSNG----VQLKVRMALGCGYDQV--FSPSSYHPLDGLLGLGRGKASLISQLNS 217
Query: 244 AGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVP-NMPHYNVILEEVEVGGNPLDL 302
G VR HCL GG IF S +V TP+ + HY+ E+ GG
Sbjct: 218 QGLVRNVIGHCLSSQGGGYIFFGNAYDSARVTWTPISSVDSKHYSAGPAELVFGGRK--- 274
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
G G + D+G++ Y Y +LS
Sbjct: 275 ----TGVG-SLTAVFDTGSSYTYFNSHAYQALLS 303
>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
Length = 720
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 112/255 (43%), Gaps = 37/255 (14%)
Query: 98 VQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
V +D+GSD+ WV C C C + D LFDP+ S+T + C+ C R
Sbjct: 170 VIIDSGSDVSWVQCKPCPLPMCHRQRD-----PLFDPAMSTTYAAVPCTSAACAQLGPYR 224
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
CS +C++ + YGDGS+ +G + D + L + FGC + D
Sbjct: 225 R-GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD-------VIRGFRFGCAH---AD 273
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG-GIFAIG------- 267
GS+ D V G L G + SL+ Q A + F++CL G +G
Sbjct: 274 RGSAFDYDVAGSLALGGGSQSLVQQTAT--RYGRVFSYCLPPTASSLGFLVLGVPPERAQ 331
Query: 268 ---DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
VS + ++ M P Y V+L + V G PL +P ++ ++IDS T ++
Sbjct: 332 LIPSFVSTPLLSSSMAPTF--YRVLLRAIIVAGRPLAVPPAVFSA----SSVIDSSTIIS 385
Query: 325 YLPPMLYDLVLSQFR 339
LPP Y + + FR
Sbjct: 386 RLPPTAYQALRAAFR 400
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 43/186 (23%), Positives = 74/186 (39%), Gaps = 45/186 (24%)
Query: 159 CSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG-----CGNRQS 213
CS +C++ + YGDGS+ +G + D + L + + PL ++ +G C
Sbjct: 480 CSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDRQGLPLRTATQYGRVFSYCIPPSP 539
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK 273
LG T LG ++L+ V +P
Sbjct: 540 SSLGFIT-------LGVPPQRAALVPTF---------------------------VSTPL 565
Query: 274 VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDL 333
+ ++ M P Y V+L + V G PL +P ++ T ++I S T ++ LPP Y
Sbjct: 566 LSSSSMPPTF--YRVLLRAIIVAGRPLPVPPTVFST----SSVIASTTVISRLPPTAYQA 619
Query: 334 VLSQFR 339
+ + FR
Sbjct: 620 LRAAFR 625
>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
Group]
gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 495
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 73/255 (28%), Positives = 112/255 (43%), Gaps = 37/255 (14%)
Query: 98 VQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
V +D+GSD+ WV C C C + D LFDP+ S+T + C+ C R
Sbjct: 170 VIIDSGSDVSWVQCKPCPLPMCHRQRD-----PLFDPAMSTTYAAVPCTSAACAQLGPYR 224
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
CS +C++ + YGDGS+ +G + D + L + FGC + D
Sbjct: 225 R-GCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD-------VIRGFRFGCAH---AD 273
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG-GIFAIG------- 267
GS+ D V G L G + SL+ Q A + F++CL G +G
Sbjct: 274 RGSAFDYDVAGSLALGGGSQSLVQQTAT--RYGRVFSYCLPPTASSLGFLVLGVPPERAQ 331
Query: 268 ---DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLA 324
VS + ++ M P Y V+L + V G PL +P ++ ++IDS T ++
Sbjct: 332 LIPSFVSTPLLSSSMAPTF--YRVLLRAIIVAGRPLAVPPAVFSA----SSVIDSSTIIS 385
Query: 325 YLPPMLYDLVLSQFR 339
LPP Y + + FR
Sbjct: 386 RLPPTAYQALRAAFR 400
>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
Full=Nepenthesin-I; Flags: Precursor
gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
Length = 437
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 157/363 (43%), Gaps = 66/363 (18%)
Query: 6 LLALVVVTVAVV-------------HQWAVGGGGVMGNFVFEVEN--KFKAGGERERTLS 50
LLAL +V + V H+ V G +M V +N KF+ ER +
Sbjct: 9 LLALSIVYIFVAPTHSTSRTALNHRHEAKVTGFQIMLEHVDSGKNLTKFQL---LERAI- 64
Query: 51 ALKQHDTRRHGRMMASIDLELGGNGHPSAT----GLYFTKVGLGTPTDEYYVQVDTGSDL 106
+ +RR R+ A ++ G +G ++ G Y + +GTP + +DTGSDL
Sbjct: 65 ---ERGSRRLQRLEAMLN---GPSGVETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDL 118
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
+W C C++C +S +F+P SS+ + CS C+ + P+CS C+
Sbjct: 119 IWTQCQPCTQCFNQST-----PIFNPQGSSSFSTLPCSSQLCQALSS---PTCSNNF-CQ 169
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y YGDGS T G + + S P ++ FGCG G G A G
Sbjct: 170 YTYGYGDGSETQGSMGTETLTFGSVS-----IP---NITFGCGENNQG-FGQGNGA---G 217
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG-----IFAIGDVVSPKVKTTPMVP 281
++G G+ SL SQL +V K F++C+ + + ++ + V+ T ++
Sbjct: 218 LVGMGRGPLSLPSQL----DVTK-FSYCMTPIGSSTPSNLLLGSLANSVTAGSPNTTLIQ 272
Query: 282 N--MP-HYNVILEEVEVGGNPLDLPTSLLGTGDERGT---IIDSGTTLAYLPPMLYDLVL 335
+ +P Y + L + VG L + S GT IIDSGTTL Y Y V
Sbjct: 273 SSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTLTYFVNNAYQSVR 332
Query: 336 SQF 338
+F
Sbjct: 333 QEF 335
>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 485
Score = 90.9 bits (224), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 92/287 (32%), Positives = 128/287 (44%), Gaps = 42/287 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C RC ++SD +FDP
Sbjct: 133 SGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSD-----PIFDPR 187
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQL-- 188
KS T I CS CR R S R C Y V+YGDGS T G F + +
Sbjct: 188 KSKTYATIPCSSPHCR-----RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR 242
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
N+ G V GCG+ G G+LG G+ S Q N
Sbjct: 243 NRVKG----------VALGCGHDNEGLF-----VGAAGLLGLGKGKLSFPGQTGHRFN-- 285
Query: 249 KEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD 301
++F++CL K + VS + TP++ N Y V L + VGG +
Sbjct: 286 QKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVP 345
Query: 302 LPTSLLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
T+ L D+ G IIDSGT++ L Y + FR +L
Sbjct: 346 GVTASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTL 392
>gi|308813706|ref|XP_003084159.1| Aspartyl protease (ISS) [Ostreococcus tauri]
gi|116056042|emb|CAL58575.1| Aspartyl protease (ISS) [Ostreococcus tauri]
Length = 478
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 71/246 (28%), Positives = 113/246 (45%), Gaps = 37/246 (15%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
VDTGS ++ C GC+ C +D S+ + CS C C
Sbjct: 51 VDTGSSRTYLPCKGCASCGAHE----AGRYYDYDASADFSRVECSA--CAGIGGK----C 100
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
C Y V Y +GS + GY VRD++ L + G N++V+FGC R+ LGS
Sbjct: 101 GTSGVCRYDVHYLEGSGSEGYLVRDVVSLGGSVG-------NATVVFGCEERE---LGSI 150
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG------GGIFAIGD----V 269
+ DG+ GFG+ +L +QLA+A + F+ C++ + GG+ +G+
Sbjct: 151 KQQSADGLFGFGRQAYALRAQLASASVIDDLFSMCVEGYEKLSGEHVGGLLTLGNFDFGA 210
Query: 270 VSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPM 329
+P + TPMV + +Y V +G + ++ +L TIIDSGT+ Y+P
Sbjct: 211 DAPALVYTPMVSSAMYYQVTTTSWTLGNSVVEGSRGVL-------TIIDSGTSYTYVPGN 263
Query: 330 LYDLVL 335
++ L
Sbjct: 264 MHARFL 269
>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 431
Score = 90.5 bits (223), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 131/304 (43%), Gaps = 50/304 (16%)
Query: 38 KFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYY 97
+ A R R LS + R H S+ +E Y ++ +GTP +
Sbjct: 49 RRAAHRSRLRALSGYDANSPRLH-----SVQVE------------YLMELAIGTPPVPFV 91
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
DTGSDL W C C C ++DPS SST + CS C +R
Sbjct: 92 ALADTGSDLTWTQCQPCKLC-----FPQDTPVYDPSASSTFSPVPCSSATCLPVLRSR-- 144
Query: 158 SCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDL 216
+CS P C Y +Y DG+ ++G + + L + + S V FGCG GD
Sbjct: 145 NCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVS--VSDVAFGCGTDNGGDS 202
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI---FAIGDV--VS 271
+ST G +G G+ SLL+QL +F++CL + F +G + ++
Sbjct: 203 LNST-----GTVGLGRGTLSLLAQLGVG-----KFSYCLTDFFNSTLDSPFLLGTLAELA 252
Query: 272 P---KVKTTPMVP---NMPHYNVILEEVEVGGNPLDLP--TSLLGTGDERGTIIDSGTTL 323
P V++TP++ N Y V L+ + +G L +P T L G ++DSGTT
Sbjct: 253 PGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVDSGTTF 312
Query: 324 AYLP 327
+ LP
Sbjct: 313 SILP 316
>gi|2570402|gb|AAB97155.1| EEA1 [Hordeum vulgare subsp. vulgare]
Length = 410
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 123/287 (42%), Gaps = 41/287 (14%)
Query: 66 SIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC----AGCSRCPTKS 121
+I+ L GN +P G ++ + +G P Y++ VDTGS+L W+ C GC C +
Sbjct: 23 AINFPLEGNVYP--VGHFYATLNIGEPAKPYFLDVDTGSNLTWLECHPPVHGCKGCHPRP 80
Query: 122 DLGIKLTLFDPSKSSTSG--EIACSDNFCRTTYNNR--YPSCSPG--VRCEYVVTYGDGS 175
P + G ++ C C + P CS RC Y + Y G
Sbjct: 81 P--------HPYYTPADGKLKVVCGSPLCVAVRRDVPGIPECSRNDPHRCHYEIQYVTGK 132
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
S G DII +N + FGCG +Q + S + V+GILG G +
Sbjct: 133 S-EGDLATDIISVNGRD--------KKRIAFGCGYKQE-EPPDSPPSPVNGILGLGMGKA 182
Query: 236 SLLSQLAAAGNVRKE-FAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNVILEE 292
+QL +++ HCL KG G+ +GD P V PM ++ +Y+ L E
Sbjct: 183 GFAAQLKGLKMIKENVIGHCLS-SKGKGVLYVGDFNPPTRGVTWAPMRESLFYYSPGLAE 241
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
V + P+ + + DSG+T ++P +Y+ ++S+ R
Sbjct: 242 VFIDKQPIRGNPTF-------EAVFDSGSTYTHVPAQIYNEIVSKVR 281
>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 477
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 118/286 (41%), Gaps = 33/286 (11%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
+ G G T Y + +GTP + +DTGSDL+W CA C C D G +
Sbjct: 80 VRTAGAGGGIVTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNC---FDQG-AIP 135
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG------VRCEYVVTYGDGSSTSGYFV 182
+ DP+ SST + C CR + SC G C YV YGD S T G
Sbjct: 136 VLDPAASSTHAAVRCDAPVCRAL---PFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLA 192
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
D + FGCG+ G A GI GFG+ SL SQL
Sbjct: 193 SDRFTFGPGDNADGGGVSERRLTFGCGHFNKGIF----QANETGIAGFGRGRWSLPSQLG 248
Query: 243 AAGNVRKEFAHCLDVVKGGGIFAIGDVVSP-------KVKTTPMV--PNMPH-YNVILEE 292
F++C + + V+P +V++TP++ P+ P Y + L+
Sbjct: 249 V-----TSFSYCFTSMFESTSSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKA 303
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ VG + +P E IIDSG ++ LP +Y+ V ++F
Sbjct: 304 ITVGATRIPIPERRQRL-REASAIIDSGASITTLPEDVYEAVKAEF 348
>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 417
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 120/261 (45%), Gaps = 39/261 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y ++ +GTP + DTGSDL W C C C ++DPS SST +
Sbjct: 66 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLC-----FPQDTPVYDPSASSTFSPVP 120
Query: 143 CSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
CS C T+ +R +CS P C Y+ +Y DG+ + G + + + + +
Sbjct: 121 CSSATCLPTWRSR--NCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVS--V 176
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
SV FGCG GD +ST G +G G+ SLL+QL +F++CL
Sbjct: 177 GSVAFGCGTDNGGDSLNST-----GTVGLGRGTLSLLAQLGVG-----KFSYCLTDFFNS 226
Query: 262 GI---FAIGDV--VSP---KVKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
+ F +G + ++P V++TP++ N Y V L+ + +G L +P GT
Sbjct: 227 TMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPN---GTF 283
Query: 311 DER-----GTIIDSGTTLAYL 326
D R G ++DSGTT L
Sbjct: 284 DLRADGNGGMMVDSGTTFTIL 304
>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 479
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 121/273 (44%), Gaps = 36/273 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF++VG+G P+ Y+ +DTGSD+ W+ CA C+ C ++D +F+P+
Sbjct: 135 SGTSQGSGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQAD-----PIFEPA 189
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ ++C C++ C C Y V+YGDGS T G FV + I L AS
Sbjct: 190 SSTSYSPLSCDTKQCQSL---DVSECRNNT-CLYEVSYGDGSYTVGDFVTETITLGSASV 245
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ +V GCG+ G + G S SQ+ A+ F++
Sbjct: 246 D--------NVAIGCGHNNEGLFIGAAGLLGLGGGKL-----SFPSQINASS-----FSY 287
Query: 254 CL--DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLLG 308
CL + P T P++ N Y V + + VGG L +P S+
Sbjct: 288 CLVDRDSDSASTLEFNSALLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFE 347
Query: 309 TGDERGT---IIDSGTTLAYLPPMLYDLVLSQF 338
DE G IIDSGT + L Y+ + F
Sbjct: 348 M-DESGNGGIIIDSGTAVTRLQTAAYNALRDAF 379
>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
lyrata]
Length = 467
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/291 (28%), Positives = 129/291 (44%), Gaps = 29/291 (9%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-A 112
Q ++ R+ +S+ + GN +P G Y+ + +G P + + +DTGSDL WV C A
Sbjct: 41 QQVKLQNRRLGSSVVFPVSGNVYP--LGYYYVLLNIGNPPKLFDLDIDTGSDLTWVQCDA 98
Query: 113 GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVT 170
C+ C TK + + P+ ++ + CS C NR P P +C+Y +
Sbjct: 99 PCNGC-TKP----RAKQYKPNHNT----LPCSHLLCSGLDLTQNR-PCDDPEDQCDYEIG 148
Query: 171 YGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGF 230
Y D +S+ G V D L A+G++ +N + FGCG Q + G GILG
Sbjct: 149 YSDHASSIGALVTDEFPLKLANGSI----MNPHLTFGCGYDQQ-NPGPHPPPPTAGILGL 203
Query: 231 GQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNMPHYNV 288
G+ + +QL + G + HCL G G +IGD + P V T + N N
Sbjct: 204 GRGKVGISTQLKSLGITKNVIVHCLSHT-GKGFLSIGDELVPSSGVTWTSLATNSASKNY 262
Query: 289 ILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ E+ N D T + G + DSG++ Y Y +L R
Sbjct: 263 MTGPAELLFN--DKTTGVKGI----NVVFDSGSSYTYFNAEAYQAILDLIR 307
>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 435
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 119/259 (45%), Gaps = 29/259 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + LGTP DTGS+L+W C C C T+ D LFDP SST +
Sbjct: 92 GEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVD-----PLFDPKASSTYKD 146
Query: 141 IACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
++CS + C N SCS + C Y+V+Y DGS T G F D + L G+ P
Sbjct: 147 VSCSSSQCTALENQA--SCSTEDKTCSYLVSYADGSYTMGKFAVDTLTL----GSTDNRP 200
Query: 200 LN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--D 256
+ ++I GCG + + + V G SL+ QL ++ +F++CL +
Sbjct: 201 VQLKNIIIGCGQNNAVTFRNKSSGVVGL----GGGAVSLIKQL--GDSIDGKFSYCLVPE 254
Query: 257 VVKGGGI-FAIGDVVS-PKVKTTPMVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
+ I F VVS P +TP+V Y + L+ + VG + P S + +
Sbjct: 255 NDQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNI----K 310
Query: 313 RGTIIDSGTTLAYLPPMLY 331
+IDSGTTL LP Y
Sbjct: 311 GNMVIDSGTTLTLLPVKYY 329
>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
Length = 474
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 119/279 (42%), Gaps = 42/279 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DTGSDL W CA C C +S L F+PS+S T +
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQS-----LPRFNPSRSMTFSVLP 165
Query: 143 CSDNFCRTTYNNRYPSCSP-----GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C CR + + SC G+ C Y Y D S T+G+ D A +
Sbjct: 166 CDLRICR---DLTWSSCGEQSWGNGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGG 221
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A + + FGCG +G S+ GI GF + S+ +QL F++C
Sbjct: 222 ASV-PDLTFGCGLFNNGIFVSNE----TGIAGFSRGALSMPAQLKV-----DNFSYCFTA 271
Query: 258 VKGGGIFAIGDVVSPK------------VKTTPMV----PNMPHYNVILEEVEVGGNPLD 301
+ G + V P V++T ++ + Y + L+ V VG L
Sbjct: 272 ITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLP 331
Query: 302 LPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P S+ ++ GTI+DSGT + LP +Y+LV F
Sbjct: 332 IPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAF 370
>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 125/277 (45%), Gaps = 40/277 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+G+P Y+ +D+GSD++WV C C++C ++D LFDP+
Sbjct: 34 SGMDQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTD-----PLFDPA 88
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ ++CS C N C+ G RC Y V+YGDGSST G + + L +
Sbjct: 89 DSASFMGVSCSSAVCDQVDN---AGCNSG-RCRYEVSYGDGSSTKGTLALETLTLGRT-- 142
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFA 252
+ +V GCG+ G + G + S + QL+ GN F+
Sbjct: 143 ------VVQNVAIGCGHMNQGMFVGAAGLLGL-----GGGSMSFVGQLSRERGNA---FS 188
Query: 253 HCL--DVVKGGGIFAIGDVVSP-KVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTSL 306
+CL V G G P P++ P+ P Y I L + VG + + +
Sbjct: 189 YCLVSRVTNSNGFLEFGSEAMPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDI 248
Query: 307 -----LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
LG G G ++D+GT + P + Y+ F
Sbjct: 249 FELTELGNG---GVVMDTGTAVTRFPTVAYEAFRDAF 282
>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 446
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/296 (27%), Positives = 122/296 (41%), Gaps = 37/296 (12%)
Query: 44 ERERTLSALKQHDT----RRHGRMMASIDLELGGNGHPSATGLYF-TKVGLGTPTDEYYV 98
E +LS DT H + + + N PS + F +G P
Sbjct: 49 HHESSLSPYNSKDTIWDHYSHKILKQTFSNDYISNLVPSPRYVVFLMNFSIGEPPIPQLA 108
Query: 99 QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSD-NFCRTTYNNRYP 157
+DTGS L WV C CS C +S + +FDPSKSST ++CS+ N C
Sbjct: 109 VMDTGSSLTWVMCHPCSSCSQQS-----VPIFDPSKSSTYSNLSCSECNKCDVV------ 157
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
C Y V Y S+ G + R+ + L ++ P S+IFGCG + S
Sbjct: 158 ----NGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVP---SLIFGCGRKFSISSN 210
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGI----FAIGDVVSPK 273
++G+ G G SLL K+F++C+ ++ +GD + +
Sbjct: 211 GYPYQGINGVFGLGSGRFSLLPSFG------KKFSYCIGNLRNTNYKFNRLVLGDKANMQ 264
Query: 274 VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLG---TGDERGTIIDSGTTLAYL 326
+T + Y V LE + +GG LD+ +L T + G IIDSG +L
Sbjct: 265 GDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWL 320
>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 414
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 143/317 (45%), Gaps = 49/317 (15%)
Query: 44 ERERTLSALKQHDTRRHGRMMASI-DLELGGNGHPSATGL------YFTKVGLGTPTDEY 96
+++ L L+ + R +AS ++E P ++G+ Y +GLG+
Sbjct: 19 QKQLILDDLRVRSMQNRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGS--KNM 76
Query: 97 YVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT----TY 152
V +DTGSDL WV C C C + +F PS SS+ ++C+ + C++ T
Sbjct: 77 TVIIDTGSDLTWVQCEPCMSCYNQQG-----PIFKPSTSSSYQSVSCNSSTCQSLQFATG 131
Query: 153 NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
N S C YVV YGDGS T+G + + S S +FGCG
Sbjct: 132 NTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFGGVS--------VSDFVFGCGRNN 183
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGG--GIFAIGDV 269
G G V G++G G++ SL+SQ A G V F++CL + G G +G+
Sbjct: 184 KGLFG-----GVSGLMGLGRSYLSLVSQTNATFGGV---FSYCLPTTEAGSSGSLVMGNE 235
Query: 270 VSPKVKTTPMV-------PNMPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGTIIDSGT 321
S P+ P + ++ ++ L ++VGG L P S G G G +IDSGT
Sbjct: 236 SSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLS-FGNG---GILIDSGT 291
Query: 322 TLAYLPPMLYDLVLSQF 338
+ LP +Y + ++F
Sbjct: 292 VITRLPSSVYKALKAEF 308
>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
Group]
gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 504
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 119/267 (44%), Gaps = 34/267 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF++VG+G+P + Y+ +DTGSD+ WV C C+ C +SD +FDPS S++
Sbjct: 164 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSD-----PVFDPSLSTSYA 218
Query: 140 EIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+AC + C ++ +C C Y V YGDGS T G F + + L +A
Sbjct: 219 SVACDNPRC---HDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLG------DSA 269
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL--D 256
P+ SSV GCG+ G + G S SQ++A F++CL
Sbjct: 270 PV-SSVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISA-----TTFSYCLVDR 318
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGD-- 311
GD +V T P++ + Y V L + VGG L +P S
Sbjct: 319 DSPSSSTLQFGDAADAEV-TAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTG 377
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G I+DSGT + L Y + F
Sbjct: 378 AGGVIVDSGTAVTRLQSSAYAALRDAF 404
>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
Length = 448
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 119/279 (42%), Gaps = 42/279 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DTGSDL W CA C C +S L F+PS+S T +
Sbjct: 85 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQS-----LPRFNPSRSMTFSVLP 139
Query: 143 CSDNFCRTTYNNRYPSCSP-----GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C CR + + SC G+ C Y Y D S T+G+ D A +
Sbjct: 140 CDLRICR---DLTWSSCGEQSWGNGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGG 195
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A + + FGCG +G S+ GI GF + S+ +QL F++C
Sbjct: 196 ASV-PDLTFGCGLFNNGIFVSNE----TGIAGFSRGALSMPAQLKV-----DNFSYCFTA 245
Query: 258 VKGGGIFAIGDVVSPK------------VKTTPMV----PNMPHYNVILEEVEVGGNPLD 301
+ G + V P V++T ++ + Y + L+ V VG L
Sbjct: 246 ITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLP 305
Query: 302 LPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P S+ ++ GTI+DSGT + LP +Y+LV F
Sbjct: 306 IPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAF 344
>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
Length = 474
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 119/279 (42%), Gaps = 42/279 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DTGSDL W CA C C +S L F+PS+S T +
Sbjct: 111 YLVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQS-----LPRFNPSRSMTFSVLP 165
Query: 143 CSDNFCRTTYNNRYPSCSP-----GVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
C CR + + SC G+ C Y Y D S T+G+ D A +
Sbjct: 166 CDLRICR---DLTWSSCGEQSWGNGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGG 221
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
A + + FGCG +G S+ GI GF + S+ +QL F++C
Sbjct: 222 ASV-PDLTFGCGLFNNGIFVSNE----TGIAGFSRGALSMPAQLKV-----DNFSYCFTA 271
Query: 258 VKGGGIFAIGDVVSPK------------VKTTPMV----PNMPHYNVILEEVEVGGNPLD 301
+ G + V P V++T ++ + Y + L+ V VG L
Sbjct: 272 ITGSEPSPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLP 331
Query: 302 LPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P S+ ++ GTI+DSGT + LP +Y+LV F
Sbjct: 332 IPESVFALKEDGTGGTIVDSGTGMTMLPEAVYNLVCDAF 370
>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/283 (29%), Positives = 124/283 (43%), Gaps = 34/283 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP ++ +DTGSD++W+ CA C +C +++D +F+P+
Sbjct: 138 SGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTD-----PVFNPT 192
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
KS + I C CR + P CS C Y V+YGDGS T G F + +
Sbjct: 193 KSRSFANIPCGSPLCRRLDS---PGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTR 249
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
V GCG+ G + G+ S SQ+ ++F+
Sbjct: 250 VG--------RVALGCGHDNEGLFIGAAGLLGL-----GRGRLSFPSQIGR--RFSRKFS 294
Query: 253 HCL---DVVKGGGIFAIGD-VVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTS 305
+CL GD +S + TP+V N Y V L V VGG + T+
Sbjct: 295 YCLVDRSASSKPSYMVFGDSAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITA 354
Query: 306 LLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L D G IIDSGT++ L Y + FR ++L
Sbjct: 355 SLFKLDSTGNGGVIIDSGTSVTRLTRPAYVALRDAFRVGASNL 397
>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
Length = 475
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/277 (31%), Positives = 119/277 (42%), Gaps = 41/277 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + + +GTP Y VDTGSDL+W C C C ++ +FDP+ SST
Sbjct: 114 GEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTT-----PVFDPAASSTYAA 168
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCE----YVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ CS C + S S Y TYGD SST G + L + +
Sbjct: 169 LPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTLAR-----Q 223
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC-- 254
P V FGCG+ GD G + A G++G G+ SL+SQL F++C
Sbjct: 224 KVP---GVAFGCGDTNEGD-GFTQGA---GLVGLGRGPLSLVSQLGI-----DRFSYCLT 271
Query: 255 -LDVVKGGGIFAI-------GDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLP 303
LD G + + +TTP+V P+ P Y V L + VG L LP
Sbjct: 272 SLDDAAGRSPLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALP 331
Query: 304 TSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+S D+ G I+DSGT++ YL Y + F
Sbjct: 332 SSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAF 368
>gi|328875414|gb|EGG23778.1| putative aspartyl protease [Dictyostelium fasciculatum]
Length = 507
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 139/291 (47%), Gaps = 45/291 (15%)
Query: 71 LGGNGHPSATGLYF---TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKL 127
L G + TG F T++ +G T + VQVDTGS L+ + GC+ C +
Sbjct: 107 LSGKVNQPMTGDLFQINTQIIVGNTT--FLVQVDTGSLLMAIPLEGCNTCVESRPV---- 160
Query: 128 TLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS---PGVRCEYVVTYGDGSSTSGYFVRD 184
+ PS STS ++ACS + C+ + + PSCS G C++ + YGDGS SGY D
Sbjct: 161 --YHPS--STSTKVACSSDQCKGS-GSTPPSCSRTSSGESCDFQIRYGDGSHVSGYIYED 215
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL----SQ 240
++ L A L FG + ++GD DGI+GFG+ SS +
Sbjct: 216 VVNL---------AGLQGKANFGANDEETGDFEY---PRADGIIGFGRTCSSCVPTVWDS 263
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDV----VSPKVKTTPMV-PNMPHYNVILEEVEV 295
L + ++ +F L+ +GGG ++G++ + ++ TP+V N P Y+V + +
Sbjct: 264 LVSDLGLKNQFGMLLN-YEGGGSLSLGEINTSYYTGDIRYTPLVQKNTPFYSVKSTGIRI 322
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASLD 346
N +P S LG + I+DSG+T L YD + + F+ S+
Sbjct: 323 --NDYTIPGSKLG----QEVIVDSGSTALSLASGAYDQLRNYFQTHYCSIQ 367
>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
Length = 449
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 121/282 (42%), Gaps = 49/282 (17%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP DTGSDL W+ C +C + K +FDPS S+T +
Sbjct: 78 GEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQ-----KGPIFDPSNSTTFHK 132
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ C + SC+ C Y +YGD S T+GY D + + AS ++
Sbjct: 133 LPCTTAPCNA-LDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNASVQIR---- 187
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
+V FGCG R G+ D GI+G G N S +SQL + K+F++CL
Sbjct: 188 --NVAFGCGTRNGGNF----DEQGSGIVGLGGGNLSFVSQL--GDTIGKKFSYCLLPLEN 239
Query: 256 -------------DVVKGGG-IFAIGDVVSPKVKTTPMVPNMP--HYNVILEEVEVGGNP 299
+V G +F+ TTP+V P +Y + +E + VG
Sbjct: 240 EISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKK 299
Query: 300 LDLPTSLLGTG----------DERGTIIDSGTTLAYLPPMLY 331
L +S T +E IIDSGTTL +L Y
Sbjct: 300 LLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFY 341
>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
distachyon]
Length = 486
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 119/278 (42%), Gaps = 43/278 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP DTGSDL+WV C G + + F PS SST G +
Sbjct: 110 YLMAIEVGTPPVRVLAIADTGSDLVWVKCKG--KDNDNNSTAPPSVYFVPSASSTYGRVG 167
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN- 201
C CR + SCSP CEY+ +YGDGS SG + + + + KT
Sbjct: 168 CDTKACRAL--SSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGN 225
Query: 202 -------------SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
+ + FGC +G + DG++G G SL SQL A ++
Sbjct: 226 NNNNSSSHGQVEIAKLDFGCSTTTTGTFRA------DGLVGLGGGPVSLASQLGATTSLG 279
Query: 249 KEFAHCLDVVKGGGI-----FAIGDVVS-PKVKTTPMVPN--MPHYNVILEEVEVGGNPL 300
++F++CL F VVS P +TP++ +Y + L+ + V G
Sbjct: 280 RKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGT-- 337
Query: 301 DLPTSLLGTGDERGTIIDSGTTLAY-----LPPMLYDL 333
PT T + I+DSGTTL Y L P++ DL
Sbjct: 338 KRPT----TAAQAHIIVDSGTTLTYLDSALLTPLVKDL 371
>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
Length = 494
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/303 (28%), Positives = 121/303 (39%), Gaps = 38/303 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT----- 128
+G + TG YF + +GTP + + DTGSDL WV C G + P+ +
Sbjct: 101 SGAYTGTGQYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAAS-PSHATATASPAAAPSP 159
Query: 129 ------LFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYF 181
+F P S T I CS C++T +CS C Y Y D S+ G
Sbjct: 160 AVAPPRVFRPGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVV 219
Query: 182 VRD--IIQLNQASGNLKTAPLNSS---VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
D + L+ G + V+ GC +G A DG+L G +N S
Sbjct: 220 GTDSATVALSGGRGGGGGGDRKAKLQGVVLGCTTAHAGQ----GFEASDGVLSLGYSNIS 275
Query: 237 LLSQLAAAGNVRKEFAHCL-----------DVVKGGGIFAIGDVVSPKVKTTPMVPNM-- 283
S+ AA F++CL + G G A TP++ +
Sbjct: 276 FASR--AASRFGGRFSYCLVDHLAPRNATSYLTFGAGPDAASSSAPAPGSRTPLLLDARV 333
Query: 284 -PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
P Y V ++ V V G LD+P + G GTIIDSGT+L L Y V++ +
Sbjct: 334 RPFYAVAVDSVSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQL 393
Query: 343 ASL 345
A L
Sbjct: 394 AGL 396
>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 485
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/287 (31%), Positives = 127/287 (44%), Gaps = 42/287 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C RC ++SD +FDP
Sbjct: 133 SGLSQGSGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSD-----PIFDPR 187
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQL-- 188
KS T I CS CR R S R C Y V+YGDGS T G F + +
Sbjct: 188 KSKTYATIPCSSPHCR-----RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRR 242
Query: 189 NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVR 248
N+ G V GCG+ G G+LG G+ S Q N
Sbjct: 243 NRVKG----------VALGCGHDNEGLF-----VGAAGLLGLGKGKLSFPGQTGHRFN-- 285
Query: 249 KEFAHCL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD 301
++F++CL K + VS + TP++ N Y V L + VGG +
Sbjct: 286 QKFSYCLVDRSASSKPSSVVFGNAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVP 345
Query: 302 LPTSLLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ L D+ G IIDSGT++ L Y + FR +L
Sbjct: 346 GVAASLFKLDQIGNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKAL 392
>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
Length = 510
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 130/297 (43%), Gaps = 44/297 (14%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS 121
RM+A+++ +G +G Y V +GTP + + +DTGSDL W+ CA C C
Sbjct: 133 RMVATVE-----SGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDC---- 183
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP-SCSPGVR--CEYVVTYGDGSSTS 178
+ +FDP+ SS+ + C D C P +C C Y YGD S+T+
Sbjct: 184 -FEQRGPVFDPAASSSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTT 242
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSS----VIFGCGNRQSGDLGSSTDAAVDGILGFGQAN 234
G + +N TAP S V+FGCG+R G + G+
Sbjct: 243 GDLALESFTVNL------TAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGL-----GRGP 291
Query: 235 SSLLSQLAAAGNVRKEFAHCLDVVKG---GGIFAIGD----VVSPKVKTTPMVPNMP--- 284
S SQL A F++CL V G G G+ + P++K T P
Sbjct: 292 LSFASQLRAVYG--HTFSYCL-VEHGSDAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPAD 348
Query: 285 -HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y V L+ V VGG+ L++ + G + GTIIDSGTTL+Y Y ++ F
Sbjct: 349 TFYYVKLKGVLVGGDLLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRQAF 405
>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
Length = 438
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 120/269 (44%), Gaps = 39/269 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y +V LGTP + ++ +DT +D WV C+GC+ C + T F P+ S+T G +
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSS--------TTFLPNASTTLGSLD 149
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C P+ C + +YG SS + V+D I L +
Sbjct: 150 CSGAQCSQVRGFSCPATGSSA-CLFNQSYGGDSSLTATLVQDAITLAND--------VIP 200
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
FGC N SG G+LG G+ SL+SQ A F++CL K
Sbjct: 201 GFTFGCINAVSGG-----SIPPQGLLGLGRGPISLISQ--AGAMYSGVFSYCLPSFKSYY 253
Query: 261 -GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLL----GTG 310
G +G V PK ++TTP++ N PH Y V L V VG + +P+ L TG
Sbjct: 254 FSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 312
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + +Y + +FR
Sbjct: 313 --AGTIIDSGTVITRFVQPVYFAIRDEFR 339
>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
[Brachypodium distachyon]
Length = 540
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 125/274 (45%), Gaps = 41/274 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF+++G+G+P + Y+ +DTGSD+ W+ CA C+ C +SD LFDP+ SS+
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSD-----PLFDPALSSSYA 247
Query: 140 EIACSDNFCR-----TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
+ C CR +NN + + C Y V YGDGS T G F + + L G
Sbjct: 248 TVPCDSPHCRALDASACHNN---AANGNSSCVYEVAYGDGSYTVGDFATETLTL----GG 300
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+A ++ V GCG+ G + G S SQ++A EF++C
Sbjct: 301 DGSAAVH-DVAIGCGHDNEGLFVGAAGLLALGGGPL-----SFPSQISA-----TEFSYC 349
Query: 255 L---DVVKGGGI-FAIGD--VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL-DLPTSLL 307
L D + F D V+ + +P Y V L + VGG L D+P +
Sbjct: 350 LVDRDSPSASTLQFGASDSSTVTAPLMRSPRSNTF--YYVALNGISVGGETLSDIPPAAF 407
Query: 308 GTGDERGT---IIDSGTTLAYLPPMLYDLVLSQF 338
DE+G+ I+DSGT + L Y + F
Sbjct: 408 AM-DEQGSGGVIVDSGTAVTRLQSSAYSALRDAF 440
>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
communis]
Length = 557
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/296 (28%), Positives = 133/296 (44%), Gaps = 35/296 (11%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G++MA+++ +G +G YF V +GTP + + +DTGSDL W+ C C C +
Sbjct: 175 GQLMATLE-----SGVSLGSGEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQ 229
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+ +DP +SS+ I C D C ++ + P + C Y YGD S+T+
Sbjct: 230 NG-----PYYDPKESSSFKNIGCHDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTT 284
Query: 179 GYFVRDIIQLNQASGNLKTA-PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G F + +N S K+ +V+FGCG+ G + G+ S
Sbjct: 285 GDFALETFTVNLTSPAGKSEFKRVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSF 339
Query: 238 LSQLAAAGNVRKEFAHCL------DVVKGGGIF-AIGDVVS-PKVKTTPMV-----PNMP 284
SQL + F++CL V IF D+++ P+V T +V P
Sbjct: 340 SSQLQSL--YGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHPEVNFTSLVAGKENPVDT 397
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y V ++ + VGG L +P E GTI+DSGTTL+Y Y+++ F
Sbjct: 398 FYYVQIKSIMVGGEVLKIPEETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAF 453
>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 543
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/302 (27%), Positives = 135/302 (44%), Gaps = 40/302 (13%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G +MA+++ +G TG YF + +GTP ++ +DTGSDL W+ C C C +
Sbjct: 154 GNIMATLE-----SGASLGTGEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQ 208
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCR-TTYNNRYPSC-SPGVRCEYVVTYGDGSSTS 178
+ + + P SST I+C D C+ + ++ C + C Y Y DGS+T+
Sbjct: 209 NG-----SHYYPKDSSTYRNISCYDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTT 263
Query: 179 GYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSL 237
G F + +N N K V+FGCG+ G ++ G+LG G+ S
Sbjct: 264 GDFASETFTVNLTWPNGKEKFKQVVDVMFGCGHWNKGFFYGAS-----GLLGLGRGPISF 318
Query: 238 LSQLAAAGNVRKEFAHCL-DVVKGGGI---FAIGD----VVSPKVKTTPMV-----PNMP 284
SQ+ + F++CL D+ + G+ + + + T ++ P+
Sbjct: 319 PSQIQSI--YGHSFSYCLTDLFSNTSVSSKLIFGEDKELLNNHNLNFTTLLAGEETPDET 376
Query: 285 HYNVILEEVEVGGNPLDLPTSLLGTGDE-------RGTIIDSGTTLAYLPPMLYDLVLSQ 337
Y + ++ + VGG LD+ E GTIIDSG+TL + P YD++
Sbjct: 377 FYYLQIKSIMVGGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEA 436
Query: 338 FR 339
F
Sbjct: 437 FE 438
>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 80/288 (27%), Positives = 126/288 (43%), Gaps = 38/288 (13%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSR 116
+ R+++S+ L GN +P G Y + +G + + +D+GSDL WV C A C+
Sbjct: 32 KNSDRLLSSVVFPLKGNVYP--LGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTH 89
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGS 175
C + L+ P+ ++ + C + C + + C S +C+Y + Y D
Sbjct: 90 CTKPRE-----QLYKPNNNA----LNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHG 140
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
S+ G V D + L +G+L AP + FGCG + S+ G+LG G
Sbjct: 141 SSLGVLVNDHVPLKLTNGSL-AAP---RIAFGCGYDHKYSVPDSSPPTA-GVLGLGNGEV 195
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH------YNVI 289
S +SQL++ G VR HCL GG GD P T +M H Y+
Sbjct: 196 SFISQLSSMGVVRNVVGHCLS--DEGGFLFFGDEFVPSSGVT--WTSMSHESIGSYYSSG 251
Query: 290 LEEVEVGGNPLDLPTSLLGTGDERGTII-DSGTTLAYLPPMLYDLVLS 336
EV GG TG + T++ DSG++ Y Y+ +L+
Sbjct: 252 PAEVYFGGK---------ATGIKDLTLVFDSGSSYTYFNSQAYNSILA 290
>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 445
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 127/294 (43%), Gaps = 39/294 (13%)
Query: 59 RHGRMMASIDLELG--GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
R R DL+ G NG G YF + +GTP + + DTGSDL WV C C +
Sbjct: 64 RSRRFTTKTDLQSGLISNG-----GEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQ 118
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
C ++ LFD KSST +C C+ + C+Y +YGD S
Sbjct: 119 CYKQNS-----PLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSF 173
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
T G + I ++ +SG+ + P +FGCG G + GI+G G S
Sbjct: 174 TKGDVATETISIDSSSGSSVSFP---GTVFGCGYNNGGTF----EETGSGIIGLGGGPLS 226
Query: 237 LLSQLAAAGNVRKEFAHCLD----VVKGGGIFAIGDVVSPK-------VKTTPMVPNMP- 284
L+SQL ++ + K+F++CL G + +G P TTP++ P
Sbjct: 227 LVSQLGSS--IGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPE 284
Query: 285 -HYNVILEEVEVGGNPLDLP---TSLLGTGDER--GTIIDSGTTLAYLPPMLYD 332
+Y + LE V VG L L G +R IIDSGTTL L YD
Sbjct: 285 TYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYD 338
>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 427
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/272 (30%), Positives = 113/272 (41%), Gaps = 30/272 (11%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y K+ LG+P + Y VDTGSDL+W C C C + K +F+P +S T
Sbjct: 77 SNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQ-----KSPMFEPLRSKT 131
Query: 138 SGEIACSDNFCRTT-YNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
I C C Y SCSP C Y +Y D S T G R+ I + G+
Sbjct: 132 YSPIPCESEQCSFFGY-----SCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPV 186
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL- 255
+IFGCG+ SG + + SL+SQ+ K F+ CL
Sbjct: 187 VV---GDIIFGCGHSNSGTFNENDMGIIGMG----GGPLSLVSQIGTLYG-SKRFSQCLV 238
Query: 256 ----DVVKGGGIF--AIGDVVSPKVKTTPMVPN--MPHYNVILEEVEVGGNPLDLPTSLL 307
D G I DV V TTP+ Y V LE + VG + +S
Sbjct: 239 PFHTDAHTSGTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSS-- 296
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
T + +IDSGT Y+P Y+ ++ + +
Sbjct: 297 ETLSKGNIMIDSGTPATYIPQEFYERLVEELK 328
>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 431
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 79/266 (29%), Positives = 118/266 (44%), Gaps = 37/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DT +D W+ C+GC +G T+F+ KS+T +
Sbjct: 96 YIVRAKIGTPAQTMLLAMDTSNDAAWIPCSGC--------VGCSSTVFNNVKSTTFKTVG 147
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N++ C G C + +TYG SS + +D++ L A+ ++
Sbjct: 148 CEAPQCKQVPNSK---CG-GSACAFNMTYGS-SSIAANLSQDVVTL--ATDSIP------ 194
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
S FGC +G + G+LG G+ SLLSQ + F++CL +
Sbjct: 195 SYTFGCLTEATG-----SSIPPQGLLGLGRGPMSLLSQ--TQNLYQSTFSYCLPSFRSLN 247
Query: 261 -GGIFAIGDVVSPK-VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER-- 313
G +G V PK +KTTP++ N Y V L + VG +D+P S L
Sbjct: 248 FSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGA 307
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L Y V FR
Sbjct: 308 GTIFDSGTVFTRLVAPAYTAVRDAFR 333
>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/276 (29%), Positives = 123/276 (44%), Gaps = 40/276 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+G E V VDT S+L WV C C C + + LFDPS S + +
Sbjct: 113 YVATVGIGG--GEATVIVDTASELTWVQCEPCDACHDQQE-----PLFDPSSSPSYAAVP 165
Query: 143 CSDNFC---RTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
C+ + C R +C C Y ++Y DGS + G D +L+ A +++
Sbjct: 166 CNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHD--RLSLAGEDIQ-- 221
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFAHCLDV 257
+FGCG G G ++ G++G G++ SL+SQ + G V F++CL
Sbjct: 222 ----GFVFGCGTSNQGPFGGTS-----GLMGLGRSQLSLISQTMDQFGGV---FSYCLPP 269
Query: 258 VKGG--GIFAIGDVVSPKVKTTPMVPNM--------PHYNVILEEVEVGGNPLDLPTSLL 307
+ G G +GD S +TP+V P Y L + VGG + P
Sbjct: 270 KESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSA 329
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
G G + I+DSGT + L P +Y V ++F +A
Sbjct: 330 GGGGK--AIVDSGTIITSLVPSVYAAVRAEFVSQLA 363
>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 437
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 77/282 (27%), Positives = 126/282 (44%), Gaps = 26/282 (9%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSR 116
+ R+++S+ L GN +P G Y + +G + + +D+GSDL WV C A C+
Sbjct: 32 KNSDRLLSSVVFPLKGNVYP--LGYYSVSINIGKGDEAFEFDIDSGSDLTWVQCDAPCTH 89
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGS 175
C + L+ P+ ++ + C + C + + C S +C+Y + Y D
Sbjct: 90 CTKPRE-----QLYKPNNNA----LNCFEPLCTSLHPITNHHCKSADDQCQYEIEYADHG 140
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS 235
S+ G V D + L +G+L AP + FGCG + S+ G+LG G
Sbjct: 141 SSLGVLVNDHVPLKLTNGSL-AAP---RIAFGCGYDHKYSVPDSSPPTA-GVLGLGNGEV 195
Query: 236 SLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEV 295
S +SQL++ G VR HCL GG GD P T +M H ++
Sbjct: 196 SFISQLSSMGVVRNVVGHCLS--DEGGFLFFGDEFVPSSGVT--WTSMSHESI---GSYY 248
Query: 296 GGNPLDLPTSLLGTGDERGTII-DSGTTLAYLPPMLYDLVLS 336
P ++ S TG + T++ DSG++ Y Y+ +L+
Sbjct: 249 SSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNSILA 290
>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
Length = 472
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 78/289 (26%), Positives = 124/289 (42%), Gaps = 45/289 (15%)
Query: 73 GNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDP 132
+G ++ Y K+G GTP +Y +DTGS++ W+ C CS C +K F+P
Sbjct: 114 ASGQAISSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQ------PFEP 167
Query: 133 SKSSTSGEIACSDNFCR-----TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQ 187
SKSST + C+ C+ T +N V C YGD S + +
Sbjct: 168 SKSSTYNYLTCASQQCQLLRVCTKSDN-------SVNCSLTQRYGDQSEVDEILSSETLS 220
Query: 188 L-NQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
+ +Q N +FGC N G + + ++GFG+ S +SQ A +
Sbjct: 221 VGSQQVENF---------VFGCSNAARGLIQRTP-----SLVGFGRNPLSFVSQTATLYD 266
Query: 247 VRKEFAHCL-----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH---YNVILEEVEVGGN 298
F++CL G + + + +K TP++ N + Y V L + VG
Sbjct: 267 --STFSYCLPSLFSSAFTGSLLLGKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEE 324
Query: 299 PLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ +P L + RGTIIDSGT + L Y+ + FR +++L
Sbjct: 325 LVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLSNL 373
>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
Length = 438
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 119/269 (44%), Gaps = 39/269 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y +V LGTP + ++ +DT +D WV C+GC+ G T F P+ S+T G +
Sbjct: 98 YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCT--------GFSSTTFLPNASTTLGSLD 149
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
CS C P+ C + +YG SS + V+D I L +
Sbjct: 150 CSGAQCSQVRGFSCPATGSSA-CLFNQSYGGDSSLTATLVQDAITLAND--------VIP 200
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
FGC N SG G+LG G+ SL+SQ A F++CL K
Sbjct: 201 GFTFGCINAVSGG-----SIPPQGLLGLGRGPISLISQ--AGAMYSGVFSYCLPSFKSYY 253
Query: 261 -GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLL----GTG 310
G +G V PK ++TTP++ N PH Y V L V VG + +P+ L TG
Sbjct: 254 FSGSLKLGPVGQPKSIRTTPLLRN-PHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTG 312
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + +Y + +FR
Sbjct: 313 --AGTIIDSGTVITRFVQPVYFAIRDEFR 339
>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/310 (30%), Positives = 141/310 (45%), Gaps = 47/310 (15%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATG-------LYFTKVGLGTPTDEYYVQ 99
R L L Q R+ L G + P A+G Y KV +GTP +
Sbjct: 60 RVLQTLAQD----QARLQYLSSLVAGRSVVPIASGRQMLQSTTYIVKVLIGTPAQPLLLA 115
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DT SD+ W+ C+GC CP+ T F P+KS++ ++CS C+ N P+C
Sbjct: 116 MDTSSDVAWIPCSGCVGCPSN-------TAFSPAKSTSFKNVSCSAPQCKQVPN---PAC 165
Query: 160 SPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
G R C + +TYG SS + +D I+L A+ +K + FGC N+ +G
Sbjct: 166 --GARACSFNLTYGS-SSIAANLSQDTIRL--AADPIK------AFTFGCVNKVAGG--- 211
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG---GGIFAIGDVVSP-KV 274
T G+LG G+ SL+SQ A + F++CL + G +G P +V
Sbjct: 212 GTIPPPQGLLGLGRGPLSLMSQ--AQSVYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRV 269
Query: 275 KTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER--GTIIDSGTTLAYLPPM 329
K T ++ N Y V L + VG +DLP + + GTI DSGT L
Sbjct: 270 KYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKP 329
Query: 330 LYDLVLSQFR 339
+Y+ V ++FR
Sbjct: 330 VYEAVRNEFR 339
>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
Length = 421
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 100/217 (46%), Gaps = 19/217 (8%)
Query: 67 IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGI 125
+ ++ GN +P G Y + +G P Y + +DTGSDL WV C A C C +
Sbjct: 50 VAFQIKGNVYP--LGYYTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPCKGCTLPRN--- 104
Query: 126 KLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRD 184
L+ P + C D C + C+ P +C+Y V Y D S+ G +RD
Sbjct: 105 --RLYKPH----GDLVKCVDPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRD 158
Query: 185 IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA 244
I L +G+L L FGCG Q+ G + + G+LG G +S+LSQL +
Sbjct: 159 NIPLKFTNGSLARPML----AFGCGYDQTHH-GQNPPPSTAGVLGLGNGRTSILSQLHSL 213
Query: 245 GNVRKEFAHCLDVVKGGGIFAIGDVVSPK-VKTTPMV 280
G +R HCL GG +F ++ P V TP++
Sbjct: 214 GLIRNVVGHCLSGRGGGFLFFGDQLIPPSGVVWTPLL 250
>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 536
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 80/308 (25%), Positives = 135/308 (43%), Gaps = 33/308 (10%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLW 108
+++LK G +MA+++ +G TG YF + +GTP ++ +DTGSDL W
Sbjct: 141 VASLKSSKDEFSGNIMATLE-----SGASLGTGEYFIDMFVGTPPKHVWLILDTGSDLSW 195
Query: 109 VNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYN-NRYPSC-SPGVRCE 166
+ C C C ++ ++P++SS+ I+C D C+ + + C + C
Sbjct: 196 IQCDPCYDCFEQNG-----PHYNPNESSSYRNISCYDPRCQLVSSPDPLQHCKTENQTCP 250
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLG-------- 217
Y Y DGS+T+G F + +N N K + V+FGCG+ G
Sbjct: 251 YFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMFGCGHWNKGFFHGAGGLLGL 310
Query: 218 ----SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK 273
S + + I +G + S L+ L + +V + D + ++ K
Sbjct: 311 GRGPLSFPSQLQSI--YGHSFSYCLTDLFSNTSVSSKLIFGED----KELLNHHNLNFTK 364
Query: 274 VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLY 331
+ P+ Y + ++ + VGG LD+P E GTIIDSG+TL + P Y
Sbjct: 365 LLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGGTIIDSGSTLTFFPDSAY 424
Query: 332 DLVLSQFR 339
D++ F
Sbjct: 425 DVIKEAFE 432
>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
Length = 469
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 91/272 (33%), Positives = 123/272 (45%), Gaps = 37/272 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G GTP + +DTGSDL WV C C S C + D +FDPS SST
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKD-----PVFDPSASSTYAP 176
Query: 141 IACSDNFCR----TTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ C CR +Y N + S G C+Y + YG+G +T G + + + L+
Sbjct: 177 VPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSP----- 231
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ A + ++ FGCG Q G DG+LG G A SL+SQ G F++CL
Sbjct: 232 EAATVVNNFSFGCGLVQKG-----VFDLFDGLLGLGGAPESLVSQ--TTGTYGGAFSYCL 284
Query: 256 DVVKG-GGIFAIGDVVSPKVKT-----TPM-VPNMPHYNVILEEVEVGGNPLDL-PTSLL 307
G A+G + T TP+ V Y V L + VGG LD+ PT
Sbjct: 285 PAGNSTAGFLALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFA 344
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G G IIDSGT + LP Y + + FR
Sbjct: 345 G-----GMIIDSGTIVTGLPETAYSALRTAFR 371
>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
Length = 749
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 118/278 (42%), Gaps = 32/278 (11%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF V +GTP Y + +DTGSDL W+ C C C +S +DP +SS+
Sbjct: 189 SGEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSG-----PYYDPKESSSFE 243
Query: 140 EIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I C D C+ ++ + P C Y YGD S+T+G F + +N + N K+
Sbjct: 244 NITCHDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKS 303
Query: 198 APLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
+ +V+FGCG+ G + G+ S SQL + F++CL
Sbjct: 304 EQKHVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFASQLQSIYG--HSFSYCL- 355
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPNM--------------PHYNVILEEVEVGGNPLDL 302
V + ++ + K PN+ Y V ++ + V G L +
Sbjct: 356 VDRNSDTSVSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKI 415
Query: 303 PTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
P E GTIIDSGTTL Y Y+++ F
Sbjct: 416 PEETWHLSKEGGGGTIIDSGTTLTYFAEPAYEIIKEAF 453
>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
Length = 350
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 81/272 (29%), Positives = 115/272 (42%), Gaps = 40/272 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YFT++G+GTPT E Y+ +DTGSD++W+ C C C +++D +F+PS S +
Sbjct: 5 SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQAD-----PIFNPSSSVSFS 59
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C C N C G C Y V+YGDGS T G + + + S
Sbjct: 60 TVGCDSAVCSQLDAN---DCHGG-GCLYEVSYGDGSYTVGSYATETLTFGTTS------- 108
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---D 256
+V GCG+ G V G SL + F++CL D
Sbjct: 109 -IQNVAIGCGHDNVGLF-------VGAAGLLGLGAGSLSFPAQLGTQTGRAFSYCLVDRD 160
Query: 257 VVKGGGI------FAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
G + IG + +P V P +P Y + + + VGG LD S
Sbjct: 161 SESSGTLEFGPESVPIGSIFTPLVA-NPFLPTF--YYLSMVAISVGGVILDSVPSEAFRI 217
Query: 311 DER----GTIIDSGTTLAYLPPMLYDLVLSQF 338
DE G IIDSGT + L YD + F
Sbjct: 218 DETTGRGGIIIDSGTAVTRLQTSAYDALRDAF 249
>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
Length = 428
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 127/281 (45%), Gaps = 52/281 (18%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T LY VGLGTP V++DTGS WV C C C T F S+S+T
Sbjct: 79 TSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCA 131
Query: 140 EIAC---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
+++C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 132 KVSCGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS- 182
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
+++ P S FGC N S G++ VDG+LG G S+L Q + +
Sbjct: 183 ---DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPRFD---G 230
Query: 251 FAHCLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGN 298
F++CL + K G F++G V + V+ T MV N + V L + V G
Sbjct: 231 FSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGE 290
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 291 RLGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 328
>gi|242067693|ref|XP_002449123.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
gi|241934966|gb|EES08111.1| hypothetical protein SORBIDRAFT_05g005430 [Sorghum bicolor]
Length = 408
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 127/287 (44%), Gaps = 42/287 (14%)
Query: 69 LELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT 128
+L G+ +P G ++ + +G P + Y++ +DTGS W+ C P K+ +
Sbjct: 27 FKLDGSVYP--VGHFYVTMNIGEPAEPYFLDIDTGSSFTWLECHA-KDGPCKTCNKVPHP 83
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNN--RYPSCSPGVR---CEYVVTYGDGSSTSGYFVR 183
L+ ++ + C+D C + + C+ VR C+Y V Y DG S+ G +
Sbjct: 84 LYRLTRKKL---VPCADPLCDALHKDLGTTKKCT-DVRKNQCDYKVKYQDGLSSLGVLLL 139
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA----AVDGILGFGQANSSLLS 239
D L ++ FGCG Q GS A VDGILG G+ + L S
Sbjct: 140 DKFSLPTGGAR--------NIAFGCGYDQMK--GSKKKAPEKVPVDGILGLGRGSVDLAS 189
Query: 240 QLAAAGNVRKE-FAHCLDVVKGGGIFAIGD--VVSPKVKTTPMVPNMP----HYNVILEE 292
QL +G V K HCL KGGG IG+ V S V PM P P HY+
Sbjct: 190 QLKHSGAVSKNVIGHCLS-SKGGGYLFIGEENVPSSHVTWVPMAPTTPGEPNHYSPGQAT 248
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+ + NP +GT + I DSG+T YLP L+ ++S +
Sbjct: 249 LHLDSNP-------IGTKPLKA-IFDSGSTYTYLPENLHAQLVSALK 287
>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
Length = 382
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 125/277 (45%), Gaps = 40/277 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++GLG+P Y+ +D+GSD++WV C C++C ++D LFDP+
Sbjct: 34 SGMNQGSGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTD-----PLFDPA 88
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ ++CS C N C+ G RC Y V+YGDGS T G + + +
Sbjct: 89 DSASFMGVSCSSAVCDRVEN---AGCNSG-RCRYEVSYGDGSYTKGTLALETLTFGRT-- 142
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ +V GCG+ G + G + S + QL +G F++
Sbjct: 143 ------VVRNVAIGCGHSNRGMFVGAAGLLGL-----GGGSMSFMGQL--SGQTGNAFSY 189
Query: 254 CLDVVKG---GGIFAIGDVVSP-KVKTTPMV--PNMP-HYNVILEEVEVGGNPLDLPTSL 306
CL V +G G G P P+V P P Y + L + VG + + +
Sbjct: 190 CL-VSRGTNTNGFLEFGSEAMPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDV 248
Query: 307 -----LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
LG+G G ++D+GT + P + Y+ + F
Sbjct: 249 FQLNELGSG---GVVMDTGTAVTRFPTVAYEAFRNAF 282
>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 28/272 (10%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G Y VGLGTP + + DTGSD+ W C C+R K K +FDPS
Sbjct: 140 DGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQ----KEQIFDPS 195
Query: 134 KSS--TSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
+S+ T+ + S T+ P C+ C Y + YGD S + G+F + + L
Sbjct: 196 QSTSYTNISCSSSICNSLTSATGNTPGCASSA-CVYGIQYGDSSFSVGFFGTEKLTLTS- 253
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
T N ++ FGCG G G S G+ S++SQ A N K F
Sbjct: 254 -----TDAFN-NIYFGCGQNNQGLFGGSAGLLGL-----GRDKLSVVSQTAQKYN--KIF 300
Query: 252 AHCLDVVKGG-GIFAIGDVVSPKVKTTPM--VPNMP-HYNVILEEVEVGGNPLDLPTSLL 307
++CL G G S K TP+ + P Y + + VGG L + S+
Sbjct: 301 SYCLPSSSSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVF 360
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
T G IIDSGT + LPP Y + + FR
Sbjct: 361 STA---GAIIDSGTVITRLPPAAYSALRASFR 389
>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
gi|194690124|gb|ACF79146.1| unknown [Zea mays]
gi|194708040|gb|ACF88104.1| unknown [Zea mays]
gi|223950469|gb|ACN29318.1| unknown [Zea mays]
gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
Length = 500
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 118/285 (41%), Gaps = 52/285 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG E V VDT S+L WV CA C C + + LFDPS S + +
Sbjct: 143 YVATVGLGG--GEATVIVDTASELTWVQCAPCESCHDQ-----QGPLFDPSSSPSYAAVP 195
Query: 143 CSDNFCRTTYNNR-------YPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C C P C G C Y ++Y DGS + G D + L
Sbjct: 196 CDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSL----- 250
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEFA 252
+ +FGCG G T G++G G++ SL+SQ + G V F+
Sbjct: 251 ---AGEVIDGFVFGCGTSNQGPPFGGT----SGLMGLGRSQLSLVSQTVDQFGGV---FS 300
Query: 253 HCLDVVK---GGGIFAIGDVVSPKVKTTP-----MVPNM------PHYNVILEEVEVGGN 298
+CL + + G +GD S +TP MV N P Y V L + VGG
Sbjct: 301 YCLPLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQ 360
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
++ TG I+DSGT + L P +Y+ V ++F +A
Sbjct: 361 EVE------STGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLA 399
>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 442
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 91/319 (28%), Positives = 128/319 (40%), Gaps = 57/319 (17%)
Query: 54 QHDTRRHGRMMASIDLELGGNGHPSAT----------GLYFTKVGLGTPTDEYYVQVDTG 103
+ D RH R EL +G + G Y + +GTP Y DTG
Sbjct: 53 RRDMHRHARFTR----ELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSYPAIADTG 108
Query: 104 SDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEIAC--SDNFCRTTYNNRYPSCS 160
SDL+W CA C S+C ++ ++PS S+T G + C S + C PS
Sbjct: 109 SDLIWTQCAPCGSQCFKQAG-----QPYNPSSSTTFGVLPCNSSVSMCAALAG---PSPP 160
Query: 161 PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSST 220
PG C Y TYG G T+G + + P + FGC N S D S
Sbjct: 161 PGCSCMYNQTYGTG-WTAGIQSVETFTFGSTPADQTRVP---GIAFGCSNASSDDWNGSA 216
Query: 221 DAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK------------GGGIFAIGD 268
G++G G+ + SL+SQL A F++CL + + G
Sbjct: 217 -----GLVGLGRGSMSLVSQLGAG-----MFSYCLTPFQDANSTSTLLLGPSAALNGTGV 266
Query: 269 VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTS--LLGTGDERGTIIDSGTTLAYL 326
+ +P V + P +Y + L + +G L +P + L T G IIDSGTT+
Sbjct: 267 LTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITS- 325
Query: 327 PPMLYDLVLSQFRFWIASL 345
L D Q R I SL
Sbjct: 326 ---LVDAAYQQVRAAIESL 341
>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 127/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y T VGLGTP V++DTGS + WV C C C T F S+S+T +++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSISWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 412
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 142/305 (46%), Gaps = 54/305 (17%)
Query: 59 RHGRMMASIDLELGGNGHPSATGL------YFTKVGLGTPTDEYYVQVDTGSDLLWVNCA 112
R R+++S ++E P ++G+ Y +GLG+ V +DTGSDL WV C
Sbjct: 35 RIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGST--NMTVIIDTGSDLTWVQCE 92
Query: 113 GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT----TYNNRYPSCSPGVRCEYV 168
C C + +F PS SS+ ++C+ + C++ T N +P C YV
Sbjct: 93 PCMSCYNQQG-----PIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACGSNPST-CNYV 146
Query: 169 VTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGIL 228
V YGDGS T+G + + S S +FGCG G G V G++
Sbjct: 147 VNYGDGSYTNGELGVEQLSFGGVS--------VSDFVFGCGRNNKGLFG-----GVSGLM 193
Query: 229 GFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGG--GIFAIGDVVSPKVKTTP-----MV 280
G G++ SL+SQ A G V F++CL + G G +G+ S TP M+
Sbjct: 194 GLGRSYLSLVSQTNATFGGV---FSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRML 250
Query: 281 PN--MPHYNVI-LEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD----L 333
PN + ++ ++ L ++V G L +P+ G G G +IDSGT + LP +Y L
Sbjct: 251 PNPQLSNFYILNLTGIDVDGVALQVPS--FGNG---GVLIDSGTVITRLPSSVYKALKAL 305
Query: 334 VLSQF 338
L QF
Sbjct: 306 FLKQF 310
>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 438
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 75/267 (28%), Positives = 120/267 (44%), Gaps = 31/267 (11%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y +GTP + Y VDTGSD++W+ C C +C ++ F+PSKSS+
Sbjct: 82 SYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQT-----TPKFNPSKSSS 136
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I+CS C++ R SC+ CEY + YG+ S + G + + L +G +
Sbjct: 137 YKNISCSSKLCQSV---RDTSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVS 193
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
P + GCG G + V G +SL++QL + + +F++CL
Sbjct: 194 FP---KTVIGCGTNNIGSFKRVSSGVVGL----GGGPASLITQLGPS--IGGKFSYCLVR 244
Query: 256 ------DVVKGGGIFAIGDVV---SPKVKTTPMVP--NMPHYNVILEEVEVGGNPLDLPT 304
++ G GDV V +TP+V + Y + +E VG ++
Sbjct: 245 MSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAG 304
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLY 331
S G +E IIDS T + ++P +Y
Sbjct: 305 SSKGV-EEGNIIIDSSTIVTFVPSDVY 330
>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 455
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 121/284 (42%), Gaps = 44/284 (15%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSD---LGIKLTLFDPSKSST 137
G Y + +GTP Y DTGSDL+W CA C T +D L++PS S+T
Sbjct: 85 GEYIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTT 144
Query: 138 SGEIACSD--NFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
G + C+ + C PS PG C Y TYG G T+G V+ + S +
Sbjct: 145 FGVLPCNSPLSMCAAMAG---PSPPPGCACMYNQTYGTG-WTAG--VQSVETFTFGSSST 198
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
A ++ FGC N S D S G++G G+ + SL+SQL A F++CL
Sbjct: 199 PPAVRVPNIAFGCSNASSNDWNGSA-----GLVGLGRGSMSLVSQLGAGA-----FSYCL 248
Query: 256 DVVKGG---GIFAIGDVVSP------KVKTTPMV------PNMPHYNVILEEVEVGGNPL 300
+ +G + V++TP V P +Y + L + VG L
Sbjct: 249 TPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETAL 308
Query: 301 DLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+P GTG G IIDSGTT+ L Y V + R
Sbjct: 309 AIPPDAFSLRADGTG---GLIIDSGTTITTLVDSAYQQVRAAVR 349
>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 87.4 bits (215), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 144/329 (43%), Gaps = 62/329 (18%)
Query: 43 GERERTLSALKQHDTRRHGRMMASIDLELGG-------------------------NGHP 77
G + TLS L Q D+ R ++ +DL + +G
Sbjct: 85 GYKSLTLSRL-QRDSARVKSLVTRLDLAINSISSSDLKPLETDSEFKPEDLQSPIISGTS 143
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
+G YF++VG+G P + Y+ +DTGSD+ WV CA C+ C ++D +F+P+ S++
Sbjct: 144 QGSGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQAD-----PIFEPASSAS 198
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++C+ CR+ C C Y V+YGDGS T G FV + I L +
Sbjct: 199 FSTLSCNTRQCRSL---DVSECRNDT-CLYEVSYGDGSYTVGDFVTETITLG-------S 247
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
AP++ +V GCG+ G G+LG G + S SQ+ A F++CL
Sbjct: 248 APVD-NVAIGCGHNNEGLF-----VGAAGLLGLGGGSLSFPSQINATS-----FSYCLVD 296
Query: 256 DVVKGGGIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDE 312
+ + P + P++ N Y V L + VGG + +P S DE
Sbjct: 297 RDSESASTLEFNSTLPPNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQI-DE 355
Query: 313 R---GTIIDSGTTLAYLPPMLYDLVLSQF 338
G I+DSGT + L +Y+ + F
Sbjct: 356 SGNGGVIVDSGTAITRLQTDVYNSLRDAF 384
>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 445
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 121/270 (44%), Gaps = 32/270 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G YF + +GTP ++ DTGSDL WV C C +C ++ LFD KSST
Sbjct: 83 GEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQN-----TPLFDKKKSSTYKT 137
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+C C + C+Y +YGD S T G + I ++ +SG+ + P
Sbjct: 138 ESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFP- 196
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD---- 256
FGCG G + GI+G G SL+SQL ++ + K+F++CL
Sbjct: 197 --GTAFGCGYNNGGTF----EETGSGIIGLGGGPLSLVSQLGSS--IGKKFSYCLSHTSA 248
Query: 257 VVKGGGIFAIG-DVVSPK------VKTTPMVPNMP--HYNVILEEVEVGGNPLDLP---- 303
G + +G + ++ K + TTP++ P +Y + LE + VG L
Sbjct: 249 TTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGG 308
Query: 304 TSLLGTGDERGT-IIDSGTTLAYLPPMLYD 332
SL + G IIDSGTTL L YD
Sbjct: 309 YSLNRKSKKTGNIIIDSGTTLTLLDSGFYD 338
>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 431
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 78/264 (29%), Positives = 114/264 (43%), Gaps = 37/264 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y LGTP Y VDT SD++WV C C C + +FDPS S T
Sbjct: 86 GDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTS-----PMFDPSYSKTYKN 140
Query: 141 IACSDNFCRTTYNNRYPSCSPGVR--CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ CS C++ SCS R CE+ V Y DGS + G + + + L +
Sbjct: 141 LPCSSTTCKSVQGT---SCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHF 197
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVD--GILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
P + GC +T+ + D GI+G G SL+ QL+++ + K+F++CL
Sbjct: 198 P---RTVIGCIR--------NTNVSFDSIGIVGLGGGPVSLVPQLSSS--ISKKFSYCLA 244
Query: 257 VVK--------GGGIFAIGD-VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
+ G GD VS ++ Y + LE VG N ++ +S
Sbjct: 245 PISDRSSKLKFGDAAMVSGDGTVSTRIVFKDW---KKFYYLTLEAFSVGNNRIEFRSSSS 301
Query: 308 GTGDERGTIIDSGTTLAYLPPMLY 331
+ + IIDSGTT LP +Y
Sbjct: 302 RSSGKGNIIIDSGTTFTVLPDDVY 325
>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
gi|194688798|gb|ACF78483.1| unknown [Zea mays]
gi|194703430|gb|ACF85799.1| unknown [Zea mays]
gi|194707192|gb|ACF87680.1| unknown [Zea mays]
gi|223944599|gb|ACN26383.1| unknown [Zea mays]
gi|223948667|gb|ACN28417.1| unknown [Zea mays]
gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
Length = 450
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/269 (31%), Positives = 120/269 (44%), Gaps = 36/269 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y + LGTP + + VDT +D W+ CAGC+ CPT S FDP+ S++
Sbjct: 109 TPTYVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSS-----AAPFDPASSASYR 163
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C C N +C PG + C + +TY D SS +D + + +GN A
Sbjct: 164 TVPCGSPLCAQAPNA---ACPPGGKACGFSLTYAD-SSLQAALSQDSLAV---AGNAVKA 216
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
FGC R +G T A G+LG G+ S LSQ F++CL
Sbjct: 217 -----YTFGCLQRATG-----TAAPPQGLLGLGRGPLSFLSQ--TKDMYEATFSYCLPSF 264
Query: 259 KG---GGIFAIGDVVSP-KVKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTG 310
K G +G P ++KTTP++ N PH Y V + + VG + +P TG
Sbjct: 265 KSLNFSGTLRLGRNGQPQRIKTTPLLAN-PHRSSLYYVNMTGIRVGRKVVPIPAFDPATG 323
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT L Y V + R
Sbjct: 324 A--GTVLDSGTMFTRLVAPAYVAVRDEVR 350
>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 120/273 (43%), Gaps = 36/273 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +VG+G P + YV +DTGSD+ W+ CA CS C +SD +FDP
Sbjct: 140 SGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSD-----PIFDPI 194
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ I C + C++ C G C Y V+YGDGS T G F + + L A+
Sbjct: 195 SSNSYSPIRCDEPQCKSL---DLSECRNGT-CLYEVSYGDGSYTVGEFATETVTLGSAAV 250
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+V GCG+ G + G S +Q+ A F++
Sbjct: 251 E--------NVAIGCGHNNEGLFVGAAGLLGLGGGKL-----SFPAQVNATS-----FSY 292
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH------YNVILEEVEVGGNPLDLPTS-- 305
CL V + + + SP + P M + Y + L+ + VGG L +P S
Sbjct: 293 CL-VNRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSF 351
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G IIDSGT + L +YD + F
Sbjct: 352 EVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAF 384
>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 500
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/313 (27%), Positives = 135/313 (43%), Gaps = 46/313 (14%)
Query: 38 KFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYY 97
+F G L + DTR + + + +G +G YF+++G+GTP E Y
Sbjct: 121 RFAVEGIDRSDLKPVNNEDTRYQPEALTTPVV----SGVSQGSGEYFSRIGVGTPAKEMY 176
Query: 98 VQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
+ +DTGSD+ W+ C CS C +SD +F+P+ SST + CS C +
Sbjct: 177 LVLDTGSDVNWIQCEPCSDCYQQSD-----PVFNPTSSSTYKSLTCSAPQCSLLETS--- 228
Query: 158 SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLG 217
+C +C Y V+YGDGS T G D + SG + + V GCG+ G
Sbjct: 229 ACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKI------NDVALGCGHDNEGLFT 280
Query: 218 SSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKV--- 274
+ G S+ +Q+ A F++CL V + G + D S ++
Sbjct: 281 GAAGLLGL-----GGGALSITNQMKAT-----SFSYCL-VDRDSGKSSSLDFNSVQLGSG 329
Query: 275 -KTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAY 325
T P++ N Y V L VGG + +P ++ G+G G I+D GT +
Sbjct: 330 DATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSG---GVILDCGTAVTR 386
Query: 326 LPPMLYDLVLSQF 338
L Y+ + F
Sbjct: 387 LQTQAYNSLRDAF 399
>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
Length = 490
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 122/277 (44%), Gaps = 40/277 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y K+ +GTP E + +DT SDL W+ C C RC +S +FDP S++
Sbjct: 135 SGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSG-----PVFDPRHSTSYR 189
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
E++ + C+ + G C Y V YGDGS+T G F+ + + +G ++
Sbjct: 190 EMSFNAADCQALGRSGGGDAKRGT-CVYTVGYGDGSTTVGDFIEETLTF---AGGVRLPR 245
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVV 258
++ GCG+ G G A GILG G+ S +Q+ G F++CL D +
Sbjct: 246 IS----IGCGHDNKGLFG----APAAGILGLGRGLMSFPNQIDHNGT----FSYCLVDFL 293
Query: 259 KGGG------IFAIGDV-VSPKVKTTPMV--PNMP-HYNVILEEVEVGG------NPLDL 302
G G F G V SP V TP V NMP Y V L + VGG DL
Sbjct: 294 SGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDL 353
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L G I+DSGT + L Y FR
Sbjct: 354 --QLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFR 388
>gi|115465837|ref|NP_001056518.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|55733881|gb|AAV59388.1| unknown protein [Oryza sativa Japonica Group]
gi|57900669|gb|AAW57794.1| unknown protein [Oryza sativa Japonica Group]
gi|113580069|dbj|BAF18432.1| Os05g0596000 [Oryza sativa Japonica Group]
gi|215697162|dbj|BAG91156.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215768162|dbj|BAH00391.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 535
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 138/356 (38%), Gaps = 59/356 (16%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASI-------DLELGGNGHPSATGLYFTKVGLGTP 92
++G ER AL D RR R + + +L + + + G+Y V +GTP
Sbjct: 60 ESGEERREHFRALMAKDMRRMMRQVPELMSKTDMFELPMRSALNIAQVGMYVVVVRIGTP 119
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSR----------CPTKSDLGIK---------------- 126
Y + ++T +++ W+NC R P + + I+
Sbjct: 120 ALPYSLALETANEVTWINCRLRRRKGKHPGRPHVPPAATTMSIQVDDDGGGGGSGGKSKV 179
Query: 127 ----LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFV 182
+ + P+KSS+ CS C N S C Y D + TSG +
Sbjct: 180 TKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCESPDQNTSCTYYQVMKDSTITSGIYG 239
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
++ + + G +K P ++ GC + G +S DGIL G + SS +A
Sbjct: 240 QEKATVAVSDGTMKKLP---GLVIGCSTFEHGGAVNSH----DGILSLGNSPSSF--GIA 290
Query: 243 AAGNVRKEFAHCLDVVKGG-------GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEV 295
AA + CL G A V +P TP++ Y + + V
Sbjct: 291 AARRFGGRLSFCLLATTSGRNASSYLTFGANPAVQAPGTMETPLLYRDVAYGAHVTGILV 350
Query: 296 GGNPLDLPTSLLGTG------DERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
GG PLD+P + G E G I+D+GT++ YL +YD V + +A L
Sbjct: 351 GGQPLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHL 406
>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 486
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 144/331 (43%), Gaps = 66/331 (19%)
Query: 45 RERTLSALKQHDTRRHGRMMASIDLELGG-----------------------------NG 75
+ TLS LK+ D+ R + A IDL + G +G
Sbjct: 85 KSLTLSRLKR-DSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVSG 143
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
+G YF++VG+G P Y+ +DTGSD+ WV CA C+ C ++D +F+P+ S
Sbjct: 144 ASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTD-----PIFEPTSS 198
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS-GN 194
++ ++C C++ C G C Y V+YGDGS T G FV + + L S GN
Sbjct: 199 ASFTSLSCETEQCKSL---DVSECRNGT-CLYEVSYGDGSYTVGDFVTETVTLGSTSLGN 254
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ GCG+ G G+LG G + S SQL A+ F++C
Sbjct: 255 ---------IAIGCGHNNEGLF-----IGAAGLLGLGGGSLSFPSQLNASS-----FSYC 295
Query: 255 L--DVVKGGGIFAIGDVVSPKVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTSLLGT 309
L ++P T P+ PN+ + + L + VGG L +P +
Sbjct: 296 LVDRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQM 355
Query: 310 GDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
++ G I+DSGT + L +Y+++ F
Sbjct: 356 SEDGNGGIIVDSGTAVTRLQTTVYNVLRDAF 386
>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
Length = 494
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 87/265 (32%), Positives = 119/265 (44%), Gaps = 35/265 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y K+ +GTP + + +DT SDL W+ C C RC +S +FDP S++ G
Sbjct: 131 SGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSG-----PVFDPRHSTSYG 185
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA-SGNLKTA 198
E+ C+ + G C Y V YGDG ++ V D+++ +G ++ A
Sbjct: 186 EMNYDAPDCQALGRSGGGDAKRGT-CIYTVQYGDGHGSTSTSVGDLVEETLTFAGGVRQA 244
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DV 257
L+ GCG+ G G A GILG G+ S+ Q+A G F++CL D
Sbjct: 245 YLS----IGCGHDNKGLFG----APAAGILGLGRGQISIPHQIAFLG-YNASFSYCLVDF 295
Query: 258 VKGGG------IFAIGDV-VSPKVKTTPMV--PNMP-HYNVILEEVEVGG------NPLD 301
+ G G F G V SP TP V NMP Y V L V VGG D
Sbjct: 296 ISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERD 355
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYL 326
L L G I+DSGTT+ L
Sbjct: 356 L--QLDPYTGRGGVILDSGTTVTRL 378
>gi|125553570|gb|EAY99279.1| hypothetical protein OsI_21243 [Oryza sativa Indica Group]
gi|125605796|gb|EAZ44832.1| hypothetical protein OsJ_29469 [Oryza sativa Japonica Group]
Length = 534
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 138/356 (38%), Gaps = 59/356 (16%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASI-------DLELGGNGHPSATGLYFTKVGLGTP 92
++G ER AL D RR R + + +L + + + G+Y V +GTP
Sbjct: 59 ESGEERREHFRALMAKDMRRMMRQVPELMSKTDMFELPMRSALNIAQVGMYVVVVRIGTP 118
Query: 93 TDEYYVQVDTGSDLLWVNCAGCSR----------CPTKSDLGIK---------------- 126
Y + ++T +++ W+NC R P + + I+
Sbjct: 119 ALPYSLALETANEVTWINCRLRRRKGKHPGRPHVPPAATTMSIQVDDDGGGGGSGGKSKV 178
Query: 127 ----LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFV 182
+ + P+KSS+ CS C N S C Y D + TSG +
Sbjct: 179 TKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCESPDQNTSCTYYQVMKDSTITSGIYG 238
Query: 183 RDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA 242
++ + + G +K P ++ GC + G +S DGIL G + SS +A
Sbjct: 239 QEKATVAVSDGTMKKLP---GLVIGCSTFEHGGAVNSH----DGILSLGNSPSSF--GIA 289
Query: 243 AAGNVRKEFAHCLDVVKGG-------GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEV 295
AA + CL G A V +P TP++ Y + + V
Sbjct: 290 AARRFGGRLSFCLLATTSGRNASSYLTFGANPAVQAPGTMETPLLYRDVAYGAHVTGILV 349
Query: 296 GGNPLDLPTSLLGTG------DERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
GG PLD+P + G E G I+D+GT++ YL +YD V + +A L
Sbjct: 350 GGQPLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHL 405
>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 119/266 (44%), Gaps = 36/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + LGTP + + VDT +D W+ CAGC+ CPT S FDP+ S++ +
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSS-----AAPFDPAASASYRTVP 166
Query: 143 CSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C C N +C PG + C + +TY D SS +D + + +GN A
Sbjct: 167 CGSPLCAQAPNA---ACPPGGKACGFSLTYAD-SSLQAALSQDSLAV---AGNAVKA--- 216
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG- 260
FGC R +G T A G+LG G+ S LSQ F++CL K
Sbjct: 217 --YTFGCLQRATG-----TAAPPQGLLGLGRGPLSFLSQ--TKDMYEATFSYCLPSFKSL 267
Query: 261 --GGIFAIGDVVSP-KVKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTGDER 313
G +G P ++KTTP++ N PH Y V + V VG + +P TG
Sbjct: 268 NFSGTLRLGRNGQPQRIKTTPLLAN-PHRSSLYYVNMTGVRVGRKVVPIPAFDPATGA-- 324
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT L Y V + R
Sbjct: 325 GTVLDSGTMFTRLVAPAYVAVRDEVR 350
>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 560
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/277 (28%), Positives = 117/277 (42%), Gaps = 30/277 (10%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF V +GTP + + +DTGSDL W+ C C C ++ +DP SS+
Sbjct: 192 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNG-----PYYDPKDSSSFK 246
Query: 140 EIACSDNFCRTTYNNRYPSCSPG--VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I C D C+ + P G C Y YGD S+T+G F + +N + K
Sbjct: 247 NITCHDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKP 306
Query: 198 A-PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL- 255
+ +V+FGCG+ G + G+ S +QL + F++CL
Sbjct: 307 ELKIVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLSFATQLQSL--YGHSFSYCLV 359
Query: 256 -----DVVKGGGIFAIGD--VVSPKVKTTPMV-----PNMPHYNVILEEVEVGGNPLDLP 303
V IF + P + T V P Y V+++ + VGG L +P
Sbjct: 360 DRNSNSSVSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIP 419
Query: 304 --TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T L GTIIDSGTTL Y Y+++ F
Sbjct: 420 EETWHLSAQGGGGTIIDSGTTLTYFAEPAYEIIKEAF 456
>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
Length = 570
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 70/266 (26%), Positives = 105/266 (39%), Gaps = 39/266 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V LG+P DTGSDL+WV C + S T FDPS+SST G ++
Sbjct: 101 YLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNN--DTSSAAAPTTQFDPSRSSTYGRVS 158
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ-ASGNLKTAPLN 201
C + C +C G C Y+ YGDGS+T+G + + +G
Sbjct: 159 CQTDACEALGRA---TCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRI 215
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----D 256
V FGC +G + + SL++QL A ++ + F++CL +
Sbjct: 216 GGVKFGCSTATAGSFPADGLVGLG------GGAVSLVTQLGGATSLGRRFSYCLVPHSVN 269
Query: 257 VVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTI 316
A+ DV P +TP+V N + + I
Sbjct: 270 ASSALNFGALADVTEPGAASTPLVGN----------------------KTVASAASSRII 307
Query: 317 IDSGTTLAYLPPMLYDLVLSQFRFWI 342
+DSGTTL +L P L ++ + I
Sbjct: 308 VDSGTTLTFLDPSLLGPIVDELSRRI 333
>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 127/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y T VGLGTP+ V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVTSVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
Length = 436
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/270 (30%), Positives = 119/270 (44%), Gaps = 36/270 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +V LGTP Y+ +DT +D W C+GC C + T F SST
Sbjct: 93 GNYVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSST-------TTFSAQNSSTFAT 145
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C P+ V C + TYG S+ S V+D + L +
Sbjct: 146 LDCSKPECTQARGLSCPTTG-NVDCLFNQTYGGDSTFSATLVQDSLHLGPN--------V 196
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK-EFAHCLDVVK 259
+ FGC + SG + G++G G+ SL+SQ +G++ F++CL K
Sbjct: 197 IPNFSFGCISSASG-----SSIPPQGLMGLGRGPLSLISQ---SGSLYSGLFSYCLPSFK 248
Query: 260 G---GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTGD 311
G +G V PK ++TTP++ N PH Y V L + VG + + LL
Sbjct: 249 SYYFSGSLKLGPVGQPKAIRTTPLLHN-PHRPSLYYVNLTGISVGRVLVPISPELLAFDP 307
Query: 312 ER--GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + P +Y V +FR
Sbjct: 308 NTGAGTIIDSGTVITRFVPAIYTAVRDEFR 337
>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
Length = 428
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/281 (29%), Positives = 127/281 (45%), Gaps = 52/281 (18%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T LY VGLGTP V++DTGS WV C C C T F S+S+T
Sbjct: 79 TSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCA 131
Query: 140 EIAC---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
+++C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 132 KVSCGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS- 182
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
+++ P S FGC N S G++ VDG+LG G S+L Q + +
Sbjct: 183 ---DVQKIPGFS---FGC-NMDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFDC--- 230
Query: 251 FAHCLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGN 298
F++CL + K G F++G V + V+ T MV N + V L + V G
Sbjct: 231 FSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGE 290
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 291 RLGLSPSVFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 328
>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 506
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/259 (31%), Positives = 115/259 (44%), Gaps = 44/259 (16%)
Query: 100 VDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP 157
VDT SD+ WV CA C + C +SD+ L+DP+KS S CS CR+ RY
Sbjct: 178 VDTASDVPWVQCAPCPQPQCYAQSDV-----LYDPTKSILSAPFPCSSPQCRSL--GRYA 230
Query: 158 SCSPGV----RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR-- 211
+ G C+Y V Y DGS TSG +V D++ LN + K A S FGC +
Sbjct: 231 NGCTGAGNTGTCQYRVLYPDGSGTSGTYVSDLLTLN---ADPKGA--VSKFQFGCSHALL 285
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV-KGGGIFAIG--- 267
+ G + T G + G+ SL SQ + F++CL G ++G
Sbjct: 286 RPGSFNNKT----AGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQ 341
Query: 268 -----DVVSP--KVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSG 320
V+P K K PM+ Y V L ++V G L +P ++ +DS
Sbjct: 342 HAASRYAVTPMLKSKMAPMI-----YMVRLIGIDVAGQRLPVPPAVFAA----NAAMDSR 392
Query: 321 TTLAYLPPMLYDLVLSQFR 339
T + LPP Y + + FR
Sbjct: 393 TIITRLPPTAYMALRAAFR 411
>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
Length = 449
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 75/286 (26%), Positives = 129/286 (45%), Gaps = 32/286 (11%)
Query: 76 HPSA---TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR---CPTKSDLGIKLT- 128
HP+A G YF +GTP+ ++ + DTGSDL W++C R C + I+
Sbjct: 73 HPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKR 132
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNNRYP--SC-SPGVRCEYVVTYGDGSSTSGYFVRDI 185
+F + SS+ I C + C+ + + +C +P C Y Y DGS+ G+F +
Sbjct: 133 VFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANET 192
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ + G + L+ +V+ GC G + A DG++G G + S + AA
Sbjct: 193 VTVELKEG--RKMKLH-NVLIGCSESFQGQ----SFQAADGVMGLGYSKYSF--AIKAAE 243
Query: 246 NVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH-----------YNVILEEV 293
+F++CL D + + S + K ++ NM + Y V + +
Sbjct: 244 KFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEA-LLNNMTYTELVLGMVNSFYAVNMMGI 302
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+GG L +P+ + GTI+DSG++L +L Y V++ R
Sbjct: 303 SIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALR 348
>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
Length = 366
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 102/216 (47%), Gaps = 33/216 (15%)
Query: 13 TVAVVHQWAV---GGGGVMGNFVFEVENKFKAGGER--------ERTLSALKQHDTRRHG 61
+V VVH+ A+ ++ ++ K + R ERTL+ K R
Sbjct: 75 SVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYEN 134
Query: 62 RMMASIDLELGG---NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCP 118
+A +D + GG +G +G YFT++G+GTPT E Y+ +DTGSD+ W+ C C C
Sbjct: 135 --VAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECY 192
Query: 119 TKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+++D +F+PS S++ + C C + Y S G C Y +YGDGS ++
Sbjct: 193 SQAD-----PIFNPSYSASFSTVGCDSAVCSQL--DAYDCHSGG--CLYEASYGDGSYST 243
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
G F + + S ++V GCG++ G
Sbjct: 244 GSFATETLTFGTTS--------VANVAIGCGHKNVG 271
>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
Length = 447
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/320 (26%), Positives = 138/320 (43%), Gaps = 54/320 (16%)
Query: 49 LSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLW 108
L+AL RH + ++ ++ +P + G Y LGTP + + +DTGS L+W
Sbjct: 40 LAALSSLSRARHLKRPPTLTGKVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDTGSSLVW 99
Query: 109 VNCA------GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG 162
C C C K+ ++ +KSST + C C + + +CS
Sbjct: 100 TPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKCNWVFGSDL-NCSTT 158
Query: 163 VRCEYV-VTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGC---GNRQSGDLGS 218
RC Y + YG G ST+G V D++ L++ L P +FGC NRQ
Sbjct: 159 KRCPYYGLEYGLG-STTGQLVSDVLGLSK----LNRIP---DFLFGCSLVSNRQP----- 205
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------DVVKGGGIF-------- 264
+GI GFG+ +S+ +QL +F++CL D + G +
Sbjct: 206 ------EGIAGFGRGLASIPAQLGLT-----KFSYCLVSHRFDDTPQSGDLVLHRGRRHA 254
Query: 265 ---AIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDS 319
A G +P K+ + P +Y + L ++ VGG + +P L E G I+DS
Sbjct: 255 DAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDS 314
Query: 320 GTTLAYLPPMLYDLVLSQFR 339
G+T ++ +++D V +
Sbjct: 315 GSTFTFMERIIFDPVARELE 334
>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 484
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 121/273 (44%), Gaps = 36/273 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +VG+G P + YV +DTGSD+ W+ CA CS C +SD +FDP
Sbjct: 140 SGTSQGSGEYFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSD-----PIFDPV 194
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ I C C++ C G C Y V+YGDGS T G F + + L A+
Sbjct: 195 SSNSYSPIRCDAPQCKSL---DLSECRNGT-CLYEVSYGDGSYTVGEFATETVTLGTAAV 250
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+V GCG+ G + G S +Q+ A F++
Sbjct: 251 E--------NVAIGCGHNNEGLFVGAAGLLGLGGGKL-----SFPAQVNATS-----FSY 292
Query: 254 CLDVVKGGGIFAIGDVVSP---KVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSL- 306
CL V + + + SP V T P+ N Y + L+ + VGG L +P S+
Sbjct: 293 CL-VNRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIF 351
Query: 307 -LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ G IIDSGT + L +YD + F
Sbjct: 352 EVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAF 384
>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
Length = 449
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 135/307 (43%), Gaps = 46/307 (14%)
Query: 52 LKQHDTRRHGRMMASIDLELGGNGH-PSATG-------LYFTKVGLGTPTDEYYVQVDTG 103
L R R++ L + G + P A+G Y + LGTP + + VDT
Sbjct: 68 LADQAARDASRLLYLDSLAVKGRAYAPIASGRQLLQTPTYVVRARLGTPAQQLLLAVDTS 127
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
+D W+ C+GC+ CPT S F+P+ S++ + C C N PSCSP
Sbjct: 128 NDAAWIPCSGCAGCPTSSP-------FNPAASASYRPVPCGSPQCVLAPN---PSCSPNA 177
Query: 164 R-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+ C + ++Y D SS +D + + +G++ A FGC R +G T A
Sbjct: 178 KSCGFSLSYAD-SSLQAALSQDTLAV---AGDVVKA-----YTFGCLQRATG-----TAA 223
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG---GGIFAIGDVVSP-KVKTTP 278
G+LG G+ S LSQ F++CL K G +G P ++KTTP
Sbjct: 224 PPQGLLGLGRGPLSFLSQ--TKDMYGATFSYCLPSFKSLNFSGTLRLGRNGQPRRIKTTP 281
Query: 279 MVPNMPH----YNVILEEVEVGGNPLDLPTSLLG--TGDERGTIIDSGTTLAYLPPMLYD 332
++ N PH Y V + + VG + +P S L GT++DSGT L +Y
Sbjct: 282 LLAN-PHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRLVAPVYL 340
Query: 333 LVLSQFR 339
+ + R
Sbjct: 341 ALRDEVR 347
>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
Length = 447
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 80/160 (50%), Gaps = 24/160 (15%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G P +G YF VG+GTP+ + + +DTGSDL+W+ C+ C RC + +FDP
Sbjct: 77 SGIPFESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRRC-----YAQRGQVFDPR 131
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSC----SPGVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
+SST + CS CR R+P C + G C Y+V YGDGSS++G D +
Sbjct: 132 RSSTYRRVPCSSPQCRAL---RFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFA 188
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILG 229
+ ++V GCG G S+ G+LG
Sbjct: 189 NDT-------YVNNVTLGCGRDNEGLFDSAA-----GLLG 216
>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 455
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 148/331 (44%), Gaps = 53/331 (16%)
Query: 32 VFEVENK---FKAGGE---RERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATG---- 81
+F +++ FK+ R L L Q R+ L G + P A+G
Sbjct: 55 IFHIDSPCSPFKSSSPLSWEARVLQTLAQD----QARLQYLSSLVAGRSVVPIASGRQML 110
Query: 82 ---LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
Y K +GTP + +DT SD+ W+ C+GC CP+ T F P+KS++
Sbjct: 111 QSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSN-------TAFSPAKSTSF 163
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++CS C+ N P+C G R C + +TYG SS + +D I+L A+ +K
Sbjct: 164 KNVSCSAPQCKQVPN---PTC--GARACSFNLTYGS-SSIAANLSQDTIRL--AADPIK- 214
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
+ FGC N+ +G T G+LG G+ SL+SQ A + F++CL
Sbjct: 215 -----AFTFGCVNKVAGG---GTIPPPQGLLGLGRGPLSLMSQ--AQSIYKSTFSYCLPS 264
Query: 258 VKG---GGIFAIGDVVSP-KVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTG 310
+ G +G P +VK T ++ N Y V L + VG +DLP + +
Sbjct: 265 FRSLTFSGSLRLGPTSQPQRVKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFN 324
Query: 311 DER--GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L +Y+ V ++FR
Sbjct: 325 PSTGAGTIFDSGTVYTRLAKPVYEAVRNEFR 355
>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
Length = 435
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/267 (29%), Positives = 120/267 (44%), Gaps = 38/267 (14%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+ +GTP + +DTGS+L W+ CA T F P S+T + C
Sbjct: 65 LAVGTPPQNVTMVLDTGSELSWLLCA------TGRAAAAAADSFRPRASATFAAVPCGSA 118
Query: 147 FCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C + PSC + RC ++Y DGS++ G D+ + A PL S+
Sbjct: 119 RCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAP------PLRSA-- 170
Query: 206 FGCGNRQSGDLGSSTDA-AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
FGC S SS DA A G+LG + S ++Q + + F++C+ G+
Sbjct: 171 FGC---MSAAYDSSPDAVATAGLLGMNRGALSFVTQAST-----RRFSYCISDRDDAGVL 222
Query: 265 AIG--DVVSPKVKTTPM---VPNMPH-----YNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+G D+ + TP+ P +P+ Y+V L + VGG PL +P S+L D G
Sbjct: 223 LLGHSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAP-DHTG 281
Query: 315 ---TIIDSGTTLAYLPPMLYDLVLSQF 338
T++DSGT +L Y V ++F
Sbjct: 282 AGQTMVDSGTQFTFLLGDAYSAVKAEF 308
>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
Length = 332
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/271 (29%), Positives = 123/271 (45%), Gaps = 56/271 (20%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G+Y++ + LG+P ++ + +DTGSDL WV C CS P S + FD S+T
Sbjct: 1 GVYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCS--PDCS------STFDRLASNTYKA 52
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQL-NQASGNLKTAP 199
+ C+D +Y YGDGS T G D +++ AS L+ P
Sbjct: 53 LTCAD--------------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFP 92
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCL--- 255
+FGCG+ G + GIL + S SQ+ GN +F++CL
Sbjct: 93 ---GFVFGCGSLLKGLISGEV-----GILALSPGSLSFPSQIGEKYGN---KFSYCLLRQ 141
Query: 256 ----DVVKGGGIF--AIGDVVSP------KVKTTPMVPNMPHYNVILEEVEVGGNPLDLP 303
+ K +F A ++ P +++ TP+ + +Y V L+ + VG LDL
Sbjct: 142 TAQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLS 201
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
S G ++ TI DSGTTL LPP + D +
Sbjct: 202 PSAFLNGQDKPTIFDSGTTLTMLPPGVCDSI 232
>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
Length = 370
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/267 (30%), Positives = 115/267 (43%), Gaps = 39/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y K +GTP + +D D W+ C GC +G T+F+ KS+T +
Sbjct: 35 YIVKAKVGTPPQTLLMALDNSYDAAWIPCKGC--------VGCSSTVFNTVKSTTFKTLG 86
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N P C G C + TYG + S RD I L ++ P +
Sbjct: 87 CGAPQCKQVPN---PICG-GSTCTWNTTYGSSTILSN-LTRDTIAL-----SMDPVPYYA 136
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-FAHCLDVVKG- 260
FGC + +G + G+LGFG+ S LSQ N+ K F++CL +
Sbjct: 137 ---FGCIQKATG-----SSVPPQGLLGFGRGPLSFLSQ---TQNLYKSTFSYCLPSFRTL 185
Query: 261 --GGIFAIGDV-VSPKVKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER- 313
G +G V P++KTTP++ N Y V L + VG +D+P S L
Sbjct: 186 NFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTG 245
Query: 314 -GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L Y V ++FR
Sbjct: 246 AGTIFDSGTVFTRLVAPAYIAVRNEFR 272
>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
Length = 321
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y T VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
Length = 439
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 140/310 (45%), Gaps = 47/310 (15%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATG-------LYFTKVGLGTPTDEYYVQ 99
R L L Q R+ L G + P A+G Y K +GTP +
Sbjct: 60 RVLQTLAQD----QARLQYLSSLVAGRSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLA 115
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DT SD+ W+ C+GC CP+ T F P+KS++ ++CS C+ N P+C
Sbjct: 116 MDTSSDVAWIPCSGCVGCPSN-------TAFSPAKSTSFKNVSCSAPQCKQVPN---PTC 165
Query: 160 SPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGS 218
G R C + +TYG SS + +D I+L A+ +K + FGC N+ +G
Sbjct: 166 --GARACSFNLTYGS-SSIAANLSQDTIRL--AADPIK------AFTFGCVNKVAGG--- 211
Query: 219 STDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG---GGIFAIGDVVSP-KV 274
T G+LG G+ SL+SQ A + F++CL + G +G P +V
Sbjct: 212 GTIPPPQGLLGLGRGPLSLMSQ--AQSIYKSTFSYCLPSFRSLTFSGSLRLGPTSQPQRV 269
Query: 275 KTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER--GTIIDSGTTLAYLPPM 329
K T ++ N Y V L + VG +DLP + + GTI DSGT L
Sbjct: 270 KYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRLAKP 329
Query: 330 LYDLVLSQFR 339
+Y+ V ++FR
Sbjct: 330 VYEAVRNEFR 339
>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y T VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVTSVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 449
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 127/291 (43%), Gaps = 42/291 (14%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
R + D+ GG G Y ++ +G P E DTGSDL+WV C C C +
Sbjct: 78 ARALVQSDIVPGG-------GEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQ 130
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPG---VRCEYVVTYGDGSST 177
+ +FDP +SS+ + C + FC + SC C Y +YGD S +
Sbjct: 131 NS-----PIFDPRRSSSYRNVLCGNEFC-NKLDGEARSCDARGFVKTCGYTYSYGDQSFS 184
Query: 178 SGYFVRDIIQLNQASGNLKTA-PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
G+ + + + N A V FGCG + G D GI+G G + S
Sbjct: 185 DGHLAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTF----DELGSGIIGLGGGSMS 240
Query: 237 LLSQLAAAGNVRKEFAHCL-----------DVVKGGGIFAIGDVVSPKVKTTPMVPNMP- 284
L+SQL + +F++CL + G I G + V +TP++P P
Sbjct: 241 LVSQLGP--KLSGKFSYCLVPTSEQSNYTSKINFGNDINISGS--NYNVVSTPLLPKKPE 296
Query: 285 -HYNVILEEVEVGGNPLDLPTSLLGTGD-ERGT-IIDSGTTLAYLPPMLYD 332
+Y + LE + V LP + L G+ E+G IIDSGTTL +L ++
Sbjct: 297 TYYYLTLEAISVENK--RLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFN 345
>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
Length = 444
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 127/266 (47%), Gaps = 40/266 (15%)
Query: 89 LGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFC 148
+GTP + +DTGS+L W+ RC + + ++F+P S T +I CS C
Sbjct: 73 IGTPPQNITMVLDTGSELSWL------RCKKEPNFT---SIFNPLASKTYTKIPCSSQTC 123
Query: 149 RT-TYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIF 206
+T T + P +C P C ++++Y D SS G+ + + G+L T P + +F
Sbjct: 124 KTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFRF----GSL-TRP---ATVF 175
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAI 266
GC + S + DA G++G + + S ++Q+ ++F++C+ + G +
Sbjct: 176 GCMDSGSSS-NTEEDAKTTGLMGMNRGSLSFVNQMGF-----RKFSYCISGLDSTGFLLL 229
Query: 267 GDVVSPKVKT---TPMVP---NMPH-----YNVILEEVEVGGNPLDLPTSLLGTGDERG- 314
G+ +K TP+V +P+ Y+V LE ++V L LP S+ D G
Sbjct: 230 GEARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVF-VPDHTGA 288
Query: 315 --TIIDSGTTLAYLPPMLYDLVLSQF 338
T++DSGT +L +Y + +F
Sbjct: 289 GQTMVDSGTQFTFLLGPVYSALRKEF 314
>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 432
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/319 (26%), Positives = 139/319 (43%), Gaps = 44/319 (13%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL----YFTKVGLGTPTDEYYVQVDT 102
++ AL + D R ++S G + P A+G Y + GLG+P + +DT
Sbjct: 38 ESIIALAREDDARL-LFLSSKAASTGVSSAPVASGQSPPSYVVRAGLGSPAQPILLALDT 96
Query: 103 GSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT------TYNNRY 156
+D W +C+ C CP+ +LF P+ S++ + CS C + Y
Sbjct: 97 SADATWAHCSPCGTCPSSG------SLFAPANSTSYAPLPCSSTMCTVLQGQPCPAQDPY 150
Query: 157 PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDL 216
S +P C + + D S + D + L K A N + FGC + SG
Sbjct: 151 DSSAPLPMCAFTKPFADASFQAS-LASDWLHLG------KDAIPNYA--FGCVSAVSGP- 200
Query: 217 GSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK-EFAHCLDVVKG---GGIFAIGDVVSP 272
+ + G+LG G+ +LLSQ+ GN+ F++CL K G +G P
Sbjct: 201 --TANLPKQGLLGLGRGPMALLSQV---GNMYNGVFSYCLPSYKSYYFSGSLRLGAAGQP 255
Query: 273 K-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLG--TGDERGTIIDSGTTLAY 325
+ V+ TPM+ N P+ Y V + + VG P+ +P GT++DSGT +
Sbjct: 256 RGVRYTPMLKN-PNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGTVVDSGTVITR 314
Query: 326 LPPMLYDLVLSQFRFWIAS 344
P +Y + +FR +A+
Sbjct: 315 WTPPVYAALREEFRRHVAA 333
>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 556
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/291 (29%), Positives = 126/291 (43%), Gaps = 30/291 (10%)
Query: 46 ERTLSALKQH--DTRRHG---RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQV 100
R S ++QH +TR G M + ++L NG + L+ + LGTP V V
Sbjct: 165 HRDHSCVQQHLGNTRSSGNIVEMDLPLPIDLIQNGDIN-NFLFLMPIKLGTPPVWNLVAV 223
Query: 101 DTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
DTG+ L +V C C+ RC ++D G +FDPSKS + + CS+N CRT +
Sbjct: 224 DTGATLSFVQCEPCTLRCHKQTDAG---EIFDPSKSESFSRVGCSENKCRTVQRALHLQS 280
Query: 160 SPGVR----CEYVVTYGDGSSTS-GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSG 214
+ C Y +T+G SS S G VRD + A G +FGC
Sbjct: 281 KACMEKEDSCLYSMTFGGTSSYSVGKLVRDRL----AIGKYAKGYSFPDFLFGCS----- 331
Query: 215 DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV-VKGGGIFAIGDVVSPK 273
L + G++GF S Q+A N K F++C + G +IGD
Sbjct: 332 -LDTEYHQYEAGLVGFADEPFSFFEQVAPLVNY-KAFSYCFPSDRRKTGYLSIGDYTRVN 389
Query: 274 VKTTP--MVPNMPHYNVILEEVEVGGNPL-DLPTSLLGTGDERGTIIDSGT 321
TP + Y + L+EV V G L P+ ++ R TI+ S T
Sbjct: 390 STYTPLFLARQQSRYALKLDEVLVNGMALVTTPSEMIVDSGSRWTILLSDT 440
>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
sativa Japonica Group]
Length = 333
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 79/254 (31%), Positives = 110/254 (43%), Gaps = 33/254 (12%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTSGEIACSD 145
+GLGTP +Y + VDTGS L W+ C+ C C +S +F+P SST + CS
Sbjct: 1 MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSG-----PVFNPKSSSTYASVGCSA 55
Query: 146 NFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C T N +CS C Y +YGD S + GY +D + S
Sbjct: 56 QQCSDLPSATLNPS--ACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS--------L 105
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG 261
+ +GCG G G S G++G + SLL QLA ++ F +CL
Sbjct: 106 PNFYYGCGQDNEGLFGRSA-----GLIGLARNKLSLLYQLAP--SLGYSFTYCLPSSSSS 158
Query: 262 GIFAIGDVVSPKVKTTPMVPNM---PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIID 318
G ++G + TPMV + Y + L + V GNPL + TIID
Sbjct: 159 GYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPL---SVSSSAYSSLPTIID 215
Query: 319 SGTTLAYLPPMLYD 332
SGT + LP +Y
Sbjct: 216 SGTVITRLPTSVYS 229
>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 546
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 137/297 (46%), Gaps = 37/297 (12%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G+++A+++ +G +G YF V +GTP + + +DTGSDL W+ C C C +
Sbjct: 164 GQLIATLE-----SGVSLGSGEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQ 218
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTS 178
+ +DP +SS+ I C D+ C ++ + P + C Y YGD S+T+
Sbjct: 219 NG-----PHYDPGQSSSYRNIGCHDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTT 273
Query: 179 GYFVRD--IIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
G F + + L +SG + + +V+FGCG+ G + G+ S
Sbjct: 274 GDFALETFTVNLTMSSGKPELRRV-ENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLS 327
Query: 237 LLSQLAAAGNVRKEFAHCL------DVVKGGGIF-AIGDVVS-PKVKTTPMV-----PNM 283
SQL + F++CL V IF D++S P++ T +V P
Sbjct: 328 FSSQLQSL--YGHSFSYCLVDRNSDANVSSKLIFGEDKDLLSHPELNFTTLVAGKENPVD 385
Query: 284 PHYNVILEEVEVGGNPLDLPTS--LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y V ++ + VGG +++P + T GTIIDSGTTL+Y Y ++ F
Sbjct: 386 TFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAF 442
>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 486
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/287 (29%), Positives = 126/287 (43%), Gaps = 36/287 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+GTP Y+ +DTGSD++W+ C+ C C +SD+ +FDP
Sbjct: 129 SGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDV-----IFDPK 183
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS T + C CR ++ C Y V+YGDGS T G F + + + A
Sbjct: 184 KSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGA-- 241
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ PL GCG+ G G+LG G+ S SQ + N +F++
Sbjct: 242 RVDHVPL------GCGHDNEGLF-----VGAAGLLGLGRGGLSFPSQTKSRYN--GKFSY 288
Query: 254 CL-------DVVKGGGIFAIGDVVSPKVKT-TPMVPNMP---HYNVILEEVEVGGNPL-- 300
CL K G+ PK TP++ N Y + L + VGG+ +
Sbjct: 289 CLVDRTSSGSSSKPPSTIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 348
Query: 301 --DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ L TG+ G IIDSGT++ L Y + FR L
Sbjct: 349 VSESQFKLDATGNG-GVIIDSGTSVTRLTQSAYVALRDAFRLGATKL 394
>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 453
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 125/279 (44%), Gaps = 38/279 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ C+ C +C ++SD +F+P
Sbjct: 101 SGLSQGSGEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSD-----PIFNPY 155
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQLNQ 190
KS + I CS CR R S R C Y V+YGDGS T+G F + +
Sbjct: 156 KSKSFAGIPCSSPLCR-----RLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFR- 209
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
GN K A V GCG+ G + G+ S SQ N +
Sbjct: 210 --GN-KIA----KVALGCGHHNEGLFVGAAGLLGL-----GRGRLSFPSQTGIRFN--HK 255
Query: 251 FAHCL---DVVKGGGIFAIGD-VVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD-L 302
F++CL GD +S + TP++ N Y V L + VGG + +
Sbjct: 256 FSYCLVDRSASSKPSSMVFGDAAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGV 315
Query: 303 PTSL--LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
SL L + G IIDSGT++ L Y + FR
Sbjct: 316 SPSLFKLDSAGNGGVIIDSGTSVTRLTRPAYTALRDAFR 354
>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 435
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/263 (30%), Positives = 113/263 (42%), Gaps = 32/263 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +G+P E VDTGS L+W+ C+ C C + LF+P KSST
Sbjct: 87 GEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNC-----FPQETPLFEPLKSSTYKY 141
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C C T C +C Y + YGD S + G + + G +
Sbjct: 142 ATCDSQPC-TLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFP 200
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
N+ IFGCG + + +S V GI G G SL+SQL A + +F++CL
Sbjct: 201 NT--IFGCGVDNNFTIYTSNK--VMGIAGLGAGPLSLVSQLGA--QIGHKFSYCLLPYDS 254
Query: 256 ---DVVKGG--GIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTG 310
+K G I VVS + P +P +Y + LE V +G ++ TG
Sbjct: 255 TSTSKLKFGSEAIITTNGVVSTPLIIKPSLPT--YYFLNLEAVTIGQK-------VVSTG 305
Query: 311 DERGTI-IDSGTTLAYLPPMLYD 332
G I IDSGT L YL Y+
Sbjct: 306 QTDGNIVIDSGTPLTYLENTFYN 328
>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
lyrata]
Length = 449
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 81/309 (26%), Positives = 136/309 (44%), Gaps = 29/309 (9%)
Query: 53 KQHD-TRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC 111
K+H R + + ++LG +G T YFT+V +GTP ++ V VDTGS+L WVNC
Sbjct: 58 KRHSLISRKRKFKGGVKMDLG-SGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNC 116
Query: 112 AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY--PSC-SPGVRCEYV 168
R K +F +S + + C C+ N + +C +P C Y
Sbjct: 117 RYRGRGKGKVK---NRRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYD 173
Query: 169 VTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGIL 228
Y DGS+ G F ++ I + +G + A L ++ GC + S + DG+L
Sbjct: 174 YRYADGSAAQGVFAKETITVGLTNG--RKARLR-GLLVGCSSSFS----GQSFQGADGVL 226
Query: 229 GFGQANSSLLSQLAAAGNVRKEFAHCL----------DVVKGGGIFAIGDVVSPKVKTTP 278
G ++ S S A + ++CL + + G + + +TTP
Sbjct: 227 GLAFSDFSFTS--TATSLFGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTP 284
Query: 279 MVPNM--PHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
+ + P Y + + + +G + LD+PT + GTI+DSGT+L L Y V++
Sbjct: 285 LDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYKPVVT 344
Query: 337 QFRFWIASL 345
++ L
Sbjct: 345 GLARYLVEL 353
>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 427
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/336 (24%), Positives = 144/336 (42%), Gaps = 41/336 (12%)
Query: 4 LRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRM 63
L L ++ +++VVH A V + + + + + +K+ R +
Sbjct: 9 LFFLIILCFSISVVHLSA------SPTLVLNLVHSYHIYSRKPPHVYHIKEASVERLEYL 62
Query: 64 MASIDLELGGNGHPSATGL---YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
A ++ + P+ + + + +G+P + +DT SDLLW+ C C C +
Sbjct: 63 KAKTTGDIIAHLSPNVPIIPQAFLVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQ 122
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTT-YNNRYPSCSPGVR-CEYVVTYGDGSSTS 178
S L +FDPS+S T + CRT+ Y+ + R CEY + Y D + +
Sbjct: 123 S-----LPIFDPSRSYTH-----RNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDTGSK 172
Query: 179 GYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
G R+++ N +A L+ V+FGCG+ G+ T GILG G SL+
Sbjct: 173 GILAREMLLFNTIYDESSSAALH-DVVFGCGHDNYGEPLVGT-----GILGLGYGEFSLV 226
Query: 239 SQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKV-KTTPMVPNMPHYNVILEEV 293
+ K+F++C D + +GD + + TTP+ + Y V +E +
Sbjct: 227 HRFG------KKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAI 280
Query: 294 EVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYL 326
V G L + + + GTIID+G +L L
Sbjct: 281 SVDGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSL 316
>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 350
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/146 (39%), Positives = 70/146 (47%), Gaps = 11/146 (7%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G S +G YF + +G P + DTGSDL+WV C+ C C S T+F P
Sbjct: 75 SGAASGSGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHS----PATVFFPR 130
Query: 134 KSSTSGEIACSDNFCRTTYN-NRYPSCSP---GVRCEYVVTYGDGSSTSGYFVRDIIQLN 189
SST C D CR +R P C+ C Y Y DGS TSG F R+ L
Sbjct: 131 HSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLK 190
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGD 215
+SG K A L SV FGCG R SG
Sbjct: 191 TSSG--KEARLK-SVAFGCGFRISGQ 213
>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
Length = 489
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 123/282 (43%), Gaps = 33/282 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C +C +++D +FDP
Sbjct: 138 SGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTD-----PVFDPK 192
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + I+C C + P C+ C Y V YGDGS T G F + +
Sbjct: 193 KSGSFSSISCRSPLCLRLDS---PGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR- 248
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
V GCG+ G G+LG G+ S +Q ++F++
Sbjct: 249 -------VPKVALGCGHDNEGLF-----VGAAGLLGLGRGRLSFPTQTGL--RFGRKFSY 294
Query: 254 CL----DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLD-LPTS 305
CL K + VS TP++ N Y + L + VGG + + S
Sbjct: 295 CLVDRSASSKPSSVVFGQSAVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITAS 354
Query: 306 L--LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L L T G IIDSGT++ L Y + FR A L
Sbjct: 355 LFKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADL 396
>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 486
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 143/331 (43%), Gaps = 66/331 (19%)
Query: 45 RERTLSALKQHDTRRHGRMMASIDLELGG-----------------------------NG 75
+ TLS LK+ D+ R + A IDL + G +G
Sbjct: 85 KSLTLSRLKR-DSARVRSLTARIDLAIRGITGTDLEPLGNGGGGGSQFGTEDFESPIVSG 143
Query: 76 HPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKS 135
+G YF++VG+G P Y+ +DTGSD+ WV CA C+ C ++D F+P+ S
Sbjct: 144 ASQGSGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTD-----PXFEPTSS 198
Query: 136 STSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS-GN 194
++ ++C C++ C G C Y V+YGDGS T G FV + + L S GN
Sbjct: 199 ASFTSLSCETEQCKSL---DVSECRNGT-CLYEVSYGDGSYTVGDFVTETVTLGSTSLGN 254
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ GCG+ G G+LG G + S SQL A+ F++C
Sbjct: 255 ---------IAIGCGHNNEGLF-----IGAAGLLGLGGGSLSFPSQLNASS-----FSYC 295
Query: 255 L--DVVKGGGIFAIGDVVSPKVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTSLLGT 309
L ++P T P+ PN+ + + L + VGG L +P +
Sbjct: 296 LVDRDSDSTSTLDFNSPITPDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQM 355
Query: 310 GDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
++ G I+DSGT + L +Y+++ F
Sbjct: 356 SEDGNGGIIVDSGTAVTRLQTTVYNVLRDAF 386
>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
Length = 512
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 120/277 (43%), Gaps = 44/277 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG E V VDT S+L WV CA C C + D LFDPS S + +
Sbjct: 153 YVATVGLGG--GEATVIVDTASELTWVQCAPCESCHDQQD-----PLFDPSSSPSYAAVP 205
Query: 143 CSDNFCRTTY------NNRYPSC----SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
C+ + C + +C C Y ++Y DGS + G D + L
Sbjct: 206 CNSSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSL---- 261
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKEF 251
+ +FGCG G T G++G G++ SL+SQ + G V F
Sbjct: 262 ----AGEVIDGFVFGCGTSNQGPPFGGT----SGLMGLGRSQLSLVSQTMDQFGGV---F 310
Query: 252 AHCLDVVK--GGGIFAIGDVVSPKVKTTPMV-PNM-------PHYNVILEEVEVGGNPLD 301
++CL + + G IGD S +TP+V +M P Y V L + VGG ++
Sbjct: 311 SYCLPLKESDSSGSLVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVE 370
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G G + IIDSGT + L P +Y+ V ++F
Sbjct: 371 SSGFSSGGGGGK-AIIDSGTVITSLVPSIYNAVKAEF 406
>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 445
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 44/272 (16%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + GLGTP V +D +D WV C+ C+ C S F P++SST +
Sbjct: 102 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSP------SFSPTQSSTYRTVP 155
Query: 143 CSDNFCRTTYNNRYPSCSPGV--RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C C + PSC GV C + +TY S+ +D + L N+
Sbjct: 156 CGSPQCAQVPS---PSCPAGVGSSCGFNLTYA-ASTFQAVLGQDSLALEN---NVVV--- 205
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFAHCLDVVK 259
S FGC SG+ G++GFG+ S LSQ G+V F++CL +
Sbjct: 206 --SYTFGCLRVVSGN-----SVPPQGLIGFGRGPLSFLSQTKDTYGSV---FSYCLPNYR 255
Query: 260 G---GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLG--- 308
G +G + PK +KTTP++ N PH Y V + + VG + +P S L
Sbjct: 256 SSNFSGTLKLGPIGQPKRIKTTPLLYN-PHRPSLYYVNMIGIRVGSKVVQVPQSALAFNP 314
Query: 309 -TGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
TG GTIID+GT L +Y V FR
Sbjct: 315 VTGS--GTIIDAGTMFTRLAAPVYAAVRDAFR 344
>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
Length = 426
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/272 (31%), Positives = 119/272 (43%), Gaps = 44/272 (16%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + GLGTP V +D +D WV C+ C+ C S F P++SST +
Sbjct: 83 YIARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSP------SFSPTQSSTYRTVP 136
Query: 143 CSDNFCRTTYNNRYPSCSPGV--RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C C + PSC GV C + +TY S+ +D + L N+
Sbjct: 137 CGSPQCAQVPS---PSCPAGVGSSCGFNLTYA-ASTFQAVLGQDSLALEN---NVVV--- 186
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFAHCLDVVK 259
S FGC SG+ G++GFG+ S LSQ G+V F++CL +
Sbjct: 187 --SYTFGCLRVVSGN-----SVPPQGLIGFGRGPLSFLSQTKDTYGSV---FSYCLPNYR 236
Query: 260 G---GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLG--- 308
G +G + PK +KTTP++ N PH Y V + + VG + +P S L
Sbjct: 237 SSNFSGTLKLGPIGQPKRIKTTPLLYN-PHRPSLYYVNMIGIRVGSKVVQVPQSALAFNP 295
Query: 309 -TGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
TG GTIID+GT L +Y V FR
Sbjct: 296 VTGS--GTIIDAGTMFTRLAAPVYAAVRDAFR 325
>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
Length = 484
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 133/347 (38%), Gaps = 55/347 (15%)
Query: 44 ERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTG 103
+RER ++ + RR ++ + L + + TG YF + +GTP + + DTG
Sbjct: 50 DRER-MAFISSRGRRRAAETASAFAMPLSSGAY-TGTGQYFVRFRVGTPAQPFLLVADTG 107
Query: 104 SDLLWVNC-----------AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
SDL WV C S P + + T F P KS T I CS CR +
Sbjct: 108 SDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT-FRPDKSRTWAPIPCSSATCRESL 166
Query: 153 NNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
+C+ P C Y Y DGS+ G D + + + A L V+ GC
Sbjct: 167 PFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGRAARKAKLR-GVVLGCTTS 225
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---------------- 255
+G + A DG+L G +N S S+ AA F++CL
Sbjct: 226 YNGQ----SFLASDGVLSLGYSNISFASR--AASRFGGRFSYCLVDHLAPRNATSYLTFG 279
Query: 256 --------------DVVKGGGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGN 298
K +P + TP+V + P Y V ++ V V G
Sbjct: 280 PNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGE 339
Query: 299 PLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L +P ++ G I+DSGT+L L Y V++ +A L
Sbjct: 340 LLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAGL 386
>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
Length = 493
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 120/291 (41%), Gaps = 41/291 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFTK+G+GTP+ + +DTGSD++W+ CA C RC +S +FDP
Sbjct: 131 SGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSG-----PVFDPR 185
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
+SS+ G + C+ CR + C R C Y V YGDGS T+G F + + +
Sbjct: 186 RSSSYGAVDCAAPLCRRLDSG---GCDLRRRACLYQVAYGDGSVTAGDFATETLTF---A 239
Query: 193 GNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
G + A V GCG+ G V G SL + K F+
Sbjct: 240 GGARVA----RVALGCGHDNEGLF-------VAAAGLLGLGRGSLSFPTQISRRYGKSFS 288
Query: 253 HCL-----------DVVKGGGIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEEVEVGGN 298
+CL G + TPMV P M Y V L + VGG
Sbjct: 289 YCLVDRTSSSSSGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGA 348
Query: 299 PL----DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ + L + G I+DSGT++ L Y + FR A L
Sbjct: 349 RVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGL 399
>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 457
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 124/276 (44%), Gaps = 40/276 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSST- 137
+G Y+ K+G+GTP + + VDTGS L W+ C C C + D +F PS S T
Sbjct: 104 SGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVD-----PIFTPSVSKTY 158
Query: 138 -SGEIACSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ + S + P CS C Y +YGD S + GY +D++ L ++
Sbjct: 159 KALSCSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTLTPSA--- 215
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLA-AAGNVRKEFAHC 254
AP +S ++GCG G G S GI+G S+L QL+ GN F++C
Sbjct: 216 --AP-SSGFVYGCGQDNQGLFGRSA-----GIIGLANDKLSMLGQLSNKYGNA---FSYC 264
Query: 255 LDVVKGG-------GIFAIG--DVVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDL 302
L G +IG + S K TP+V P +P Y + L + V G PL +
Sbjct: 265 LPSSFSAQPNSSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGV 324
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
S TIIDSGT + LP +Y+ + F
Sbjct: 325 SASSYNV----PTIIDSGTVITRLPVAIYNALKKSF 356
>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 559
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 81/280 (28%), Positives = 119/280 (42%), Gaps = 36/280 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF V +GTP + + +DTGSDL W+ C C C +S +DP SS+
Sbjct: 192 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSG-----PYYDPKDSSSFR 246
Query: 140 EIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I+C D C+ ++ + P + C Y YGDGS+T+G F + +N + N K+
Sbjct: 247 NISCHDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKS 306
Query: 198 APLN-SSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ +V+FGCG NR + G L F SL Q F++C
Sbjct: 307 ELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQ---------SFSYC 357
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNM--------------PHYNVILEEVEVGGNPL 300
L V + ++ + K PN+ Y V + V V L
Sbjct: 358 L-VDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVL 416
Query: 301 DLP--TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+P T L + GTIIDSGTTL Y Y+++ F
Sbjct: 417 KIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAF 456
>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 260
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 106/239 (44%), Gaps = 33/239 (13%)
Query: 79 ATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR-CPTKSDLGIKLTLFDPSKSST 137
+ G Y TK+ +GTP E+ + VDTGS++ +V C G C D F SST
Sbjct: 46 SYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPCCGSEEYCGKHEDPA-----FQTESSST 100
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGV---RCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
+ C +PSC +C Y + YGDGS + G DII S
Sbjct: 101 YQPVNC------------HPSCDCDYLRSQCSYKMHYGDGSYSRGVLAEDIISFGNES-- 146
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ AP ++FGC + +GS DGI+G G+ S+++ QL G + F+ C
Sbjct: 147 -EFAP--QRLVFGC---ELDAIGSLYSLRADGIIGLGRGRSTIVDQLVDKGVISDSFSLC 200
Query: 255 LDVVKGGGIFAIGDVVSPKVKTTPMVPNMP----HYNVILEEVEVGGNPLDLPTSLLGT 309
++GGG I SP + P +YNV L E++V G PL+ L T
Sbjct: 201 YGGMEGGGGHIILGSFSPPPSDMFFTYSNPGRSQYYNVELMEIQVAGKPLEXXREALNT 259
>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 507
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 153/369 (41%), Gaps = 59/369 (15%)
Query: 18 HQWAVGGGGVMGNFVFEVENKFKAGGERERTLS---ALKQHDTRRHG---RMMASIDLEL 71
H+ GGGG + V V+ G R + ++ + +D RR G +++ +
Sbjct: 42 HERFSGGGGDVDQ-VEAVKGFVNRDGLRRQRMNQRWGVSNYDRRRKGLETTTTTEVEMPM 100
Query: 72 GGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLT--- 128
G A G YFT+V +G+P +++ DTGS+ W NC + T + +
Sbjct: 101 RA-GRDDALGEYFTEVKVGSPGQRFWLAADTGSEFTWFNCVMRNATTTATTKKTRKNKTK 159
Query: 129 ------------------------------LFDPSKSSTSGEIACSDNFCRTTYNNRYPS 158
+F P +S + + C+ C+ + +
Sbjct: 160 KKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCPHRSKSFQAVTCASQKCKIDLSQLFSL 219
Query: 159 C---SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
P C Y ++Y DGSS G+F D I ++ +G K LN+ I GC +S +
Sbjct: 220 SLCPKPSDPCLYDISYADGSSAKGFFGTDTITVDLKNG--KEGKLNNLTI-GC--TKSME 274
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-DVVKGGGI---FAIGDVVS 271
G + + GILG G A S + + AA +F++CL D + + IG +
Sbjct: 275 NGVNFNEDTGGILGLGFAKDSFIDK--AAYEYGAKFSYCLVDHLSHRNVSSYLTIGGHHN 332
Query: 272 PK----VKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
K +K T ++ P Y V + + +GG L +P + + GT+IDSGTTL L
Sbjct: 333 AKLLGEIKRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNSQGGTLIDSGTTLTALL 392
Query: 328 PMLYDLVLS 336
Y+ V
Sbjct: 393 VPAYEPVFE 401
>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
Short=AtASPG1; Flags: Precursor
gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
thaliana]
gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 500
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 140/326 (42%), Gaps = 52/326 (15%)
Query: 25 GGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYF 84
G++ F VE G L + DTR + + + +G +G YF
Sbjct: 114 AGIVAKIRFAVE------GVDRSDLKPVYNEDTRYQTEDLTTPVV----SGASQGSGEYF 163
Query: 85 TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS 144
+++G+GTP E Y+ +DTGSD+ W+ C C+ C +SD +F+P+ SST + CS
Sbjct: 164 SRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYKSLTCS 218
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
C + +C +C Y V+YGDGS T G D + SG + ++V
Sbjct: 219 APQCSLLETS---ACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKI------NNV 267
Query: 205 IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
GCG+ G + G S+ +Q+ A F++CL V + G
Sbjct: 268 ALGCGHDNEGLFTGAAGLLGLGGGVL-----SITNQMKAT-----SFSYCL-VDRDSGKS 316
Query: 265 AIGDVVSPKV----KTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLL-----GTGDE 312
+ D S ++ T P++ N Y V L VGG + LP ++ G+G
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG-- 374
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
G I+D GT + L Y+ + F
Sbjct: 375 -GVILDCGTAVTRLQTQAYNSLRDAF 399
>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
Length = 466
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 127/326 (38%), Gaps = 27/326 (8%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGG----NGHPSATGLYFTKVGLGTPTDE 95
+A + R Q + R GR A + +G + TG YF + +GTP
Sbjct: 54 RARDDLHRHAYIRSQLASSRRGRRAAEVGASAFAMPLSSGAYTGTGQYFVRFRVGTPAQP 113
Query: 96 YYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR 155
+ + DTGSDL WV C G +F + S + IACS + C +
Sbjct: 114 FVLVADTGSDLTWVKCRGAGAAAGTGAGS-PARVFRTAASKSWAPIACSSDTCTSYVPFS 172
Query: 156 YPSC-SPGVRCEYVVTYGDGSSTSGYFVRD--IIQLNQASGNLKTAPLNSS------VIF 206
+C SP C Y Y DGS+ G D I L+ SG V+
Sbjct: 173 LANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGGRRAKLQGVVL 232
Query: 207 GCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC----LDVVKGGG 262
GC G S+ DG+L G +N S S+ AA R F++C L
Sbjct: 233 GCAATYDGQSFQSS----DGVLSLGNSNISFASRAAARFGGR--FSYCLVDHLAPRNATS 286
Query: 263 IFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDS 319
G + TP++ + P Y V ++ V V G LD+P + G I+DS
Sbjct: 287 YLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWDVDRNGGAILDS 346
Query: 320 GTTLAYLPPMLYDLVLSQFRFWIASL 345
GT+L L Y V++ +A L
Sbjct: 347 GTSLTILATPAYRAVVTALSKHLAGL 372
>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
Length = 475
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 132/318 (41%), Gaps = 53/318 (16%)
Query: 39 FKAGGERERTLSALKQHDTRRHGRMMASIDL-------ELGGN----GHPSATGLYFTKV 87
F + +A Q DT+R ++ + E G+ G +G YF ++
Sbjct: 81 FNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAEAFGSDVVSGMEQGSGEYFVRI 140
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
G+G+P YV +D+GSD++WV C C++C +SD +F+P+ SS+ ++C+
Sbjct: 141 GVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSD-----PVFNPADSSSFSGVSCASTV 195
Query: 148 CRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
C N +C G RC Y V+YGDGS T G + I + L +V G
Sbjct: 196 CSHVDN---AACHEG-RCRYEVSYGDGSYTKGTLALETITFGRT--------LIRNVAIG 243
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV--VKGGGIFA 265
CG+ G + G S + QL G F++CL ++ G+
Sbjct: 244 CGHHNQGMFVGAAGLLGLGGGPM-----SFVGQL--GGQTGGAFSYCLVSRGIESSGLLE 296
Query: 266 IGDVVSP-KVKTTPMV--PNMPHYNVI--------LEEVEVGGNPLDLPTSLLGTGDERG 314
G P P++ P + I V + + L S LG G G
Sbjct: 297 FGREAMPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKL--SELGDG---G 351
Query: 315 TIIDSGTTLAYLPPMLYD 332
++D+GT + LP + Y+
Sbjct: 352 VVMDTGTAVTRLPTVAYE 369
>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
Length = 359
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 113/270 (41%), Gaps = 30/270 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y ++ +GTP +DTGSDL+W+ C C C T+F SS+ +
Sbjct: 3 GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHH---GETIFFSDASSSYKK 59
Query: 141 IACSDNFCRTTYNNRY-PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ C + P C C+Y YGDGS TSG D I
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCEE--TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRS 117
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
+FGCG + GD + G++G GQ + SL+ QL + +F++CL
Sbjct: 118 FFDGFLFGCGRKLKGDWNFT-----QGLIGLGQKSHSLIQQL--GDKLGYKFSYCLVSYD 170
Query: 256 DVVKGGGIFAIGDVVSPK---VKTTPMVP----NMPHYNVILEEVEVGGNPLDLPTSLLG 308
+G + + V +TP++ + Y V L+ + VGG P+ + G
Sbjct: 171 SPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESG 230
Query: 309 TGDERG------TIIDSGTTLAYLPPMLYD 332
G T+IDSGTT L P +Y+
Sbjct: 231 HNTSVGPFLANKTVIDSGTTYTLLTPPVYE 260
>gi|218196224|gb|EEC78651.1| hypothetical protein OsI_18747 [Oryza sativa Indica Group]
Length = 317
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 62/108 (57%)
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYN 287
+G G +N+SL+ QLA + +K FAHCLD + GGIF +G +V PKV+ TP+ Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 288 VILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
L E+ VG L L + + TI+++G+ ++YLP +Y L
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQSFL 108
>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
Length = 525
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 91/310 (29%), Positives = 131/310 (42%), Gaps = 59/310 (19%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS 121
RM+A+++ +G +G Y V +GTP + + +DTGSDL W+ CA C C +
Sbjct: 135 RMVATVE-----SGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQ- 188
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFC-----------RTTYNNRYPSCSPGVRCEYVVT 170
+ +FDP+ SS+ + C D+ C + R P P C Y
Sbjct: 189 ----RGPVFDPAASSSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDP---CPYYYW 241
Query: 171 YGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS----VIFGCGNRQSGDLGSSTDAAVDG 226
YGD S+T+G + +N TAP S V+FGCG+R G +
Sbjct: 242 YGDQSNTTGDLALESFTVNL------TAPGASRRVDGVVFGCGHRNRGLFHGAAGLLGL- 294
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCL---------DVVKGGGIFAIGDVVSPKVKTT 277
G+ S SQL A F++CL VV G A+ P++K T
Sbjct: 295 ----GRGPLSFASQLRAVYG--HTFSYCLVDHGSDVGSKVVFGEDDDALALAAHPQLKYT 348
Query: 278 PMVPNM-------PHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPP 328
P Y V L+ V VGG L++ + G + GTIIDSGTTL+Y
Sbjct: 349 AFAPASSSSSPADTFYYVKLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVE 408
Query: 329 MLYDLVLSQF 338
Y ++ F
Sbjct: 409 PAYQVIRHAF 418
>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
Length = 396
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 82/271 (30%), Positives = 123/271 (45%), Gaps = 38/271 (14%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y + LGTP + + VDT +D W+ C+GC+ CPT S F+P+ S++
Sbjct: 51 TPTYVVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSP-------FNPAASASYR 103
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C C N PSCSP + C + ++Y D SS +D + + +G++ A
Sbjct: 104 PVPCGSPQCVLAPN---PSCSPNAKSCGFSLSYAD-SSLQAALSQDTLAV---AGDVVKA 156
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
FGC R +G T A G+LG G+ S LSQ F++CL
Sbjct: 157 -----YTFGCLQRATG-----TAAPPQGLLGLGRGPLSFLSQ--TKDMYGATFSYCLPSF 204
Query: 259 KG---GGIFAIGDVVSP-KVKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLG-- 308
K G +G P ++KTTP++ N PH Y V + + VG + +P S L
Sbjct: 205 KSLNFSGTLRLGRNGQPRRIKTTPLLAN-PHRSSLYYVNMTGIRVGKKVVSIPASALAFD 263
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT L +Y + + R
Sbjct: 264 PATGAGTVLDSGTMFTRLVAPVYLALRDEVR 294
>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTTWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
Length = 469
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/284 (29%), Positives = 124/284 (43%), Gaps = 57/284 (20%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y VGLGTP + +DTGS L WV C C S+C + +L LFDP+ SS+
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQ-----RLPLFDPNTSSSYSP 183
Query: 141 IACSDNFCRTTYNNRYPSCSPGVR-----------CEYVVTYGDGSSTSGYFVRDIIQLN 189
+ C CR + + G+ C Y + YG G++ +G + D + L
Sbjct: 184 VPCDSQECR--------ALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLG 235
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAA--AGNV 247
+ + FGCG+ Q D A DG+LG G+ SL Q +A G V
Sbjct: 236 PGA-------IVKRFHFGCGHHQQ---RGKFDMA-DGVLGLGRLPQSLAWQASARRGGGV 284
Query: 248 RKEFAHCLDVVK-GGGIFAIGDVVSPKVKT----TPMVP--NMP-HYNVILEEVEVGGNP 299
F+HCL G A+G +P + TP++ + P Y ++ + V G
Sbjct: 285 ---FSHCLPPTGVSTGFLALG---APHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQL 338
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
LD+P ++ G I DSGT L+ L Y + + FR +A
Sbjct: 339 LDIPPAVF----REGVITDSGTVLSALQETAYTALRTAFRSAMA 378
>gi|299471769|emb|CBN76990.1| aspartic protease PM5 [Ectocarpus siliculosus]
Length = 947
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 114/260 (43%), Gaps = 32/260 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G +F V GTP V +DTGS C+ C C + +D +D SKS++S
Sbjct: 124 GTHFAYVYAGTPPQRVSVIIDTGSHFTAFPCSECENCGSHTD-----PHWDQSKSTSSHI 178
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDI-----IQLNQASG-N 194
+ C D C ++ C RC + Y +GSS Y V D+ + L Q+ N
Sbjct: 179 VTCED--CHGSFR-----CQKDKRCGFSQRYSEGSSWRAYQVEDVLWVGELTLQQSEKIN 231
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-FAH 253
+ + +FGC Q+G + DGI+G + +L+ QLA AG +++ F+
Sbjct: 232 HDESAYSVEFMFGCIESQTGLFKTQL---ADGIMGMSADSHTLVWQLAKAGKIKERTFSL 288
Query: 254 CLDVVKGGGIFAIG------DVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLL 307
C K GG IG + ++ TP + V + ++ V + ++
Sbjct: 289 CFG--KNGGTMVIGGYDTRLNKPGHEMMYTPSTKTNGWFTVQVTDITVNRVSIAQDPAIF 346
Query: 308 GTGDERGTIIDSGTTLAYLP 327
G +G I+DSGTT YLP
Sbjct: 347 QRG--KGIIVDSGTTDTYLP 364
>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
gi|194702684|gb|ACF85426.1| unknown [Zea mays]
gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
Length = 439
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 132/313 (42%), Gaps = 49/313 (15%)
Query: 50 SALKQHDTRRHGRMMA--SIDLELGGNGHPSAT-GLYFTKVGLGTPTDEYYVQVDTGSDL 106
+AL + R + R +A S D + P+ G + + +GTP + DTGSDL
Sbjct: 49 AALHRDMHRHNARKLAASSSDGTVSAPVSPTTVPGEFLMTLAIGTPPLPFLAIADTGSDL 108
Query: 107 LWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE 166
+W CA CSR L++PS S+T + C N+ C+P C
Sbjct: 109 IWTQCAPCSR----QCFQQPTPLYNPSSSTTFSALPC---------NSSLGLCAPACACM 155
Query: 167 YVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDG 226
Y +TYG G + Y + S + FGC N SG SS G
Sbjct: 156 YNMTYGSGWT---YVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNASS----ASG 208
Query: 227 ILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK---GGGIFAIGDVVSPK----VKTTPM 279
++G G+ + SL+SQL A +F++CL + +G S V +TP
Sbjct: 209 LVGLGRGSLSLVSQLGA-----PKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPF 263
Query: 280 V--PNMPHYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTIIDSGTTLAYLPPMLYD 332
V P+ +Y + L + +G L +P + GTG G IIDSGTT+ ML +
Sbjct: 264 VASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTG---GLIIDSGTTIT----MLGN 316
Query: 333 LVLSQFRFWIASL 345
Q R + SL
Sbjct: 317 TAYQQVRAAVLSL 329
>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
Length = 471
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 121/282 (42%), Gaps = 36/282 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+G+P + Y+ +D+GSD++WV C C C +SD +FDP+
Sbjct: 123 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPA 177
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + ++C + C N+ C G C Y V YGDGS T G + +
Sbjct: 178 KSGSYTGVSCGSSVCDRIENS---GCHSG-GCRYEVMYGDGSYTKGTLALETLTF----- 228
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
KT N V GCG+R G + G + S + QL +G F +
Sbjct: 229 -AKTVVRN--VAMGCGHRNRGMFIGAAGLLGI-----GGGSMSFVGQL--SGQTGGAFGY 278
Query: 254 CL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL-DLP 303
CL +V G +G P V+ P P+ + + V PL D
Sbjct: 279 CLVSRGTDSTGSLVFGREALPVGASWVPLVR-NPRAPSFYYVGLKGLGVGGVRIPLPDGV 337
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L TGD G ++D+GT + LP Y F+ A+L
Sbjct: 338 FDLTETGDG-GVVMDTGTAVTRLPTGAYAAFRDGFKSQTANL 378
>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
Short=AtASPG2; Flags: Precursor
gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
thaliana]
gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 470
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 121/282 (42%), Gaps = 36/282 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+G+P + Y+ +D+GSD++WV C C C +SD +FDP+
Sbjct: 122 SGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSD-----PVFDPA 176
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + ++C + C N+ C G C Y V YGDGS T G + +
Sbjct: 177 KSGSYTGVSCGSSVCDRIENS---GCHSG-GCRYEVMYGDGSYTKGTLALETLTF----- 227
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
KT N V GCG+R G + G + S + QL +G F +
Sbjct: 228 -AKTVVRN--VAMGCGHRNRGMFIGAAGLLGI-----GGGSMSFVGQL--SGQTGGAFGY 277
Query: 254 CL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL-DLP 303
CL +V G +G P V+ P P+ + + V PL D
Sbjct: 278 CLVSRGTDSTGSLVFGREALPVGASWVPLVR-NPRAPSFYYVGLKGLGVGGVRIPLPDGV 336
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L TGD G ++D+GT + LP Y F+ A+L
Sbjct: 337 FDLTETGDG-GVVMDTGTAVTRLPTAAYVAFRDGFKSQTANL 377
>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
Length = 452
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 119/271 (43%), Gaps = 36/271 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
T Y + LGTP + + VDT +D W+ CAGC+ CPT S FDP+ S++
Sbjct: 107 TPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSS-----APPFDPAASTSYR 161
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C C N +C PG + C + +TY D SS +D L A +KT
Sbjct: 162 SVPCGSPLCAQAPNA---ACPPGGKACGFSLTYAD-SSLQAALSQD--SLAVAGDAVKT- 214
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
FGC + +G T A G+LG G+ S LSQ + F++CL
Sbjct: 215 -----YTFGCLQKATG-----TAAPPQGLLGLGRGPLSFLSQ--TRDMYQGTFSYCLPSF 262
Query: 259 KG---GGIFAIG-DVVSPKVKTTPMVPNMPH----YNVILEEVEVGGN--PLDLPTSLLG 308
K G +G + P++KTTP++ N PH Y V + + VG P+ P
Sbjct: 263 KSLNFSGTLRLGRNGQPPRIKTTPLLAN-PHRSSLYYVNMTGIRVGRKVVPIPPPALAFD 321
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GT++DSGT L Y V + R
Sbjct: 322 PATGAGTVLDSGTMFTRLVAPAYVAVRDEVR 352
>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
Length = 451
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 143/329 (43%), Gaps = 52/329 (15%)
Query: 39 FKAGGERERTLSALKQHDTRRHGRM------------MASIDLELGGNGHPSATGL---- 82
F+A + S+L +HD RHG +A + G P+ L
Sbjct: 28 FRADLDHPYAGSSLSRHDVVRHGARASKTRAAWLTAKLAGVLSNRRGGVSPADVRLSPLS 87
Query: 83 ---YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+ VG+GTP + VDTGSDL+W C S + G ++DP +SST
Sbjct: 88 DQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHG-SPPVYDPGESSTFA 146
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ CSD C+ + + +C+ RC Y YG ++ G + G +
Sbjct: 147 FLPCSDRLCQEGQFS-FKNCTSKNRCVYEDVYGSAAAV-GVLASETFTF----GARRAVS 200
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
L + FGCG +G L +T GILG + SL++QL + F++CL
Sbjct: 201 LR--LGFGCGALSAGSLIGAT-----GILGLSPESLSLITQLKI-----QRFSYCLTPFA 248
Query: 256 DVVKGGGIF-AIGDVVSPK----VKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLL 307
D +F A+ D+ K ++TT +V N +Y V L + +G L +P + L
Sbjct: 249 DKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASL 308
Query: 308 GTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ GTI+DSG+T+AYL ++ V
Sbjct: 309 AMRPDGGGGTIVDSGSTVAYLVEAAFEAV 337
>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP+ V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSASWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
Length = 456
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 74/270 (27%), Positives = 125/270 (46%), Gaps = 38/270 (14%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+G+P Y+ +D+GSD++W+ C C +C ++D +F+P+
Sbjct: 120 SGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTD-----PIFNPA 174
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S++ +ACS N C ++ +C G RC Y V YGDGS T G + I +
Sbjct: 175 TSASFIGVACSSNVCNQLDDDV--ACRKG-RCGYQVAYGDGSYTKGTLALETITIG---- 227
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+T ++++ GCG+ G + G S + QL A F +
Sbjct: 228 --RTVIQDTAI--GCGHWNEGMFVGAAGLLGLGGGPM-----SFVGQLGA--QTGGAFGY 276
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSL-----LG 308
CL V + +G + P + P P+ Y V L + VGG + + + +G
Sbjct: 277 CL-VSRA---MPVGAMWVPLIH-NPFYPSF--YYVSLSGLAVGGIRVPISEQIFQLTDIG 329
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
TG G ++D+GT + LP + Y+ F
Sbjct: 330 TG---GVVMDTGTAITRLPTVAYNAFRDAF 356
>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
Length = 486
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 132/298 (44%), Gaps = 40/298 (13%)
Query: 55 HDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC 114
+T+ H + S+++ ++G++ G+ + V +DT D+ W+ C C
Sbjct: 122 EETQLHHQAAISVEVGTSQTSSEPSSGIHPAAATDGSSSPPVTVVLDTAGDVPWMRCVPC 181
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS-CSPGVRCEY-VVTYG 172
+ + +DP++SST C+ + C+ RY + C +C+Y VVT G
Sbjct: 182 TFA--------QCADYDPTRSSTYSAFPCNSSACKQL--GRYANGCDANGQCQYMVVTAG 231
Query: 173 DGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
D +TSG + D++ +N SG+ FGC + G S + DGI+ G+
Sbjct: 232 DSFTTSGTYSSDVLTIN--SGDRV-----EGFRFGCSQNEQG----SFENQADGIMALGR 280
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVK-GGGIFAIGDVV--SPKVKTTPMVPN------- 282
SL++Q ++ F++CL + G F IG + S + TTPM+
Sbjct: 281 GVQSLMAQTSSTYG--DAFSYCLPPTETTKGFFQIGVPIGASYRFVTTPMLKERGGASAA 338
Query: 283 -MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
Y +L + V G L++P + GT++DS T + LP Y + + FR
Sbjct: 339 AATLYRALLLAITVDGKELNVPAEVFAA----GTVMDSRTIITRLPVTAYGALRAAFR 392
>gi|222630453|gb|EEE62585.1| hypothetical protein OsJ_17388 [Oryza sativa Japonica Group]
Length = 275
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 61/105 (58%)
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYN 287
+G G +N+SL+ QLA + +K FAHCLD + GGIF +G +V PKV+ TP+ Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 288 VILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
L E+ VG L L + + TI+++G+ ++YLP +Y
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQ 105
>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
Length = 414
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 87/267 (32%), Positives = 122/267 (45%), Gaps = 39/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DT +D W+ C C C + TLF P KS+T ++
Sbjct: 78 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAS--------TLFAPEKSTTFKNVS 129
Query: 143 CSDNFCRTTYNNRYPSCSPGV-RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C+ C+ N P C GV C + +TYG SS + V+D I L T P+
Sbjct: 130 CAAPECKQVPN---PGC--GVSSCNFNLTYG-SSSIAANLVQDTI-------TLATDPV- 175
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG- 260
S FGC ++ +G T A G+LG G+ SLLSQ + F++CL K
Sbjct: 176 PSYTFGCVSKTTG-----TSAPPQGLLGLGRGPLSLLSQ--TQNLYQSTFSYCLPSFKSL 228
Query: 261 --GGIFAIGDVVSPK-VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER- 313
G +G V PK +K TP++ N Y V LE + VG +D+P + L
Sbjct: 229 NFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTG 288
Query: 314 -GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L +Y V +FR
Sbjct: 289 AGTIFDSGTVFTRLVAPVYVAVRDEFR 315
>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
Length = 409
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 67/262 (25%), Positives = 110/262 (41%), Gaps = 34/262 (12%)
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
GT V +D+GSD+ WV C C C + D LFDP+ S+T + CS
Sbjct: 75 GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRD-----PLFDPATSTTYAAVPCSSAA 129
Query: 148 CRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
C R C +C++ +TY +G++ +G + D + L + +FG
Sbjct: 130 CARLGPYRR-GCLANSQCQFGITYANGATATGTYSSDDLTLGPYD-------VVRGFLFG 181
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIG 267
C + D GS+ V G L G + S + Q A+ + F++C+ F +
Sbjct: 182 CAH---ADQGSTFSYDVAGTLALGGGSQSFVQQTAS--QYSRVFSYCVPPSTSSFGFIMF 236
Query: 268 DVVSPKVKTTPMVPNMP----------HYNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
V + P + P Y V+L + V G PL +P ++ ++I
Sbjct: 237 GVPPQRAALVPTFVSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSA----SSVI 292
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
DS T ++ +PP Y + + FR
Sbjct: 293 DSATVISRIPPTAYQALRAAFR 314
>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 416
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 78/270 (28%), Positives = 113/270 (41%), Gaps = 67/270 (24%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +GTP + Y DTGSD++W+ C C C ++ F PSKSST
Sbjct: 85 GEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTT-----PKFKPSKSSTYKN 139
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
I CS + C+ S G D + L ++G+ + P
Sbjct: 140 IPCSSDLCK-------------------------SGQQGNLSVDTLTLESSTGHPISFP- 173
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
+ GCG D S + A GI+G G +SL++QL ++ + +F++CL
Sbjct: 174 --KTVIGCGT----DNTVSFEGASSGIVGLGGGPASLITQLGSS--IDAKFSYCLLPNPV 225
Query: 256 -------------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDL 302
VV G G+ V +P VK P+V Y + LE VG ++
Sbjct: 226 ESNTTSKLNFGDTAVVSGDGV-----VSTPIVKKDPIV----FYYLTLEAFSVGNKRIEF 276
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
S G G E IIDSGTTL +P +Y+
Sbjct: 277 EGSSNG-GHEGNIIIDSGTTLTVIPTDVYN 305
>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP+ V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPRFD---GFSY 152
Query: 254 CLDVVKGG--------GIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 561
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 119/281 (42%), Gaps = 38/281 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF V +GTP + + +DTGSDL W+ C C C +S +DP SS+
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSG-----PYYDPKDSSSFR 248
Query: 140 EIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
I+C D C+ + + P + C Y YGDGS+T+G F + +N + N T
Sbjct: 249 NISCHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPN-GT 307
Query: 198 APLN--SSVIFGCG--NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ L +V+FGCG NR + G L F SL Q F++
Sbjct: 308 SELKHVENVMFGCGHWNRGLFHGAAGLLGLGKGPLSFASQMQSLYGQ---------SFSY 358
Query: 254 CLDVVKGGGIFAIGDVVSPKVKTTPMVPNM--------------PHYNVILEEVEVGGNP 299
CL V + ++ + K PN+ Y V ++ V V
Sbjct: 359 CL-VDRNSNASVSSKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEV 417
Query: 300 LDLP--TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
L +P T L + GTIIDSGTTL Y Y+++ F
Sbjct: 418 LKIPEETWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAF 458
>gi|54287450|gb|AAV31194.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 351
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/105 (38%), Positives = 61/105 (58%)
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYN 287
+G G +N+SL+ QLA + +K FAHCLD + GGIF +G +V PKV+ TP+ Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 288 VILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYD 332
L E+ VG L L + + TI+++G+ ++YLP +Y
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQ 105
>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
Length = 496
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/267 (29%), Positives = 120/267 (44%), Gaps = 44/267 (16%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
L+ ++G+G+ +DTGS+ + V C SR +FDP+ S + +
Sbjct: 98 ALFSMQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSR-----------PVFDPAASQSYRQ 146
Query: 141 IACSDNFC-----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+ C C +T+ + P + C Y ++YGD +++G F +D+I LN + +
Sbjct: 147 VPCISQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSG 206
Query: 196 KTAPLNSSVIFGCGNRQSG---DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFA 252
+ V FGC + G DLGS GI+GF + N SL SQL K F+
Sbjct: 207 QAVQFR-DVAFGCAHSPQGFLVDLGSL------GIVGFNRGNLSLPSQLKDRLGGSK-FS 258
Query: 253 HCLDVV----KGGGIFAIGD--VVSPKVKTTPMVPN------MPHYNVILEEVEVGGNPL 300
+C + G+ +GD + KV TP++ N Y V L + V G L
Sbjct: 259 YCFPSQPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTL 318
Query: 301 DLPTSLL----GTGDERGTIIDSGTTL 323
+P S TGD GT++DSGTT
Sbjct: 319 AIPESAFKLDPSTGDG-GTVLDSGTTF 344
>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 397
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 77/261 (29%), Positives = 106/261 (40%), Gaps = 54/261 (20%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
+Y ++ LGTP E ++DTGSDL+W C C C T+ +FDPSKSST E
Sbjct: 60 IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQ-----FAPIFDPSKSSTFKEK 114
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C N C Y + Y D S ++G + + + SG
Sbjct: 115 RCHGN-----------------SCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAET 157
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL------------AAAGNVRK 249
S GCG S + A+ GI+G SSL+SQ+ ++ G +
Sbjct: 158 S---IGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKI 214
Query: 250 EFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT 309
F VV G G A D+ K + P Y + L+ V VG ++ LGT
Sbjct: 215 NFGTNA-VVAGDGTVA-ADMFIKK--------DQPFYYLNLDAVSVGDKRIE----TLGT 260
Query: 310 ---GDERGTIIDSGTTLAYLP 327
+ IDSGTT YLP
Sbjct: 261 PFHAQDGNIFIDSGTTYTYLP 281
>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP+ V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIPGFS---FGC-NMDS--FGANEFGNVDGLLGMGAGAMSVLKQSSPTFDC---FSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
Length = 450
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 132/298 (44%), Gaps = 53/298 (17%)
Query: 50 SALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWV 109
++L + +HG+ + L P + G + + GTP + VDTGSD++W
Sbjct: 49 ASLSRAHHLKHGKTNPPVKTSL----FPHSYGGHSISLSFGTPPQKLSFLVDTGSDVVWA 104
Query: 110 NCA---GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY-----------NNR 155
C C+ C + K+ +FDP SS+S + C + C +TY N
Sbjct: 105 PCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKILDCRNPKCVSTYFPYVHLGCPRCNGN 164
Query: 156 YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGD 215
CS C Y YG G+S SGYF+ + ++ + + + + GC + +
Sbjct: 165 SKHCS--YACPYSTQYGTGAS-SGYFLLENLKFPRKTIR--------NFLLGCTTSAARE 213
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----DVVKGGG--IFAIGD 268
L S D + GFG++ SL Q+ K+FA+CL D + G I D
Sbjct: 214 LSS------DALAGFGRSMFSLPIQMGV-----KKFAYCLNSHDYDDTRNSGKLILDYRD 262
Query: 269 VVSPKVKTTPMVPNMP----HYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSG 320
+ + TP + + P +Y++ ++++++G L +P+ L G + G IIDSG
Sbjct: 263 GKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKYLAPGSDGRSGVIIDSG 320
>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
Length = 321
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPRFD---GFSY 152
Query: 254 CLDVVKGG--------GIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 421
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 110/262 (41%), Gaps = 49/262 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G + V GTP + + +DTGS + W C C C S + S +
Sbjct: 126 GNFLVDVAFGTPPQNFMLILDTGSSITWTQCKACVNCLQDSHRYFNWSASSTYSSGS--- 182
Query: 141 IACSDNFCRTTYNNRYPSCSPG-VRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
C PG V Y +TYGD S++ G + D + L+ +
Sbjct: 183 ------------------CIPGTVENNYNMTYGDDSTSVGNYGCDTM-------TLEPSD 217
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
+ FGCG GD GS VDG+LG GQ S +SQ A+ N K F++CL
Sbjct: 218 VFQKFQFGCGRNNKGDFGS----GVDGMLGLGQGQLSTVSQTASKFN--KVFSYCLPEED 271
Query: 260 GGGIFAIGDVV---SPKVKTTPMVPNMP-------HYNVILEEVEVGGNPLDLPTSLLGT 309
G G+ S +K T +V N P +Y V L ++ VG L++P+S+ +
Sbjct: 272 SIGSLLFGEKATSQSSSLKFTSLV-NGPGTLQESGYYFVNLSDISVGNERLNIPSSVFAS 330
Query: 310 GDERGTIIDSGTTLAYLPPMLY 331
GTIIDS T + LP Y
Sbjct: 331 ---PGTIIDSRTVITRLPQRAY 349
>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
Length = 321
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP+ V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIPGFS---FGC-NMDS--FGANEFGNVDGLLGMGAGAMSVLKQSSPTFDC---FSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPRFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSASWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 475
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 73/264 (27%), Positives = 111/264 (42%), Gaps = 37/264 (14%)
Query: 90 GTPT--DEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
G PT + + +DT D+ W+ CA C P + LFDP+ SST+ + C
Sbjct: 140 GDPTVVSQQTMAIDTTVDVPWIQCAPC---PIPQCYPQRDPLFDPTTSSTAAAVRCRSPA 196
Query: 148 CRT--TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLN--QASGNLKTAPLNSS 203
CR+ Y N + S C Y++ Y D +T+G ++ D + ++ A N +
Sbjct: 197 CRSLGPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTLTISGTTAVRNFR------- 249
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVVKGGG 262
FGC + G T G + G SLL+Q A + GN F++C+ G
Sbjct: 250 --FGCSHAVRGRFSDLT----AGTMSLGGGAQSLLAQTARSLGNA---FSYCVPQASASG 300
Query: 263 IFAIGDVVSPK----VKTTPMVP---NMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGT 315
+IG + TTP+V N Y V L+ + V G L +P G
Sbjct: 301 FLSIGGPATTNSTTVFATTPLVRSAINPSLYLVRLQGIVVAGRRLGIPPVAFSA----GA 356
Query: 316 IIDSGTTLAYLPPMLYDLVLSQFR 339
++DS + LPP Y + FR
Sbjct: 357 VMDSSAVITQLPPTAYRALRRAFR 380
>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
Length = 321
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPRFD---GFSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
sativa Japonica Group]
gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
Length = 473
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 118/278 (42%), Gaps = 59/278 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG E V VDT S+L WV CA C+ C + LFDP+ S + +
Sbjct: 127 YVATVGLGG--GEATVIVDTASELTWVQCAPCASCHDQQG-----PLFDPASSPSYAVLP 179
Query: 143 CSDNFC-----------RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
C+ + C PSCS Y ++Y DGS + G D + L
Sbjct: 180 CNSSSCDALQVATGSAAGACGGGEQPSCS------YTLSYRDGSYSQGVLAHDKLSL--- 230
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKE 250
+ +FGCG G G ++ G++G G++ SL+SQ + G V
Sbjct: 231 -----AGEVIDGFVFGCGTSNQGPFGGTS-----GLMGLGRSQLSLISQTMDQFGGV--- 277
Query: 251 FAHCLDV--VKGGGIFAIGDVVSPKVKTTPMVPNM--------PHYNVILEEVEVGGNPL 300
F++CL + + G +GD S +TP+V P Y V L + +GG +
Sbjct: 278 FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV 337
Query: 301 DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ + I+DSGT + L P +Y+ V ++F
Sbjct: 338 ESSAGKV--------IVDSGTIITSLVPSVYNAVKAEF 367
>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 476
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 120/276 (43%), Gaps = 38/276 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+G+P YV +D+GSD++WV C CS C +SD +FDP+
Sbjct: 128 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSD-----PVFDPA 182
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
S+T I+C + C N C+ G RC Y V+YGDGS T G + + +
Sbjct: 183 GSATYAGISCDSSVCDRLDNA---GCNDG-RCRYEVSYGDGSYTRGTLALETLTFGRV-- 236
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
L ++ GCG+ G + G S + QL G F++
Sbjct: 237 ------LIRNIAIGCGHMNRGMFIGAAGLLGLGGGAM-----SFVGQL--GGQTGGAFSY 283
Query: 254 CL---------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
CL + G G +G P ++ P P+ Y V L + VGG + +P
Sbjct: 284 CLVSRGTESTGTLEFGRGAMPVGAAWVPLIR-NPRAPSF--YYVGLSGLGVGGIRVPIPE 340
Query: 305 SLLGTGD--ERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ D G ++D+GT + LP Y+ F
Sbjct: 341 QIFELTDLGYGGVVMDTGTAVTRLPAPAYEAFRDTF 376
>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 464
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 113/277 (40%), Gaps = 37/277 (13%)
Query: 83 YFTKVGLGTPTDEYYV-QVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y +G+GTP + V +DTGSDL+W CA C+ C + +F S S T +
Sbjct: 94 YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTVC-----FDQPVPVFRASVSHTFSRV 147
Query: 142 ACSDNFCRTTYNNRYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
CSD C C+ R C Y Y D S T+G D +A TA
Sbjct: 148 PCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTF-KAPDRADTAAA 206
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
++ FGCG G + GI GFG SL SQL VR+ F++C
Sbjct: 207 VPNIRFGCGMMNYGLFTPNQ----SGIAGFGTGPLSLPSQL----KVRR-FSYCFTAMEE 257
Query: 256 ----DVVKGGGIFAIGDVVSPKVKTTPMVP--------NMPHYNVILEEVEVGGN--PLD 301
V+ GG I + +++TP P + P Y + L V VG P +
Sbjct: 258 SRVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFN 317
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
T L GT IDSGT + + P ++ + F
Sbjct: 318 ASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAF 354
>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 491
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 83/273 (30%), Positives = 127/273 (46%), Gaps = 35/273 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF++VG+G+P Y+ VDTGSD+ WV CA C+ C ++D +F+PS
Sbjct: 146 SGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQAD-----PIFEPS 200
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SS+ + C + C++ + + S C Y V+YGDGS T G F + I L+
Sbjct: 201 FSSSYAPLTCETHQCKSLDVSECRNDS----CLYEVSYGDGSYTVGDFATETITLD---- 252
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+A LN +V GCG+ G G+LG G + S SQ+ A+ F++
Sbjct: 253 --GSASLN-NVAIGCGHDNEGLF-----VGAAGLLGLGGGSLSFPSQINASS-----FSY 299
Query: 254 CL--DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLLG 308
CL + T P++ N Y + + + VGG L +P S
Sbjct: 300 CLVNRDTDSASTLEFNSPIPSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFE 359
Query: 309 TGDERGT---IIDSGTTLAYLPPMLYDLVLSQF 338
DE G I+DSGT + L +Y+ + F
Sbjct: 360 V-DESGNGGIIVDSGTAVTRLQSDVYNSLRDSF 391
>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
Length = 472
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 118/278 (42%), Gaps = 59/278 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLG E V VDT S+L WV CA C+ C + LFDP+ S + +
Sbjct: 126 YVATVGLGG--GEATVIVDTASELTWVQCAPCASCHDQQG-----PLFDPASSPSYAVLP 178
Query: 143 CSDNFC-----------RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
C+ + C PSCS Y ++Y DGS + G D + L
Sbjct: 179 CNSSSCDALQVATGSAAGACGGGEQPSCS------YTLSYRDGSYSQGVLAHDKLSL--- 229
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ-LAAAGNVRKE 250
+ +FGCG G G ++ G++G G++ SL+SQ + G V
Sbjct: 230 -----AGEVIDGFVFGCGTSNQGPFGGTS-----GLMGLGRSQLSLISQTMDQFGGV--- 276
Query: 251 FAHCLDV--VKGGGIFAIGDVVSPKVKTTPMVPNM--------PHYNVILEEVEVGGNPL 300
F++CL + + G +GD S +TP+V P Y V L + +GG +
Sbjct: 277 FSYCLPLKESESSGSLVLGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEV 336
Query: 301 DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
+ + I+DSGT + L P +Y+ V ++F
Sbjct: 337 ESSAGKV--------IVDSGTIITSLVPSVYNAVKAEF 366
>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 483
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 124/287 (43%), Gaps = 36/287 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+GTP Y+ +DTGSD++W+ C+ C C ++D +FDP
Sbjct: 126 SGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTD-----AIFDPK 180
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS T + C CR ++ C Y V+YGDGS T G F + + + A
Sbjct: 181 KSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTFHGA-- 238
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ PL GCG+ G G+LG G+ S SQ N +F++
Sbjct: 239 RVDHVPL------GCGHDNEGLF-----VGAAGLLGLGRGGLSFPSQTKNRYN--GKFSY 285
Query: 254 CL-------DVVKGGGIFAIGDVVSPKVKT-TPMVPNMP---HYNVILEEVEVGGNPL-- 300
CL K G+ PK TP++ N Y + L + VGG+ +
Sbjct: 286 CLVDRTSSGSSSKPPSTIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 345
Query: 301 --DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ L TG+ G IIDSGT++ L Y + FR L
Sbjct: 346 VSESQFKLDATGNG-GVIIDSGTSVTRLTQPAYVALRDAFRLGATKL 391
>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
Length = 2819
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 132/299 (44%), Gaps = 48/299 (16%)
Query: 63 MMASIDLELGGNGHPSATGLYFTKVGL------GTPTDEYYVQVDTGSDLLWVNCAGCSR 116
M+ ++ ++G PS + V L G+P + + +DTGS+L W++ C +
Sbjct: 974 MVLPLNTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLH---CKK 1030
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT-TYNNRYP-SCSPGVRCEYVVTYGDG 174
P + ++F+P SS+ I CS CRT T + P +C P C +V+Y D
Sbjct: 1031 SPNLT------SVFNPLSSSSYSPIPCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADA 1084
Query: 175 SSTSGYFVRDIIQLNQASGNLKT-APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQA 233
SS G N AS N + + +FGC + S DA G++G +
Sbjct: 1085 SSLEG---------NLASDNFRIGSSALPGTLFGCMDSGFSS-NSEEDAKTTGLMGMNRG 1134
Query: 234 NSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV---------VSPKVKTTPMVPNMP 284
+ S ++QL +F++C+ G+ GD+ +P V+ + +P
Sbjct: 1135 SLSFVTQLGLP-----KFSYCISGRDSSGVLLFGDLHLSWLGNLTYTPLVQISTPLPYFD 1189
Query: 285 H--YNVILEEVEVGGNPLDLPTSLLGTGDERG---TIIDSGTTLAYLPPMLYDLVLSQF 338
Y V L+ + VG L LP S+ D G T++DSGT +L +Y + ++F
Sbjct: 1190 RVAYTVQLDGIRVGNKILPLPKSIFAP-DHTGAGQTMVDSGTQFTFLLGPVYTALRNEF 1247
>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 492
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 126/291 (43%), Gaps = 54/291 (18%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YFTK+G+GTP + +DTGSD++W+ CA C RC +S +FDP +S +
Sbjct: 137 SGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSG-----QVFDPRRSRSYN 191
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ C+ CR R S +R C Y V YGDGS T+G F + + +G +
Sbjct: 192 AVGCAAPLCR-----RLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTF---AGGAR 243
Query: 197 TAPLNSSVIFGCGNRQSGDL--GSSTDAAVDGILGF--------GQANSSLLSQLAAAGN 246
A V GCG+ G + G L F G++ S L ++ N
Sbjct: 244 VA----RVALGCGHDNEGLFVAAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSAN 299
Query: 247 VRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPL--- 300
+ V G G A+G V+ TPMV N Y V L + VGG +
Sbjct: 300 TASRSST---VTFGSG--AVGSTVASSF--TPMVKNPRMETFYYVQLIGISVGGARVPGV 352
Query: 301 ---DL---PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
DL P+S G G I+DSGT++ L Y + FR A L
Sbjct: 353 ANSDLRLDPSSGRG-----GVIVDSGTSVTRLARPAYSALRDAFRGAAAGL 398
>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
Length = 477
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 120/293 (40%), Gaps = 36/293 (12%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-----AGCSRCPTKSDLGIKLTLFDPSKS 135
G YF + +GTP + + DTGSDL WV C A S P S G F P S
Sbjct: 95 GQYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRA-FRPEDS 153
Query: 136 STSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
T I+C+ + C + +C +PG C Y Y DGS+ G + + +
Sbjct: 154 RTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRE 213
Query: 195 LKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHC 254
+ A L ++ GC + +G + A DG+L G + S S AA F++C
Sbjct: 214 ERKAKLK-GLVLGCSSSYTG----PSFEASDGVLSLGYSGISFASH--AASRFGGRFSYC 266
Query: 255 ----LDVVKGGGIFAIGD---VVSP------------KVKTTPMVPN---MPHYNVILEE 292
L G V SP + + TP++ + P Y+V L+
Sbjct: 267 LVDHLSPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKA 326
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ V G L +P ++ G I+DSGT+L L Y V++ +A L
Sbjct: 327 ISVAGEFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGL 379
>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
thaliana]
Length = 500
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 88/326 (26%), Positives = 140/326 (42%), Gaps = 52/326 (15%)
Query: 25 GGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYF 84
G++ F VE G L + DTR + + + +G +G YF
Sbjct: 114 AGIVAKIRFAVE------GVDRSDLKPVYNEDTRYQTEDLTTPVV----SGASQGSGEYF 163
Query: 85 TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS 144
+++G+GTP + Y+ +DTGSD+ W+ C C+ C +SD +F+P+ SST + CS
Sbjct: 164 SRIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSD-----PVFNPTSSSTYKSLTCS 218
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
C + +C +C Y V+YGDGS T G D + SG + ++V
Sbjct: 219 APQCSLLETS---ACRSN-KCLYQVSYGDGSFTVGELATDTVTFGN-SGKI------NNV 267
Query: 205 IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
GCG+ G + G S+ +Q+ A F++CL V + G
Sbjct: 268 ALGCGHDNEGLFTGAAGLLGLGGGVL-----SITNQMKAT-----SFSYCL-VDRDSGKS 316
Query: 265 AIGDVVSPKV----KTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLL-----GTGDE 312
+ D S ++ T P++ N Y V L VGG + LP ++ G+G
Sbjct: 317 SSLDFNSVQLGGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSG-- 374
Query: 313 RGTIIDSGTTLAYLPPMLYDLVLSQF 338
G I+D GT + L Y+ + F
Sbjct: 375 -GVILDCGTAVTRLQTQAYNSLRDAF 399
>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 76/159 (47%), Gaps = 21/159 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFT++G+GTP Y+ +DTGSD++W+ CA C +C +++D +FDP
Sbjct: 165 SGLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTD-----PVFDPK 219
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS + I+C C + P C+ C Y V YGDGS T G F + +
Sbjct: 220 KSGSFSSISCRSPLCLRLDS---PGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTR- 275
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQ 232
P V GCG+ G G+LG G+
Sbjct: 276 ----VP---KVALGCGHDNEGLF-----VGAAGLLGLGR 302
>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
gi|255638149|gb|ACU19388.1| unknown [Glycine max]
Length = 437
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 93/343 (27%), Positives = 134/343 (39%), Gaps = 46/343 (13%)
Query: 18 HQWAVGGGGVMGNFVFEVENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHP 77
HQ G + VF + F+ S LK + RM L + P
Sbjct: 27 HQHDHDGSTLQVFHVFSPCSPFRPSKPMSWEESVLKLQ-AKDQARMQYLSSLVARRSIVP 85
Query: 78 SATG-------LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
A+G Y K +GTP + +DT +D WV C C C T T F
Sbjct: 86 IASGRQITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTT-------TPF 138
Query: 131 DPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQ 190
P+KS+T ++ C + C+ N P+C G C + TYG SS + V+D +
Sbjct: 139 APAKSTTFKKVGCGASQCKQVRN---PTCD-GSACAFNFTYGT-SSVAASLVQDTV---- 189
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSG-DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK 249
L T P+ + FGC + +G + + A + L Q
Sbjct: 190 ---TLATDPV-PAYAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQ--------S 237
Query: 250 EFAHCLDVVKG---GGIFAIGDVVSPK-VKTTPMVPNMPH---YNVILEEVEVGGNPLDL 302
F++CL K G +G V PK +K TP++ N Y V L + VG +D+
Sbjct: 238 TFSYCLPSFKTLNFSGSLRLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDI 297
Query: 303 PTSLLGTGDER--GTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
P L GT+ DSGT L Y+ V ++FR IA
Sbjct: 298 PPEALAFNANTGAGTVFDSGTVFTRLVEPAYNAVRNEFRRRIA 340
>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 460
Score = 84.0 bits (206), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 91/333 (27%), Positives = 136/333 (40%), Gaps = 47/333 (14%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATG-------LYFTKV 87
V+N S + +R R++ L G G P A+G Y +
Sbjct: 39 VKNSNNDAAPSSSWTSFIAAQTSRDTSRVLYLSSLASGFGGAPLASGRQLLHTPTYLVRA 98
Query: 88 GLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF 147
LGTP + VDT +D WV CAGC CPT + F+P+ S+T + C
Sbjct: 99 SLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTAP------SFNPASSATFRPVPCGAPP 152
Query: 148 CRTTYNNRYPSCSPGVR----CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS 203
C N PSC+ + C + ++YGD SS +D + + G +K
Sbjct: 153 CSQAPN---PSCTSLAKSKNSCGFSLSYGD-SSLDATLSQDNLAVTANGGVIK------G 202
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-----DVV 258
FGC + +G + A G+LG G+ ++Q G F++CL
Sbjct: 203 YTFGCLTKSNG-----SAAPAQGLLGLGRGPLGFVAQ--TKGIYEGTFSYCLPSYYRSAA 255
Query: 259 KGGGIFAIGDVVSP---KVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLG--TG 310
G +G P K+KTTP++ P+ P Y V + V +G + +P S L
Sbjct: 256 NFSGSLTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAA 315
Query: 311 DERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIA 343
GT++DSGT A L Y V + R +A
Sbjct: 316 TGAGTVLDSGTMFARLAQPAYAAVRDEVRRRVA 348
>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
Length = 445
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 96/328 (29%), Positives = 140/328 (42%), Gaps = 67/328 (20%)
Query: 51 ALKQHDTRRHGRMMAS-----IDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
AL++ R + R +A+ + P+A G Y + +GTP Y DTGSD
Sbjct: 50 ALRRDMHRHNARQLAASSSNGTTVSAPTQISPTA-GEYLMTLAIGTPPVSYQAIADTGSD 108
Query: 106 LLWVNCAGC-SRC---PTKSDLGIKLTLFDPSKSSTSGEIACSDNF--CRTTYNNRYPSC 159
L+W CA C S+C PT L++PS S+T + C+ + C P
Sbjct: 109 LIWTQCAPCSSQCFQQPTP--------LYNPSSSTTFAVLPCNSSLSMCAAALAGTTP-- 158
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS----VIFGCGNRQSGD 215
PG C Y +TYG G TS Y + ++ P N + + FGC N G
Sbjct: 159 PPGCTCMYNMTYGSG-WTSVYQGSETFTFGSST------PANQTGVPGIAFGCSNASGGF 211
Query: 216 LGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK---GGGIFAIGDVVSP 272
SS G++G G+ + SL+SQL +F++CL + +G S
Sbjct: 212 NTSS----ASGLVGLGRGSLSLVSQLGV-----PKFSYCLTPYQDTNSTSTLLLGPSASL 262
Query: 273 K----VKTTPMV------PNMPHYNVILEEVEVGGNPLDLPTSLL-----GTGDERGTII 317
V +TP V P +Y + L + +G L +PT+ L GTG G II
Sbjct: 263 NDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTG---GFII 319
Query: 318 DSGTTLAYLPPMLYDLVLSQFRFWIASL 345
DSGTT+ +L + Q R + SL
Sbjct: 320 DSGTTIT----LLGNTAYQQVRAAVVSL 343
>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
lyrata]
Length = 529
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/295 (25%), Positives = 131/295 (44%), Gaps = 33/295 (11%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G+++A+++ +G +G YF V +GTP + + +DTGSDL W+ C C C +
Sbjct: 145 GKLIATLE-----SGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQ 199
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYP-SC-SPGVRCEYVVTYGDGSSTS 178
++ +DP S++ I C+D C + P C S C Y YGD S+T+
Sbjct: 200 NE-----AFYDPKTSASFKNITCNDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTT 254
Query: 179 GYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILG-------- 229
G F + +N + +++ +++FGCG+ G ++ G
Sbjct: 255 GDFAVETFTVNLTTTEGRSSEYKVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQ 314
Query: 230 --FGQANSSLLSQLAAAGNVRKE--FAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH 285
+G + S L + NV + F D++ + V+ K +
Sbjct: 315 SLYGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLNHTNL-NFTSFVNGKENSVETF----- 368
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y + ++ + VGG LD+P + GTIIDSGTTL+Y Y+++ ++F
Sbjct: 369 YYIQIKSILVGGEALDIPEETWNISPDGAGGTIIDSGTTLSYFAEPAYEIIKNKF 423
>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 461
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 75/256 (29%), Positives = 112/256 (43%), Gaps = 38/256 (14%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+ +GTP + +DTGS+L W+ CA P + F P SST + C+
Sbjct: 89 LAVGTPPQNVTMVLDTGSELSWLLCA-----PAGARNKFSAMSFRPRASSTFAAVPCASA 143
Query: 147 FCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
CR+ P+C RC ++Y DGSS+ G D+ + PL ++
Sbjct: 144 QCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGP------PLRAA-- 195
Query: 206 FGCGNRQSGDLGSSTD-AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
FGC S SS D A G+LG + S +SQ + + F++C+ G+
Sbjct: 196 FGC---MSSAFDSSPDGVASAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVL 247
Query: 265 AIGDVVSP---KVKTTPMV-PNMP-------HYNVILEEVEVGGNPLDLPTSLLGTGDER 313
+G P + TPM P +P Y+V L + VGG L +P S+L D
Sbjct: 248 LLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAP-DHT 306
Query: 314 G---TIIDSGTTLAYL 326
G T++DSGT +L
Sbjct: 307 GAGQTMVDSGTQFTFL 322
>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
Length = 336
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/251 (29%), Positives = 118/251 (47%), Gaps = 38/251 (15%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSC 159
+DTGSDL+W CA C C + FD KS+T + C + C + + PSC
Sbjct: 1 MDTGSDLIWTQCAPCLLCADQ-----PTPYFDVKKSATYRALPCRSSRCASLSS---PSC 52
Query: 160 SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSS 219
+ C Y YGD +ST+G + A+ A +++ FGCG+ +GDL +S
Sbjct: 53 FKKM-CVYQYYYGDTASTAGVLANETFTFGAANSTKVRA---TNIAFGCGSLNAGDLANS 108
Query: 220 TDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG-------GIFA----IGD 268
+ G++GFG+ SL+SQL + F++CL G++A
Sbjct: 109 S-----GMVGFGRGPLSLVSQLGPS-----RFSYCLTSYLSATPSRLYFGVYANLSSTNT 158
Query: 269 VVSPKVKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTL 323
V++TP V P +P+ Y + L+ + +G L + + D+ G IIDSGT++
Sbjct: 159 SSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSI 218
Query: 324 AYLPPMLYDLV 334
+L Y+ V
Sbjct: 219 TWLQQDAYEAV 229
>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
distachyon]
Length = 533
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/274 (30%), Positives = 118/274 (43%), Gaps = 41/274 (14%)
Query: 83 YFTKVGLGTP-TDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
Y T + LG V VDTGSDL WV C CP S + LFDP+ S T +
Sbjct: 180 YVTTIALGGGGAKNLTVIVDTGSDLTWVQ---CEPCPGSSCYAQRDPLFDPAASPTFAAV 236
Query: 142 ACSDNFCRTTYNNRYPSCSPGV----------RCEYVVTYGDGSSTSGYFVRDIIQLNQA 191
C C + + + +PG RC Y ++YGDGS + G +D + L
Sbjct: 237 PCGSPACAASLKD--ATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLG-- 292
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKE 250
T L+ +FGCG G G + G++G G+ + SL+SQ AA G V
Sbjct: 293 ----TTTKLD-GFVFGCGLSNRGLFGGTA-----GLMGLGRTDLSLVSQTAARFGGV--- 339
Query: 251 FAHCLDV-VKGGGIFAIGDVVS---PKVKTTPMV--PNMPHYNVILEEVEVGGNPLDLPT 304
F++CL G ++G S P + T M+ P P + I G L
Sbjct: 340 FSYCLPATTTSTGSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTA 399
Query: 305 SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G G+ ++DSGT + L P +Y V ++F
Sbjct: 400 PGFGAGN---VLVDSGTVITRLAPSVYKAVRAEF 430
>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
Length = 460
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/272 (28%), Positives = 109/272 (40%), Gaps = 45/272 (16%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSS 136
++T Y +GTP +DTGSDL+W C A C RC L+ P++S
Sbjct: 95 ASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRC-----FPQPAPLYAPARSV 149
Query: 137 TSGEIACSDNFCRTTYNNRYPSCSPGVR---------CEYVVTYGDGSSTSGYFVRDIIQ 187
T ++C C + R S C Y +YGDGSST G +
Sbjct: 150 TYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFT 209
Query: 188 LNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNV 247
+ A FGCG G +S+ G++G G+ SL+SQL
Sbjct: 210 FGAGTTVHDLA-------FGCGTDNLGGTDNSS-----GLVGMGRGPLSLVSQLGV---- 253
Query: 248 RKEFAHCL----DVVKGGGIFAIGDV-VSPKVKTTPMVPN------MPHYNVILEEVEVG 296
+F++C D +F +SP K+TP VP+ +Y + LE + VG
Sbjct: 254 -TKFSYCFTPFNDTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVG 312
Query: 297 GN--PLDLPTSLLGTGDERGTIIDSGTTLAYL 326
P+D L G IIDSGTT L
Sbjct: 313 DTLLPIDPAVFRLTASGRGGLIIDSGTTFTAL 344
>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
Length = 469
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 128/302 (42%), Gaps = 60/302 (19%)
Query: 77 PSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG---CSRCPTKSDLGIKLTLFDPS 133
P + G Y + GTP +DTGS L+W C CSRC + + F P
Sbjct: 86 PRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPK 145
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPS----CSPGVR-----CE-YVVTYGDGSSTSGYFVR 183
+SS+S I C ++ C + + S C P + C YV+ YG GS T+G +
Sbjct: 146 QSSSSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYVIQYGLGS-TAGLLLS 204
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGCG---NRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
+ + KT P + GC RQ +GI GFG++ SL SQ
Sbjct: 205 ETLDFPHK----KTIP---GFLVGCSLFSIRQP-----------EGIAGFGRSPESLPSQ 246
Query: 241 LAAAGNVRKEFAHCL------------DVVKGGGIFAIGDVVSPKVKTTPMVPN-----M 283
L K+F++CL D+V G D +P + TP N
Sbjct: 247 LGL-----KKFSYCLVSHAFDDTPASSDLVLDTGS-GSDDTKTPGLSYTPFQKNPTAAFR 300
Query: 284 PHYNVILEEVEVGGNPLDLPTSLL--GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFW 341
+Y V+L + +G + +P L G+ GTI+DSGTT ++ +Y+LV +F
Sbjct: 301 DYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQ 360
Query: 342 IA 343
+A
Sbjct: 361 VA 362
>gi|32487305|emb|CAE05796.1| OSJNBb0046K02.6 [Oryza sativa Japonica Group]
gi|38344664|emb|CAE02326.2| OSJNBb0112E13.8 [Oryza sativa Japonica Group]
gi|125547764|gb|EAY93586.1| hypothetical protein OsI_15371 [Oryza sativa Indica Group]
gi|125589862|gb|EAZ30212.1| hypothetical protein OsJ_14269 [Oryza sativa Japonica Group]
Length = 174
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 90/211 (42%), Gaps = 41/211 (19%)
Query: 1 MGGLRLLALVVVTVAVVHQWAVGGGGVMGNFVFEVENKFKA--GGERERTLSALKQHDTR 58
M LL+ ++ + VV A G M N VF+V KF G + + AL+ HD
Sbjct: 1 MAAPLLLSTIISALVVV---ASSTRGTMANGVFQVRRKFHIVDGVYKGSDIGALQIHDGN 57
Query: 59 RHGR--MMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR 116
RH R +MAS +L LGG P TG G + C R
Sbjct: 58 RHRRHNLMAS-ELPLGGFSIPYGTGYALIISAFG-------------------HVCVCLR 97
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSS 176
KLT +DP S +S E+ C D C + P C+ +RC Y+ Y DG
Sbjct: 98 ---------KLTFYDPRSSVSSKEVKCDDTICTSR-----PPCNMTLRCPYIAAYSDGGL 143
Query: 177 TSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
T G D++ +Q GN +T P ++SV FG
Sbjct: 144 TMGILFTDLLHYHQLYGNGQTQPTSTSVTFG 174
>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
thaliana]
gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 491
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 128/302 (42%), Gaps = 62/302 (20%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG----CSRCPTKSDLGIK-LTLFDPSKSST 137
Y + +GTP V +DTGSDL WV C C C + +K ++F P SST
Sbjct: 83 YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDCIECYDLKNNDLKSPSVFSPLHSST 142
Query: 138 SGEIACSDNFCRTTY--NNRYPSC------------SPGVR-C-EYVVTYGDGSSTSGYF 181
S +C+ +FC + +N + C S VR C + TYG+G SG
Sbjct: 143 SFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISGIL 202
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
RDI++ + P S FGC +ST GI GFG+ SL SQL
Sbjct: 203 TRDILK-----ARTRDVPRFS---FGC--------VTSTYREPIGIAGFGRGLLSLPSQL 246
Query: 242 AAAGNVRKEFAHCL-------------DVVKGGGIFAIGDVVSPKVK---TTPMVPNMPH 285
G + K F+HC ++ G +I S + TPM PN
Sbjct: 247 ---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPN--S 301
Query: 286 YNVILEEVEVGGN--PLDLPTSL--LGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFW 341
Y + LE + +G N P +P +L + G ++DSGTT +LP Y +L+ +
Sbjct: 302 YYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQST 361
Query: 342 IA 343
I
Sbjct: 362 IT 363
>gi|46275851|gb|AAS86401.1| hypothetical protein [Oryza sativa Japonica Group]
Length = 197
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/108 (37%), Positives = 62/108 (57%)
Query: 228 LGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYN 287
+G G +N+SL+ QLA + +K FAHCLD + GGIF +G +V PKV+ TP+ Y
Sbjct: 1 MGLGPSNTSLVYQLAKSQKWKKMFAHCLDGKRSGGIFVLGHIVGPKVRKTPLDQTSSRYR 60
Query: 288 VILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
L E+ VG L L + + TI+++G+ ++YLP +Y L
Sbjct: 61 TTLLEITVGETSLSLSAGNVEIKSQNMTILETGSLISYLPEKVYQSFL 108
>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
Length = 453
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 116/283 (40%), Gaps = 38/283 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y G+GTP + DTGSDL+W C C+RC + + SS++
Sbjct: 89 SGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYP-----TSSSSAA 143
Query: 140 EIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+AC D C R +N S C Y YG+ T Y + I + +
Sbjct: 144 FVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHY--TEGILMTETFTFG 201
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK------ 249
A + FGC R G G+ + G++G G+ SL++QL +
Sbjct: 202 DDAAAFPGIAFGCTLRSEGGFGTGS-----GLVGLGRGKLSLVTQLNVEAFGYRLSSDLS 256
Query: 250 -----EFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
F DV G G +S + T P+V ++P Y V L + VGG + +P+
Sbjct: 257 APSPISFGSLADVTGGNG----DSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPS 312
Query: 305 ---SLLGTGDERGTIIDSGTTLAYLPPMLYDLV----LSQFRF 340
S + G I DSGTTL LP Y LV LSQ F
Sbjct: 313 GTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGF 355
>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
Length = 368
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/262 (29%), Positives = 119/262 (45%), Gaps = 44/262 (16%)
Query: 86 KVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSD 145
++G+G+ +DTGS+ + V C SR +FDP+ S + ++ C
Sbjct: 2 QLGIGSLQKNLSAIIDTGSEAVLVQCGSRSR-----------PVFDPAASQSYRQVPCIS 50
Query: 146 NFC-----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C +T+ + P + C Y ++YGD +++G F +D+I LN + + +
Sbjct: 51 QLCLAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQF 110
Query: 201 NSSVIFGCGNRQSG---DLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
V FGC + G DLGS GI+GF + N SL SQL K F++C
Sbjct: 111 R-DVAFGCAHSPQGFLVDLGSL------GIVGFNRGNLSLPSQLKDRLGGSK-FSYCFPS 162
Query: 258 V----KGGGIFAIGD--VVSPKVKTTPMV--PNMPH----YNVILEEVEVGGNPLDLPTS 305
+ G+ +GD + KV TP++ P P Y V L + V G L +P S
Sbjct: 163 QPWQPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPES 222
Query: 306 LL----GTGDERGTIIDSGTTL 323
TGD GT++DSGTT
Sbjct: 223 AFKLDPSTGDG-GTVLDSGTTF 243
>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
Length = 448
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/286 (28%), Positives = 117/286 (40%), Gaps = 49/286 (17%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTS 138
G Y + +GTP Y DTGSDL+W CA CS +C L++P+ S+T
Sbjct: 90 GEYLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQC-----FAQPAPLYNPASSTTF 144
Query: 139 GEIACSDNF--CRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
G + C+ + C + P PG C Y TYG G T+G + A+ +
Sbjct: 145 GVLPCNSSLSMCAGVLAGKAP--PPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQA 201
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
P + FGC N S D S G++G G+ + SL+SQL A F++CL
Sbjct: 202 RVP---GIAFGCSNASSSDWNGSA-----GLVGLGRGSLSLVSQLGAG-----RFSYCLT 248
Query: 257 VVK------------GGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
+ + G +P V + P +Y + L + +G L +
Sbjct: 249 PFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISP 308
Query: 305 SLL-----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
GTG G IIDSGTT+ L Y Q R + SL
Sbjct: 309 DAFSLKADGTG---GLIIDSGTTITSLVNAAYQ----QVRAAVQSL 347
>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 468
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 118/283 (41%), Gaps = 33/283 (11%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G + TG YF ++ +GTP + + DTGSDL WV C+ S + +F P+
Sbjct: 95 SGAYTGTGQYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPA 154
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQAS 192
S + + C + C++ +C SP C Y Y D SS G ++ L+ A+
Sbjct: 155 GSKSWSPLPCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARG-----VVGLDSAT 209
Query: 193 GNL------KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGN 246
+L + A L V+ GC G S+ DG+L G +N S S+ AA
Sbjct: 210 VSLSGNDGTRKAKLQ-EVVLGCTTSYDGQSFKSS----DGVLSLGNSNISFASR--AASR 262
Query: 247 VRKEFAHC----LDVVKGGGIFAIGD-----VVSPKVKTTPMV-----PNMPHYNVILEE 292
F++C L G+ + TP+V P Y V ++
Sbjct: 263 FGGRFSYCLVDHLAPRNATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDA 322
Query: 293 VEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVL 335
V V G L++ + G I+DSGT+L L YD V+
Sbjct: 323 VTVAGERLEILPDVWDFRKNGGAILDSGTSLTILATPAYDAVV 365
>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
Length = 453
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 116/283 (40%), Gaps = 38/283 (13%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y G+GTP + DTGSDL+W C C+RC + + SS++
Sbjct: 89 SGDYAMSFGIGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYP-----TSSSSAA 143
Query: 140 EIACSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
+AC D C R +N S C Y YG+ T Y + I + +
Sbjct: 144 FVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHY--TEGILMTETFTFG 201
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK------ 249
A + FGC R G G+ + G++G G+ SL++QL +
Sbjct: 202 DDAAAFPGIAFGCTLRSEGGFGTGS-----GLVGLGRGKLSLVTQLNVEAFGYRLSSDLS 256
Query: 250 -----EFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPT 304
F DV G G +S + T P+V ++P Y V L + VGG + +P+
Sbjct: 257 APSPISFGSLADVTGGNG----DSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPS 312
Query: 305 ---SLLGTGDERGTIIDSGTTLAYLPPMLYDLV----LSQFRF 340
S + G I DSGTTL LP Y LV LSQ F
Sbjct: 313 GTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGF 355
>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
Length = 480
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 130/283 (45%), Gaps = 49/283 (17%)
Query: 77 PSATGL------YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLF 130
P A+G+ Y +GLG V +DTGSDL WV C C C ++ +F
Sbjct: 121 PLASGINLETLNYIVTIGLGN--QNMTVIIDTGSDLTWVQCDPCMSCYSQQG-----PVF 173
Query: 131 DPSKSSTSGEIACSDNFCR----TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDII 186
+PS SS+ + C+ + C+ TT N + C + V+YGDGS T G + +
Sbjct: 174 NPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHL 233
Query: 187 QLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-G 245
S S+ +FGCG G G V GI+G G++N S++SQ G
Sbjct: 234 SFGGISV--------SNFVFGCGRNNKGLFG-----GVSGIMGLGRSNLSMISQTNTTFG 280
Query: 246 NVRKEFAHCLDVVKGG--GIFAIGDVVSPKVKTTPMV-------PNMPHYNVI-LEEVEV 295
V F++CL G G IG+ S TP+ P + ++ V+ L ++V
Sbjct: 281 GV---FSYCLPTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDV 337
Query: 296 GGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
GG + + + G G G +IDSGT + L P LY+ + ++F
Sbjct: 338 GG--VAIQDTSFGNG---GILIDSGTVITRLAPSLYNALKAEF 375
>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
Length = 459
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/259 (32%), Positives = 122/259 (47%), Gaps = 48/259 (18%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
VG+GTP V +D GSDLLW C+ PT L +FD ++SS+ + C
Sbjct: 111 VGVGTPPQPSKVILDLGSDLLWTQCSLVG--PTAKQLE---PVFDAARSSSFSVLPCDSK 165
Query: 147 FCRT-TYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
C T+ N+ +C+ +C Y YG ++T G + G +++++
Sbjct: 166 LCEAGTFTNK--TCT-DRKCAYENDYGIMTAT-GVLATETFTFGAHHG------VSANLT 215
Query: 206 FGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGG 261
FGCG L + T A GILG S+L QLA +F++CL D
Sbjct: 216 FGCGK-----LANGTIAEASGILGLSPGPLSMLKQLAIT-----KFSYCLTPFADRKTSP 265
Query: 262 GIF-AIGDV----VSPKVKTTPMVPNMP----HYNVILEEVEVGGNPLDLPTSLL----- 307
+F A+ D+ + KV+T P++ N P +Y V + + VG LD+P L
Sbjct: 266 VMFGAMADLGKYKTTGKVQTIPLLKN-PVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPD 324
Query: 308 GTGDERGTIIDSGTTLAYL 326
GTG GT++DS TTLAYL
Sbjct: 325 GTG---GTVLDSATTLAYL 340
>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
Length = 420
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/270 (29%), Positives = 116/270 (42%), Gaps = 55/270 (20%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP + V DTGSDL+W CA C++C + F P+ SST +
Sbjct: 84 GGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQ-----PAPPFQPASSSTFSK 138
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C+ +FC+ N+ + G C Y YG G T+GY + +++ AS
Sbjct: 139 LPCTSSFCQFLPNSIRTCNATG--CVYNYKYGSG-YTAGYLATETLKVGDAS-------- 187
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG 260
SV FGC G GQ + + F++CL
Sbjct: 188 FPSVAFGCSTEN----------------GLGQLDLGV-----------GRFSYCLRSGSA 220
Query: 261 GG----IF-AIGDVVSPKVKTTPMVPNMP----HYNVILEEVEVGGNPLDLPTSLLG--- 308
G +F ++ ++ V++TP V N +Y V L + VG L + TS G
Sbjct: 221 AGASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQ 280
Query: 309 TGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G GTI+DSGTTL YL Y++V F
Sbjct: 281 NGLGGGTIVDSGTTLTYLAKDGYEMVKQAF 310
>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
Length = 328
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/246 (30%), Positives = 105/246 (42%), Gaps = 51/246 (20%)
Query: 83 YFTKVGLG----TPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTS 138
Y T + LG +P V VDTGSDL WV C CS C + D LFDP+ S+T
Sbjct: 92 YVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPCSACYAQRD-----PLFDPAGSATY 146
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGV---------RCEYVVTYGDGSSTSGYFVRDIIQLN 189
+ C+ + C + R + +PG +C Y + YGDGS + G D + L
Sbjct: 147 AAVRCNASACADSL--RAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALG 204
Query: 190 QASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVR 248
AS +FGCG G G + G++G G+ SL+SQ A+ G V
Sbjct: 205 GAS--------LGGFVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGV- 250
Query: 249 KEFAHCLDVVKGG---GIFAIG---DVVSPKVKTTPMV--------PNMPHYNVILEEVE 294
F++CL G G ++G D S TTP+ P Y + +
Sbjct: 251 --FSYCLPAATSGDASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAA 308
Query: 295 VGGNPL 300
VGG L
Sbjct: 309 VGGTAL 314
>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
Length = 440
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 70/265 (26%), Positives = 120/265 (45%), Gaps = 40/265 (15%)
Query: 90 GTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR 149
GTP + +DTGS+L W++C K + ++F+P S T +I CS C
Sbjct: 74 GTPLQNITMVLDTGSELSWLHC--------KKEPNFN-SIFNPLASKTYTKIPCSSPTCE 124
Query: 150 T-TYNNRYP-SCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
T T + P SC P C ++++Y D SS G + ++ +G + +FG
Sbjct: 125 TRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRVGSVTG--------PATVFG 176
Query: 208 CGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIG 267
C + S DA G++G + + S ++Q+ ++F++C+ G+ +G
Sbjct: 177 CMDSGFSS-NSEEDAKTTGLMGMNRGSLSFVNQMGF-----RKFSYCISDRDSSGVLLLG 230
Query: 268 DVVSPKVKT---TPMVP---NMPH-----YNVILEEVEVGGNPLDLPTSLLGTGDERG-- 314
+ +K TP+V +P+ Y+V LE + V L LP S+ D G
Sbjct: 231 EASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVF-VPDHTGAG 289
Query: 315 -TIIDSGTTLAYLPPMLYDLVLSQF 338
T++DSGT +L +Y + +F
Sbjct: 290 QTMVDSGTQFTFLLGPVYSALKQEF 314
>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
Length = 389
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 77/271 (28%), Positives = 120/271 (44%), Gaps = 29/271 (10%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAG-CS-RCPTKSDLGIKLTLFDPSKSSTSGEIACS 144
+ LGTP + S WV C+ C+ C T S LF P S++ ++ C
Sbjct: 3 LSLGTPPQPLNFTLAVDSGFSWVACSSSCAINCTTAS-------LFQPGLSTSHTKLPCG 55
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSV 204
C + ++ SC P C Y +YG S++G V DI ++ N K A +++
Sbjct: 56 SPSC-SAFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSDIATMDSVR-NRKVA---ANL 110
Query: 205 IFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
GCG R SG L D + G +GF + N S + QL+A G R +F +CL G
Sbjct: 111 SLGCG-RDSGGLLELLDTS--GFVGFDKGNVSFMGQLSALG-YRSKFIYCLPSDTFRGKL 166
Query: 265 AIGDV------VSPKVKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTGDERG 314
IG+ +S + TPM+ N P Y + L + + N +P + G
Sbjct: 167 VIGNYKLRNASISSSMAYTPMITN-PQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGG 225
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
T+ID+ T L+YL Y ++ + + +L
Sbjct: 226 TVIDTTTFLSYLTSDFYTQLVQAIKNYTTNL 256
>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 535
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/318 (25%), Positives = 137/318 (43%), Gaps = 35/318 (11%)
Query: 35 VENKFKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD 94
V K K + T + + + G+++A+++ +G +G YF V +G+P
Sbjct: 127 VSQKQKKNDKEVVTTTPVASSVEEQAGQLVATLE-----SGMTLGSGEYFMDVLVGSPPK 181
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTY 152
+ + +DTGSDL W+ C C C ++ +DP S++ I C+D C ++
Sbjct: 182 HFSLILDTGSDLNWIQCLPCYDCFQQNG-----AFYDPKASASYKNITCNDQRCNLVSSP 236
Query: 153 NNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNR 211
+ P S C Y YGD S+T+G F + +N + + N +++FGCG+
Sbjct: 237 DPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHW 296
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------DVVKGGGIF- 264
G + G+ S SQL + F++CL V IF
Sbjct: 297 NRGLFHGAAGLLGL-----GRGPLSFSSQLQSL--YGHSFSYCLVDRNSDTNVSSKLIFG 349
Query: 265 AIGDVVS-PKVKTTPMVPNMPH-----YNVILEEVEVGGNPLDLPTSLLGTGDE--RGTI 316
D++S P + T V + Y V ++ + V G L++P + GTI
Sbjct: 350 EDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTI 409
Query: 317 IDSGTTLAYLPPMLYDLV 334
IDSGTTL+Y Y+ +
Sbjct: 410 IDSGTTLSYFAEPAYEFI 427
>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 449
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 128/286 (44%), Gaps = 32/286 (11%)
Query: 76 HPSA---TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR---CPTKSDLGIKLT- 128
HP+A G Y +GTP+ ++ + DTGSDL W++C R C + I+
Sbjct: 73 HPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKR 132
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNNRYP--SC-SPGVRCEYVVTYGDGSSTSGYFVRDI 185
+F + SS+ I C + C+ + + +C +P C Y Y DGS+ G+F +
Sbjct: 133 VFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANET 192
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ + G + L+ +V+ GC G + A DG++G G + S + AA
Sbjct: 193 VTVELKEG--RKMKLH-NVLIGCSESFQGQ----SFQAADGVMGLGYSKYSF--AIKAAE 243
Query: 246 NVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH-----------YNVILEEV 293
+F++CL D + + S + K ++ NM + Y V + +
Sbjct: 244 KFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEA-LLNNMTYTELVLGMVNSFYAVNMMGI 302
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+GG L +P+ + GTI+DSG++L +L Y V++ R
Sbjct: 303 SIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALR 348
>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
Length = 474
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/300 (28%), Positives = 127/300 (42%), Gaps = 61/300 (20%)
Query: 75 GHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG---CSRCPTKSDLGIKLTLFD 131
+P + G Y + LGTP +DTGS L+W C CS C + K+ F
Sbjct: 84 AYPKSYGGYSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFI 143
Query: 132 PSKSSTSGEIACSDNFCRTTYNN----RYPSCSP-----GVRCE-YVVTYGDGSSTSGYF 181
P SST+ + C + C + + R P C P + C Y++ YG G ST+G+
Sbjct: 144 PKNSSTAKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLG-STAGFL 202
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGC---GNRQSGDLGSSTDAAVDGILGFGQANSSLL 238
+ D + KT P + GC RQ GI GFG+ SL
Sbjct: 203 LLDNLNFPG-----KTVP---QFLVGCSILSIRQP-----------SGIAGFGRGQESLP 243
Query: 239 SQLAAAGNVRKEFAHCL------DVVKGGG----IFAIGDVVSPKVKTTPMVPN------ 282
SQ+ K F++CL D + I + GD + + TP N
Sbjct: 244 SQMNL-----KRFSYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNP 298
Query: 283 --MPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQF 338
+Y + L +V VGG + +P + L G + GTI+DSG+T ++ +Y+LV +F
Sbjct: 299 AFKEYYYLTLRKVIVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEF 358
>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 439
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 126/297 (42%), Gaps = 28/297 (9%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSD 105
+R +A+++ R + A + + + ++ G Y + +G+P + VDTGSD
Sbjct: 54 QRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSD 113
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRC 165
+LW+ C C C ++ +FDPSKS T + CS N C + N +CS C
Sbjct: 114 ILWLQCEPCEDCYKQT-----TPIFDPSKSKTYKTLPCSSNTCESLRNT---ACSSDNVC 165
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
EY + YGDGS + G + + L G+ P + GCG+ G V
Sbjct: 166 EYSIDYGDGSHSDGDLSVETLTLGSTDGSSVHFP---KTVIGCGHNNGGTFQEEGSGIV- 221
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV----KGGGIFAIGD--VVSPK-VKTTP 278
G + + ++ +F++CL + GD VVS + +TP
Sbjct: 222 -----GLGGGPVSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTP 276
Query: 279 MVP--NMPHYNVILEEVEVGGNPLDL--PTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+ P Y + LE VG N ++ +S + IIDSGTTL LP Y
Sbjct: 277 LDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDY 333
>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
Length = 359
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 73/270 (27%), Positives = 112/270 (41%), Gaps = 30/270 (11%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y ++ +GTP +DTGSDL+W+ C C C T+F SS+ +
Sbjct: 3 GEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHH---GETIFFSDASSSYKK 59
Query: 141 IACSDNFCRTTYNNRY-PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
+ C+ C + P C C+Y YGDGS TSG D I
Sbjct: 60 LPCNSTHCSGMSSAGIGPRCEE--TCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRS 117
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---- 255
+FGC + GD + G++G GQ + SL+ QL + +F++CL
Sbjct: 118 FFDGFLFGCARKLKGDWNFT-----QGLIGLGQKSHSLIQQL--GDKLGYKFSYCLVSYD 170
Query: 256 DVVKGGGIFAIGDVVSPK---VKTTPMVP----NMPHYNVILEEVEVGGNPLDLPTSLLG 308
+G + + V +TP++ + Y V L+ + +GG P+ + G
Sbjct: 171 SPPSAKSFLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESG 230
Query: 309 TGDERG------TIIDSGTTLAYLPPMLYD 332
G T+IDSGTT L P +Y+
Sbjct: 231 HNTSVGPFLANKTVIDSGTTYTLLTPPVYE 260
>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
Length = 472
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/272 (31%), Positives = 122/272 (44%), Gaps = 39/272 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC--SRCPTKSDLGIKLTLFDPSKSSTSGE 140
Y +G+GTP + V +DTGSDL WV C C S C + D L+DP+ SST
Sbjct: 127 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKD-----PLYDPTASSTYAP 181
Query: 141 IACSDNFCR----TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLK 196
+ C C+ Y++ + S C+Y + YG+ +T G + + + L+
Sbjct: 182 VPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVSVKD 241
Query: 197 TAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLD 256
FGCG Q G T DG+LG G A SL+SQ A F++CL
Sbjct: 242 FG-------FGCGLVQQG-----TFDLFDGLLGLGGAPESLVSQTAE--TYGGAFSYCLP 287
Query: 257 VVKG-GGIFAIGDVVSPKVKT----TPMVPNMPH----YNVILEEVEVGGNPLDLPTSLL 307
G A+G + TP+ ++P Y V L V VGG PLD+P ++L
Sbjct: 288 PGNSTTGFLALGAPTNNNDTAGFLFTPLH-SLPEQATFYLVNLTGVSVGGKPLDIPPTVL 346
Query: 308 GTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G IIDSGT + LP Y + + FR
Sbjct: 347 ----SGGMIIDSGTIITGLPDTAYSALRTAFR 374
>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
Group]
gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
Group]
gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIPGFS---FGC-NMDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFDC---FSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSVFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 76/293 (25%), Positives = 127/293 (43%), Gaps = 39/293 (13%)
Query: 45 RERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
+E ++ L+ + G ++A + + P + + +G+P + +DT S
Sbjct: 52 KEASVERLEYLKAKATGDIIAHLSPNV-----PIIPQAFLVNISIGSPPVTQLLHMDTAS 106
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
DLLW+ C C C +S L +FDPS+S T + CRT+ PS +
Sbjct: 107 DLLWLQCRPCINCYAQS-----LPIFDPSRSYTH-----RNESCRTS-QYSMPSLRFNAK 155
Query: 165 ---CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTD 221
CEY + Y DG+ + G ++++ N +A L+ V+FGCG+ G+ T
Sbjct: 156 TRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALH-DVVFGCGHDNYGEPLVGT- 213
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVVKGGGIFAIGDVVSPKV-KT 276
GILG G SL+ + +F++C D + +GD + + T
Sbjct: 214 ----GILGLGYGEFSLVHRFGT------KFSYCFGSLDDPSYPHNVLVLGDDGANILGDT 263
Query: 277 TPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYL 326
TP+ Y V +E + V G L + + + GTIID+G +L L
Sbjct: 264 TPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTSL 316
>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
Length = 321
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 126/278 (45%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP+ +++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPSKTQILEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---SFSFGC-NMDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDV--------VKGGGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQMSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
Length = 434
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 122/267 (45%), Gaps = 38/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y K GTP + +DT SD W+ C+GC C T F P KS++ ++
Sbjct: 97 YIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKP-------FAPIKSTSFRNVS 149
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N P+C G C + TYG SS + V+D + L T P+
Sbjct: 150 CGSPHCKQVPN---PTCG-GSACAFNFTYG-SSSIAASVVQDTL-------TLATDPI-P 196
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-FAHCLDVVKG- 260
FGC N+ +G + A G+LG G+ SLLSQ + N+ K F++CL K
Sbjct: 197 GYTFGCVNKTTG-----SSAPQQGLLGLGRGPLSLLSQ---SQNLYKSTFSYCLPSFKSI 248
Query: 261 --GGIFAIGDVVSPK-VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER- 313
G +G V PK +K TP++ N Y V L ++VG +D+P + L
Sbjct: 249 NFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTG 308
Query: 314 -GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L +Y V ++FR
Sbjct: 309 AGTIFDSGTVFTRLAEPVYTAVRNEFR 335
>gi|413924529|gb|AFW64461.1| hypothetical protein ZEAMMB73_591827 [Zea mays]
Length = 217
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/177 (31%), Positives = 89/177 (50%), Gaps = 21/177 (11%)
Query: 40 KAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL---YFTKVGLGTPTDEY 96
+ GE R AL + D +R R +A + L GG+ L Y+ V +GTP +
Sbjct: 53 RGSGEYYR---ALVRSDIQRQKRRLAVLSLSKGGSTFSPGNDLGWLYYAWVDVGTPATSF 109
Query: 97 YVQVDTGSDLLWVNCAGCSRCPT----KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTY 152
V +DTGSDL WV C C +C + +L L ++ P++S+TS + CS C++
Sbjct: 110 LVALDTGSDLFWVPC-DCIQCAPLSGYRGNLDRDLRIYRPAESTTSRHLPCSHELCQSV- 167
Query: 153 NNRYPSCS-PGVRCEYVVTY-GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFG 207
P C+ P C Y + Y + +++SG + D + LN ++ P+N+SVI G
Sbjct: 168 ----PGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHV---PVNASVIIG 217
>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
Length = 321
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 126/284 (44%), Gaps = 56/284 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---GFTFGC-NLDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFD---GFSY 152
Query: 254 CLDV--------VKGGGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQMSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
L S+ +G + DSG+ L+Y+P D LS R I L
Sbjct: 213 LSPSVFS---RKGVVFDSGSELSYIP----DRALSVLRQRIREL 249
>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
Length = 425
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 120/266 (45%), Gaps = 37/266 (13%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y + +GTP + +DT +D W+ C C C + TLF P KS+T ++
Sbjct: 93 YIVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCAS--------TLFAPEKSTTFKNVS 144
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C+ C+ N P C R + +TYG SS + V+D I L T P+
Sbjct: 145 CAAPECKQVPN---PGCGVSSR-NFNLTYG-SSSIAANLVQDTI-------TLATDPV-P 191
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG-- 260
S FGC ++ +G T A G+LG G+ SLLSQ + F++CL K
Sbjct: 192 SYTFGCVSKTTG-----TSAPPQGLLGLGRGPLSLLSQ--TQNLYQSTFSYCLPSFKSLN 244
Query: 261 -GGIFAIGDVVSPK-VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER-- 313
G +G V PK +K TP++ N Y V LE + VG +D+P + L
Sbjct: 245 FSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGA 304
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L +Y V +FR
Sbjct: 305 GTIFDSGTVFTRLVAPVYVAVRDEFR 330
>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
Length = 378
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 74/286 (25%), Positives = 128/286 (44%), Gaps = 32/286 (11%)
Query: 76 HPSA---TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSR---CPTKSDLGIKLT- 128
HP+A G Y +GTP+ ++ + DTGSDL W++C R C + I+
Sbjct: 2 HPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKR 61
Query: 129 LFDPSKSSTSGEIACSDNFCRTTYNNRYP--SC-SPGVRCEYVVTYGDGSSTSGYFVRDI 185
+F + SS+ I C + C+ + + +C +P C Y Y DGS+ G+F +
Sbjct: 62 VFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANET 121
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ + G + L+ +V+ GC G + A DG++G G + S + AA
Sbjct: 122 VTVELKEG--RKMKLH-NVLIGCSESFQGQ----SFQAADGVMGLGYSKYSF--AIKAAE 172
Query: 246 NVRKEFAHCL-DVVKGGGIFAIGDVVSPKVKTTPMVPNMPH-----------YNVILEEV 293
+F++CL D + + S + K ++ NM + Y V + +
Sbjct: 173 KFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEA-LLNNMTYTELVLGMVNSFYAVNMMGI 231
Query: 294 EVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
+GG L +P+ + GTI+DSG++L +L Y V++ R
Sbjct: 232 SIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALR 277
>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
Length = 439
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/270 (31%), Positives = 119/270 (44%), Gaps = 37/270 (13%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y +V LGTP ++ +DT D WV CA C+ C + + F P+ SST
Sbjct: 97 GNYVVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSSPT--------FSPNTSSTYAS 148
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ CS C P+ C + TYG SS S +D + L + T P
Sbjct: 149 LQCSVPQCTQVRGLSCPTTGTAA-CFFNQTYGGDSSFSAMLSQDSLGL-----AVDTLP- 201
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRK-EFAHCLDVVK 259
S FGC N SG + G+LG G+ SLLSQ +G++ F++C K
Sbjct: 202 --SYSFGCVNAVSG-----STLPPQGLLGLGRGPMSLLSQ---SGSLYSGVFSYCFPSFK 251
Query: 260 G---GGIFAIGDVVSPK-VKTTPMVPNMPH----YNVILEEVEVGGNPLDLPTSLLGTGD 311
G +G + PK ++TTP++ N PH Y V L V VG + + LL
Sbjct: 252 SYYFSGSLRLGPLGQPKNIRTTPLLRN-PHRPTLYYVNLTGVSVGRVLVPVAPELLAFDP 310
Query: 312 ER--GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + +Y + +FR
Sbjct: 311 NTGAGTIIDSGTVITRFVEPVYAAIRDEFR 340
>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
Length = 460
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/268 (28%), Positives = 120/268 (44%), Gaps = 50/268 (18%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCS--RCPTKSDLGIKLTLFDPSKSSTS 138
GL+ VG GTP ++ + +DTGSD W+ C CS C K F+PS SS+
Sbjct: 127 GLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKKT-------FNPSLSSSY 179
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
+ C P Y + Y D S + G FV D + L
Sbjct: 180 SNRS----------------CIPSTDTNYTMKYEDNSYSKGVFVCDEVTLK--------P 215
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANS-SLLSQLAAAGNVRKEFAHCL-- 255
+ FGCG+ G+ G+++ G+LG + SL+SQ A+ +K+F++C
Sbjct: 216 DVFPKFQFGCGDSGGGEFGTAS-----GVLGLAKGEQYSLISQTAS--KFKKKFSYCFPP 268
Query: 256 -DVVKGGGIFAIGDV-VSPKVKTTPMV--PNMPHYNVILEEVEVGGNPLDLPTSLLGTGD 311
+ G +F + SP +K T ++ P+ Y V L + V L++ +SL +
Sbjct: 269 KEHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLFAS-- 326
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTIIDSGT + LP Y+ + + F+
Sbjct: 327 -PGTIIDSGTVITRLPTAAYEALRTAFQ 353
>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 133/304 (43%), Gaps = 40/304 (13%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGG-NGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
E LS LK+ D + DL +G +G YF++VG+G P +Y+ +DTGS
Sbjct: 117 EFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGS 176
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
D+ W+ C C+ C ++D +FDP SS+ + C C+ S +
Sbjct: 177 DINWLQCQPCTDCYQQTD-----PIFDPRSSSSFASLPCESQQCQALET----SGCRASK 227
Query: 165 CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAV 224
C Y V+YGDGS T G FV + + GN + + + V GCG+ G
Sbjct: 228 CLYQVSYGDGSFTVGEFVTETLTF----GN---SGMINDVAVGCGHDNEGLF-----VGS 275
Query: 225 DGILGFGQANSSLLSQLAAAGNVRKEFAHCL--------DVVKGGGIFAIGDVVSPKVKT 276
G+LG G SL SQ+ A+ F++CL ++ V +P +K+
Sbjct: 276 AGLLGLGGGPLSLTSQMKASS-----FSYCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKS 330
Query: 277 TPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ Y V L + VGG L +P +L D G I+DSGT + L Y+ +
Sbjct: 331 GKV---DTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTL 387
Query: 335 LSQF 338
F
Sbjct: 388 RDAF 391
>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 752
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 129/300 (43%), Gaps = 42/300 (14%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G++MA+++ +G +G YF V +G+P + + +DTGSDL W+ C C C +
Sbjct: 179 GQLMATLE-----SGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQ 233
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS-CSPGVR-CEYVVTYGDGSSTS 178
+ +DP S + I C+D C+ + P C + C Y YGD S+T+
Sbjct: 234 NG-----PYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTT 288
Query: 179 GYFVRDIIQLNQASGNLKTAPLN--SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
G F + +N S + +V+FGCG+ G + G+ S
Sbjct: 289 GDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLS 343
Query: 237 LLSQLAAAGNVRKEFAHCL------DVVKGGGIFAIGD--VVSPKVKTTPMV-----PNM 283
SQL + F++CL V IF + P++ T ++ P
Sbjct: 344 FSSQLQSLYG--HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVD 401
Query: 284 PHYNVILEEVEVGGNPLDLPT-----SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y + ++ + VGG L +P S G G GTIIDSGTTL+Y Y ++ F
Sbjct: 402 TFYYLQIKSIFVGGEKLQIPEENWNLSADGAG---GTIIDSGTTLSYFSDPAYRIIKEAF 458
>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 484
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 120/256 (46%), Gaps = 43/256 (16%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR----TTYNNR 155
VDTGSDL WV C C C + L+DPS SS+ + C+ + C+ T N+
Sbjct: 150 VDTGSDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 204
Query: 156 YPSCSPGV---RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
+ GV CEYVV+YGDGS T G + I L G+ K + +FGCG
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKL----ENFVFGCGRNN 256
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG--GIFAIGD-- 268
G G S+ G+++ SL+SQ N F++CL ++ G G + G+
Sbjct: 257 KGLFGGSSGLMGL-----GRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGSLSFGNDS 309
Query: 269 ---VVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
S V TP+V N Y + L +GG ++L +S G RG +IDSGT
Sbjct: 310 SVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFG----RGILIDSGTV 363
Query: 323 LAYLPPMLYDLVLSQF 338
+ LPP +Y V +F
Sbjct: 364 ITRLPPSIYKAVKIEF 379
>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
CELL 1-like [Cucumis sativus]
Length = 757
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/300 (27%), Positives = 129/300 (43%), Gaps = 42/300 (14%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G++MA+++ +G +G YF V +G+P + + +DTGSDL W+ C C C +
Sbjct: 179 GQLMATLE-----SGVSLGSGEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQ 233
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPS-CSPGVR-CEYVVTYGDGSSTS 178
+ +DP S + I C+D C+ + P C + C Y YGD S+T+
Sbjct: 234 NG-----PYYDPKDSISFRNITCNDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTT 288
Query: 179 GYFVRDIIQLNQASGNLKTAPLN--SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSS 236
G F + +N S + +V+FGCG+ G + G+ S
Sbjct: 289 GDFALETFTVNLTSSTTGKSEFRRVENVMFGCGHWNRGLFHGAAGLLGL-----GRGPLS 343
Query: 237 LLSQLAAAGNVRKEFAHCL------DVVKGGGIFAIGD--VVSPKVKTTPMV-----PNM 283
SQL + F++CL V IF + P++ T ++ P
Sbjct: 344 FSSQLQSLYG--HSFSYCLVDRDSDTSVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVD 401
Query: 284 PHYNVILEEVEVGGNPLDLPT-----SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y + ++ + VGG L +P S G G GTIIDSGTTL+Y Y ++ F
Sbjct: 402 TFYYLQIKSIFVGGEKLQIPEENWNLSADGAG---GTIIDSGTTLSYFSDPAYRIIKEAF 458
>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
vinifera]
Length = 444
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 108/277 (38%), Gaps = 38/277 (13%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
S G Y + LGTP DTGSDL+W C C C + + LFDP +S T
Sbjct: 89 SGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVE-----PLFDPKESET 143
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
+ C + FC+ + SC C Y +YGD S T G D + + G+ +
Sbjct: 144 YKTLDCDNEFCQDL--GQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLTIGSTEGDPAS 201
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL-- 255
P + FGCG+ G + G L S++ +F++CL
Sbjct: 202 FP---GIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVGG------QFSYCLVP 252
Query: 256 -----------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLP- 303
+ K G + G V +P +K TP Y + LE + VG +
Sbjct: 253 LSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDT----FYYLTLEGLSVGSETVAFKG 308
Query: 304 ----TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
S +E IIDSGTTL LP Y V S
Sbjct: 309 FSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVES 345
>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
Length = 484
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 120/256 (46%), Gaps = 43/256 (16%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR----TTYNNR 155
VDTGSDL WV C C C + L+DPS SS+ + C+ + C+ T N+
Sbjct: 150 VDTGSDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 204
Query: 156 YPSCSPGV---RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
+ GV CEYVV+YGDGS T G + I L G+ K + +FGCG
Sbjct: 205 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKL----ENFVFGCGRNN 256
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG--GIFAIGD-- 268
G G S+ G+++ SL+SQ N F++CL ++ G G + G+
Sbjct: 257 KGLFGGSSGLMGL-----GRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGSLSFGNDS 309
Query: 269 ---VVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
S V TP+V N Y + L +GG ++L +S G RG +IDSGT
Sbjct: 310 SVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFG----RGILIDSGTV 363
Query: 323 LAYLPPMLYDLVLSQF 338
+ LPP +Y V +F
Sbjct: 364 ITRLPPSIYKAVKIEF 379
>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
Length = 436
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 85/256 (33%), Positives = 120/256 (46%), Gaps = 43/256 (16%)
Query: 100 VDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR----TTYNNR 155
VDTGSDL WV C C C + L+DPS SS+ + C+ + C+ T N+
Sbjct: 102 VDTGSDLTWVQCQPCRSCYNQQG-----PLYDPSVSSSYKTVFCNSSTCQDLVAATSNSG 156
Query: 156 YPSCSPGV---RCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQ 212
+ GV CEYVV+YGDGS T G + I L G+ K + +FGCG
Sbjct: 157 PCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL----GDTKL----ENFVFGCGRNN 208
Query: 213 SGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGG--GIFAIGD-- 268
G G S+ G+++ SL+SQ N F++CL ++ G G + G+
Sbjct: 209 KGLFGGSSGLMGL-----GRSSVSLVSQTLKTFN--GVFSYCLPSLEDGASGSLSFGNDS 261
Query: 269 ---VVSPKVKTTPMVPN---MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTT 322
S V TP+V N Y + L +GG ++L +S G RG +IDSGT
Sbjct: 262 SVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFG----RGILIDSGTV 315
Query: 323 LAYLPPMLYDLVLSQF 338
+ LPP +Y V +F
Sbjct: 316 ITRLPPSIYKAVKIEF 331
>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 392
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 112/274 (40%), Gaps = 48/274 (17%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
+Y K+ +GTP E ++DTGSDL+W C C+ C ++ +FDPS SST E
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQ-----YAPIFDPSNSSTFKEK 114
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C+ N C Y + Y D + + G + + ++ SG P
Sbjct: 115 RCNGN-----------------SCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMP-- 155
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------ 255
GCG+ S G++G SSL++Q+ G ++C
Sbjct: 156 -ETTIGCGHNSSW-----FKPTFSGMVGLSWGPSSLITQM--GGEYPGLMSYCFASQGTS 207
Query: 256 DVVKGGGIFAIGD-VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT---GD 311
+ G GD VVS + T P + + N L+ V VG D +GT
Sbjct: 208 KINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLN--LDAVSVG----DTHVETMGTTFHAL 261
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
E IIDSGTTL Y P +LV ++ ++
Sbjct: 262 EGNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAV 295
>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 392
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/274 (27%), Positives = 112/274 (40%), Gaps = 48/274 (17%)
Query: 82 LYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEI 141
+Y K+ +GTP E ++DTGSDL+W C C+ C ++ +FDPS SST E
Sbjct: 60 IYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQ-----YAPIFDPSNSSTFKEK 114
Query: 142 ACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLN 201
C+ N C Y + Y D + + G + + ++ SG P
Sbjct: 115 RCNGN-----------------SCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMP-- 155
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------ 255
GCG+ S G++G SSL++Q+ G ++C
Sbjct: 156 -ETTIGCGHNSSW-----FKPTFSGMVGLSWGPSSLITQM--GGEYPGLMSYCFASQGTS 207
Query: 256 DVVKGGGIFAIGD-VVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGT---GD 311
+ G GD VVS + T P + + N L+ V VG D +GT
Sbjct: 208 KINFGTNAIVAGDGVVSTTMFLTTAKPGLYYLN--LDAVSVG----DTHVETMGTTFHAL 261
Query: 312 ERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
E IIDSGTTL Y P +LV ++ ++
Sbjct: 262 EGNIIIDSGTTLTYFPVSYCNLVREAVDHYVTAV 295
>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
Length = 447
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 121/264 (45%), Gaps = 46/264 (17%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y ++ +GTP + DTGSDL W C C C + ++D + SS+ +
Sbjct: 93 YLMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQ-----DTPIYDTAVSSSFSPVP 147
Query: 143 CSDNFCRTTYNNR--YPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
C+ C +++R S SP C Y YGDG+ ++G + + A G
Sbjct: 148 CASATCLPIWSSRNCTASSSP---CRYRYAYGDGAYSAGVLGTETLTFPGAPGVSV---- 200
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----D 256
+ FGCG G +ST G +G G+ + SL++QL +F++CL +
Sbjct: 201 -GGIAFGCGVDNGGLSYNST-----GTVGLGRGSLSLVAQLGVG-----KFSYCLTDFFN 249
Query: 257 VVKGGGIF--AIGDVVSPK----VKTTPMV--PNMPH-YNVILEEVEVGGNPLDLPTSLL 307
G + A+ ++ +P V++TP+V P +P Y V LE + +G L +P
Sbjct: 250 TSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPN--- 306
Query: 308 GTGDER-----GTIIDSGTTLAYL 326
GT D R G I+DSGTT +L
Sbjct: 307 GTFDLRDDGSGGMIVDSGTTFTFL 330
>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
sativus]
Length = 491
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 135/304 (44%), Gaps = 40/304 (13%)
Query: 46 ERTLSALKQHDTRRHGRMMASIDLELGG-NGHPSATGLYFTKVGLGTPTDEYYVQVDTGS 104
E LS LK+ D + DL +G +G YF++VG+G P +Y+ +DTGS
Sbjct: 117 EFALSELKRSDLEPLKTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGS 176
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
D+ W+ C C+ C ++D +FDP SS+ + C C+ S +
Sbjct: 177 DINWLQCQPCTDCYQQTD-----PIFDPRSSSSFASLPCESQQCQALET----SGCRASK 227
Query: 165 CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAV 224
C Y V+YGDGS T G FV + + GN + + ++V GCG+ G
Sbjct: 228 CLYQVSYGDGSFTVGEFVIETLTF----GN---SGMINNVAVGCGHDNEGLF-----VGS 275
Query: 225 DGILGFGQANSSLLSQLAAAGNVRKEFAHCL--------DVVKGGGIFAIGDVVSPKVKT 276
G+LG G + SL SQ+ A+ F++CL ++ V +P +K+
Sbjct: 276 AGLLGLGGGSLSLTSQMKASS-----FSYCLVDRDSSSSSDLEFNSAAPSDSVNAPLLKS 330
Query: 277 TPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLV 334
+ Y V L + VGG L +P +L D G I+DSGT + L Y+ +
Sbjct: 331 GKV---DTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITRLQTQAYNTL 387
Query: 335 LSQF 338
F
Sbjct: 388 RDAF 391
>gi|88174593|gb|ABD39371.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
Length = 321
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/278 (29%), Positives = 125/278 (44%), Gaps = 52/278 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPD------CPFRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P S FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIPGFS---FGC-NMDS--FGANEFGNVDGLLGMGAGPMSVLKQSSPTFDC---FSY 152
Query: 254 CLDVVKG--------GGIFAIGDVVS-PKVKTTPMV---PNMPHYNVILEEVEVGGNPLD 301
CL + K G F++G V + V+ T MV N + V L + V G L
Sbjct: 153 CLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLIAISVDGERLG 212
Query: 302 LPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LSPSVFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 247
>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
Length = 438
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 122/270 (45%), Gaps = 43/270 (15%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+ +G+P + +DTGS+L W++C +LG ++F+P SST + CS
Sbjct: 65 LAVGSPPQNISMVLDTGSELSWLHCK------KSPNLG---SVFNPVSSSTYSPVPCSSP 115
Query: 147 FCRT-TYNNRYP-SCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS 203
CRT T + P SC P C ++Y D +S G D + +
Sbjct: 116 ICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVT--------RPG 167
Query: 204 VIFGCGNR-QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
+FGC + S D S DA G++G + + S ++QL + +F++C+ G
Sbjct: 168 TLFGCMDSGLSSD--SEEDAKSTGLMGMNRGSLSFVNQLGFS-----KFSYCISGSDSSG 220
Query: 263 IFAIGDVVSP---KVKTTPMVPN---MPH-----YNVILEEVEVGGNPLDLPTSLLGTGD 311
I +GD ++ TP+V +P+ Y V LE + VG L LP S+ D
Sbjct: 221 ILLLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVF-VPD 279
Query: 312 ERG---TIIDSGTTLAYLPPMLYDLVLSQF 338
G T++DSGT +L +Y + ++F
Sbjct: 280 HTGAGQTMVDSGTQFTFLMGPVYTALKNEF 309
>gi|449449755|ref|XP_004142630.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
gi|449500674|ref|XP_004161165.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
Length = 413
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 128/261 (49%), Gaps = 34/261 (13%)
Query: 27 VMGNFVFEVENKFK-AGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFT 85
+ +F + +KF A E+ +TL R +S+ + GN +P G +
Sbjct: 10 LFASFAVSLSDKFLFADSEQVKTL------------RFGSSVLFPVRGNVYP--LGHFTV 55
Query: 86 KVGLGTPTDEYYVQVDTGSDLLWVNC-AGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACS 144
+ +G P+ + + +DTGSDL WV C C C D+ L+ P ++ S E
Sbjct: 56 LLNIGNPSKVFELDIDTGSDLTWVQCDVECIGCTLPRDM-----LYRPHNNAVSRE---- 106
Query: 145 DNFCRTTYN-NRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSS 203
D C + ++ +P +C Y V Y D S+ G V+D++ + +G + +P +
Sbjct: 107 DPLCAALSSLGKFIFKNPNDQCAYEVEYADHGSSVGVLVKDLVPMRLTNGK-RISP---N 162
Query: 204 VIFGCG-NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGG 262
+ FGCG ++++GDL ++ G+LG + ++++SQL+ G+V HCL GG
Sbjct: 163 LGFGCGYDQENGDL--QQPPSIAGVLGLSSSKATIVSQLSDLGHVSNVVGHCLTGRGGGF 220
Query: 263 IFAIGDVV-SPKVKTTPMVPN 282
+F GDVV S + TP++ N
Sbjct: 221 LFFGGDVVPSSGMSWTPILRN 241
>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 456
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 74/271 (27%), Positives = 119/271 (43%), Gaps = 45/271 (16%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
+ + +G+P V VDTGS LLWV C C C +S + FDP KS + +
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQST-----SWFDPLKSVSFKTLG 158
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQA-SGNLKTAPLN 201
C F Y N Y C+ + EY + Y G S+ G ++ + G +K
Sbjct: 159 CG--FPGYNYINGY-KCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIK----K 211
Query: 202 SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL------ 255
S++ FGCG+ ++ ++ D A +G+ G G + A + +F++C+
Sbjct: 212 SNITFGCGHM---NIKTNNDDAYNGVFGLGA-----YPHITMATQLGNKFSYCIGDINNP 263
Query: 256 -----DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDL-PTSLLGT 309
+V G G + GD +TP+ + HY V L+ + VG L + P + +
Sbjct: 264 LYTHNHLVLGQGSYIEGD-------STPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKIS 316
Query: 310 GD-ERGTIIDSGTTLAYLP----PMLYDLVL 335
D G +IDSG T L +LYD ++
Sbjct: 317 SDGSGGVLIDSGMTYTKLANGGFELLYDEIV 347
>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
Length = 450
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/331 (27%), Positives = 132/331 (39%), Gaps = 52/331 (15%)
Query: 37 NKFKAGGERERTLSALKQHDTRRHGRMMAS-IDLELGGNGHPSATGLYFTKVGLGTPTDE 95
++F G R + +H+ R+ +S + P+A G Y + +GTP
Sbjct: 48 SQFVRGALRRD----MHRHNARKLALAASSGATVSAPTQNSPTA-GEYLMALAIGTPPLP 102
Query: 96 YYVQVDTGSDLLWVNCAGCS----RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNF--CR 149
Y DTGSDL+W CA C+ R PT L++PS S+T + C+ + C
Sbjct: 103 YQAIADTGSDLIWTQCAPCTSQCFRQPTP--------LYNPSSSTTFAVLPCNSSLSVCA 154
Query: 150 TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCG 209
+ PG C Y VTYG G TS + + P + FGC
Sbjct: 155 AALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQSRVP---GIAFGCS 210
Query: 210 NRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK---GGGIFAI 266
SG SS G++G G+ SL+SQL +F++CL + +
Sbjct: 211 TASSGFNASS----ASGLVGLGRGRLSLVSQLGV-----PKFSYCLTPYQDTNSTSTLLL 261
Query: 267 GDVVS----PKVKTTPMV------PNMPHYNVILEEVEVGGNPLDLPTS--LLGTGDERG 314
G S V +TP V P Y + L + +G L +P LL G
Sbjct: 262 GPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGTGG 321
Query: 315 TIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
IIDSGTT+ +L + Q R + SL
Sbjct: 322 LIIDSGTTIT----LLGNTAYQQVRAAVVSL 348
>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 125/280 (44%), Gaps = 54/280 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP+ V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---GFTFGC-NMDS--FGANEFGNVDGLLGMGAGQMSVLKQSSPTFD---GFSY 152
Query: 254 CLDV--------VKGGGIFAIGDVVSP---KVKTTPMV---PNMPHYNVILEEVEVGGNP 299
CL + K G F++G ++ V+ T MV N + V L + V G
Sbjct: 153 CLPLQMSERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGER 212
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 249
>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
Group]
gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
Length = 500
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 92/299 (30%), Positives = 128/299 (42%), Gaps = 57/299 (19%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YFTK+G+GTP + +DTGSD++W+ CA C RC +S +FDP
Sbjct: 138 SGLAQGSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSG-----QMFDPR 192
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVR---CEYVVTYGDGSSTSGYFVRDIIQLNQ 190
S + G + C+ CR R S +R C Y V YGDGS T+G F + +
Sbjct: 193 ASHSYGAVDCAAPLCR-----RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTF-- 245
Query: 191 ASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
ASG V GCG+ G ++ G+ + S SQ++ +
Sbjct: 246 ASGARV-----PRVALGCGHDNEGLFVAAAGLLGL-----GRGSLSFPSQISR--RFGRS 293
Query: 251 FAHCL---------------DVVKGGGIFAIGDVVSPKVKTTPMV--PNMP-HYNVILEE 292
F++CL V G G A+G S TPMV P M Y V L
Sbjct: 294 FSYCLVDRTSSSASATSRSSTVTFGSG--AVGP--SAAASFTPMVKNPRMETFYYVQLMG 349
Query: 293 VEVGGNPL------DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWIASL 345
+ VGG + DL L + G I+DSGT++ L Y + FR A L
Sbjct: 350 ISVGGARVPGVAVSDL--RLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGL 406
>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
melo]
Length = 412
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 76/298 (25%), Positives = 129/298 (43%), Gaps = 46/298 (15%)
Query: 63 MMASIDLELGGNGHPSATGLYFTKVGL------GTPTDEYYVQVDTGSDLLWVNCAGCSR 116
M+ + ++G PS + V L G+P + + +DTGS+L W++ C +
Sbjct: 14 MVLPLQTQMGLISQPSNKLSFHHNVTLTVSLTVGSPPQQVTMVLDTGSELSWLH---CKK 70
Query: 117 CPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT-TYNNRYP-SCSPGVRCEYVVTYGDG 174
P + ++F+P SS+ I CS CRT T + P +C P C +V+Y D
Sbjct: 71 SPNLT------SVFNPLSSSSYSPIPCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADA 124
Query: 175 SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQAN 234
SS G D ++ ++ +FGC + S DA G++G + +
Sbjct: 125 SSLEGNLASDNFRIGSSA--------LPGTLFGCMDSGFSS-NSEEDAKTTGLMGMNRGS 175
Query: 235 SSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGD---------VVSPKVKTTPMVPNMPH 285
S ++QL +F++C+ G+ GD +P V+ + +P
Sbjct: 176 LSFVTQLGLP-----KFSYCISGRDSSGVLLFGDSHLSWLGNLTYTPLVQISTPLPYFDR 230
Query: 286 --YNVILEEVEVGGNPLDLPTSLLGTGDERG---TIIDSGTTLAYLPPMLYDLVLSQF 338
Y V L+ + VG L LP S+ D G T++DSGT +L +Y + ++F
Sbjct: 231 VAYTVQLDGIRVGNKILPLPKSIFAP-DHTGAGQTMVDSGTQFTFLLGPVYTALRNEF 287
>gi|449517142|ref|XP_004165605.1| PREDICTED: aspartic proteinase-like protein 1-like, partial
[Cucumis sativus]
Length = 430
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 95/214 (44%), Gaps = 24/214 (11%)
Query: 127 LTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS-GYFVRDI 185
L + P+ S+TS + C+ + C +N+ C Y + Y +++S GY V D+
Sbjct: 3 LNHYSPNDSTTSSTVPCTSSLCNRCTSNQN-------VCPYEMRYLSANTSSIGYLVEDV 55
Query: 186 IQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAG 245
+ L LK P+ + + FGCG Q+G +T AA +G++G G S+ S LA G
Sbjct: 56 LHLATDDSLLK--PVEAKITFGCGTVQTGIF--ATTAAPNGLIGLGMEKISVPSFLADQG 111
Query: 246 NVRKEFAHCLDVVKGGGIFAIGDVVSPKVKTTPMVPNMPH--YNVILEEVEVGGNPLDLP 303
F+ C G G GD K TP + + YNV + VGG P D+P
Sbjct: 112 LTSNSFSMCFGA-DGYGRIDFGDTGPADQKQTPFNTMLEYQSYNVTFNVINVGGEPNDVP 170
Query: 304 TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQ 337
+ I DSGT+ YL Y + Q
Sbjct: 171 FT---------AIFDSGTSFTYLTEPAYSTITKQ 195
>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
Length = 507
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 88/270 (32%), Positives = 118/270 (43%), Gaps = 43/270 (15%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G Y K+ +GTP E + +DT SDL W+ C C RC +S +FDP S++ G
Sbjct: 138 SGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSG-----PVFDPRHSTSYG 192
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDG------SSTSGYFVRDIIQLNQASG 193
E+ C+ + G C Y V YGDG S++ G V + + +G
Sbjct: 193 EMNYDAPDCQALGRSGGGDAKRGT-CIYTVLYGDGDGHGSTSTSVGDLVEETLTF---AG 248
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
++ A L+ GCG+ G G A GILG + S+ Q+A G F++
Sbjct: 249 GVRQAYLS----IGCGHDNKGLFG----APAAGILGLSRGQISIPHQIAFLG-YNASFSY 299
Query: 254 CL-DVVKGGG------IFAIGDV-VSPKVKTTPMV--PNMP-HYNVILEEVEVGG----- 297
CL D + G G F G V SP TP V NMP Y V L V VGG
Sbjct: 300 CLVDFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPG 359
Query: 298 -NPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
DL L G I+DSGTT+ L
Sbjct: 360 VTERDL--QLDPYTGHGGVILDSGTTVTRL 387
>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
sativus]
Length = 469
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 75/281 (26%), Positives = 120/281 (42%), Gaps = 52/281 (18%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
+ + +G+P V VDTGS LLWV C C C +S + FDP KS + +
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQST-----SWFDPLKSVSFKTLG 158
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRD-----------IIQLNQA 191
C F Y N Y C+ + EY + Y G S+ G ++ + Q N
Sbjct: 159 CG--FPGYNYINGY-KCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAI 215
Query: 192 SGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEF 251
S + S++ FGCG+ ++ ++ D A +G+ G G + A + +F
Sbjct: 216 STQISKIK-KSNITFGCGHM---NIKTNNDDAYNGVFGLGA-----YPHITMATQLGNKF 266
Query: 252 AHCL-----------DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPL 300
++C+ +V G G + GD +TP+ + HY V L+ + VG L
Sbjct: 267 SYCIGDINNPLYTHNHLVLGQGSYIEGD-------STPLQIHFGHYYVTLQSISVGSKTL 319
Query: 301 DL-PTSLLGTGD-ERGTIIDSGTTLAYLP----PMLYDLVL 335
+ P + + D G +IDSG T L +LYD ++
Sbjct: 320 KIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIV 360
>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
Length = 458
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 83/275 (30%), Positives = 121/275 (44%), Gaps = 33/275 (12%)
Query: 64 MASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSD 122
+AS+ L G G G Y T++GLGTP Y + VDTGS L W+ C+ C C +S
Sbjct: 105 LASVPL---GPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSG 161
Query: 123 LGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGSSTSGY 180
+F+P SS+ ++CS C TT +CS C Y +YGD S + GY
Sbjct: 162 -----PVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGY 216
Query: 181 FVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQ 240
+D + S + +GCG G G S G++G + SLL Q
Sbjct: 217 LSKDTVSFGSTS--------VPNFYYGCGQDNEGLFGQSA-----GLIGLARNKLSLLYQ 263
Query: 241 LAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSP-KVKTTPMVPNM---PHYNVILEEVEVG 296
LA ++ F++CL + +P + TPM + Y + + + V
Sbjct: 264 LAP--SMGYSFSYCLPTSSSSSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVA 321
Query: 297 GNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
G PL + S + TIIDSGT + LP +Y
Sbjct: 322 GKPLSVSASAYSS---LPTIIDSGTVITRLPTDVY 353
>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
gi|224030447|gb|ACN34299.1| unknown [Zea mays]
gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
Length = 512
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 86/282 (30%), Positives = 119/282 (42%), Gaps = 46/282 (16%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y V +GTP + + +DTGSDL W+ CA C C + +FDP+ SS+ +
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDC-----FEQRGPVFDPAASSSYRNLT 200
Query: 143 CSDNFC--------RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGN 194
C D C R P P C Y YGD S+++G + +N
Sbjct: 201 CGDPRCGHVAPPEAPAPRACRRPGEDP---CPYYYWYGDQSNSTGDLALESFTVNL---- 253
Query: 195 LKTAPLNSS----VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE 250
TAP SS V+FGCG+R G + G+ S SQL A
Sbjct: 254 --TAPGASSRVDGVVFGCGHRNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYGGHT- 305
Query: 251 FAHCL----DVVKGGGIFAIGDVVS----PKVKTTPMVP-NMP---HYNVILEEVEVGGN 298
F++CL V +F D ++ P++K T P + P Y V L V VGG
Sbjct: 306 FSYCLVDHGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGE 365
Query: 299 PLDLPTSLL--GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
L++ + G GTIIDSGTTL+Y Y ++ F
Sbjct: 366 LLNISSDTWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAF 407
>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
communis]
Length = 448
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 77/273 (28%), Positives = 116/273 (42%), Gaps = 31/273 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y KV +G+P Y+ DTGS L W C C+R +F+ + S T ++
Sbjct: 91 YLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTR-----RFRQLPPIFNSTASRTYRDLP 145
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C FC T N C +C Y + Y GS+T+G +DI+Q S P
Sbjct: 146 CQHQFC--TNNQNVFQCRDD-KCVYRIAYAGGSATAGVAAQDILQ----SAENDRIPF-- 196
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV---- 258
FGC + GI+G + SLL Q+ + F++CL++
Sbjct: 197 --YFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHI--TKNRFSYCLNLFDLSS 252
Query: 259 --KGGGIFAIGDVVSP---KVKTTPMVP--NMPHYNVILEEVEVGGNPLDLP--TSLLGT 309
+ G+ + K +TP V MP+Y + L +V V GN + +P T L
Sbjct: 253 PSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKP 312
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFRFWI 342
GTIIDSGT + Y+ Y V++ F+ +
Sbjct: 313 DGTGGTIIDSGTAVTYISQTAYFPVITAFKNYF 345
>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
Length = 496
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/274 (29%), Positives = 118/274 (43%), Gaps = 37/274 (13%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +VG+G P+ +Y+ +DTGSD+ W+ C C C + D +FDP+
Sbjct: 151 SGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVD-----PIFDPA 205
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SS+ + C CR N +C C Y V+YGDGS T G F + + +
Sbjct: 206 SSSSFSRLGCQTPQCR---NLDVFACR-NDSCLYQVSYGDGSYTVGDFATETVSFGNSGS 261
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
K A GCG+ G + G SL SQ+ A+ F++
Sbjct: 262 VDKVA-------IGCGHDNEGLFVGAAGLIGLGGGPL-----SLTSQIKAS-----SFSY 304
Query: 254 CL---DVVKGGGIFAIGDVVSPKVKTTPMVPNMP---HYNVILEEVEVGGNPLDLPTSLL 307
CL D V + S V T P+ N Y V + + VGG L +P S+
Sbjct: 305 CLVNRDSVDSSTLEFNSAKPSDSV-TAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIF 363
Query: 308 ---GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
G+G + G I+D GT + L Y+ + F
Sbjct: 364 EVDGSG-KGGIIVDCGTAVTRLQTQAYNALRDTF 396
>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
thaliana]
gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
Length = 527
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 134/300 (44%), Gaps = 43/300 (14%)
Query: 61 GRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTK 120
G+++A+++ +G +G YF V +GTP + + +DTGSDL W+ C C C +
Sbjct: 143 GKLIATLE-----SGMTLGSGEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQ 197
Query: 121 SDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCE-------YVVTYGD 173
+ + +DP S++ I C+D C S P V+CE Y YGD
Sbjct: 198 NGM-----FYDPKTSASFKNITCNDPRCSLI-----SSPDPPVQCESDNQSCPYFYWYGD 247
Query: 174 GSSTSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILG--- 229
S+T+G F + +N + ++ +++FGCG+ G ++ G
Sbjct: 248 RSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSF 307
Query: 230 -------FGQANSSLLSQLAAAGNVRKE--FAHCLDVVKGGGIFAIGDVVSPKVKTTPMV 280
+G + S L + NV + F D++ + V+ K +
Sbjct: 308 SSQLQSLYGHSFSYCLVDRNSNTNVSSKLIFGEDKDLLNHTNL-NFTSFVNGKENSVETF 366
Query: 281 PNMPHYNVILEEVEVGGNPLDLP--TSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
Y + ++ + VGG LD+P T + + + GTIIDSGTTL+Y Y+++ ++F
Sbjct: 367 -----YYIQIKSILVGGKALDIPEETWNISSDGDGGTIIDSGTTLSYFAEPAYEIIKNKF 421
>gi|414887401|tpg|DAA63415.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
Length = 242
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 86/164 (52%), Gaps = 21/164 (12%)
Query: 175 SSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQAN 234
SS+SG DI+ + S LK +FGC N ++GDL S DGI+G G+
Sbjct: 2 SSSSGVLGEDIVSFGRES-ELKA----QRAVFGCENSETGDLFSQ---HADGIMGLGRGQ 53
Query: 235 SSLLSQLAAAGNVRKEFAHC---LDVVKGGGIFAIGDVVSPK----VKTTPMVPNMPHYN 287
S++ QL G + F+ C +D+ GGG +G V +P ++ P+ P+YN
Sbjct: 54 LSIMDQLVEKGVINDSFSLCYGGMDI--GGGAMVLGGVPTPSDMVFSRSDPL--RSPYYN 109
Query: 288 VILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLY 331
+ L+E+ V G L + + + + + GT++DSGTT AYLP +
Sbjct: 110 IELKEIHVAGKALRVDSRIFDS--KHGTVLDSGTTYAYLPEQAF 151
>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
Group]
gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
Length = 442
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 123/279 (44%), Gaps = 39/279 (13%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+ +GTP + +DTGS+L W+ CA + F P S T + C
Sbjct: 70 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALS---FRPRASLTFASVPCDSA 126
Query: 147 FCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
CR+ P+C +C ++Y DGSS+ G ++ + Q PL ++
Sbjct: 127 QCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGP------PLRAA-- 178
Query: 206 FGCGNRQSGDLGSSTD-AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
FGC + +S D A G+LG + S +SQ + + F++C+ G+
Sbjct: 179 FGC---MATAFDTSPDGVATAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVL 230
Query: 265 AIG--DVVSPKVKTTPMV-PNMP-------HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+G D+ + TP+ P MP Y+V L + VGG PL +P S+L D G
Sbjct: 231 LLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP-DHTG 289
Query: 315 ---TIIDSGTTLAYLPPMLYDLVLSQF----RFWIASLD 346
T++DSGT +L Y + ++F + W+ +L+
Sbjct: 290 AGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALN 328
>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
Length = 332
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 77/252 (30%), Positives = 115/252 (45%), Gaps = 38/252 (15%)
Query: 100 VDTGSDLLWVNCAGCS-RCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR----TTYNN 154
+DTGS L W+ C C+ C ++D L+DPS S T +++C+ C T N+
Sbjct: 3 LDTGSSLSWLQCQPCAVYCHAQAD-----PLYDPSVSKTYKKLSCASVECSRLKAATLND 57
Query: 155 RYPSCSPGVR-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
P C C Y +YGD S + GY +D++ L + +T P +GCG
Sbjct: 58 --PLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSS----QTLP---QFTYGCGQDNQ 108
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL---DVVKGGGIFAIGDVV 270
G G + GI+G + S+L+QL+ F++CL + GG F +
Sbjct: 109 GLFGRAA-----GIIGLARDKLSMLAQLST--KYGHAFSYCLPTANSGSSGGGFLSIGSI 161
Query: 271 SP-KVKTTPMV---PNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYL 326
SP K TPM+ N Y + L + V G PLDL ++ T+IDSGT + L
Sbjct: 162 SPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMY----RVPTLIDSGTVITRL 217
Query: 327 PPMLYDLVLSQF 338
P +Y + F
Sbjct: 218 PMSMYAALRQAF 229
>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 479
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 99/224 (44%), Gaps = 34/224 (15%)
Query: 45 RERTLSALKQHDTRRHGRMMASID----------LELGGNGHPSATGLYFTKVGLGTPTD 94
R + AL DT R + + +E GG +G Y +VG+G+P
Sbjct: 75 RRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVSHGSGEYLVRVGIGSPPL 134
Query: 95 EYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNN 154
E ++ DTGSD++WV C+ CS C + D LFDP+ S++ + C+ CR
Sbjct: 135 EQHLVADTGSDVIWVQCSPCSDCYAQGD-----PLFDPANSASFSPVPCNSGVCRAA--A 187
Query: 155 RY---PSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNR 211
RY G CEY V+YGD S T+G + + L+ G + V GCG+
Sbjct: 188 RYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLD---GGTEV----QGVAMGCGHE 240
Query: 212 QSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
G A G+LG G SL+ QL A F++CL
Sbjct: 241 NRGLF-----AEAAGLLGLGWGPMSLVGQLGGA--AGGAFSYCL 277
>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
Length = 441
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 123/279 (44%), Gaps = 39/279 (13%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDN 146
+ +GTP + +DTGS+L W+ CA + F P S T + C
Sbjct: 69 LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALS---FRPRASLTFASVPCGSA 125
Query: 147 FCRTTYNNRYPSCS-PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI 205
CR+ P+C +C ++Y DGSS+ G ++ + Q PL ++
Sbjct: 126 QCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGP------PLRAA-- 177
Query: 206 FGCGNRQSGDLGSSTD-AAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIF 264
FGC + +S D A G+LG + S +SQ + + F++C+ G+
Sbjct: 178 FGC---MATAFDTSPDGVATAGLLGMNRGALSFVSQAST-----RRFSYCISDRDDAGVL 229
Query: 265 AIG--DVVSPKVKTTPMV-PNMP-------HYNVILEEVEVGGNPLDLPTSLLGTGDERG 314
+G D+ + TP+ P MP Y+V L + VGG PL +P S+L D G
Sbjct: 230 LLGHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAP-DHTG 288
Query: 315 ---TIIDSGTTLAYLPPMLYDLVLSQF----RFWIASLD 346
T++DSGT +L Y + ++F + W+ +L+
Sbjct: 289 AGQTMVDSGTQFTFLLGDAYSALKAEFSRQTKPWLPALN 327
>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
Length = 323
Score = 81.3 bits (199), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/280 (28%), Positives = 124/280 (44%), Gaps = 54/280 (19%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VGLGTP V++DTGS WV C C C T F S+S+T +++
Sbjct: 1 YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-CDGCHTNP------RTFLQSRSTTCAKVS 53
Query: 143 C---------SDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
C SD C+ + N YP C + V+Y DGS++ G +D + +
Sbjct: 54 CGTSMCLLGGSDPHCQDSEN--YPDCP------FRVSYQDGSASYGILYQDTLTFS---- 101
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+++ P FGC N S G++ VDG+LG G S+L Q + + F++
Sbjct: 102 DVQKIP---GFTFGC-NMDS--FGANEFGNVDGLLGMGAGQMSVLKQSSPTFD---GFSY 152
Query: 254 CLDV--------VKGGGIFAIGDVVSP---KVKTTPMV---PNMPHYNVILEEVEVGGNP 299
CL + K G F++G ++ V+ T MV N + V L + V G
Sbjct: 153 CLPLQMSERGFFSKTTGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGER 212
Query: 300 LDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L L S+ +G + DSG+ L+Y+P ++ + R
Sbjct: 213 LGLSPSIFS---RKGVVFDSGSELSYIPDRALSVLSQRIR 249
>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
lyrata]
Length = 419
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/295 (28%), Positives = 129/295 (43%), Gaps = 62/295 (21%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAG----CSRC-PTKSDLGIKLTLFDPSKSST 137
Y + +GTP V +DTGSDL WV C C C KS+ ++F P SS+
Sbjct: 11 YLITLNIGTPPQAVQVYMDTGSDLTWVPCGNLSFDCIDCNDLKSNNLKSSSIFSPLHSSS 70
Query: 138 SGEIACSDNFCRTTYNNRYP-------SCSPGVRCE---------YVVTYGDGSSTSGYF 181
S +C+ +FC +++ P CS + + + TYG+G SG
Sbjct: 71 SFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAYTYGEGGLVSGIL 130
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
RDI++ + P S FGC +ST GI GFG+ SL SQL
Sbjct: 131 TRDILK-----ARTRDVPRFS---FGC--------VTSTYHEPIGIAGFGRGLLSLPSQL 174
Query: 242 AAAGNVRKEFAHCL-------------DVVKGGGIFAIGDVVSPKVKTTPMV--PNMPH- 285
G + K F+HC ++ G +I ++ ++ TPM+ P P+
Sbjct: 175 ---GFLEKGFSHCFLPFKFVNNPNISSPLILGASALSIN--LTDSLQFTPMLNTPVYPNS 229
Query: 286 YNVILEEVEVGGN--PLDLPTSL--LGTGDERGTIIDSGTTLAYLPPMLYDLVLS 336
Y + LE + +G N P +P +L + G ++DSGTT +LP Y +L+
Sbjct: 230 YYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLT 284
>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
Length = 434
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/267 (31%), Positives = 121/267 (45%), Gaps = 38/267 (14%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y K GTP + +DT SD W+ C+GC C T F P KS++ ++
Sbjct: 97 YIVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKP-------FAPIKSTSFRNVS 149
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
C C+ N P+C G C + TYG SS + V+D + L P+
Sbjct: 150 CGSPHCKQVPN---PTCG-GSACAFNFTYG-SSSIAASVVQDTL-------TLAADPI-P 196
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKE-FAHCLDVVKG- 260
FGC N+ +G + A G+LG G+ SLLSQ + N+ K F++CL K
Sbjct: 197 GYTFGCVNKTTG-----SSAPQQGLLGLGRGPLSLLSQ---SQNLYKSTFSYCLPSFKSI 248
Query: 261 --GGIFAIGDVVSPK-VKTTPMVPNMPH---YNVILEEVEVGGNPLDLPTSLLGTGDER- 313
G +G V PK +K TP++ N Y V L ++VG +D+P + L
Sbjct: 249 NFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTG 308
Query: 314 -GTIIDSGTTLAYLPPMLYDLVLSQFR 339
GTI DSGT L +Y V ++FR
Sbjct: 309 AGTIFDSGTVFTRLAEPVYTAVRNEFR 335
>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
Length = 415
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 116/280 (41%), Gaps = 64/280 (22%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
NG P+ Y + +GTP + +DTGSDL+W C C C ++ L FDPS
Sbjct: 82 NGVPTTE--YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQA-----LPYFDPS 134
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SST +C C+ P R + G G+S G
Sbjct: 135 TSSTLSLTSCDSTLCQGLPVASLP------RSDKFTFVGAGASVPG-------------- 174
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
V FGCG +G S+ GI GFG+ SL SQL GN F+H
Sbjct: 175 ----------VAFGCGLFNNGVFKSNE----TGIAGFGRGPLSLPSQL-KVGN----FSH 215
Query: 254 CLDVVKGGGIFAI-----GDVVSP---KVKTTPMVPNMPH---YNVILEEVEVGGNPLDL 302
C + G + D+ S V+TTP++ N + Y + L+ + VG L +
Sbjct: 216 CFTTITGAIPSTVLLDLPADLFSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPV 275
Query: 303 PTSLL----GTGDERGTIIDSGTTLAYLPPMLYDLVLSQF 338
P S GTG GTIIDSGT + LP +Y LV F
Sbjct: 276 PESEFALKNGTG---GTIIDSGTAMTSLPTRVYRLVRDAF 312
>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
Length = 449
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/259 (30%), Positives = 119/259 (45%), Gaps = 38/259 (14%)
Query: 87 VGLGTPTDEYYVQVDTGSDLLWVNCAGCSR--CPTKSDLGIKLTLFDPSKSSTSGEIACS 144
VG+GTP + VDTGSDL+W C+ SR S + L++P +SS+ + CS
Sbjct: 88 VGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCS 147
Query: 145 DNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA-PLNSS 203
D C+ + Y +C+ RC Y YG + G + N K + PL
Sbjct: 148 DRLCQEGQFS-YKNCARNNRCMYDELYGSAEA-GGVLASETFTFGV---NAKVSLPLG-- 200
Query: 204 VIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV---KG 260
FGCG +GDL G++G SL+SQL+ F++CL K
Sbjct: 201 --FGCGALSAGDL-----VGASGLMGLSPGIMSLVSQLSV-----PRFSYCLTPFAERKT 248
Query: 261 GGIF--AIGDV----VSPKVKTTPMVPN----MPHYNVILEEVEVGGNPLDLPTSLLGTG 310
+ A+ D+ + V+TT ++ N +Y V L + +G LD+P + LG
Sbjct: 249 SPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMI 308
Query: 311 DER---GTIIDSGTTLAYL 326
GTI+DSG+T++YL
Sbjct: 309 KPDGSGGTIVDSGSTMSYL 327
>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
Length = 484
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 120/282 (42%), Gaps = 36/282 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF ++G+GTP Y+ +DTGSD++W+ C+ C C +SD +F+P+
Sbjct: 127 SGLSQGSGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSD-----PVFNPA 181
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
KS T + C CR ++ C Y V+YGDGS T G F + + + A
Sbjct: 182 KSKTFATVPCGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARV 241
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAH 253
+ V GCG+ G + G+ S SQ N +F++
Sbjct: 242 D--------HVALGCGHDNEGLFVGAAGLLGL-----GRGGLSFPSQTKNRYN--GKFSY 286
Query: 254 CL-------DVVKGGGIFAIGDVVSPKVKT-TPMVPNMP---HYNVILEEVEVGGNPL-- 300
CL K G+ PK TP++ N Y + L + VGG+ +
Sbjct: 287 CLVDRTSSGSSSKPPSTIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPG 346
Query: 301 --DLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFRF 340
+ L TG+ G IIDSGT++ L Y + FR
Sbjct: 347 VSESQFKLDATGNG-GVIIDSGTSVTRLTQSAYVALRDAFRL 387
>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
Length = 423
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 119/270 (44%), Gaps = 35/270 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSG 139
+G YF +G+GTP + DTGSD+LW+ C C C ++D LF+PS SST
Sbjct: 78 SGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTD-----PLFNPSFSSTFQ 132
Query: 140 EIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAP 199
I C + C+ C +C Y V+YGDGS T G F + + + N
Sbjct: 133 SITCGSSLCQQLL---IRGCRRN-QCLYQVSYGDGSFTVGEFSTETLSFGSNAVN----- 183
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCLDVV 258
SV GCG+ G + G+ S SQ+ G+V F++CL
Sbjct: 184 ---SVAIGCGHNNQGLFTGAAGLLGL-----GKGLLSFPSQVGQLYGSV---FSYCLPTR 232
Query: 259 KGGG----IFAIGDVVSPKVKTTPMV-PNM-PHYNVILEEVEVGGNPLDLPT---SLLGT 309
+ G IF V S TT + P + Y V + ++VGG +++P SL +
Sbjct: 233 ESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSS 292
Query: 310 GDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
G I+DSGT + L Y+ + FR
Sbjct: 293 TGNGGVILDSGTAVTRLVTSAYNPMRDAFR 322
>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
communis]
Length = 459
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 72/260 (27%), Positives = 121/260 (46%), Gaps = 31/260 (11%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
+ +G P Y +DTGS L W+ C C C + K L++PS SST +
Sbjct: 110 FLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQ-----KGPLYNPSSSSTYVSCS 164
Query: 143 CSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNS 202
+F RT + + + G C Y TY D ++T G + R+ + + +
Sbjct: 165 ---DFDRT---DTTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGIT---IMH 215
Query: 203 SVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----DVV 258
VIFGCG+ + G + A+ G+ G G + SS++S+L F++C+ D +
Sbjct: 216 DVIFGCGHNNTQLPGPTGYAS--GVFGLGDSGSSIISKLGFG------FSYCIGNIGDPL 267
Query: 259 KGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDERG---- 314
G +G+ + + +TP+VP +Y + L + +G LD+ + D G
Sbjct: 268 YGFHRLTLGNKLKIEGYSTPLVPRGLYY-ITLVGISIGQERLDIDPIVFQRVDLNGISSR 326
Query: 315 TIIDSGTTLAYLPPMLYDLV 334
+IDSG TL+Y+P Y++V
Sbjct: 327 IVIDSGATLSYIPRQAYNVV 346
>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
Length = 414
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/249 (29%), Positives = 109/249 (43%), Gaps = 48/249 (19%)
Query: 106 LLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRC 165
+ W C C RC S FDPS S T +C + TYN
Sbjct: 98 ITWTQCKPCVRCLKDSH-----RHFDPSASLTYSLGSCIPSTVGNTYN------------ 140
Query: 166 EYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVD 225
+TYGD S++ G + D + L+ + + FGCG GD GS D
Sbjct: 141 ---MTYGDKSTSVGNYGCDTM-------TLEPSDVFPKFQFGCGRNNEGDFGS----GAD 186
Query: 226 GILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDVVSPK--VKTTPMVPNM 283
G+LG GQ S +SQ A+ +K F++CL G G+ + + +K T +V N
Sbjct: 187 GMLGLGQGQLSTVSQTAS--KFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLV-NG 243
Query: 284 P---------HYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
P +Y V L ++ VG L++P+S+ + GTIIDSGT + LP Y +
Sbjct: 244 PGTSGLEESGYYFVKLLDISVGNKRLNVPSSVFAS---PGTIIDSGTVITCLPQRAYSAL 300
Query: 335 LSQFRFWIA 343
+ F+ +A
Sbjct: 301 TAAFKKAMA 309
>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
Length = 447
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 126/306 (41%), Gaps = 57/306 (18%)
Query: 77 PSATGLYF-------TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTL 129
P+A L F V +GTP + +DTGS+L W+ C G P
Sbjct: 42 PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTP-------A 94
Query: 130 FDPSKSSTSGEIACSDNFCRTTYNNR----YPSC--SPGVRCEYVVTYGDGSSTSGYFVR 183
F+ S SS+ G + C C + R P C P C ++Y D SS G
Sbjct: 95 FNASGSSSYGAVPCPSTACE--WRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLAT 152
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGC-------GNRQSGDLGSSTDAAVDGILGFGQANSS 236
D L + P+ FGC S G+ A G+LG + S
Sbjct: 153 DTFLLTGGA-----PPVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLS 207
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV--VSPKVKTTPMVP---NMPH-----Y 286
++Q + FA+C+ +G G+ +GD V+P + TP++ +P+ Y
Sbjct: 208 FVTQTGT-----RRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAY 262
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERG---TIIDSGTTLAYLPPMLYDLVLSQF----R 339
+V LE + VG L +P S+L T D G T++DSGT +L Y + ++F R
Sbjct: 263 SVQLEGIRVGCALLPIPKSVL-TPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQAR 321
Query: 340 FWIASL 345
+A L
Sbjct: 322 LLLAPL 327
>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 448
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/310 (29%), Positives = 132/310 (42%), Gaps = 46/310 (14%)
Query: 52 LKQHDTRRHGRMMASIDLELGGNGH-PSATG-------LYFTKVGLGTPTDEYYVQVDTG 103
L +R R++ L + G + P A+G Y + LGTP + + VDT
Sbjct: 69 LADQSSRDASRLLYLDSLAVAGRAYAPIASGRQLLQTPTYVVRARLGTPPQQLLLAVDTS 128
Query: 104 SDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGV 163
+D W+ C+GC+ CPT T F+P+ S + + C C N PSCS
Sbjct: 129 NDAAWIPCSGCAGCPTT-------TPFNPAASKSYRAVPCGSPACSRAPN---PSCSLNT 178
Query: 164 R-CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDA 222
+ C + +TY D SS +D L A+ +K S FGC + +G T
Sbjct: 179 KSCGFSLTYAD-SSLEAALSQD--SLAVANDVVK------SYTFGCLQKATG-----TAT 224
Query: 223 AVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVKG---GGIFAIGDVVSP-KVKTTP 278
G+LG G+ S LSQ F++CL K G +G P ++KTTP
Sbjct: 225 PPQGLLGLGRGPLSFLSQ--TKDMYEGTFSYCLPSFKSLNFSGTLRLGRKGQPLRIKTTP 282
Query: 279 MVPNMPH----YNVILEEVEVGGNPLDLPTSLLG--TGDERGTIIDSGTTLAYLPPMLYD 332
++ N PH Y V + + VG + +P + L GT++DSGT L Y
Sbjct: 283 LLVN-PHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRLVAPAYV 341
Query: 333 LVLSQFRFWI 342
V + R I
Sbjct: 342 AVRDEVRRRI 351
>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
Length = 448
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/272 (29%), Positives = 119/272 (43%), Gaps = 39/272 (14%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC-PTKSDLGIKLTLFDPSKSSTSG 139
G Y + +G P + +VDTGSDL+WV C+ C+ C P S L+DP++S +SG
Sbjct: 85 GKYIMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPS------PLYDPARSRSSG 138
Query: 140 EIACSDNFCRTTYNNRYPS--CS--PGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNL 195
++ CS C+ R S CS P + C Y YG S V L +
Sbjct: 139 KLPCSSQLCQALGRGRIISDQCSDDPPL-CGYHYAYGHSGDHSTQGV-----LGTETFTF 192
Query: 196 KTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL 255
+ ++V FG +S + S G++G G+ + SL+SQL A FA+CL
Sbjct: 193 GDGYVANNVSFG----RSDTIDGSQFGGTAGLVGLGRGHLSLVSQLGAG-----RFAYCL 243
Query: 256 -------DVVKGGGIFAI----GDVVSPKVKTTPMVPNMPHYNVILEEVEVGGN--PLDL 302
+ G + A+ GDV S + T P HY V L+ + VGG+ P+
Sbjct: 244 AADPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKD 303
Query: 303 PTSLLGTGDERGTIIDSGTTLAYLPPMLYDLV 334
T + + G DSG L Y +V
Sbjct: 304 GTFAINSDGSGGVFFDSGAIDTSLKDAAYQVV 335
>gi|66357264|ref|XP_625810.1| membrane associated aspartyl protease with a transmembrane domain
at the C-terminus [Cryptosporidium parvum Iowa II]
gi|46226904|gb|EAK87870.1| membrane associated aspartyl protease with a transmembrane domain
at the C-terminus [Cryptosporidium parvum Iowa II]
Length = 550
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/306 (25%), Positives = 123/306 (40%), Gaps = 50/306 (16%)
Query: 65 ASIDLELGGNGHPSATGLYFTKVGLGTP-TDEYYVQVDTGSDLLWVNCAGCSRCPTKSDL 123
+I L L GN H G YF KV +G P T + + +DTGS L C+ C C T +
Sbjct: 18 KTITLPLYGNVHK--YGYYFIKVNVGFPITQQQTLIIDTGSSLTGFACSDCINCGTHENK 75
Query: 124 GIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR---------------YPSCSPGV---RC 165
+ L S TS I C N T NN YP+ + +C
Sbjct: 76 PFNINL-----SDTSNIIKCKRN---NTPNNETDIINKSIHGRISMNYPNYNKSFLNNKC 127
Query: 166 EYVVTYGDGSSTSGYFVRDIIQL-NQASGNLKT-APLNSSVIFGCGNRQSGDLGSSTDAA 223
Y + Y +GS GYF D ++ N+ S NL+ + +FGC ++ +
Sbjct: 128 VYDIKYSEGSRILGYFFEDFVEFENKLSSNLEIRQKFKNKFVFGCNIIENNFFKFQKASG 187
Query: 224 VDGILGFGQAN-SSLLSQLAAAGNVRKEFAHCLDVV---KGGGIFAIGDVVSPKVK---- 275
+ G+ F + +++ + +G VRK + + + K GG G + K
Sbjct: 188 IMGLANFSNKEMNQIINYIFKSGEVRKTDSDKIISIFFEKDGGKLTFGSTCFDQTKMMNY 247
Query: 276 -----TTPMVPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER--GTIIDSGTTLAYLPP 328
N Y + ++EV N +L T L +ER I D+GTT++ P
Sbjct: 248 PFENYNITRCINDERYCAYISKIEVDSNTRELDTKL----NERLFKAIFDTGTTISIFPA 303
Query: 329 MLYDLV 334
L+ +
Sbjct: 304 RLFKKI 309
>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
Length = 430
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 96/224 (42%), Gaps = 21/224 (9%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + LGTP+ E DTGSDL W+ C C C + + LFDP++SST +
Sbjct: 86 GEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQ-----EAPLFDPTQSSTYVD 140
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPL 200
+ C C T + C +C Y+ YG S T G D I + A
Sbjct: 141 VPCESQPC-TLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATF 199
Query: 201 NSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCL----- 255
SV FGC + ST A +G +G G SL SQL + +F++C+
Sbjct: 200 PKSV-FGCAFYSNFTFKISTKA--NGFVGLGPGPLSLASQL--GDQIGHKFSYCMVPFSS 254
Query: 256 ---DVVKGGGIFAIGDVVSPKVKTTPMVPNMPHYNVILEEVEVG 296
+K G + +VVS P P+ +Y + LE + VG
Sbjct: 255 TSTGKLKFGSMAPTNEVVSTPFMINPSYPS--YYVLNLEGITVG 296
>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
sativa Japonica Group]
gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 447
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/306 (27%), Positives = 126/306 (41%), Gaps = 57/306 (18%)
Query: 77 PSATGLYF-------TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTL 129
P+A L F V +GTP + +DTGS+L W+ C G P
Sbjct: 42 PAANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTP-------A 94
Query: 130 FDPSKSSTSGEIACSDNFCRTTYNNR----YPSC--SPGVRCEYVVTYGDGSSTSGYFVR 183
F+ S SS+ G + C C + R P C P C ++Y D SS G
Sbjct: 95 FNASGSSSYGAVPCPSTACE--WRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLAT 152
Query: 184 DIIQLNQASGNLKTAPLNSSVIFGC-------GNRQSGDLGSSTDAAVDGILGFGQANSS 236
D L + P+ FGC S G+ A G+LG + S
Sbjct: 153 DTFLLTGGA-----PPVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLS 207
Query: 237 LLSQLAAAGNVRKEFAHCLDVVKGGGIFAIGDV--VSPKVKTTPMVP---NMPH-----Y 286
++Q + FA+C+ +G G+ +GD V+P + TP++ +P+ Y
Sbjct: 208 FVTQTGT-----RRFAYCIAPGEGPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAY 262
Query: 287 NVILEEVEVGGNPLDLPTSLLGTGDERG---TIIDSGTTLAYLPPMLYDLVLSQF----R 339
+V LE + VG L +P S+L T D G T++DSGT +L Y + ++F R
Sbjct: 263 SVQLEGIRVGCALLPIPKSVL-TPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQAR 321
Query: 340 FWIASL 345
+A L
Sbjct: 322 LLLAPL 327
>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
Length = 489
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 140/322 (43%), Gaps = 41/322 (12%)
Query: 39 FKAGGERERTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYV 98
++ R + +S+L+ H TRR ++ +G S YF + +GTP + ++
Sbjct: 76 LQSDNARRQMISSLR-HGTRRKAFEVSHTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFI 134
Query: 99 QV-DTGSDLLWVNCA-GCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRY 156
V DTGSDL W+NC C CP + ++ F + SS+ I CS + C+ + +
Sbjct: 135 LVTDTGSDLTWMNCEYWCKSCPKPNPHPGRV--FRANDSSSFRTIPCSSDDCKIELQDYF 192
Query: 157 P--SC-SPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQS 213
C +P C + Y +G G F + + + + K L V+ GC
Sbjct: 193 SLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTV--GLNDHKKIRL-FDVLIGCT---- 245
Query: 214 GDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCL----DVVKGGGIFAIGD 268
+ + T+ DG++G G SL +LA GN +F++CL + GD
Sbjct: 246 -ESFNETNGFPDGVMGLGYRKHSLALRLAEIFGN---KFSYCLVDHLSSSNHKNFLSFGD 301
Query: 269 VVSPKVKTTPMVPNMPH-----------YNVILEEVEVGGNPLDLPTSLLGTGDERGTII 317
+ P++K +P M H Y V + + VGG+ L + + + G I+
Sbjct: 302 I--PEMK----LPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVGGMIV 355
Query: 318 DSGTTLAYLPPMLYDLVLSQFR 339
DSGT+L L YD V+ +
Sbjct: 356 DSGTSLTMLAGEAYDKVVDALK 377
>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
sativus]
Length = 496
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 127/274 (46%), Gaps = 43/274 (15%)
Query: 83 YFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIA 142
Y VG+G + VDTGSDL WV C C C + + LF+PS SS+ +
Sbjct: 145 YIVTVGIGGQNST--LIVDTGSDLTWVQCLPCRLCYNQQE-----PLFNPSNSSSFLSLP 197
Query: 143 CSDNFC----RTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
C+ C T ++ S C+Y + YGDGS + G + + L + +
Sbjct: 198 CNSPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKTEID---- 253
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFAHCL-- 255
+ IFGCG G G ++ G++G ++ SL+SQ ++ G+V F++CL
Sbjct: 254 ----NFIFGCGRNNKGLFGGAS-----GLMGLARSELSLVSQTSSLFGSV---FSYCLPT 301
Query: 256 -------DVVKGGGIFAIGDVVSPKVKTTPMV--PNMPHYNVI-LEEVEVGGNPLDLPTS 305
+ GG F+ +SP + T M+ P M ++ + L + +GG L++P
Sbjct: 302 TGVGSSGSLTLGGADFSNFKNISP-ISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPR- 359
Query: 306 LLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
L + + +++DSGT + L P +Y ++F
Sbjct: 360 -LSSNEGVLSLLDSGTVITRLSPSIYKAFKAEFE 392
>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
thaliana]
gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
thaliana]
gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
Length = 464
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 76/271 (28%), Positives = 110/271 (40%), Gaps = 34/271 (12%)
Query: 80 TGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGC-SRCPTKSDLGIKLTLFDPSKSSTS 138
+G Y +G+GTP + + DTGSDL W C C C ++ K F+PS SST
Sbjct: 129 SGNYIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQ-----KEPKFNPSSSSTY 183
Query: 139 GEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTA 198
++CS C SCS C Y + YGD S T G+ ++ L +
Sbjct: 184 QNVSCSSPMCEDA-----ESCSAS-NCVYSIVYGDKSFTQGFLAKEKFTLTNSD------ 231
Query: 199 PLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVV 258
+ V FGCG G D + S N+ F++CL
Sbjct: 232 -VLEDVYFGCGENNQGLF----DGVAGLLGLGPGKLSLPAQTTTTYNNI---FSYCLPSF 283
Query: 259 KGG--GIFAIGDV-VSPKVKTTPM--VPNMPHYNVILEEVEVGGNPLDLPTSLLGTGDER 313
G G +S VK TP+ P+ +Y + + + VG L + + T
Sbjct: 284 TSNSTGHLTFGSAGISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFST---E 340
Query: 314 GTIIDSGTTLAYLPPMLYDLVLSQFRFWIAS 344
G IIDSGT LP +Y + S F+ ++S
Sbjct: 341 GAIIDSGTVFTRLPTKVYAELRSVFKEKMSS 371
>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
distachyon]
Length = 452
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/306 (29%), Positives = 131/306 (42%), Gaps = 66/306 (21%)
Query: 67 IDLELGGNGHPSATGLYF-------TKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPT 119
+ L L P A L F V +GTP + +DTGS+L W+ C G +
Sbjct: 40 LPLRLQAASPPPANRLRFRHNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNG-----S 94
Query: 120 KSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNR----YPSCSPGVRCEYVVTYGDGS 175
+ D FD S SS+ + CS C T+ R P C C ++Y D S
Sbjct: 95 RHD-----APFDASASSSYAPVPCSSPAC--TWLGRDLPVRPFCDSSA-CRVSLSYADAS 146
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAA---VDGILGFGQ 232
S G D L ++P+ + +FGC S SSTD + G+LG +
Sbjct: 147 SADGLLAADTFLLG-------SSPMPA--LFGCITSYS----SSTDPSETPPTGLLGMNR 193
Query: 233 ANSSLLSQLAAAGNVRKEFAHCLDVVKGGGIFAIG--DVVSP-------KVKTTPMVP-- 281
S ++Q A + FA+C+ +G GI +G D +P ++ TP+V
Sbjct: 194 GGLSFVTQTAT-----RRFAYCIAAGQGPGILLLGGNDTETPLTSPPQQQLNYTPLVEIS 248
Query: 282 -NMPH-----YNVILEEVEVGGNPLDLPTSLLGTGDERG---TIIDSGTTLAYLPPMLYD 332
+P+ Y V LE + VG L +P LL T D G T++DSGT +L P Y
Sbjct: 249 QPLPYFDRAAYTVQLEGIRVGSALLAIPKHLL-TPDHTGAGQTMVDSGTRFTFLLPDAYA 307
Query: 333 LVLSQF 338
+ ++F
Sbjct: 308 ALKAEF 313
>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
Length = 404
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 114/248 (45%), Gaps = 39/248 (15%)
Query: 78 SATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSST 137
++ G Y + +GTP + V DTGS L+W CA C+ C + F P+ SST
Sbjct: 85 NSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAAR-----PAPPFQPASSST 139
Query: 138 SGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKT 197
++ C+ + C+ + Y +C+ C Y YG G T+GY + + + AS
Sbjct: 140 FSKLPCASSLCQ-FLTSPYRTCN-ATGCVYYYPYGMG-FTAGYLATETLHVGGAS----- 191
Query: 198 APLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDV 257
V FGC +G+S+ GI+G G++ SL+SQ+ A F++CL
Sbjct: 192 ---FPGVTFGCSTENG--VGNSS----SGIVGLGRSPLSLVSQVGVA-----RFSYCLRS 237
Query: 258 VKGGG----IF-AIGDVVSPKVKTTPMV--PNMP---HYNVILEEVEVGGNPLDLPTSLL 307
G +F ++ V V++TP++ P MP +Y V L + VG DLP ++
Sbjct: 238 NADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGAT--DLPMAMA 295
Query: 308 GTGDERGT 315
GT
Sbjct: 296 NLTTVNGT 303
>gi|424513106|emb|CCO66690.1| predicted protein [Bathycoccus prasinos]
Length = 802
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 128/295 (43%), Gaps = 42/295 (14%)
Query: 62 RMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKS 121
+ +S LEL NG TG ++ V +GTP ++ V VDTGS +V C C+ C
Sbjct: 119 KQSSSAGLEL--NGKARDTGYFYATVLIGTPGHQFEVIVDTGSTYTFVTCYPCASCGQHG 176
Query: 122 DLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYF 181
+D +KSS+ + C + +C CEY + + S G+
Sbjct: 177 SNAP----YDAAKSSSYERVPCGSGCI-------FGACRASGLCEYDEKFSEDSQVGGHV 225
Query: 182 VRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQL 241
V D+I + G+L T ++ FGC + ++ L + +G++ G+A + L QL
Sbjct: 226 VSDVIDV---GGSLGTPRIH----FGCNSLETNMLKTQ---KANGMIALGRAEAGLHRQL 275
Query: 242 AAA----GNVRKEFAHCLDVVKGGGIFAIG--------DVVSPKVKTTPMV----PNMPH 285
G+ F CL +GGG+ ++G + V+ K T+ + +
Sbjct: 276 KKKAYPPGSYDGTFGLCLGSFEGGGVLSLGKLPEQHYANFVTRKTHTSTVKLVKGSKSQY 335
Query: 286 YNVILEEVEVGGNPLDLPTSLLGTGDER---GTIIDSGTTLAYLPPMLYDLVLSQ 337
YNV + + V L P+ R GT++DSGTT YL ++ +S+
Sbjct: 336 YNVEVHRMFVRNTELKKPSGAELMEAFRAGYGTVLDSGTTYTYLHEDVFIPFISE 390
>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
lyrata]
Length = 520
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 131/298 (43%), Gaps = 35/298 (11%)
Query: 58 RRHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRC 117
+ G+++A+++ +G +G YF V +G+P + + +DTGSDL W+ C C C
Sbjct: 135 EQAGQLVATLE-----SGMTLGSGEYFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCHDC 189
Query: 118 PTKSDLGIKLTLFDPSKSSTSGEIACSDNFCR--TTYNNRYPSCSPGVRCEYVVTYGDGS 175
++ +DP S++ I C+D C + + P S C Y YGD S
Sbjct: 190 FQQNG-----AFYDPKASASYKNITCNDPRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSS 244
Query: 176 STSGYFVRDIIQLNQASGNLKTAPLN-SSVIFGCGNRQSGDLGSSTDAAVDGILGFGQAN 234
+T+G F + +N + + N +++FGCG+ G + G+
Sbjct: 245 NTTGDFAVETFTVNLTTSGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGL-----GRGP 299
Query: 235 SSLLSQLAAAGNVRKEFAHCL------DVVKGGGIF-AIGDVVS-PKVKTTPMVPNMPH- 285
S SQL + F++CL V IF D++S P + T V +
Sbjct: 300 LSFSSQLQSL--YGHSFSYCLVDRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVARKENL 357
Query: 286 ----YNVILEEVEVGGNPLDLPTSLLGTGDE--RGTIIDSGTTLAYLPPMLYDLVLSQ 337
Y V ++ + V G L++P + GTIIDSGTTL+Y Y+ + ++
Sbjct: 358 VDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNK 415
>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
Length = 423
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/276 (30%), Positives = 120/276 (43%), Gaps = 35/276 (12%)
Query: 74 NGHPSATGLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPS 133
+G +G YF +G+GTP + DTGSD+LW+ C C C ++D LF+PS
Sbjct: 72 SGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTD-----PLFNPS 126
Query: 134 KSSTSGEIACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTSGYFVRDIIQLNQASG 193
SST I C + C+ C +C Y V+YGDGS T G F + + +
Sbjct: 127 FSSTFQSITCGSSLCQQLL---IRGCRRN-QCLYQVSYGDGSFTVGEFSTETLSFGSNAV 182
Query: 194 NLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAA-GNVRKEFA 252
N SV GCG+ G + G+ S SQ+ G+V F+
Sbjct: 183 N--------SVAIGCGHNNQGLFTGAAGLLGL-----GKGLLSFPSQVGQLYGSV---FS 226
Query: 253 HCLDVVKGGG----IFAIGDVVSPKVKTTPMV-PNM-PHYNVILEEVEVGGNPLDLPT-- 304
+CL + G IF V S TT + P + Y V + ++VGG + +P
Sbjct: 227 YCLPTRESTGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGS 286
Query: 305 -SLLGTGDERGTIIDSGTTLAYLPPMLYDLVLSQFR 339
SL + G I+DSGT + L Y+ + FR
Sbjct: 287 LSLDSSTGNGGVILDSGTAVTRLVTSAYNPMRDAFR 322
>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
Length = 437
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/263 (31%), Positives = 112/263 (42%), Gaps = 43/263 (16%)
Query: 81 GLYFTKVGLGTPTDEYYVQVDTGSDLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGE 140
G Y + +GTP E DTGSDL+WV C+ C+ C +S LF P KSST
Sbjct: 88 GEYLMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQST-----PLFQPLKSSTFMP 142
Query: 141 IACSDNFCRTTYNNRYPSCSPGVRCEYVVTYGDGSSTS-GYFVRDIIQLNQASGNLKTAP 199
C C T C C Y YGD S S G + ++ + G A
Sbjct: 143 TTCRSQPC-TLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAF 201
Query: 200 LNSSVIFGCGNRQSGDLGSSTDAAVDGILGFGQANSSLLSQLAAAGNVRKEFAHCLDVVK 259
NS FGCG + + S + GI+G G SL+SQ+ + +F++CL
Sbjct: 202 PNS--FFGCGLYNNITVFPS--YKLTGIMGLGAGPLSLVSQI--GDQIGHKFSYCL---- 251
Query: 260 GGGIFAIGDVVSPKVK-------------TTPMV--PNMPHYNVI-LEEVEVGGNPLDLP 303
+G + K+K +TPM+ P +P Y + LE V V +P
Sbjct: 252 ----LPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQK--TVP 305
Query: 304 TSLLGTGDERGTIIDSGTTLAYL 326
T G+ D IIDSGT L YL
Sbjct: 306 T---GSTD-GNVIIDSGTLLTYL 324
>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
Length = 363
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 107/239 (44%), Gaps = 36/239 (15%)
Query: 59 RHGRMMASIDLELGGNGHPSATGLYFTKVGLGTPTD----EYYVQVDTGSDLLWVNCAGC 114
R +M++S +E+ P A+G+ F + + + V +DTGSDL WV C C
Sbjct: 115 RLRKMVSSHSVEVSQIQIPLASGVNFQTLNYIVTMELGGQDMTVIIDTGSDLTWVQCEPC 174
Query: 115 SRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRT---TYNNRYPSCSPGVRCEYVVTY 171
C + +F PS SS+ I C+ + C++ T N S C Y V Y
Sbjct: 175 MSCYNQQG-----PVFKPSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNY 229
Query: 172 GDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVIFGCGNRQSGDLGSSTDAAVDGILGFG 231
GDGS T+G + + S S+ +FGCG G G V G++G G
Sbjct: 230 GDGSYTNGELGAEHLSFGGISV--------SNFVFGCGKNNKGLFG-----GVSGLMGLG 276
Query: 232 QANSSLLSQLAAA-GNVRKEFAHCLDVVKGG--GIFAIGDVVSPKVKTTP-----MVPN 282
++N SL+SQ + G V F++CL G G A+G+ S TP MVPN
Sbjct: 277 RSNLSLISQTNSTFGGV---FSYCLPPTDAGASGSLAMGNESSVFKNLTPIAYTRMVPN 332
>gi|325183199|emb|CCA17657.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 873
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 128/292 (43%), Gaps = 31/292 (10%)
Query: 47 RTLSALKQHDTRRHGRMMASIDLELGGNGHPSATGL--YFTKVGLGTPTDEYYVQVDTGS 104
R L LK R+ ++ L N P GL ++ ++ +G P V +DTGS
Sbjct: 8 RNLEPLKIELKRKTRQLKNQTSPPLVYNDAPLGVGLGTHYAELYIGIPPQRASVILDTGS 67
Query: 105 DLLWVNCAGCSRCPTKSDLGIKLTLFDPSKSSTSGEIACSDNFCRTTYNNRYPSCSPGVR 164
L C C C T +D FD +KS+ S NF + Y +C +
Sbjct: 68 GLTAFPCDKCVDCGTHTD-----PKFDATKST-------SINFVQCKYEEGCDTCRDNL- 114
Query: 165 CEYVVTYGDGSSTSGYFVRDIIQLNQASGNLKTAPLNSSVI---FGCGNRQSGDLGSSTD 221
C Y +GS ++D+I + + + I FGC R++G + +
Sbjct: 115 CVIHQRYSEGSMWEAVVMQDLIWVGNVDSDRAEMIMRRYGIRFKFGCQTRETGLFITQVE 174
Query: 222 AAVDGILGFGQANSSLLSQLAAAGNVRK-EFAHCLDVVKGGGIFAIGDV----VSPKVKT 276
+GI+G G +++ +++ A V + +FA C + GG F IG V + K+
Sbjct: 175 ---NGIMGLGIGRNNIATEMYKAKRVEEHKFALCFG--QKGGSFVIGGVDYSHHTTKIAY 229
Query: 277 TPMVPN-MPHYNVILEEVEVGGNPLDLPTSLLGTGDERGTIIDSGTTLAYLP 327
TP+ + +Y + +++V +GG L + +G RG I+DSGTT Y P
Sbjct: 230 TPLAKHGTSNYPIEVKDVRIGGISLQVDAEHFKSG--RGAIVDSGTTDTYFP 279
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.137 0.413
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,832,756,049
Number of Sequences: 23463169
Number of extensions: 259754995
Number of successful extensions: 559914
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1257
Number of HSP's successfully gapped in prelim test: 2141
Number of HSP's that attempted gapping in prelim test: 552875
Number of HSP's gapped (non-prelim): 3925
length of query: 346
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 203
effective length of database: 9,003,962,200
effective search space: 1827804326600
effective search space used: 1827804326600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)